NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F089274

Metagenome / Metatranscriptome Family F089274

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F089274
Family Type Metagenome / Metatranscriptome
Number of Sequences 109
Average Sequence Length 119 residues
Representative Sequence GPGTGISTSLGLLDARGPALARTLAMRGLDPREPRASNGQLRPLGSIAELEAWLNQGPLLMDGARWFGEGHWFVGIGYDAGGVYIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVAA
Number of Associated Samples 82
Number of Associated Scaffolds 109

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 8.49 %
% of genes near scaffold ends (potentially truncated) 87.16 %
% of genes from short scaffolds (< 2000 bps) 92.66 %
Associated GOLD sequencing projects 78
AlphaFold2 3D model prediction Yes
3D model pTM-score0.71

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (68.807 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(34.862 % of family members)
Environment Ontology (ENVO) Unclassified
(36.697 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(48.624 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 23.61%    β-sheet: 22.22%    Coil/Unstructured: 54.17%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.71
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
d.3.1.12: IgG-specific endopeptidase IdeS (Sib38)d1y08a_1y080.7051
d.3.1.1: Papain-liked1pcia_1pci0.70439
d.3.1.1: Papain-liked3pdfa23pdf0.68565
d.3.1.1: Papain-liked3f5va_3f5v0.68367
d.3.1.0: automated matchesd3qsda_3qsd0.67685


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 109 Family Scaffolds
PF12666PrgI 4.59
PF00501AMP-binding 3.67
PF13751DDE_Tnp_1_6 0.92
PF03446NAD_binding_2 0.92
PF01850PIN 0.92
PF09913DUF2142 0.92
PF01636APH 0.92
PF01381HTH_3 0.92
PF13847Methyltransf_31 0.92
PF12840HTH_20 0.92
PF02899Phage_int_SAM_1 0.92
PF01068DNA_ligase_A_M 0.92
PF12680SnoaL_2 0.92

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 109 Family Scaffolds
COG1423ATP-dependent RNA circularization protein, DNA/RNA ligase (PAB1020) familyReplication, recombination and repair [L] 0.92
COG1793ATP-dependent DNA ligaseReplication, recombination and repair [L] 0.92
COG4973Site-specific recombinase XerCReplication, recombination and repair [L] 0.92
COG4974Site-specific recombinase XerDReplication, recombination and repair [L] 0.92


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A68.81 %
All OrganismsrootAll Organisms31.19 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001538|A10PFW1_11331414All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1053Open in IMG/M
3300005178|Ga0066688_10045449All Organisms → cellular organisms → Bacteria2516Open in IMG/M
3300005181|Ga0066678_11121709Not Available505Open in IMG/M
3300005332|Ga0066388_105613185Not Available635Open in IMG/M
3300005332|Ga0066388_108661559Not Available506Open in IMG/M
3300005454|Ga0066687_10359932All Organisms → cellular organisms → Bacteria → Proteobacteria835Open in IMG/M
3300005467|Ga0070706_100384041All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Thermomicrobia → Sphaerobacteridae → Sphaerobacterales → Sphaerobacterineae → Sphaerobacteraceae → Nitrolancea → Nitrolancea hollandica1308Open in IMG/M
3300005468|Ga0070707_101775588Not Available584Open in IMG/M
3300005471|Ga0070698_101334552Not Available668Open in IMG/M
3300005536|Ga0070697_100962703All Organisms → cellular organisms → Bacteria → Terrabacteria group758Open in IMG/M
3300005537|Ga0070730_10736002Not Available624Open in IMG/M
3300005552|Ga0066701_10910602Not Available521Open in IMG/M
3300005557|Ga0066704_10114726All Organisms → cellular organisms → Bacteria1781Open in IMG/M
3300005559|Ga0066700_10403510All Organisms → cellular organisms → Bacteria → Terrabacteria group963Open in IMG/M
3300005559|Ga0066700_11177494Not Available500Open in IMG/M
3300005561|Ga0066699_10674896Not Available739Open in IMG/M
3300005586|Ga0066691_10618095Not Available644Open in IMG/M
3300005764|Ga0066903_101367027All Organisms → cellular organisms → Bacteria → Proteobacteria1328Open in IMG/M
3300005764|Ga0066903_105943454Not Available640Open in IMG/M
3300006175|Ga0070712_100433850Not Available1091Open in IMG/M
3300006797|Ga0066659_11434842Not Available577Open in IMG/M
3300006797|Ga0066659_11792173Not Available519Open in IMG/M
3300009088|Ga0099830_10511600All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium980Open in IMG/M
3300009088|Ga0099830_10924014Not Available722Open in IMG/M
3300009089|Ga0099828_10511615All Organisms → cellular organisms → Bacteria → Proteobacteria1082Open in IMG/M
3300009090|Ga0099827_10843329All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium793Open in IMG/M
3300009137|Ga0066709_101513323Not Available967Open in IMG/M
3300009156|Ga0111538_11821784Not Available766Open in IMG/M
3300009156|Ga0111538_12640743All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium630Open in IMG/M
3300009156|Ga0111538_13952044Not Available512Open in IMG/M
3300009553|Ga0105249_12429464Not Available597Open in IMG/M
3300010038|Ga0126315_10358988Not Available909Open in IMG/M
3300010040|Ga0126308_10306715Not Available1044Open in IMG/M
3300010046|Ga0126384_11788146Not Available583Open in IMG/M
3300010048|Ga0126373_10724351All Organisms → cellular organisms → Bacteria → Terrabacteria group1054Open in IMG/M
3300010166|Ga0126306_11095037Not Available652Open in IMG/M
3300010166|Ga0126306_11121858Not Available644Open in IMG/M
3300010358|Ga0126370_12592102Not Available507Open in IMG/M
3300010360|Ga0126372_12371986Not Available581Open in IMG/M
3300010376|Ga0126381_101604798All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia940Open in IMG/M
3300011269|Ga0137392_10698437Not Available840Open in IMG/M
3300011269|Ga0137392_11172411Not Available626Open in IMG/M
3300011270|Ga0137391_10865326Not Available741Open in IMG/M
3300011270|Ga0137391_10928453Not Available711Open in IMG/M
3300011270|Ga0137391_11517319Not Available515Open in IMG/M
3300011271|Ga0137393_10777087All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium819Open in IMG/M
3300011992|Ga0120146_1061351Not Available615Open in IMG/M
3300011999|Ga0120148_1086346Not Available611Open in IMG/M
3300011999|Ga0120148_1099573Not Available561Open in IMG/M
3300012011|Ga0120152_1083941All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium935Open in IMG/M
3300012014|Ga0120159_1030100All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1883Open in IMG/M
3300012019|Ga0120139_1017906Not Available1619Open in IMG/M
3300012019|Ga0120139_1165676Not Available580Open in IMG/M
3300012096|Ga0137389_11804415Not Available507Open in IMG/M
3300012189|Ga0137388_10626293All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1000Open in IMG/M
3300012189|Ga0137388_10663890All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium969Open in IMG/M
3300012201|Ga0137365_11163073Not Available553Open in IMG/M
3300012203|Ga0137399_10954292Not Available722Open in IMG/M
3300012206|Ga0137380_10788640All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium821Open in IMG/M
3300012207|Ga0137381_11162078Not Available663Open in IMG/M
3300012209|Ga0137379_10875478Not Available803Open in IMG/M
3300012209|Ga0137379_10949061All Organisms → cellular organisms → Bacteria765Open in IMG/M
3300012210|Ga0137378_10892930Not Available802Open in IMG/M
3300012350|Ga0137372_10041435All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium4146Open in IMG/M
3300012351|Ga0137386_10075932All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2350Open in IMG/M
3300012354|Ga0137366_10416188All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium977Open in IMG/M
3300012355|Ga0137369_10129717Not Available2026Open in IMG/M
3300012355|Ga0137369_10689693Not Available703Open in IMG/M
3300012357|Ga0137384_10205808All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1646Open in IMG/M
3300012357|Ga0137384_10547351All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium948Open in IMG/M
3300012359|Ga0137385_10524576All Organisms → cellular organisms → Bacteria → Terrabacteria group1001Open in IMG/M
3300012360|Ga0137375_10630045All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium888Open in IMG/M
3300012360|Ga0137375_11307058Not Available547Open in IMG/M
3300012360|Ga0137375_11399732Not Available521Open in IMG/M
3300012363|Ga0137390_10498331Not Available1192Open in IMG/M
3300012363|Ga0137390_11037407All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium771Open in IMG/M
3300012532|Ga0137373_10992885Not Available608Open in IMG/M
3300012532|Ga0137373_11123629Not Available560Open in IMG/M
3300012971|Ga0126369_11375822Not Available796Open in IMG/M
3300013763|Ga0120179_1068840Not Available794Open in IMG/M
3300013772|Ga0120158_10194650All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1066Open in IMG/M
3300014052|Ga0120109_1027297Not Available1278Open in IMG/M
3300014325|Ga0163163_11950496Not Available647Open in IMG/M
3300015371|Ga0132258_13817244All Organisms → cellular organisms → Bacteria1026Open in IMG/M
3300015372|Ga0132256_103337198Not Available540Open in IMG/M
3300015374|Ga0132255_105946936Not Available516Open in IMG/M
3300015374|Ga0132255_106035217Not Available513Open in IMG/M
3300016341|Ga0182035_10697892Not Available884Open in IMG/M
3300018061|Ga0184619_10439431All Organisms → cellular organisms → Bacteria584Open in IMG/M
3300018071|Ga0184618_10185986Not Available863Open in IMG/M
3300018431|Ga0066655_10950366Not Available591Open in IMG/M
3300018468|Ga0066662_12483867Not Available546Open in IMG/M
3300018468|Ga0066662_12612053Not Available534Open in IMG/M
3300021384|Ga0213876_10062938Not Available1959Open in IMG/M
3300021384|Ga0213876_10212842Not Available1027Open in IMG/M
3300021861|Ga0213853_10829561All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium2258Open in IMG/M
3300021861|Ga0213853_11138208Not Available592Open in IMG/M
3300025910|Ga0207684_10827682Not Available781Open in IMG/M
3300027857|Ga0209166_10449746Not Available665Open in IMG/M
3300027875|Ga0209283_10805894Not Available577Open in IMG/M
3300031723|Ga0318493_10757010Not Available546Open in IMG/M
3300031805|Ga0318497_10424552Not Available744Open in IMG/M
3300031910|Ga0306923_11735687Not Available644Open in IMG/M
3300031947|Ga0310909_11164377Not Available625Open in IMG/M
3300033289|Ga0310914_10916612All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria776Open in IMG/M
3300034268|Ga0372943_0411382Not Available874Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil34.86%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost10.09%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil10.09%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil5.50%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.50%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.59%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil4.59%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil3.67%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.67%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere2.75%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.75%
WatershedsEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Watersheds1.83%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.83%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.83%
Plant RootsHost-Associated → Plants → Roots → Unclassified → Unclassified → Plant Roots1.83%
Switchgrass RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Switchgrass Rhizosphere0.92%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.92%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.92%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001538Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A10-PF 4A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300010038Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot106EnvironmentalOpen in IMG/M
3300010040Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot55EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010166Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot27EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011992Permafrost microbial communities from Nunavut, Canada - A23_65cm_12MEnvironmentalOpen in IMG/M
3300011999Permafrost microbial communities from Nunavut, Canada - A28_65cm_6MEnvironmentalOpen in IMG/M
3300012011Permafrost microbial communities from Nunavut, Canada - A30_65cm_6MEnvironmentalOpen in IMG/M
3300012014Permafrost microbial communities from Nunavut, Canada - A10_80cm_6MEnvironmentalOpen in IMG/M
3300012019Permafrost microbial communities from Nunavut, Canada - A7_5cm_12MEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300013763Permafrost microbial communities from Nunavut, Canada - A15_65cm_0MEnvironmentalOpen in IMG/M
3300013772Permafrost microbial communities from Nunavut, Canada - A10_80_0.25MEnvironmentalOpen in IMG/M
3300014052Permafrost microbial communities from Nunavut, Canada - A23_35cm_12MEnvironmentalOpen in IMG/M
3300014325Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S6-5 metaGHost-AssociatedOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021384Root-associated microbial communities from Barbacenia macrantha in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R9Host-AssociatedOpen in IMG/M
3300021861Metatranscriptome of freshwater sediment microbial communities from post-fracked creek in Pennsylvania, United States - ABR_2016 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300031723Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.108b1f23EnvironmentalOpen in IMG/M
3300031805Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.109b1f23EnvironmentalOpen in IMG/M
3300031910Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 (v2)EnvironmentalOpen in IMG/M
3300031947Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.T000HEnvironmentalOpen in IMG/M
3300033289Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN108EnvironmentalOpen in IMG/M
3300034268Forest soil microbial communities from Eldorado National Forest, California, USA - SNFC_MG_FRD_1.2EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
A10PFW1_1133141413300001538PermafrostGPGTGISTSLGLLDARGPALARTLAMRGLDPREPRASNGQLRPLGSIAELEAWLNQGPLLMDGARWFGEGHWFVGIGYDAGGVYIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVAA*
Ga0066688_1004544943300005178SoilARALAARGLYPREPRTSDGQLRPLGSIAELLAWLDMGPLLMDGARWFGEGHWFVGIGYDAGGLSIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVADCSTTLALRSRPCEESP*
Ga0066678_1112170913300005181SoilGAYGQQLGSLDDAVALVGPGSGISASLGLLDARGPALARSLAARGLDAREPRASNGQLRPLGSIAELEAWLNQGPLLMDGARWFGEGHWFVGIGYDAGGVYIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVAA*
Ga0066388_10561318523300005332Tropical Forest SoilLVQALTRAGLRARSPGTRPLGSIADLKFWLDQGPLLMDGATWFNEGHWFVGIGYDQNGIYIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVAA*
Ga0066388_10866155913300005332Tropical Forest SoilAWLTWRKVDCSAAALDWLLGAYGQTLASIDEAIALVGPNRGISTSLGLLDARGPALAAALTSRGFTPREPHDAAGRLRPLASVKELQAWLDQGPLLMDGASWFGEGHWFVAVGYDQSGVYIRDSSGWDTRYLPWSRLYGEVGFSGWVVGVT*
Ga0066687_1035993223300005454SoilAAALDWLLGAYGQPQGSIDAAIALVGPYTGISTSLGLLDARGPALASALASRGFTPREPRDGAGRLRPLASPKELQTWLDQGPLLMDGASWFGEGHWFVGVGYDQNGVYIRDSSGWDTRYLPWSRLYGEVGFSGWVVGVAT*
Ga0070706_10038404133300005467Corn, Switchgrass And Miscanthus RhizosphereGTPLARALARRGLQPREPRTANGQLRPLNSITELKAWLQQGPLLMDGASWFGEGHWFVGIGYDGSGIYIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVAT*
Ga0070707_10177558813300005468Corn, Switchgrass And Miscanthus RhizosphereDCSAAALDWLLGAYGQPLASIEDAIAMIGANTGISTTLGLLDARGPALARALAARGLHPRTPGQQPLNSTAQLKVWLDQGPLLLDGARWFGEGHWFVAIGYDQKGIYTRDSSGWDTRYLTWSRLYGEVGFSGWVVGIAS*
Ga0070698_10133455213300005471Corn, Switchgrass And Miscanthus RhizosphereVALVAPGSGISASLGLLDARGPALTRALAVRGLDPREPRASNGQLKPLGSIAELEAWLNQGPLLMDSARWFGERHWFVGIDYDAGEISIGDSSGWDNRHLSWSRLYGEVGLSGWGVGVAA
Ga0070697_10096270323300005536Corn, Switchgrass And Miscanthus RhizosphereERGTALSNAVSREGLKPRTPAQRPLGSISELKSWLDQGPLLMDGARWFGEGHWFVAVGYDKNGVYTRDSSGWDTRYLTWARLYGEVGFSGWVVGVAP*
Ga0070730_1073600223300005537Surface SoilALDWLLGSYGRAVGSIDDAIAVIGPSSGISTTLGLLDERGPALAQALSREGLKPRTPGQRPLGSTRELEAWLDRGPLLMDGARWFGEGHWFVAIGYDKNGVYTRDSSGWDTRYLTWSRLYGEVGFTGWVVGVAQ*
Ga0066701_1091060223300005552SoilLAARGLQPRTPGQRPLDSIAELEAWLDQGPLLMDGARWFGEGHWFVAIGYDTRGIYIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVADCSTTLALRSRPCEESP*
Ga0066704_1011472633300005557SoilRTSDGQLRPLGSIAELLAWLDMGPLLMDGARWFGEGHWFVGIGYDAGGLSIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVADCSTTLALRSRPCEESP*
Ga0066700_1040351013300005559SoilWLSWEKVDCSAAALDWLLGAYGRSIGSIDDAIALIGPGTGISTKLGLLDERGTALSNAVSREGLKPRTPAQRPLGSISELKSWLDQGPLLMDGARWFGEGHWFVAVGYDKNGVYTRDSSGWDTRYLTWARLYGEVGFSGWVVGVAP*
Ga0066700_1117749413300005559SoilNTGISTTLGLLDARGPALASALSTRGFTARQPRDRDGRLRPLVSTRELQAWLDQGPLLMDGASWFGEGHWFVGVGYDQNGIYVRDSSGWDTRYLTWSRLYGEVGFSGWVVGAG*
Ga0066699_1067489623300005561SoilALSARGFTPREPRDAAGRLRPLASTRELQAWLDQGPLLMDGASWFGEGHWFVAVGYDQSGIYIRDSSGWDTRYLAWSRLYGEVGFSGWVVGVA*
Ga0066691_1061809523300005586SoilWFLGAYGQPVASIDDAIALVGPNTKISTILGLLDARGPALASALSTRGFTPRQPQDRDGRLRPLTSIRELQAWLDQGPLLLDGASWFGEGHWFVGVGYDKNGVYIRDSSGWDTRYLTWSRLYGEVGFSGWVVGAT*
Ga0066903_10136702713300005764Tropical Forest SoilEQSWITWRSVDCSAAALDWLLGAYGQQLGSLDDAVALVGPGTGISTSLGLLDARGTALARALSERGLRPRTPGPRPLGSIAALETWLDQGPLLMDGARWFGQGHWFVGVGYDRNGVYVRDSSGWDTRYLTWSRLYSGVGFSGWVVGVAA*
Ga0066903_10594345413300005764Tropical Forest SoilIALVGPGSGISSRLGLLDARGTPLARGLAARGLEPREPHTASGQLRPLNSIAELKAWLDQGPLLMDGASWFGEGHWFVDIGYDASGIYIRDSSGWDTRYLSWSRLYGKVGFSGWVVGVATRRRR*
Ga0066903_10827518023300005764Tropical Forest SoilRGLSPREPRSGGGQLRPLGSTAELQAWLEQGPLLMDGSRWFGEGHWFVGIGYDSSGIYIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVAA*
Ga0070712_10043385033300006175Corn, Switchgrass And Miscanthus RhizosphereDWLLGAYGLRLGSIDQAIALIGPNTGISTSLGLLDATGRPLAKAIATSGFNPRNGQVHSIGELESWLDRGPLALDGAAWFGEGHWFVATGYDQNGIYIRDSSGWDNRYLTWSRLYGEVGFSGRVVGVSA*
Ga0066659_1143484223300006797SoilSTSLGLLDARGPALARALAARGLHPREPRTSDGQLRPLGSIAELLAWLDMGPLLMDGARWFGEGHWFVGIGYDAGGLSIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVADCSTTLALRSRPCEESP*
Ga0066659_1179217313300006797SoilYGRTVSSLDDAIAVVGPGTGISTSLGLLDERGPALAQALEKEGLHARRPDQKPLGSIAELKAWLDQGPLLMDGARWYGKGHWFVAVGYDQNGIYTRDSSGWDTRYLNWSRLYGEVGFSGWVVGVG*
Ga0099830_1051160023300009088Vadose Zone SoilSNYRTDASWLTWRNADCSAAALDWLLGAYGQQLGSLDDAVALVGPGSGISASLGLLDARGPALARSLAARGLDAREPRASNGQLRPLGSIVELEAWLNQGPLLMDGARWFGEGHWFVGIGYDAGGVYIRDSSG*
Ga0099830_1092401413300009088Vadose Zone SoilLGAYGQPLGSLDDAIALVGPGTGISTSLGLLDARGTALARALAARGLQPRTPGRPLGSITELEAWLNQGPLLMDGGRWFGEGHWFVGIGYDAGGIYIRDSSGWDTRYLSWSRLYGDVGFSGWVVGVAG*
Ga0099828_1051161513300009089Vadose Zone SoilDWLLGAYGQQLGGLDDAIALVGPGTGLSPSLGLLDARGTAFARALAARGLQPRTPGQRPLGSIAELEAWLNQGPLLMDGARWFGVGHWFVGIGYDAGGIYIRDSSGWDTRHLSWSRLYGDVGFSGWVVGVST*
Ga0099827_1084332913300009090Vadose Zone SoilIALLGPGTGISTSLGLLDARGPALARALAVRSLDPREPRASNGQLRPLGSIAELEAWLNQGPLLMDGARWFGEGHWFVGIGYDAGGVYIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVAA
Ga0066709_10151332313300009137Grasslands SoilSIDDAIGLIGPNTGIAPGPGLLDARGPALARALTSRGLRPRTPGENPLASTAQLKEWLDQGPLLVDGVAWFGKGHWFVAVGYDKNGVYIRDSSGWDTRYLTWARLYGEVGFTGWVVGVAP
Ga0111538_1182178423300009156Populus RhizosphereALIGPNTGISTSLGLLDARGSGLASALARRGLQPRQPRDAAGRLRPVTSPSELEAWLDQGPLLMGGDRWFGEGHWFVGIGYDKNGVYIRDSSGWDTRYLTWSRLYGEVGFGGWVVGVAP*
Ga0111538_1264074323300009156Populus RhizosphereVHTVRNGRTLGSLEDAITLISPNTGISTRLGLPDACGPALVRALSSQGLRPRSPGQKPLASVAELKAWLDLGPLLMGGARWFGTGHWFVAVGYDQNGIYTRDSSGWDTRYLTWSRLYGEVGFSGWVVGVSP*
Ga0111538_1395204423300009156Populus RhizosphereLGSVEDAIALIGPNSGMSIRLGLLDARGRALARALSSQGSRPRMPGQTPLGSVAELEAWLDQGPLLMDGARWFGQGHWFVAVGYDHNGIYTRDSSGWDTRYLTW
Ga0105249_1242946413300009553Switchgrass RhizosphereALIGPNTGISTALGLLDARGSGLASALARRGLQPRQPRDTAGRLRPLTSTSELQAWLDQGPLLMGGDRWFGEGHWFVGIGYDRNGVYIRDSSGWDNRYLTWSRLYGDVGFNGWVVGVTP*
Ga0126315_1035898823300010038Serpentine SoilLGAYGEPLASMDDAIALVGPNTGISTTLGLLDARGPALASALSARGFTPREPRDAAGRLRPLASTKELQTWLDQGPLLMDGASWFGEGHWFVAVGYDQSGIYVRDSSGWDTRYLAWSRLYGEVGFSGWVVGVA*
Ga0126308_1030671523300010040Serpentine SoilAAALDWFLGAYGQPLGAIDDAIALIGPNTGISAALGLLDARGPGLASAVTRRGLQPRQPRDPAGRLRPLTSISELQAWLDQGPLLMGGDRWFGEGHWFVGIGYDRNGVYIRDSSGWDTRYLAWSRLYGEVGFSGWVVGVAR*
Ga0126384_1178814613300010046Tropical Forest SoilMPALDQFDRRYYASDRAWRTWRSSCAYGVQVQSIDAAIELIGPYTGISPSLGLLDARGPALAEALDRRGLRARVPRDRGGRPAALGSVAELQAWLDRGPLLMDGARWFGEGHWFVGMSYDRDGIYIRDSSGYDTQYLTWARLYGEVGFSGWVVGVA*
Ga0126373_1072435123300010048Tropical Forest SoilASALSARGFMPREPRDGDGRLRPLGYVQELKAWLDQGPLLMDGAVWFGEGHWFVGVGYDQNGVYIRDSSGWDTRYLTWQRLYGEVGFSGWVVGVES*
Ga0126306_1109503723300010166Serpentine SoilASALARRGLQPRQPRDAASKLRPLTSTSELQAWLDQGPLLMGGDRWFGEGHWFVGIGYDRNGVYIRDSSGWDNLYLTWSRLYGEVGFNGWVVGVTP*
Ga0126306_1112185813300010166Serpentine SoilDARGPALDRALAGRGLRPRAPAGRPLGSTAELKVWLDQGPLLMDGARWFGAGHWFVGISYDAGGISIRDSSGWDTRYLNWSRLYGEVGFSGWVVGVAA*
Ga0126370_1259210213300010358Tropical Forest SoilVAARSSATCGLAPRTPGARPLGSITELQSWLDQGPLLMDGARWFGEGHWFVATGYDSNGISIRDSSGWDTRYLSWNRLYGDVGFSGWVVGVRA*
Ga0126372_1237198623300010360Tropical Forest SoilLLGAYGQELSSLDEAIALVGPGTGISSRIGLLDARGPALARALDARGLAAREPRAGNGQLRPLGSIAELEAWLDQGPLLMDGARWFGEGHWFVGIGYDGSGIYIRDSSGWDTRYLTWSRLYGEVGFSGWVVGVG*
Ga0126381_10160479823300010376Tropical Forest SoilLGPNTGISTTVGLLDARGLALAAALAERGLAPRQPRAGDGRLRSLGNTGELQAWLDRGPLLMDGASWFGEGHWFVGVGYDQNGVYIRDSSGWDTRYLTWGRLYGEVGFSGWVVGVG*
Ga0137392_1069843713300011269Vadose Zone SoilYGQPLGSIDDAIGLIGPGSGISTKLGLLDARGPALARALEDRGLHPRAPGQRPLGSIAELKGWLDQGPLLMDGARWFGGGHWFVAVGYDTGGVFTRDSSGWDTRYLSWSRLYGEVGFSGWVVGVTP*
Ga0137392_1117241113300011269Vadose Zone SoilLLGAYGQRLGSLDDAIALLGQGTGISTSLGLLDARGPALARALAVRGLDPREPRASNGQLRPLGSIAELEAWLNQGPLLMDGARWFGEGHWFVGIGYDAGGVYIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVAA*
Ga0137391_1086532613300011270Vadose Zone SoilDWLLGAYGQQLGSLDDAIALVGPGTEISPSLGLLDWRGPALARALGSRGLSPREPRTGDGQLKPLASVADLEAWLNQGPLLTDGARWFGEEGHWFVGIGYHTGGIFIRDSSGLDNRCLSWSRLYDEGLSGWVVGVAA*
Ga0137391_1092845323300011270Vadose Zone SoilDWFLGAYGQPVASIDDAIALVGPNTKISTTLGLLDARGPALASALSTRGFTPRQPQDRDGRLRPLTSIRELQAWLDQGPLLMDGASWFGEGHWFVGVGYDQNGIYVRDSSGWDTRYLTWPRLYGEVGFSGWVVGVA*
Ga0137391_1151731913300011270Vadose Zone SoilSLGLLDARGPALARALAARGLQPRTPGQRPLGSIADLETWLDQGPLLMDGARWFGVGHWFVGIGYDAGGIYIRDSSGWDTRYLSWSRLYGDVGFSGWVVGVA*
Ga0137393_1077708723300011271Vadose Zone SoilLDWLLGAYGQQLGSLDDAIVLLGPGTGISTSFGLLDARGPALARALAVRGLDPREPRASNGQLRPLGSIAELEAWLNQGPLLMDGARWFGEGHWFVGIGYDAGGVYIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVAA*
Ga0120146_106135113300011992PermafrostGPGSAISASLGLLDARGPALARALAARGLDPREPRASNGQLRPLGSIAELEAWLNQGPLLMDGARWFGEGQWFVGIGYDAGGIYVRDSSGWDTRYLSWSRLYGEVGFSGWVVGVAA*
Ga0120148_108634613300011999PermafrostLGLLDARGPALARALAGRGLRPRAPDQRPLGSIAELKGWLDQGPLLMDGARWFGEGHWFVAVGYDTGGVFTRDSSGWDTRYLSWSRLYGEVGFSGWVVGVTP*
Ga0120148_109957323300011999PermafrostDLAAGVGGHGAQPAEGLGLVQLDAHALGSLDDAIALLGPGTGISTSLGLLDARGPALARALAARGLDPREPRASNGQLRPLGSIAELEAWLNQGPLLMDGARWFGEGHWFVGIGYDAGGVYIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVAA*
Ga0120152_108394123300012011PermafrostLLGAYGQPFASIEDATALIGPNTGISTRLGLLDARGLALARALAARGLHTRTPGQQPLNSIAQLKAWMDQGPLLLDGARWFGEGHWFVAVGYDQNGIYTRDSSGWDTRYLSWSRLYGEVGFSGWVVGVAA*
Ga0120159_103010013300012014PermafrostLARALAGRGLRPRAPDQRPLGSIAELKDWLDQGPLLMDGARWFGEGHWFVAVGYDNGGVFTRDSSGWDTRYLSWSRLYGEVGFSGWVVGVAP*
Ga0120139_101790633300012019PermafrostDWMLGAYGQPLGSIDDAIGLIGPNTGIAPGPGLLDARGPALARALTSRGLQPRTPGKNPLSSIAQLKDWLDQGPLLLDGAAWFGKGHWFVAVGYDKNGVYIRDSSGWDNRYLTWARLYGEVGFSGWVVGVSP*
Ga0120139_116567613300012019PermafrostALARAIAGRGLRPRAPDQRPLGSIAELKGWLDQGPLLMDGARWFGEGHWFVAVGYDTGGVFTRDSSGWDTRYLSWSRLYGEVGFSGWVVGVTP*
Ga0137389_1180441523300012096Vadose Zone SoilPGTGISTKLGLLDARGTALARALASRGLRPRSPGQRPLGSIAELEAWLDQGPLLMDGARWFAEGHWFVAIGYDQGGVYIRDSSGWGTWYLSWSRLYGAVGFSGWVVGVTP*
Ga0137388_1062629313300012189Vadose Zone SoilVALVGPGSGISASLGLLDARGPALARSLAARGLDAREPRASNGQLRPLGSIAELEAWLNQGPLLMDGARWFGEGHWFVGIGYDAGGVYIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVAA
Ga0137388_1066389013300012189Vadose Zone SoilLARALAGRGLRPRAPDQRPLGSIAELKGWLDQGPLLMDGARWFGEGHWFVAVGYDKGGVFTRDSSGWDTRYLSWSRLYGEVGFSGWVVGVTP*
Ga0137365_1116307313300012201Vadose Zone SoilSLDDAIALLGPGTGISTSLGLLDARGPALARALAVRGLDPREPRASNGQLRPLGSIGELEAWLNQGPLLMDGARWFGEGHWFVGIGYDAGGVYIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVAA*
Ga0137399_1095429223300012203Vadose Zone SoilLLDSRGPALARALAARGLHPRTPGQQPLNSIAQLKAWLDHGPLLLDGARWFGEGHWFVAVGYDQNGIYTRDSSGWDTRYLTWSRLYGEVGFSGWVVGVAS*
Ga0137380_1078864023300012206Vadose Zone SoilARALAVRGLDPREPRASNGQLRPLGSIAELEAWLNQGPLLMDGARWFGEGHWFVGIGYDAGGVYIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVAA*
Ga0137381_1116207813300012207Vadose Zone SoilPLDSIDAAIGLIGPGTGISTKLGLLDARGTALARALASRGLRPRSPGQRPLGSVAELEAWLDQGPLLMDGAHWFGEGHWFVAIGYDRGGVYTRDSSGWDTRYLSWSRLYGEVGFSGWAVGVAP*
Ga0137379_1081028623300012209Vadose Zone SoilLAARAVRPREPRGADGHLRPLASVAELEAWLDQGPLLDGARWFGEGHWFVGIGYDAGGISIRDSSSRDTRYLNWSRLGSEVGFSSWVVGVAA*
Ga0137379_1087547823300012209Vadose Zone SoilRALAARGLHTRTPGQQPLNSIAQLKAWLDQGPLLLDGAPWFGEGHWFVAIGYDQNGIYTRDSSGWDTRYLTWSRLYGEVGFSGWVVGVTS*
Ga0137379_1094906113300012209Vadose Zone SoilLARGLAARGLQPRTPSQRPLSSIAELEAWLNQGPLLMDGARWFGDGHWFVGIGYDAGGIYIRDSSGGDTRYLSWSRLYGEVGFSGWVVGVST*
Ga0137378_1089293013300012210Vadose Zone SoilQLGSLDDAVALVGLGSGISASLGLLDARGPALARSLAARGLDAREPRASNGQLRPLGSIAELEAWLNQGPLLMDGARWFGEGHWFVGIGYDAGGVYIRDSSGWDTRYLSWSGLYGEVGFSGWVVGVAH*
Ga0137372_1004143563300012350Vadose Zone SoilGSLDDAIALVGPNTNISTTLGLLDARGPALARALAARGFRPRTPGERPLASAAALKAWLDQGPLLMDGASWFGAGHWFVGVGYDQNGVYIRDSSGWDTRYLTWSRLYGEVGFSGWVIGVET*
Ga0137386_1007593223300012351Vadose Zone SoilLARGLAARGLQPRTPGRRPLGSIAELQAWLEQGPLLMDGARWFGEGHWFVGVGYDNNGVYIRDSSGWDTRYLTWSRLYGEVGFSGWVVGAT*
Ga0137366_1041618833300012354Vadose Zone SoilRALAARGLEPREARTSTGQLRPLSSVAELEAWLDRGPLLMDGARWFGEGHWFVGIGYDAGGIYIRDSSGWDTRYLSWSRLYGDVGFSGWVVGVAG*
Ga0137369_1012971733300012355Vadose Zone SoilWQTWRAADCSAAALDWFLGAYGQGLASIDDAIALIGPNTGISTSLGLLDARGVRLAAVLNGRGLTARQPLDPAGHPRPLGSASELQQWLDRGPLLMDGSRWFGEGHWFVGIGYDKNGIYVRDSSGWDTRYLTWSGLYGEVGFSGWVVGVA*
Ga0137369_1068969313300012355Vadose Zone SoilAYGQPLASLDAAIALVGPNTNISTSLGLLDARGPALARALAARGFRPRTPSDRPLASTAAVKALLDQGPLLMDGASWFGEGHWFVGVGYDQNGVYIRDSSGWDTRYLTWSRLYGEVGFSGWVVGVAS*
Ga0137384_1020580833300012357Vadose Zone SoilDSIDAAIGLIGPGIGISTKLGLLDARGTALARALASRGLRPRSPGQRPLGSVAELEAWLDQGPLLMDGAHWFGEGHWFVAIGYDRGGVYTRDSSGWDTRYLSWSRLYGEVGFSGWAVGVAP*
Ga0137384_1054735123300012357Vadose Zone SoilLIGPGSGISTKLGLLDARGPALARALAGRGLRPRAPDQRPLGSIAELKGWLDQGPLLMDGARWFGEGHWFVAVGYDNGGVFTRDSSGWDTRYLSWSRLYGEVGFSGWVVGVTP*
Ga0137385_1052457613300012359Vadose Zone SoilASIDDAIALVGPNTKISTTLGLLDARGPALASALSTRGFTPREPQDRDGRLRPLTSIRELQAWLDQGPLLMDGASWFGEGHWFVGVGYDQNGIYIRDSSGWDTRYLTWSRLYGEVGFSGWVVGAG*
Ga0137375_1063004523300012360Vadose Zone SoilGLLDARGTALARALAVRGLEPREPRTSTGQLRPLGSIAELEAWLDQGPLLMDGARWFGEGHWFVGIGYDAGGIYIRDSSGWDTRYLTWSRLYGDVGFSGWVVGVAA*
Ga0137375_1130705813300012360Vadose Zone SoilRALAARGLEPREPRTASGQLRPLGSVAELKAWLDQGPLLMDGARWFGEGHWFVGIGYDASGVYIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVAA*
Ga0137375_1139973213300012360Vadose Zone SoilADCSAAALDWFLGAYGQGLASIDDAIALIGPNTGISTSLGLLDARGVRLAAVLNGRGLTARQPLDPAGHPRPLGSASELQQWLDRGPLLMDGSRWFGEGHWFVGIGYDKNGIYVRDSSGWDTRYLTWSGLYGEVGFSGWVVGVA*
Ga0137390_1049833113300012363Vadose Zone SoilGTALARALAARGLQPRTPGRPLGSITELEAWLNQGPLLMDGGRWFGEGHWFVGIGYDAGGIYIRDSSGWDTRYLSWSRLYGDVGFSGWVVGVAG*
Ga0137390_1103740723300012363Vadose Zone SoilVRRRNVALAQGLAARGLSPRAPQDVNGRPRPLGSIGELEAWLDRGPLLMDGARWFGEGHWFVAIAYDKSGVYIRDSSGWDNRYLSWSRLYGEVGFSGWVVGVAP*
Ga0137373_1099288513300012532Vadose Zone SoilLDWLLGAYGQPLASLDDAIALVGPGTGISTGLGLLDARGTPLARALAARGLRPREPHAASGQLRPLGSIAELQAWLDQGPLLMDGARWFGEGHWFVGIGYDASGIYIRDSSG
Ga0137373_1112362913300012532Vadose Zone SoilYGQPLPYLEDAVALIGPGTGISTSLGLLDARGTALARALTARGLQPRTPGQRPLASVAELQAWLDRGPLLLDGGRWIGQGHWFVAVGYDKNGVYTRDSSGWDTRYLTWSGLYGEVGFSGWVVGVGS*
Ga0137416_1194965213300012927Vadose Zone SoilDAMWLTWRNADCSAAALDWFLGVYGQPLGAIDDAIALIGPNTGISAALGLLDARGPGLAAALARRGLQPRQPKDAAGRVRPLTSISELQAWVDQGPLLMGGDRWFGEGHWFVGIGSDRNGVYLRDSSGWDTRYLSWSRLYGEVGFNGWVVGVGR*
Ga0126369_1137582213300012971Tropical Forest SoilLGPNTGISTTVGLLDARGLALAAALAERGLAPRQPRAGDGRLRSLGNTGELRTWLDRGPLLMDGASWFGEGHWFVGVGYDQNGVYIRDSSGWDTRYLTWGRLYGEVGFSGWVVGVA*
Ga0120179_106884013300013763PermafrostGLLDARGPALARALAGRGLRPRAPDQRPLGSIAELKGWLDQGPLLMDGARWFGEGHWFVAVGYDTGGVFTRDSSGWDTRYLSWSRLYGEVGFSGWVVGVTP*
Ga0120158_1019465013300013772PermafrostLAGRGLRPRAPDQRPLGSIAELKVWLDQGPLLMDGASWFGEGHWFVAIGYDKNGVYIRDSSGWDNRYLSWSRLYGEVGFSGWVVGVTP*
Ga0120109_102729733300014052PermafrostARGPALARAIADRGLRPRAPDQRPLGSIAELKGWLDQGPLLMDGARWFGEGHWFVAVGYDNGGVFTRDSNGWDTRYLSWSRLYGEVGFSGWVVGVTP*
Ga0163163_1195049623300014325Switchgrass RhizosphereIGPGTGISTRLGLLDARGPGLARALAVRGLQPRTPGAQPLRSVAELKMWLDGGPLLMDGARWFGAGHWFVAIGYDGGGIYIRDSSGWDNRYLTWARLYGEVGFSGWVVGVST*
Ga0132258_1381724413300015371Arabidopsis RhizosphereALIGPNTGISAAVGLLDACGPGLAAALAARGLTPRQPRDGGGRLRPLGSTSELQAWLDQGPLLMGGDRWFGEGHWFVGIGYNRNGVYIRDSSGWDTRYLTWSRMYGEVGFHGWVVGVAP*
Ga0132256_10333719823300015372Arabidopsis RhizosphereAIDDAIALIGPNTGISPSLGLLDARGSGLASALARRGLQPRQPRDAAGRLRPVTSPSELEAWLDQGPLLMGGDRWFGEGHWFVGIGYDKNGVYIRDSSGWDTRYLTWSRLYGEVGFDGWVVGVAP*
Ga0132255_10594693613300015374Arabidopsis RhizosphereWQAAACSAAALDWLLGAYGVRLGGIDQAIALIGPNTGISTSLGLLDATGRPLAKAIAASGLNPRNGQVHSIGELESWLDQGPLALDGASWFGEGHWFVATGYDQNGIYIRDSSGWDNRYLTWSRLYGEVGFSGWVAGVRGV*
Ga0132255_10603521723300015374Arabidopsis RhizosphereVGLLDARGPALARALAARGLQPRTPRGPLRSVAELRAWLDAGPLLMDGARWFGQGHWFVEVGYDQNDIYTHDSSGWDTRYLTWSRLYGEVGFSGWVVGVAV*
Ga0182035_1069789213300016341SoilANLDEAIALVGAGTGISTSLGLLDARGTALARALLARGLQPRTPGQRPLGSIAELQSWLNQGPLLMDGARWFGEGHWFVGIGYDSGGIYIRDSSGWDNRYLTWSRLYGEVGFSGWVVGVS
Ga0184619_1043943123300018061Groundwater SedimentNSGISTTLGLLDARGPALARAIAARGLHTRTPGQQPLNSIAQLKAWLDQGPLLLDGARWFGEGHWFVAVGYDQNGIYTRDSSGWDTRYLTWSRLYGEVGFSGWVVGVAS
Ga0184618_1018598623300018071Groundwater SedimentLTWRNVDCSAAALNWLLGAYGQPLPSIEDAIALIGPNTGISTVLGLLDARGPALARALAARGLHTRTPGQQPLNSIAQLKAWLDQGPLLLDGARWFGEGHWFVAVGYDQNGIYTRDSSGWDTRYLTWSRLYGEVGFSGWVVGVAS
Ga0066655_1095036623300018431Grasslands SoilPALARALAARGLQARTPGQRPLGSIAELESWLDQGPLLMDGGRWFGEGHWFVGIGHDAGGIFIRDSSGWDTRYLTWSRPYGELGFSGWVVGVRV
Ga0066662_1248386713300018468Grasslands SoilWLTWRNADCSAAALDWLLGAYGQPLGSLEDAIALVGPGTGISPALGLLDARGPALARALAARGLQPRTPDQRPLGSIAELEAWLDRGPLLMDGARWFGEGHWFVGIGYDAGGIAIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVLA
Ga0066662_1261205313300018468Grasslands SoilGSIDEAIGLIGPNTGIAPGPGLLDARGPALARALTSRGLQPRIPGKNPLSSIAQLKEWLDQGPLLLDGVAWFGKGHWFVAVGYDKNGVYIRDSSGWDNRYLTWARLYGEVGFTGWVVGVA
Ga0213876_1006293813300021384Plant RootsVSSLDDAITLIGPSTGISTSLGLLDERGPALAQALAKEGLHARRPDQKPLGSIAELKAWLDQGPLLMDGARWFGKGHWFVAVGYDQNGIYTRDSSGWDTRYLNWSRLYGEVGFSGWVVGV
Ga0213876_1021284213300021384Plant RootsGLLDARGPALARALQARGLHPRLPGQQPLGSIAELKAWLDQGPLLLDGARWFTVGHWFVAIGYDQNGIYTRDSSGWDTRYLTWSRLYGEVGFSGWVVGVAP
Ga0213853_1082956153300021861WatershedsISTSLGLLDARGPALARTLAMRGLDPREPRASNGQLRPLGSIAELEAWLNQGPLLMDGARWFGEGHWFVGIGYDAGGIYIRDSSGWDTRYLSWARLYGEVGFSGWVVGVAA
Ga0213853_1113820813300021861WatershedsTSLGLLDARGPALARTLAMRGLDPREPRASNGQLRPLGSTAELEVWLNQGPLLMDGARRFGEGHWFVGIGYDAGGVYIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVTA
Ga0207684_1082768213300025910Corn, Switchgrass And Miscanthus RhizosphereGQQLGSLDEAVALVGPGSGISASLGLLDARGLALARALAARGLDPRAPRASNGQLRPLGSIAELEAWLNQGPLLMDGACWFGEGHWFVGIGYDAGGVYIRDSSGWDIRYLSWSRLYGEVGFSGWVVGVST
Ga0209166_1044974613300027857Surface SoilNVDCSAAALDWLLGSYGRAVGSIDDAIAVIGPSSGISTTLGLLDERGPALAQALSREGLKPRTPGQRPLGSTRELEAWLDRGPLLMDGARWFGEGHWFVAIGYDKNGVYTRDSSGWDTRYLTWSRLYGEVGFTGWVVGVAQ
Ga0209283_1080589413300027875Vadose Zone SoilLAKIGDHPGHPWIHCHGQGQRGLQPRAPGPRPLGSIAELEAWLNQGPLLMDGARWFGEGHWFVGIGYDAGGVFIRDSSGWGTRYLSWLRLYGEVGFTGWVVGVST
Ga0318493_1075701013300031723SoilHWLLGAYGHTLGSLDAAIALIGPGTGISTTLGLTDARGPALAQALNSQGLRPRTPGARPLGSIAELKAWLDQGPLLMDGARWFGEGHWFVGIGYDQNGLYIRDSSGYDTRYLTWSRLYGEVGFSGWVVGVAPR
Ga0318497_1042455213300031805SoilVWQEWRACSAAALDWLLGAYGVRLGRIDRAIALIGPNTGISTTLGLLDASGAPLAQAIAGTGLVPRHRNVHSIAELERWLDQGPLALDGAAWFGVGHWFVATGYDQNGIFIRDSSGWDTRYLPWSRLYGQVGFDGWVVGVEE
Ga0306923_1173568713300031910SoilACSAATLDWLLEAYGVRLGSIDQAIALIGPDTGISPSLGLLDATGRPLAHALSASGLSSRNAQVHSIGELESWLNQGPLALDGARWFGEGHWFVATGYDQNGIYIRDSSGWDTRYLSWSRLYGEVGFSGWVVGVWA
Ga0310909_1116437713300031947SoilAACSAATLDWLLGAYGVRLASIDQAITLIGPNTGISPSLGLLDATGRPLAHALSASGLSSRNAQVHSIGELESWLNQGPLALDGARWFGEGHWFVATGYDQNGIYVRDSSGWDTRYLSWPRLYGEVGFSGWVVGVSA
Ga0310914_1091661233300033289SoilWGCWTNAAWPWPHALSTEALHPRTPGARPLGSIAELKTWLDQGPLLMDGARWFGKGHWFVAVGYDQQGVYTRDSSGWDTRYLDWSRLYGEVGFSGWVVGVSP
Ga0372943_0411382_2_4183300034268SoilCSAAALDWLLGAYGQPIASIEDAIALIGPNTGISTTLGLLDARGPALARALGARGLHTRTPGQQPLTSIAQLKAWLDQGPLLLDGARWFGEGHWFVAVGYDQNGIYTRDSSGWDTRYLTWSRLYGEVGFSGWVVGVAP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.