NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F090052

Metagenome / Metatranscriptome Family F090052

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F090052
Family Type Metagenome / Metatranscriptome
Number of Sequences 108
Average Sequence Length 125 residues
Representative Sequence MRQLLLVLLGRMRRGAPHVLGLGLPLMSLALVVVGCASPSLMPETQAVAIKDRDRALASHADAIHAAISQSGNVGALAFLDAKDGRLVVLPGDSSADAWSRYTSSPESGTGRVSVPPVLTFVHRA
Number of Associated Samples 98
Number of Associated Scaffolds 108

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 92.16 %
% of genes near scaffold ends (potentially truncated) 94.44 %
% of genes from short scaffolds (< 2000 bps) 89.81 %
Associated GOLD sequencing projects 95
AlphaFold2 3D model prediction Yes
3D model pTM-score0.55

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (90.741 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(11.111 % of family members)
Environment Ontology (ENVO) Unclassified
(37.963 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(50.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 43.79%    β-sheet: 10.46%    Coil/Unstructured: 45.75%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.55
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 108 Family Scaffolds
PF06472ABC_membrane_2 31.48
PF01061ABC2_membrane 0.93
PF08666SAF 0.93
PF00589Phage_integrase 0.93
PF00005ABC_tran 0.93
PF13011LZ_Tnp_IS481 0.93



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms90.74 %
UnclassifiedrootN/A9.26 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2228664021|ICCgaii200_c0297552All Organisms → cellular organisms → Bacteria533Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_101288658All Organisms → cellular organisms → Bacteria634Open in IMG/M
3300000443|F12B_13441160All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium509Open in IMG/M
3300002558|JGI25385J37094_10099848All Organisms → cellular organisms → Bacteria862Open in IMG/M
3300004114|Ga0062593_101811614All Organisms → cellular organisms → Bacteria671Open in IMG/M
3300004463|Ga0063356_101212362All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1097Open in IMG/M
3300004479|Ga0062595_102201008All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300004480|Ga0062592_100313756All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1195Open in IMG/M
3300005093|Ga0062594_100270288All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1264Open in IMG/M
3300005294|Ga0065705_11110022All Organisms → cellular organisms → Bacteria520Open in IMG/M
3300005330|Ga0070690_101508839All Organisms → cellular organisms → Bacteria543Open in IMG/M
3300005332|Ga0066388_108709012All Organisms → cellular organisms → Bacteria504Open in IMG/M
3300005336|Ga0070680_101837080All Organisms → cellular organisms → Bacteria525Open in IMG/M
3300005354|Ga0070675_101636522All Organisms → cellular organisms → Bacteria594Open in IMG/M
3300005444|Ga0070694_101028246All Organisms → cellular organisms → Bacteria685Open in IMG/M
3300005445|Ga0070708_100538665All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1101Open in IMG/M
3300005471|Ga0070698_100019523All Organisms → cellular organisms → Bacteria7113Open in IMG/M
3300005471|Ga0070698_100930165All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium815Open in IMG/M
3300005518|Ga0070699_102141913All Organisms → cellular organisms → Bacteria510Open in IMG/M
3300005536|Ga0070697_101026114Not Available733Open in IMG/M
3300005549|Ga0070704_101826937All Organisms → cellular organisms → Bacteria563Open in IMG/M
3300005617|Ga0068859_100568686All Organisms → cellular organisms → Bacteria1227Open in IMG/M
3300005617|Ga0068859_102968822All Organisms → cellular organisms → Bacteria519Open in IMG/M
3300005719|Ga0068861_100568578All Organisms → cellular organisms → Bacteria1036Open in IMG/M
3300005841|Ga0068863_102674255All Organisms → cellular organisms → Bacteria508Open in IMG/M
3300005842|Ga0068858_100657055All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1019Open in IMG/M
3300005842|Ga0068858_102127894All Organisms → cellular organisms → Bacteria555Open in IMG/M
3300005844|Ga0068862_102266811All Organisms → cellular organisms → Bacteria555Open in IMG/M
3300006806|Ga0079220_11089057All Organisms → cellular organisms → Bacteria645Open in IMG/M
3300006904|Ga0075424_100071862All Organisms → cellular organisms → Bacteria → Proteobacteria3623Open in IMG/M
3300006914|Ga0075436_101081487All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium603Open in IMG/M
3300009087|Ga0105107_10507180All Organisms → cellular organisms → Bacteria840Open in IMG/M
3300009089|Ga0099828_11842052All Organisms → cellular organisms → Bacteria531Open in IMG/M
3300009098|Ga0105245_11562874All Organisms → cellular organisms → Bacteria711Open in IMG/M
3300009101|Ga0105247_11666950All Organisms → cellular organisms → Bacteria526Open in IMG/M
3300009157|Ga0105092_10369445All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium814Open in IMG/M
3300009166|Ga0105100_10627240All Organisms → cellular organisms → Bacteria660Open in IMG/M
3300009174|Ga0105241_10729338All Organisms → cellular organisms → Bacteria907Open in IMG/M
3300009815|Ga0105070_1116755All Organisms → cellular organisms → Bacteria535Open in IMG/M
3300010047|Ga0126382_11069276All Organisms → cellular organisms → Bacteria713Open in IMG/M
3300010399|Ga0134127_10599259All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1130Open in IMG/M
3300011119|Ga0105246_10441713All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1091Open in IMG/M
3300011271|Ga0137393_10279732All Organisms → cellular organisms → Bacteria1419Open in IMG/M
3300011428|Ga0137456_1230840All Organisms → cellular organisms → Bacteria503Open in IMG/M
3300011442|Ga0137437_1272500All Organisms → cellular organisms → Bacteria581Open in IMG/M
3300012202|Ga0137363_10144496All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1863Open in IMG/M
3300012362|Ga0137361_10343914All Organisms → cellular organisms → Bacteria1368Open in IMG/M
3300012362|Ga0137361_10641361Not Available972Open in IMG/M
3300012363|Ga0137390_10497817All Organisms → cellular organisms → Bacteria1193Open in IMG/M
3300012923|Ga0137359_11211625All Organisms → cellular organisms → Bacteria643Open in IMG/M
3300012929|Ga0137404_10270809All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1464Open in IMG/M
3300012931|Ga0153915_12342413All Organisms → cellular organisms → Bacteria625Open in IMG/M
3300012931|Ga0153915_12698345All Organisms → cellular organisms → Bacteria581Open in IMG/M
3300012986|Ga0164304_11140847All Organisms → cellular organisms → Bacteria626Open in IMG/M
3300013297|Ga0157378_10994771All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium873Open in IMG/M
3300014326|Ga0157380_10135495All Organisms → cellular organisms → Bacteria2107Open in IMG/M
3300015241|Ga0137418_10013048All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria7678Open in IMG/M
3300015373|Ga0132257_101124719All Organisms → cellular organisms → Bacteria992Open in IMG/M
3300017997|Ga0184610_1096146All Organisms → cellular organisms → Bacteria935Open in IMG/M
3300018028|Ga0184608_10175346All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium933Open in IMG/M
3300018056|Ga0184623_10095143All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1375Open in IMG/M
3300018075|Ga0184632_10192552All Organisms → cellular organisms → Bacteria898Open in IMG/M
3300018076|Ga0184609_10200402All Organisms → cellular organisms → Bacteria929Open in IMG/M
3300018076|Ga0184609_10238225All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium850Open in IMG/M
3300018079|Ga0184627_10378258All Organisms → cellular organisms → Bacteria739Open in IMG/M
3300018429|Ga0190272_10631195All Organisms → cellular organisms → Bacteria947Open in IMG/M
3300018429|Ga0190272_11214137All Organisms → cellular organisms → Bacteria742Open in IMG/M
3300018429|Ga0190272_13113796All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300018469|Ga0190270_13270285All Organisms → cellular organisms → Bacteria514Open in IMG/M
3300019869|Ga0193705_1075520All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium658Open in IMG/M
3300019885|Ga0193747_1117917All Organisms → cellular organisms → Bacteria631Open in IMG/M
3300021073|Ga0210378_10169957All Organisms → cellular organisms → Bacteria838Open in IMG/M
3300021078|Ga0210381_10343132Not Available545Open in IMG/M
3300022694|Ga0222623_10142202All Organisms → cellular organisms → Bacteria934Open in IMG/M
3300023071|Ga0247752_1008114All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1414Open in IMG/M
3300025885|Ga0207653_10134126All Organisms → cellular organisms → Bacteria901Open in IMG/M
3300025910|Ga0207684_10348490All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1275Open in IMG/M
3300025910|Ga0207684_10573028All Organisms → cellular organisms → Bacteria965Open in IMG/M
3300025922|Ga0207646_10800490All Organisms → cellular organisms → Bacteria839Open in IMG/M
3300025942|Ga0207689_10056185All Organisms → cellular organisms → Bacteria3239Open in IMG/M
3300025961|Ga0207712_10571057All Organisms → cellular organisms → Bacteria975Open in IMG/M
3300026041|Ga0207639_11018599All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium776Open in IMG/M
3300026118|Ga0207675_102067103All Organisms → cellular organisms → Bacteria586Open in IMG/M
3300026285|Ga0209438_1081243All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1038Open in IMG/M
3300026285|Ga0209438_1225685All Organisms → cellular organisms → Bacteria506Open in IMG/M
3300026354|Ga0257180_1048567All Organisms → cellular organisms → Bacteria600Open in IMG/M
3300026480|Ga0257177_1059980All Organisms → cellular organisms → Bacteria599Open in IMG/M
3300026514|Ga0257168_1083427All Organisms → cellular organisms → Bacteria708Open in IMG/M
3300027490|Ga0209899_1023444All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1372Open in IMG/M
3300027655|Ga0209388_1058679All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1110Open in IMG/M
3300027671|Ga0209588_1210358All Organisms → cellular organisms → Bacteria604Open in IMG/M
3300027765|Ga0209073_10104138All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1005Open in IMG/M
3300027910|Ga0209583_10399752Not Available654Open in IMG/M
3300027915|Ga0209069_10660944All Organisms → cellular organisms → Bacteria609Open in IMG/M
3300028381|Ga0268264_10382956All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1347Open in IMG/M
3300028819|Ga0307296_10381100All Organisms → cellular organisms → Bacteria770Open in IMG/M
(restricted) 3300031197|Ga0255310_10020458All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1702Open in IMG/M
3300033433|Ga0326726_11616242All Organisms → cellular organisms → Bacteria631Open in IMG/M
3300033480|Ga0316620_11068611All Organisms → cellular organisms → Bacteria787Open in IMG/M
3300033486|Ga0316624_11646024All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium592Open in IMG/M
3300033502|Ga0326731_1163651All Organisms → cellular organisms → Bacteria509Open in IMG/M
3300034354|Ga0364943_0352272All Organisms → cellular organisms → Bacteria563Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil11.11%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere10.19%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil8.33%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere8.33%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment6.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil3.70%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment2.78%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment2.78%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.78%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.78%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere2.78%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands1.85%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil1.85%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.85%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.85%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.85%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.85%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.85%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand1.85%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil1.85%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.85%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.85%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.93%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.93%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.93%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.93%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.93%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.93%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.93%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.93%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.93%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.93%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.93%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.93%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.93%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.93%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.93%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.93%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.93%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.93%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2228664021Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000443Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.2B clc assemlyEnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005330Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaGEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005354Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M4-3 metaGHost-AssociatedOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005617Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2Host-AssociatedOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005841Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2Host-AssociatedOpen in IMG/M
3300005842Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2Host-AssociatedOpen in IMG/M
3300005844Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2Host-AssociatedOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300009087Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 19-21cm September2015EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300009101Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaGHost-AssociatedOpen in IMG/M
3300009157Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009166Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm May2015EnvironmentalOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009177Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaGHost-AssociatedOpen in IMG/M
3300009815Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_0_10EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300011119Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-4 metaGHost-AssociatedOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011428Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT615_2EnvironmentalOpen in IMG/M
3300011442Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT138_2EnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300014326Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S3-5 metaGHost-AssociatedOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018469Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 TEnvironmentalOpen in IMG/M
3300019869Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3m2EnvironmentalOpen in IMG/M
3300019885Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m2EnvironmentalOpen in IMG/M
3300020065Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT499_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redoEnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300023071Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S019-104C-5EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025942Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025961Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026041Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026354Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-BEnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300027490Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300030620Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033480Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M1_C1_D5_BEnvironmentalOpen in IMG/M
3300033486Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_N3_C1_D5_AEnvironmentalOpen in IMG/M
3300033502Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF9FY SIP fractionEnvironmentalOpen in IMG/M
3300034354Sediment microbial communities from East River floodplain, Colorado, United States - 23_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
ICCgaii200_029755212228664021SoilMRQLLLGRVRRGVSHALRSGLLVLSLALLIVGCASQTLMPKEQAVAIDERDRALVSHGAAIQAAIRQSGTIGALAFLDAKDGTLVVLPGDSPADAWARYATSPESGTGRVSVPAVLTFVYRADVLKAPXSV
INPhiseqgaiiFebDRAFT_10128865813300000364SoilMRQPLLVLLGRTRRRVLHVLGSCLLLVGLPLLVVGCAGHSLVPAAQSEAIRGRERALAPHADAIQAAIRQSGNIGGLAFLDASDGGLVVLPGDSPADAWGRYASSPESGTGRVSVPVVVTFVHRADVPKAPEPVTRS
F12B_1344116013300000443SoilMRQLLFVLLGRIRRGAPQVLGSGLPLMSLALALVVGCASPSLMPQAQRAAIKDRDRALAPHADAIHASITQSGTLGALAFLDARDGRLVVLPGDSPADSWARHASSSQA
JGI25385J37094_1009984813300002558Grasslands SoilMMRQLLLVLLGRLRRGVPHVLSSGLPLVGLSLVVVGCASHSLMTAAQSEAIRDRERALASHADAIQAAVRQSGNVGALAFLDAKDGHLVVLPGDSPADAWARYAASPESGTSPVSVPVVVTFVYR
Ga0062593_10181161413300004114SoilMRQLLLGSLRRGVPHTLRSGLLLLSLALLAVGCASQTLMPKEQAVAIDIRDRALVPHAAAIQAAIRQSGNVGALAFLDAVDGNLIVLPGDSPADAWARYATSPESGTGRVSVPAVVTFVYRADVPKAPET
Ga0063356_10121236223300004463Arabidopsis Thaliana RhizosphereMMRQLLLLLLGRIWRGAPHMLSSGLPVVGLALVVVGCATPSLMPPAQAVAVRDRDRALAPHAAAIHAAIDRSGNVGALAFLDAKDGRLVVLPGDNPSDAWSRHARSPESGTAPVSVPPVLTFVHRADVPKAPETVTRSALQQQQAVAAL
Ga0062595_10220100813300004479SoilMRQLLLGRVRRGVSHALRSGLLVLSLALLIVGCASQTLMPKEQAVAIDERDRALVSHGAAIQAAIRQSGTIGALAFLDAKDGTLVVLPGDSPA
Ga0062592_10031375613300004480SoilMRQLPLALLERIRREAPHVWGSALLLMSFAWGVVGCASPSLMPETQAGAIKNRDRALAPHAAAIHAAIGQSGKAGALAFLDARDGRLIVLPGDSPAEAWSRYIASPEGETGRASAPPVLTFVHRA
Ga0062594_10027028823300005093SoilMRQLPLALLERIRREAPHVWGSALLLMSFAWGVVGCASPSLMPETQAGAIKNRDRALAPHAAAIHAAIGQSGKAGALAFLDARDGRLIVLPGDSPAEAWSRYIASPEGETGRASAPPVLTFVHR
Ga0065705_1111002213300005294Switchgrass RhizosphereMRQLLLALFGRIGRGVPNVLGSGLPLMSLALVVVGCASPSLMPDAQAVAIKHRERALASHADAIHAAISQSGQVGALALLDAKDGRLVVLPGDSPADAWVRYTASPESATGRVSVPPVLTFVHRADVPKAPETVTQSGLQQQ
Ga0070690_10150883913300005330Switchgrass RhizosphereMRQLLLGSLRRGVPHTLRSGLLLLSLALLAVGCASQTLMPKEQAVAIDIRDRALVPHAAAIQAAIRQSGNVGALAFLDAVDGNLIVLPGDSPADAWARYATSPESGTGRVSVP
Ga0066388_10870901213300005332Tropical Forest SoilMRQRLLVLLRRTPRRVLHALGSCLVLVGLPLVVVGCAGHSLMPAEQSAAIRDRERTLAPHADEIQAAIRQSGNIGGLAFLDASDGHLVVLPGDSPAEAWGRYAASTDSATGRASVPVVVTFVHRADVPKAPETVTRSALQQLEGTRKSLAAMETD
Ga0070680_10183708013300005336Corn RhizosphereMCQLPLGRIRRGAPHTLRSGLPLLSLVMLVVGCASHTLMPKEQAVAIGDRDRALVSHADAIQAAIRQSGNIGALAFLDAKDGRLVVLPGDSPADAWARHA
Ga0070675_10163652223300005354Miscanthus RhizosphereMRQLPLGRIGRGAPHTLRSGLALLGLVLLVVGCASHTLMPKEQAGAIDNRDRALVSHADAIQAAIRPSGSLGGLAFLDAKDSRLVVLPGDSPADAWARYATSPESGAGR
Ga0070694_10102824613300005444Corn, Switchgrass And Miscanthus RhizosphereMRQLLLALFGRIGRGVPNVLGSGLPLMSLALVVVGCASPSLMPEAQAVAIKHRERALASHADAIHAAISQSGQVGALALLDAKDGRLVVLPGDSPADAWARYTASPESGTGRVSVPPVLTFVHRADVPKAPETVT
Ga0070708_10053866513300005445Corn, Switchgrass And Miscanthus RhizosphereMLRLLHVLLGRMRRGAPHGLGSGVPLVSLALVVVGCASPSLMPEAQAVAIKNRDRALASRADAIHAAISQSGHVGALAFLDAADGRLVVLPGDSPADAWSRYTTSPESGPGGVSVP
Ga0070698_10001952313300005471Corn, Switchgrass And Miscanthus RhizosphereMRQLLLVLLERIRRGAPHVLGSGLALMSLGVVVVGCASHSLMPETQAGAIKDRDRALASHAAAIHAAIGQSGNVGALAFLDAKDGRLIVLPGDSSADAWSRYIASPDGETGRVSVPPVLTFVHRADVPKAPETVTR
Ga0070698_10093016523300005471Corn, Switchgrass And Miscanthus RhizosphereMLQLLHVLLGRMRRGALHGLGSGVPLVSLALVVVGCASPSLMPEAQAVAIKNRDRALASRADAIHAAISQSGHVGALAFLDAADGRLVVLPGDSPADAWSRYTTSPESGPGGVSVP
Ga0070699_10214191313300005518Corn, Switchgrass And Miscanthus RhizosphereMRQLLLVLLGRIRRGAPHMLGSGLPLMSLAVVVVGCASHSLMPEAQAVAIKDRDRALASHADAIHTAISQSGKVGALAFLDARDGRLVVLPGDSPADAWAGYTTSPESGTGRVSVPPVLTFVHR
Ga0070697_10102611413300005536Corn, Switchgrass And Miscanthus RhizosphereMLQLLLVLLGRMRRGAPHVRGSGLPLMSLAVVVVGCASPSLLPEAQTVAIQDRDRALALHADAIHAAISQSAKVGALVFLDAKDSHLVVLPGESPADAWARY
Ga0070704_10182693723300005549Corn, Switchgrass And Miscanthus RhizosphereMCQLPLGRIRRGAPHTLRSGLPLLSLVMLVVGCASHTLMPKEQAVAIGDRDRALVSHADAIQAAIRQSGNIGALAFLDAKDGRLVVLPGDSPAD
Ga0068859_10056868623300005617Switchgrass RhizosphereMRQLLLGRIRRGAPHTLRSGLPLLSLVLLVVGCASHTLMPKEQAVAIDERDRALVAHAAAIQAAIRQSGNIGALVFLDAKDRRLVVLPGDSPADAWVRHATSPESGAGRISVPAVVTFVY
Ga0068859_10296882223300005617Switchgrass RhizosphereMRQLLLGCPRRRVPHARRSGILLLSLALLAVGCASHTMMPKEQAVAIDERDRALVSHAAAIQASIRQSGSMGALAFLDGKDGSLVVLPGDSPADAWARHATAPESQT
Ga0068861_10056857813300005719Switchgrass RhizosphereMCQLPLGRIRRGAPHTLRSGLPLLGLVLLVVGCASHTLMPKEQAGAIDNRDRALVSHADAIQAAIRPSGSLGGLAFLDAKDSRLVVLPGDSPAD
Ga0068863_10267425523300005841Switchgrass RhizosphereMRQLLLGRVRRGVSHALRSGLLVLSLALLIVGCASQTLMPKEQAVAIDERDRALVSHGAAIQAAIRQSGTIGALAFLDAKDGTLVVLPGDSPADAWARYATSPESG
Ga0068858_10065705523300005842Switchgrass RhizosphereMCQLPLGRIRRGAPHTLRSGLPLLSLVMLVVGCASHTLMPKEQAVAIGDRDRALVSHADAIQAAIRQSGNIGALAFLDAKDGRLVVLPGDSPADAWARHATSPETGAGRISMPAVVTFVYRADVPKAPETVTQHLLEQQQAFRRS
Ga0068858_10212789413300005842Switchgrass RhizosphereMRQLLLGSLRRGVPHTLRSGLLLLSLALLAVGCASQTLMPKEQAVAIDIRDRALVPHAAAIQAAIRQSGNVGALAFLDAVEGNLIVLPGDSPADAWARYATSPESGTGRVSVPAVVTFVYRADVPKAPETV
Ga0068862_10226681113300005844Switchgrass RhizosphereMRQLPLGRIRRGAPHTLRSGLPLLSLVLLVVGCASHTLMPKEQAGAIDNRDRALVSHADAIQAAIRQSGNIGALTFLDAKDSRLVVLPGDSPADAWARYATSPESGAGRISVPAVVTFVYRADVPKAPETV
Ga0079220_1108905713300006806Agricultural SoilMLQLLHVLLGRMRRGARHGLGSGVPLVSLALVVVGCASPSLMPAAQAVAIKDRDRALAARADAIHAAIGQSGHEGALAFLDAADGRLVVLPGDSPADAWSRYTTSPESGPGGVSVPPVLTFVHRADVAKAPETVTRSV
Ga0075424_10007186213300006904Populus RhizosphereMRQLLPVLLGRIRRGAPHVLGSGLPLMSLALLAVGCAGPSLMPRAQAAAIKDRDRALALHADAIQTATRQSGNVGALAFLDAKDGRLVVLPGDTPADAWARYTASPESRTGRVSVPVV
Ga0075436_10108148713300006914Populus RhizosphereMRQLLPVLLGRIRRGAPHVLGSGLPLMSLALLAVGCAGPSLMPRAQAVAIKDRDRALALHADAIQTATRQSGNVGALAFLDAKDGRLVVLPGDTPADAWARYTASPES
Ga0105107_1050718013300009087Freshwater SedimentMRQLRLVLLERIRRGAPRVLGSALLLVSLAWVVVGCASTPLIPETQAGAIKGRERALAPHAAAIHAAIGQSGNVGALAFLDAKDGRLVVLPGDTPADAWSRYTTSPEGETGRGSVPPVLTFVH
Ga0099828_1184205213300009089Vadose Zone SoilMLQLLLVLLGRMRRGAPHVLGSGLPLMSLAVVVVGCASPSLLPEAQAVAIQDRDRALALHADAIHAAISQSAKVGALVFLDAKDGQLVVLPGESPADAWARYTMLPENGTGRVSMPPVLTFVHRADVPKAPETIPR
Ga0105245_1156287413300009098Miscanthus RhizosphereMRQLPLALLERIRREAPHVWGSALLLMSFAWGVVGCASPSLMPETQAGAIKNRDRALAPHAAAIHAAIGQSGKAGALAFLDARDGRLIVLPGDSPAEAWSRYIASPEGETGRASAPPVLTFVHRADVPKAPETVTRVVLQQQAQLGD
Ga0105247_1166695013300009101Switchgrass RhizosphereMPQLPLGSIRRGAPHTLGSGLLLLSLALLAVGCASHTLMPKEQAVAIDSRDRALVPHAATIQAVIRQSGTIGALAFLDAKDGALIALPGDSPADAWARYATSPESGTGRVSVP
Ga0105092_1036944523300009157Freshwater SedimentMSLQEPEESTMRQLLLGRLRRGVPHTLRSGVLLILTFLAVGCAGHTLMPKEQTGAIDERDRALASHAAAIQAAIRQSGNMGALAFLDARDGRLVVLPGDSPADAWARYATSPESGADRISVPAVVTFVYRADVPKAPETV
Ga0105100_1062724013300009166Freshwater SedimentMRQLRLVLLERIRRGAPHAVGSALLLMSLAWVVVGCASTSLMPETQAGAIKDRDRALAPHAAAIHAAIGQSGNAGALAFLDAKDGRLVVLPGDTPADAWSRYTTSPEGETGR
Ga0105241_1072933813300009174Corn RhizosphereMRQLLLGRVRRGVSHALRSGLLVLSLALLIVGCASQTLMPKEQAVAIDERDRALVSHGAAIQAAIRQSGTIGALAFLDAKDGTLVVLPGDSPADAWARYATSPESGTGRVSVPAVLTFV
Ga0105248_1067138423300009177Switchgrass RhizosphereMGLALMVVGCASASLMPPGQAQAIKDRERALAVHTDAIQTAISQSGQVGALAFLDAGDSHLVVLPGSSPADAWARFAASPEGGTGRGSVPPVLTFVHRADMPKAPEAVTLSDLEQQQALR
Ga0105070_111675513300009815Groundwater SandMSLALVVIGCTNPSLMPETQAVAIKDRDRALASHADAIHAAIRQSGNAGALAFLDAKDGRLVVLPGDSSADAWSRYTTSPESGTGRVSVPPVLTFVHRADVPKAPEPVTRSVLQQQQALAALGTE
Ga0126382_1106927613300010047Tropical Forest SoilMLGGTRRRVLHVLGSCLVLGGLPLLVVGCAGHSLVPAEQSAAIRDQERALAPHADAIQAAIRQSGNIGGLAFLDASDGRLVVLPGDSPVEAWGQYASSPKSETGRVSVPMVVTFV
Ga0134127_1059925913300010399Terrestrial SoilMLSSGLPVVGLALVVVGCATPSLMPPAQAVAVRDRDRALAPHAAAIHAAIDRSGNVGALAFLDAKDGRLVVLPGDNPSDAWSRHARSPESGTAPVSVPPVLTFVHRADVPKAPETVTRSALQQQQ
Ga0134122_1176093813300010400Terrestrial SoilMSLALLVVGCASPSLMPEAQALAIKHRDRALASHTAAIHAAISQSGHVGALALLDAKDGRLIVLPGDSPADAWSRYIASPEGGAGRVSVPPVLTFVHRADVPKAPETVTGSALQQQAQLRDAY
Ga0105246_1044171323300011119Miscanthus RhizosphereMRQLPLALLERIRREAPRVWGSALLLMSLAWGVVGCASPSLMPETQAGAIKNRDRALAPHAAAIHAAIGQSGKAGALAFLDARDGRLIVLPGDSPAEAWSRY
Ga0137393_1027973213300011271Vadose Zone SoilMSLAVVVVGCASPSLLPEAQAVAIQDRDRALAFHADAIHAAISQSGKVGALAFLDPKDSHLVVLPGDSPADAWARYIMLPESGTGRVSVPPVLTFVHRADVPKAPETIPRSVLQQQAHLRDAW
Ga0137456_123084013300011428SoilMGSALLLMSLAWVVAGCASTSLMPETQAGAIKDRDRALAPHAGAIHAAIGLSGNAGALAFLDAKDGRLIVLPGDSPADAWLRYLASPEGETGRGSVPPVLTFVHRADVPKAPETVTRGALQQQAQLRDGYRRF
Ga0137437_127250013300011442SoilMMSLQEPEESTMRQLLLGSIRRGAPHTLRSALLLLSLALLVVGCASHTLMPKEQTVAIETRDRALVSHAAAIQAAIRQSGHVGALAFLAPVDGRVIVLPGDSPADAWARYAASPESAAGAISVP
Ga0137363_1014449623300012202Vadose Zone SoilMSLAVVVVGCASPSLLPEAQAVAIQDRDRALALHADAIHAAISQSAKVGALVFLDAKDSHLVVLPGESPADAWARYTMLPENGTGRVSMPPVLTFVHRADVPKAPETIPRSVLQQQAQLRDAWR
Ga0137361_1034391413300012362Vadose Zone SoilMSLAVVVVGCASPSLLPDVQAVAIQDRDRALALHADAIHAAISQSGKMGALVFLDATDSHLVVLPGESPADAWARYRRVSMPPVLTFVHRADVPKAPETIPRSVLQQQAQLRDAWRRIEE
Ga0137361_1064136113300012362Vadose Zone SoilMSLAVVVVGCASPSLLPEAQAVAIQDRDRALALHADAIHAAISQSAKVGALVFLDAKDSHLVVLPGESPADAWARYTMLPENGTGRVSVPPVLTFVHRADVPKAPETIPRSVLQQQAQLRDAWRRIEER
Ga0137390_1049781733300012363Vadose Zone SoilMSLALVVVGCASPSLMPETQAVAIKDRDRALASHAGAIHAAISQSGNVGALAFLDAKDGRLVVLPGDSSADAWSRYTSSPESGTGRVSVPPVLTFVHRADVPKAPEAITRSLLQQQQALSALETELRDAQRR
Ga0137359_1121162513300012923Vadose Zone SoilMSLAVVVVGCASPSLLPDVQAVAIQDRDRALALHADAIHAAISQSGKMGALVFLDATDSHLVVLPGESPADAWARYRRVSMPPVLTFVHRADVPKAPETIPRSVPQQQAQLRDAWRRIEERLSIVQSELAESKREADASLT
Ga0137404_1027080913300012929Vadose Zone SoilMRQLLLGLLGRIRRETPAVLGSGLPLMGLALVVVGCASPSLMPEAQAVAIKDRDRALAAHADAIHAAIRQSGNVGALAFLDAKDGRLVVLPGDSPADAWSRYTTSPESGSGRVSAPPVLTFVHRADVPKAPETIPRSVLQQQAQLRDTWRRI
Ga0137407_1005508143300012930Vadose Zone SoilVGLALVVVGCASASLMPPGQAQAIKDRERALAVHADAIQTAISQSGQVGALAFLDTGDGHLVVLPGSSPADAWARFATSPESGTGRGSVPLVLTFVHRADMPKAPEAVTLTDLEQQQALR
Ga0153915_1234241313300012931Freshwater WetlandsMLQLLHVLFGRIRRGAPHGLGSGLPLMSLALVVVGCASPSLMPEAQVVAIKDRDRALASRADAIHAAISQSGHLGALAFLDAADGRLVVLPGDSPADAWSRYTTSPESGPGGVSVPPVLTFVHRADVAKAPETVTRGVLQQQEALRTSLAA
Ga0153915_1269834513300012931Freshwater WetlandsMLQLLHVLLGRMRRGAPHGLGSGVLLMSLALVAVGCASPSLMPEAQAVAIKDRDRALASRADAIHAAIGQSGHGGALAFLDAADGRLVVLPGDSPADAWSRYTTSPESGPGRVSVPPVLTFVHRAD
Ga0164304_1114084713300012986SoilMRQLLLVLLERIRRGAPHALGSGLALMSLAVVVVGCASHSLMPETQAGAIKDRDRALASHAAAIHAAIGQSGNVGALAFLDTKDGRLIVLPGDSSADAWSRYIASPEGETGRVSVPPVLTFVH
Ga0157378_1099477113300013297Miscanthus RhizosphereMGLALMVVGCASASLMPPGQAQAIKDRERALAVHTDAIQTAISQSGQVGALAFLDAGDSHLVVLPGSSPADAWARVAASPEGGTGRGSVPPVLTFVHRADVPKAPEAVTLTDLEQQQALRGS
Ga0157380_1013549513300014326Switchgrass RhizosphereMSLQEPEESAMRQLLLGRVRRGVSHALRSGLLVLSLALLIVGCASQTLMPKEQAVAIDERDRALVSHGAAIQAAIRQSGTIGALAFLDAKDGTLVVLPGDSPADAWARYATSPESGTGRVSVPAVLTFVYRADVLKAPELVTQNL
Ga0137418_1001304893300015241Vadose Zone SoilMRQLLLALLGRIRRGRRHVLGSGLSLIGLSLVLAGCASHSLMPAAQSAAIRERERALASHADAIQSTVRPSGKVGGLAFLDAKDGHLVVLPGDSPADAWAQYATSPESETSPVSMPP
Ga0132257_10112471913300015373Arabidopsis RhizosphereMRQRLLVLLGRTRRRVRHGLGSLLPLVGLPLLVVGCAGHSLVPAEQSAAITERERALAPHADAIQAAIRQSGNTGGLAFLDANDGHLVVLPGDTPAEAWGRYASSADSGTGQGSVPLVVTFVHRADVPKAPEAITRSALQQLEATRKSM
Ga0184610_109614613300017997Groundwater SedimentMRQLLLVLLGRIRRGVPHVVASGLPLMSLALVVVGCASPSLMPETQAVAIKDRARALASHADAIHAAIRQSGNVGALAFLDAKDGRLVVLPGDSSADAWSRYTSSPESGTGRVSVPPVLT
Ga0184608_1017534623300018028Groundwater SedimentMRHLLLVLLGRIGRGAPHVLGSGLALMSLALAVAGCASPSLMPEAQAVAIKDRDRALASHANAIHAAISQSGNVGALVFLDATDSHLVVLPGDSPVDAWSRYTSSPESGTGRASVPP
Ga0184623_1009514313300018056Groundwater SedimentMRQLLLVLLGRIRRGVPHVVASGLPLMSLALVVVGCASPSLMPETQAVAIKDRARALASHADAIHAAIRQSGNVGALAFLDAKDGRLVVLPGDSSADAWSRYTSSPESGTGRVSVPPVLTFVHRADVPKAPETITRS
Ga0184632_1019255213300018075Groundwater SedimentMRQLLLVLLGRIRRGAPHVVGSGLPLMSLALVVVGCASHSLMPETQAVAIKDRARALASHADAIHAAIRQSGNAGALAFLDAKDGRLVVLPGDSSADAWSRYTTSPESGTGRVSVPP
Ga0184609_1020040213300018076Groundwater SedimentMRQRLPVLLGRIRRGAPHVLGSGLPLMSLALLAVGCASPSLMPKAQAVAIKDRDRALALHADAIQTATRESGNLGALAFLDAKDGRLVVLPGDTPADAWARYATSPESGTGRVSVPAVVVFVHRADVPK
Ga0184609_1023822523300018076Groundwater SedimentMRQLLFVLLGRIGRGAPHVLGSGLPLMSLALVLVGCASPSLMPETQAVAIKDRDRALASHAGAIHAAIRQSGNAGALAFLDAKDGRLVVLPGDSSADAWSRY
Ga0184627_1037825813300018079Groundwater SedimentMRQLLLVLLGRMRRGAPHVLGLGLPLMSLALVVVGCASPSLMPETQAVAIKDRDRALASHADAIHAAISQSGNVGALAFLDAKDGRLVVLPGDSSADAWSRYTSSPESGTGRVSVPPVLTFVHRA
Ga0190272_1063119513300018429SoilMRQLLLVLPGRIRRGAPHVLGLALLLTSLALVVVGCASHSLIPVTQAGAIKDRDRALAPHAAAIHAAIGQSGNAGALAFLDAKDGRLVVLPGDTPADAWSRHTASPESATGRASAPPVLTFVHRADIPKAPETVT
Ga0190272_1121413713300018429SoilMRQLLLVLLGRIGRRVPHVLGSGLPLMSLALVVVGCASPSLMPEAQAVAIKHRDRALASHADAIHAAISQSGHVGALALLDAKDGRLVVLPGDSPADAWARYSASPESGAGRVSVPPVLTFVHRADVP
Ga0190272_1311379613300018429SoilMRQLLLVPVEPIRRGAPHVLGSALLLMSLAWVVVGCASTSLMPETQAGAIKDRDRALAPHAAAIHAAIGQSGNVGALAFLDAKDGRLIVLPGDSPADAWSR
Ga0190270_1327028513300018469SoilMRQLLLVLLGCPRREAPHVLGPGLSLVSLALVVVGCASPSLMPESQAAAIKNRERALAPHASAIHTAIAQSGNAGALVFLDAEDGRLVILPGDSPADAWTRHTAPPESGTGRVSAPPVLTYVHRADIPKAPE
Ga0193705_107552013300019869SoilMRQLLLALFGRIGRGVPNVLGSGLPLMSLALVVVGCASPSLMPEAQAVAIKHRERALASHADAIHAAISQSGQVGALALLDAKDGRLVVLPGDSPADAWAR
Ga0193747_111791713300019885SoilMRQLLLVLLGRIGRRVPHVLGSGLPLMSLALVVVGCASPSLMPEAQAVAIKHRDRALASHADAIHAAISQSGHVGSLALLDAKDGRLVVLPGDSPADAWARYTASPESGAGRVSVPPVLTFVHRADVPKAPETVTQSVLQQQAQLLDAHRRVAE
Ga0180113_130977813300020065Groundwater SedimentMRQLLCLMGLALVVAGCASPSLMPETQAVAVKERDRALAPHADAIHGAIRQSGHAGALAYLDAADGRLIVLPGDSPADAWGRYTASPESGTGLASVPPVLTFVHRADVPKAPDVVTRSALLQQQALRAAVATLESELRDAHRRIEG
Ga0210378_1016995713300021073Groundwater SedimentMRHLLLVLLGRIRRGAPHVLGSGLPLMSLALAVAGCASPSLMPEAQAVAIKDRDRALASHANAIHAAISQSDKVGALVFLDATDGRLVVLPGDSPVDAWSRYTSSPESGTGRASVPPVLTFVHP
Ga0210381_1034313213300021078Groundwater SedimentMRQLLLVLLGRIGRRVPHGLGSGLPLMSLALVVVGCASPSLMPEAQAVAIKHRDRALASHADAIHAAISQSGHVGALALLDAKDGRLVVLPGDSPADAWARYTASPESGTGRVSVPPVLT
Ga0222623_1014220213300022694Groundwater SedimentMRQLLLVLLGRIRWGARHVRGSGLPLMSLALVVVGCASPSLMPETQAVAVKDRDRALASHAGAIHAAISQSGNAGALAFLDAKDGRLVVLPGDSSADAWSRYTSSPESGTGRVSVPPVLTFVHRADVPKAPEPVT
Ga0247752_100811413300023071SoilMRQLLLGRVRRGVSHALRSGLLVLSLALLIVGCASQTLMPKEQAVAIDERDRALVSHGAAIQAAIRQSGTIGALAFLDAKDGSLVVLPGDSPADAWARYATSPESGTGRVSVPAVLTFVYRADVLKAPELVTQNLLQQQQAFRRSLT
Ga0207653_1013412623300025885Corn, Switchgrass And Miscanthus RhizosphereMCQLPLGRIRRGAPHTLRSGLPLLSLVMLVVGCASHTLMPKEQAVAIGDRDRALVSHADAIQAAIRQSGNIGALAFLDAKDGRLVVLPGDSPADAWARHATSPETGAGRISMPAVVTFVYRADVPKAPETVT
Ga0207684_1034849013300025910Corn, Switchgrass And Miscanthus RhizosphereMRQLLPVLLGRIRRGAPHVLGSGLPLMSLALLAVGCAGPSLMPKAQAAAIEDRDRALALHADAIQTATRQSGNLGALAFLDAKDGRLVVLPGDTPADAWAR
Ga0207684_1057302813300025910Corn, Switchgrass And Miscanthus RhizosphereMRQLLLVLLGRIRRGAPHMLGSRLPLMSLALVVVGCASHSLMPEAQAVAIKDRDRALASHAAAIHAAISQSGKVGALAFLDAKDGRLVVLPGDSPADAWAGYTTSPESGTGRVSVPPVLTFV
Ga0207646_1080049013300025922Corn, Switchgrass And Miscanthus RhizosphereMRQLLLGLLGRIRRETPAVLGSGLPLMGLALVVVGCASPSLMPEAQAVAIKDRDRALAAHADAIHAAIRQSGNVGALAFLDAKDGRLVVLPGDSPADAWSRYTTSPESETGRASVPPVLTFVHRADVPKAPETVTTSALQQQGAL
Ga0207689_1005618533300025942Miscanthus RhizosphereMCQLPLGRIRRGAPHTLRSGLPLLSLVMLVVGCASHTLMPKEQAVAIGDRDRALVSHADAIQAATRQSGNIGALAFLDAKDGRLVVLPGDSPADAWARHATSPETG
Ga0207712_1057105723300025961Switchgrass RhizosphereMRQLLLGSLRRGVPHTLRSGLLLLSLALLAVSCASQTLMPKEQAVAIDIRDRALVPHAAAIQAAIRQSGNVGALAFLDAVDGNLIVLPGDSPADAWARYATSPESGTGRVSVPA
Ga0207639_1101859923300026041Corn RhizosphereMPQLPLGSIRREAPHTLGSGLLLLSLALLAVGCASHTLMPKEQAVAIDSRDRALVPHAATIQAVIRQSGTIGALAFLDAKDGALIALPGDSPADAWARYTTSPDSGTGRVSVPAVVTFVYRADVPKAPETVTQTVLQ
Ga0207675_10206710323300026118Switchgrass RhizosphereMPQLLLGSIRRGAPHTLGSGLLLLSLALLAVGCASHTLMPKEQAVAIDSRDRALVPHAATIQAVIRQSGTIGALAFLDAKDGALIALPGDSPADAWARYTTS
Ga0209438_108124323300026285Grasslands SoilMRQLLLGLLGRIRRETPAVLGSGLPLMGLALVVVGCASPSLMPEAQAVAIKDRDRALAAHADAIHSAIRQSGNVGALAFLDAKDGRLVVLPGDSPADAWSRYTTSPESAAGRV
Ga0209438_122568513300026285Grasslands SoilMRQLLLGLLRRIGRGVPHGLGAGLPLMSLALVVAGCASHSLMPETQAAAIKHRERALASHADAIHAAIRQSGHAGALALLDAQDGRLVVLPGDSPADAWARYTASPESGTGRVSLPPVLTFVHRADVPQAPETITLSVLQQQAQLLD
Ga0257180_104856713300026354SoilMMRQLLLVLLGRLRRGVPHVLSSGLPLVGLSLVVVGCASHSLMPAAQSEAIRDRERALASHADAIQAAVRQSGNAGALAFLDAKDGHLVVLPGDSPADAWARYAASPESGTSPVSVPVVV
Ga0257177_105998013300026480SoilMLQLLLVLLGRMRRGAPHVLGSGLPLMSLAVVVVGCASPSLLPEAQAVAIQDRDRALALHADAIHAAISQSAKVGALVFLDAKDSHLVVLPGESPADAWARYTMLPENGTGRVSMPPVLTFVHRADVPKAPETIPRSVLQQQAQLRDA
Ga0257168_108342713300026514SoilMLQLLLVLLGRMRRGAPHVLGSGLPLMSLAVVVVGCASPSLLPEAQAVAIQDRDRALALHADAIHAAISQSAKVGALVFLDAKDSHLVVLPGESPADAWARYTMLPENGTGRVSMPPVLTFVHRADVPKAPETIPRSVLQQQAQLRDAWRRIEERLSIVQ
Ga0209899_102344423300027490Groundwater SandMRQLLLGLLGRIRSGRPHVLGSGLLLIGLSLVVVGCASPSLMPETQAVAIKDRDRALASHADAIHAAIRQSGNAGALAFLDAKDGRLVVLPGDSSADAWSRYTTSPESGTGRVSVPPVLTFVHRADVPKAPEPVTRSVLQQQQALAALGTE
Ga0209388_105867913300027655Vadose Zone SoilMMRQLLLVLLGRLRRGVPHVLSSGLPLVGLSLVVVGCASHSLMTAAQSEAIRDRERALASHADAIQAAVRQSGNVGALAFLDAKDGHLVVLPGDSPADAWARYAASPESGTSPVSVPVVV
Ga0209588_121035813300027671Vadose Zone SoilMLQLLLVLLGRMRRGAPHVLGSGLPLMSLAVVVVGCASPSLLPEAQAVAIQDRDRALALHADAIHAAISQSAKVGALVFLDAKDSHLVVLPGESPADAWARYTMLPENGTGRVSMPPVLTFVHRADVPKAPEAIPRS
Ga0209073_1010413813300027765Agricultural SoilMLQLLHVLLGRMRRGARHGLGSGVPLVSLALVVVGCASPSLMPAAQAVAIKDRDRALASRADAIHAAIGQSGHEGALAFLDAADGRLVVLPGDSPADAWSRYTTSPESGPGGVSVPPVLTFVHRADVAKA
Ga0209583_1039975213300027910WatershedsMLQLLLVLLGRMRRGAPHLRSSGLPLMSLAVVVVGCASPSLLPEAQAVAIQDRDRALALHADAIHAAISQSAKVGALVFLDATDSHLVVLPGESPADAWARYTLL
Ga0209069_1066094413300027915WatershedsMLQLLLVLLGRMRRGAPHVRSLGLPLMSLALVVVGCASPSLMPEAQAVAIKDRDRALASHADAIHAAISQSGNVGALAFLDAKDGRLIVLPGDSSGDAWSRYIASPEGETGRVSVPPVLTFVHRADVPKAPETVTRGVLQ
Ga0268264_1038295623300028381Switchgrass RhizosphereMRQLLLGSLRRGVPHTLRSGLLLLSLALLAVGCASQTLMPKEQAVAIDIRDRALVPHAAAIQAAIRQSGNVGALAFLDAKDGALIALPGDSPADAWARYTTSPDSGTGRVSVPAVVTFVYRADVPKAPETVTQNILQQQQAFRRSL
Ga0307296_1038110013300028819SoilMRQLLLALFGRIGRGVPNVLGSGLPLMSLALVVVGCASPSLMPEAQAVAIKHRERALASHADAIHAAISQSGQVGALALLDAKDGRLVVLPGDSPADAWARYTASPESGTGRVSVPP
Ga0302046_1005988033300030620SoilMRQLLLVLVGSTRRGALRALRSRVLLVSLALLVAGCASHSLMPKDQADAIKRRDRALAPHAEAIQAAIRQSGETGALTFLDANDGRLVVLPGETPADAWARHAAAPESEAGRVSVPAVVTFVHRADVARVP
(restricted) Ga0255310_1002045813300031197Sandy SoilMRQLLLVLLERIRRGAPHAVGSGLALMSLALVVVGCASHSLMPETQAGAIKDRDRALASHAAAIHAAIGPSGNVGALAFLDAKDGRLIVLPGDSSADAWSRYIASPEGETGRVSVP
Ga0307472_10269709813300032205Hardwood Forest SoilMGLALVVAGCASASLMPPGQAQAIKDRERALAVHVDAIQTAISQSGQVGALVFLDTGDGHLVVLPGSSPADAWARFATSPEGETGRRSVPPVLTFVHRADMPKAPEAVTLSDLEQQQAL
Ga0326726_1161624213300033433Peat SoilMCQLLLGRTRRGGPHALRSGLPLLSLVLLVVGCASHTLMPKEQAVAIDDRDRALVSHAAAIQAAIRQSGSIGALAFLDAKDGILAVLPGDSPADAWARYVTSPESVTGRVSVPAAL
Ga0316620_1106861113300033480SoilMESPAMLQLLHVLLGRMRRGAPHGLGSGVLLMSLALVAVGCASPSLMPEAQAVAIKDRDRALASRADAIHAAIGQSGHGGALAFLDAADGRLVVLPGDSPADAWSRYTTSPESGPGRVSV
Ga0316624_1164602413300033486SoilMLQLLHVLFGRIRRGAPHGLGSGLPLMSLALVVVGCASPSLMPEAQVVAIKDRDRALASRADAIHAAISQSGHLGALAFLDAADGRLVVLPGDSPADAWSRYTT
Ga0326731_116365113300033502Peat SoilMLQLLLVLLGRIRRGVPHVLGSGLPLMSLALVVGACASPSLMPEAQAVAIKDRDRALAPHADAIHAAISQSGNVGALAFLDAKDGRLVVLPGDSSAEAWSRYTTSPESGTGRVAVPPVLTFVHRADVPKAPETVT
Ga0364943_0352272_136_5613300034354SedimentMSLQEAKESTMRQLLLGRIRRGAPHTLRSGLLLLSLALLAVGCASHTLMPKEQAVAIDERDRALVSHAAGIQAAIRQSGNIGALAFLDAKDGSLVVLPGDSPADAWARYTTSPESGTGRVSVPAVLTFVYRSDVPKAPETVT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.