NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F039878

Metagenome Family F039878

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F039878
Family Type Metagenome
Number of Sequences 163
Average Sequence Length 221 residues
Representative Sequence MLQLWRDGALIVVLATLILPGCAPLPKVEYWDGFFRHLHPPQYTFQVPNGWRQATTSDYPSLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQVGAGGWYTFQDLRFGLSDREKRAIWQRLAARLIQAAPPAEKPNLTLESLDVVEDYHVKNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAHVGIPEDREEGIAGLEVLGRSFRFF
Number of Associated Samples 135
Number of Associated Scaffolds 163

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 53.09 %
% of genes near scaffold ends (potentially truncated) 45.40 %
% of genes from short scaffolds (< 2000 bps) 62.58 %
Associated GOLD sequencing projects 123
AlphaFold2 3D model prediction Yes
3D model pTM-score0.73

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (98.773 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(22.086 % of family members)
Environment Ontology (ENVO) Unclassified
(26.380 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(38.650 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 34.92%    β-sheet: 28.57%    Coil/Unstructured: 36.51%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.73
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
d.107.1.1: Ran-binding protein mog1pd1jhsa_1jhs0.55589
d.107.1.2: PsbP-liked1v2ba_1v2b0.54925
d.107.1.3: PA0094-liked1tu1a11tu10.51527


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 163 Family Scaffolds
PF00072Response_reg 7.98
PF02567PhzC-PhzF 4.29
PF00884Sulfatase 4.29
PF01566Nramp 2.45
PF01590GAF 2.45
PF04238DUF420 1.84
PF02518HATPase_c 1.84
PF10442FIST_C 1.84
PF04392ABC_sub_bind 1.23
PF01936NYN 1.23
PF08241Methyltransf_11 1.23
PF02018CBM_4_9 1.23
PF12847Methyltransf_18 0.61
PF01329Pterin_4a 0.61
PF10754DUF2569 0.61
PF01569PAP2 0.61
PF00571CBS 0.61
PF13439Glyco_transf_4 0.61
PF13263PHP_C 0.61
PF08495FIST 0.61
PF09721Exosortase_EpsH 0.61
PF13646HEAT_2 0.61
PF00892EamA 0.61
PF16347DUF4976 0.61
PF14595Thioredoxin_9 0.61
PF01381HTH_3 0.61
PF00076RRM_1 0.61
PF13560HTH_31 0.61
PF00146NADHdh 0.61
PF07719TPR_2 0.61
PF12681Glyoxalase_2 0.61
PF13649Methyltransf_25 0.61
PF00781DAGK_cat 0.61
PF12974Phosphonate-bd 0.61
PF12833HTH_18 0.61

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 163 Family Scaffolds
COG0384Predicted epimerase YddE/YHI9, PhzF superfamilyGeneral function prediction only [R] 4.29
COG1914Mn2+ or Fe2+ transporter, NRAMP familyInorganic ion transport and metabolism [P] 2.45
COG2322Cytochrome oxidase assembly protein CtaM/YozB, DUF420 familyPosttranslational modification, protein turnover, chaperones [O] 1.84
COG1432NYN domain, predicted PIN-related RNAse, tRNA/rRNA maturationGeneral function prediction only [R] 1.23
COG1597Phosphatidylglycerol kinase, diacylglycerol kinase familyLipid transport and metabolism [I] 1.23
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 1.23
COG0650Formate hydrogenlyase subunit HyfCEnergy production and conversion [C] 0.61
COG1005NADH:ubiquinone oxidoreductase subunit 1 (chain H)Energy production and conversion [C] 0.61
COG2154Pterin-4a-carbinolamine dehydrataseCoenzyme transport and metabolism [H] 0.61
COG3287FIST domain protein MJ1623, contains FIST_N and FIST_C domainsSignal transduction mechanisms [T] 0.61
COG4398Small ligand-binding sensory domain FISTSignal transduction mechanisms [T] 0.61


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms98.77 %
UnclassifiedrootN/A1.23 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000363|ICChiseqgaiiFebDRAFT_11392575All Organisms → cellular organisms → Bacteria1772Open in IMG/M
3300000443|F12B_10050762All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1770Open in IMG/M
3300000559|F14TC_101002568All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium527Open in IMG/M
3300000787|JGI11643J11755_11476559All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium961Open in IMG/M
3300001431|F14TB_100698577All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2477Open in IMG/M
3300001431|F14TB_102388779All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1310Open in IMG/M
3300002561|JGI25384J37096_10146978All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium757Open in IMG/M
3300002562|JGI25382J37095_10065673All Organisms → cellular organisms → Bacteria1363Open in IMG/M
3300002908|JGI25382J43887_10010021All Organisms → cellular organisms → Bacteria4783Open in IMG/M
3300002908|JGI25382J43887_10169899All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1084Open in IMG/M
3300002912|JGI25386J43895_10057241All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1100Open in IMG/M
3300005167|Ga0066672_10204786All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1258Open in IMG/M
3300005174|Ga0066680_10044353All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2586Open in IMG/M
3300005178|Ga0066688_10098255All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1785Open in IMG/M
3300005180|Ga0066685_10314003All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1086Open in IMG/M
3300005181|Ga0066678_10074695All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1994Open in IMG/M
3300005186|Ga0066676_10116193All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1644Open in IMG/M
3300005289|Ga0065704_10332439All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium836Open in IMG/M
3300005295|Ga0065707_10114452All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2297Open in IMG/M
3300005445|Ga0070708_100236294All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1715Open in IMG/M
3300005445|Ga0070708_100324806All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1450Open in IMG/M
3300005554|Ga0066661_10337555All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium924Open in IMG/M
3300005558|Ga0066698_10083168All Organisms → cellular organisms → Bacteria2086Open in IMG/M
3300005598|Ga0066706_10154156All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1731Open in IMG/M
3300005598|Ga0066706_10577755All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium893Open in IMG/M
3300005713|Ga0066905_100698793All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium870Open in IMG/M
3300005764|Ga0066903_100070581All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium4465Open in IMG/M
3300006032|Ga0066696_10526684All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium774Open in IMG/M
3300006034|Ga0066656_10042562All Organisms → cellular organisms → Bacteria2588Open in IMG/M
3300006049|Ga0075417_10086898All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1403Open in IMG/M
3300006796|Ga0066665_10070448All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2482Open in IMG/M
3300006797|Ga0066659_10084892All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2112Open in IMG/M
3300006800|Ga0066660_10035621All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3111Open in IMG/M
3300006844|Ga0075428_101045063All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium864Open in IMG/M
3300006845|Ga0075421_100031651All Organisms → cellular organisms → Bacteria → Proteobacteria6698Open in IMG/M
3300006852|Ga0075433_10000440All Organisms → cellular organisms → Bacteria26836Open in IMG/M
3300006852|Ga0075433_10014094All Organisms → cellular organisms → Bacteria → Proteobacteria6518Open in IMG/M
3300006854|Ga0075425_100363020All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1667Open in IMG/M
3300006854|Ga0075425_101004549All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium952Open in IMG/M
3300006904|Ga0075424_102117606All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium593Open in IMG/M
3300007076|Ga0075435_100068952All Organisms → cellular organisms → Bacteria2882Open in IMG/M
3300007076|Ga0075435_100104392All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2351Open in IMG/M
3300007076|Ga0075435_100126703All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2134Open in IMG/M
3300007076|Ga0075435_100387149All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1201Open in IMG/M
3300007255|Ga0099791_10021163Not Available2797Open in IMG/M
3300007255|Ga0099791_10120821All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1214Open in IMG/M
3300009094|Ga0111539_10248286All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2072Open in IMG/M
3300009100|Ga0075418_10569379All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1218Open in IMG/M
3300009137|Ga0066709_101561541All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium948Open in IMG/M
3300009147|Ga0114129_10003614All Organisms → cellular organisms → Bacteria21748Open in IMG/M
3300009147|Ga0114129_10133227All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3412Open in IMG/M
3300009156|Ga0111538_11498505All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium850Open in IMG/M
3300009162|Ga0075423_10000846All Organisms → cellular organisms → Bacteria25964Open in IMG/M
3300009162|Ga0075423_12281571All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium589Open in IMG/M
3300009792|Ga0126374_10295458All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1083Open in IMG/M
3300009792|Ga0126374_10463930All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium903Open in IMG/M
3300009810|Ga0105088_1067173All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium626Open in IMG/M
3300009816|Ga0105076_1008942All Organisms → cellular organisms → Bacteria1674Open in IMG/M
3300009819|Ga0105087_1055721All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium657Open in IMG/M
3300009821|Ga0105064_1022151All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1171Open in IMG/M
3300009837|Ga0105058_1014593All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1596Open in IMG/M
3300010047|Ga0126382_10837793All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium788Open in IMG/M
3300010303|Ga0134082_10038488All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1816Open in IMG/M
3300010362|Ga0126377_11031728All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium889Open in IMG/M
3300010366|Ga0126379_11325911All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium826Open in IMG/M
3300010398|Ga0126383_10495947All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1279Open in IMG/M
3300011269|Ga0137392_10027187All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium4104Open in IMG/M
3300011271|Ga0137393_10026924All Organisms → cellular organisms → Bacteria4296Open in IMG/M
3300012096|Ga0137389_10023116All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium4407Open in IMG/M
3300012199|Ga0137383_10125768All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1871Open in IMG/M
3300012200|Ga0137382_10118027All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1770Open in IMG/M
3300012202|Ga0137363_10717108All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium847Open in IMG/M
3300012203|Ga0137399_10254492All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1441Open in IMG/M
3300012203|Ga0137399_10806565All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium790Open in IMG/M
3300012207|Ga0137381_10751027All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium847Open in IMG/M
3300012211|Ga0137377_10006334All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium9561Open in IMG/M
3300012349|Ga0137387_10070723All Organisms → cellular organisms → Bacteria2377Open in IMG/M
3300012349|Ga0137387_10514898All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium868Open in IMG/M
3300012351|Ga0137386_10425453All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium956Open in IMG/M
3300012353|Ga0137367_10116408All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1964Open in IMG/M
3300012355|Ga0137369_10229963All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1415Open in IMG/M
3300012355|Ga0137369_10389012All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1008Open in IMG/M
3300012358|Ga0137368_10051595All Organisms → cellular organisms → Bacteria → Proteobacteria3483Open in IMG/M
3300012361|Ga0137360_10013850All Organisms → cellular organisms → Bacteria5211Open in IMG/M
3300012362|Ga0137361_10008624All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria7133Open in IMG/M
3300012362|Ga0137361_11028283All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium744Open in IMG/M
3300012532|Ga0137373_10275715All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1347Open in IMG/M
3300012925|Ga0137419_10034284All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3145Open in IMG/M
3300012930|Ga0137407_10176499All Organisms → cellular organisms → Bacteria1904Open in IMG/M
3300012944|Ga0137410_10306553All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1259Open in IMG/M
3300012944|Ga0137410_10466976All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1027Open in IMG/M
3300012948|Ga0126375_11261268All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium619Open in IMG/M
3300012972|Ga0134077_10219090All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium780Open in IMG/M
3300012972|Ga0134077_10244518All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium741Open in IMG/M
3300012972|Ga0134077_10568848All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium510Open in IMG/M
3300012975|Ga0134110_10204246All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium830Open in IMG/M
3300012976|Ga0134076_10028183All Organisms → cellular organisms → Bacteria2031Open in IMG/M
3300015264|Ga0137403_10801722All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium797Open in IMG/M
3300015358|Ga0134089_10230699All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium752Open in IMG/M
3300017656|Ga0134112_10103583All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1071Open in IMG/M
3300017659|Ga0134083_10076084All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1297Open in IMG/M
3300017997|Ga0184610_1030974All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1500Open in IMG/M
3300017997|Ga0184610_1187355All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium689Open in IMG/M
3300018028|Ga0184608_10029413All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2060Open in IMG/M
3300018052|Ga0184638_1022949All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2218Open in IMG/M
3300018053|Ga0184626_10019098All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2764Open in IMG/M
3300018063|Ga0184637_10016555All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium4429Open in IMG/M
3300018075|Ga0184632_10025726All Organisms → cellular organisms → Bacteria2504Open in IMG/M
3300018075|Ga0184632_10027281All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2433Open in IMG/M
3300018076|Ga0184609_10021962All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2538Open in IMG/M
3300018076|Ga0184609_10092438All Organisms → cellular organisms → Bacteria1350Open in IMG/M
3300018077|Ga0184633_10018791All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3394Open in IMG/M
3300018078|Ga0184612_10015136All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3936Open in IMG/M
3300018084|Ga0184629_10060791All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1756Open in IMG/M
3300018431|Ga0066655_10134784All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1442Open in IMG/M
3300018433|Ga0066667_10445202All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1057Open in IMG/M
3300018468|Ga0066662_10762575All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium933Open in IMG/M
3300019789|Ga0137408_1106489All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium667Open in IMG/M
3300021073|Ga0210378_10002112All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium10195Open in IMG/M
3300021081|Ga0210379_10103545All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1186Open in IMG/M
3300021086|Ga0179596_10310198All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium788Open in IMG/M
3300022534|Ga0224452_1006031All Organisms → cellular organisms → Bacteria2987Open in IMG/M
3300022694|Ga0222623_10009503All Organisms → cellular organisms → Bacteria → Proteobacteria3516Open in IMG/M
3300025922|Ga0207646_10021313All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium5989Open in IMG/M
3300026297|Ga0209237_1007998All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium6340Open in IMG/M
3300026297|Ga0209237_1047021All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2200Open in IMG/M
3300026298|Ga0209236_1026309All Organisms → cellular organisms → Bacteria3170Open in IMG/M
3300026309|Ga0209055_1003936All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium8938Open in IMG/M
3300026315|Ga0209686_1025269All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2325Open in IMG/M
3300026325|Ga0209152_10037431All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1675Open in IMG/M
3300026332|Ga0209803_1002052All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium12825Open in IMG/M
3300026351|Ga0257170_1001334All Organisms → cellular organisms → Bacteria2421Open in IMG/M
3300026360|Ga0257173_1022276All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium802Open in IMG/M
3300026480|Ga0257177_1000147All Organisms → cellular organisms → Bacteria4045Open in IMG/M
3300026536|Ga0209058_1201294All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium821Open in IMG/M
3300026537|Ga0209157_1016388All Organisms → cellular organisms → Bacteria4731Open in IMG/M
3300026550|Ga0209474_10333409All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium864Open in IMG/M
3300027068|Ga0209898_1036706All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium630Open in IMG/M
3300027277|Ga0209846_1037198All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium765Open in IMG/M
3300027379|Ga0209842_1062362All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium663Open in IMG/M
3300027511|Ga0209843_1010482All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1964Open in IMG/M
3300027561|Ga0209887_1014975All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1943Open in IMG/M
3300027577|Ga0209874_1031167All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1461Open in IMG/M
3300027846|Ga0209180_10001313All Organisms → cellular organisms → Bacteria12303Open in IMG/M
3300027862|Ga0209701_10002177All Organisms → cellular organisms → Bacteria12752Open in IMG/M
3300027873|Ga0209814_10013557All Organisms → cellular organisms → Bacteria3242Open in IMG/M
3300027880|Ga0209481_10018756All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3042Open in IMG/M
3300027882|Ga0209590_10075464All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1953Open in IMG/M
3300027903|Ga0209488_10042821All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3321Open in IMG/M
3300027903|Ga0209488_10193465All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1533Open in IMG/M
3300027952|Ga0209889_1060439All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium783Open in IMG/M
3300027961|Ga0209853_1109304All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium695Open in IMG/M
3300028536|Ga0137415_10469430All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1065Open in IMG/M
3300028792|Ga0307504_10003831All Organisms → cellular organisms → Bacteria3024Open in IMG/M
(restricted) 3300031150|Ga0255311_1020901All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1348Open in IMG/M
3300031740|Ga0307468_100825636All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium794Open in IMG/M
3300031820|Ga0307473_10119872All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1441Open in IMG/M
3300031820|Ga0307473_10400984All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium899Open in IMG/M
3300031820|Ga0307473_11514259All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium509Open in IMG/M
3300032180|Ga0307471_100089537All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2745Open in IMG/M
3300032180|Ga0307471_100736151All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1152Open in IMG/M
3300032205|Ga0307472_100194705All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1534Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil22.09%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil14.72%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere12.88%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment7.98%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand7.98%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil7.36%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil5.52%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.29%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.29%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil3.07%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.84%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.84%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.23%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.23%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.23%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.61%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.61%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.61%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Switchgrass Rhizosphere0.61%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000363Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000443Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.2B clc assemlyEnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300000787Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300001431Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.4TB clc assemlyEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005289Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2Host-AssociatedOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300009810Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_20_30EnvironmentalOpen in IMG/M
3300009816Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10EnvironmentalOpen in IMG/M
3300009819Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_40_50EnvironmentalOpen in IMG/M
3300009821Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_20_30EnvironmentalOpen in IMG/M
3300009837Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_20_30EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018077Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1EnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redoEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026351Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-05-BEnvironmentalOpen in IMG/M
3300026360Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-BEnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300027068Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027277Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027379Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_10_20 (SPAdes)EnvironmentalOpen in IMG/M
3300027511Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027561Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027577Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027873Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027880Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3 (SPAdes)Host-AssociatedOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027952Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300027961Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
ICChiseqgaiiFebDRAFT_1139257513300000363SoilRGARSCLVLARSRVSGGAKRSVLKGLEVKGRCSGFGGTGASIVVLATLMISGCAPVPKAEYWEGYFRHLHPPRYTFQVPEGWRQATISDYPSLGFNRRVFETLDAAGRSAAMQRAELEMQGRDTGLISSGGAWIQVASAAGAGGWYTFKDLRFGLGDREKQAIWQRLSTNLIQAAPPAEKPNLTLQSLDVVEDYHVKNVLRLRFTADGPRGSMHWTVLEFYGSSGVVNVAHVGIPEDSGEGIAGLEVLANSFRFE*
F12B_1005076223300000443SoilMLRFWRNWASIVVLATSALPGCAPLPKAEYWDGYFRHLHHPRYTFQVPDGWRPATISDYPSLGFNQRFFQTLDEAGRNAAMQRAELEMQGRDAALISSRGAWIQVQSAAGAGGWYTSRDLRFGLGDREKQAIWQRLSTNLIQAAPPAEKPNLTLQSLDVVQDYRVKGVPRLRFTADGPRGSMHWTVLEFYGSSGVVNVAHVSIPEDSGEGIAGLEVLANSFRFE*
F14TC_10100256813300000559SoilYWDGYFRHLHHPRYTFQVPDGWRPATISDYPSLGFNQRFFQTLDEAGRNAAMQRAELEMQGRDAALISSRGAWIQVQSAAGAGGWYTSRDLRFGLGDREKQAIWQRLSTNLIQAAPPAEKPNLTLQSLDVVQDYRVKGVPRLRFTADGPRGSMHWTVLEFYGSSGVVNVAHVGIP
JGI11643J11755_1147655913300000787SoilRGARSCLVLARSRVSGGAKRSVLKGLEVKGRCSGFGGTGASIVVLATLMISGCAPVPKAEYWEGYFRHLHPPRYTFQVPEGWRQATISDYPSLGFNRRXFETLDXAGRSAAMQRAEXEMQGRDTGLISSGGAWIQVASAAGAGGWYTFKDLRFGLGDREKQAIWQRLSTNLIQAAPPAEKPNLTLQSLDVVEDYHVKNVLRLRFTADGPRGSMHWTVLEFYGSSGVVNVAHVDRAPR*
F14TB_10069857713300001431SoilMLRFWRNWASIVVLATSALPGCAPLPKAEYWDGYFRHLHHPRYTFQVPDGWRPATISDYPSLGFNQRFFQTLDEAGRNAAMQRAELEMQGRDAALISSRGAWIQVQSAAGAGGWYTSRDLRFGLGDREKQAIWQRLSTNLIQAAPPAEKPNLTLQSLDVVQDYRVKGVPRLRFTADGPRGSMHWTVLEFYGSSGVVNVAHVGIPEDSGEGIAGLEVLARSFRFE*
F14TB_10238877913300001431SoilMGMVNAVLPCPARSRVSGGAKRSVLRFWRNWASIVVLATLMISGCAPVPKAEYWEGYFRHLHPPRYTFQVPEGWRQATISDYPSLGFNRRVFETLDEAGRSAAMQRAEREMQGRDTGLISSGGAWIQVASAAGAGGWYTFKDLRFGLGDREKQAIWQRLSTNLIQAAPPAEKPNLTLQSLDVVEDYHVKNVLRLRFTADGPRGSMHWTVLEFYGSSGVVNVAHVGIPEDSGEGIAGLEVLANSFRFE*
JGI25384J37096_1014697813300002561Grasslands SoilVPAARGPGEGESVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWGGFFRHLHPPQYMFQVPDGWRPVTTSDYPLLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDAGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTILEFYG
JGI25382J37095_1006567323300002562Grasslands SoilMTWLRRQLEELGRRGVVMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEPGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSXAGPGGWYSLRDRPGFGLSEREQQXVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIXGLEVIAQSFRFQ*
JGI25382J43887_1001002143300002908Grasslands SoilMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLEEPGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ*
JGI25382J43887_1016989913300002908Grasslands SoilVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWGGFFRHLXPPQYMFQVPDGWRPVTTSDYPLLGFNRRLFQXLDEAGRTAAMQRAELEMQSGDXGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTILEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF*
JGI25386J43895_1005724123300002912Grasslands SoilTLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQTVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ*
Ga0066672_1020478623300005167SoilMTATGQKGPLSDAGGMGNLHPPSHGDRSGRPGLRTRHGSAPRGRPGEGEGVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWDGFFRHLHPPKYMFQVPDGWRPVTTSDYPLLGFNRRLFQRLDEAGRTAAMQRAEAEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF*
Ga0066680_1004435343300005174SoilVSRGGRRSVLEVLEVRERCSGFGGTGASIVVLATLMLPGCAPLPKAEYWDGYFRHLHPPRYTFQVPDGWRQATISDYPSLGFNRHVFETLDQAGRSAAMQRAELEMQGRDTGLISSRGAWIQVTSAGGVGGWYTFKDLRFGLSEQEKQAIWQRLSTNLIQAAPPAEKPNLTLQSLDVVEDYRVKSVLRLRFTADGPRGSMHWTVLEFYSSSGVVTVAHVGIPEVRGEGIACLEALAYSFRFE*
Ga0066688_1009825513300005178SoilYMFQVPDGWRPVTTSDYPLLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDAGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF*
Ga0066685_1031400313300005180SoilMLQLWRDGALIAVLAMSILPGCAPLPKVEYWDGYFRHLQHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAERSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRFSTGLIQNAPPADKPKLSLESMELADYGENRVLRLRFRSDIKRGSMHWTVLGFYGSSGMVTVAHVGVPEDRDEGIAGLEVIAQSFRFR*
Ga0066678_1007469513300005181SoilVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWGGFFRHLHPPQYMFQVPDGWRPVTTSDYPLLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDAGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTILEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF*
Ga0066676_1011619323300005186SoilVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWDGFFRHLHPPKYMFQVPDGWRPVTTSDYPLLGFNRRLFQTLDEAGRTAAMQRAELELQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF*
Ga0065704_1033243913300005289Switchgrass RhizosphereMVNAVLPCPARSRVSGGAKRSVLKGLEVKGRCSGFGGTGASIVVLATLMISGCAPVPKAEYWEGYFRHLHPPRYTFQVPEGWRQATISDYPSLGFNRRVFETLDEAGRSAAMQRAEREMQGRDTGLISSGGAWIQVASAAGAGGWYTFKDLRFGLGDREKRAIWQRLSTNLFQAVPPAEKPNLTLQSLDVVEDYHVKNVLRLRFTADGPRGSMHWTVLEFYGSSGVVNVAHVGIPEDSGEGIAGLEVLARSFRFE*
Ga0065707_1011445233300005295Switchgrass RhizosphereMGMVNAVLPCPARSRVSGGAKRSVLKGLEVKGRCSGFGGTGASIVVLATLMISGCAPVPKAEYWEGYFRHLHPPRYTFQVPEGWRQATISDYPSLGFNRRVFETLDEAGRSAAMQRAEREMQGRDTGLISSGGAWIQVASAAGAGGWYTFKDLRFGLGDREKRAIWQRLSTNLFQAVPPAEKPNLTLQSLDVVEDYHVKNVLRLRFTADGPRGSMHWTVLEFYGSSGVVNVAHVGIPEDSGEGIAGLEVLANSFRFE*
Ga0070708_10023629443300005445Corn, Switchgrass And Miscanthus RhizosphereMGCAACRQLEGLGEGEGVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWDGFFRHLHPPQYMFQVPDGWRPVTTSDYPLLGFNRRLFQRLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTILEFYASSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF*
Ga0070708_10032480633300005445Corn, Switchgrass And Miscanthus RhizosphereVSRGGRRSVLEVLEVRERCSGFGGTGASIVVLATLMLPGCAPLPKAEYWDGYFRHLHPPRYTFQVPDGWRQATISDYPSLGFNRHVFETLDQAGRSAAMQRAELEMQGRDTGLISSRGAWIQVTSAGGVGGWYTFKDLRFGLSEQEKQAIWQRLSTNLIQAAPPAEKPNLTLQSLDVVEDYRVKSVLRLRFTADGPRGSMHWTVLEFYSSSGVVTVAHVGIPEDREEGIAGLEALAYSFRFE*
Ga0066661_1033755513300005554SoilMLPGCAPLPKAEYWDGYFRHLHPPRYTFQVPDGWRQATISDYPSLGFNRHVFETLDQAGRSAAMQRAELEMQGRDTGLISSRGAWIQVTSAGGVGGWYTFKDLRFGLSEQEKQAIWQRLSTNLIQAAPPAEKPNLTLQSLDVVEDYRVKSVLRLRFTADGPRGSMHWTVLEFYSSSGVVTVAHVGIPEDRGEGIAGLEALAYSFRFE*
Ga0066698_1008316813300005558SoilAMSILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAERSVFLQRAELEMQGRDSVLISSQGAWIQVGSEVGPGGWYSLRDRPGFGLSEREQQAVWQRFSTGLIQNAPPADKPKLSLESMELADYGENRVLRLRFRSDIKRGSMHWTVLGFYGSSGMVTVAHVGVPEDRDEGIAGLEVIAQSFRFR*
Ga0066705_1044477813300005569SoilQVPDGWRPVTTSDYPLLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDAGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF*
Ga0066706_1015415633300005598SoilVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWGGFFRHLHPPQYMFQVPDGWRPVTTSDYPLLGFNRRLFQRLDEAGRTAAMQRAEAEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF*
Ga0066706_1057775513300005598SoilMANAWFRRQLEGLGKEGVRLRFWRDWAFIVVPVLASCAPLPKVEYWGGFFRHLHPPRYTFQVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSAAMQRAELEMRGSDTGLISSRGAWIQVMSRVGPGGWYTFKDLRFGLSDGEKQAIWQQLSTNLIQTAPAAEKPNLTLESIDVLEDYSVRSVLRLRFTADVPRGSMHWTVL
Ga0066905_10069879323300005713Tropical Forest SoilMFRFWRKWASILVLAMSALPGCAPLPKAEYWDGYFRHLHPPRYTFQVPDGWRPATISDYPSLGFNRRFFETLDEAGRSAAMQRAELEMQGRDAGLVSSRGAWIQVQSAAGAGGWYTFRDLRFGLGDREKQAIWQRVSTKLIQAAPPGERPNLTLQSLDVVEDYRVKGVLRLRFTAEGTRGSMHWTVLEFYGSSGAVNVAHVGIPEDSGEGIAGLEVLARSFRFE*
Ga0066903_10007058153300005764Tropical Forest SoilMFRFWRKWASILVLATSALSGCAPLPKTEYWEGYFRHLHPPRYSFQVPDGWRPATISDYPSLGFNRRFFATLDEAGRSAAMQRAELEMQGRDAGLVSSRGAWIQVQSAAGAGGWYTFRDLRFGLGDREKQAIWQRVSTKLIQAAPPGERPNLTLQSLDVVEDYRVKGVLRLRFTADGPRGSMHWTVLEFYGSSGAVNVAHVGIPEDSGEGIAGLEVLARSFRFE*
Ga0066696_1052668413300006032SoilVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWDGFFRHLHPPKYMFQVPDGWRPVTTSDYPLLGFNRRLFRRLDEAGRTAAVQRAEAEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTILEFYGSSGVVTVAYVGIP
Ga0066656_1004256223300006034SoilMLQLWRDGALIAVLAMSILPGCAPLPEIEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAERSVFLQRAELEMQGRDTVLISSQGAWIQVGSEVGPGGWYSLRDRPGFGLSEREQQAVWQRFSTGLIQNAPPADRPKLSLESMELADYGENRVLRLRFRSDIKRGSMHWTVLGFYGSSGMVTVAHVGVPEDRDEGIAGLEVIAQSFRFR*
Ga0075417_1008689813300006049Populus RhizosphereVRDRRSLVALTTLMLAGCAPVPKAEYWQGYFRHLHPPRYMFQVPDGWREATISDYPSLGFNRGVFERLDEAGRSAAMQRGELDMQSRDTGLVSSRGAWIQVASTVGAGGWYTFKDLRFGLGDREKQAIWQRLSTNLTQAAPAGEKPNLTLQSLDVVEDYWVKNVLRLRFTADVPRGSMRWTVLEFYGSSGVVTVAHVGIPEDSGEGIAGLDVLAQSFRFQ*
Ga0066665_1007044823300006796SoilVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWGGFFRHLHPPQYMFQVPDGWRPVTTSDYPLLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDAGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF*
Ga0066659_1008489233300006797SoilMGTDLDAPWAPHPTWICAACRPGEGEGVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWDGFFRHLHPPKYMFQVPDGWRPVTTSDYPLLGFNRRLFRRLDEAGRTAAMQRAEAEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF*
Ga0066660_1003562153300006800SoilVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWDGFFRHLHPPKYMFQVPDGWRPVTTSDYPLLGFNRRLFRRLDEAGRTAAMQRAEAEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF*
Ga0075428_10104506313300006844Populus RhizosphereVRDRRFSFGRTGAASIVLATLMLLGCAPVPKAEYWEGYFRHLHPPRYMFQVPDGWRQATISDYPSLGFNQRVFELLDEAGRSAAMQRAEVEMQSRDTGLISSRGAWIQVASAAGAGGWYTFRDLRFGLGDREKQAIWQRLSTNLIQAAPSAEKPNLTLQSLDVVEDYWVKNVLRLRFTADGARGSMHWTVLEFYGSSGVVNVAHVGIPEDSGEGIAGLEVLARSFRFE*
Ga0075421_10003165163300006845Populus RhizosphereMWAEVCTHAGESASGRLVRKSEGGSLTLVGHPIRCQSRLRRSGARNVLGSGEGLNVRDRRSLVALTTLMLAGCAPVPKAEYWQGYFRHLHPPRYMFQVPDGWREATISDYPSLGFNRGVFERLDEAGRSAAMQRGELDMQSRDTGLVSSRGAWIQVASTVGAGGWYTFKDLRFGLGDREKQAIWQRLSTNLTQAAPAGEKPNLTLQSLDVVEDYWVKNVLRLRFTADVPRGSMRWTVLEFYGSSGVVTVAHVGIPEDSGEGIAGLDVLAQSFRFQ*
Ga0075433_1000044033300006852Populus RhizosphereMLAGCAPVPKAEYWQGYFRHLHPPRYMFQVPDGWREATISDYPSLGFNRGVFERLDEAGRSAAMQRGELDMQSRDTGLVSSRGAWIQVASTVGAGGWYTFKDLRFGLGDREKQAIWQRLSTNLTQAAPAGEKPNLTLQSLDVVEDYWVKNVLRLRFTADVPRGSMRWTVLEFYGSSGVVTVAHVGIPEDSGEGIAGLDVLAQSFRFQ*
Ga0075433_1001409453300006852Populus RhizosphereMFRFWRKWASILVLATSALPGCAPLPKAEYWDGYFRHLHPPRYTFQVPDGWRPATISDYPSLGFNRRFFETLDEAGRSAALQRAELEMQSRDAGLVSSRGAWIQVQSAAGAGGWYTFRDLRFGLGDREKQAIWQRVSTKLMQAAPPGERPNLTLQSLDVVEDYRVKGVLRLRFTADGPRGSMHWTVFEFYGSSGAVNVAHVGIPEDSGEGIAGLEVLARSFRFE*
Ga0075425_10036302023300006854Populus RhizosphereMFRFWRKWASILVLATSALPGCAPLPKAEYWDGYFRHLHPPRYTFQVPDGWRPATISDYPSLGFNRRFFETLDEAGRSAALQRAELEMQSRDAGLVSSRGAWIQVQSAAGAGGWYTFRDLRFGLGDREKQAIWQRVSTKLMQAAPPGERPNLTLQSLDVVEDYRVKGVLRLRFTADGPRGSMHWTVFEFYGSSGAVNVAHVGIPGDSGEGIAGLEVLARSFRFE*
Ga0075425_10100454913300006854Populus RhizosphereCCRWRKGLRRVPTARGPGEGEGVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWDGFFRHLHPPKYMFQVPDGWRPVTTSDYPLLGFNRRLFQRLDEAGRTSAMQRAELEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTILEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF*
Ga0075424_10211760613300006904Populus RhizosphereVKGRCSGFGGTGASIVVLATLMISGCAPVPKAEYWEGYFRHLHPPRYTFQVPDGWRQATISDYPALGFNRRLFQTLDEAGRSAAMQGAEQEMQSPDTGLISCRGAWIQVTSAGGVGGWYTSKDLRFGLSDREKQAIWQRLSTNLIQAAPPAEKPNLTLQSLDVVEDYRVKNVLRLRFTADGPRGSMHWTVLEFYGSS
Ga0075435_10006895213300007076Populus RhizosphereVPKAEYWQGYFRHLHPPRYMFQVPDGWREATISDYPSLGFNRGVFERLDEAGRSAAMQRGELDMQSRDTGLVSSRGAWIQVASTVGAGGWYTFKDLRFGLGDREKQAIWQRLSTNLTQAAPAGEKPNLTLQSLDVVEDYWVKNVLRLRFTADVPRGSMRWTVLEFYGSSGVVTVAHVGIPEDSGEGIAGLDVLAQSFRFQ*
Ga0075435_10010439263300007076Populus RhizosphereMFRLPSGRVPIVVLATLILPACASLPKAEYWGGFFRHLYPPRYTFQVPDGWRQATISDYPSLGFNRRLFQALDEAGRSAAMQRAELEMQSPDAGLISSRGAWIQVMSRAGPGGWYTFKDLRFGLSDREKQAIWQQLSTNLIQTAPAAEKPNLTVESIDVIEDYYVRSVLRLRFTANAPRGSMHWTVLEFYGSSGAVTVAHLGTPEDRDEGIAGLEEIARWFRFD*
Ga0075435_10012670333300007076Populus RhizosphereMKLGRNWALTVVVLTTLTLPGCAPLPKVEYWDGFFRHLHPPKYMFQVPDGWRPVTTSDYPLLGFNRRLFQRLDEAGRTVAMQRAELEMQSGDTGLISSRGAWIQVRSQSGPGGWYTLKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTILEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF*
Ga0075435_10038714933300007076Populus RhizospherePRYTFQVPDGWRPATISDYPSLGFNRRFFETLDEAGRSAALQRAELEMQSRDAGLVSSRGAWIQVQSAAGAGGWYTFRDLRFGLGDREKQAIWQRVSTKLMQAAPPGERPNLTLQSLDVVEDYRVKGVLRLRFTADGPRGSMHWTVFEFYGSSGAVNVAHVGIPEDSGEGIAGLEVLARSFRFE*
Ga0099791_1002116343300007255Vadose Zone SoilMTWLRRQLEELGRRGVVMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQTVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ*
Ga0099791_1012082143300007255Vadose Zone SoilVRLRFWRDWAFIVVPVLAGCAPFPKAEYWDGFFRHLYPPQYTFQVPDGWRQATMSDYPWLGFNRRLFQTLDKAGRTAAMQRAELEMQSVDTGLISSRGAWIQVRSQVGAGGWYTFQDLRFGLSDREKRAIWQRLAARLIQAAPPAEKPNLTLESLDVVEDYHVKNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAHVGIPEDREEGLAGLEVLARSFRFF*
Ga0111539_1024828643300009094Populus RhizosphereFWRKWASILVLATSALPGCAPLPKAEYWDGYFRHLHPPRYTFQVPDGWRPATISDYPSLGFNRRFFETLDEAGRSAALQRAELEMQSRDAGLVSSRGAWIQVQSAAGAGGWYTFRDLRFGLGDREKQAIWQRVSTKLIQAAPPGERPNLTLQSLDVVEDYRVKGVLRLRFTADGPRGSMHWTVFEFYGSSGAVNVAHVGIPGDSGEGIAGLEVLARSFRFE*
Ga0075418_1056937923300009100Populus RhizosphereLPKAEYWDGYFRHLHPPRYTFQVPDGWRPATISDYPSLGFNRRFFETLDEAGRSAAMQRAELEMQGRDAGLVSSRGAWIQVQSAAGAGGWYTFRDLRFGLGDREKQAIWQRVSTKLIQAAPPGERPNLTLQSLDVVEDYRVKGVLRLRFTADGPRGSMHWTVFEFYGSSGAVNVAHVGIPEDSGEGIAGLEVLARSFRFE*
Ga0066709_10156154123300009137Grasslands SoilLNRTRFRWSPRIPPSKRISKLGKKFKEAMANAWFRRQLEGLGKEGVRLRFWRDWAFIVVPVLASCAPLPKVEYWGGFFRHLHPPRYTFQVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSAAMQRAELEEMRGSDTGLISSRGAWIQVMSRVGPGGWYTFKDLRFGLSDGEKQAIWQQLSTNLIQTAPAAEKPNLTLESIDVVEDYSVRSVLRLRFTADVPRGSMHWTVLEFYDSSGVVTVAHVGTPEDRDEGIAGLEVIARWFRFD*
Ga0114129_10003614233300009147Populus RhizosphereVLGSGEGLNVRDRRSLVALTTLMLAGCAPVPKAEYWQGYFRHLHPPRYMFQVPDGWREATISDYPSLGFNRGVFERLDEAGRSAAMQRGELDMQSRDTGLVSSRGAWIQVASTVGAGGWYTFKDLRFGLGDREKQAIWQRLSTNLTQAAPAGEKPNLTLQSLDVVEDYWVKNVLRLRFTADVPRGSMRWTVLEFYGSSGVVTVAHVGIPEDSGEGIAGLDVLAQSFRFQ*
Ga0114129_1013322713300009147Populus RhizosphereMFRFWRKWASILVLATSALPGCAPLPKAEYWDGYFRHLHPPRYTFQVPDGWRPATISDYPSLGFNRRFFETLDEAGRSAAMQRAELEMQGRDAGLVSSRGAWIQVQSAAGAGGWYTFRDLRFGLGDREKQAIWQRVSTKLIQAAPPGERPNLTLQSLDVVEDYRVKGVLRLRFTADGPRGSMHWTVFEFYGSSGAVNVAHVGIPEDSGEGIAGLEVLARSFRFE*
Ga0111538_1149850513300009156Populus RhizosphereMFRFWRKWASILVLATSALPGCAPLPKAEYWDGYFRHLHPPRYTFQVPDGWRPATISDYPSLGFNRRFFETLDEAGRSAALQRAELEMQSRDAGLVSSRGAWIQVQSAAGAGGWYTFRDLRFGLGDREKQAIWQRVSTKLIQAAPPGERPNLTLQSLDVVEDYRVKGVLRLRFTADGPRGSMHWTV
Ga0075423_1000084623300009162Populus RhizosphereMFQVPDGWREATISDYPSLGFNRGVFERLDEAGRSAAMQRGELDMQSRDTGLVSSRGAWIQVASTVGAGGWYTFKDLRFGLGDREKQAIWQRLSTNLTQAAPAGEKPNLTLQSLDVVEDYWVKNVLRLRFTADVPRGSMRWTVLEFYGSSGVVTVAHVGIPEDSGEGIAGLDVLAQSFRFQ*
Ga0075423_1228157113300009162Populus RhizospherePPRYTFQVPDGWRPATISDYPSLGFNRRFFETLDEAGRSAALQRAELEMQSRDAGLVSSRGAWIQVQSAAGAGGWYTFRDLRFGLGDREKQAIWQRVSTKLMQAAPPGERPNLTLQSLDVVEDYRVKGVLRLRFTADGPRGSMHWTVFEFYGSSGAVNVAHVGIPEDSGEGIAGLEVLARSFRFE*
Ga0126374_1029545813300009792Tropical Forest SoilGCAPLPKAEYWEGYFRHLHPPRYSFQVPDGWRPATISDYPSLGFNRRFFATLDEAGRSAAMQRAELEMQGRDAGLVSSRGAWIQVQSAAGAGGWYTFRDLRFGLGDREKQAIWQRVSTKLTQAAPPGERPNLTLQSLDVVEDYRVKGVLRLRFTADGPRGSMHWTVLEFYGSSGAVNVAHVGIPEDSGEGIAGLEVLARSFRFE*
Ga0126374_1046393013300009792Tropical Forest SoilVRRWRDWAFVVVLTTLTQHGCAPLPKAEYWDGFFRHLHPPKYSFQVPDGWRPATSSDYPLLGFNRHLFQTLDEAGRSAAMQRAELEMQRQDAVLVSSRGAWIQVRSQVGAGRWYAFGNLRFGLGEREKQAIWQSLSTSLSQSAPAADKPKLTLESIGVVDYSLNRVVRLSFQSEGARGPMHWTLLAFDRSSDLVTIAHVGIPEDRNEGIAGLDVIARTFRFD*
Ga0105088_106717313300009810Groundwater SandGEAGRVMLQLWRDGALIIVLATLILSGCAPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSFGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYGLKDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDITRGSMHWTVLGFYGSSGMVTVAHVG
Ga0105076_100894213300009816Groundwater SandMLQLWRDGALIIVLATLILPGCAPLPKVEYWDGFFLHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPTLSLESMELADYGGNTVLRLRFRSDIRRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLAVIAQSFRFQ*
Ga0105087_105572113300009819Groundwater SandWRDGALIIVLATLILSGCAPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSFGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGGNTVLRLRFRSDIRRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFR
Ga0105064_102215123300009821Groundwater SandMLQLWRDGALIIGLATLILPGCAPLPKVEYWDGFFLHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDLRRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ*
Ga0105058_101459323300009837Groundwater SandMLQLWRDGALIIVLATLILSGCAPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYGLKDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDLRRGSMHWTGLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ*
Ga0126382_1083779313300010047Tropical Forest SoilWRDWAFVVVLTTLTQHGCAPVPKKEYWDGYFRHLHPPRYSFQVPDGWRPATSSDYPLLGFNRHLFQTLDEAGRSAAMQRAELEMQRQDAVLVSSRGAWIQVRSQVGAERWYAFGNLRFGLGERERQAIWQSLSTSLSRSAPAADKPKLTLESIDVVDYSLNRVVRLSFQSEGTRGPMHWTVLAFDRSFDLVTIAHVGIPEDRNEGLAGLDAIALTFRFD*
Ga0134082_1003848833300010303Grasslands SoilMGTDLDALGSAPDMDLRRVPAARWPGEGEGVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWDGFFRHLHPPKYMFQVPDGWRPVTTSDYPLLGFNRRLFQRLDEAGRTAAMQRAEAEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTILEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF*
Ga0126377_1103172813300010362Tropical Forest SoilMFRFWRKWASILVLAMSALPGCAPLPKAEYWDGYFRHLHPPRYTFQVPDGWRPATISDYPSLGFNRRFFETLDEAGRSAAMQRAELEMQGRDASLISSRGAWIQVQSAAGAGGWYTFRDLRFGLGDREKQAIWQRVSTKLIQAAPPGERPNLTLQSLDVVEDYRVKGVLRLRFTADGPRGSMHWTVLEFYGSSGAVNVAHVGIPEDSGEGTAGLEVLARSFRFE*
Ga0126379_1132591123300010366Tropical Forest SoilMFRFWRKWASILVLATSALSGCAPLPKTEYWEGYFRHLHPPRYSFQVPDGWRPATISDYPSLGFNRRFFATLDEAGRSAAMQRAELEMQGRDAGLVSSRGAWIQVQSAAGAGGWYTFRDLRFGLGDREKQAIWQRVSTKLIQAAPPGERPNLTLQSLDVVEDYRVKGVLRLRFTADGPRGSMHWTVLEFYGSSGAVNVAHVGIPEDSGEGIAGLE
Ga0126383_1049594733300010398Tropical Forest SoilMFRFWRKRASILVLATSALSGCAPLPKTEYWEGYFRHLHPPRYSFQVPDGWRPATISDYPSLGFNRRFFATLDEAGRSAAMQRAELEMQGRDAGLVSSRGAWIQVQSAAGAGGWYTFRDLRFGLGDREKQAIWQRVSTKLIQAAPPGERPNLTLQSLDVVEDYRVKGVLRLRFTADGPRGSMHWTVLEFYGSSGAVNVAHVGIPEDSGEGIAGLEVLARSFRFE*
Ga0137392_1002718743300011269Vadose Zone SoilMTWLRRQLEELGRRGVVMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ*
Ga0137393_1002692463300011271Vadose Zone SoilGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ*
Ga0137389_1002311633300012096Vadose Zone SoilMSPRVRVGEHGQGATWLCRQLEGLGEGKGVKLWRNWALTVVLSTLMLPGCAPVPKAEYWDGYFRHLHPPRYTFQVPDGWRQATTSDYPLLGFNRRLFQTLDEAGRTAAMQRAEPEMQSGDTGLISSRGAWIQVASAGGAGGWYTFKDLRFGLGDREKQAIWQRLSTNLIQAAPPAEKPNLTLQSLDVVEDYRVKSVLRLRFTADGSRGSMHWTVLEFYGSSGVVTVAHVGIPEDRGEGIAGLEVLANSFRFE*
Ga0137383_1012576833300012199Vadose Zone SoilVKLGRNWALTVVMLTTLTLPGCAPLPKAEYWGGFFRHLHPPQYMFQVPDGWRQVTTSDYPLLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTILEFYGSSGVVTVAYVGIPEDREEGLAGLEVLVRSFRFF*
Ga0137382_1011802723300012200Vadose Zone SoilVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWDGFFRHLHPPKYMFQVPDGWRPVTTSDYPLLGFNRRLFQRLDEAGRTAAMQRAEAEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF*
Ga0137363_1071710813300012202Vadose Zone SoilMGCAACRQLEGLGEGEGVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWGGFFRHLHPPQYMFQVPDGWRPVTTSDYPLLGFNRRLFQTLDEAGRTAAMQGAELEMQSGDAGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTILEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF*
Ga0137399_1025449213300012203Vadose Zone SoilVKLRRDWALLVVLSTLTLPGCAPLPKAEYWDGFFRHLDPPRYTFQVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSAAMQRAELEMQGPDTGLISSRGAWIQVMSRVGPGGWYTFKDLRFGLSDGEKQAIWQQLSTSLIQTAPAAEKPNLTLESIDVVEDYYVRSVLRLRFTADVPRGSMHWTVLEFYGSSGVATVAHVGTPEDRDEGIAGLEVIARWFRFD*
Ga0137399_1080656513300012203Vadose Zone SoilMLAGCAPFPKAEYWEGFFRHLYPPQYTFQVPDGWRQATMSDYPLLGFNRRVFQTLDQAGRTAAMQRAELEMQSVDTGLISSRGAWIQVRSQSGPGGWYTFRDLRFGLSDREKQAIWQRLAARLIQTAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMRWTALEFYGSSGVVTVAHVGIPEDRDEGIAGLEAIARSFRF*
Ga0137381_1075102713300012207Vadose Zone SoilPRYTFQVPNGWRQATTSDYPSLGFNRRLFQTLDEAGRIAAMQRAELEMQNGDTGLISSRGAWIQVRSQVGAGGWYTFQDLRFGLSDREKRAIWQRLAARLIQAAPPAEKPNLTLESLDVVEDYHVKNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAHVGIPEDREEGIVGLEVLGRSFRFF*
Ga0137377_10006334113300012211Vadose Zone SoilMLTTLTLPGCAPLPKAEYWGGFFRHLHPPQYMFQVPDGWRQVTTSDYPLLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTILEFYGSSGVVTVAYVGIPEDREEGLAGLEVLVRSFRFF*
Ga0137387_1007072313300012349Vadose Zone SoilSRQESGVREIRLRRSMRWGLETDRAGTAPVLDPTSDPVVMILCLTIDPLRRQLEGLGEAGRAMLQLWWDGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAERSVFLQRAELEMQGRDSVLISSQGAWIQVGSEVGPGGWYSLRDRPGFGLSEREQQAVWQRFSTGLIQNAPPADKPKLSLESMELADYGENRVLRLRFRSDIKRGSMHWTVLGFYGSSGMVTVAHVGVPEDRDEGIAGLEVIAQSFRFR*
Ga0137387_1051489813300012349Vadose Zone SoilGWRQATTSDYPSLGFNRRLFQTLDEVGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQVGAGGWYTFQDLRFGLSDREKRAIWQRLAARLIQAAPPAEKPNLTLESLDVVEDYHVKNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAHVGIPEDRRKVLSALKCSDARSDFSESRGPSLAVGARLAVSFN*
Ga0137386_1042545313300012351Vadose Zone SoilRYTFQVPNGWRQATTSDYPSLGFNRRLFQTLDEAGRIAAMQRAELEMQNGDTGLISSRGAWIQVRSQVGAGGWYTFQDLRFGLSDREKRAIWQRLAARLIQAAPPAEKPNLTLESLDVVEDYHVKNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAHVGIPEDREEGIVGLEVLGRSFRFF*
Ga0137367_1011640863300012353Vadose Zone SoilVDGVKLWRNWALTVVLSTLTLPGCAPLPKAEYWNGFFRHLHPPQYTFQVPDGWRQATMSDYPLLGFNRRLFQTLDEAGRTAALQRAELEMQSGDTGLISSRGAWIQVRSQIGPGGWNTFKDLRFGLSDREKQAIWQRLAASLIHTAPPAEKPNLTLESLDVVEDYRVKNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARSFRFF*
Ga0137369_1022996333300012355Vadose Zone SoilMSGKRVMLQLWRDWALIIVLATLILPGCAPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMHGRDTVLISSRGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPAYKPKLSLESMGLADYGENRVLRLRFRSDIRRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEEIAQSFRFQ*
Ga0137369_1038901213300012355Vadose Zone SoilVDGVKLWRNWALTVVLSTLTLPGCAPLPKAEYWNGFFRHLRPPQYTFQVPDGWRQATMSDYPLLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQIGPGGWNTFKDLRFGLSDREKQAIWQRLAASLIHTAPPAEKPNLTLESLDVVEDYRVKNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGL
Ga0137368_1005159553300012358Vadose Zone SoilMTRPRTAWHRRVMAKLEMGHLQLASRERMSICATSSRAGEVDGVKLWRNWALTVVLSTLTLPGCAPLPKAEYWNGFFRHLRPPQYTFQVPDGWRQATMSDYPLLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQIGPGGWNTFKDLRFGLSDREKQAIWQRLAASLIQTAPPAEKPNLTLESLDVVEDYRVKNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARSFRFF*
Ga0137360_1001385033300012361Vadose Zone SoilMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQTVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ*
Ga0137361_1000862413300012362Vadose Zone SoilRRLRGAMTWLRRQLEELGRRGVVMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQTVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ*
Ga0137361_1102828313300012362Vadose Zone SoilVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWGGFFRHLHPPQYMFQVPDGWRPVTTSDYPLLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDAGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTILEFYGSSGVVTVAYVGIPEDRE
Ga0137373_1027571523300012532Vadose Zone SoilVDGVKLWRNWALTVVLSTLTLPGCAPLPKAEYWNGFFRHLHPPQYTFQVPDGWRQATMSDYPLLGFNRRLFQTLDEAGRTAALQRAELEMQSGDTGLISSRGAWIQVRSQIGPGGWNTFKDLRFGLSDREKQAIWQRLAASLIQTAPPAEKPNLTLESLDVVEDYRVKNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLDVLARSFRFF*
Ga0137419_1003428443300012925Vadose Zone SoilMLAGCAPFPKAEYWEGFFRHLYPPQYTFQVPDGWRQATMSDYPLLGFNRRVFQTLDQAGRTAAMQRAELEMQSVDTGLISSRGAWIQVRSQSGPGGWYTFRDLRFGLSDREKQAIWQRLAARLIQTAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMRWTALEFYGSSGVV
Ga0137407_1017649933300012930Vadose Zone SoilMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ*
Ga0137410_1030655313300012944Vadose Zone SoilVPLLAGCAPFPKAEYWDGFFRHLYPPQYTFQVPDGWRQATMSDYPWLGFNRRLFQTLDKAGRTAAMQRAELEMQSVDTGLISSRGAWIQVRSDSGPGGWYTFKDLRFGLSDREKQAIWQRLAARLIQTAPPAEKPDLTLESLDVVEDYEVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVT
Ga0137410_1046697613300012944Vadose Zone SoilMILCLSIDPLRRQLEGLGEARRAMLRLWRDGALIVVLATLILPGCAPLPKVEYWDGFFRHLHPPRYTFQVPNGWRQATTSDYPSLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQVGAGGWYTFQDLRFGLSDRERRAIWQRLAARLIQAAPPAEKPNLTLESLDVVEDYHVKNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAHVGIPEDREEGIAGLEVLGRSFRFF*
Ga0126375_1126126813300012948Tropical Forest SoilGCAPVPKKEYWDGYFRHLHPPRYSFQVPDGWRPATSSDYPLLGFNRHLFETLDEAGRSAAMQRAEREMQGQDAVLVSSRGAWIQVGSQVGAKRWYAFRNLRFGLDEREKQAIWDSLSTRLSQSAPPADKPKLSLESIDTVDYAWHRAVRLSFQSEGTRGPMHWTVLAFDRSFDLVTIAHVGIPEDRNEGIEGLDAIALTFRFE*
Ga0134077_1021909023300012972Grasslands SoilWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAERSVFLQRAELEMQGRDSVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRFSTGLIQNAPPADKPKLSLESMELADYGENRVLRLRFRSDIKRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFR*
Ga0134077_1024451813300012972Grasslands SoilMLQLWRDGALIAVLAMSILPGCAPLPKVEYWDGFFRHLHPPQYTFQVPDGWRQATTSDYPSLGFNRRLFQTLDEAGRTAATQRAEAEMQSGDTGLISSKGAWIQVRSQVGAGGWYTFKDLRFGLSDREKRAIWQRLAARLIQAAPPAEKPNLTLESLDVVEDYRVKNVLRLRFAADVPRGSMHWTVLEFYGSSGVVTVAH
Ga0134077_1056884813300012972Grasslands SoilKVEYWGGFFRHLHPPQYMFQVPDGWRPVTTSDYPLLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGV
Ga0134110_1020424613300012975Grasslands SoilVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWDGFFRHLHPPKYMFQVPDGWRPVTTSDYPLLGFNRRLFRRLDEAGRTAAMQRAEAEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRF
Ga0134076_1002818333300012976Grasslands SoilLWRDGALIAVLAMSILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAERSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRFSTGLIQNAPPADRPKLSLESMELADYGENRVVRLRFRSDIKRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFR*
Ga0137403_1080172213300015264Vadose Zone SoilMILCLSIDPLRRQLEGLGEAGRAMLRLWRDGALIVVLATLILPGCAPLPKVEYWDGFFRHLHPPRYTFQVPNGWRQATTSDYPSLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMRWTALEFYGSSGVVTVAHVGIPE
Ga0134089_1023069913300015358Grasslands SoilAPLPKVEYWDGYFRHLHPPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAERSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRFSTGLIQNAPPADKPKLSLESMELADYGENRVLRLRFRSDIKRGSMHWTVLGFYGSSGMVTVAHVGVPEDRDEGIAGLEVIAQSFRFR*
Ga0134112_1010358323300017656Grasslands SoilMLQLWRDGALIAVLAISILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAERSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRFSTGLIQNAPPADKPKLSLESMELADYGENRVLRLRFRSDIKRGSMHWTVLGFYGSSGMVTVAHVGVPEDRDEGIA
Ga0134083_1007608423300017659Grasslands SoilFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAERSLFLQRAELEMQGRDSVLISSQGAWIQVGSEVGPGGWYSLRDRPGFGLSEREQQAVWQRFSTGLIQNAPPADKPKLSLESMELADYGENRVVRLRFRSDIKRGSMHWTVLGFYGSSGMVTVAHVGVPEDRDEGIAGLEVIAQSFRFR
Ga0184610_103097443300017997Groundwater SedimentPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAERSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEQEQQAVWQRLSTGLIQAAPPADKPKLSLESMELADYSENRVLRLRFRSDIRRGSMHWTVLGFYGSSGMVTFAHVGVPEDREEGIAGLEVIAQSFRFQ
Ga0184610_118735513300017997Groundwater SedimentPLPKVEYWDGFFRHLHPPRYTFQVPNGWRQATTSDYPSLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQVGAGGWYTFQDLRFGLSDRKKRAIWQRLAARLIQAAPPAERPNLTLESLDVVEDYHVKNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAHVGIPEDREEGIASLEVLGRSFRFF
Ga0184608_1002941333300018028Groundwater SedimentMYSSLCLSIDPLRRQLEGLGEAGRAMLRLWRDGALIVVLATLILPGCAPFPKVEYWDGFFRHLHPPRYTFQVPNGWRQATTSDYPSLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQVGAGGWYTFQDLRFGLSDREKRAIWQRLAARLIHAAPPAEKPNLTLESLDVVEDYHVKNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAHVGIPEDREEGIAGLEVLGRSFRFF
Ga0184638_102294913300018052Groundwater SedimentMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGFFRHLHPPQYTFQVPNGWRQATTSDYPSLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQVGAGGWYTFQDLRFGLSDREKRAIWQRLAARLIQAAPPAEKPNLTLESLDVVEDYHVKNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAHVGIPEDREEGIAGLEVLGRSFRFF
Ga0184626_1001909813300018053Groundwater SedimentMLQLWRDGALIVVLAMLILPGCAPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAERSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSSRDRPGFGLSEREQQAVWQRLSTGLIQAAPPADKPKLSLESMELADYSENRVLRLRFRSDIRRGSMRWTVLGFYGSSGMVTVAHVGVPEDKEEGIAGLAVIAQSFRFQ
Ga0184637_1001655593300018063Groundwater SedimentMLQLWRDGALIVVLAMLILPGCAPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAERSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSSRDRPGFGLSEREQQAVWQRLSTGLIQAAPPADKPKLSLESMELADYSENRVLRLRFRSDIRRGSMHWTVLGFYGSSGMVTFAHVGVPEDREEGIAGLEVIAQSFRFQ
Ga0184632_1002572623300018075Groundwater SedimentMLQLWRDGALIVVLATLILPGCAPLPKVEYWGGFFRHLHHPRYSFRVPDGWRQATISDYLSFGFNRRLFQTLDEAERSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSSRDRPGFGLSEREQQAVWQRLSTGLIQAAPPADKPKLSLESMELADYSENRVLRLRFRSDIRRGSMHWTVLGFYGSSGMVTVAHVGVPEDKEEGIAGLEVIARSFRFQ
Ga0184632_1002728113300018075Groundwater SedimentRYTFQVPNGWRQATTSDYPSLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDTGLISSRAAWIQVRSQVGAGGWYTFQDLRFGLSDREKRAIWQRLAARLIQAAPPAEKPNLTLESLDVVEDYHVKNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAHVGIPEDREEDIAGLEVLGRSFRFF
Ga0184609_1002196223300018076Groundwater SedimentMILCLSIDSLSIDPFRRQLEGLGGAGRAMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGFFRHLHPPRYTFQVPNGWRQATTSDYPSLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQVGAGGWYTFQDLRFGLSDREKRAIWQRLAARLIHAAPPAEKPNLTLESLDVVEDYHVKNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAHVGIPEDREEGIAGLEVLGRSFRFF
Ga0184609_1009243833300018076Groundwater SedimentMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAERSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSSRDRPGFGLSEREQQAVWQRLSTGLIQAAPPADKPKLSLESMALADYGENRVLRLRFRSDIRRGSMHWTVLGFYGSSGMVTVAHVGVPED
Ga0184633_1001879193300018077Groundwater SedimentMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAERSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEQEQQAVWQRLSTGLIQAAPPADKPKLSLESMELADYSENRVLRLRFRSDIRRGSMHWTVLGFYGSSGMVTFAHVGVPEDREEGIAGLEVIAQSFRFQ
Ga0184612_1001513643300018078Groundwater SedimentMLQLWRDGALIVVLAMLILPGCAPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAERSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSSRDRPGFGLSEREQQAVWQRLSTGLIQAAPPADKPKLSLESMELADYSENRVLRLRFRSDIRRGSMHWTVLGFYGSSGMVTVAHVGVPEDKEEGIAGLAVIAQSFRFQ
Ga0184629_1006079123300018084Groundwater SedimentGALIVVLATLILPGCAPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSAQEQQAVWQRLSTGLIQTAPPGDKPKLSLESMELVDYSENRVLRLRFRSDITRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ
Ga0066655_1013478433300018431Grasslands SoilVKFGRNWALTVVVLTTLTLPGCAPLPKVEYWDGFFRHLHPPKYMFQVPDGWRPVTTSDYPLLGFNRRLFQRLDEAGRTAAMQRAEAEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF
Ga0066667_1044520223300018433Grasslands SoilVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWDGFFRHLHPPKYMFQVPDGWRPVTTSDYPLLGFNRRLFQRLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQSGPGGWYTIKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF
Ga0066662_1076257513300018468Grasslands SoilVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWGGFFRHLHPPQYMFQVPDGWRPVTTSDYPLLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDAGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF
Ga0137408_110648913300019789Vadose Zone SoilRDGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFRFDF
Ga0210378_10002112143300021073Groundwater SedimentMLQLWRDGALIVVLATLILPGCAPFPKVEYWDGFFRHLHPPRYTFQVPNGWRQATTSDYPSLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQVGAGGWYTFQDLRFGLSDREKRAIWQRLAARLIQAAPPAEKPNLTLESLDVVEDYHVKNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAHVGIPEDREEGIAGLEVLGRSFRFF
Ga0210379_1010354523300021081Groundwater SedimentIVVLAMLILPGCAPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAERSVFLQRAELEMQGRDTVLISSRGAWIQVGSEAGPGGWYSLRYRPGFGLSEREQQAVWQRLSTGLIQTAPPGDKPKLSLESMELVDYSENRVLRLRFRSDITRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ
Ga0179596_1031019813300021086Vadose Zone SoilAMTWLRRQLEELGRRGVVMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQTVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ
Ga0224452_100603133300022534Groundwater SedimentMILCLSIDPLRRQLEGLGEAGRAMLRLWRDGALIVVLATLILPGCAPFPKVEYWDGFFRHLHPPRYTFQVPNGWRQATTSDYPSLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQVGAGGWYTFQDLRFGLSDREKRAIWQRLAARLIQAAPPAEKPNLTLESLDVVEDYHVKNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAHVGIPEDREEGIAGLEVLGRSFRFF
Ga0222623_1000950313300022694Groundwater SedimentEPLLEYRPTPPATRGAGEAGRVMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGFFRHLHPPRYTFQVPNGWRQATTSDYPSLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQVGAGGWYTFQDLRFGLSDREKRAIWQRLAARLIHAAPPAEKPNLTLESLDVVEDYHVKNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAHVGIPEDREEGIAGLEVLGRSFRFF
Ga0207646_1002131353300025922Corn, Switchgrass And Miscanthus RhizosphereMGCAACRQLEGLGEGEGVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWDGFFRHLHPPQYMFQVPDGWRPVTTSDYPLLGFNRRLFQRLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTILEFYASSGVVTVAYVGIPEDREEGLAGLEVLARTFRF
Ga0209237_100799883300026297Grasslands SoilVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWGGFFRHLHPPQYMFQVPDGWRPVTTSDYPLLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDAGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQCLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTILEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF
Ga0209237_104702133300026297Grasslands SoilMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQTVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ
Ga0209236_102630953300026298Grasslands SoilGWSRPSSSAALARRLRGAMTWLRRQLEELRRRGVVMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQTVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGITGLEVIAQSFRFQ
Ga0209055_1003936123300026309SoilVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWGGFFRHLHPPQYMFQVPDGWRPVTTSDYPLLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDAGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTILEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF
Ga0209686_102526943300026315SoilLPGCAPLPKVEYWGGFFRHLHPPQYMFQVPDGWRPVTTSDYPLLGFNRRLFRRLDEAGRTAAMQRAEAEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF
Ga0209152_1003743133300026325SoilMGTDLDAPWAPHPTWICAACRPGEGEGVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWDGFFRHLHPPKYMFQVPDGWRPVTTSDYPLLGFNRRLFRRLDEAGRTAAMQRAEAEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF
Ga0209803_100205263300026332SoilMGCAACRQLEGLGEGEGVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWDGFFRHLHPPKYMFQVPDGWRPVTTSDYPLLGFNRRLFQRLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTILEFYGSSGVVTVAHVGIPEDREEGLAGLEVLARTFRF
Ga0257170_100133433300026351SoilMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIA
Ga0257173_102227613300026360SoilYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ
Ga0257177_100014753300026480SoilMTWLRRQLEELGRRGVVMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRF
Ga0209058_120129413300026536SoilHDLRGVPAARGAGKGEGVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWDGFFRHLHPPKYMFQVPDGWRPVTTSDYPLLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDAGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRF
Ga0209157_101638833300026537SoilMLQLWRDGALIAVLAMSILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAERSVFLQRAELEMQGRDSVLISSQGAWIQVGSEVGPGGWYSLRDRPGFGLSEREQQAVWQRFSTGLIQNAPPADRPKLSLESMELADYGENRVLRLRFRSDIKRGSMHWTVLGFYGSSGMVTVAHVGVPEDRDEGIAGLEVIAQSFRFR
Ga0209474_1033340913300026550SoilEEWVIYIRRVMGTDLDAPWAPHPTWICAACRPGEGEGVKLGRNWALTVVVLTTLTLPGCAPLPKVEYWDGFFRHLHPPKYMFQVPDGWRPVTTSDYPLLGFNRRLFQRLDEAGRTAAMQRAEAEMQGGDTGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARTFRFF
Ga0209898_103670613300027068Groundwater SandAEYWDGYFRHLHPPRYTFQVPNGWRQATISDYPSLGFNRRFFETLDEAGRNAAMQRAELDMQSGDIGLISSRGGWIQVRSQGSPGGWYTFRDLRFGVGDREKQAIWQRLSTNLIQTAPPAEKPNLTLASIDVVEAYRVGSVLRVRFTADVPRGSMHWTVLEFYSSSGVVTVAHVGIPEDREEGIAGFEQLARSFRFE
Ga0209846_103719813300027277Groundwater SandMLQLWRDGALIIVLATLILSGCAPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSFGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYGLKDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDITRGSMHWTVLGFYGS
Ga0209842_106236213300027379Groundwater SandMLQLWRDGALIIGLATLILPGCAPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDLRRGSMHWTVLGFYGSSGMVTVAHVGV
Ga0209843_101048233300027511Groundwater SandMLQLWRDGALIIVLATLILSGCAPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSFGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDLRRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ
Ga0209887_101497523300027561Groundwater SandMLQLWRDGALIIGLATLILPGCAPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYGLKDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDITRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ
Ga0209874_103116723300027577Groundwater SandMLQLWRDGALIIVLATLILSGCAPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSFGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYGLKDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDLRRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ
Ga0209180_1000131313300027846Vadose Zone SoilMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSEGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ
Ga0209701_1000217763300027862Vadose Zone SoilMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ
Ga0209814_1001355743300027873Populus RhizosphereMLAGCAPVPKAEYWQGYFRHLHPPRYMFQVPDGWREATISDYPSLGFNRGVFERLDEAGRSAAMQRGELDMQSRDTGLVSSRGAWIQVASTVGAGGWYTFKDLRFGLGDREKQAIWQRLSTNLTQAAPAGEKPNLTLQSLDVVEDYWVKNVLRLRFTADVPRGSMRWTVLEFYGSSGVVTVAHVGIPEDSGEGIAGLDVLAQSFRFQ
Ga0209481_1001875623300027880Populus RhizosphereVRDRRSLVALTTLMLAGCAPVPKAEYWQGYFRHLHPPRYMFQVPDGWREATISDYPSLGFNRGVFERLDEAGRSAAMQRGELDMQSRDTGLVSSRGAWIQVASTVGAGGWYTFKDLRFGLGDREKQAIWQRLSTNLTQAAPAGEKPNLTLQSLDVVEDYWVKNVLRLRFTADVPRGSMRWTVLEFYGSSGVVTVAHVGIPEDSGEGIAGLDVLAQSFRFQ
Ga0209590_1007546433300027882Vadose Zone SoilLARRLRGAMTWLRRQLEELGRRGVVMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQTVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ
Ga0209488_1004282163300027903Vadose Zone SoilMQPPKPHAVETDPDSSLPLRGARVGGMSPRVRVGEHGQGATWLCRQLEGLGEGKGVKLWRNWALTVVLSTLMLPGCAPVPKAEYWDGYFRHLHPPRYTFQVPDGWRQATTSDYPLLGFNRRLFQTLDEAGRTAAMQRAEPEMQSGDTGLISSRGAWIQVASAGGAGGWYTFKDLRFGLGDREKQAIWQRLSTNLIQAAPPAEKPNLTLQSLDVVEDYRVKSVLRLRFTADGSRGSMHWTVLEFYGSSGVVTVAHVGIPEDRGEGIAGLEVLANSFRFE
Ga0209488_1019346513300027903Vadose Zone SoilRMEDSRVFADRSLFSSAGFVVAAGLHGLSGDRFARVIALRTPKKTADCRRSQAGARRLRGAMTWLRRQLEELGRRGVVMLQLWRDGALIVVLATLILPGCAPLPKVEYWDGYFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDIQRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ
Ga0209889_106043923300027952Groundwater SandMLQLWRDGALIIVLATLILSGCAPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDLRRGSMHWTVLGFYGSSGMVTVAHVGVPEDREEGIAGLEVIAQSFRFQ
Ga0209853_110930413300027961Groundwater SandMLQLWRDGALIIGLATLILPGCAPLPKVEYWDGFFRHLHHPRYSFRVPDGWRQATISDYPSLGFNRRLFQTLDEAGRSVFLQRAELEMQGRDTVLISSQGAWIQVGSEAGPGGWYSLRDRPGFGLSEREQQAVWQRLSTGLIQTAPPADKPKLSLESMELADYGENRVLRLRFRSDLRRGSMHWTVLGFYGSSGM
Ga0137415_1046943023300028536Vadose Zone SoilRLRFWRDWAFIVVPVLAGCAPFPKAEYWDGFFRHLYPPRYTFQAPDGWRQARMSDYPLLGFNRRLFQTLDEAGRKAAMQRAELEMQSGDTGLISSRGAWIQVRSQSSPGGWYTFRDLRFGLGDREKQAIWQRLAARLIQTAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMRWTALEFYGSSGVVTVAHVGIPEDRDEGIAGLEAIARSFRF
Ga0307504_1000383113300028792SoilVKLWPNWALTVVVLSTLTLPGCAPLPKVEYWDGFFRHLHPPQYMFQVPDGWRQATMSDYPLLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQAQSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQTAPPGEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAHVGIPED
(restricted) Ga0255311_102090133300031150Sandy SoilVKLWRNWALTVVVLSTLTLPGCAPLPKVEYWDGFFRHLHPPQYMFQVPDGWRQATMSDYPLLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDTGLISSRGAWIQAQSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQTAPPGEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTVLEFYGSSGVVTVAYVGIPEDREEGLAGLEVLARSFRFF
Ga0307468_10082563613300031740Hardwood Forest SoilLSGLGIHRRASAGELCALTQGRVLGWILPPPPPRYTFQVPDDWRQATISDYPSLGFNRHLFQTLDEAGRSATRQRAELEMQGRDTGLISSGGAWIQVASAAGAGGWYRFKDLRFGLGDREKQAIWQRLSTNLIQAAPPAEKPNLTLQSLDVVEDYRVKNVLRLRFTADGARGSMHWTVLEFYGSSGVVNVAH
Ga0307473_1011987233300031820Hardwood Forest SoilMFRFWRKWASILVLATSALPGCAPLPKAEYWDGYFRHLHPPRYTFQVPDGWRPATISDYPSLGFNRRFFETLDEAGRSAAMQRAELEMQGRDAGLVSSRGAWIQVQSAAGAGGWYTFRDLRFGLGDREKQAIWQRVSTKLIQAAPPGERPNLTLQSLDVVEDYRVKGVLRLRFTADGPRGSMHWTVFEFYGSSGAVNVAHVGIPEDSGEGIAG
Ga0307473_1040098423300031820Hardwood Forest SoilPRYTFQVPDGWRQATISDYPSLGFNRHVFETLDQAGRSAAMQRAELEMQGRDTGLISSRGAWIQVTSAGGVGGWYTFKDLRFGLSEQEKQAIWQRLSTNLIQAAPPAEKPNLTLQSLDVVEDYRVKSVLRLRFTADGPRGSMHWTVLEFYSSSGVVTVAHVGIPEDRGEGIAGLEALAYSFRFE
Ga0307473_1151425913300031820Hardwood Forest SoilFRHLHPPQYMFQVPDGWRPVTTSDYPLLGFNRRLFQTLDEAGRTAAMQRAELEMQSGDAGLISSRGAWIQVRSQSGPGGWYTFKDLRFGLSDREKQAIWQRLAASLIQAAPPAEKPNLTLESLDVVEDYDVRNVLRLRFTADVPRGSMHWTILEFYGSSGVVTVAHVGI
Ga0307471_10008953713300032180Hardwood Forest SoilRYTFQVPDGWRPATISDYPSLGFNRRFFETLDEAGRSAAMQRAELEMQGRDAGLVSSRGAWIQVQSAAGAGGWYTFRDLRFGLGDREKQAIWQRVSTKLIQAAPPGERPNLTLQSLDVVEDYRVKGVLRLRFTADGPRGSMHWTVFEFYGSSGAVNVAHVGIPEDSGEGIAGLEVLARSFRFE
Ga0307471_10073615123300032180Hardwood Forest SoilMFQSWRKWASIVVLAASTLPGCAPLPRAEYWDGYFRHLDPPRYTFQVPDDWRPATISDYPSLGFNQRFFQTLDETSRSAAMQRAELEIQGRDAALISSRGAWIQVQSAVGVGGWYTFRDLRFGLSDREKQAIWQRLSTNLIQAAPPAEKPNLTLQSLDVVEDYRVKGVLRLRFTADGPRGSMHWTVLEFYGSSGVVNVAHVGIPEDSGEGIAGLEVLARSFRFE
Ga0307472_10019470543300032205Hardwood Forest SoilMLRIRRDWASIVLATLMLPGCAPLPKGEYWDGYFRHLHPPRYTFQVPDGWRQATISDYPALGFNRRLFQTLDEAGRSAAMQGAEQEMQSPDTGLISSRGAWIQVTSAGGVGGWYTSKDLRFGLSDREKQAIWQRLSTNLIQAAPPAEKPNLTLQSMDVVEDYRVKSALRLRFTADGPRGSMHWTVLEFYSSSGVVTVAHVGIPEDSGEGIAGLEVLARSFRFE


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.