NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F047739

Metagenome Family F047739

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F047739
Family Type Metagenome
Number of Sequences 149
Average Sequence Length 170 residues
Representative Sequence DRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP
Number of Associated Samples 120
Number of Associated Scaffolds 149

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.75 %
% of genes near scaffold ends (potentially truncated) 89.26 %
% of genes from short scaffolds (< 2000 bps) 84.56 %
Associated GOLD sequencing projects 102
AlphaFold2 3D model prediction Yes
3D model pTM-score0.39

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (89.933 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(28.859 % of family members)
Environment Ontology (ENVO) Unclassified
(46.980 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(57.047 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 28.13%    β-sheet: 12.50%    Coil/Unstructured: 59.37%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.39
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 149 Family Scaffolds
PF00912Transgly 0.67
PF03741TerC 0.67
PF08450SGL 0.67
PF12695Abhydrolase_5 0.67
PF00753Lactamase_B 0.67

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 149 Family Scaffolds
COG0744Penicillin-binding protein 1B/1F, peptidoglycan transglycosylase/transpeptidaseCell wall/membrane/envelope biogenesis [M] 0.67
COG0861Tellurite resistance membrane protein TerCInorganic ion transport and metabolism [P] 0.67
COG3386Sugar lactone lactonase YvrECarbohydrate transport and metabolism [G] 0.67
COG3391DNA-binding beta-propeller fold protein YncEGeneral function prediction only [R] 0.67
COG4953Membrane carboxypeptidase/penicillin-binding protein PbpCCell wall/membrane/envelope biogenesis [M] 0.67
COG5009Membrane carboxypeptidase/penicillin-binding proteinCell wall/membrane/envelope biogenesis [M] 0.67


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms89.93 %
UnclassifiedrootN/A10.07 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001305|C688J14111_10007286All Organisms → cellular organisms → Bacteria3092Open in IMG/M
3300002568|C688J35102_118241108All Organisms → cellular organisms → Bacteria541Open in IMG/M
3300002907|JGI25613J43889_10153820All Organisms → cellular organisms → Bacteria598Open in IMG/M
3300002911|JGI25390J43892_10121424All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Saprospiria → Saprospirales → Lewinellaceae → Lewinella → Lewinella cohaerens597Open in IMG/M
3300004479|Ga0062595_100195409All Organisms → cellular organisms → Bacteria1244Open in IMG/M
3300005171|Ga0066677_10653272All Organisms → cellular organisms → Bacteria592Open in IMG/M
3300005176|Ga0066679_10260079All Organisms → cellular organisms → Bacteria1120Open in IMG/M
3300005177|Ga0066690_10207158All Organisms → cellular organisms → Bacteria1306Open in IMG/M
3300005177|Ga0066690_10700344All Organisms → cellular organisms → Bacteria670Open in IMG/M
3300005179|Ga0066684_10036550All Organisms → cellular organisms → Bacteria2723Open in IMG/M
3300005187|Ga0066675_10609737All Organisms → cellular organisms → Bacteria820Open in IMG/M
3300005187|Ga0066675_11155022All Organisms → cellular organisms → Bacteria576Open in IMG/M
3300005332|Ga0066388_108386165All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300005340|Ga0070689_101514862All Organisms → cellular organisms → Bacteria608Open in IMG/M
3300005345|Ga0070692_11344296All Organisms → cellular organisms → Bacteria514Open in IMG/M
3300005450|Ga0066682_10028822All Organisms → cellular organisms → Bacteria3230Open in IMG/M
3300005451|Ga0066681_10673013All Organisms → cellular organisms → Bacteria632Open in IMG/M
3300005454|Ga0066687_10245834All Organisms → cellular organisms → Bacteria996Open in IMG/M
3300005454|Ga0066687_10603923All Organisms → cellular organisms → Bacteria652Open in IMG/M
3300005458|Ga0070681_10636023All Organisms → cellular organisms → Bacteria982Open in IMG/M
3300005459|Ga0068867_102017152All Organisms → cellular organisms → Bacteria545Open in IMG/M
3300005537|Ga0070730_10584684All Organisms → cellular organisms → Bacteria713Open in IMG/M
3300005540|Ga0066697_10127601All Organisms → cellular organisms → Bacteria1494Open in IMG/M
3300005549|Ga0070704_100852422All Organisms → cellular organisms → Bacteria817Open in IMG/M
3300005554|Ga0066661_10781660All Organisms → cellular organisms → Bacteria559Open in IMG/M
3300005555|Ga0066692_10113342All Organisms → cellular organisms → Bacteria1625Open in IMG/M
3300005557|Ga0066704_10835280All Organisms → cellular organisms → Bacteria571Open in IMG/M
3300005558|Ga0066698_10363405All Organisms → cellular organisms → Bacteria999Open in IMG/M
3300005559|Ga0066700_10935208All Organisms → cellular organisms → Bacteria574Open in IMG/M
3300005561|Ga0066699_10399136All Organisms → cellular organisms → Bacteria984Open in IMG/M
3300005575|Ga0066702_10515597All Organisms → cellular organisms → Bacteria726Open in IMG/M
3300005575|Ga0066702_10846754All Organisms → cellular organisms → Bacteria545Open in IMG/M
3300005575|Ga0066702_10938151All Organisms → cellular organisms → Bacteria517Open in IMG/M
3300005587|Ga0066654_10667096All Organisms → cellular organisms → Bacteria580Open in IMG/M
3300005615|Ga0070702_101141877All Organisms → cellular organisms → Bacteria625Open in IMG/M
3300005993|Ga0080027_10196159All Organisms → cellular organisms → Bacteria787Open in IMG/M
3300006237|Ga0097621_101438092All Organisms → cellular organisms → Bacteria653Open in IMG/M
3300006755|Ga0079222_11218120All Organisms → cellular organisms → Bacteria676Open in IMG/M
3300006791|Ga0066653_10253465All Organisms → cellular organisms → Bacteria896Open in IMG/M
3300006800|Ga0066660_11488594All Organisms → cellular organisms → Bacteria532Open in IMG/M
3300006806|Ga0079220_10573270All Organisms → cellular organisms → Bacteria794Open in IMG/M
3300006852|Ga0075433_10001687All Organisms → cellular organisms → Bacteria16447Open in IMG/M
3300006854|Ga0075425_100578634All Organisms → cellular organisms → Bacteria1292Open in IMG/M
3300006880|Ga0075429_101252003All Organisms → cellular organisms → Bacteria647Open in IMG/M
3300006903|Ga0075426_10175949All Organisms → cellular organisms → Bacteria1548Open in IMG/M
3300006904|Ga0075424_101646415All Organisms → cellular organisms → Bacteria680Open in IMG/M
3300007255|Ga0099791_10341385All Organisms → cellular organisms → Bacteria717Open in IMG/M
3300007265|Ga0099794_10513634All Organisms → cellular organisms → Bacteria631Open in IMG/M
3300007265|Ga0099794_10686256All Organisms → cellular organisms → Bacteria545Open in IMG/M
3300009012|Ga0066710_100713187All Organisms → cellular organisms → Bacteria1530Open in IMG/M
3300009012|Ga0066710_103655079All Organisms → cellular organisms → Bacteria579Open in IMG/M
3300009012|Ga0066710_104542500All Organisms → cellular organisms → Bacteria519Open in IMG/M
3300009038|Ga0099829_11593512All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300009137|Ga0066709_100657650All Organisms → cellular organisms → Bacteria1501Open in IMG/M
3300009137|Ga0066709_101005795All Organisms → cellular organisms → Bacteria1221Open in IMG/M
3300009137|Ga0066709_101387391All Organisms → cellular organisms → Bacteria1023Open in IMG/M
3300009137|Ga0066709_101986585All Organisms → cellular organisms → Bacteria806Open in IMG/M
3300009137|Ga0066709_103472914All Organisms → cellular organisms → Bacteria572Open in IMG/M
3300009137|Ga0066709_104414043All Organisms → cellular organisms → Bacteria513Open in IMG/M
3300009143|Ga0099792_10335479All Organisms → cellular organisms → Bacteria908Open in IMG/M
3300009162|Ga0075423_11600999All Organisms → cellular organisms → Bacteria700Open in IMG/M
3300010046|Ga0126384_10815740All Organisms → cellular organisms → Bacteria836Open in IMG/M
3300010159|Ga0099796_10581339All Organisms → cellular organisms → Bacteria504Open in IMG/M
3300010303|Ga0134082_10129330All Organisms → cellular organisms → Bacteria1013Open in IMG/M
3300010320|Ga0134109_10447698All Organisms → cellular organisms → Bacteria524Open in IMG/M
3300010321|Ga0134067_10048211All Organisms → cellular organisms → Bacteria1365Open in IMG/M
3300010336|Ga0134071_10519341All Organisms → cellular organisms → Bacteria616Open in IMG/M
3300010358|Ga0126370_12016211All Organisms → cellular organisms → Bacteria564Open in IMG/M
3300010373|Ga0134128_13233697All Organisms → cellular organisms → Bacteria500Open in IMG/M
3300012202|Ga0137363_10344863All Organisms → cellular organisms → Bacteria1231Open in IMG/M
3300012203|Ga0137399_10061595All Organisms → cellular organisms → Bacteria2778Open in IMG/M
3300012203|Ga0137399_10776662All Organisms → cellular organisms → Bacteria807Open in IMG/M
3300012205|Ga0137362_10669399All Organisms → cellular organisms → Bacteria893Open in IMG/M
3300012361|Ga0137360_10915868All Organisms → cellular organisms → Bacteria756Open in IMG/M
3300012362|Ga0137361_10978892All Organisms → cellular organisms → Bacteria765Open in IMG/M
3300012917|Ga0137395_10230183All Organisms → cellular organisms → Bacteria1298Open in IMG/M
3300012917|Ga0137395_10878412All Organisms → cellular organisms → Bacteria649Open in IMG/M
3300012922|Ga0137394_11229320All Organisms → cellular organisms → Bacteria613Open in IMG/M
3300012923|Ga0137359_11062933All Organisms → cellular organisms → Bacteria693Open in IMG/M
3300012927|Ga0137416_11279878All Organisms → cellular organisms → Bacteria662Open in IMG/M
3300012927|Ga0137416_11850269All Organisms → cellular organisms → Bacteria552Open in IMG/M
3300012929|Ga0137404_10144927All Organisms → cellular organisms → Bacteria1969Open in IMG/M
3300012964|Ga0153916_12453918All Organisms → cellular organisms → Bacteria587Open in IMG/M
3300012975|Ga0134110_10279825All Organisms → cellular organisms → Bacteria716Open in IMG/M
3300012975|Ga0134110_10520377All Organisms → cellular organisms → Bacteria543Open in IMG/M
3300014154|Ga0134075_10184502All Organisms → cellular organisms → Bacteria895Open in IMG/M
3300014157|Ga0134078_10081753All Organisms → cellular organisms → Bacteria1178Open in IMG/M
3300015053|Ga0137405_1408760All Organisms → cellular organisms → Bacteria2611Open in IMG/M
3300015264|Ga0137403_10683078All Organisms → cellular organisms → Bacteria887Open in IMG/M
3300015356|Ga0134073_10297200All Organisms → cellular organisms → Bacteria575Open in IMG/M
3300018468|Ga0066662_10214027All Organisms → cellular organisms → Bacteria1538Open in IMG/M
3300018468|Ga0066662_11715286All Organisms → cellular organisms → Bacteria657Open in IMG/M
3300020170|Ga0179594_10064454All Organisms → cellular organisms → Bacteria1265Open in IMG/M
3300020170|Ga0179594_10091721All Organisms → cellular organisms → Bacteria1083Open in IMG/M
3300025899|Ga0207642_10868408All Organisms → cellular organisms → Bacteria577Open in IMG/M
3300025912|Ga0207707_10223087All Organisms → cellular organisms → Bacteria1640Open in IMG/M
3300025931|Ga0207644_10651158All Organisms → cellular organisms → Bacteria877Open in IMG/M
3300026089|Ga0207648_11989717All Organisms → cellular organisms → Bacteria542Open in IMG/M
3300026274|Ga0209888_1061861All Organisms → cellular organisms → Bacteria692Open in IMG/M
3300026308|Ga0209265_1046287All Organisms → cellular organisms → Bacteria1346Open in IMG/M
3300026308|Ga0209265_1077608All Organisms → cellular organisms → Bacteria956Open in IMG/M
3300026313|Ga0209761_1207001All Organisms → cellular organisms → Bacteria850Open in IMG/M
3300026315|Ga0209686_1085407All Organisms → cellular organisms → Bacteria1112Open in IMG/M
3300026328|Ga0209802_1232633All Organisms → cellular organisms → Bacteria658Open in IMG/M
3300026330|Ga0209473_1110647All Organisms → cellular organisms → Bacteria1143Open in IMG/M
3300026332|Ga0209803_1074748All Organisms → cellular organisms → Bacteria1430Open in IMG/M
3300026334|Ga0209377_1053246All Organisms → cellular organisms → Bacteria1801Open in IMG/M
3300026335|Ga0209804_1108062All Organisms → cellular organisms → Bacteria1272Open in IMG/M
3300026527|Ga0209059_1218935All Organisms → cellular organisms → Bacteria639Open in IMG/M
3300026542|Ga0209805_1141703All Organisms → cellular organisms → Bacteria1115Open in IMG/M
3300026547|Ga0209156_10403828All Organisms → cellular organisms → Bacteria571Open in IMG/M
3300026548|Ga0209161_10024882All Organisms → cellular organisms → Bacteria4226Open in IMG/M
3300026548|Ga0209161_10143237All Organisms → cellular organisms → Bacteria1388Open in IMG/M
3300026550|Ga0209474_10214773All Organisms → cellular organisms → Bacteria1216Open in IMG/M
3300026552|Ga0209577_10425105All Organisms → cellular organisms → Bacteria942Open in IMG/M
3300026557|Ga0179587_11037299All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300027603|Ga0209331_1102267All Organisms → cellular organisms → Bacteria700Open in IMG/M
3300027643|Ga0209076_1209637All Organisms → cellular organisms → Bacteria532Open in IMG/M
3300027667|Ga0209009_1060157All Organisms → cellular organisms → Bacteria952Open in IMG/M
3300027671|Ga0209588_1017867All Organisms → cellular organisms → Bacteria2203Open in IMG/M
3300027671|Ga0209588_1034939All Organisms → cellular organisms → Bacteria1615Open in IMG/M
3300027671|Ga0209588_1212459All Organisms → cellular organisms → Bacteria600Open in IMG/M
3300027857|Ga0209166_10673067All Organisms → cellular organisms → Bacteria522Open in IMG/M
3300027909|Ga0209382_11418030All Organisms → cellular organisms → Bacteria697Open in IMG/M
3300028381|Ga0268264_10836457All Organisms → cellular organisms → Bacteria921Open in IMG/M
3300031720|Ga0307469_10631885All Organisms → cellular organisms → Bacteria962Open in IMG/M
3300031720|Ga0307469_10860778All Organisms → cellular organisms → Bacteria836Open in IMG/M
3300031720|Ga0307469_11511988All Organisms → cellular organisms → Bacteria643Open in IMG/M
3300031740|Ga0307468_102256792All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300031820|Ga0307473_10150651All Organisms → cellular organisms → Bacteria1322Open in IMG/M
3300031938|Ga0308175_103213861All Organisms → cellular organisms → Bacteria507Open in IMG/M
3300032205|Ga0307472_100263255All Organisms → cellular organisms → Bacteria1361Open in IMG/M
3300032954|Ga0335083_10687905All Organisms → cellular organisms → Bacteria832Open in IMG/M
3300033480|Ga0316620_11259159All Organisms → cellular organisms → Bacteria726Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil28.86%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil22.15%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil10.07%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil6.71%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere5.37%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.03%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil2.68%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.01%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere2.01%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.34%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.34%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.34%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.34%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.34%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.34%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.34%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.67%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.67%
Permafrost SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost Soil0.67%
Prmafrost SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Prmafrost Soil0.67%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.67%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.67%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.67%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.67%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001305Grasslands soil microbial communities from Hopland, California, USAEnvironmentalOpen in IMG/M
3300002568Grasslands soil microbial communities from Hopland, California, USA - 2EnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005340Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaGEnvironmentalOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005365Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3H metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005615Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-3 metaGEnvironmentalOpen in IMG/M
3300005993Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 1 DNA2013-046EnvironmentalOpen in IMG/M
3300006237Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2 (version 2)Host-AssociatedOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012964Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 4 metaGEnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300025899Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025931Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026089Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026274Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil DNA_2013-059 (SPAdes)EnvironmentalOpen in IMG/M
3300026308Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103 (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027603Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027667Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031938Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R1EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032954Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.2EnvironmentalOpen in IMG/M
3300033480Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M1_C1_D5_BEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
C688J14111_1000728643300001305SoilKERREQAGLPAQRLPDDPTSLLGNSSRATAEQVGAYLHRKLFAHDGTCMLSDTGALLAQRRKEGTLRWLAQRWPKLLFAGKTGSSPHDDSAVAAVGLCLDARPVVLVAALRPLDGPLPIGLRGSVLLRGIDAYLRELGRLQRSPTPALLPEWAQEPDPTVTAEVRP*
C688J35102_11824110813300002568SoilLFRYLKTRREEAGLPAQRLPDDPTSLLGNSSRATAEQIGTYLHRKLFAGDGTCALSDTGALLALHRKEGTLRWLAFRWPKLVFAGKTGSSPHDDSAVAALALCLDARPVVLVAALRPIEGALPTGLRGSVLLRGMDAYLKELSRLQRRPTAALLPAWTQPPDSSIPVEMNP*
JGI25613J43889_1015382023300002907Grasslands SoilLLGNSSRATAEQIGGYLHRKLLAGEGACALTDTGALLALHRRAGTLRWLAWRWPKLVFAGKTGSSPHDDSAVAGIALCLDARPVVLVAALRPLQGPLPDGLHGSVLLRGLDAYLRELRRLDRRVSSAALPAWAEEIDAPAEGRDPISVAATPVEKEER*
JGI25390J43892_1012142413300002911Grasslands SoilEAIAPELSYSAAGVALFRYLRDRRELAGLPXDRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSVLRPLQAPLPDGLQGSLLLRGIDGYLRELARLDRKPASALLPTWAEQDAQLAVEARP*
Ga0062595_10019540933300004479SoilPELSYSAAGVALFRYLRDRRERAGLPAGRLPEDPTSLLGNSSRATAEEIGAYLHRKLFRNDGTCALSDTGALLALHRKEGTLRWLAQRWPKLVFSGKTGSSPHDDSALAAVGLCLDARPVVLIAALRPLQAPLPDGLQGSVLLRGIDAYLRELSRLDRKPAPAILPAWADPEAQLPAEGNP*
Ga0066677_1065327213300005171SoilTVQPDEIASDLSYSAASVALFRHLKSRRERGGLPASMLPDDPTSLLGNSSRATAEQIGAYLHRKLFAGDGSCTLSDTGALLGLHRREGTLRWLALRWPKLVFTGKTGSSPHDDSAVAAVALCLDARPVVAVSALRALAGPLPEGLRGSVLLRGIDAYLKQLARLERKPSSTLWPSFVTDEAPAEELTVEAKR*
Ga0066679_1026007913300005176SoilLLGNSSRATAEQVGGYLHRKLFVNDGSCTLSDTGALLALRRRDGTLRWLAQRWPKLVFTGKTGSSPHDDSAVAAAGICLDARPVILVAALRPLQDQLPTGLRGSVLLRGMDAYLRELSRLERRLTSAVLPGWTEEEMAGTAAAVPLASEARVAEEKR*
Ga0066690_1020715833300005177SoilGEIVPPDQVGPDLSYTAAGVALFRFLKARREEAGLPAALLPDDPTSLLGNSSRATAEQIGHYLHRKLFSDSSCVLSDAGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSALAAVGICLDARPVVLVAALRPLQPPLPKGLHGSVLLRGLDAYLRELVRLERRPTSLMLPAWAEENIAEPSAVAAGGIPSFAGDGEKR*
Ga0066690_1070034413300005177SoilVSARERASLDSPADRALAGDLLAQLGGAMPPEAIAPELSYSAAGVALFRYLRDRRELAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSVLRPLQAPLPDGLQGSLLLRGIDGYLRELARLDRKPASALLPTWAEQDAQLAVEARP*
Ga0066684_1003655043300005179SoilLPAERLPDDPTSLLGNSSRATPEQIAAYLHRKLFLKDGSCTLTDTGALLALRRKEGTLRWLAQKWPKLVFAGKTGSSPHDDSAVAAVAICLDQRPVVLVGALRPLLGALPLGLRGSVVLRGLDSYLKALVRLDRRPASALWPAWVEEELAAKQASAPLEPSRVESALATKGKP*
Ga0066675_1060973713300005187SoilEQIGAYLHRKLFAGDGTCTLSDTGALLALHRREGTLRWLALRWPKLVFSGKTGSSPHDDSAVAAVALCLDARPVVLAAGLRPLQGTLPTGLRGSVLLRGLDAYLHELVRLQRPPTPALWPAWAEEDLAAAAQPPETPSLAAEEKR*
Ga0066675_1115502213300005187SoilERAGLPSERLPDDPTSLLGNSSRATVEQIGAYLHRKLFAGDGTCTLSDTGALLALHRREGTLRWLALRWPKLVFSGKTGSSPHDDSAVAAVALCLDARPVVLAAGLRPLQGSLPTGLRGSVLLRGLDAYLRELRRLDRRVTSVALPAWAEEIEAPAAIPATISAAAMPVEKEER*
Ga0066388_10838616513300005332Tropical Forest SoilIGGYLHRKLFVQDGSCTLSDTGALLALHRREGTLRWLAQRWPKLVFTGKTGSSPHDDSAVAAIAVCLDARPVVLVAALRPLEGTLPVGLRGSIVLRGLDAYMKELVRLQRRPTSALLPDWAQDETQAGLLPIAPLTLATGGSP*
Ga0070689_10151486213300005340Switchgrass RhizosphereEVAAEVGREERAALDSPADRALAGDLLAQLGAPVPPDAIPEELSYSAAGVALFRYLKDRRERAGLPSDRLPEDPTSLLGNSSRATAEQIGTYLHRKLFANDGTCALTDTGALLALRRREGTLRWLAQRWPRLVFSGKTGSSPHDDSALAAVGICLDQRPVVLVAALRPLQGTLPDGLHGSLLLRGIDAYLRELTRLDRKPAA
Ga0070692_1134429613300005345Corn, Switchgrass And Miscanthus RhizosphereSRATAEQIGTYLHRKLFANDGTCALTDTGALLALRRREGTLRWLAQRWPRLVFSGKTGSSPHDDSALAAVGICLDQRPVVLVAALRPLQGTLPDGLHGSLLLRGIDAYLRELTRLDRKPAAALLPSWAQADPQTPVEALP*
Ga0070688_10154205413300005365Switchgrass RhizosphereSRATAEQIGTYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPRLVFSGKTGSSPHDDSALAAVGICLDQRPVVLVAALRPLQGTLPDGLHGSLLLRGIDAYLRELTRLDRKPAAALLPSWAQADPQTPVEALP*
Ga0066689_1100962413300005447SoilSRATAEQIGAYLHRKLFAGDGSCTLSDTGALLGLHRREGTLRWLALRWPRLVFTGKTGSSPHDDSAVAAVALCLDARPVVAVSALRALAGPLPEGLRGSVLLRGIDAYLKELSRLERKPSSALWPSFIADEAPAEELTVEAKR*
Ga0066682_1002882213300005450SoilAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP*
Ga0066681_1067301313300005451SoilIASDLSYSAASVALFRHLKSRRERGGLPASMLPDDPTSLLGNSSRATAEQIGAYLHRKLFAGDGSCTLSDTGALLGLHRREGTLRWLALRWPKLVFTGKTGSSPHDDSAVAAVALCLDARPVVAVSALRALAGPLPEGLRGSVLLRGIDAYLKELSRLERKPSSALWPSFIADEAPAEELTVEAKR*
Ga0066687_1024583413300005454SoilLDSPADRALAGNLLGQLGEVVRPEEVPPEISYSSASIALFRHLKMRRERAGLPSERLPDDPTSLLGNSSRATVEQIGAYLHRKLFAGDGTCTLSDTGALLALHRREGTLRWLALRWPKLVFSGKTGSSPHDDSAVAAVALCLDARPVVLAAGLRPLQGTLPTGLRGSVLLRGLDAYLHELVKLQRPPAPALWPAWAEEELAAAAQPPETPSLAVEEKR*
Ga0066687_1060392313300005454SoilAAVALFRYLKDRREQAGLPAERLPEDPTSLLGNSSRATPEQIGAYLHRKLFLKDGSCTLTDTGALLALRRKEGTLRWLAQKWPKLVFAGKTGSSPHDDSAVAAVGICLDQRPVVLVGALRPLLGALPLGLRGSVVLRGLDSYLKALVRLDRRPASALWPAWVEEELAAKQASAPLEPSRVESALATKGKP*
Ga0070681_1063602323300005458Corn RhizosphereRDRRERAGLPAGRLPEDPTSLLGNSSRATPEEIGAYLHRKLFRNDGTCALSDTGALLALHRKEGTLRWLAQRWPKLVFSGKTGSSPHDDSALAAVGLCLDARPVVLIAALRPLQAPLPDGLQGSVLLRGIDAYLRELSRLDRKPAPAILPAWADPEAQLPAEANP*
Ga0068867_10201715213300005459Miscanthus RhizosphereADRALAGDLLAQLGAPVPPDAIPGELSYSAAGVALFRYLKDRRERAGLPSDRLPEDPTSLLGNSSRATAEQIGTYLHRKLFANDGTCALTDTGALLALRRREGTLRWLAQRWPRLVFSGKTGSSPHDDSALAAVGICLDQRPVVLVAALRPLQGTLPDGLHGSLLLRGIDAYLRELTRLD
Ga0070730_1058468423300005537Surface SoilALFRYLKARRERAGLPAQGLPDDPTSLLGNSSRATVEQIGAYLHRKLFSGDGTCTLSDTGALLSLHRRDGTLRWLAQRWPKLVFSGKTGSSPHDDSAIAAVALCLDARPVVLVAGLRPPHGALPEGLRGSLVLRGLDAYLRELQRLERAPTSALWPGWAEEELAKAQPTTEPQLISTAPLAAKEQP*
Ga0066697_1012760113300005540SoilNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRAGTLRWLAWRWPKLVFAGKTGSSPHDDSALVGIAVCLDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRVTSLALPAWAEEIEAPVAAPEAISVAATPAEKEER*
Ga0070704_10085242223300005549Corn, Switchgrass And Miscanthus RhizosphereLAGDLLAQLGAPVPPDAIPGELSYSAAGVALFRYLKDRRERAGLPSDRLPEDPTSLLGNSSRATAEQIGTYLHRKLFANDGTCALTDTGALLALRRREGTLRWLAQRWPRLVFSGKTGSSPHDDSALAAVGICLDQRPVVLVAALRPLQGTLPDGLHGSLLLRGIDAYLRELTRLDRKPAAALLPAWAEADPQAPVEASP*
Ga0066661_1078166013300005554SoilLSYSAAGVALFRYLRDRRERVGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP*
Ga0066692_1011334213300005555SoilSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKLVFSGKTGSSPHDDAALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP*
Ga0066704_1083528023300005557SoilDRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSVLRPLQAPLPDGLQGSLLLRGIDGYLRELARLDRKPASALLPTWAEQDAQLAVEARP*
Ga0066698_1036340513300005558SoilVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP*
Ga0066700_1093520813300005559SoilPDDPTSLLGNSSRATAEQVGGYLHRKLFVNDGSCTLSDTGALLALRRRDGTLRWLAQRWPKLVFTGKTGSSPHDDSAVAAAGICLDARPVILVAALRPLQDQLPTGLRGSVLLRGMDAYLRELSRLERRLTSAVLPGWTEEEMAGTAAAVPLASEARVAEEKR*
Ga0066670_1085290613300005560SoilSRATAEQIGAYLHRKLFAGDGSCTLSDTGALLGLHRREGTLRWLALRWPKLVFTGKTGSSPHDDSAVAAVALCLDARPVVAVSALRALAGPLPEGLRGSVLLRGIDAYLKELSRLERKPSSALWPSFIADEAPAEELTVEAKR*
Ga0066699_1039913613300005561SoilDLSYSAAGVALFRYLKKSREAAGLPAERLPDDPTSLLGNSSRATPEQIGAYLHRKLFLKDGSCTLSDTGSLLALRRKEGTLRWLAQKWPKLVFAGKTGSSPHDDSAVAAVAICLDQRPVVLVGALRPLEGSLPLGLRGSVVLRGLDSYLKALVRLDRRPTSALWPAWVEEEIAAKQAALPIDPPRVESALATKGKP*
Ga0066703_1050673823300005568SoilSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP*
Ga0066702_1051559723300005575SoilERLPDDPTSLLGNSSRATGEQIGAYLMTKLFAGDGSCALSDTGALLGLHRRDGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVAICLDARPVVLVAALRPLQPPLPKGLHGSVLLRGLDAYLRELVRLERRPTSLMLPAWAEENIAEPSAVAAGGIPSFAGDGEKR*
Ga0066702_1084675413300005575SoilAGVALFRYLRDRRELAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSVLRPLQAPLPDGLQGSLLLRGIDGYLRELARLDRKPASALLPTWAEQDAQLAVEARP*
Ga0066702_1093815113300005575SoilGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP*
Ga0066654_1066709613300005587SoilAQLGETLAPDDVPQDLSYSAAGVALFRYLKKSREAAGLPAERLPDDPTSLLGNSSRATPEQIGAYLHRKLFLKDGSCTLSDTGSLLALRRKEGTLRWLAQKWPKLVFAGKTGSSPHDDSAVAAVAICLDQRPVVLVGALRPLEGSLPLGLRGSVVLRGLDSYLKALVRLDRRPTSALWPAWVEEEIAAKQAA
Ga0070702_10114187713300005615Corn, Switchgrass And Miscanthus RhizosphereFRYLKDRRERAGLPSDRLPEDPTSLLGNSSRATAEQIGTYLHRKLFANDGTCALTDTGALLALRRREGTLRWLAQRWPRLVFSGKTGSSPHDDSALAAVGICLDQRPVVLVAALRPLQGTLPDGLHGSLLLRGIDAYLRELTRLDRKPAAALLPSWAQADPQTPVEALP*
Ga0080027_1019615923300005993Prmafrost SoilAPDAISPALSYSAAGVALFRTLKQRRELAGLPANSLPDDPTQLLGNDSRATVEQIGSYLHKKLFLGDGSCTLSDTGALLALHRREGTLRWLAQRWPKLIFTGKTGSSPHDDSAVAAVAICLDTRPVVLVAALRPLQGALPQGLRGSVVLRGLDSYLRELSRLERRPNSAELPPWAVLATAVEAVQ*
Ga0097621_10143809223300006237Miscanthus RhizosphereLGNSSRATAEQIGTYLHRKLFANDGTCALTDTGALLALRRREGTLRWLAQRWPRLVFSGKTGSSPHDDSALAAVGICLDQRPVVLVAALRPLQGTLPDGLHGSLLLRGIDAYLRELTRLDRKPAAALLPAWAEADPQAPVEASP*
Ga0079222_1121812013300006755Agricultural SoilERAGLPAARLPDDPTSLLGNSSRATAEQIGMYVHRKLFENDGTCSLRDTGALLALHRSEGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPIQGELPPGLHGSLLLRGMDAYLKELAHLDRLVTPALLPAWTEQGRGSGAEEATQ*
Ga0066653_1025346523300006791SoilRASLDSPADRALAGDLFAQVGQTMSPDEVASDLSYSAAGVALFRYLKARRERAGLPAERLPDDPTSLLGNSSRAAAEQIGGYLHRKLFANDGSCALSDTGALIALHRREGTLRWLAQRWPKLVFTGKTGSSPHDDSAVAAVGICLDARPVLLVAALRPVQGTLPAGLHGSVLLRGLDAYLRELGRLERRPTSLQLPAWAEEEEAPSIPAVAAEEKR*
Ga0066660_1148859413300006800SoilPEEVPPEISYSAASIALFRHLKMRRERAGLPSERLPDDPTSLLGNSSRATVEQIGAYLHRKLFAGDGTCTLSDTGALLALHRREGTLRWLALRWPKLVFSSKTGSSPHDDSAVAAVALCLDARPVVLAAGLRPLQGTLPTGLRGSVLLRGLDAYLHELVKLQRPPAPALWPAWTEEE
Ga0079220_1057327023300006806Agricultural SoilATAEQIGAYVHRKLFAHDGTCALRDTGALLAMHRFEGTLRWLAQRWPKLVLSGKTGSSPHDDSAVAAVGVCLDARPVVLVAALRPIEGHLPMGLHGSILLRGIDAYLRELGRLQRRAASALLPSWAEEPEPALTAQVKP*
Ga0075433_10001687143300006852Populus RhizosphereLGETVAPDQVAPELSYSAAGVALFRHLRARRELAGLPAARLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRVGTLRWLAWRWPKLIFAGKTGSSPHDDSAVAGIALCVDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRDLRRLDRRVTSLALPAWAEEIVAPQSVSVAATPLEEER*
Ga0075425_10057863413300006854Populus RhizospherePEQVPPELSYTAAGVALFRHLRARRQLAGLPADRLPDDPTSLLGNSSRATAEQIGGYLHRKLLASDGSCTLSDTGGLLALHRRAGTLRWLAWRWPKLVFAGKTGSSPHDDSAVAGIAVCVDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRLTPVALPAWAAETLAPAAAPEAISAPAAPVEEER*
Ga0075429_10125200313300006880Populus RhizosphereQLGETLRPEEVAPELSYSTAGVALFRHLRARRELAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRVGTLRWLAWRWPKLVFAGKTGSSPHDDSAVAGIAACLDERPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRVASLALPAWAEDTVAPPPVPETISTAAMPAEKEER*
Ga0075426_1017594913300006903Populus RhizospherePELSYSAAGVALFRHLRARRELAGLPAARLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRVGTLRWLAWRWPKLIFAGKTGSSPHDDSAVAGIALCVDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRDLRRLDRRVTSLALPAWAEEIVAPQSVSVAATPLEEER*
Ga0075424_10164641513300006904Populus RhizosphereAGVALFRHLRARRELAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRVGTLRWLAWRWPKLVFAGKTGSSPHDDSAVAGIAACFDERPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRVASLVLPAWAEDTVAPPPVPETISTAAMPAEKEER*
Ga0099791_1034138513300007255Vadose Zone SoilREQAGLPAERLPEDPTSLLGNSSRATPEQIGAYLQRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVAALRPLQAPLPDGLQGSVLLRGIDAYLKELARLQRTPTSALLPPWADAAAELSAEARP*
Ga0099793_1069387123300007258Vadose Zone SoilADGSCTLSDTGALLALHRRVGTLRWLAWRWPKLVFAGKTGSSPHDDSAVAGIALCLDARPVVLVAALRPLQGTLPDGLHGSVLLRGLDAYLRELRRLDRRVSSAALPAWAEEIDAPAGGRDPISVAATPAEKEER*
Ga0099794_1051363413300007265Vadose Zone SoilDDPTSLLGNSSRATAEQIGAYLHRKLFAGDGTCALSDTGALLALRRKEGTLRWLAQRWPKLVFTGKTGSSPHDDSAVAAVGVCLDARPVVLVAALRPLSGVLPTGLRGSVVLRGIDAYLKEMVRLERPPTPALMPSWVEEENGPEHPLAAEVGP*
Ga0099794_1068625613300007265Vadose Zone SoilELSYSAAAVALFRYLKDRREQAGLPAERLPEDPTSLLGNSSRATPEQIGGYLQRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPKIVFSGKTGSSPHDDSALAAVGLCLDARPVVLIGALRPIEPPLPDGLQGSVLLRGIDAYFKELVRLQRKPGSALLPAWAEPQPALAVEAKP
Ga0066710_10071318733300009012Grasslands SoilFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP
Ga0066710_10365507913300009012Grasslands SoilLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDAALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP
Ga0066710_10454250013300009012Grasslands SoilSLLGNSSRATAEQIGGYLHRKLLAGDGSCTLSDTGALLALHRRVGTLRWLAWRWPKVVFAGKTGSSPHDDSAVAGIAVCLDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYFKELRRLDRRVASVALPAWADEFQAPAAAPESVSVAATPAEEER
Ga0099829_1159351213300009038Vadose Zone SoilPAERLPDDPTSLLGNSSRATAEQIGAYLHRKLFAGDGTCALSDTGALLALRRKEGTLRWLAQRWPKLVFTGKTGSSPHDDSAVAAVGLCLDARPVVLVAALRPLSGVLPTGLRGSVVLRGIDAYLKEMVRLERPPTPALMPSWVEEENGPEHPLAAEVGP*
Ga0075418_1301039613300009100Populus RhizosphereKLFVNDGTCTLSDTGALLALHRREGTLRWLAYRWPKIVFSGKTGSSPHDDSALTAVGLCLDSRPVVLVAALRPIQPPLPDGLHGSVLLRGIDAYLKELVRLQRRPTSALLPQWACAADPSQSCPVETVEARP*
Ga0066709_10065765033300009137Grasslands SoilALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDAALAAIGLCLDARPVILVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQESPIALEAKP*
Ga0066709_10100579513300009137Grasslands SoilLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSVLRPLQAPLPDGLQGSLLLRGIDGYLRELARLDRKPASALLPTWAEQDAQLAVEARP*
Ga0066709_10138739133300009137Grasslands SoilFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPPWAEQDAQLAVEARP*
Ga0066709_10198658523300009137Grasslands SoilRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFTGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP*
Ga0066709_10347291413300009137Grasslands SoilLGQTLRPDQVAPELSYSTAGVALFRHLRARREMAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLAGDGSCTLSDTGALLALHRRVGTLRWLAWRWPKVVFAGKTGSSPHDDSAVAGIAVCLDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYFKELRRLDRRVASVALPAWADEFQAPAAAP
Ga0066709_10441404313300009137Grasslands SoilARRERAGLPSERLPDDPTSLLGNSSRATAEQIGAYLHRRLFAGDGSCALSDTGALIALHRSVGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAALGICLDARPVIMVAALRPVQAPLPMGLHGSILLRGLDAYLRELGRLERRPTSLMLPPWATEEEAPPAPAVAAEEKR*
Ga0099792_1033547923300009143Vadose Zone SoilQLGETLAPDDVPQDLSYSAAGVALFRYLKERRELAGLPAERLPDDPTSLLGNSSRATPEQIAAYLHRKLFLKDGSCTLTDTGALLALRRKEGTLRWLAQKWPKLVFAGKTGSSPHDDSAVAAVAVCLDQRPVVLVGALRPLEGTLPLGLRGSVVLRGLDSYLKALVRLDRRPTSALWPAWVEEEIAAKQAALLVDSPRVEGALATKGKP*
Ga0075423_1160099923300009162Populus RhizospherePDAVSPELSYSAAGVSLFRYLKDRRERAGLPSDRLPEDPTSLLGNSSRATAEQIGAYIHRKLFVNDGTCTLSDTGALLALHRREGTLRWLAYRWPKIVFSGKTGSSPHDDSALTAVGLCLDSRPVVLVAALRPIQPPLPDGLHGSVLLRGIDAYLKELVRLQRRPTSALLPQWACAADPSQSCPVETVEARP*
Ga0126384_1081574013300010046Tropical Forest SoilADRALAGDLLAQLGEPLPPEAIPAELSYSAAGVALFRYLKDRREQAGLPADRLPEDPTSLLGNSSRATAEQIGSYLHRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPKIVLSGKTGSSPHDDSAVAAVGLCLDARPVVLVAALRPLQAPLPDGLQGSVLLRGIDAYLKELARLQRTPASALLPPWTEPAAELSAEARP*
Ga0099796_1058133913300010159Vadose Zone SoilSGLFAQLGETQRPDQVAPELSYSAAGVALFRHLRARRELAGLPAGRLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRVGTLRWLAWRWPKLVFAGKTGSSPHDDSAVAGIAACIDERPVVLVAALRPLHAPLPDGLHGSVLLRGLDAYL
Ga0134082_1012933033300010303Grasslands SoilLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP*
Ga0134109_1044769813300010320Grasslands SoilPDDPTSLLGNSSRATAEQIGAYLHRKLFAGDGSCTLSDTGALLGLHRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPPWAEQDAQLAVEARP*
Ga0134067_1004821113300010321Grasslands SoilLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP*
Ga0134063_1046038513300010335Grasslands SoilAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPPWAEQDAQLAVEARP*
Ga0134071_1051934123300010336Grasslands SoilYLRDRRERAGLPADRLPEDPTSLLGNSWRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDAALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP*
Ga0126370_1201621113300010358Tropical Forest SoilAEVPPGERAALDSPADRALAAGLLAQLGDMVTPDQVASELSYTAAGVALFRYLRARRQLAGLPADRLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRAGTLRWLAWRWPKLVFAGKTGSSPHDDSAVAGVAVCLDARPVVLVAALRPLQAPLPEGLHGSVLLRGLDLYL
Ga0134128_1323369713300010373Terrestrial SoilEPDLSYSAAGVALFRYLKQRREAAGLPAERLPDDPTSLLGNSSRATVEQIGTYLSRKLFLGDGTCTLSDTGALLALHRSEGTLRWLAQRHPGLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVAALRSASGPLPEGLHGSMLLRGIDEYLRELVRLERKPSSALM
Ga0137364_1126461723300012198Vadose Zone SoilATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEPETPVALEAKP*
Ga0137363_1034486313300012202Vadose Zone SoilATPEQIGAYLHFKLFANDGTCTLSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETQLAAEAKP*
Ga0137399_1006159513300012203Vadose Zone SoilQIGGYLHLKLFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFTGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETQLAAEAKP*
Ga0137399_1077666223300012203Vadose Zone SoilYSAAGVALFRYLRDRRELAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVFPAWAEQDAQLAVEARP*
Ga0137399_1116440823300012203Vadose Zone SoilPEQIGAYLHLKLFANDGTCTLSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRAVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSALLPPWVEQETRLAAEANP*
Ga0137362_1066939923300012205Vadose Zone SoilPDDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETRLAAEAKP*
Ga0137360_1091586823300012361Vadose Zone SoilLAQLGDAMPPDAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSVLRPLQAPLPDGLQGSLLLRGIDGYLRELARLDRKPASALLPTWAEQDAQLAVEARP*
Ga0137361_1097889223300012362Vadose Zone SoilLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVFPAWAEQDAQLAVEARP*
Ga0137395_1023018313300012917Vadose Zone SoilGELLSQVGEAVAPDAVSPELSYSTAAIGLFRYLKDRRERAGLPADRLPDDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETQLAAEAKP*
Ga0137395_1087841213300012917Vadose Zone SoilVAPELSYSAAGVALFRHLRTRRELAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLAGDGSCTLSDTGALLALHRRAGTLRWIAWRWPKLVFAGKTGSSPHDDSAVAGIAVCLDARPVVLVAALRPLQPPLPEGLHGSVLLRGLDAYLRELRRLDRRVTSVALPAWAEEMETPAAIPETISAAAMPVEKEER*
Ga0137394_1122932013300012922Vadose Zone SoilAIPAELSYSAAGVALFRYLKDRREQAGLPADRLPEDPTSLLGNSSRATAEQVGSYLHRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPKIVLSGKTGSSPHDDSAVAAVGLCLDARPVVLVAALRPLQAPLPDGLQGSVLLRGIDAYLKELARLQRTPTSALLPPWADAAAELSAEARP*
Ga0137359_1106293313300012923Vadose Zone SoilPAEERAALDSPADRALAGDLLSQLGEVVPPDAVAPELSYSAAAVALFRYLKDRREQAGLPAERLPEDPTSLLGNSSRATPEQIGGYLQRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPKIVFSGKTGSSPHDDSALAAVGLCLDARPVVLIGALRPIEAPLPDGLQGSVLLRGIDAYFKELVRLQRKPGSALLPAWAEPQPALAVEAKP*
Ga0137359_1142368213300012923Vadose Zone SoilLKLFANDGTCALSDTGALLALHRREGTLRWLAQRWPKIVFSGKTGSSPHDDSALAAVGLCLDSRPVVLIGALRPIEPPLPDGLQGAVLLRGIDAYLKELVRLQRKPGSALLPPWADPQPTLAVEAKP*
Ga0137416_1127987813300012927Vadose Zone SoilDPTSLLGNSSRATPEQIGGYLQRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPKIVFSGKTGSSPHDDSALAAVGLCLDARPVVLIGALRPIERPLPDGLQGSVLLRGIDAYLRELAKLQRKPSSALLPAWAEPPAPAEIAAEATP*
Ga0137416_1185026913300012927Vadose Zone SoilGLPAERLPDDPTSLLGNSSRATTEQIGGYLHRKLLAADGSCTLSDTGALLALHRRVGTLRWLAWRWPKLVFAGKTGSSPHDDSAVAGIAACVDERPVVLVAALRPLHAPLPDGLHGSVLLRGLDAYLRELRRLDRRVTSVALPAWAEETYAPSPAAETVSTAAVPADKEER*
Ga0137404_1014492713300012929Vadose Zone SoilLSQVGEAVPPDAVSPELSYSTAAVGLFRYLRDRRERAGLPADRLPDDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPLQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETQLAAEAKP*
Ga0153916_1245391813300012964Freshwater WetlandsGLPAAALPDDPTSLLGNSSRATAEQIGGYLHRKLFGGSGSCALSDTGALLALRRRDGTLRWLAARWPKLVFAGKTGSSPHDDSAVAAVAICLDARPVVLVAALRALSGSLPEGLRGSVLLRGIDGYMRELVRLERRPQSALWPEWASEEAAPPGPPGPAAKPAAKKGVQP*
Ga0134110_1027982513300012975Grasslands SoilKLFANDGTCTLSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRAVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSALLPPWVEEETRLAAEAKP*
Ga0134110_1052037713300012975Grasslands SoilEAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPPWAEQDAQLA
Ga0134075_1018450213300014154Grasslands SoilLLAQLGEAMPPDAIAPELSYSAAGVALFRYLRDRRELAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSALAGIAVCLDARPVVLIAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRVTSLALPAWAEEIEAPVAAPEAISVAATPVEKEER*
Ga0134078_1008175313300014157Grasslands SoilAGDLLAQLGETIPPGAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP*
Ga0137411_115748513300015052Vadose Zone SoilTAEQVGSYLHRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPKIVLSGKTGSSPHDDSAVAAVGLCLDARPVVLVAALRPLQAPLPDGLQGSVLLRGIDAYLKELARLQRTPTSALLPPWADAAAELSAEARP*
Ga0137405_140876043300015053Vadose Zone SoilVPPDERASLDSPLDRALAGELLSQVGEAVPPDAVSPELSYSTAAVGLFRYLKDRRERAGLPADRLPDDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETRLAAEAKP*
Ga0137403_1068307823300015264Vadose Zone SoilDRLPDDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETRLAAEAKP*
Ga0134073_1029720023300015356Grasslands SoilMLPDDPTSLLGNSSRATAEQIGAYLHRKLFAGDGSCTLSDTGALLGLHRREGTLRWLALRWPKLVFTGKTGSSPHDDSAVAAVALCLDARPVVAVSALRALAGPLPEGLRGSVLLRGIDAYLKELSRLERKPSSALWPSFIADEAPAEELTVEAKR*
Ga0066662_1021402713300018468Grasslands SoilRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRAGTLRWLAWRWPKLVFAGKTGSSPHDDSALAGIAVCLDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRVTSLALPAWAEEIEAPVAAPEAISVAATPAEKEER
Ga0066662_1171528613300018468Grasslands SoilLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRAPAEQIGSYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQESPIALEAKP
Ga0066669_1218911923300018482Grasslands SoilSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKLVFSGKTGSSPHDDAALAAVGLCLDARPVVFVAALRPLQPPLPDGLQGSVLLRGIDAYLRELVRLDRRPGSAVLPAWAEQQTEIALEAKP
Ga0179594_1006445433300020170Vadose Zone SoilNSSRATPEQIGAYLHLKLFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPLQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETQLAAEAKP
Ga0179594_1009172113300020170Vadose Zone SoilAPELSYSAAGVALFRYLRDRRELAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSVLRPLQAPLPDGLQGSLLLRGIDGYLRELARLDRKPASALLPTWAEQDAQLAVEARP
Ga0207642_1086840813300025899Miscanthus RhizosphereDRALAGDLLAQLGAPVPTDAIPGELSYSAAGVALFRYLKDRRERAGLPSDRLPEDPTSLLGNSSRATAEQIGTYLHRKLFANDGTCALTDTGALLALRRREGTLRWLAQRWPRLVFSGKTGSSPHDDSALAAVGICLDQRPVVLVAALRPLQGTLPDGLHGSLLLRGIGQ
Ga0207707_1022308713300025912Corn RhizosphereLEFNLYYLRGINYVIQRGSFGSMEEGSLSIGFVTSAGVALFRYLRDRRERAGLPAGRLPEDPTSLLGNSSRATPEEIGAYLHRKLFRNDGTCALSDTGALLALHRKEGTLRWLAQRWPKLVFSGKTGSSPHDDSALAAVGLCLDARPVVLIAALRPLQAPLPDGLQGSVLLRGIDAYLRELSRLDRKPAPAILPAWADPEAQLPAEGNP
Ga0207644_1065115823300025931Switchgrass RhizosphereVALFRYLKDRRERAGLPSDRLPEDPTSLLGNSSRATAEQIGTYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPRLVFSGKTGSSPHDDSALAAVGICLDQRPVVLVAALRPLQGTLPDGLHGSLLLRGIDAYLRELTRLDRKPAAALLPAWAEADPQAPVEASP
Ga0207648_1198971713300026089Miscanthus RhizosphereEGAPVPPDAIPGELSYSAAGVALFRYLKDRRERAGLPSDRLPEDPTSLLGNSSRATAEQIGTYLHRKLFANDGTCALTDTGALLALRRREGTLRWLAQRWPRLVFSGKTGSSPHDDSALAAVGICLDQRPVVLVAALRPLQGTLPDGLHGSLLLRGIDAYLRELTRLDRKPAAALLPSWA
Ga0209888_106186123300026274Permafrost SoilLSYSAAGVALFRTLKQRREQAGLPATSLPDDPTSLLGNDSRATVEQIGAYLHKKLFLGDGSCTLSDTGALLALHRREGTLRWLAQRWPKLIFTGKTGSSPHDDSAVAAVAICLDTRPVVLVAALRPLQGALPQGLRGSVVLRGLDSYLRELSRLERRPNSAELPPWAVLATAVEAVQ
Ga0209265_104628733300026308SoilADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP
Ga0209265_107760823300026308SoilAQLGETLAPDDVPQDLSYSAAGVALFRYLKKSREAAGLPAERLPDDPTSLLGNSSRATPEQIGAYLHRKLFLKDGSCTLSDTGSLLALRRKEGTLRWLAQKWPKLVFAGKTGSSPHDDSAVAAVAICLDQRPVVLVGALRPLEGSLPLGLRGSVVLRGLDSYLKALVRLDRRPTSALWPAWVEEEIAAKEAALPIDPPRVESALATKGKP
Ga0209055_105670813300026309SoilAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP
Ga0209761_120700123300026313Grasslands SoilPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP
Ga0209686_108540733300026315SoilGEAMPPDAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP
Ga0209802_123263323300026328SoilEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDAALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP
Ga0209473_111064713300026330SoilLAYSAAGVALFRYLKERREQAGLPAERLPDDPTSLLGNSSRATPEQIAAYLHRKLFLKDGSCTLTDTGALLALRRKEGTLRWLAQKWPKLVFAGKTGSSPHDDSAVAAVAICLDQRPVVLVGALRPLLGALPLGLRGSVVLRGLDSYLKALVRLDRRPASALWPAWVEEELAAKQASAPLEPSRVESALATKGKP
Ga0209803_107474813300026332SoilDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP
Ga0209377_105324613300026334SoilDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP
Ga0209804_110806233300026335SoilSYSAAGVALFRYLKKSREAAGLPAERLPDDPTSLLGNSSRATAEQIGHYLHRKLFSDSSCVLSDAGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSALAAVGICLDARPVVLVAALRPLQPPLPKGLHGSVLLRGLDAYLRELVRLERRPTSLMLPAWAEENIAEPSAVAAGGIPSFAGDGEKR
Ga0209804_130505423300026335SoilAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPPWAEQDAQLAVEARP
Ga0209059_121893513300026527SoilAGVALFRYLRDRRELAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSVLRPLQAPLPDGLQGSLLLRGIDGYLRELARLDRKPASALLPTWAEQDAQLAVEARP
Ga0209805_114170333300026542SoilLAGDLLAQLGEPIPPDAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKLVFSGKTGSSPHDDAALAAVGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPASAVLPAWAEQETQVALEAKP
Ga0209156_1040382813300026547SoilLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP
Ga0209161_1002488253300026548SoilIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP
Ga0209161_1014323733300026548SoilALDSPADRALSATLFAQLGQTVQPDEIASDLSYSAASVALFRHLKSRRERGGLPASMLPDDPTSLLGNSSRATVEQIGAYLHRKLFAGDGSCTLSDTGTLLGLHRREGKLRWLALRWPKLVFTGKTGSSPHDDSAVAAVALCLDARPVVAVSALRALAGPLPEGLRGSVLLRGIDAYLKELSRLERKPSSALWPSFIADEAPAEELTVEAKR
Ga0209474_1001596613300026550SoilGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPPWAEQDAQLAVEARP
Ga0209474_1021477313300026550SoilRATVEQIGAYLHRKLFAGDGTCTLSDTGALLALHRREGTLRWLALRWPKLVFSGKTGSSPHDDSAVAAVALCLDARPVVLAAGLRPLQGTLPTGLRGSVLLRGLDAYLHELVKLQRPPAPALWPAWAEEELAAAAQPPETPSLAVEEKR
Ga0209577_1042510523300026552SoilAGVALFRYLKIRREQAGLPAQRLPDDPTSLLGNSSRATAEQIGGYLHRKLFVNDGSCTLSDTGALLALHRREGTLRWLAQRWPKLVFTGKTGSSPHDDSAVAAAGICLDARPVVLVAALRPLQDPLPTGLRGSVLLRGMDAYLRELSRLERRLTSAVLPGWTEEGGTPKPPPGLSPMANDVRSGEEKP
Ga0179587_1103729913300026557Vadose Zone SoilRDRREQAGLPAERLPEDPTSLLGNSSRATPEQIGAYLQRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGLCLDARPVVLIGALRPLHPPLPDGLQGSVLLRGIDAYLRELAKLQRKPSSALLPAWAEPPAPAEIAAEATP
Ga0209331_110226713300027603Forest SoilLPDDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETQLAAEAKP
Ga0209076_120963713300027643Vadose Zone SoilDLLAQLGDAMPPDAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSVLRPLQAPLPDGLQGSLLLRGIDGYLRELARLDRKPA
Ga0209009_106015723300027667Forest SoilSAAGVALFRYLRERRERAGLPAGRLPEDPTSLLGNSSRATAEEIGAYLHRKLFANDGTCTLSDTGALLALRRKEGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGVCLDARPVVLIAALRPLQPPLPDGLQGSVLLRGIDAYLREISRLDRKPAPAILPAWADPEAQLAVEGKR
Ga0209588_101786713300027671Vadose Zone SoilPDDPTSLLGNSSRATAEQIGGYLHRKLLAGDGSCTLSDTGALLALHRRAGTLRWLAWRWPKLVFAGKTGSSPHDDSAVAGIAVCLDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELVRLDRRVTSAALPAWAEEEIAAPVAAPEAISIAATPAEKEER
Ga0209588_103493933300027671Vadose Zone SoilDPTSLLGNSSRATPEQIGAYLHLKVFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETQLAAEAKP
Ga0209588_121245923300027671Vadose Zone SoilFRYLKDRREQAGLPAERLPEDPTSLLGNSSRATPEQIGGYLQRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPKIVFSGKTGSSPHDDSALAAVGLCLDARPVVLIGALRPIEPPLPDGLQGSVLLRGIDAYFKELVRLQRKPGSALLPAWAEPQPALAVEAKP
Ga0209166_1067306713300027857Surface SoilASVALFRYLKARRERAGLPAQGLPDDPTSLLGNSSRATVEQIGAYLHRKLFSGDGTCTLSDTGALLSLHRRDGTLRWLAQRWPKLVFSGKTGSSPHDDSAIAAVALCLDARPVVLVAGLRPPHGALPEGLRGSLVLRGLDAYLRELQRLERAPTSALWPGWAEEELAKAQPTT
Ga0209382_1141803023300027909Populus RhizosphereDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRVGTLRWLAWRWPKLVFAGKTGSSPHDDSAVAGIAACLDERPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRVASLALPAWAEDTVAPPPVPETISTAAMPAEKEER
Ga0268264_1083645713300028381Switchgrass RhizosphereGAPVPPDAIPGELSYSAAGVALFRYLKDRRERAGLPSDRLPEDPTSLLGNSSRATAEQIGTYLHRKLFANDGTCALTDTGALLALRRREGTLRWLAQRWPRLVFSGKTGSSPHDDSALAAVGICLDQRPVVLVAALRPLQGTLPDGLHGSLLLRGIDAYLRELTRLDRKPAAALLPSWAQADPQTPVEALP
Ga0307469_1063188513300031720Hardwood Forest SoilDPTSLLGNSSRATAEQIGGYLHRKLLAPDGSCTLSDTGALIAVHRRAGTLRWLAGRWPKLVFAGKTGSSPHDDSAVAGIAVCLDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLKELQRLDRRLTSVALPDWAAETLAPAAAPETISPAAARVEEER
Ga0307469_1086077813300031720Hardwood Forest SoilDLLAQLGEPLPPDAIPAELSYSAAGVALFRYLKDRREQAGLPADRLPEDPTSLLGNSSRATAEQVGSYLHRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPKIVLSGKTGSSPHDDSAVAAIGLCLDARPVVLVAALRPLQAPLPDGLQGSVLLRGIDAYLKELARLQRTPTSALLPPWADAAAELSAEARP
Ga0307469_1151198823300031720Hardwood Forest SoilGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAMLPAWAEQDAQLAVEARP
Ga0307468_10225679213300031740Hardwood Forest SoilAIPAELSYSAAGVALFRYLKDRREQAGRPADRLPEDPTSLLGNSSRATAEQVGSYLHRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPKIVLSGKTGSTPHDDSAVAAIGLCLDARPVVLVAALRPLQAPLPDGLQGSVLLRGIDAYLKELARLQRTPTSALLPPWADAAA
Ga0307473_1015065133300031820Hardwood Forest SoilLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVAALRPLQAPLPDGLQGSLLLRGIDGYLRELAGLDRKPSSALLPPWAEQEAQPAVVEARP
Ga0308175_10321386113300031938SoilARREQAGLPAQRLPDDPTSLLGNSSRATAEQVGAYLHRKLFAPDGSCTLSDTGALLALHRREGTLRWLAQRFPKLVFAGKTGSSPHDDSAVAAVALCLDARPVLLVAALRPIEGHLPMGLHGSILLRGMDAYLRELNRLQRKAASALLPSWAEEPEPALTAQVKP
Ga0307472_10026325513300032205Hardwood Forest SoilEQIGGYLHRKLLAPDGSCTLSDTGALLALHRRAGTLRWLAGRWPKLVFAGKTGSSPHDDSAVAGIAVCLDARPVVLVAALRPVQAPLPDGLHGSVLLRGLDAYLKELRKLDRRLTSVALPDWAAEIEAPAAAPETISAAAARVEEER
Ga0335083_1068790523300032954SoilAELSYSAAGVALFRYLRERREQAGLPASRLPEDPTSLLGNSSRATAEEIGSYLHRKLFANDGTCALSDTGALLALHRKEGTLRWLAARWPKLVFSGKTGSSPHDDSALAAVGICLDARPVVLIAALRPLHPPLPDGLQGSVLLRGIDAYLRELSRLDRRPAPAPLPAWAMSEGAPPVEAQ
Ga0316620_1125915913300033480SoilLSYRAAGGALFRDLTDRRARAGLPAGRLPDDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLAIHRKEGTLRWLAGRWPNLVFSGKTGSSPHDDSAVAAVALCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRKPAPALLPAWAETETQLAVEAKP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.