NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F068653

Metagenome Family F068653

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F068653
Family Type Metagenome
Number of Sequences 124
Average Sequence Length 178 residues
Representative Sequence MPLLAILLLAAAPGQDVVQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAERWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQQKKAKALKLVVVFKPAGDRCAGSAAAESWRIGGRARSWELTGAQGIVA
Number of Associated Samples 106
Number of Associated Scaffolds 124

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 7.26 %
% of genes near scaffold ends (potentially truncated) 97.58 %
% of genes from short scaffolds (< 2000 bps) 86.29 %
Associated GOLD sequencing projects 97
AlphaFold2 3D model prediction Yes
3D model pTM-score0.81

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(27.419 % of family members)
Environment Ontology (ENVO) Unclassified
(43.548 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(53.226 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 24.26%    β-sheet: 29.21%    Coil/Unstructured: 46.53%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.81
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
f.23.24.1: PetL subunit of the cytochrome b6f complexd4pv1e_4pv10.69004
f.23.9.1: Bacterial ba3 type cytochrome c oxidase subunit IIad3s8gc_3s8g0.68884
a.53.1.0: automated matchesd4cz5a_4cz50.6768
a.4.5.47: C-terminal part of PCI (proteasome COP9/signalosome eIF3) domains (PINT motif)d4lcta24lct0.67406
f.23.10.1: Photosystem II reaction centre subunit H, transmembrane regiond1rzhh21rzh0.66404


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 124 Family Scaffolds
PF00781DAGK_cat 15.32
PF02754CCG 8.87
PF00476DNA_pol_A 1.61
PF00437T2SSE 0.81
PF01709Transcrip_reg 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 124 Family Scaffolds
COG1597Phosphatidylglycerol kinase, diacylglycerol kinase familyLipid transport and metabolism [I] 30.65
COG0247Fe-S cluster-containing oxidoreductase, includes glycolate oxidase subunit GlcFEnergy production and conversion [C] 8.87
COG2048Heterodisulfide reductase, subunit BEnergy production and conversion [C] 8.87
COG0749DNA polymerase I, 3'-5' exonuclease and polymerase domainsReplication, recombination and repair [L] 1.61
COG0217Transcriptional and/or translational regulatory protein YebC/TACO1Translation, ribosomal structure and biogenesis [J] 0.81


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001661|JGI12053J15887_10181427All Organisms → cellular organisms → Bacteria1083Open in IMG/M
3300002568|C688J35102_120325324All Organisms → cellular organisms → Bacteria991Open in IMG/M
3300003324|soilH2_10300958All Organisms → cellular organisms → Bacteria1342Open in IMG/M
3300004153|Ga0063455_100418274All Organisms → cellular organisms → Bacteria798Open in IMG/M
3300005167|Ga0066672_10012660All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales4142Open in IMG/M
3300005174|Ga0066680_10316039All Organisms → cellular organisms → Bacteria997Open in IMG/M
3300005179|Ga0066684_10874000All Organisms → cellular organisms → Bacteria589Open in IMG/M
3300005187|Ga0066675_10403222All Organisms → cellular organisms → Bacteria1010Open in IMG/M
3300005355|Ga0070671_101428838All Organisms → cellular organisms → Bacteria611Open in IMG/M
3300005440|Ga0070705_101284746All Organisms → cellular organisms → Bacteria606Open in IMG/M
3300005440|Ga0070705_101659838All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300005446|Ga0066686_10188858All Organisms → cellular organisms → Bacteria1376Open in IMG/M
3300005459|Ga0068867_101877315All Organisms → cellular organisms → Bacteria564Open in IMG/M
3300005518|Ga0070699_100205584All Organisms → cellular organisms → Bacteria1752Open in IMG/M
3300005536|Ga0070697_100022273All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales5028Open in IMG/M
3300005544|Ga0070686_101701775All Organisms → cellular organisms → Bacteria535Open in IMG/M
3300005546|Ga0070696_100988609All Organisms → cellular organisms → Bacteria702Open in IMG/M
3300005547|Ga0070693_100944765All Organisms → cellular organisms → Bacteria649Open in IMG/M
3300005554|Ga0066661_10873234All Organisms → cellular organisms → Bacteria526Open in IMG/M
3300005556|Ga0066707_10719544All Organisms → cellular organisms → Bacteria624Open in IMG/M
3300005575|Ga0066702_10158310All Organisms → cellular organisms → Bacteria1345Open in IMG/M
3300005575|Ga0066702_10773686All Organisms → cellular organisms → Bacteria571Open in IMG/M
3300005575|Ga0066702_10857561All Organisms → cellular organisms → Bacteria541Open in IMG/M
3300005576|Ga0066708_10020570All Organisms → cellular organisms → Bacteria → Proteobacteria3366Open in IMG/M
3300005587|Ga0066654_10543032All Organisms → cellular organisms → Bacteria640Open in IMG/M
3300005843|Ga0068860_101604533All Organisms → cellular organisms → Bacteria672Open in IMG/M
3300005895|Ga0075277_1053848All Organisms → cellular organisms → Bacteria627Open in IMG/M
3300006796|Ga0066665_10102036All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales2106Open in IMG/M
3300006796|Ga0066665_11588260All Organisms → cellular organisms → Bacteria514Open in IMG/M
3300006797|Ga0066659_10191242All Organisms → cellular organisms → Bacteria1490Open in IMG/M
3300006797|Ga0066659_10956764All Organisms → cellular organisms → Bacteria716Open in IMG/M
3300006800|Ga0066660_10165358All Organisms → cellular organisms → Bacteria1654Open in IMG/M
3300006800|Ga0066660_11618430All Organisms → cellular organisms → Bacteria514Open in IMG/M
3300006854|Ga0075425_100740250All Organisms → cellular organisms → Bacteria1128Open in IMG/M
3300006904|Ga0075424_100703432All Organisms → cellular organisms → Bacteria1079Open in IMG/M
3300006954|Ga0079219_12115147All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300007076|Ga0075435_100556441All Organisms → cellular organisms → Bacteria994Open in IMG/M
3300007255|Ga0099791_10054900All Organisms → cellular organisms → Bacteria1787Open in IMG/M
3300009143|Ga0099792_10388970All Organisms → cellular organisms → Bacteria851Open in IMG/M
3300009143|Ga0099792_10513658All Organisms → cellular organisms → Bacteria752Open in IMG/M
3300009143|Ga0099792_10694430All Organisms → cellular organisms → Bacteria658Open in IMG/M
3300009792|Ga0126374_11596068All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300010304|Ga0134088_10414431All Organisms → cellular organisms → Bacteria658Open in IMG/M
3300010320|Ga0134109_10304189All Organisms → cellular organisms → Bacteria615Open in IMG/M
3300010321|Ga0134067_10142624All Organisms → cellular organisms → Bacteria850Open in IMG/M
3300010322|Ga0134084_10377158All Organisms → cellular organisms → Bacteria547Open in IMG/M
3300010326|Ga0134065_10092513All Organisms → cellular organisms → Bacteria993Open in IMG/M
3300010333|Ga0134080_10455452All Organisms → cellular organisms → Bacteria601Open in IMG/M
3300010335|Ga0134063_10030179All Organisms → cellular organisms → Bacteria2296Open in IMG/M
3300010335|Ga0134063_10728541All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300010336|Ga0134071_10745107All Organisms → cellular organisms → Bacteria521Open in IMG/M
3300010364|Ga0134066_10283722All Organisms → cellular organisms → Bacteria587Open in IMG/M
3300010399|Ga0134127_12194472All Organisms → cellular organisms → Bacteria631Open in IMG/M
3300010401|Ga0134121_10306358All Organisms → cellular organisms → Bacteria1408Open in IMG/M
3300012200|Ga0137382_10723455All Organisms → cellular organisms → Bacteria714Open in IMG/M
3300012202|Ga0137363_10604145All Organisms → cellular organisms → Bacteria926Open in IMG/M
3300012203|Ga0137399_10055164All Organisms → cellular organisms → Bacteria2907Open in IMG/M
3300012203|Ga0137399_10223235All Organisms → cellular organisms → Bacteria1537Open in IMG/M
3300012203|Ga0137399_11265206All Organisms → cellular organisms → Bacteria621Open in IMG/M
3300012205|Ga0137362_10096373All Organisms → cellular organisms → Bacteria2485Open in IMG/M
3300012205|Ga0137362_10313171All Organisms → cellular organisms → Bacteria1359Open in IMG/M
3300012205|Ga0137362_11064480All Organisms → cellular organisms → Bacteria688Open in IMG/M
3300012208|Ga0137376_11543929All Organisms → cellular organisms → Bacteria555Open in IMG/M
3300012211|Ga0137377_11529176All Organisms → cellular organisms → Bacteria593Open in IMG/M
3300012362|Ga0137361_10197843All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales1817Open in IMG/M
3300012469|Ga0150984_106305490All Organisms → cellular organisms → Bacteria758Open in IMG/M
3300012469|Ga0150984_106875865All Organisms → cellular organisms → Bacteria537Open in IMG/M
3300012582|Ga0137358_10312578All Organisms → cellular organisms → Bacteria1066Open in IMG/M
3300012683|Ga0137398_10935016All Organisms → cellular organisms → Bacteria603Open in IMG/M
3300012917|Ga0137395_10563733All Organisms → cellular organisms → Bacteria822Open in IMG/M
3300012922|Ga0137394_11102013All Organisms → cellular organisms → Bacteria657Open in IMG/M
3300012924|Ga0137413_10253656All Organisms → cellular organisms → Bacteria1209Open in IMG/M
3300012929|Ga0137404_10380264All Organisms → cellular organisms → Bacteria1242Open in IMG/M
3300012930|Ga0137407_11311442All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Phyllobacteriaceae → Mesorhizobium → unclassified Mesorhizobium → Mesorhizobium sp. LSHC420B00688Open in IMG/M
3300012931|Ga0153915_11961903All Organisms → cellular organisms → Bacteria685Open in IMG/M
3300012944|Ga0137410_10489725All Organisms → cellular organisms → Bacteria1003Open in IMG/M
3300012971|Ga0126369_12797833All Organisms → cellular organisms → Bacteria571Open in IMG/M
3300012977|Ga0134087_10129012All Organisms → cellular organisms → Bacteria1083Open in IMG/M
3300013297|Ga0157378_10535838All Organisms → cellular organisms → Bacteria1174Open in IMG/M
3300015052|Ga0137411_1022879All Organisms → cellular organisms → Bacteria908Open in IMG/M
3300015245|Ga0137409_10085437All Organisms → cellular organisms → Bacteria2941Open in IMG/M
3300015245|Ga0137409_10184650All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria1887Open in IMG/M
3300015356|Ga0134073_10244356All Organisms → cellular organisms → Bacteria617Open in IMG/M
3300018433|Ga0066667_10577439All Organisms → cellular organisms → Bacteria935Open in IMG/M
3300018468|Ga0066662_12359734All Organisms → cellular organisms → Bacteria559Open in IMG/M
3300018482|Ga0066669_10106650All Organisms → cellular organisms → Bacteria1960Open in IMG/M
3300025988|Ga0208141_1021793All Organisms → cellular organisms → Bacteria598Open in IMG/M
3300026078|Ga0207702_11551979All Organisms → cellular organisms → Bacteria655Open in IMG/M
3300026297|Ga0209237_1289996All Organisms → cellular organisms → Bacteria502Open in IMG/M
3300026300|Ga0209027_1163606All Organisms → cellular organisms → Bacteria744Open in IMG/M
3300026301|Ga0209238_1243832All Organisms → cellular organisms → Bacteria533Open in IMG/M
3300026312|Ga0209153_1284052All Organisms → cellular organisms → Bacteria529Open in IMG/M
3300026314|Ga0209268_1140817All Organisms → cellular organisms → Bacteria594Open in IMG/M
3300026316|Ga0209155_1037694All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales1931Open in IMG/M
3300026328|Ga0209802_1301844All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300026330|Ga0209473_1116291All Organisms → cellular organisms → Bacteria1111Open in IMG/M
3300026335|Ga0209804_1007710All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales5943Open in IMG/M
3300026523|Ga0209808_1003036All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales8608Open in IMG/M
3300026524|Ga0209690_1007216All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales6128Open in IMG/M
3300026527|Ga0209059_1159449All Organisms → cellular organisms → Bacteria813Open in IMG/M
3300026529|Ga0209806_1019799All Organisms → cellular organisms → Bacteria3468Open in IMG/M
3300026537|Ga0209157_1191869All Organisms → cellular organisms → Bacteria874Open in IMG/M
3300026542|Ga0209805_1294663All Organisms → cellular organisms → Bacteria617Open in IMG/M
3300026548|Ga0209161_10052162All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales2667Open in IMG/M
3300026548|Ga0209161_10057764All Organisms → cellular organisms → Bacteria2500Open in IMG/M
3300026550|Ga0209474_10062639All Organisms → cellular organisms → Bacteria → Proteobacteria2613Open in IMG/M
3300026550|Ga0209474_10138315All Organisms → cellular organisms → Bacteria1597Open in IMG/M
3300026555|Ga0179593_1153480All Organisms → cellular organisms → Bacteria2355Open in IMG/M
3300027583|Ga0209527_1108657All Organisms → cellular organisms → Bacteria621Open in IMG/M
3300027643|Ga0209076_1046237All Organisms → cellular organisms → Bacteria1229Open in IMG/M
3300027667|Ga0209009_1127919All Organisms → cellular organisms → Bacteria646Open in IMG/M
3300027669|Ga0208981_1094447All Organisms → cellular organisms → Bacteria763Open in IMG/M
3300027671|Ga0209588_1075467All Organisms → cellular organisms → Bacteria1089Open in IMG/M
3300027748|Ga0209689_1071168All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales1870Open in IMG/M
3300027903|Ga0209488_10922272All Organisms → cellular organisms → Bacteria611Open in IMG/M
3300028799|Ga0307284_10294523All Organisms → cellular organisms → Bacteria651Open in IMG/M
3300031720|Ga0307469_10411813All Organisms → cellular organisms → Bacteria1158Open in IMG/M
3300031740|Ga0307468_101463452All Organisms → cellular organisms → Bacteria631Open in IMG/M
3300031820|Ga0307473_10677802All Organisms → cellular organisms → Bacteria721Open in IMG/M
3300031820|Ga0307473_11259632All Organisms → cellular organisms → Bacteria552Open in IMG/M
3300032205|Ga0307472_100688855All Organisms → cellular organisms → Bacteria917Open in IMG/M
3300032782|Ga0335082_10165056All Organisms → cellular organisms → Bacteria2131Open in IMG/M
3300033432|Ga0326729_1066042All Organisms → cellular organisms → Bacteria552Open in IMG/M
3300034090|Ga0326723_0213103All Organisms → cellular organisms → Bacteria857Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil27.42%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil24.19%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil9.68%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.84%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.84%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.03%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.23%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil2.42%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.42%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.61%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.61%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil1.61%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil1.61%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere1.61%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.81%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.81%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.81%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.81%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.81%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.81%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.81%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.81%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300002568Grasslands soil microbial communities from Hopland, California, USA - 2EnvironmentalOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300004153Grasslands soil microbial communities from Hopland, California, USA (version 2)EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005355Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaGHost-AssociatedOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005544Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3L metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005547Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-3 metaGEnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300005895Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_10C_0N_101EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300025988Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_10C_0N_101 (SPAdes)EnvironmentalOpen in IMG/M
3300026078Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026300Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026312Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120 (SPAdes)EnvironmentalOpen in IMG/M
3300026314Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143 (SPAdes)EnvironmentalOpen in IMG/M
3300026316Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300027583Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027667Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027669Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028799Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_123EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032782Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.1EnvironmentalOpen in IMG/M
3300033432Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF6AY SIP fractionEnvironmentalOpen in IMG/M
3300034090Peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF00NEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12053J15887_1018142723300001661Forest SoilMPLIAILVLAAASSQDVVQLTQKSQVKELCDALRAQPSESDLDPAQVAAARKAAQARRDEAASRWYRVEVPAKGFAFGRYRAQDQQLELDGDRPLRAVDDTLSLDLDGADDVAFNARPEQVTAWNQEKKAKTLKLALVWKPAGERCAGSAAAESWRLAAHARSWELVGAQGAVAAANEEGDPVGGGPRQVQVEKVAIDSDDA
C688J35102_12032532413300002568SoilMVLIALLLAASLPQDAVVLTQESQLKELCEALRARPPESGLDPAELVTARKVAQTRREEALSRWYQVEIPSKAFSLGRYREQDRQLELDGDRPVRALDDMLSLDLEGIDDVAFRARREDVSAWSREKKAGTLRLLVVWKPTA
soilH2_1030095823300003324Sugarcane Root And Bulk SoilMILAILLAASSQDVVQLTQKAQLKELCEALRAQPSETDLDPAQLDAARKAAQARREEATGRWYRVEVPSKGFAFGRYRAQDQQLELDGDRPLRAVDDTLSLDLEGIDDVAFSAKPEQVTAWSKEKKAKALRLAVVWKPTGERCAGSSAAESWRIAGRARSWELLGAQGAVAAANEDGEPV
Ga0063455_10041827423300004153SoilMTVLIALLLAALPPQDTVALTQMSQIKELCDVLRAQPATADLDPAAEVAARKTAVARRDEALARWYRVEIPSKAFNFGQYREQDRQLELDGKKPVRALDDALSLDLEGIDDVAFNARPEQVSAWSRDKKAGQLQLVVIWKPTGDRCGGSAAAESWRIAGRVHTWQLVTPQGVVAAANEDGEPSKGG
Ga0066672_1001266053300005167SoilMPLLAILLVAAAPGQDVVQLTSKAQLKELCEALRAQPSETDLDPAQIAEARKEAQARREEVAGRWYRVEVPSKGFAFGRYRTQDQQLELDGDRPLRAIDNMLSLDLEGIDEVSFTARSEQVTAWAQEKKGKALKLVVVWKPSGARCAGSAAAE
Ga0066680_1031603913300005174SoilMPLLAILLVAAAPGQDVVQLTSKTQLKELCEALRAQPSETDLDPAQIAEARKAAQARREEAAGRWYRVEVPSKGFAFGRYRAQDQQLELDGDRPLRAIDDTLSLDLDGIDDVSFAAGSEQVTAWTQEKKAKALKLVLVWKPSGASCAGSAAAEAWRIAGHARSWELLGAQGTLASANEE
Ga0066684_1087400013300005179SoilKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAERWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVAAWTQEKKAKALKLVVVFKPAGDRCAGSAAAESWRIAGRARSWELTGAQGIVAAANEDGEPVGGNGPRQVRIEKVTLDSDNTPQDNEGRSRLSSTQRALD
Ga0066675_1040322223300005187SoilMLLITLLIAAAPVSDVVQLTQESQVKELCEALRAKPVLEGADPAQEEAGHQAAQARREQAASQWYRLEVPSKGFAFGRYRDRERQIELDGDWPLRAVDGVLSLDLEGIDDVAFNARPEQVTVWTAEKKAGSLKLVVVWKPTGERCAGSAAAEAW
Ga0070671_10142883813300005355Switchgrass RhizosphereMALLAILLAVSSQDVVQLTQKAQLKELCEALRAQPSEADLDPAQVATARKTAQARHDEAASHWYRVEVPSKGFGFGRYRSQEQQLELDGDRPLRAVDDTLALDLEGIDDVAFSARPEQVTAWNQEKKAKTLRLAVVWKPTGERCAGSSAAESWRIPGRARSWELLGAQGTLASANEDGEPIGSTGTHQ
Ga0070705_10128474613300005440Corn, Switchgrass And Miscanthus RhizosphereHYPRGMALLAILLAVSSQDVVQLTQKAQLKELCEALRAQPSEADLDPAQVATARKTAQARHDEAASRWYRVEVPSKGFGFGRYRSQEQQLELDGDRPLRAVDDTLALDLEGIDDVAFSARPEQVTAWNQEKKAKTLRLAVVWKPTGERCAGSSAAESWRIPGRARSWELLGAQGTLASANEDGEPIGSTGTHHLRVDKVTL
Ga0070705_10165983813300005440Corn, Switchgrass And Miscanthus RhizosphereKAQLKELCEALRAQPSESGLDPAQVSAARKAAQARRDEAASRWYRVEVPSKGFAFGRYRTQDQQLELDGDRPLRAIDDMLALDLDGIDDVAFNAKPEQVTAWSQQKKGKTLRLAVVWKPAGDRCAGSAAAESWRIAGRARSWELVGPQGVLAAANEDGDPVGGGPRQVQVEKVTLDSDE
Ga0066686_1018885823300005446SoilMPLLAILLLAVAPGQDVLQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAERWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQEKKAKALKLVVVFKPTGDRCAGSAAAESWRIAGRARSWELT
Ga0068867_10187731513300005459Miscanthus RhizospherePSEADLDPAQVASARKTAQARHDEAASHWYRVEVPSKGFGFGRYRSQEQQLELDGDRPLRAVDDTLALDLEGIDDVAFSARPEQVTAWNQEKKAKTLRLAVVWKPTGERCAGSSAAESWRIPGRARSWELLGAQGTLASANEDGEPIGSTGTHHLRVDKVTLDSDEQPPENDGRGRLLAAQGALERCA
Ga0070699_10020558433300005518Corn, Switchgrass And Miscanthus RhizosphereMPLLAILLVAAAPGQDVVQLTAKAQLKELCEALRAQPSETDLDPAQIAEARKAAQALREEAAARWYRVEVPSKGFAFGRYRAQDQQLELDGDRPLRAIDDMLSLDLEGVDEVSFAARPEQVTAWAQEKKAKALKLVVVWKPAGTRCAGSAAAEAWRIAAHARSWELVGAQGTLAWANEEGDPIGGGPRQVRIEKVTLDSDD
Ga0070697_10002227353300005536Corn, Switchgrass And Miscanthus RhizosphereMPLIAILVLVAASNPDVVQLTDKAQLKELCEALRAQPSESGLDPAQVSAARKAAQARRDEAASRWYRVEVPSKGFAFGRYRTQDQQLELDGDRPLRAIDDMLALDLDGIDDVAFNAKPEQVTAWSQQKKGKTLRLAVVWKPAGDRCAGSAAAESWRIAGRARSWELVGPQGVLAAANEDGDPVGGG
Ga0070686_10170177513300005544Switchgrass RhizospherePSEADLDPAQVATARKTAQARHDEAASHWYRVEVPSKGFGFGRYRSQEQQLELDGDRPLRAVDDTLALDLEGIDDVAFSARPEQVTAWNHEKKAKTLRLAVVWKPTGERCAGSSAAESWRIPGRARSWELLGAQGTLASANEDGEPIGSTGTHQLRVDKVTLDSDEQPPENDGRGRLL
Ga0070696_10098860913300005546Corn, Switchgrass And Miscanthus RhizosphereMALLAILLAVSSQDVVQLTQKAQLKELCEALRAQPSEADLDPAQVATARKTAQARHDEAASHWYRVEVPSKGFGFGRYRSQEQQLELDGDRPLRAVDDTLALDLEGIDDVAFSARPEQVTAWNHEKKAKTLRLAVVWKPTGERCAGSSAAESWRIPGRARSWELLGAQGTLASANEDGEPIGSTGTHQLRVD
Ga0070693_10094476513300005547Corn, Switchgrass And Miscanthus RhizosphereMALLAILLAVSSQDVVQLTQKAQLKELCEALRAQPSEADLDPAQVATARKTAQARHDEAASRWYRVEVPSKGFGFGRYRSQEQQLELDGDRPLRAVDDTLALDLEGIDDVAFSARPEQVTAWNHEKKAKTLRLAVVWKPTGERCAGSSAAESWRIPGRARSWELLGAQGTLASANEDGEPIGST
Ga0066661_1087323413300005554SoilMPLLAILLVAAAAGQDVVQLTSKTQLKELCEALRAQPSETDLDPAQIAEARKAAQARREEAAGRWYRVEVPSKGFAFGRYRAQDQQLELDGDRPLRAIDDTLSLDLDGIDDVSFAAGSEQVTAWTQEKKAKALKLVLVWKPSGARC
Ga0066707_1071954413300005556SoilCEALRAQPSESNLDPAQLAAARKAAKARRDEAAERWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQEKKAKALKLVVVFKPAGDRCAGSAAAESWRIAGRARSWELTGAQGIVAAANEDGEPVGGNGPRQVRIEKVTLDSDDTPQDNEGRSRLSSTQRALDRCATGAHRAGKLLVS
Ga0066702_1015831013300005575SoilMPLLAILLFAAAPGQDVVQLTSKAQLKELCEVLRAHPSETDLDPAQIAEARKEAQARREEVAGRWYRVEVPSKGFAFGRYRTQDQQLELDGDRPLRAIDNMLSLDLEGIDEVSFTARSEQVTAWAQEKKAKALKLVVVWKPSGARCAG
Ga0066702_1077368613300005575SoilRLAVGELRFCGAGERPYPRGMLLITLLIAAAPVSDVVQLTQESQVKELCEALRAKPVLEGADPAQEEAGHKAAQARREQAASQWYRLEVPSKGFAFGRYRDRERQIELDGDWPLRAVDGVLSLDLEGIDDVAFNARPEQVTAWTAEKKAGSLKLVVVWKPTGERCAGSAAAEAWRVGGKVRSWELIGTQG
Ga0066702_1085756113300005575SoilMPLLAILLLAAAPGQDVVQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAERWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQQKKAKAL
Ga0066708_1002057043300005576SoilMPLLAILLLAAAPGQDVVQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAERWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQQKKAKALKLVVVFKPAGDRCAGSAA
Ga0066654_1054303213300005587SoilMPLLAILLFAAAPGQDVVQLTSKAQLKELCEALRAQPSETDLDPAQIAEARKEAQARRDEMAGRWYKVEVPSKGFALGRYRTQDQQLELDGDRPLRAIDNMLSLDLEGVDEVSFTARSEQVTAWAQEKKAKALKLVVVWRPSGARCAGSAAAEAWRIAGHARSWELVGAQGTLASANEEGDPVGGGPRQ
Ga0068860_10160453323300005843Switchgrass RhizosphereMALLAILLAVSSQDVVQLTQKAQLKELCEALRAQPSEADLDPAQVATARKTAQTRHDEAASRWYRVEVPSKGFGFGRYRSQEQQLELDGDRPLRAVDDTLALDLEGIDDVAFSARPEQVTAWNQEKKAKTLRLAVVWKPTGERCAGS
Ga0075277_105384813300005895Rice Paddy SoilMVLIALLLAASQPQDALVLTQESQLKELCQALRAQPAEADLDPAERVAARKAALARREAALSRWYEVEIPSKGFALGRYRARDLALELDGDRPVRALDDMLSLDLQGIDDVAFSARPEQVSAWSREKKAGTLRLRVVWKPAAERCAGSAAAEAWRIAGRVSS
Ga0066665_1010203613300006796SoilMPLLAILLFAAAPGQDVVQLTSKAQLKELCEVLRAQPSETDRDPAQIAEARKEAQARREEVAGRWYRVEVPSKGFAFGRYRTQDQQLELDGDRPLRAIDNMLSLDLEGIDEVSFTARSEQVTAWAQEKKAKALKLVVVWRPSGARCAGSAAAEAWRIAGHARSWELVGAQGTLASANEEGDPVGGGPRQVRVEKVTLD
Ga0066665_1158826013300006796SoilYYPRSMLLLAILLLAVAPGQDVLQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAERWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQEKKAKALKLVVVFKPTGDRCAGSAAAESWRIAGRARSWE
Ga0066659_1019124223300006797SoilMPLLAILVLAAASNQDAVQLTQKPQLQELCDALRAQPSESDLDPAQAAEARKAAQARRDDAAGRWYRVEVPSKGFAFGRYRSQDHQLELDGDRPLRALDDMLALDLDGTDEVAFNARPEQVTAWSAEKRAKTLRLVVVFQPKGERCAGSAAAESWRIAGRARSWELVGTQGALATANEDGDPVGGGPRRLSVEKVSLESDGAPQENDGRARL
Ga0066659_1095676413300006797SoilMPLLAILLVAAAPGQDVVQLTSKAQLKELCEALRAQPSETDLDPAQIAEARKEAQARREEVAGRWYRVEVPSKGFAFGRYRTQDQQLELDGDRPLRAIDNMLSLDLEGVDEVSFTARSEQVTAWAQEKKAKALKLVVVWKPSGARCAGSAAAEAWRIAGHARSWELVGTQGTLASANEEGDPVGGGPRQVRVEKVTLDADEAPLENEGRSRLASSQAA
Ga0066660_1016535833300006800SoilAAAPGQDVVQLTSKAQLKELCEALRAQPSETDLDPAQIAEARKEAQARREEMAGRWYKVEVPSKGFALGRYRTQDQQLELDGDRPLRAIDNMLSLDLEGVDEVSFTARSEQVTAWAQEKKAKALKLVVVWKPSGARCAGSAAAEAWRIAGHARS*
Ga0066660_1161843013300006800SoilMPLLAILLLAASPGQDVVQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAQARRDEAVERWYRVEVPSKGFGFGRYRAQDQQIELDGDRPVRAIDDMLVLDLEGIDDVSFNARTEQVAAWNQEKKAKALKLVVVFKPTGDRCAGSAAAESWRIAGRARSWELTGAQG
Ga0075425_10074025023300006854Populus RhizosphereMPLIALLVLAAASNPDVVPLTQKSQLKELCDALRAQPSESGLDPAQVSAARKAAQARRDEAASRWYRVEVPSKGFAFGRYRTQDQQLELDGDRPLRAIDDMLALDLDGIDDVAFNAKPEQVTAWSQQKKGKTLRLAVVWKPAGDRCAGSAAAESWRIAGRARSWELVGPQGVLAAANEDGDPV
Ga0075424_10070343223300006904Populus RhizosphereMPLIALLVLAAASNPDVVPLTQKSQLKELCDALRAQPSESGLDPAQLADARRAAQARREEAASRWYRVEVPSKGFAFGRYRAQDQQLELDGDRPLRAVEDMLSLDLDGIDDVSFNARPAQVSAWSEEKKARALRLAVVFRPAGDRCAGSAAAESWRIAGHARSWELLDAQGIVAAANEEGEPVG
Ga0079219_1211514713300006954Agricultural SoilLPPCISSADRFVRRAERLILAGMVLIALVLAASQPQDALVLTQESQLKELCQALRAQPAEADLDPAERVAARKAAQARREEALSRWYEVEIPSKGFAFGRYRAQDRALELDGDRPVRALEDMLFLDLEGIDDVAFNARPDQVSEWSREKKAGTLRLRVVWKPAADRCAGSAAAESWRIA
Ga0075435_10055644123300007076Populus RhizosphereMPLIALLVLAAASNPDVVPLTQKSQLKELCDALRAQPSESGLDPAQLADARRAAQARREEAASRWYRVEVPSKGFAFGRYRAQDQQLELDGDRPLRAVEDMLSLDLDGIDDVSFNARPAQVSAWSEEKKARALRLAVVFRPAGDRCAGSAAAESWRIAGHARSWELLDAQGLVAAANEEGEPVGGGSSRLAQVEKVTLDSD
Ga0099791_1005490033300007255Vadose Zone SoilMPLIAILVLAAASSQDVVQLSDKAQVKELCEALRAQPSESGLDPAQVSAARKAAQARREEAASRWYRVEVPAKGFAFGRYRTQDQQLELDGDRPLRSIDDMLALDLDGIDEVGFNAKPEQVTAWSQQKKGKTLRLAVVWKPTGDRCAGSAAAESWRIAGHARSWELVGPQGIVAAANED
Ga0099792_1038897023300009143Vadose Zone SoilMPLIAILVLAAASSQDVVQLSDKAQLKELCEALRAQPSESGLDPAQVSAARRAAQARREEAASRWYRVEVPAKGFAFGRYRTQDQQLELDGDRPLRAIDDMLVLDLDGVDEVAFNGKPEQVTAWSQQKKGKTLRLAVVWKPAGDRCAGSVAAESWRIAGHARSWELVGTQ
Ga0099792_1051365823300009143Vadose Zone SoilMPLIAILVLAAASSQDVVQLTQKSQVKELCDALRAQPSESDLDPAQVAAARKAAQARRDEAGARWYRVEVPSKGFAFGRYRAQDQQLELDGDRPLRAVDDTLSLDLDGADDVAFNARPEQVTGWNQEKKAKTLKLVVVWKPAGERCAGSAAAESWRLAGHARSWELVGAQGVVAAANEEGEPVGGGPRQVQVEKVAIDSDDAPPQNDGRA
Ga0099792_1069443013300009143Vadose Zone SoilMPLLAILVLAAASNQDAVQLTDKSQIKELCEALRAQPSEADLDPAQVAAARKAAQTRREEFASQWYRVEMPSKGFAFGRYRSHDQQIELDGDRPLRALDDVLALDLDGTNDVAFNARPEQVMAWSAEKRSKTLRLLVVFKPSGERCAGNDAAESWRIAGHARSWELVGARGSLAAANEEGEPVG
Ga0126374_1159606813300009792Tropical Forest SoilMSLLAILVLAAASNQDAVQLADKSQIKDLCEALRAQPPEADLDPAQLSAARKAAQARREDAAGRFYRVEIPSKGFSFGRYRAHDKQLELDGDHPLRAIDDTLALDIEGTNDVSFNARPEQVTAWSAEKRGRTLRLLVVFKPSG
Ga0134088_1041443113300010304Grasslands SoilPFGGAGAPHYARGMPLIAILVLAAASSQDVVQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAERWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQQKKAKALKLVVVFKPAGDRCAGSAAAESWRIAGRARSWELTGAQGIVAAANEDGEPVGGNGPRQVRIEKVTLDSDDTPQD
Ga0134109_1030418913300010320Grasslands SoilMPLLAILLLAAAPGQDVVQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAEHWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQQKKAKALKLVVVFKPAGDRCAGSAAA
Ga0134067_1014262423300010321Grasslands SoilMPLLAILLLAAAPGQDVVQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAEHWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQQKKAKALKLVVVFKPAGDRCAGSAAAESWRIAGRARSWELTGAQGIVAAANEDGEPVGGNGPRQVRVEKVTLDSDDTP
Ga0134084_1037715813300010322Grasslands SoilMPLIAILVLAAASSQDVVQLTQKSQVKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAEHWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQQKKAKALKLVVVFKPAGDRCAGSAAAESWRIAGRARSWELTGAQGI
Ga0134065_1009251313300010326Grasslands SoilMPLLAILLLAAAPGQDVVQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAEHWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQQKKAKALKLVVVFKPAGDRCAGSAAAESWRIAGRARSWELTGAQGIVAAANEDGEPVGGNGPRQVRVEKVTLDSDDTPQDNEGRSRLSSTQRAL
Ga0134080_1045545213300010333Grasslands SoilMPLLAILLVAAAPGQDVVQLTSKAQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAERWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQEKKAKALKLVVVFKPMGDRCAGSAAAESWRIAGRARSWEL
Ga0134063_1003017923300010335Grasslands SoilMPLLAILLLAAAPGQDVVQLTQKSQLKELCEALRAQPSESNLDPAQLAAARRAAKARRDEAAEHWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQQKKAKALKLVVVFKPAGDRCAGSAAAESWRIAGRARSWELTGAQGIVAAANEDGEPVGGNGPRQVRIEKVTLDSDDTPQDNEGRSRLSSTQRALDR*
Ga0134063_1072854113300010335Grasslands SoilGQDVVQLTSKAQLKELCEALRAQPSETDLDPAQIAEARKEAQARREEMAGRWYKVEVPSKGFALGRYRTQDQQLELDGDRPLRAIDNMLSLDLEGIDEVSFTARSEQVTAWAQEKKAKALQLVVVWRPSGARCAGSAAAEAWRIAGHARSWELVGAQGTLASANEEGDPVG
Ga0134071_1074510713300010336Grasslands SoilTAKAQLKELCEALRAQPSETDLDPAQIAEARKAAQALREEAAARWYRVEVPSKGFAFGRYRAQDQQLELDGDRPLRAIDDMLSLELEGVDEVSFAARPEQVTAWAQEKKAKALKLVVVWKPAGTRCAGSAAAEAWRIAAHARSWELVGAQGTLASANEEGDPVGGGPRQVRIE
Ga0134066_1028372213300010364Grasslands SoilMPLLAILVLAAASNQDAVQLTQKSQLQELCDALRAQPSESGLDPAQTAEARKTAQARRDEAAGRWYRVEVPSKGFAFGRYRSQDQQLELDGDRPLRALDDMLALDLDGTDEVAFNGRPEQVTAWSAEKKAKTLRLVVVFKPTGERCAGSAAAESWRIAGHARSWELFGSQGAVATANED
Ga0134127_1219447213300010399Terrestrial SoilMALLAILLAVSSQDVVQLTQKAQLKELCEALRAQPSEADLDPAQVATARKTAQARHDEAASRWYRVEVPSKGFGFGRYRSQEQQLELDGDRPLRAVDDTLALDLEGIDDVAFSARPEQVTAWNQEKKAKTLRLAVVWKPTGERCAGSSAAESWRIPGRARSWELLGAQGTLASANED
Ga0134121_1030635823300010401Terrestrial SoilMALLAILLAVSSQDVVQLTQKAQLKELCEALRAQPSEADLDPAQVATARKTAQARHDEAASHWYRVEVPSKGFGFGRYRSQEQQLELDGDRPLRAVDDTLALDLEGIDDVAFSARPEQVTAWNHEKKAKTLRLAVVWKPTGERCAGSSAAESWRIPGRARSWELLGAQGTLASANEDGEPNPSPAPGQPPTGAPTGSSTSN
Ga0137382_1072345513300012200Vadose Zone SoilMPVLAILVRAAASNQDALQLTQKPQLQELCDALRAQPSESDLDPAQAAEARKAAQARRDDAAGRWYRVEVPSKGFAFGRYRSQDHQLELDGDRPLRALDDMLALDLDGTDEVAFNARPEQVTAWSAEKRAKTLRL
Ga0137363_1060414513300012202Vadose Zone SoilMPLLAILLVAAAPGQDVVQLTSKAQLKELCEALRAQPSETDLDPAQIAEARKEAQARREEVAGRWYRVEVPSKGFAFGRYRTQDQQLELDGDRPLRAIDNMLSLDVEDIDEVSFTARPEQVTAWAQEKKGKALKLVVVWKPSGARCAGSAAAEAWRIAGHARSWELVG
Ga0137399_1005516413300012203Vadose Zone SoilMPLIAILVLAAASSQDVVQLTQKSQVKELCDALRAQPSESDLDPAQVAAARKAAQARRDEAAARWYRVEVPAKGFAFGRYRAQDQQLELDGDRPLRAVDDTLSLDLDGADDVAFNARPEQVTAWNQEKKAKTLKLIVVWKPAGERCAGSAAAESWRLAGHTRSWELVGAQGTVAAANEEGEPVGGGPRQVQVEKVAIDSDDAPAQNDGRARLASAQAALDRCASG
Ga0137399_1022323513300012203Vadose Zone SoilMPLLAILVLAAASNQDAVQLTDKSQIKELCEALRAQPSEADLDPAQVAAARKAAQTRREEFASQWYRVEVPSKGFSFGRYRSHDQQIELDGDRPLRALDDTLAIDLDVTNEVAFNARPEQVTAWSGEKRSKTLRLVVVFKPSGERCAGNDAAESWRIA
Ga0137399_1126520613300012203Vadose Zone SoilMPLIAILVLAAASSQDVVQLTQKSQVKELCDALRAQPSESDLDPAQVAAARKAAQARRDEAAARWYRVEVPAKGFAFGRYRAQDQQLELDGDRPLRAVDDTLSLDLDGADDVAFNARPEHVTAWNQEKKAKTLKLVVVWKPAGERCAGSAAAESWRLAGHARSWELVGAQGAVAAANEEGEPVGGGPRQVQVEKVAIDSDDAPAQN
Ga0137362_1009637313300012205Vadose Zone SoilMPLLAILLVAAAPGQDVVQLTAKAQLKELCETLRAQPSETDLDPAQIAEARKEAQARREEAAARWYRVEVPSKGFAFGRYRAQDQQLELDGDRPLRAIEDMLSLDVEGVDEVSFTAGPEQVTAWAREKKAKALKLVVVWKPTGTRCAGSAAAEAWRIAG
Ga0137362_1031317123300012205Vadose Zone SoilMPLIAILVLAAASSQDVVQLSDKAQVKELCEALRAQPSESGLDPAQVSAARKAAQARREEAASRWYRVEVPAKGFVFGRYRTQDQQLELDGDRPLRSIDDMLALDLDGIDDVGFNAKPEQVAAWSQQKKGKTLRLAVVWKPTGDRCAGSAAA
Ga0137362_1106448023300012205Vadose Zone SoilMPLIAILVLAAASNQDVVQLTDKAQLKELCEALRAQPSESGLDPAQVSAARRTAQARREEAASRWYRVEVPSKGFAFGRYRTQDQQLELDGDRPLRALDDMLALDLDGIDEVGFNAKPEQVAAWSQQKKGKTLRLAVVWKPTGDRCAGSAAA
Ga0137376_1154392913300012208Vadose Zone SoilQPSESNLDPAQLAAARKAAKARRDEAAERWYRVEVPSKGFAFGLYRVQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVAAWNQEKKAKALKLAVVFKPTGDRCAGSAAAESWRIAGRARSWELTGAQGLVAAANEDGEPVGGNGPRQVRIEKVTLDSDDTPQDNEGRSRLSSTQRA
Ga0137377_1152917613300012211Vadose Zone SoilMPLLAILLVAAAPGQDVVQLTSKTQLKELCEALRAQPSESNLDPAQLAAARKTAQARRDEAVERWYRVEVPSKGFGFGRYRAQDQQIELDGDRPVRAIDDMLVLDLEGIDDVSFNARTEQVAAWNQEKKAKALKLVVVFKPT
Ga0137361_1019784313300012362Vadose Zone SoilMPLLAILLVAAAPGQDVVQLTSKTQLKELCEALRAQPSETDLDPAQIAEARKAAQARREEAVGRWYRVEVSSKGFAFGRYRAQDQQLELDGDRPLRAIDDMLSLDLDGIDDVSFAGSSEQVTAWTQEKKAKALKLVVVWKPSGARCAGSGAAEAWRIAGHARSWEVLGAQGTLASANEEGDPVGAGPRQMRVEKVTLDSDDAP
Ga0150984_10630549013300012469Avena Fatua RhizosphereMPLLAILVLVAAPSQDVVQLTQKSQVKELCEALRAQPSEKGLDPAQLAAARKAAQARRDEAASRWYRVEVPAKGFALGRYRAQDQQLELDGDRPLRAIDDTLSLDFDGIDEVSFNARSAQVTEWSEQKKAKALRLAVVWKPSGEPCAGSAAAESWRIAAK
Ga0150984_10687586513300012469Avena Fatua RhizosphereLAALPPQDTVALTQMSQIKELCDALRAQPATADLDPAAEVAARKAAVARREEALARWYRVEIPSKAFNFGQYREQDRQLELDGKKPVRALDDALSLDLEGIDDVAFNARPEQVSAWSRDKKPGQLQLVVIWKPTGDRCGGSAAAESWRIAGRVHTWQLVTPQGVVAAANEDGEPSKGG
Ga0137358_1031257813300012582Vadose Zone SoilMPLIAILVLAAASSQDVVQLTQKSQVKELCDALRAQPSESDLDPAQVAAARKAAQARRDEAASRWYRVEVPAKGFAFGRYRTQDQQLELDGDRPLRAVDDTISLDLDSADDVAFNARPEQVTAWNQEKKAKTLKLVLVWKPAGERCAGSAAAESWRLAGHARSWELVGAQGTVAAANEEGEPVGGGP
Ga0137398_1093501613300012683Vadose Zone SoilMPLIAILVLAAASSQDVVQLSDKAQLRELCEALRAQPSESGLDPAQVSAARKAAQARREEAASRWYRVEVPAKGFAFGRYRTQDQQLELDGDRPLRSIDDMLALDLDGIDEVGFNAKPEQVTAWSQQKKGKTLRLAVVWKPTGDRCAGSAAAESWRIAGHARSWELV
Ga0137395_1056373313300012917Vadose Zone SoilMPLLAILLVAAAPGQDVVQLTSKAQLKELCEALRAQPSETDLDPAQIAEARKEAQARREEVAGRWYRVEVPSKGFAFGRYRTQDQQLELDGDRPLRAIDNMLSLDVEDIDEVSFTARPEQVTAWAQEKKGKALKLVV
Ga0137394_1110201313300012922Vadose Zone SoilPLIAILVLAAASSQDVVQLTQKSQVKEICDALREQPSESDLDPAQVAAARKAAQARRDEAAARWYRVEVPAKGFAFGRYRAQDQQLELDGDRPLRALDDTLSLDLDGADDVAFNARPQQVTAWHQEKKAKTLKLVVVWKPAGERCAGSAAAESWRLAGHARSWELVGAQGAVAAANEEGEPVGGGPRQVPVEKVAIDSDDAPPQNDGRARLAAAQAAL
Ga0137413_1025365623300012924Vadose Zone SoilMPLIAILVLAAASSQDVVQLTQKSQVKELCEALRAQPSESDLDPAQVAAARKVAQARRDEAASRWYRVEVPAKGFAFGRYRAQDQQLELDGDRPLRAVDDTLSLDLDGADDVAFNARPEQVATWNQEKKAKTLKLVLVWKPAGERCAGS
Ga0137404_1038026423300012929Vadose Zone SoilMPLIAILVLAAASSQDVVQLTQKSQVKELCEALRAQPSESDLDPAQVAAARKAAQARRDEAAARWYRVEVPSKGFAFGRYRAQDQQLELDGDRPLRAVDDALSLDLDGADDVAFNARPEQVTAWNQEKKAKTLKLVLV
Ga0137407_1131144213300012930Vadose Zone SoilMPLIAILVLAAASNQDVVQLTDKAQLKELCEALRAQPSESGLDPAQVSAARRTAQARREEAASRWYRVEVPSKGFAFGRYRTQDQQLELDGDRPLRALDDMLALDLDGIDDVSFNARPEQVTAWSQQKKGKTLRLAVVWKPAGDRCAGSAAAESWLAGEMSDVSAMPDMASRFARLTDAWDRLRPA*
Ga0153915_1196190313300012931Freshwater WetlandsFLAAAPSPEIVDLTQKAQLKELCEALRAQPSERDLDPAQLAAARKAAQARRAEAAARWYRVEVPSKGFAFGRYDAQDQQLELDGDRPLRAIDDMLSLDLDGVDEVAFNARPEQVSAWSREKKAKTLRLVVVWKPAGERCAGSAAAEAWRIAGHARSWELLGAEGTVAAANEEGEPVGGGPRQVRVEKVSVDSDGAPQENEGRDRLASSQQALDRCASGAQRTGRLLIT
Ga0137410_1048972513300012944Vadose Zone SoilMPLIAILVLAAASSQEVVQLTQKSQVKELCDALRAQPSESDLDPAQVAAARKAAQARRDEAAARWYRVEVPAKGFAFGRYRGQDQQLELDGDRPLRAVDDTLSLDLDGADDVAFNARPQQVTAWHQEKKAKTLKLVVVWKPAGER
Ga0126369_1279783313300012971Tropical Forest SoilLAILVLAAASSQDAVQLTDKSQIKDLCEALRAQPPEADLDPAQVSAARKAAQARREEAAGRFYRVEIPSMGFSFGRYRSHDRQLELDGDHPLRAIDDTLALDIEGTNDVSFNARPEQVTAWSAERRAKTLRLVIVFKPTGLLGERVAERA*
Ga0134087_1012901213300012977Grasslands SoilVQLTSKAQLKELCEALRAQPSETDLDPAQIAEARKEAQARRDEMAGRWYKVEVPSKGFALGRYRTQDQQLELDGDRPLRAIDNMLSLDLEGVDEVSFTARSEQVTAWAQEKKGKALKLVVVWKPSGARCAGSAAAEAWRIAGHARSWDLVGAQGTLASANEEGDHVGGGPRQVRIEKVTL
Ga0157378_1053583823300013297Miscanthus RhizosphereMALLAILLAVSSQDVVQLTQKAQLKELCEALRAQPSEADLDPAQVATARKTAQARHDEAASHWYRVEVPSKGFGFGRYRSQEQQLELDGDRPLRAVDDTLALDLEGIDDVAFSARPEQVTAWNQEKKAKTLRLAVVWKPTGERCAGSSAAESWRIPGRARSWELLGAQGTLASANEDGEPIGSTGTH
Ga0137411_102287923300015052Vadose Zone SoilMPLIAILVLAAASSQEVVQLTQKSQVKELCDALRAQPSESDLDPAQVAAARKAAQARRDEAAARWYRVEVPAKGFAFGRYRGQDQQLELDGDRPLRAVDDTLSLDLDGADDVAFNARPEQVTAWNQEKKAKTLKLVVVWKPAGERCAGSAAAESWRLAGHARSWELVGAQGAVAAANEEGEPVGGGPRQVQVEKVAIDSDDAPPQNEGRSRLTSAQAALDRCASGA
Ga0137409_1008543713300015245Vadose Zone SoilMPLIAILVLAAASSQDVVQLTQKSQVKELCDALRAQPSESDLDPAQVAAARKAAQARRDEAAARWYRVEVPAKGFAFGRYRAQDQQLELDGDRPLRAVDDTLSLDLDGADDVAFNARPEQVTAWNQEKKAKTLKLIVVWKPAGERCAGSAAAESWRLAGHTRSWELVGAQGTVAAANEEGEPVGGGPRQVQVEKVAID
Ga0137409_1018465033300015245Vadose Zone SoilMPLIAILVLAAASSQEVVQLTQKSQVKELCDALRAQPSESDLDPAQVAAARKAAQARRDEAAARWYRVEVPAKGFAFGRYRGQDQQLELDGDRPLRAVDDTLSLDLDGADDVAFNARPQQVTAWHQEKKAKTLKLVVVWKPAGERCAGSAAAESWRLAGHARSWELVGAQGAVAAANEEGEPVGGGPRQVQVEKVAID
Ga0134073_1024435613300015356Grasslands SoilMPLLAILLLAAAPGQDVVQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAEHWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQQKKAKALKLVVVFKPAGD
Ga0066667_1057743913300018433Grasslands SoilMPLLAILLLAAAPGQDVVQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAERWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQQKKAKALKLVVVFKPAGDRCAGSAAAESWRIGGRARSWELTGAQGIVA
Ga0066662_1235973413300018468Grasslands SoilMPLLAILLLAVAPGQDVLQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAERWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLILDLEGIDDVSFNARTEQVAAWNQEKKAKALKLVVVFKPTG
Ga0066669_1010665033300018482Grasslands SoilMPLIAILVLAAASSQDVVQLTQKSQVKELCEALRAQPSESDLDPAQVAAARKAAQARRDEAAARWYRVEVPSKGFAFGRYRAQDQQLELDGDRPLRVVDDALSLDLDGADDVAFNARPEQVTAWNQEKKAKTLKLVLVWKPAGERCAGSAAAE
Ga0208141_102179313300025988Rice Paddy SoilMVLIALLLAASQPQDALVLTQESQLKELCQALRAQPAEADLDPAERVAARKAALARREAALSRWYEVEIPSKGFALGRYRARDLALELDGDRPVRALDDMLSLDLQGIDDVAFSARPEQVSAWSREKKAGTLRLRVVWKPAAERCAGSAAA
Ga0207702_1155197913300026078Corn RhizosphereALRAQPAEADLDPAERVAARKAAQARREEALSRWYEVEIPSKGFAFGRYRAQDRALELDGDRPVRALEDMLFLDLEGIDDVAFNARPDQVSEWSREKKAGTLRLRVVWKPSADRCAGSAAAESWRIAGRVRSWQLVGTEGVLAMANEDGEPVGTGPRQAKVEKVTLDSDRAPQEDEGRERLSAHIRGGGSLIVAVGPDVDGQLVADVLGSGIAIARSP
Ga0209237_128999613300026297Grasslands SoilVQLTSKAQLKELCEVLRAHPSETDLDPAQIAEARKEAQARREEVAGRWYRVEVPSKGFAFGRYRTQDQQLELDGDRPLRAIDNMLSLDLEGIDEVSFTARSEQVTAWAQEKKAKALKLVVVWKPSGARCAGSAAAEAWRIAGHARSWELVGTQGTLASANEEGDPVG
Ga0209027_116360613300026300Grasslands SoilMPLLAILLVAAAPGQDVVQLTSKTQLKELCEALRAQPSETDLDPAQIAEARKAAQARREEAAGRWYRVEVPSKGFAFGRYRAQDQQLELDGDRPLRAIDDTLSLDLDGIDDVSFAAGSEQVTAWTQEKKAKALKLVLVWKPSGARCAGSAPAEAWRIAGHARSWELLGAQGTLASAN
Ga0209238_124383213300026301Grasslands SoilYPRSMPLLAILLFAAAPGQDVVQLTSKAQLKELCEMLRAHPSETDLDPAQIAEARKEAQARREEVAGRWYRVEVPSKGFAFGRYRTQDQQLELDGDRPLRAIDNMLSLDLEGIDEVSFTARSEQVTAWAQEKKAKALKLVVVWKPSGARCAGSAAAEAWRIAGHARSWELVGTQGTL
Ga0209153_128405213300026312SoilMPLLAILLLAAAPGQDVVQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAEHWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQQKKAKALKLVVVFKPAGDRCAGSAAAESWRIGG
Ga0209268_114081713300026314SoilQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAEHWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQQKKAKALKLVVVFKPAGDRCAGSAAAESWRIAGRARSWELTGAQGIVAAANEDGEPVGGNGPRQVRVEKVTLDSDDTPQDNEGRSRLSS
Ga0209155_103769433300026316SoilMPLIAILVLAAASSQDVVQLTQKSQVKELCEALRAQPSESDLDPAQVAAARKAAQARRDEAAARWYRVEVPSKGFAFGRYRAQDQQLELDGDRPLRVVDDALSLDLDGADDVAFNARPEQVTAWNQEKKAKTLKLVLVWKPAGERCAGSAAAESWRLAGHARSWELVGAQGAVAAANEEGDPVGGGPRQVQIEKVAIDSDEAPPQNDGRARLASAQGA
Ga0209802_130184413300026328SoilLAVLLVAAAPGQDVVQLTSKTQLKELCEALRAQPSETDLDPAQIAEARKAAQARREEAAGRWYRVEVPSKGFAFGRYRAQDQQLELDGDRPLRAIDDTLSLDLDGIDDVSFAAGSEQVTAWTQEKKAKALKLVLVWKPSGARCAGSAAAEAWRIAGHARSWELLGAQGTLASANEE
Ga0209473_111629123300026330SoilMPLLAILLLAAAPGKDVVQLTQKSQVKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAERWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVAAWTQEKKAKALKLVVVFKPAGDRCAGSAAAESWRIAGRARSWELTGAQGIVAAANEDGEPVGGNGPRQVR
Ga0209804_100771013300026335SoilMPLLAILLVAAAAGQDVVQLTSKTQLKELCEALRAQPSETDLDPAQIAEARKAAQARREEAAGRWYRVEVPSKGFAFGRYRAQDQQLELDGDRPLRAIDDTLSLDLDGIDDVSFAAGSEQVTAWTQEKKAKALKLVLVWKPSGTRCAGSAAAEAWRIAGHARSWELLGAQGTLASANEEGDPVGAAPRQMRVEKVTLDSDDAPLEN
Ga0209808_100303653300026523SoilMPLLAILLLAAAPGQDVVQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAERWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQQKKAKALKLVVVFKPAGDRCAGSAAGN
Ga0209690_100721673300026524SoilMPLLAILLVAAAPGQDVVQLTSKAQLKELCEALRAQPSETDLDPAQIAEARKEAQARREEVAGRWYRVEVPSKGFAFGRYRTQDQQLELDGDRPLRAIDNMLSLDVEDIDEVSFTARPEQVTAWAQEKKGKALKLVVVWKPSGARCAGSAAAEAWRIAGHARSWELVGTQG
Ga0209059_115944923300026527SoilMPLLAILLFAAAPGQDVVQLTSKAQLKELCEALRAQPSETDLDPAQIAEARKEAQARRDEMAGRWYKVEVPSKGFALGRYRTQDQQLELDGDRPLRAIDNMLSLDLEGIDEVSFTARSEQVTAWAQEKKAKALKLVVVWKPSGARCAG
Ga0209806_101979943300026529SoilMPLLAILLFAAAPGQDVVQLTSKAQLKELCEMLRAHPSETDLDPAQIAEARKEAQARREEVAGRWYRVEVPSKGFAFGRYRTQDQQLELDGDRPLRAIDNMLSLDLEGIDEVSFTARSEQVTAWAQEKKAKALKLVVVWKPSGARCAGSAAAEAWRIAGHARSWELVGAQGTLASANEEGDPVGG
Ga0209157_119186913300026537SoilMPLLAILLLAVAPGQDVLQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAERWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQQKKAKALKLVVVFKPT
Ga0209805_129466313300026542SoilMPLLAILLLAAAPGQDVVQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAERWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQQKKAKALKLVVVFKPA
Ga0209161_1005216213300026548SoilMPLLAILLFAAAPGQDVVQLTSKAQLKELCEVLRAQPSETDRDPAQIAEARKEAQARREEVAGRWYRVEVPSKGFAFGRYRTQDQQLELDGDRPLRAIDNMLSLDLEGIDEVSFTARSEQVTAWAQEKKAKALKLVVVWRPSGARCAGSAAAEAWRIAGHARSWELVGAQGTLASANEEGDPVGGGPRQVRGEKVTLDADEAPLENEGR
Ga0209161_1005776433300026548SoilMPLLAILLLAAAPGQDVVQLTQKSQLKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAEHWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVATWNQQKKAKALKLVVVFKPAGDRCAGSAAAESWRIAGRARSW
Ga0209474_1006263933300026550SoilMLLIILLIAAAPVSDVVQLTQESQVKELCEALRAKPALEGADPAQEEAGHKAAQARREQAASQWYRLEVPSKGFAFGRYRDRERQIELDGDWPLRAVDGELSLDLEGIDDVAFNARPEQVTAWTAEKKAGSLKLVVVWKPTGERCAGSAAAEAWRVAGKVRSWE
Ga0209474_1013831533300026550SoilLRLFPRLYYPRCMPLLAILLLAAAPGKDVVQLTQKSQVKELCEALRAQPSESNLDPAQLAAARKAAKARRDEAAERWYRVEVPSKGFAFGLYRAQDQQIELDGDRPLRAIDDMLVLDLEGIDDVSFNARAEQVAAWTEEKKAKVLKLVVVFKPAGDRCAGSAAAESWRI
Ga0179593_115348033300026555Vadose Zone SoilMPLIAILVLAAASSPDVVQLSDKAQLKELCEALRAQPSESGLDPAQVSAARKAAQARREEAASRWYRVEVPAKGFAFGRYRTQDQQLELDGDRPLRAIDDMLALDLDGIDEVGFNAKPEQVAAWSEQKKGKTLRLAVVWKPAGDRCAGSAAAESWRIAGHARSWELVGPQG
Ga0209527_110865723300027583Forest SoilMPLIAILVLAAASSQDVVQLTQKSQVKELCDALRAQPSESDLDPAQVAAARKAAQVRRDEAAARWYRVEVPAKGFAFGRYRAQDQQLELDGDRPLRAVDDALSLDLDGADDVAFNARPEQVTAWNQEKKAKTLKLVLVW
Ga0209076_104623713300027643Vadose Zone SoilMPLLAILLVAAAPGQDVVQLTSKTQLKELCEALRAQPSETDLDPAQIAEARKAAQARREEAAGRWFRVEVPSKGFAFGRYRAQDQQLELDGDRPLRAIDDTLSLDLDGIDDVSFAAGSEQVTAWTQEKKAKALKLVVVWKPSGARCAGSAAAEAWRIAGHARSWELLGAQGTLASANEEGDPVGAGPRQMRV
Ga0209009_112791923300027667Forest SoilMSPLIAILLLAAAPDPEVVQLTQKAQLKELCEALRAEPSERDLDPAQVEAARKAAQARREEAAARWYRVEVPSKGFAFGRYRAQDQQLELDGDRPLLAIDDMLSLDLDGIDDVAFNARGDQVAAWSKEKKAKTLRLA
Ga0208981_109444713300027669Forest SoilMPLIAILVLAAASSQDVVQLTQKSQVKELCDALRAQPSESDLDPAQVAAARKAAQARRDEAASRWYRVEVPAKGFAFGRYRAQDQQLELDGDRPLRAVDDTLSLDLDGADDVAFNARPEQVTAWNQEKKAKTLKLVLVWKPAGERCAGSAAAESWRLAGHARSWELVGAQGTVAAANEEGDPVGG
Ga0209588_107546713300027671Vadose Zone SoilMPLIAILVLAAASSQDVVQLSDKAQVKELCEALRAQPSESGLDPAQVSAARKAAQARREEAASRWYRVEVPAKGFAFGRYRTQDQQLELDGDRPLRSIDDMLALDLDGIDDVGFNAKPEQVAAWSQQKKGKTLRLAVVWKPTGDRCAGSAAAESWRIAGHARSWELVGPQGIVAAANEDGDPVGGGPRQVQVEKVTLDSDEAPQQNEGRGRL
Ga0209689_107116813300027748SoilMPLLAILLFAAAPGQDVVQLTSKAQLKELCEMLRAHPSETDLDPAQIAEARKEAQARREEVAGRWYRVEVPSKGFAFGRYRTQDQQLELDGDRPLRAIDNMLSLDLEGIDEVSFTARSEQVTAWAQEKKAKALKLVVVWKPSGARCAGSAAAEAWRIAGHARSWEL
Ga0209488_1092227213300027903Vadose Zone SoilPSEADLDPAQVAAARKAAQTRREEFASQWYRVEMPSKGFAFGRYRSHDQQIELDGDRPLRALDDVLALDLDGTNDVAFNARPEQVTAWSGEKRSKTLRLVVVFKPSGERCAGNDAAESWRIAGHARSWELVGARGSLAAANEEGEPVGGGPRRMQVEKVSLESDGAPQENEGRARLSSAKAALDRCAFGAQRSGKLVLTFSVQ
Ga0307284_1029452323300028799SoilMPLLAILLAAAASGQDVVQLTQKAQLKELCEALRAQPSETDLDPAQVAAARKTAQARREEAAGLWYRIEVPSKGFAFGRYRAQDQQLELDGDRPLRAMDDMLSLDLEGIDDVAFAARPEQVTAWSKEKKAKTLRLLVVWKPTGERCGGSAAAESWRVPGR
Ga0307469_1041181323300031720Hardwood Forest SoilVLAAPHYPRGMSLLAILVLAAASSQDAVQLADKSQIKDLCEALRAQPPEADLDPAQLSAARKAAQARREEAVGRFYRVEIPSRGFSLGRYRSHDRQIELDGDHPLRAIDDMLVLDIEGTNDVSFNARPEQVTAWSAEKRAKTLRLVVVFKPSTERCAGNDAAESWRISGQARSWELVGAQGVLAAANEDGEPVGGGPRQMRVEKVTLESDGAPQENE
Ga0307468_10146345213300031740Hardwood Forest SoilLRRGAARRFLAVLAAPHYPLGMSLLAILVLAAASNQDAVQLTDKSQVKELCEALRAQPSEADLDPAQVAAARKAAETRREEFASQWYRVELPSKGFAFGRYRSHDQQIELDGDRPLRALDDMLALDLDGTNDVAFNARPEQVTAWSAEKRAKTLRLLVVFKPSGERCAGNDAAESWRIAGHARSWELVGARGSLAAANEEGDPVGGGPRR
Ga0307473_1067780213300031820Hardwood Forest SoilPRRFLAVLAAPHYPRGMSLLAILVLAAASSQDAVQLADKSQIKDLCEALRAQPPEADLDPAQLSAARKAAQARREEAVGRFYRVEIPSRGFSLGRYRSHDRQIELDGDHPLRAIDDMLVLDIEGTNDVSFNARPEQVTAWSAEKRAKTLRLVVVFKPSTERCAGNDAAESWRISGQARSWELVGAQGVLAAANEDGEPVGGGPRQMRVEKVTLESDGAPQENEGRARLSSARAALDRCAY
Ga0307473_1125963213300031820Hardwood Forest SoilLCLRAVATLSTLTRGSGEFCRGTPRHYPRGMPLIAILVLAAASNQDVVQLTDKAQLKELCEALRAQPSESGLDPAQVSAARRTAQARREEAASRWYGVEVPSKGFAFGRYRTQDQQLELDGDRPLRALDDMLALDLDGIDDVSFNARPEQVTAWSQQKKGKTLRLAVVWKPAGDRCAGSAAAE
Ga0307472_10068885523300032205Hardwood Forest SoilMLLLTLLLAAAQAQDAVVLTQESQIKELCEALRAQPAERDLDPAQQVAARKAAQARRDEALARWYEVEVPSKGFAFGRYREPEHQLELDGDRPVRALDNMLSLDLEGIDDVAFEARPDQVTSWSRQKKAGVLRLLVVWKPAADRCAGSAAAESWRIAGRVRSWRLLGDEGVVASANEEGEPDSGGPKQVRVEKVALDSDGPPQENEGRDRLSGAQRALDRCAAGA
Ga0335082_1016505633300032782SoilLAGQAGSHYPRSMLLIAILLAAAPDPEVQPLTEKAQLRELCDALRAQPSERDLDPAQAAAARKAAQKRREEAAARWYRVEVPSKGFSFGRYRAQDQQLELDGDRPLRAVDDMLSLDLEGIDEVAFNARPEQVTAWSEEKKARSLRLIVVWKPGGDRCAGSAAAESWRIAGKARSWELVGPEGTV
Ga0326729_106604213300033432Peat SoilALAPDPDVLQLTEKAQLKELCDALRAQPSERDLDPAQVAAARKAAQARREEAASRWYRVEVPSKGFVFGRYRAQDQQLELDGDRPLRALDDMLTLDLDGIDEVAFNARPEQVTAWSEEKKAKELRLVVVWKPRGDRCAGSAAAEAWRIAGKARSWELVGPEGTVAAANEDGEPVGGGPRQVLV
Ga0326723_0213103_2_4483300034090Peat SoilMPLIAILLLALAPDPDVLQLTEKAQLKELCDALRAQPSERDLDPAQVAAARKAAQARREEAASRWYRVEVPSKGFVFGRYRAQDQQLELDGDRPLRALDDMLTLDLDGIDEVAFNARPEQVTAWSEEKKAKELRLVVVWKPRGDRCAGS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.