NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F045847

Metagenome / Metatranscriptome Family F045847

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F045847
Family Type Metagenome / Metatranscriptome
Number of Sequences 152
Average Sequence Length 110 residues
Representative Sequence LLEDIAVDTFVHSSFGTVVMTLFQGIFTAPSWHTFTSLACGWALASDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYHRRWQLWGAVIRLAAQFVPEGE
Number of Associated Samples 132
Number of Associated Scaffolds 152

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 58.74 %
% of genes near scaffold ends (potentially truncated) 86.84 %
% of genes from short scaffolds (< 2000 bps) 91.45 %
Associated GOLD sequencing projects 131
AlphaFold2 3D model prediction Yes
3D model pTM-score0.34

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (71.053 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil
(11.184 % of family members)
Environment Ontology (ENVO) Unclassified
(23.026 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(45.395 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 62.50%    β-sheet: 0.00%    Coil/Unstructured: 37.50%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.34
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 152 Family Scaffolds
PF13701DDE_Tnp_1_4 1.32
PF03400DDE_Tnp_IS1 1.32
PF04255DUF433 0.66
PF05378Hydant_A_N 0.66
PF01979Amidohydro_1 0.66
PF05016ParE_toxin 0.66
PF02384N6_Mtase 0.66
PF03050DDE_Tnp_IS66 0.66
PF01656CbiA 0.66
PF13586DDE_Tnp_1_2 0.66
PF13565HTH_32 0.66
PF13518HTH_28 0.66
PF13551HTH_29 0.66
PF00239Resolvase 0.66
PF13384HTH_23 0.66
PF13751DDE_Tnp_1_6 0.66
PF01526DDE_Tnp_Tn3 0.66
PF13358DDE_3 0.66
PF02518HATPase_c 0.66
PF13374TPR_10 0.66
PF01266DAO 0.66
PF00884Sulfatase 0.66
PF13185GAF_2 0.66
PF02515CoA_transf_3 0.66
PF00561Abhydrolase_1 0.66

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 152 Family Scaffolds
COG0145N-methylhydantoinase A/oxoprolinase/acetone carboxylase, beta subunitAmino acid transport and metabolism [E] 1.32
COG1662Transposase and inactivated derivatives, IS1 familyMobilome: prophages, transposons [X] 1.32
COG1804Crotonobetainyl-CoA:carnitine CoA-transferase CaiB and related acyl-CoA transferasesLipid transport and metabolism [I] 0.66
COG1961Site-specific DNA recombinase SpoIVCA/DNA invertase PinEReplication, recombination and repair [L] 0.66
COG2442Predicted antitoxin component of a toxin-antitoxin system, DUF433 familyDefense mechanisms [V] 0.66
COG2452Predicted site-specific integrase-resolvaseMobilome: prophages, transposons [X] 0.66
COG3436TransposaseMobilome: prophages, transposons [X] 0.66
COG4644Transposase and inactivated derivatives, TnpA familyMobilome: prophages, transposons [X] 0.66


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A71.05 %
All OrganismsrootAll Organisms28.95 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000787|JGI11643J11755_11842590Not Available516Open in IMG/M
3300005187|Ga0066675_10099194All Organisms → cellular organisms → Bacteria1918Open in IMG/M
3300005332|Ga0066388_106694311All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Chromatiales → Chromatiaceae → Nitrosococcus → Nitrosococcus halophilus → Nitrosococcus halophilus Nc 4580Open in IMG/M
3300005343|Ga0070687_101010075Not Available603Open in IMG/M
3300005347|Ga0070668_101765377Not Available569Open in IMG/M
3300005447|Ga0066689_10182490Not Available1265Open in IMG/M
3300005450|Ga0066682_10922767Not Available519Open in IMG/M
3300005455|Ga0070663_100204388All Organisms → cellular organisms → Bacteria1543Open in IMG/M
3300005552|Ga0066701_10707022All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Methylobacteriaceae → Microvirga → Microvirga lupini605Open in IMG/M
3300005557|Ga0066704_10405210Not Available905Open in IMG/M
3300005558|Ga0066698_10979702Not Available538Open in IMG/M
3300005558|Ga0066698_10998351Not Available532Open in IMG/M
3300005564|Ga0070664_101427248All Organisms → cellular organisms → Bacteria → Terrabacteria group → Deinococcus-Thermus → Deinococci → Deinococcales → Deinococcaceae → Deinococcus → Deinococcus marmoris654Open in IMG/M
3300005576|Ga0066708_11049373Not Available506Open in IMG/M
3300005764|Ga0066903_103683977Not Available824Open in IMG/M
3300005764|Ga0066903_104631618Not Available732Open in IMG/M
3300005764|Ga0066903_105627082All Organisms → cellular organisms → Bacteria659Open in IMG/M
3300005764|Ga0066903_107473287Not Available564Open in IMG/M
3300006034|Ga0066656_10567091Not Available737Open in IMG/M
3300006049|Ga0075417_10365594All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Steroidobacteraceae → Steroidobacter709Open in IMG/M
3300006194|Ga0075427_10118130Not Available503Open in IMG/M
3300006796|Ga0066665_11119132Not Available600Open in IMG/M
3300006796|Ga0066665_11365650Not Available547Open in IMG/M
3300006797|Ga0066659_11165172Not Available644Open in IMG/M
3300006800|Ga0066660_10786298Not Available775Open in IMG/M
3300006844|Ga0075428_102422785Not Available538Open in IMG/M
3300006852|Ga0075433_11470302All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria589Open in IMG/M
3300006853|Ga0075420_100976574All Organisms → cellular organisms → Bacteria729Open in IMG/M
3300006853|Ga0075420_101338557Not Available615Open in IMG/M
3300006854|Ga0075425_101251323Not Available842Open in IMG/M
3300006969|Ga0075419_10552202Not Available805Open in IMG/M
3300006969|Ga0075419_10822430Not Available666Open in IMG/M
3300009012|Ga0066710_104225394Not Available537Open in IMG/M
3300009090|Ga0099827_11671187Not Available555Open in IMG/M
3300009148|Ga0105243_13016148Not Available511Open in IMG/M
3300009156|Ga0111538_13827217Not Available521Open in IMG/M
3300009162|Ga0075423_12173107Not Available603Open in IMG/M
3300009165|Ga0105102_10837630Not Available526Open in IMG/M
3300009444|Ga0114945_11071747Not Available501Open in IMG/M
3300009455|Ga0114939_10281181All Organisms → cellular organisms → Bacteria709Open in IMG/M
3300009553|Ga0105249_12368736All Organisms → cellular organisms → Bacteria603Open in IMG/M
3300009792|Ga0126374_11360047Not Available576Open in IMG/M
3300009792|Ga0126374_11424994Not Available565Open in IMG/M
3300009792|Ga0126374_11779251Not Available515Open in IMG/M
3300009793|Ga0105077_115272All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella620Open in IMG/M
3300009803|Ga0105065_1078955Not Available519Open in IMG/M
3300009813|Ga0105057_1063275Not Available630Open in IMG/M
3300009837|Ga0105058_1047559All Organisms → cellular organisms → Bacteria955Open in IMG/M
3300009837|Ga0105058_1099835All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium681Open in IMG/M
3300010029|Ga0105074_1112258Not Available523Open in IMG/M
3300010038|Ga0126315_10812658Not Available617Open in IMG/M
3300010041|Ga0126312_10025569All Organisms → cellular organisms → Bacteria3941Open in IMG/M
3300010042|Ga0126314_10440273All Organisms → cellular organisms → Bacteria942Open in IMG/M
3300010047|Ga0126382_10274723All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1250Open in IMG/M
3300010047|Ga0126382_11821945Not Available573Open in IMG/M
3300010074|Ga0127439_125241Not Available608Open in IMG/M
3300010100|Ga0127440_1014885Not Available554Open in IMG/M
3300010113|Ga0127444_1155255All Organisms → cellular organisms → Bacteria790Open in IMG/M
3300010124|Ga0127498_1083237All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria639Open in IMG/M
3300010125|Ga0127443_1016758Not Available561Open in IMG/M
3300010303|Ga0134082_10563277Not Available503Open in IMG/M
3300010358|Ga0126370_12295978Not Available534Open in IMG/M
3300010359|Ga0126376_12224994Not Available593Open in IMG/M
3300010360|Ga0126372_10148129All Organisms → cellular organisms → Bacteria1870Open in IMG/M
3300010362|Ga0126377_11176679All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella → Candidatus Entotheonella gemina837Open in IMG/M
3300010362|Ga0126377_12026129All Organisms → cellular organisms → Bacteria652Open in IMG/M
3300010362|Ga0126377_13595563Not Available501Open in IMG/M
3300010366|Ga0126379_13170186Not Available551Open in IMG/M
3300010400|Ga0134122_10998564All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Chloroflexi incertae sedis → SAR202 cluster → SAR202 cluster bacterium AD-804-J14_MRT_500m820Open in IMG/M
3300012199|Ga0137383_10258473All Organisms → cellular organisms → Bacteria → Proteobacteria1276Open in IMG/M
3300012205|Ga0137362_11671205Not Available524Open in IMG/M
3300012209|Ga0137379_11690959Not Available530Open in IMG/M
3300012349|Ga0137387_10753928Not Available704Open in IMG/M
3300012349|Ga0137387_10869308Not Available652Open in IMG/M
3300012354|Ga0137366_10161507All Organisms → cellular organisms → Bacteria1683Open in IMG/M
3300012354|Ga0137366_10763350Not Available687Open in IMG/M
3300012376|Ga0134032_1084483Not Available500Open in IMG/M
3300012391|Ga0134035_1040460Not Available502Open in IMG/M
3300012393|Ga0134052_1112265Not Available534Open in IMG/M
3300012406|Ga0134053_1160088Not Available542Open in IMG/M
3300012410|Ga0134060_1194074Not Available657Open in IMG/M
3300012917|Ga0137395_11167769Not Available541Open in IMG/M
3300012948|Ga0126375_12084284All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Isosphaerales → Isosphaeraceae → Singulisphaera → unclassified Singulisphaera → Singulisphaera sp. GP187503Open in IMG/M
3300012971|Ga0126369_10783407Not Available1035Open in IMG/M
3300012971|Ga0126369_12748657All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales → Sorangiineae → Polyangiaceae → Chondromyces → Chondromyces crocatus576Open in IMG/M
3300012971|Ga0126369_13543770Not Available511Open in IMG/M
3300012972|Ga0134077_10226393All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → unclassified Betaproteobacteria → Betaproteobacteria bacterium768Open in IMG/M
3300014150|Ga0134081_10285555Not Available588Open in IMG/M
3300014267|Ga0075313_1033778All Organisms → cellular organisms → Bacteria1177Open in IMG/M
3300015245|Ga0137409_10811265Not Available771Open in IMG/M
3300015373|Ga0132257_102959228Not Available619Open in IMG/M
3300015374|Ga0132255_102763302All Organisms → cellular organisms → Bacteria750Open in IMG/M
3300015374|Ga0132255_105250655Not Available548Open in IMG/M
3300016270|Ga0182036_10321019All Organisms → cellular organisms → Bacteria1184Open in IMG/M
3300016357|Ga0182032_11605288Not Available566Open in IMG/M
3300017659|Ga0134083_10326308Not Available656Open in IMG/M
3300018078|Ga0184612_10427636All Organisms → cellular organisms → Bacteria662Open in IMG/M
3300018431|Ga0066655_10819697All Organisms → cellular organisms → Bacteria633Open in IMG/M
3300018466|Ga0190268_11478994Not Available588Open in IMG/M
3300018468|Ga0066662_12408582All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Isosphaerales → Isosphaeraceae → Singulisphaera → unclassified Singulisphaera → Singulisphaera sp. GP187554Open in IMG/M
3300018468|Ga0066662_12775070Not Available519Open in IMG/M
3300021560|Ga0126371_11643103All Organisms → cellular organisms → Bacteria768Open in IMG/M
3300024516|Ga0209980_10466134Not Available531Open in IMG/M
3300025157|Ga0209399_10442905Not Available501Open in IMG/M
3300025939|Ga0207665_10116170All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → unclassified Cyanobacteria → Cyanobacteria bacterium 13_1_40CM_2_61_41886Open in IMG/M
3300026023|Ga0207677_10781746All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria853Open in IMG/M
3300027056|Ga0209879_1034613Not Available834Open in IMG/M
3300027561|Ga0209887_1095012Not Available606Open in IMG/M
3300027576|Ga0209003_1106979Not Available530Open in IMG/M
3300027616|Ga0209106_1076784All Organisms → cellular organisms → Bacteria748Open in IMG/M
3300027874|Ga0209465_10127096Not Available1259Open in IMG/M
3300027910|Ga0209583_10013077Not Available2497Open in IMG/M
3300028592|Ga0247822_11287684Not Available612Open in IMG/M
3300028608|Ga0247819_11074098Not Available513Open in IMG/M
3300028796|Ga0307287_10237918Not Available690Open in IMG/M
3300030547|Ga0247656_1244358Not Available505Open in IMG/M
3300030551|Ga0247638_1195209Not Available513Open in IMG/M
3300031421|Ga0308194_10207519Not Available637Open in IMG/M
3300031424|Ga0308179_1054265Not Available527Open in IMG/M
3300031545|Ga0318541_10469089Not Available704Open in IMG/M
3300031561|Ga0318528_10171899All Organisms → cellular organisms → Bacteria → Proteobacteria1157Open in IMG/M
3300031720|Ga0307469_11290493Not Available693Open in IMG/M
3300031720|Ga0307469_12539632Not Available501Open in IMG/M
3300031724|Ga0318500_10499661Not Available611Open in IMG/M
3300031751|Ga0318494_10830662Not Available541Open in IMG/M
3300031768|Ga0318509_10156602All Organisms → cellular organisms → Bacteria → Acidobacteria1258Open in IMG/M
3300031820|Ga0307473_11051217Not Available597Open in IMG/M
3300031879|Ga0306919_11302235Not Available550Open in IMG/M
3300032002|Ga0307416_102263281All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Paraburkholderia644Open in IMG/M
3300032051|Ga0318532_10102885Not Available1007Open in IMG/M
3300032060|Ga0318505_10581931Not Available526Open in IMG/M
3300032065|Ga0318513_10411001Not Available661Open in IMG/M
3300032075|Ga0310890_11709611Not Available522Open in IMG/M
3300032159|Ga0268251_10502785Not Available543Open in IMG/M
3300032180|Ga0307471_100169499All Organisms → cellular organisms → Bacteria2134Open in IMG/M
3300032205|Ga0307472_101686495Not Available626Open in IMG/M
3300032261|Ga0306920_102158566Not Available776Open in IMG/M
3300033004|Ga0335084_11525941Not Available660Open in IMG/M
3300033550|Ga0247829_11172558Not Available637Open in IMG/M
3300034172|Ga0334913_010310All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria2258Open in IMG/M
3300034664|Ga0314786_013750All Organisms → cellular organisms → Bacteria1209Open in IMG/M
3300034667|Ga0314792_122138Not Available670Open in IMG/M
3300034676|Ga0314801_125147Not Available600Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil11.18%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil9.87%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil9.21%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere7.89%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil7.24%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil7.24%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil5.26%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand5.26%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.95%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.29%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.63%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.63%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.63%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil1.97%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.97%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs1.32%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.32%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.32%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere1.32%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.66%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater0.66%
Deep SubsurfaceEnvironmental → Aquatic → Marine → Oceanic → Sediment → Deep Subsurface0.66%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.66%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.66%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.66%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.66%
Sub-Biocrust SoilEnvironmental → Terrestrial → Soil → Unclassified → Desert → Sub-Biocrust Soil0.66%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.66%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.66%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.66%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.66%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.66%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.66%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.66%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.66%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.66%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.66%
AgaveHost-Associated → Plants → Phylloplane → Unclassified → Unclassified → Agave0.66%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000787Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005343Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaGEnvironmentalOpen in IMG/M
3300005347Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaGHost-AssociatedOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005455Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3 metaGHost-AssociatedOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005564Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaGHost-AssociatedOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006194Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1Host-AssociatedOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006969Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009165Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 1-3cm September2015EnvironmentalOpen in IMG/M
3300009444Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3EnvironmentalOpen in IMG/M
3300009455Groundwater microbial communities from Big Spring, Nevada to study Microbial Dark Matter (Phase II) - Ash Meadows Crystal SpringEnvironmentalOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300009793Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_30_40EnvironmentalOpen in IMG/M
3300009803Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_40_50EnvironmentalOpen in IMG/M
3300009813Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_10_20EnvironmentalOpen in IMG/M
3300009837Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_20_30EnvironmentalOpen in IMG/M
3300010029Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_10_20EnvironmentalOpen in IMG/M
3300010038Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot106EnvironmentalOpen in IMG/M
3300010041Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot104AEnvironmentalOpen in IMG/M
3300010042Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot105BEnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010074Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_20_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010100Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_20_5_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010113Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_20_5_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010124Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010125Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_20_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012376Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012391Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012393Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012406Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012410Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014267Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_CattailC_D1EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018466Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 TEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300024516Deep subsurface microbial communities from Mariana Trench to uncover new lineages of life (NeLLi) - CR02 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025157Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3 (SPAdes)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025942Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026023Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300027056Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027561Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027576Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM3H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027616Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027874Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300028592Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Cellulose_Day30EnvironmentalOpen in IMG/M
3300028608Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Xylose_Day6EnvironmentalOpen in IMG/M
3300028796Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_141EnvironmentalOpen in IMG/M
3300030547Metatranscriptome of soil fungal communities from truffle orchard in Rollainville, France - Db9 (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030551Metatranscriptome of soil fungal communities from truffle orchard in Rollainville, France - Cb3 (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031421Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031424Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_150 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031545Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f26EnvironmentalOpen in IMG/M
3300031561Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f26EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031724Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f20EnvironmentalOpen in IMG/M
3300031751Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.108b1f24EnvironmentalOpen in IMG/M
3300031768Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f22EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300032002Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-3Host-AssociatedOpen in IMG/M
3300032041Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f22EnvironmentalOpen in IMG/M
3300032051Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f26EnvironmentalOpen in IMG/M
3300032060Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f18EnvironmentalOpen in IMG/M
3300032065Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f20EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032159Agave microbial communities from Guanajuato, Mexico - As.Ma.e (v2)Host-AssociatedOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033550Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day4EnvironmentalOpen in IMG/M
3300034172Sub-biocrust soil microbial communities from Mojave Desert, California, United States - 9HMSEnvironmentalOpen in IMG/M
3300034664Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20R3 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034667Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8R1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034675Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48R1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034676Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48R2 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI11643J11755_1184259013300000787SoilMTLFQGIFTAPSWQTFTYLAQGWALARDRHTITTYLWLTGATAVKHFSRFYVFLGCPLYERRWQLWGAIIRRAAAVVPDGEVIRVLVDDSTKKKAGRQIEGI
Ga0066675_1009919443300005187SoilMQCHGNLNSRKMSHLPRQTLSLEDIAVDTLVHSSFGTVVMTLFQGFFTAPSWHTFTLLARGWALATDRHTITTYLWLTGATTVKHFSRFYIFLGCPLYQHRWHFWGAVIRLAAQFVPEGAVIRVS
Ga0065705_1113923013300005294Switchgrass RhizosphereMKCHGNLNSGKMRHLPGQTLLLEDIAVDTFVHSSFGTVVMPLLQGILTAPSWHTFTALTCGWALATDRHPITTYLWLTGATAVKHFSRFYVF
Ga0066388_10669431113300005332Tropical Forest SoilMQCHGNLSSSKMSHLSGQTRLLEDIAVDTFVHSSFGTIVMTLFQGSFTTPSWHTFTALACGWALAGDRHTITTYMWLTGAATVKHFSRFYVFLGCPLYHHRWQLWGAVIRLAAQFVPAGAVIRVS
Ga0070687_10101007513300005343Switchgrass RhizosphereVDTFVVSSFGMFVMGLFHGLFTALSWQSVALLACGWALATDRHTITPYLWLTGATTVKHFSRFYVFLGGPLYTQRWHLWGAIIRQAAQFVPEGG*
Ga0070668_10176537713300005347Switchgrass RhizosphereVDAFFVSSFGMFVMGLFHGLFTAPSWQSFVLLACGWALTTDRHTITTYVWLTGAITVKHFSRFYVCLGCPLYSQRWHLWGAVIRQAARVVP
Ga0066686_1072112613300005446SoilMQCHGNLNSSKMSHLSEPTLLLEEIAVDTFVHSSFGTVVMTLFHGLFTAPSWHTFTLLACGWAVATDRHTITTYMWLTGATAVKHCSRFYV
Ga0066689_1018249023300005447SoilMQCHGNLNSRKMSHLPRQTLSLEDIAVDTLGHSSCGTVVMTLFQGFFTAPSWHTCTLLARGWALATERHTITTYVWLTGATTVKHFSRCSIFLGCPLSQHRWHLWGAGIRLAAQGLSPRARSCGSAASF*
Ga0066682_1092276713300005450SoilLLEDIAVDTFMHSSFGTVVMTLFQGLFTAPSWQTFTVVACGWALATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYNRRWQLWGAVIRL
Ga0070663_10020438823300005455Corn RhizosphereMQCHGNLNSSKMSHLPRQTLSLEDIAVDTFVHSSFGTVVMTLFQGFFTVPSWHTFTLLARGWALATDRHTITTYLWRTGATTVKHFSRFYVFLGCPLYQQRWHLWGAGIRLAAQFVPPGA
Ga0066701_1070702213300005552SoilMQVPRQPQLSKMSHLVGQTLLLEDIAVDTFVDSSFGTVVMTLFQGLFTGPSWQTFTYLACGWALAGDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYNRRWQLWGAVIRLAAQFVPAGAVIRVSFDDTT
Ga0066704_1040521013300005557SoilMQCHGNLNSRKMSHLPRQTLSLEDIAVDTLGHSSCGTVVMTLFQGFFTAPSWHTCPLLARGGALATERHTITTYVWLTGATTVKHFSRCSIFLGCPLSQHRWPLWGAGIRLAAQVVPEGAVMR
Ga0066698_1097970213300005558SoilMQCHGNLNSSKMSHLPRQTLSLEDIAVDTLVHSSCGTVVMTLFQGFFPAPSWHTFTLLARGWALATDRHPIPTYVWLTGATTVKHFSRFSLFLGCPLYQHRWH
Ga0066698_1099835123300005558SoilMQCHGNLNSSKMSHLSEPTLLLEEIAVDTFVHSSFGTVVMTLFHGLFTAPSWHTFTLLACGWALATDRHTITTYMWLTGATAVKHFSRFYVFLGCPLY
Ga0070664_10142724813300005564Corn RhizosphereMKCHGNLHSGKMSHLPGQTHLLEEIAVDTFLQSSFGTVVAMLFHEFFTEPSWHTFTSLACGWALASDRHTMTTYMWLTGAATLKHFSRFYVFLGCPLYHQRWPLWGAVIRMAAPLVPAGEIIRVSFDDTTKKKAGTHIAGSRTCGLRILFRDLSHVVCVV*
Ga0066708_1104937313300005576SoilMKCHGNLNSGKMSHLPRQTRLLEDIAVDTFVHSSFGTIVMTLFQGLFTAPSWHTCTALACGWALAGDRHTMTMYLWLTGAATVKHFSRFYVFLGCPLYQHRWQLWGAVIRLAAQCVPAGEVIRVSF
Ga0066903_10368397723300005764Tropical Forest SoilMSYLPRQTRLLEDIAVDTFVHSSFGTLVMTLFQGLFTTPSWQTFTALACGWALASDRHTITMYLWLTGAATVKHFSRFSVCLGCPLYQQRWQLWGAVIRLAAQFVPAGAVIRVSFDDTTK
Ga0066903_10463161823300005764Tropical Forest SoilMKCHGNLNSGKMSHLPGQTRLLEDIVVATFLYSSFGTVVAMLCHEFFTTPSWHTFTSLACGWALAGDRHTITTYMWLTGAATMKHFSRFYVFLGCPLYHKRWPLWGAVIRAAAPLVPAGEIIRVSFDDTTKKKAGTHIAGM
Ga0066903_10562708223300005764Tropical Forest SoilMEDIAVDPLVPSSFGTLVMALFHDLFTARSWHTFSALACGWALATDRHTITTYLWLTGATTVKHFSRFYVFLGGPLYHKRCQLWGAVIRLAAQFVPENA
Ga0066903_10747328713300005764Tropical Forest SoilLREDIAVDPFVHSSFGIVVMTLFQGLFTNPSWHTFTSLACGWALATDRHTITTYLWLTGATALKHFSRFYVFLGCPLYQQRWQLWGAVIRLAAQFVPEG
Ga0081455_1033762223300005937Tabebuia Heterophylla RhizosphereMKCHGNLNSGKMSHLQGQTLFLEDIAVDTFVHSSFGTVVMTLFQGFFSTPSWHTCTYLACGWAFASDRPTITTYMWLTGATTIKHFSRFYSPFAQIWG*
Ga0066656_1056709123300006034SoilMQCHGNLNSSKMSHLSEPTLLLEEIAVDTFVHSSFGTVVMTLFHGLFTAPSWHTFTLLACGWALATDRHTITTYMWLTGATAVKHFSRFYVFLGCPLYHHR
Ga0075417_1036559413300006049Populus RhizosphereLLEEIAVDTFVHSSFGTVVMTLFQGIFTTPSWQTFTALACGWALAGERHTITTYMWLTGAATVKHFSRFYVFLGCPLYQQRWQLWGAVIRLAAQFVPAGEVIRVSFDDTTKKKAGTHIEGLARYRNGAG
Ga0075427_1011813013300006194Populus RhizosphereVDTFVHSSFGTVVMTLFQGIFTAPSWQTFTALACGWALAGERHTITTYMWLTGAATVKHFSRFYVFLGCPLYHKRWQLWGAVIRLAAQWVPAGEVIR
Ga0066665_1111913223300006796SoilMQCHGNLNSSKMNHLSEPTLLLEEIAVDTFVHSSFGTVVMTLFHGLFTAPSWHTFTLLACGWALATDRHTITTSLWLTGATAVKHFSRFSVFLGCPLYHHRWHLWGAVIRLAAQ
Ga0066665_1136565013300006796SoilMSHLSGQTLLLEDMTVDTFVHSSFGTVVMTLLHSLFTAPSRETFTALACGWALATDRHTITTYVWLTGASAVKHFSQFYVFLGCPLYHQRWHLWGAVIRLAVQFVPAGEVIRVSFDDTTKKKAGRH
Ga0066659_1116517223300006797SoilVAEFVQTVFGEDVAVDTFVVSSFGTFVMGLFHGLFTAPSWQSFALLACGWALATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYTQRWHLW
Ga0066660_1078629813300006800SoilVDTFVVSSFGTFVMGLFHGLFTAPSWQSFALLACGWALATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYTQRWHLWGAVIRQAAQFVPEGESI
Ga0075428_10242278523300006844Populus RhizosphereVDTFFVSSFGTFVMGLFHGLFTAPSWQSFSLLACGWALATDRHTITTYVWLTGATTVKHFSRFYGFLGGPLYNKRWQLWGAVIRHAAHSVPEDEVIQVS
Ga0075421_10218282123300006845Populus RhizosphereMQCHGNLNSGKMSPLPGQTLLLEDIAVDTFGHSSFGTVVMTLFQGIFTAPSWQTFTFLACGWALATDRHTITTYLWLTGATAVKHFSRFY
Ga0075433_1147030213300006852Populus RhizosphereMKCHGNLNSSKMSHLPRPTRLLEDIAVDTFVHSSFGTVVMTLFQGLFTTPSWQTFTSLACGWALAGDRHTITMYLWLTGAATVKHFSRFYVFLGCPLYQQRWQLWGAVIRETHRSVKSA
Ga0075420_10097657423300006853Populus RhizosphereVFNLYKCHGNLTSGKMSHLPGQTLLLEDIAVDTFGHSSFGTVVMTLFQGIFTAPSWHTFTSLACGWALATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYHKRWQLWGAVIRLAAQYVPESEVIQVSFDDTTKKKA
Ga0075420_10133855723300006853Populus RhizosphereLLEDIAVDTFVHSSFGTVVMTLFQGLFTAPSWQTFTSLACGWALATDRHTITTYLWLTGATALKHFSRFYVFLGCPLYNRRWQLWGAVIRLAGPCVPA
Ga0075425_10125132323300006854Populus RhizosphereMAGFPSHYSRAFSPLWGMQSMKCYGNLNASKMSPLSVQTLLLEDIAVDTFVLSSFGTVVMPLFQGLFTASSWQAFMSLACGWALAMDRHTITTYVWLTGATAVKHFSRFYVFLGCPLYHQRWRLWGAVMRVSFDDATKKKAGRHIEGL
Ga0075419_1055220223300006969Populus RhizosphereLPLQTVLGEDVAVDTFVVSSFGTFVMGLLHGLFPAPSWQSCTLLACGWALATARHTLTTYLWRTRATTVKHFSQFSVFLGCPLYPQRW
Ga0075419_1082243013300006969Populus RhizosphereLLEDIAVDTFVHSSFGTVVMTLFHGLFTGPSWHTLTSLACGWALATNRHTITTYLWLTGAAAIKHFSRFYVFLGGPLYEQRWHLWGAVIRFAAQFVPPGAVVRVSFDAA
Ga0066710_10422539413300009012Grasslands SoilEDIAVDTFVHSSFGTVVMTLFQGFFTAPSWQTFPSLACGWSVATDRHTLTTYVWLTGAATVKHFSRFSVFLGGPLSHRRWQLWGAVIRLAVQCVPAGEVMRVLFDATTKKKAGTPIEGLARYRHGAGSARQA
Ga0099828_1033647723300009089Vadose Zone SoilMLFQGIFTTPSWQTFTYLACGWALASDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYHRRWQLWGAVIRLAAQFVPEGEVIRVIFDDTTKKKAGRQIEGLDRYRNGAGSARQEYRTLRG
Ga0099827_1167118713300009090Vadose Zone SoilMTLFQGFFTAPSWHTFTLLACGWALATDRHTITTYLWRTGVTTVKHFSRFYIFLGCPLYQHRWHLWGAVIRLAAQFVPEGAVIRVSFDDTTKKK
Ga0105243_1301614823300009148Miscanthus RhizosphereMKCHGNLNSGKMSHLPGQTLLLEDIAVDTFVHSSFGTVVMTLFQGLFTAPSWQTFISLACGWALATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLDNRRWQLW
Ga0111538_1382721713300009156Populus RhizosphereMTLFQGIFTAPSWQTFTYLAQGWALARDRHTITTYLWLTGATAVKHFSRFYVFLGCPLYERRWQLWGAIIRRAAAVVPDGEVIRVLVDDSTKKKAGRQ
Ga0075423_1217310713300009162Populus RhizosphereMWGIQLMKCHGNLNSSKMSHLPRPTRLLEDIAVDTFVHSSFGTVVMTLFQGLFTTPSWQTFTSLACGWALAGDRHTITMYLWLTGAATVKHFSRFYVFLGCPLYQQRWQLWGAVIRLAAQFVPAGEVNRVSFDDTTKKKAGTHIE
Ga0105102_1083763013300009165Freshwater SedimentMAVDTFLASSFGLFMKQLFSGLLTARSWQSFALLACGGALAPRQHTITTYLWLTGAATLKHFSQFYVFLGCPFYDARWRVWACIIRHAAQLVPAE
Ga0114945_1107174713300009444Thermal SpringsMTLFQGLFTTPSWHTFTSLACGWALAIDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYNRRWQLWGAVIRLAVQFVPEGEVIRVSFDDTTKKKAGTHIEG
Ga0114939_1028118123300009455GroundwaterMTLFQGVFTAPAWQTFTALACGWALATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYNRRWHLWGAVVRLAARYVPEGEVILVSFDDTTKKKAGTHIEGLARYRNGAGSARQEYRT
Ga0105249_1236873613300009553Switchgrass RhizosphereMKCHGNLNSGKMSHLPGQTLLLEDIAVDTFVYSSFGTIVMTLFQGIFTAPSWHTFTSLARGWALAGDRHTITTYLWLTGAATVKHFSRFYVFLG
Ga0126374_1136004723300009792Tropical Forest SoilMQCHGNLNSSKMSHLPRQTLSLEDIAVDTFVYSSFGTVVMTLFQGLFTHPSWHTLTSLACGWALATDRHTITTYMWLTGATAVKHFSRFYVFLGCPLYQQRWHLWRAVIRLAVQFVPLGEVMRVSFDDT
Ga0126374_1142499413300009792Tropical Forest SoilLLEDIAVDTFGHSSYGTVLMTLFPGIFTAPSWQTFTALACGWALASDRHTITTYVWLTGAAALKHFSRFYVCLGCPLYHKRWQLWGAVIRLAAQYVPEGAAIRVSFDDTTKKKAGTHIEGLARYRHGA
Ga0126374_1177925113300009792Tropical Forest SoilMSHLRGQTLLLEDIAVDTFVHSSFGTVVMTLFQGLFTTPSWHTFTSLTCGWALAGDRHTITTYMWLTGAATIKHFSRFYVFLGCPLYQHRWQLWGAVIRFAAQFIPAGEVIRVSFD
Ga0105077_11527223300009793Groundwater SandMKTLFPEDIAVDTFLASSFGFYVMTLFQGLFTAPSWQTFTSLACGWALARDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYDRRWQLWGAVIRRAAQCVPDNEVIRVIFDDTTKKKAGRQIEG
Ga0105065_107895513300009803Groundwater SandMSHLTGQTLFLEDIAVDTFVHSSFGTLVMTLFQGFFSAPSWHTFTYLACGWSLATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYNRCWQLWGAVIRRAVQCVPAGAVIRVILTIPPR
Ga0105057_106327513300009813Groundwater SandMSHLTGQTLLLEDIAVDTFMHSSFGTVVMTLLHRLLTVPLGHTFTSLACGWALATERHTITTYLWLTGATTVKHFSQFYVFLGCPLYNRRWQLWGAVIRFAAQFVPEDEVIRVAFDETTKKKAGTHIEGLDRYRNGAGSAR
Ga0105058_104755923300009837Groundwater SandMSYATRNTLFLEDIAVDTFLHSSFGTVVMTLLQGFFTVPSWQTFTSLACGWTLATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYNRRWQLW
Ga0105058_109983523300009837Groundwater SandMSHLSGQTLLLEDIAVDTLMHSSFGTVVMTLFQGLFTASSWQTFPALACGWSLATDRHTITTSLWLTGATTVKHFSRFYVFLGCPLYNRRWQLWGAV
Ga0105074_111225813300010029Groundwater SandMSYLTGQTLFLKDIAVDTFVHSSFGTVVMTLFQGLFTAPSWHTFTSLACGWALATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYHKRWQLWGAVIRLAAQFVPEGEV
Ga0126315_1081265813300010038Serpentine SoilMSHLPGQTLLLKDFAVDTFMHSSFGTVVMTLFQGLFTTPSWQTFTSLACGWALASDRHTITTYLWLTGATAVKHFSRFYVFLGCPLYHQRWHLWGAVIRLAAQLVPEGEVIRVSFDDTTKKKAGRHIEGLARYRNGAGSARQE
Ga0126312_1002556913300010041Serpentine SoilLLEDIAVAPFVHSSFAMVVMTLFQGFFSAPSGQTFTYLACGWSLATDRHTITTYIWLTGATAIKHFSRFYVFLGAPLYHKRCQLWGAIIRLAARFVPAGVVIRVVFDDTTKKKTGTHIEGLGRYRNGAGSAGKNTARCGA*
Ga0126314_1044027313300010042Serpentine SoilMKCHGNLNSGKMSHLPGQTHLREDIAVDTFLHSSFGTVVTTLFHDLFTAPSWHTFTYLACGWALAGDRHTITTYMWLTGAATVKHFSRFSVFLGRSIYRTL*
Ga0126382_1027472313300010047Tropical Forest SoilMVSCQYRLSSLEDIAVAPFLISSFGTFVMGLFQGLFTAPSWQSFSFLACGRALTTDRHTITNYLWLTGATTVKHCSQFYVFLGCPLYTQRWPLWGA
Ga0126382_1182194513300010047Tropical Forest SoilMALFHGLLTAPAWQSFLLLACGWAMATDRHTITTSLWLTGATSVKHFSQFSVFLGCPLYHRRWPLWGAVIRRAAQYVPESEVIQGAFDGTTKKKAGRYIEGLARYRHGAGSA
Ga0127439_12524113300010074Grasslands SoilMSHLPGQTLLLEDIAVDTFVHSSFGTMVMTLFHSLFTAPSWETFTSLTCGWALATDRHTITTYLWRTGATTVKHFSRFYIFLGCPLYNQRWHLWGAVIR
Ga0127440_101488513300010100Grasslands SoilMSHLTRQTLLLEDIAVDTFMHSSFGTVVMTLFQGLFTAPSWQTFTALACGWSLATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYNRRWQLWGA
Ga0127444_115525513300010113Grasslands SoilMSHLPGQTLLLEDIAVDTFVHSSFGTMVMTLFHSLFTAPSWETFTSLTCGWALATDRHTITTYLWLTGASAVKHFSRFYVFLGCPLYDRRGQLWGAVIRLAVQFVPAGEV
Ga0127498_108323713300010124Grasslands SoilLLEDIAVDTFVHSSFGTVVMRLFQGLFTAPSWQTFTSLACGWALATDRHTITTYLWLTGATAVKHFSRFSVFLGCPLSQQRWHLWGAVIRL
Ga0127443_101675813300010125Grasslands SoilMSHLPGQTLLLEDIAVDTFVHSSFGTMVMTLFHSLFTAPSWETFTSLTCGWALATDRHTITTYLWLTGASAVKHFSRFYVFLGCPLYDRRGQLWGAVIR
Ga0134082_1056327713300010303Grasslands SoilMSHLPGQTLLLEDIAVDTFVHSSFGTMVMTLFHSLFTAPSWETFTSLTCGWALATDRHTITTYLWLTGASAGKHFSRFYVFLGCPLYDRRGQLW
Ga0126370_1229597813300010358Tropical Forest SoilMPLFQGIFTAPSWQTFTSLACGWALATDRHTITTSLWLTGATTVKHFSRFYVFLGCPLYHRRWQLWGAVIHCAAQLVPASEVIRVSFD
Ga0126376_1222499413300010359Tropical Forest SoilLLEDIAVDTFGHSSFGTVLMTLFQGIFTAPSWQTCTALACGWALASDRHTITTYVWLTGAAALKHFSRFYVCLGCPLYHKRWQLWGAVIRLAAQCVPEGAVIRVSFDDTTK
Ga0126372_1014812913300010360Tropical Forest SoilMSHLSGQTLLLEDIAVDTFVYSSFGTVVMTLFDDLFTGPSWRTFTLLACGWALATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYNQRWHLWGAVIRLAVQFVP
Ga0126377_1117667913300010362Tropical Forest SoilLLEDIAVDTFGHSSYGTVLMTLFPGIFTAPSWQTFTALACGWALASDRHTITTYVWLTGASALKHFSRFYVFLGCPLYHKRWQLWGAVIRLAAPLVPAGEVIRVSFDDTTKKKAGLHIEGLARYRNG
Ga0126377_1202612923300010362Tropical Forest SoilMEDIAVDTFVHSSFGTLVMTRCHGLFTARSWHTFIALACGWALATDRHTITTYLWLTGATTVKHFSRFYVFLGGPLYHKRCQLWGAVIRLAAQFVPENAIIQIGFDDTTKKKAGTQIEGLARYRNGAGS
Ga0126377_1359556313300010362Tropical Forest SoilMWSTQSLQCHGNLHSRKMRHLPGQTLLREDITVDTCGPSSCGPVVMPLLQGIFTVPSWHTCTSLACGWAWATDRHPMPPYGWRPGAATVKPFARFYVCLGGPLYDHRGPRWGAVRRLAAQGSPEGAGMRGSVDETTKK
Ga0126379_1317018613300010366Tropical Forest SoilMQCHGNLNSGKMSHLPRPTRLLEDIAVDTFIPSSLGTVVMSLFPGVFRTPSWRTLTVLACGWALATDRHTITTSLWLTGATALKHFSRCYGFLGCPLYQQRWHLWGAVIRLAAQFVPEGAVIHVSFDDTTKKKAGR
Ga0134122_1099856413300010400Terrestrial SoilMKCHGNLNSSKISHLSGQTYLLEDIAVDTFVHSSFGTVVMTLFHGLFTGPSWHTFTSLACGWALATDRHTITTYLWLTGAAAIKHFSCFYVFLG
Ga0137383_1025847333300012199Vadose Zone SoilMALFQGLFTVPSWHTFTALACGWTLATDRHTITTYMWLTGATAVKHFSRFYVFLGCPLYQHRWPLWGAVIRRAAQAVPEGEVIRVSFDDTTKK
Ga0137362_1167120523300012205Vadose Zone SoilMTLFQGFFTAPSWHTFTLLARGWALATDRHTITTYLWLTGATTVKHFSRFYIFLGCPLYQHRWHLWGAVIRLAAQFVPEGAV
Ga0137379_1169095913300012209Vadose Zone SoilMQCHGNPNSSKMSHLPRQTLLLEDIAVDTFVHSSFGTVVMTLFQGIFTAPSWQTFTYLACGWALAGDRHTITTYVWLTGATTVKHFSRFYVFLGCPRLCTKSIP*
Ga0137387_1075392813300012349Vadose Zone SoilLLEDIAVDTFINSSFGTVVMMLFQGIFTTPSWQTFTYLACGWALASDRHTITTSLWLTGATTVKHFSRFYVFLGCPLYHRRWQLWGAVIRLAA
Ga0137387_1086930813300012349Vadose Zone SoilMTVDTFVHSSFGTVVMTLLHSLFTAPSRETFTALACGWALATDRHTITTYVWLTGASAVKHFSQFYVFLGCPLYHQRWHLWGAVIRLAVQFVPAGEVIRVSFDDTTKKKAGRHIEGL
Ga0137366_1016150713300012354Vadose Zone SoilMQCHGNLNSSKMSHLPRQTLSWEDIAVDTFVHSSFGTVVMPLFQGFFPAPSWHTFTLLARGWALATDRHTITTYLWLTGATTVKHFSRFYIFLGCPLYQHRWHLWGAVIRLAAQCVPEGAVIRVSFDETTKKKA
Ga0137366_1076335013300012354Vadose Zone SoilMTLFQGLFTAPSWQTFTSLACGWALATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLSHRRWQLWSAVIRLAVQCVPAGEVIRVIFDDTTKKK
Ga0134032_108448313300012376Grasslands SoilMSHLPGQTLLLEDIAVDTFVHSSFGTMVMTLFHSLFTAPSWETFTSLTCGWALATDRHTITTYLWLTGASAVKHFSRFYVFLGCPLYDRRGQLWGAVIRLAVQFV
Ga0134035_104046013300012391Grasslands SoilMVMTLFHSLFTAPSWETFTSLTCGWALATDRHTITTYLWLTGASAVKHFSRFYVFLGCPLYDRRGQLWGAVIRLAV
Ga0134052_111226513300012393Grasslands SoilVDPFVHSSFGTVVMTLFQGIFTAPSWHTFTSLACGWALASDRHTLTTYVWLPGATTVTHLSRFSGFLGWSLDHRRWQLWGAVSRLAAPFHGGCQRLASCEC*
Ga0134053_116008813300012406Grasslands SoilLLEDIAVDTFVHSSFGTVVMRLFQGLFTAPSWQTFTSLACGWALATDRHTITTYLWLTGATAVKHFSRFSVFLGCPLSQQRWHLWGAVIRLVAQFVPAGEVIRVSFD
Ga0134060_119407413300012410Grasslands SoilLLEDIAVDTFVHSSFGTVVMRLFQGLFTAPSWQTFTSLACGWALATDRHTITTYLWLTGATAVKHFSRFSVFLGCPLSQQRWHLWGAVIRLAAQFV
Ga0137395_1116776913300012917Vadose Zone SoilLLEDIAVDTFVHSSFGTVVMTLFQGIFTAPSWHTFTSLACGWALASDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYHRRWQLWGAVIRLAAQFVPEGE
Ga0126375_1208428413300012948Tropical Forest SoilMTLFQGLFTNPSWHTFTSLACGWALATDRHTITTYLWLTGATALKHFSRFYVFLGCPLYHQRGHLWGAMIRLEVQFVPAGEVMRVSFDDTTKTQAGRHLEGLARYRNGAGSAR
Ga0126369_1078340713300012971Tropical Forest SoilMDTFVVSSFGTFVMGLFHGVFTTPSRQSFGLLACGWALATDRHTITTYLWLTGATTMKHFSRFSVFLGCPLYTQRWHLWGAVIRQAARFVPEGESIQVSFDDTTKKKAGRHIEGLATAMARAQPGKNIGRCGA*
Ga0126369_1274865713300012971Tropical Forest SoilMQCHGNLNSSMMSHLSGQTLLLEDIAVDTFVYSSFGTVVMTLFDDLLTAPSRRTFTLLACGWALATDRHTITTYVWLTGATAIKHFSRFYVFLGCPLYHQRWRLWGAVIRMAAQFVPTGEVIRVSFDDTTKKK
Ga0126369_1354377013300012971Tropical Forest SoilMSHLPRQTRLLEDIAVDTFLHSSFGTVVTMLFHDVFTAPSWHTCTYLACGWALAGDRHTITTYMWLTGAATVKHFSRFYVFLGCPLYHKRWQLWG
Ga0134077_1022639313300012972Grasslands SoilMTLFQGLFTAPSWQTFTSLACGWALATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYNRRWQLWGAVIRLAAPLVPEGEVIRV
Ga0134081_1028555513300014150Grasslands SoilMSHLPGQTLLLEDIAVDTFVHSSFGTVVMALFQDFFTVPSWHTFTYLACGWALATDRHTITTYVWLTGASAVKHFSQFYVFLGCPLYNRRWQLWGAVIRLAAQCVPEGEVIRASVDDTTKKKAGTHIEGLDRYRN
Ga0075313_103377813300014267Natural And Restored WetlandsMVLFQGIFTAPSWQTFTYLACGWALATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYNRRWLLWGAVIGLAAQLIPNDQVIR
Ga0137409_1081126513300015245Vadose Zone SoilMTDSLAEDIAVDPFVHSSFGTVVMGLFQGRFTVPSWHTFTALACGWALATDRHTITTSMWLTGATAIKHFSRFYVFLGCPLYQQRWPLWGAVIRRAAQAVPEGEVIRVRFDDTTKKKAGTHIEGLARYRNGAGSARQA
Ga0132256_10165819723300015372Arabidopsis RhizosphereMQCHGNLNSSKMSHLPRQTLSLEDIAVDTFVHSSFGTVVMTLFQGIFTVPSWPTFTSLACGWALATDRHTITTYVWLTGAATVKHFSRFYVPNQK*
Ga0132257_10295922823300015373Arabidopsis RhizosphereMVVMTLFQGFFSAPSWQTFTYLACGWSLATDRHTITTYVWLTGATTIKHFSRFYVFLGGPLYHKRWQLWGAIIRLAVRFVPEGVVLRV
Ga0132255_10276330213300015374Arabidopsis RhizosphereMQCHGNLNSSKMSHLPRQTLSLEDIAVDTFVHSSFGTVVMTLFQGIFTVPSWPTFTSLACGWALATDRHTITTYVWLTGAATVKHFSRFYVFLGCPLYDHRWPRWGAVMRLAAQ
Ga0132255_10525065513300015374Arabidopsis RhizosphereMKCHGNLNSSKISHLSGQTYLLEDIAVDTFVHSSFGTVVMTLFHGLFTGPSWHTLTSLACGWALATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYHHRWHLWGAVIRLAAQFVPEGAVIRVSFDDTTKKKAGHHIEG
Ga0182036_1032101913300016270SoilMSHLLGQTLLLEDIAVDTFLHWSFGTGVMTLFQGLFTAPSWQTFTALACGWALATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYNRRWQLWGAVI
Ga0182032_1160528813300016357SoilMSHLSEQTLLLEEMVVDTFVHSSFGTVVMTLCRSLFTAPSRETFTTLACGWALATNRHTITTYLWLTGASAVKHFSRFYVFLGCPLYQQRWHLWGAGIRLAASFVPADEVMRVSFDDTTKKKAGTQIEGLARYRNG
Ga0134083_1032630813300017659Grasslands SoilMAVDTFLASAFGTLVMAVFQGLFTTPSWHTFTALACGWALATDRHTITTYMWLTGATAVKHCSRFYVFLGCPLYHHRWHLWGAVIRLAAQFVPEGEVIRVSFDDTTKKKAGHHSDGLARYRNGAGSARQEYR
Ga0184612_1042763613300018078Groundwater SedimentMSHLTGQTLFLEDIAVDTFVHSSFGTVVMTLFQGFFSAPSWQTFTYLACGWALATERHTITTYLWLTGATTVKHFSQFYVFLGCPLYNRRWQLWGAVIRFAAQFVPEDDVIR
Ga0066655_1081969723300018431Grasslands SoilMQCHGNLNSSKMSHLPRQTLSLEDIAVDTLVHSSCGTVVMTLFQGFFPAPSWHTFTLLARGWALATDRHPITTYVWLTGATTVKHFSRFSLFLGCPLYQHRWH
Ga0190268_1147899413300018466SoilMQCHGNLNSSKMSHQTGQTLLLEDIAVDTFVHSSFGTVVMTLFQGIFTAPSWQTFTSLACGWALASDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYHRRWQLWG
Ga0066662_1240858213300018468Grasslands SoilMQCHGNLNSSKMNHLSEPTLLLEEIAVDTFVHSSFGTVVMTLFHGLFTAPSWHTFTLLACGWALATDRHTITTYLWRTGATTVKHFSRFYIFLGCPLYNQRWHLWGAVIRLAVQFVPAGE
Ga0066662_1277507013300018468Grasslands SoilMSHLSGQTLLLEDMTVDTFVHSSFGTVVMTLLHSLFTAPSRETFTALACGWALATDRHTITTYVWLTGASAVKHFSQFYVFLGCPLYHQRWHLWGAVIRLAVQFVPAGEVIRVSFDDTTKKKAG
Ga0126371_1164310313300021560Tropical Forest SoilMECHGNLTASKMSHLSGQTLWLEDVAVDTFVNSSFGTVVMTLFQGLFTAPSWQTFTSLACGWTLARDRHTITTYLWLTGATAVKRFARFYVFLGCPLYHQRWRLWGAVLRL
Ga0209980_1046613413300024516Deep SubsurfaceMGSNTALFVEDIAVDTFMSISFGTLLMHLFQGLFTTRSWQSFTYLACGWALTTDRHTITTYLWLSGASTAKHFSRFYAFLGCPLYRQRWHLWGAVIRLADQFVPEGEVIQVLFDDTTK
Ga0209399_1044290513300025157Thermal SpringsVDTFVHSSFGTVVMTLFQGLFTTPSWHTFTSLACGWALAIDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYNRRWQLWGAVIRLAVQFVPEGEVIRVSFDDTTKKKAGTHIEG
Ga0207665_1011617013300025939Corn, Switchgrass And Miscanthus RhizosphereMSHLPGQTLLLEDIAVDTFVHSSFGTVVMTLFQGFFTAPSWQTFTSLACGWALAGDRHTITTYMWLTGAATMKHFSRFYVFLGCPLYHKRWHLWGAVIRMAAPLVPAGAIIRVSFDD
Ga0207689_1176069423300025942Miscanthus RhizosphereMQCHGNLNSSKMSHLPRQTLSLEDIAVDTFVHSSFGTVVMTLFQGLFTAPSWQTFISLACGWALATDRHTITTYLWLTGATTVKHFSRFYV
Ga0207677_1078174623300026023Miscanthus RhizosphereLLEDIAVDTFVHSSFGTVVMTLFHGLFTGPSWHTFTSLACGWALATDRHTITTYLWLTGAVTIKHFSRFYVFLGCPLYQQRWHLWGAVIRLAAQFVPPGAGVRV
Ga0209879_103461323300027056Groundwater SandLLEDIAVDTLMHSSFGTVVMTLFQGLFTASSWQTFPALACGWSLATDRHTITTSLWLTGATTVKHFSRFYVFLGCPLYNRRWQLWGAVIRLAVPFVPAGEVIRVIFDDTTKKKAGTHSEGRARSRHGAGSARQAY
Ga0209887_109501223300027561Groundwater SandMSHLTRQTLFLEDIAVDTFLSLSFGTLLMNLFQGLFTAPSWHTFTSLACGWALATDRHTITTYLWLTGATTVKHFSQFSVFLGCPLYNRRWQLWGAVIRLAARFVPEGESI
Ga0209003_110697913300027576Forest SoilVDPFVHSSFGAIVVALFQGCLSAPSWHTFTLLACGWALATDRHTITTYLWLTGATAVKHFSRFYVFLGCPLYHQRWHLWGAVIHLAAQFVPEG
Ga0209106_107678423300027616Forest SoilLLEDIAVDTFVHSSFGTVVMTLFQGIFTAPSWHTFTSLACGWALASDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYHRRWQLWGAVIHLA
Ga0209465_1012709623300027874Tropical Forest SoilMSYLPRQTRLLEDIAVDTFVHSSFGTLVMTLFQGLFTTPSWQTFTALACGWALASDRHTITMYLWLTGAATVQHFSRFSVCLGCPLYQQRWQLWGAVIRLAAQFVPAGAVIRVSFDDTTKKKAGTHSEGLARY
Ga0209583_1001307713300027910WatershedsMKCHGNLNSGKMRHLPGQTLLLEDIAVDTFGHSSGETVVMTLLQGLFLAPSWHTCSYLACGWALAGDRHTMTTYLSLTGAATVKHFSHFYVFLGCPPSTKRWRLWGAVIRLAAQY
Ga0247822_1128768413300028592SoilMTDSLVEDIAVDPFVHSSFGTVVMALFQGLFTVPSWHTFTALACGWALATDQHTITTYMWLTGATAIKHFSRFYVFLGCPLYQQRWHLWGAVIRRAAQAVPEGEVI
Ga0247819_1107409823300028608SoilMKCHGNFNSGKMSHLSGQTLLLEDIAVDTFVHSSFGTVVMTLFQGFFSAPSRQTFTYLACGWALASDRHTITTYLWLTGATAVKHFSRFYVFLGCPLYH
Ga0307287_1023791813300028796SoilMTDSLAEDIAVDPFVHSSFGTVVMGLFQGLFTVPSWHTFTALACGWALATDRHTITTYMWLTGATAIKHFSRFYVFLGCPLYQQRWHLWGAVIRRAAQAVP
Ga0247656_124435813300030547SoilVDTFVHSSFGTLVMTLFQGIFTAPSWQTFTALACGWALAGERHTISTYMWLTGAATVKHFSRFYVFLGCPLYHKRWQLWGAVIRLAAQWVPEGEGIRVSFDDTTKKKAGTHIEGLARYRNGAGSARQEYRT
Ga0247638_119520913300030551SoilVDTFVHSSFGTLVMTLFQGIFTAPSWQTFTALACGWALAGERHTISTYMWLTGAATVKHFSRFYVFLGCPLYHKRWQLWGAVIRLAAQWVPEGEVIRV
Ga0308194_1020751913300031421SoilVDTVFVSSFGTFVMGLFRGLFTVPSWQSFALLACGWALTTDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYTQRWHLWGAVIRQAARCVPEGEVIAVAFDDTTKKKAGRHVEGLGRY
Ga0308179_105426513300031424SoilMSHLPGQTLLLEDIAVDTCVQSSCGPVVMTLLQGLLTAPSWQAFIYLACGWALATDRHTLTTAVWLPGTTTVKHLSRCSVFLGGPFSNRRWQRWGAVIRVAVQFVPQGEVIRVLCDEPTKTKA
Ga0318541_1046908913300031545SoilLLEDIAVDTFLHSSFGTVVMTLFQGLFTAPSWQTFTALACGWALATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYHRRWQLWGAVIRLAVQFVPAGEAMRVL
Ga0318528_1017189913300031561SoilLLEDIAVDTFLHSSFGTVVMTLFQGLFTAPSWQTFTALACGWALATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYHRRWQLWGAVIRLAVQF
Ga0307469_1129049323300031720Hardwood Forest SoilVDTCGHSSFGTVVMTLFQGFFTAPSWQTFTSLACGWSLATDRHTITTSVWLTGAATVKHFSRFSVCLGCPLYNRRWQLWGAVIRLAVPYVPAGE
Ga0307469_1253963213300031720Hardwood Forest SoilMTDSLAEDIAVDPFVHSSFGTVVIGLFQGLFTVPSWHTFTALACGWALATDQHTITTYMWLTGATAIKHFSRFYVFLGCPLYQQRWHLWGAVIRRAAQAVPEGEVIRVSFDDTTKKKAGTHIEG
Ga0318500_1049966113300031724SoilVDTFVHSSFGTVVMPLFPGLFPAPSWQTFTCLACGWSLATDRHTITTSLWLTGATTVKHFARVSVFLGGPLSHRRWQLWGAVIRLAVQFIPAGEVMRVLCDDTTKKKAGTPIEGLARYRN
Ga0318494_1083066213300031751SoilVDTFVHSSFGTVVMPLFPGLFPAPSWQTFTCLACGWSLATDRHTITTSLWLTGATTVKHFARVSVFLGGPLSHRRWQLWGAVIRLAVQFIPAGEVMRVLCDDTTKKKAGTPI
Ga0318509_1015660223300031768SoilMQCHGNINSSKMSHLSGQTLLLEDIAVDTFVYSSFGTVVMTLFYDLFTAPSWRTFILLACGWALATDRHTITTYLWLTGATTVKHFSQFSPLRKG
Ga0307473_1105121713300031820Hardwood Forest SoilMSHLTGQTLLLEDIAVDTFVHSSFGTVVMPLFHGLFTTPSWQNFTYLACGWALTTDRHTITTYVWLTGATAVKHFSRFYIFLGCPLYDKRWQLWGAGIRLAVQFVPEGEVIRVVFDDTTKKKAGTHIEGLGRYRNGAG
Ga0306919_1130223513300031879SoilMQCHGNINSSKMSHLSGQTLLLEDIAVDTFVYSSFGTVVMTLFYDLFTAPSWRTFILLACGWALATDRHTITTYLWLTGATTVKHFAQFYVFLGCPLYKGIRFRRDTQLPQ
Ga0307416_10226328113300032002RhizosphereMNAQVISFAIFHPCRVFNRCKCHGNLNSGKMSHLTGQTLLLEDIAVDTFVHSSFGTVVMTLFHSLFTAPSWHTFTSLACGWALATDRHTITTSVWLTGATTIKHFSRFYVFLGCPLYNRRWHLWGAVIRLAVQFVPEGEVIRVIFDD
Ga0318549_1046146423300032041SoilMQCHGNINSSKMSHLSGQTLLLEDIAVDTFVYSSFGTVVMTLFYDLFTAPSWRTFILLACGWALATDRHTITTYLWLTGATTVKHFAQFY
Ga0318532_1010288523300032051SoilMSHLPGQTHLREDIAVDTFVHSSFGTVVMPLFPGLFPAPSWQTFTCLACGWSLATDRHTITTYLWLTGATTVKHFSRFYVFLGCPLYHRRWQLWGAVIRLAVQFVPAGEAMRVLFD
Ga0318505_1058193113300032060SoilVDTFVHSSFGTVVMPLFPGLFPAPSWQTFTCLACGWSLATDRHTITTSLWLTGATTVKHFARVSVFLGGPLSHRRWQLWGAVIRLAVQFI
Ga0318513_1041100113300032065SoilMSHLPGQTHLREDIAVDTFVHSSFGTVVMPLFPGLFPAPSWQTFTCLACGWSLATDRHTITTSLWLTGATTVKHFSRFYVFLGCPLYHRRWQLW
Ga0310890_1170961113300032075SoilMTDSLVEDIAVDPFVHSSFGTVVMALFQGLFTVPSWHTFTALACGWALATDQHTITTYMWLTGATAIKHFSRFYVFLGCPLYQQRWHLWGAVIRRAAQAVPEG
Ga0268251_1050278513300032159AgaveMKCHGNLSSGKMSHLPRQTHLPEDIAVDTFLHSSFGAVVTMLFHDLFTAPSWHTFTYLACGWALASDRHTITTYVWLTGAATVKHFSRFYVFLGCPLYHKRWQLWGAVIRWACLLYT
Ga0307471_10016949933300032180Hardwood Forest SoilMTDSLAEDIAVDPFVHSSFGTVVMALFQGLFTVPSWHTFTALACGWALATDRHTITTYMWLTGATAVKHFSRFYVFLGCPLYQQRWHLWGAVIRRAAQAIPEGEVMRVSFDDTTKKKAGTHI
Ga0307472_10168649513300032205Hardwood Forest SoilMTDSLAEDIAVDPFVHSSFGTVVMALFQGLFTVPSWHTFTALACGWALATDRHTITTYMWLTGATAVKHCSRFSVFLGGPLYQQRWHLWGAVIRRAAQAIPEGEVMRVSFDDTTKKKAGTHIEGL
Ga0306920_10215856613300032261SoilMSYLPRQTRLLEDIAVDIFVHSSFGTVVMTLFQGLFTTPSWQTFTALACGWALAGDRHTITMYLWLTGAATVKHFSRFYVFLGCPLYQQRWQL
Ga0335084_1152594113300033004SoilVDTLVHSSCGTVVMRRFQGLLTAPSWQTFTSLACGWAVATDRHTLTTSLWLTGATAVQHLSRFSLCLGCPLYQQRWPLWGAVIRLAAPFVP
Ga0247829_1117255813300033550SoilMQCHGNLNSGRMSHLSGQTLWLEEIAVDTFVYSSFGTVIMTLFQDLFTAPSWQTFTSLACGWALATDRHTITTYMWLTGAAAVKHFSRFYVFLGCPLYH
Ga0334913_010310_118_4533300034172Sub-Biocrust SoilMFVMELFHGLFTVPSWQSFSLLACGWALATDRHTITTYMWLTGATTVKHFSQFYVFLGCPLYHRRWQLWGAVIRRAAQCVPEGRVIQVAFDDTTKKKAGHHIDSGFTMLSE
Ga0314786_013750_3_3413300034664SoilMSHLIGQTLLLEDIAVDTFVHSSFGTVVMTLFHGLFTTPSWQNFTCLACGWALTTDRHTITTYLWLTGATAVKHFSRFYIFLGCPLYDKRWQLWGAVIRLAVQFVPEGEVIRV
Ga0314792_122138_273_6683300034667SoilMQCHGNLNSSKMSHLPRQTLSLEGIAVDTFVHSSFGTVVMTLFQGIFTVPSWHTFTSLACGWALATDRHTITTYLWLTGAATVKHFSRFYVFLGCPLYDQRWHLWGAVIRLAAQFIPEGVVIRVSFDDTTKK
Ga0314800_004952_3_2513300034675SoilMSHLSGQTLWLEEIAVDTFVYSSFGTVIMTLFQDLFTAPSWQTFTSLACGWALAGERHTITTYMWLTGAATVKHFSRFYVFLG
Ga0314801_125147_337_6003300034676SoilMGLFQGLFTAPSWQSFSFLACGWALTTDRHTITTYLWLTGATTVKHFSQFYVFLGCPLYTQRWHLWGAVIRQAARFVPESEVLVVTFD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.