NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F040403

Metagenome / Metatranscriptome Family F040403

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F040403
Family Type Metagenome / Metatranscriptome
Number of Sequences 162
Average Sequence Length 115 residues
Representative Sequence MSSFAAHILVSTLRSRLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGGFFAGGELRPPIVNTKDPILKPW
Number of Associated Samples 122
Number of Associated Scaffolds 162

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 93.79 %
% of genes near scaffold ends (potentially truncated) 96.91 %
% of genes from short scaffolds (< 2000 bps) 95.06 %
Associated GOLD sequencing projects 111
AlphaFold2 3D model prediction Yes
3D model pTM-score0.25

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (63.580 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(18.518 % of family members)
Environment Ontology (ENVO) Unclassified
(27.778 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(62.963 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 26.95%    β-sheet: 4.26%    Coil/Unstructured: 68.79%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.25
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 162 Family Scaffolds
PF04909Amidohydro_2 2.47
PF08450SGL 1.85
PF13701DDE_Tnp_1_4 1.23
PF11843DUF3363 0.62
PF17172GST_N_4 0.62
PF13570PQQ_3 0.62
PF00486Trans_reg_C 0.62
PF02635DrsE 0.62
PF14226DIOX_N 0.62
PF12697Abhydrolase_6 0.62
PF01008IF-2B 0.62
PF01292Ni_hydr_CYTB 0.62
PF13189Cytidylate_kin2 0.62
PF00378ECH_1 0.62
PF13924Lipocalin_5 0.62
PF01019G_glu_transpept 0.62
PF00174Oxidored_molyb 0.62

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 162 Family Scaffolds
COG3386Sugar lactone lactonase YvrECarbohydrate transport and metabolism [G] 1.85
COG3391DNA-binding beta-propeller fold protein YncEGeneral function prediction only [R] 1.85
COG01825-methylthioribose/5-deoxyribulose 1-phosphate isomerase (methionine salvage pathway), a paralog of eIF-2B alpha subunitAmino acid transport and metabolism [E] 0.62
COG0405Gamma-glutamyltranspeptidaseAmino acid transport and metabolism [E] 0.62
COG1184Translation initiation factor 2B subunit, eIF-2B alpha/beta/delta familyTranslation, ribosomal structure and biogenesis [J] 0.62
COG1969Ni,Fe-hydrogenase I cytochrome b subunitEnergy production and conversion [C] 0.62
COG2041Molybdopterin-dependent catalytic subunit of periplasmic DMSO/TMAO and protein-methionine-sulfoxide reductasesEnergy production and conversion [C] 0.62
COG2864Cytochrome b subunit of formate dehydrogenaseEnergy production and conversion [C] 0.62
COG3038Cytochrome b561Energy production and conversion [C] 0.62
COG3658Cytochrome b subunit of Ni2+-dependent hydrogenaseEnergy production and conversion [C] 0.62
COG3915Uncharacterized conserved proteinFunction unknown [S] 0.62
COG4117Thiosulfate reductase cytochrome b subunitInorganic ion transport and metabolism [P] 0.62


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A63.58 %
All OrganismsrootAll Organisms36.42 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2070309004|prs_FIHLEPW02RU8XANot Available528Open in IMG/M
3300001372|YBBDRAFT_1200598Not Available618Open in IMG/M
3300002907|JGI25613J43889_10125910Not Available651Open in IMG/M
3300005175|Ga0066673_10743305Not Available563Open in IMG/M
3300005178|Ga0066688_10644092All Organisms → cellular organisms → Bacteria679Open in IMG/M
3300005332|Ga0066388_101253168All Organisms → cellular organisms → Bacteria1273Open in IMG/M
3300005332|Ga0066388_102494348Not Available940Open in IMG/M
3300005332|Ga0066388_105078898All Organisms → cellular organisms → Bacteria668Open in IMG/M
3300005332|Ga0066388_105261512Not Available656Open in IMG/M
3300005332|Ga0066388_105760471Not Available627Open in IMG/M
3300005332|Ga0066388_106079908All Organisms → cellular organisms → Bacteria610Open in IMG/M
3300005332|Ga0066388_108256465Not Available519Open in IMG/M
3300005363|Ga0008090_15805973All Organisms → cellular organisms → Bacteria790Open in IMG/M
3300005451|Ga0066681_10543752Not Available717Open in IMG/M
3300005554|Ga0066661_10528419Not Available709Open in IMG/M
3300005559|Ga0066700_10995004Not Available552Open in IMG/M
3300005560|Ga0066670_10381500All Organisms → cellular organisms → Bacteria862Open in IMG/M
3300005764|Ga0066903_106339645Not Available617Open in IMG/M
3300005764|Ga0066903_108974292Not Available506Open in IMG/M
3300006028|Ga0070717_10102396All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2433Open in IMG/M
3300006032|Ga0066696_10812203Not Available597Open in IMG/M
3300006052|Ga0075029_100843211Not Available626Open in IMG/M
3300006057|Ga0075026_100314193All Organisms → cellular organisms → Bacteria859Open in IMG/M
3300006057|Ga0075026_100502190All Organisms → cellular organisms → Bacteria698Open in IMG/M
3300006354|Ga0075021_10401688Not Available859Open in IMG/M
3300006797|Ga0066659_10967020Not Available712Open in IMG/M
3300007265|Ga0099794_10044602All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2119Open in IMG/M
3300009143|Ga0099792_10596579All Organisms → cellular organisms → Bacteria703Open in IMG/M
3300009826|Ga0123355_10652714All Organisms → cellular organisms → Bacteria1227Open in IMG/M
3300010046|Ga0126384_10153467All Organisms → cellular organisms → Bacteria1776Open in IMG/M
3300010046|Ga0126384_11835347Not Available576Open in IMG/M
3300010047|Ga0126382_12272907Not Available523Open in IMG/M
3300010048|Ga0126373_11091096All Organisms → cellular organisms → Bacteria864Open in IMG/M
3300010048|Ga0126373_11999582Not Available642Open in IMG/M
3300010049|Ga0123356_10973452Not Available1019Open in IMG/M
3300010159|Ga0099796_10112313All Organisms → cellular organisms → Bacteria1038Open in IMG/M
3300010159|Ga0099796_10252681Not Available732Open in IMG/M
3300010321|Ga0134067_10263620Not Available653Open in IMG/M
3300010325|Ga0134064_10486966Not Available509Open in IMG/M
3300010339|Ga0074046_10910859Not Available510Open in IMG/M
3300010358|Ga0126370_12411618Not Available523Open in IMG/M
3300010359|Ga0126376_11344187Not Available737Open in IMG/M
3300010359|Ga0126376_12779180Not Available539Open in IMG/M
3300010360|Ga0126372_12952456Not Available527Open in IMG/M
3300010361|Ga0126378_10260751Not Available1829Open in IMG/M
3300010361|Ga0126378_11547487All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria752Open in IMG/M
3300010362|Ga0126377_12068159Not Available646Open in IMG/M
3300010364|Ga0134066_10430301Not Available511Open in IMG/M
3300010376|Ga0126381_100968789All Organisms → cellular organisms → Bacteria → Proteobacteria1226Open in IMG/M
3300010376|Ga0126381_103908891Not Available581Open in IMG/M
3300010376|Ga0126381_105144804Not Available501Open in IMG/M
3300011270|Ga0137391_11250170Not Available590Open in IMG/M
3300011429|Ga0137455_1186169Not Available620Open in IMG/M
3300012096|Ga0137389_11475135Not Available576Open in IMG/M
3300012199|Ga0137383_10312569All Organisms → cellular organisms → Bacteria1151Open in IMG/M
3300012200|Ga0137382_11182789Not Available544Open in IMG/M
3300012202|Ga0137363_10549624All Organisms → cellular organisms → Bacteria973Open in IMG/M
3300012203|Ga0137399_11519374Not Available557Open in IMG/M
3300012207|Ga0137381_10713047All Organisms → cellular organisms → Bacteria872Open in IMG/M
3300012208|Ga0137376_10906672Not Available757Open in IMG/M
3300012232|Ga0137435_1281342Not Available503Open in IMG/M
3300012285|Ga0137370_10137657All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1404Open in IMG/M
3300012356|Ga0137371_10991992Not Available637Open in IMG/M
3300012357|Ga0137384_10956431Not Available689Open in IMG/M
3300012357|Ga0137384_10982310Not Available679Open in IMG/M
3300012683|Ga0137398_10145375All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1535Open in IMG/M
3300012917|Ga0137395_11251942Not Available516Open in IMG/M
3300012925|Ga0137419_10211297All Organisms → cellular organisms → Bacteria1442Open in IMG/M
3300012927|Ga0137416_10355993All Organisms → cellular organisms → Bacteria → Proteobacteria1229Open in IMG/M
3300012971|Ga0126369_10408985All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1397Open in IMG/M
3300012971|Ga0126369_11896091Not Available684Open in IMG/M
3300012971|Ga0126369_12380963Not Available616Open in IMG/M
3300014968|Ga0157379_10960462All Organisms → cellular organisms → Bacteria → Proteobacteria813Open in IMG/M
3300015241|Ga0137418_10071779Not Available3156Open in IMG/M
3300016270|Ga0182036_10745037All Organisms → cellular organisms → Bacteria795Open in IMG/M
3300016270|Ga0182036_10790154All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → unclassified Hyphomicrobiales → Hyphomicrobiales bacterium773Open in IMG/M
3300016270|Ga0182036_10801147All Organisms → cellular organisms → Bacteria768Open in IMG/M
3300016270|Ga0182036_11721118Not Available530Open in IMG/M
3300016294|Ga0182041_10081807All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2311Open in IMG/M
3300016341|Ga0182035_10941509All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → unclassified Hyphomicrobiales → Hyphomicrobiales bacterium763Open in IMG/M
3300016357|Ga0182032_11087427All Organisms → cellular organisms → Bacteria685Open in IMG/M
3300016371|Ga0182034_10749956All Organisms → cellular organisms → Bacteria833Open in IMG/M
3300016371|Ga0182034_11610746Not Available570Open in IMG/M
3300016387|Ga0182040_11154212Not Available650Open in IMG/M
3300016387|Ga0182040_11293666Not Available615Open in IMG/M
3300016404|Ga0182037_11017533Not Available722Open in IMG/M
3300016404|Ga0182037_11063187All Organisms → cellular organisms → Bacteria707Open in IMG/M
3300016404|Ga0182037_11891200Not Available534Open in IMG/M
3300016422|Ga0182039_11046943Not Available733Open in IMG/M
3300016445|Ga0182038_10195939All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → unclassified Hyphomicrobiales → Hyphomicrobiales bacterium1585Open in IMG/M
3300016445|Ga0182038_10567223Not Available976Open in IMG/M
3300016445|Ga0182038_11917486Not Available536Open in IMG/M
3300017654|Ga0134069_1379879Not Available510Open in IMG/M
3300017942|Ga0187808_10251433Not Available791Open in IMG/M
3300017947|Ga0187785_10764684Not Available511Open in IMG/M
3300017948|Ga0187847_10619429Not Available605Open in IMG/M
3300017955|Ga0187817_11020117Not Available530Open in IMG/M
3300017970|Ga0187783_10586451All Organisms → cellular organisms → Bacteria → Proteobacteria807Open in IMG/M
3300017970|Ga0187783_10788079Not Available686Open in IMG/M
3300017970|Ga0187783_11256371Not Available533Open in IMG/M
3300017972|Ga0187781_11279728Not Available541Open in IMG/M
3300017975|Ga0187782_11351922Not Available559Open in IMG/M
3300018054|Ga0184621_10184145Not Available750Open in IMG/M
3300018071|Ga0184618_10369740Not Available609Open in IMG/M
3300018431|Ga0066655_11149374Not Available546Open in IMG/M
3300018433|Ga0066667_10276239Not Available1290Open in IMG/M
3300018468|Ga0066662_10646660Not Available999Open in IMG/M
3300019887|Ga0193729_1234600Not Available596Open in IMG/M
3300020004|Ga0193755_1175662Not Available632Open in IMG/M
3300020018|Ga0193721_1084635Not Available821Open in IMG/M
3300020140|Ga0179590_1154830Not Available627Open in IMG/M
3300020582|Ga0210395_11021474Not Available612Open in IMG/M
3300021086|Ga0179596_10046150All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1781Open in IMG/M
3300021406|Ga0210386_11167691Not Available652Open in IMG/M
3300021420|Ga0210394_10180349Not Available1839Open in IMG/M
3300021475|Ga0210392_10931816All Organisms → cellular organisms → Bacteria650Open in IMG/M
3300021477|Ga0210398_11441990Not Available537Open in IMG/M
3300022715|Ga0242678_1065269All Organisms → cellular organisms → Bacteria554Open in IMG/M
3300022726|Ga0242654_10377478All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300024331|Ga0247668_1130186Not Available512Open in IMG/M
3300026319|Ga0209647_1126493Not Available1154Open in IMG/M
3300026340|Ga0257162_1012786All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales985Open in IMG/M
3300026341|Ga0257151_1025119All Organisms → cellular organisms → Bacteria631Open in IMG/M
3300026481|Ga0257155_1050726Not Available648Open in IMG/M
3300026482|Ga0257172_1000513All Organisms → cellular organisms → Bacteria → Proteobacteria3782Open in IMG/M
3300026482|Ga0257172_1001219Not Available3026Open in IMG/M
3300026482|Ga0257172_1026471All Organisms → cellular organisms → Bacteria → Proteobacteria1038Open in IMG/M
3300026490|Ga0257153_1111871All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium538Open in IMG/M
3300027528|Ga0208985_1084074All Organisms → cellular organisms → Bacteria613Open in IMG/M
3300027671|Ga0209588_1055169All Organisms → cellular organisms → Bacteria → Proteobacteria1284Open in IMG/M
3300027894|Ga0209068_10540582All Organisms → cellular organisms → Bacteria675Open in IMG/M
3300027903|Ga0209488_10120947All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1970Open in IMG/M
3300027903|Ga0209488_11156088Not Available524Open in IMG/M
3300027910|Ga0209583_10130613All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium1005Open in IMG/M
3300028808|Ga0302228_10348420Not Available660Open in IMG/M
3300031090|Ga0265760_10272056Not Available593Open in IMG/M
3300031545|Ga0318541_10461238All Organisms → cellular organisms → Bacteria711Open in IMG/M
3300031572|Ga0318515_10593089Not Available589Open in IMG/M
3300031708|Ga0310686_115601106Not Available651Open in IMG/M
3300031719|Ga0306917_10870293Not Available705Open in IMG/M
3300031771|Ga0318546_11350973Not Available501Open in IMG/M
3300031781|Ga0318547_10893719Not Available554Open in IMG/M
3300031793|Ga0318548_10503569Not Available592Open in IMG/M
3300031805|Ga0318497_10507105Not Available676Open in IMG/M
3300031879|Ga0306919_11144080Not Available592Open in IMG/M
3300031879|Ga0306919_11472097Not Available513Open in IMG/M
3300031910|Ga0306923_10849959All Organisms → cellular organisms → Bacteria1004Open in IMG/M
3300031910|Ga0306923_11373625Not Available745Open in IMG/M
3300031912|Ga0306921_10166761All Organisms → cellular organisms → Bacteria → Proteobacteria2579Open in IMG/M
3300031941|Ga0310912_10430255All Organisms → cellular organisms → Bacteria1028Open in IMG/M
3300031946|Ga0310910_10810144All Organisms → cellular organisms → Bacteria737Open in IMG/M
3300031946|Ga0310910_10846076Not Available719Open in IMG/M
3300031947|Ga0310909_10563546Not Available953Open in IMG/M
3300031947|Ga0310909_10668176Not Available865Open in IMG/M
3300031954|Ga0306926_11201847All Organisms → cellular organisms → Bacteria891Open in IMG/M
3300031954|Ga0306926_11376838Not Available820Open in IMG/M
3300031954|Ga0306926_11662969Not Available730Open in IMG/M
3300032076|Ga0306924_10519350All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → unclassified Hyphomicrobiales → Hyphomicrobiales bacterium1355Open in IMG/M
3300032076|Ga0306924_11179966All Organisms → cellular organisms → Bacteria830Open in IMG/M
3300032180|Ga0307471_103231909All Organisms → cellular organisms → Bacteria578Open in IMG/M
3300032261|Ga0306920_101575160Not Available935Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil18.52%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil16.05%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil16.05%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil11.11%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil6.17%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil4.32%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds3.70%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland3.70%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.09%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.09%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.47%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment1.23%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil1.23%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.23%
Termite GutHost-Associated → Arthropoda → Digestive System → Gut → Unclassified → Termite Gut1.23%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland0.62%
Marine EstuarineEnvironmental → Aquatic → Marine → Intertidal Zone → Sediment → Marine Estuarine0.62%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.62%
Green-Waste CompostEnvironmental → Terrestrial → Soil → Unclassified → Tropical Rainforest → Green-Waste Compost0.62%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil0.62%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.62%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.62%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.62%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa0.62%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.62%
Tropical Rainforest SoilEnvironmental → Terrestrial → Soil → Unclassified → Tropical Rainforest → Tropical Rainforest Soil0.62%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2070309004Green-waste compost microbial communities at University of California, Davis, USA, from solid state bioreactor - Luquillo Rain Forest, Puerto RicoEnvironmentalOpen in IMG/M
3300001372YB-Back-sedEnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005363Tropical rainforest soil microbial communities from the Amazon Forest, Brazil, analyzing deforestation - Metatranscriptome F II A100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006052Freshwater sediment microbial communities from North America - Little Laurel Run_MetaG_LLR_2013EnvironmentalOpen in IMG/M
3300006057Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2012EnvironmentalOpen in IMG/M
3300006354Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009826Embiratermes neotenicus P1 segment gut microbial communities from Petit-Saut dam, French Guiana - Emb289 P1Host-AssociatedOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010049Embiratermes neotenicus P3 segment gut microbial communities from Petit-Saut dam, French Guiana - Emb289 P3Host-AssociatedOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010339Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM3EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011429Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT600_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012232Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT100_2EnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300016371Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172EnvironmentalOpen in IMG/M
3300016387Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176EnvironmentalOpen in IMG/M
3300016404Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017942Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_3EnvironmentalOpen in IMG/M
3300017947Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0815_BV2_4_20_MGEnvironmentalOpen in IMG/M
3300017948Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_4_10EnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300017970Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300017972Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0715_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300017975Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0715_SJ02_MP15_20_MGEnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019887Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020018Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s2EnvironmentalOpen in IMG/M
3300020140Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300022715Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022726Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024331Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK09EnvironmentalOpen in IMG/M
3300026319Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cm (SPAdes)EnvironmentalOpen in IMG/M
3300026340Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-AEnvironmentalOpen in IMG/M
3300026341Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-AEnvironmentalOpen in IMG/M
3300026481Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-AEnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026490Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-AEnvironmentalOpen in IMG/M
3300027528Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300028808Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - II_Palsa_N1_2EnvironmentalOpen in IMG/M
3300031090Metatranscriptome of rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZI1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031545Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f26EnvironmentalOpen in IMG/M
3300031572Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.176b2f19EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031719Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000 (v2)EnvironmentalOpen in IMG/M
3300031771Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f19EnvironmentalOpen in IMG/M
3300031781Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f20EnvironmentalOpen in IMG/M
3300031793Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f21EnvironmentalOpen in IMG/M
3300031805Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.109b1f23EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300031910Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 (v2)EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031941Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080EnvironmentalOpen in IMG/M
3300031946Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF172EnvironmentalOpen in IMG/M
3300031947Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.T000HEnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
prs_010935902070309004Green-Waste CompostMSSFAAHILVSASRSRLRPCLAGFLAAALFPFVSVAAPVENTIPQFGGRDFGWNANFWDFQLDPPQGSAHGPIKTDPRYPYNSQIQFGRFLRRESF
YBBDRAFT_120059813300001372Marine EstuarineMSSYAARVMVSALRNRLLGAGVVSGLLAAALVPAASGAAPVANSIPQLGSLDSGWNVNFWDFQLDPPPGSGHGPMKTDPNFPYNSQIQNGGFFADGELQP
JGI25613J43889_1012591023300002907Grasslands SoilMSSFAAHILVSTLRSRLLAGGVVAGFLAVTLVPSFSMAGPVEVTIPQLGSGDFGWNANFWDFQLDPPPGSGHGPMKTDPKYPYTSQCQNGGCVGGR
Ga0066673_1074330513300005175SoilMSPFASHMWVPTLRSRLLVSGVVAGFLVVTLIPPFSVAAFGTDSIPQLGGRDVGWNANFWDFQLNPPPGSAHGPMQTDPRYPYTSQCQNGGCNTRGDLK
Ga0066688_1064409213300005178SoilMSSFAAHILVSTLRSRLLAGGVVAGFLAVTLIPSFSVAAPVEESIPQLGSRDFGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGGFFAGGELRPPIVNTKDPILKPWAAEQMQATNEEVLSGK
Ga0066388_10125316813300005332Tropical Forest SoilMSSFAAHILVSRLWSRLLANGVVVGFLAVTLISSFSAAAPAEDSIPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSECQNGRCISVVDPSTRPPIVNTNDPILKPWAAKQMQATNEEVL
Ga0066388_10249434813300005332Tropical Forest SoilMSSFAARDLVSTLRNRLLAGGVVAGFLAVTLIPTFSVAAPAGDSIPQLGSRDFGWNANFWDFQLDPPQGSGHGPIKTDPKYPYNSQCQNGGCSIEQDLRPPIVNTKDPILKPWA
Ga0066388_10417469613300005332Tropical Forest SoilMWRRDFIAGLGSAAAWPLAARAFAAPILVSTLRNHLLAGGAVTGFLAITLIPSFSAAAPAKDSIPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSECQNGRCIA
Ga0066388_10507889823300005332Tropical Forest SoilMSSFAAHILVSTLRSRLLAGRVVAGFLAVALIPSFSAAAPAEDSIPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSECQNGRCIAVVDPSTRPPIVNTSD
Ga0066388_10526151213300005332Tropical Forest SoilMSPFASHMWVPTLRSRLLLSGVVAGFLLVTLIPPFCVAAFGTDSIPQLGGRDFGWNANFWDFQLDPPPGSAHGPMKTDPRYPYTSECQNGRCIPVVNPTTRPP
Ga0066388_10576047123300005332Tropical Forest SoilMSSSAARILASTLRSGLLAGGVVAGFLALTLIPSFSVAAPAEDAIPQLGSRDFGWNVNFWDFQLDPPQGSGHGPIKTDPSYPYNSQIQNGGFFSGGELRPPIVNAKDPILKPWAA
Ga0066388_10607990813300005332Tropical Forest SoilMSSFAAHISISTSRRGLLAGSVLAGLFAFILDPSFSVAAPAEDSIPQLGSRDFGWNANFWDFQLAPPPGSAHGAMKTDPNYPYTSECQNGRCISVVDPST
Ga0066388_10825646523300005332Tropical Forest SoilMSPFASHMWVPILRSRLLLSGVVAGFLVVTLIPPFCVAASGTDSIPQLGGRDVGWNANFWDFQLDPPPGSAHGPMKTDPRYPYTSQCQNGGCNTRGDLRPPIVDTKDPILKPWAAK
Ga0008090_1580597313300005363Tropical Rainforest SoilMSSFAAHILVSTLRSRLLAGGVVAGFLAVTLILSFSAAAPAEDSIPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKIDPNYPYTSECQNGRCLAVVDPSTRPPIVNTHDPILKPWAAKQMQATNEE
Ga0066681_1054375213300005451SoilMSPFASHMWVPTLRSRLLVSGVVAGFLVVTLIPPFCVAAFGTDSIPQLGGRDVGWNANFWDFQLDPPPGSAHGPMKTDPRYPYVSQCQNGGWTERLRRRG
Ga0066661_1052841913300005554SoilMSPFASHMWVPTLRSRLLVSGVVAGFLVVTLIPPFCVAASGTDSIPQLGPRDVGWNANFWDFQLDPPPGSAHGPMKTDPRYPYVSQCQNGGCNTRGDLRPPIVDTKDPILKPWAAK
Ga0066700_1099500413300005559SoilMSSFAAHILVSTLRSRLLAGGVVAGFLAVTLIPSFSAAAPAEDSIPQLGSRDFGWNANFWDFQLEPPPGSAHGPMKTDPNYPYTSECQNGRCIAVVDPSTRPP
Ga0066670_1038150013300005560SoilMSSFAAHIFVSPLRSRLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGGFFTGGELRPPIVNTKDPILKPWAAEQMQATNEEVLSGKR
Ga0066903_10633964523300005764Tropical Forest SoilMVTECDVEKTPMSSFAAQILVSTLRSRLLAGGVAAGFLAVILVPSFSVAVEDSVPTLGSRDFGWNANFWDFQLAPPPGSGHGPMKTDPKYPYTSQCQNGGCFGGRDLRPPIVDTKDPILKPW
Ga0066903_10897429213300005764Tropical Forest SoilMSSFATQIFLSTLRSRLLLGGIVGGFLAITLLPSFSVAGPVEDSIPQLGSRDFGWNANFWDFQLAPPPGSGHGPMKTDPKYPYTSQCQNGGCFGGRDLRPPIVDTKDPILKPW
Ga0070717_1010239613300006028Corn, Switchgrass And Miscanthus RhizosphereMSSIATHILVSTLRSRLLAGSVVAGFLAVTLIPSFSVAALAEDSIPRLGSLDSGWNANFWDFQLDPPQGSGHGPIKTDPKYPYTSQCQNGGCSGDRDLRPPIVDTKDPILKPWAAQEMQATNEEV
Ga0066696_1081220313300006032SoilMSPFASHMWVPTLRSRLLVSGVVAGFLVVTLIPPFCVAASGTDSIPQPGPRDVGWNANFWDFQLDPPPGSAHGPMKTDPRYPYVSQCQNGGWTERLRRRGGGVVEGCGLVYLLG
Ga0075029_10084321113300006052WatershedsMSSFAGHILVSTLRSRLLAGGVVAGFLAVTPIPSFSAAAPVTDSIPQLGSRDVGWNANFWDFQLDPPPGSGHGPMKTDPKYPYTSQCQNGGCFGGSDLRPPIVNTKDPILKPWAAEEMQATNEEVLSGKMAI
Ga0075026_10031419313300006057WatershedsMSSFAAHILVSTLWSRLLAGGVVAGFLAVTLVPSFSAAVLVGDSIPQLDSRDFGWNANFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGGLFAGGELRPPIVNTKDPVLKPWAAEQMQATNEEV
Ga0075026_10050219013300006057WatershedsMSSSAAHILVSTLRSHLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNANFWDFQLDPPQGSGHGPIKTDPNHPYNSQIQNGRFFTGGELRPPIVN
Ga0075021_1040168813300006354WatershedsMSSFAACVLVATLRSLLLAGGVAAGFLAVILVPSFSVAEPAEDTIPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKTDPKYLYNSQIQNGGLFRGGELRPPIVNTKDPILKPWAAEHMQATNDEVLSGK
Ga0066659_1096702023300006797SoilMWSFAAHILVSTLRSRLLAGGVVAGFLAVTLVPTLSAAARAGDSIPQLGSRDFGWNVNFWDFQLDPPKGSAHGPMKTDPNYLDNSQIQNGGFCTGCELRPRIVNTKDP
Ga0099794_1004460213300007265Vadose Zone SoilMSSFAAPILVSTLRSHLLAGGVVAGFLAVTLIPSFSVVARAEDSTPQLGSRDSGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSQCQNGGCFGDRDLRPPIVNTKDPILKPWAAKHMQATNEEVLSGKMAIPFTSQS
Ga0099792_1059657923300009143Vadose Zone SoilMWSFAAHILVSTLRSRLLAGGVVAGFLAVTLVPSFSVAAPVEDSTPQLGSRDFGWNVNFWDFQLAPPQGSAHGPIKTDPNYPYNSQIQNGVFLRVGSSG
Ga0123355_1065271423300009826Termite GutMSSFAADILVSTLRSRMLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSREFGWNANFWDFQLDPPQGSAHGPMKTDPNYPYNSQIQNGRLFSGGELRPAIVNAKDPILKPWAAEQMQATN
Ga0126384_1015346723300010046Tropical Forest SoilMSSFAAHILVSTLRSRLLAGSVVAGFLAVTLIPSFSAAAPADDSIPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSECQNGRCISVVDPSTRPPIVNTNDPILKPWAAKHMQAT
Ga0126384_1183534713300010046Tropical Forest SoilMSSFATHILVSTLRGRLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNVNFWDFQLDPPQGSAHGPMKTDPNYPYTSQCQNGGCSGDRDLRPPIVNTKDPILKPW
Ga0126382_1227290713300010047Tropical Forest SoilMSLFASHVWVPTLRSRLRVSGVVAGFLVATLIPPFCVAASGNDSIPQFGPRDVGWNANFWDFELDPPPGSAHGPMKIDPRYPYVSQCRNGGCNTRGDLKPPIVDTKNPILKPWAAKHMQETNEEVLSGKMAIPF
Ga0126373_1109109613300010048Tropical Forest SoilMSSFATHILISTLRSRLLAGGVVAGFLAVTLIPSFSMAAPLEDSIPQLGSRDFGWSANFWDFQLDPPQGSGHGPIKTDPNYPYNSQIQNGRLFTGGELRPPIVNAKDPILKPWAAEQMQATNEEVLSGKRPLPFI
Ga0126373_1199958223300010048Tropical Forest SoilMSSFAAHILESTLRSRLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGRFFTGG
Ga0123356_1097345213300010049Termite GutMSSFPAHIVFSTLQSRVLTGVVAGLLAVALIPSVCVAGPVEDTVPQLGSRDFGWNVNFWDFQLDPPPGSGHGPMKTDPRYPYRSQCQNGGCSGGSDLRPPIANAKDPILKPWAAAHIQ
Ga0099796_1011231313300010159Vadose Zone SoilMSSFAAPILVSTLRSHLLAGGVVAGFLAVTLIPSFSVVARAEDSVPQLGSRDSGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSQCQNGGCFGDRDLRPPIVNTKDPILKPWAAKHMQATNEEVLSGKM
Ga0099796_1025268113300010159Vadose Zone SoilMSSFAAHILVSTLRSRLLAGGVVAGFLAVTLVPSFSMAGPVEVTIPQLGSGDFGWNANFWDFQLDPPPGSGHGPMKTDPKYPYTSQCQNGGCVGGRDLRPPIVNTKDPILKPWAAKHMQATNEEVLSGKM
Ga0134067_1026362023300010321Grasslands SoilMSSFAAHILVSTLRSRLLAGGVVAGFLAVTLIPSFSVAAPVEESIPQLGSRDFGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGGFFTGGELRPPIVNTKDPILKPWAA
Ga0134064_1048696613300010325Grasslands SoilMWVPTLRSRLLVSGVVAGFLVVTLIPPFCVAAFGTDSIPQLGGRDVGWNANFWDFQLDPPPGSAHGPMKTDPRYPYTSQCQNGGCNTRGDLRPPIVDTKDPILKPWAAKHMQETNEEVLS
Ga0074046_1091085913300010339Bog Forest SoilMLSFAAHVLVSTLSSRLLAGGVVAGAVTLIPSFSAAAPAEDSIPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSECQNGRCLALVDPSTRPPIVNTNDPILKPWAAKQMQATNEEV
Ga0126370_1241161823300010358Tropical Forest SoilMSSFAAHILVPTLRGRFLAGGVVAGFLAVTLIPSFSVAAPVEDSVPQLGSRDFGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGRFFTGGELRPPIVNTKDPILKPWAAEQMQATNE
Ga0126376_1134418723300010359Tropical Forest SoilMSSFAAHILESTLRSRLLAGGVAAGFLAVTLIPSFSVAATVEDSIPQLGSRDFGWNVNFWDFQLDPPPGSAHGPIKTDPNYPYNSQIQNGRFFTGGEL
Ga0126376_1277918023300010359Tropical Forest SoilMSSFAADIWVSTLRSRLLAGGAVAGFLAVTLIPSFSVAGSVEDTIPQLGSRDFGWNANFCDFQLAPPPGSGHGPMKTDPKYPYTSQCQNGGCFGGRDLRPPIVDTKDAILKPWAADHMQATNEE
Ga0126372_1295245623300010360Tropical Forest SoilMSSSAAHILASTLRSRLLAGGVVAGFLALTLIPSFSVAAPVEDAIPQLGSRDFGWNVNFWDFQLDPPQGSGHGPIKTDPSYPYNSQIQNGGFFSGG
Ga0126378_1026075113300010361Tropical Forest SoilLPWKSRLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNANFWDFQLDPPQGSAHGPMKTDPNYSYNSQIQNGGFCTGCELQPAIVNTK
Ga0126378_1154748713300010361Tropical Forest SoilMSSFAAPILVPTSRNHLLAGGVVAGFLAVTLVPSFPVAARAEDRIPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSECENGRCLAVVDPSTRPPIVNTNDSILKPWAAKHMQTTNEEVLS
Ga0126377_1206815923300010362Tropical Forest SoilMSSFAADILVSTLRSRLLAGGVVAGSLAVTLIPSFSVAAPVEDSILQIGSRDFGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGRFFTGGELRP
Ga0134066_1043030113300010364Grasslands SoilMWVSTLRSRLRVSGVVAGFLVVTLIPPFCVAASGTDSIPQLGPRDVGWNANFWDFQLDPPPGSAHGPMKTDPRYPYTSQCQNGGCNTRGELRPPIVDTKDP
Ga0126381_10096878913300010376Tropical Forest SoilMSSFAAHTLVSTLRSCLLAGGVIAGVLAVALIPSFSLAAPERDSIPQLGSRDFGWNANFWDFQLDPPPGSGHGPIKTDPNYPYNSQIQNGRLFTGGELRPPIVNTKDPILKPWAAEQMQATN
Ga0126381_10390889123300010376Tropical Forest SoilMSSYAVHILLSTLRSRLLAGSVVAGFLAATLVPSFSLAADSIPQLGSWAANFWDFQLDPPQGSGHGPMKTDPKYPYVSQCQNGGCVAGKDYRPPIVDTKDPILKPWAAKEMQATNDEVLSGRMSIPFTS
Ga0126381_10514480423300010376Tropical Forest SoilMSSFAAQILVSTLRSRLLAGSVVASFLAVTLVPSFSVAGPVEDRIPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKADPSYPYTSQCQNGGCFGGADLRPPIGDTKDRILKPLAAEHMQATNEEVLSGK
Ga0137391_1125017013300011270Vadose Zone SoilVTVGKFWVRCDVEKTPMSSSAAHILVSTLRSHLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNANFWDFQLDPPQGSGHGPIKTDPNYPYNSQIQNGGFFAGGELRPPI
Ga0137455_118616913300011429SoilMSPFVSHMWVPTLRSRLLVSGVVGGFLVVTLISPFCVAAFGTDSIPQLGGRDVGWNANFWDFQLDPPPGSAHGPMKTDPTYPYVSQCQNGGCAGGNYRPPIVDT
Ga0137389_1147513523300012096Vadose Zone SoilMSSFAAHILVSTLRSRLLAGGVVAGFLAVTLIPSFSAAAPAEDSIPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKTDTNYPYTSECQNGRCIAVVDPSTRPPIVNTNDPIL
Ga0137383_1031256923300012199Vadose Zone SoilMSSFAAHILISTLRSRLLAGGVVAGFLAVTLVPSFSVAAPAGDSIPQLGSRDFGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGGFFAGGELRPPIVNTKDPILKPWAAEQMQATNEE
Ga0137382_1118278923300012200Vadose Zone SoilMSSFAARILVSTLRSRLLAGGVVAGFLAVTLIPCLSVSALAEDSIPRLGSRDSGWNANFWDFQLDPLPGSGHGPMKTDSKYPYTSQCQNGGCSGDRDLWPPIVDTKD
Ga0137363_1054962413300012202Vadose Zone SoilMSSFAAPILVSTLRSHLLAGGVVAGFLAVTLIPSFSVVARAEDSVPQLGSRDSGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSQCQNGGCFGDRDLRPPIVNTKDP
Ga0137399_1151937413300012203Vadose Zone SoilMSSFAAHILVSTLRSRLLAGSVVAGFLAITLIPSFSVAAPVENSIPQLGSRDFGWNVNFWDFQLDPPQGSGHGPIKTDPNYPYNSQIQNGGFFAGGELRPPIV
Ga0137381_1071304723300012207Vadose Zone SoilMSSFAAHILVSTLRSHLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNVNFWDFQLDPPQGSGHGPVKTDPNYPYNSQIQNGRFCTGCELRPPIVNTKDPILKPWAAEQMQATNEEVLNG
Ga0137376_1090667213300012208Vadose Zone SoilMSGENAMSPFASHMWVPTLRSRLLVSGVVAGFLVVTLIPPFCVAAFGTNSIPQLGGRDVGWNANFWDFQLDPPPSSAHGPMKTDPRYPYTSQCQNGGCNTRGDLRPPIVDTKDPVLKPWAAKHMQETNEEVLSGKMAIPFTS
Ga0137435_128134213300012232SoilMWVPTLRSRLLVSGVVAGFLVVTLISPFCVAAFGTDSIPQLGGRDVGWNANFWDFQLDPPPGSAHGPMKTDPTYPYVSQCQNGGCAGGNYRPPIVDTKDPILKPWAAKHM
Ga0137370_1013765723300012285Vadose Zone SoilMWVPTLRSRLLVSGVVAGFLVVTLIPPFCVAAFGTDSIPQLGGRDVGWNANFWDFQLDPPPGSAHGPMKTDPRYPYTSQCQNGGCNTRGDLRPPIVDTKDPILKP*
Ga0137371_1099199213300012356Vadose Zone SoilMWVPTSRSRLLVSGVVAGFLVVTLIPPFCVAKFGTDSIPQLGGRDVGWNANFWDFQLDPPPGSAHGPMKTDPRYPYTSQCQNGGCNTRGDLRPPIVDTKDP
Ga0137384_1095643113300012357Vadose Zone SoilMSSFAARILVSTLRTRLLAGGVVAGFLALTVIPFFSVAAPVEDSIPQLGSRDFGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGGFFAGGELRPPIVNTKDPILKPWAAEQMQA
Ga0137384_1098231013300012357Vadose Zone SoilMWVPTLRSRLLVSGVVAGFLAVTLIPLFCVAAFGTDSIPQLGGRDVGWNANFWDFQLDPPPGSAHGPMKTDPRYPYTSQCQNGGCNTRGDL
Ga0137398_1014537523300012683Vadose Zone SoilMSSFAAPILVSTLRSHLLAGGVVAGFLAVTLIPSFSVVARAEDSVPQLGSRDSGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSQCQNGGCFGDRDLRPPIVNTKDPILKPWAAKHMQA
Ga0137395_1125194213300012917Vadose Zone SoilVEKTPMSSFAAHIWVSTLRNRLLAGGVVAGFLAVTLIPSFSVADSVPQLGSQDFGWNANFWDFQLDPPQGSGHGPIKTDPNYPYNSQIQNGGFFAGGELRPPI
Ga0137419_1021129733300012925Vadose Zone SoilMSSFSAPILVSTLRGRLLAGGVVAGFLAVTLVPSFSVAGPVEDTIPQLGSRDFGWNANFWDFQLDPPPGSGHGPMKTDPKYPYTSQCQNGGCFGGKDLRPPIVNTKDPILKPWAAKHMQATNEEVLSGKMA
Ga0137416_1035599333300012927Vadose Zone SoilMSSFAAHILVSTLRSRLLAGSVVAGFLAITLIPSFSVAAPVENSIPQLGSRDFGWNVNFWDFQLDPPQGSGHGPIKTDPNYPYNSQIQNGGFFAGGELRPPIVNTKDPILKPWAA
Ga0126369_1040898543300012971Tropical Forest SoilMLTVSKFWTPVPCDVEKTPMSSFAAHILVSTLRRGLLAGGVVAGFLAVALIPSFSVAAPVDDSIPQLGSRDFGWNANFWDFQLDPPQGSPHGPIKTDPNYPYNSQIQNGRLFTGGELRPPIVNTKDPILKPWAAEQMQAT
Ga0126369_1189609113300012971Tropical Forest SoilMSSFAAHISISTSRRGLLAGSVLAGLFAFILDPSFSVAAPAEDSIPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKTDHNYPYTSECQNGRCISVVDPSTRPPIVNTNDP
Ga0126369_1238096313300012971Tropical Forest SoilMSSFAARDLVSTLRNRLLAGGVVAGFLAATLVPSFSVAAPAGDSIPQLGSLDSGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGRLFTGGELRPPIVNT
Ga0157379_1096046223300014968Switchgrass RhizosphereMWEPTLRGRLLLSGVVAGFLVVTLIPPFCGAASGTDSIPQLGGRGVGWNANFWDFQLDPPPGSAHGPMKTDPRHPYTSQCQNGGCNTRGDLRPPIVDTKDPVLKPWAAKHMQETNEEVLSGKM
Ga0137418_1007177913300015241Vadose Zone SoilMSSFSAPILVSTLRSRLLAGGVVAGFLAVTLVPSFSVAGPVEDTIPQLGSRDFGWNANFWDFQLDPPPGSGHGPMKTDPKYPYTSQCQNGGCFGGKDLRPPIVNTKDPILKPWAAKHMQATNEEVLSGKMAIPFTS
Ga0182036_1074503723300016270SoilMSSFAAQILVPTLRSRLLAGGVAAGFLAVTLVPSFSVGGPVEDTIPQLGSRDFGWNANFWDFQLAPPPGSGHGPMKTDPKYPYTSQCQNGGCFGGRDLRPPIVDTKDPILKPWAAE
Ga0182036_1079015423300016270SoilMSSFAAHISISTSRRGLLAGSVLAGLFAFILDPSFSVAAPAEDSILQLGSRDFGWNANFWDFQLDPPPGSAHGPLKTDPNYPYTSECQNGRCISVVDPSTRPPIVNTNDPILKPWAA
Ga0182036_1080114713300016270SoilMSSFAARDLVSTLRNRLLAGGVVAGFLAVTLIPTFSVAAPAGDSIPQLGSRDFGWNANFWDFQLDPPQGSGHGPIKTDPKYPYNSQCQNGGCSVEQDLRPPIVNTKDPILKPWAAAQMQATNEEVLSG
Ga0182036_1172111813300016270SoilMSSFAAHISVSTSRRGLLAGSVLAGLFAFILDPSFSVAAPGEDSIPQLGSRNFGWNVNFWDFQLDAPPGSAHGPIKTDTNYPYNSQIQNGGFFSGGELRPPIVNTKDPIFVVRDAVEHRRLVPFAGE
Ga0182041_1008180713300016294SoilMSSFAAHISISTSRRGLLAGSVLAGLSAFILDPSFSVAAPAEDSIPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKTDHNYPYTSECQNGRCISVVDPSTRPPIVNTNDPILKPWAAEQMQATNEEVL
Ga0182035_1094150913300016341SoilMSSFAAHISISTSRRGLLAGSVLAGLSAFILDPSFSVAAPAEDSTPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSECQNGRCISVVDPSTRPPIVNTNDPILKPWAAKQMQATNEEVLS
Ga0182032_1108742713300016357SoilMSSFAGRDLVSTLRNRLLACGVVAGFLAVTLIPTFSVAAPAGDSIPQLGSRDFGWNANFWDFQLDPPQGSGHGPIKTDPKYPYNSQCQNGGCSIEQDLRPPIVNTKDPILKPWAAAQMQATNEEVLSGRMAIPFASQSRCW
Ga0182034_1074995623300016371SoilMWSFAAHILVSTLRSRLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNANFWDFQLDPPQGSGHGPIKTDPNYPYNSQIQNGRLFTGGELRPPIVNTQDPILKPWAAEQMRATNEEVLSGKM
Ga0182034_1161074613300016371SoilMSSFAAHILVSTLRGRLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNVNFWDFQLDPPLGSAHGPIKTDPNYPYNSQIQNGDFFCRWGAPSPDREHEG
Ga0182040_1115421213300016387SoilMSSSAAHILVSTLRGRLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNVNFWDFQLDPPQGSGHGPMKTDPRYPYTSQCENGGCVPGRDYRPPIVNTKDPILKPWAAEQMQATNDEVLS
Ga0182040_1129366613300016387SoilMSSFAAHILPSTSRSRLLAGRVVAGFLAVTPIPSFSVAALAGDAIPQLGSGNFGWNVNFWDFQLDPPQGSAHGPIKTDPNYSYNSQIQNGGFCTGCELRPAIVNTKDPILKTWAAEEMQATNEEVLSGKRPLPFIS
Ga0182037_1101753323300016404SoilMWSFAAHILVSTLRSRLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNANFWDFQLDPPQGSGHGPIKTDPNYPYNSQIQNGRLFTGGELRPPIVNT
Ga0182037_1106318723300016404SoilMSSFAAPILVSILPSRSLAGGVVAGFVAVTLIAPFSPVARADSIPQLGSLNSGWNANFWDFQLDPPAGSSHGPIKTDPKYPYVSQCQNGGCLVDRDFQ
Ga0182037_1189120013300016404SoilMSSLQAQILVPTLRSRVLAGAVAAGFLVVTLVPSFSVAGPAENTIPRLGSRDFGWSVNFWDFQLDPPPGSGHGPMKTDPKYPYNSQIQNGGLFRGGELRPPIVNAKDPILKPWAAEQMQATNDEV
Ga0182039_1104694313300016422SoilMSSLAAHILISTLRSRLLAGAVVAGSLAVPLIPSCSVAAPDKESVPQLGSLDSGWNVNSWDFQLDPPQGSGHGPIKTDPKYPYNSQIQNGGFFADGELRPPIVNTKDPILKPWAAEQM
Ga0182038_1019593923300016445SoilMSSFAAHISISTSRRGLLAGSVLAGLSAFILDPSFSVAAPAEDSTPQLGSRDFGWNANFWDFQLAPPPGSAHGPMKTDPNYPYTSECQNGRCISVVDPSTRPPIV
Ga0182038_1056722333300016445SoilMSSFAAHILVSTLRNRLLAGGVVAGILAVTLIPSFSAAAPAEDSIPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSECQNGRCISVVDPSTRPPIV
Ga0182038_1191748623300016445SoilMSSFAACDLASTLRDRLLAGGVVAGFLAATLVPSFSVAAPAGDSIPQLGSLDSGWNVNFWDFQLDPPQGSGHGPIKTDPKYPYNSQIQNGGFFGGGELRPPIVNA
Ga0134069_137987913300017654Grasslands SoilMSPFASHMWVPTLRSRLLVSGVVAGFLVVTLIPPFCVAASGTDSIPQLGPRDVGWNANFWDFQLDPPPGSAHGPMKTDPRYPYVSQCQNGGWTERLRRRGGGVVEGC
Ga0187808_1025143313300017942Freshwater SedimentMSSFAAHNRVPTWRRHWLTYGVVAAFLAVTFLPSFSARAENNIPQLGSRDFGWNVNFWDFQLDPPPGSGHGPMKTDPNYPYNSQIQNGGFCTGCELRPPIVNAKDPI
Ga0187785_1076468413300017947Tropical PeatlandMSSFAAHILVSTLRSRLLAGGVVAGFLAVTLIPSFSVVARAEDRVLQLGSRDLGWNANFWDFQLDPPPGSAHGPMKTDPDYPYTSQCQNGGCSGDRDLRPPIVNTKDPILKPWAAKHMQATNEEVLSGKMAIPFTSQSRC
Ga0187847_1061942913300017948PeatlandMSSFAAHILVSTLRSRLLAGGIVAGFLAVTVIPSFSVAAPEEKTIPQLGGREFGWNVNFWDFQLDPPPGSGHGPIKTDPNYPYNSQIQNGGFFGGGELRPPIVNAKDPILKPWAAAQ
Ga0187817_1102011713300017955Freshwater SedimentMSSFAAHNRVPTWRRHWLTYGVVAAFLAVTFLPSFSARAENNIPQLGSRDFGWNVNFWDFQLDPPPGSGHGPMKTDPNYPYNSQIQNGGFCTGCELRPPIVNAKDPIL
Ga0187783_1058645123300017970Tropical PeatlandMSSFGAHILVSTLRSHLLTSGVVAGFLAGTLIPSFSVAGPIEDTIPQLGSRDVGWNVNFWDFQLDPPPGSGHGPIKTDPKYPYRSQCQNGGCSGGSDLRPPI
Ga0187783_1078807923300017970Tropical PeatlandVSSFAAHMLVSTLRRRPLAGAVVAGFLAVSLVRSFSVAGPAEGTVPQLGSRDFGWSANFWDFQLDPPAGSGHGPMKTDPEYPYVSQCQNGGCSTDRDLRPPIVNTQDPILKPWAAKQ
Ga0187783_1125637123300017970Tropical PeatlandMSSFAAHILISTLRSRLVAGGVVAGFLAVTLIPSFSAAAPTEDSTPQLGSRDFGWNANFWDFQLAPPPGSAHGPMKTDPNYPYTSECQNGRCIALVDPSTRPPIVNTNDPILKPWAAKQMQA
Ga0187781_1127972813300017972Tropical PeatlandMAGISTANIECSSEMTDCDVEKTPMPWFEAHILVSTLRRRLLAGGVVAGFLAVTLVPSFSVAAPVKDSIPQLGDSGWNVNFWDFQLDPPPGSAHGPIKTDPNYPYNSQIQNGGFFSGGPLRPPIVNAKD
Ga0187782_1135192223300017975Tropical PeatlandVSSFAAHILVSTLRSHLLTSGVVVGFLAVTLIPSFSVVGPVEDTIPQLGSRGVGWNVNFWDFQLDPPPGSGHGPMKTDPKYPYRSQCQYGGCSGGSDLRPPIVNAKDPILKPWAAEHMQATNEEVPN
Ga0184621_1018414513300018054Groundwater SedimentMAPFASHMWVSTVRSRLLVSGVVAGFLVVTPIPPFCVAAFGTDSIPQLGGRNVAWNANFWDFQLDPPPGSAHGPMKTDPNYPYVSQCQNGGCALGDYRPPIVDTKDPILKPWAAKHMQETNE
Ga0184618_1036974013300018071Groundwater SedimentMSPFASHMWVPTLRSRLLVSGVVAGFLVVTLIPPFCVAASGTDSIPQLGPRDVGWNANFWDFQLDPPPGSAHGPMKTDPRYPYVSQCQNGGCNTR
Ga0066655_1114937413300018431Grasslands SoilMSSFAAHILVSTLGSRLLARGVVAGFLAVTLIPSFSVAAPVEDSIPQLSSRDFGWNVNFWDFQLDPPQGSGHGPIKTDPNYPYNSQIQNGGFFAGGKLRPPIVNTKDPIL
Ga0066667_1027623923300018433Grasslands SoilMSPFASHLWVPTLRSRLLLSAVVAGFLVVTLIPPFCVAASGTDSIPQLGPRDVGWNANFWDFQLDPPPGSAHGPMKTDPRYPYVSQCQNGGWTERLRRRGGGVVEGCGLVYLLGCRDQAPGV
Ga0066662_1064666013300018468Grasslands SoilMSPFASHMWVPTLRSRLLVSGVVAGFLVVTLIPPFCVAAFGTDSIPQLGGRDVGWNANFWDFQLDPPPGSAHGPMKTDPSYPYASQCQNGVCNTRGDLRPPIVDTKDPILKPWAAKHMQETNEEVLSGK
Ga0193729_123460013300019887SoilMSSFAAHILVSTLPSRLLAGGVVAGFLAVTLIPFVSVAAPVEESIPQLGSRDFGWSVNFWDFQLDPPQGSGHGPIKTDPNYPYNSQIQNGGFFAGGELRP
Ga0193755_117566213300020004SoilMSPFASHMWVPTLRNRLLIRGVVAGFLVVTLIPPFCVAAFGTDSVPQFGGRDVGWNANFWDFQLDPPPGSAHGPMKTDPRYPYTSQCQNGGCNTRGDLRPPIVDTKDPILKPWAAKHMQETNEEV
Ga0193721_108463513300020018SoilMSPFASHMWVPTLRSRLLVSGVVAGFLVVTLIPPFCVAASGTDSIPQLGPRDVGWNANFWDFQLDPPPGSAHGPMKTDPRYPYVSQCQNGGCNTRGDLRPPIVDTKDPILKPWAAKHMQETNEEVLSGKMA
Ga0179590_115483013300020140Vadose Zone SoilMSSFAAHILVSTLRSRLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGGFFAGGELRPPIVNTKD
Ga0210395_1102147413300020582SoilMSSFAAHTLVSTLRSRLLAGGVVAGFLAVTFVPSFSVAAPVEDSIPRLGSRAFGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGGFFAGGELRPP
Ga0179596_1004615013300021086Vadose Zone SoilMSSFSAPILVSTLRGRLLAGGVVAGFLAVTLVPSFSVAGPVEDTIPQLGSRDFGWNANFWDFQLDPPPGSGHGPMKTDPKYPYTSQCQNGGCFGGEDLRP
Ga0210386_1116769113300021406SoilMSSLVSSLPAHLLVTLRGRLLAGGVVATLVAVALVPSPSAAAPESIPQLGSGDFGWNANFWDFQLDPPPGSGHGPMRTDPKYPYTSQCQNGGCFGGPDLRPPIVDTKDP
Ga0210394_1018034913300021420SoilMWSFGARISVSTLRSRLVAGGVVAGFLAATLAPSFSVAAPAEESIPQLGSRDFGWNVNFWDFQLDAPPGSAHGPIKTDPNYPYNSQIQNGGFFAGGELRPPIVKT
Ga0210392_1093181613300021475SoilMSSFAARILVSTLRRHLLAGGVVAGFVAVTLIPSFSVAASAADSIPQLGSRDFGWNANFWDFQLDPPQGSAHGPIKTDPRYPYNSQIQNGGLFTGGELRPPIVNTKDPILKPWAAEQMQATNEEVLSGKMALPFV
Ga0210398_1144199013300021477SoilMSSFATHILASTSRSRLLAGGVVAGFLAVTLIPSLSVAGPVEDTIPQLGSRDFGWNVNFWDFQLDPPPGSGHGPMKTDPKYPYRSQCQNGGCSGGSDLRPPIVNAKDPILKPWAAEHMQATNEEVL
Ga0242678_106526923300022715SoilMSSFAAPILVSTLRSRLLAGGVVAGWLAVTLVPSFSVAAPAEDSIPQLGSRDSGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGGFFLRVGSSGPRS
Ga0242654_1037747813300022726SoilMWSFAAHTLVSTFLAPTWRSRLLAGGAVAGFLAVTLVPSFSAAAPAEDSVPQLGSRDFGWNANFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGRFFTGGELRPPIVNTKDPILKPWAAEQMQATNEEVLG
Ga0247668_113018613300024331SoilMSSFAAHILGSTLRSRLLAGGAVAGFLALTLIPSLSVAAPVEDSIPQLGSRVFGWNVNFWDFQLDPPQGSGHGPIKTDPNYPYNSQIQNGGFFSGGELRP
Ga0209647_112649313300026319Grasslands SoilMSSFAPRILVSTLRNCLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGGFFAGGELRPPIVNTKDPILKPWA
Ga0257162_101278613300026340SoilMSSFAAPILVSTLRSHLLAGSVVAGFLAVTLIPSFSVVARAEDSVPQLGSRDSGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSQCQNGGCFGDRDLRPPIVNTKDPILKPWAAKHMQATNEEVLSGKMAIPFTSQSR
Ga0257151_102511923300026341SoilMSSSAAHILVSTLRSHLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWSANFWDFQLDPPQGSGHGPIKTDPNYPYNSQIQNGRFFTGG
Ga0257155_105072623300026481SoilMSSFAAHILVSTLRSRLLAGGVVAGFLAVTLVPSSSVAAPVGDSIPQLGSGDFGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGGFFAGGELRPPIVNTKDPILKPWAAEQIQ
Ga0257172_100051343300026482SoilMSSFATHLSVSTLRSPLLAGGVVVGCLAVILIPSFSMAAPVKDSIPQLGSRDFGWNANFWDFQLDPPQGSAHGPIKTHPNYPYNSQIQNGRLFTGGELRPPIVNTKDPILKPWAAEQMQATNEEVLSG
Ga0257172_100121913300026482SoilMSSFAAHILVSTLRSRLLAGSVVAGFLAITLIPSFSVAAPVENSIPQLGSRDFGWNVNFWDFQLDPPQGSGHGPIKTDPNYPYNSQIQNGGFFAGGELRPPIVNTKDP
Ga0257172_102647133300026482SoilMSSFAAHILVSTLRSRLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGGFFAGGELRPPIVNTKDPILKPW
Ga0257153_111187123300026490SoilMSSFAAHILVSTLRSRFLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGVFLRVGSSGPRS
Ga0208985_108407413300027528Forest SoilMSSFAAHISVSTLRSRLLAGGVVAGFLAVTLVLSFSEAARAEDTIPQFGSRDFGWNVNFWDFQLDPPLGSGHGPMKTDPKYPYRSQCQNGGCSGDSDLRPPIVNTKDPILKGWAAEHMQATNEEVLNGRMAIPFTSQSRCWPGGVPGQ
Ga0209588_105516913300027671Vadose Zone SoilMSSFAAPILVSTLRSHLLAGGVVAGFLAVTLIPSFSVVARAEDSTPQLGSRDSGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSQCQNGGCFGDRDLRPPIVNTKDPILKPWAAKHMQA
Ga0209068_1054058223300027894WatershedsMSSSAAHILVSTLRSHLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNVNFWDFQLDPPPGSGHGPIKTDPNYPYNSQIQNGRFFTGGELRPPIVNTKDPILKPWAAEQMQATN
Ga0209488_1012094713300027903Vadose Zone SoilVSSFAAHILVSTFWSRLLAGGVVAGFLAVSLVASFSVAGPVEESIPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKTDPNYPYNSQIQNGRFFTGGELLPPIVNAKDPILKP
Ga0209488_1115608823300027903Vadose Zone SoilMSSFAAHILVSALRSRLLTGGVVAGFLAVTLVPSFSVAGPVEDTIPRFGSRDFGWNANFWDFQLDPPPGSGHGPMKTDPKYPYTSQCQNGGCFGGRDLRPPIVDTKDPILKPWAA
Ga0209583_1013061323300027910WatershedsMSSSAAHILVSTLRSRLLAGGVVAGFLAVTLVPPFSVAAPVGDSIPQLGSRDFGWNANFWDFQLDPPQGSAHGPIKTDPKYPYNSQIQNGGFFADGEL
Ga0302228_1034842013300028808PalsaMSSFAAHILVSTLRSRLLAGGIVAGFLAVTVIPSFSVAAPVEDSIPQLGSRDFGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGGFFAGGELR
Ga0265760_1027205613300031090SoilMSSFAAHILVSTLRNRLLAGGVVAGFLAVTLPSFSVAAPVEDSIPRLGSRDFGWNANFWDFQLDPPLGSAHGPIKTDPNYPYNSQIQYGRFFADGELRPPIVNAKD
Ga0318541_1046123813300031545SoilMSSFAVHILVSTLRSRLLAGGLVAGFLAVTLIPSFLVAAAAEDSTPRLGSGDFGWNANFWDFQLDPPQGSGHGPIKTDPNYPYNSQIQNGRLFTGGELRPPIVNTKDPILKTWAAEQMQATNEEVLSGKMALPFVSQSRCWPGGVPGQLLF
Ga0318515_1059308913300031572SoilMSSFAAHISISTSRRGLLAGSVLAGLSAFILDPSFSVAAPAEDSTPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSECQNGRCISVVDPSTRPPIVNTNDPIL
Ga0310686_11560110623300031708SoilMSSSFAAHILLSTLRSRLLARGVVARFVAVTLLPSFSVAAPVEDSIPRLGSRDSGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQYGHRFTGGELRPPIVNAKDPILKPWAAERMQATNEEVLSGKKAI
Ga0306917_1087029323300031719SoilMSSFAAHILVSTLRSRLLAGGVVAGFLAVTLIPSFSAAAPAEDSIPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSECQNGRCIALVDPSTRPPIVNTNDPILKPWAA
Ga0318546_1135097313300031771SoilMSSFAAHILVSTLRSHLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNVNFWDFQLDPPQGSAHGPIKTDPNYSYNSQIQNGGFCTGCELRPAIVNTKDP
Ga0318547_1089371913300031781SoilMSSFAVHTLVSILRSRLLAGGAVAGLLAVTLALSFSVAAPAADSIPQLGSGDFGWNANFWDFQLDPPQGSGHGPIKTDPNYPYNSQIQNGRLFTGGELQPPIVNTKDPILKPWAAAQM
Ga0318548_1050356923300031793SoilMKNTVAGGVLAGSLALTVIPSFSVAAPVGDSIPQLGSLDSGWNVNFWDFQLDPPQGSGHGPIKTDPKYPYNSQIQNGGFFAGGELRPPIVNTKDPILKPWAAAQMQA
Ga0318497_1050710513300031805SoilMSSFAAHISVSTSRRGLLAGSVLAGLFAFILDPSFSVAAPGEDSIPQLGSRNFGWNVNFWDFQLDPPPGSAHGPIKTDPRYPYNSQIQNGGFFSGGELRPPIVNTKDPILKPWAAEQMQATND
Ga0306919_1114408013300031879SoilMSSFAACDLASTLRDRLLAGGVVAGFLAATLVPSFSVAAPAGDSIPQLGSLDSGWNVNFWDFQLDPPQGSGHGPIKTDPKYPYNSQIQNGGFFAGGELRP
Ga0306919_1147209713300031879SoilMSSFAAHISISTSRRGLLAGSVLAGLSAFILDPSFSVAAPAEDSTPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSECQNGRCISVVDPSTRPPIVNTNDPILKPW
Ga0306923_1084995933300031910SoilMSSLAAHILISTLRSRLLAGAVASGSLAVTLIPSCSVAAPDKESVPQLGSLDSGWNVNFWDFQLDPPPGSGHGPIKTDPKYPYNRQIQNGGFFAGGELRPPI
Ga0306923_1137362513300031910SoilMSSFAAHILPSTSRSRLLAGGVVAGFLAVTPIPSFSVAALAGDAIPQLGSGNFGWNVNFWDFQLDPPQGSAHGPIKTDPNYSYNSQIQNGGFCTGCELRP
Ga0306921_1016676133300031912SoilMSSFAVHTLVSILRSRLLAGGAVAGLLAVTLALSFSVAAPAADSIPQLGSGDFGWNANFWDFQLDPPQGSGHGPIKTDPNYPYNSQIQNGRLFTGGELQPPIVNTKDPILKPWAAAQMQATNDEVLSGKTALPFV
Ga0310912_1043025523300031941SoilMWSFAAHILVSTLRSRLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNANFWDFQLDPPQGSGHGPIKTDPNYPYNSQIQNGRLFTGGELRPPIVNTQDPILKPWAAEQMQATNEEVLSGKMA
Ga0310910_1081014413300031946SoilMSSFAACDLASTLRDRLLAGGVVAGFLAATLVPSFSVAALAGDSIPQLGSLDSGWNVNFWDFQLDPPQGSGHGPIKTDPKYPYNSQIQNGGFFAGGELRPPIVNSKDPILKPWAAEQMQATNEEVL
Ga0310910_1084607623300031946SoilMWSFAAHILVSTLRSRLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNVNFWDFQLDPPQGSAHGPIKTDPNYPYNSQIQNGRFFTGGELRPP
Ga0310909_1056354623300031947SoilMSSFAVHILVSTLRSRLLAGGLVAGFLAVTLIPSFLVAAAAEDSTPRLGSGDFGWNANFWDFQLDPPQGSGHGPMKTDPNYPYNSQIQNGRLFTGGEL
Ga0310909_1066817613300031947SoilMSSFAAHILVSTLRGRLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNVNFWDFQLDPPLGSAHGPIKTDPNYPYNSQIQNGDFFCRWGAPSP
Ga0306926_1120184723300031954SoilMSSLQAQILVSTLRSRLLAGRVAAGFLAVTLVPSFSVGGPVEDTIPQLGSRDFGWNANFWDFQLAPPPGSGHGPMKTDPKYPYTSQCQNGGCVPGRDYRPPIVNTKDPILKPWAAEEMQATNE
Ga0306926_1137683813300031954SoilMSSFAARDLVSTLRNRLLAGGVVAGFLAVTLIPTFSVAAPAGDSIPQLGSRDFGWNANFWDFQLDPPQGSGHGPIKTDPKYPYNSQCQNGGCSVEQDLR
Ga0306926_1166296913300031954SoilMSSFAAHILPSTLRGRLVVAGFLAVALVPPCSVAAPAADSIPQLGSLNSGWNANFWDFQLDPPTGSGHGPMKTDPKYPYTSQCQNGGCFGGKDLRPPIVDTKDPILKPWAAGEMQATNEEVLS
Ga0306924_1051935013300032076SoilMSSFAAHISISTSRRGLLAGSVLAGLSAFILDPSFSVAAPAEDSTPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSECQNGRCITVVDPSTRPPIV
Ga0306924_1117996623300032076SoilMSSLQAQILVSTLRSRLLAGRVAAGFLAVTLVPSFSVAGPVEDSTPQLGSRDFGWNANFWDFQLAPPPGSGHGPMKTDPKYPYTSQCQNGGCFGGRDLRPPIVDTKDPILKPWAAEHMQATN
Ga0307471_10323190913300032180Hardwood Forest SoilMSSFAAHLLVSTLWSRLLAGGVVAGFLAVTLIPSFSAAAPAEDSIPQLGSRDFGWNANFWDFQLDPPPGSAHGPMKTDPNYPYTSECQNGRCIAVVDPSTRPPIVNTNDPILKPWAAKQMQATNEEVLSGK
Ga0306920_10157516013300032261SoilMWSFAAHILVSTLRSRLLAGGVVAGFLAVTLIPSFSVAAPVEDSIPQLGSRDFGWNANFWDFQLDPPQGSGHGPIKTDPNYPYNSQIQNGRLFTGGELRPPIVNTQDPIL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.