NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F071426

Metagenome / Metatranscriptome Family F071426

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F071426
Family Type Metagenome / Metatranscriptome
Number of Sequences 122
Average Sequence Length 182 residues
Representative Sequence AQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESLQVVRSPCCPRELPDVISENLSLDAGSPTPAVHRVLSPVSSTVSSAFPTAKVGRLSASVLRITTSRRPLFRGCRYFVMFRPPSLLAPQIVPTAADTTAGQPGLLRPGLSCFVASTRTGYANRPNTGN
Number of Associated Samples 118
Number of Associated Scaffolds 122

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 10.66 %
% of genes near scaffold ends (potentially truncated) 79.51 %
% of genes from short scaffolds (< 2000 bps) 98.36 %
Associated GOLD sequencing projects 116
AlphaFold2 3D model prediction Yes
3D model pTM-score0.13

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (83.607 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(36.066 % of family members)
Environment Ontology (ENVO) Unclassified
(26.230 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(53.279 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Fibrous Signal Peptide: No Secondary Structure distribution: α-helix: 1.96%    β-sheet: 0.00%    Coil/Unstructured: 98.04%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.13
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 122 Family Scaffolds
PF00496SBP_bac_5 0.82
PF00078RVT_1 0.82
PF13408Zn_ribbon_recom 0.82
PF02518HATPase_c 0.82



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A83.61 %
All OrganismsrootAll Organisms16.39 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001086|JGI12709J13192_1010764Not Available745Open in IMG/M
3300001089|JGI12683J13190_1005289All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Granulicella → Granulicella aggregans1467Open in IMG/M
3300001593|JGI12635J15846_10358818Not Available891Open in IMG/M
3300001593|JGI12635J15846_10600089Not Available640Open in IMG/M
3300002245|JGIcombinedJ26739_101336773Not Available608Open in IMG/M
3300002677|Ga0005475J37263_108736Not Available899Open in IMG/M
3300002917|JGI25616J43925_10063857All Organisms → cellular organisms → Bacteria → Proteobacteria1567Open in IMG/M
3300004281|Ga0066397_10051911Not Available743Open in IMG/M
3300004608|Ga0068924_1307658Not Available618Open in IMG/M
3300005180|Ga0066685_10825721Not Available626Open in IMG/M
3300005332|Ga0066388_104545823Not Available706Open in IMG/M
3300005471|Ga0070698_100078335All Organisms → cellular organisms → Bacteria → Proteobacteria3304Open in IMG/M
3300005552|Ga0066701_10322677Not Available957Open in IMG/M
3300005598|Ga0066706_11289601Not Available553Open in IMG/M
3300005938|Ga0066795_10258714Not Available515Open in IMG/M
3300006050|Ga0075028_100858768Not Available556Open in IMG/M
3300006057|Ga0075026_100665218Not Available619Open in IMG/M
3300006176|Ga0070765_101470431Not Available641Open in IMG/M
3300006796|Ga0066665_10975822Not Available652Open in IMG/M
3300006864|Ga0066797_1119212Not Available923Open in IMG/M
3300006914|Ga0075436_100131583All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1754Open in IMG/M
3300007255|Ga0099791_10140353All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium lablabi1126Open in IMG/M
3300009029|Ga0066793_10686595Not Available582Open in IMG/M
3300009143|Ga0099792_10223123All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium lablabi1083Open in IMG/M
3300009168|Ga0105104_10685358Not Available588Open in IMG/M
3300009792|Ga0126374_10713380Not Available756Open in IMG/M
3300010147|Ga0126319_1107547Not Available542Open in IMG/M
3300010376|Ga0126381_104597693Not Available532Open in IMG/M
3300010856|Ga0126358_1205431Not Available925Open in IMG/M
3300010862|Ga0126348_1090845Not Available516Open in IMG/M
3300010862|Ga0126348_1195971Not Available579Open in IMG/M
3300011271|Ga0137393_11478632Not Available569Open in IMG/M
3300011305|Ga0138532_1079833Not Available646Open in IMG/M
3300012096|Ga0137389_11072077All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales690Open in IMG/M
3300012189|Ga0137388_11472886All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales618Open in IMG/M
3300012203|Ga0137399_11402828Not Available585Open in IMG/M
3300012357|Ga0137384_11438103Not Available538Open in IMG/M
3300012359|Ga0137385_10853377Not Available755Open in IMG/M
3300012925|Ga0137419_11410366Not Available588Open in IMG/M
3300012971|Ga0126369_10670675All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → Rhodovulum → unclassified Rhodovulum → Rhodovulum sp. PH101112Open in IMG/M
3300015052|Ga0137411_1275576Not Available1203Open in IMG/M
3300016270|Ga0182036_10761615Not Available787Open in IMG/M
3300016294|Ga0182041_10493168Not Available1061Open in IMG/M
3300016341|Ga0182035_10712057Not Available875Open in IMG/M
3300016341|Ga0182035_11560871Not Available595Open in IMG/M
3300016357|Ga0182032_10944480Not Available735Open in IMG/M
3300016387|Ga0182040_11008096Not Available694Open in IMG/M
3300017654|Ga0134069_1295253Not Available573Open in IMG/M
3300017970|Ga0187783_10989614Not Available606Open in IMG/M
3300018027|Ga0184605_10152828Not Available1039Open in IMG/M
3300018051|Ga0184620_10089765Not Available933Open in IMG/M
3300018067|Ga0184611_1199706Not Available711Open in IMG/M
3300018073|Ga0184624_10378776Not Available631Open in IMG/M
3300018081|Ga0184625_10503887Not Available610Open in IMG/M
3300019789|Ga0137408_1251793All Organisms → cellular organisms → Bacteria → Proteobacteria1179Open in IMG/M
3300020022|Ga0193733_1068653Not Available996Open in IMG/M
3300020062|Ga0193724_1056605All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium lablabi826Open in IMG/M
3300020581|Ga0210399_11027406Not Available663Open in IMG/M
3300021088|Ga0210404_10347874Not Available822Open in IMG/M
3300021403|Ga0210397_10191488All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. 1391456Open in IMG/M
3300021418|Ga0193695_1089839Not Available662Open in IMG/M
3300021420|Ga0210394_10920096Not Available760Open in IMG/M
3300021432|Ga0210384_11741749Not Available528Open in IMG/M
3300022498|Ga0242644_1005139Not Available1038Open in IMG/M
3300022503|Ga0242650_1007300Not Available768Open in IMG/M
3300022504|Ga0242642_1033103All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Aquabacter → Aquabacter spiritensis755Open in IMG/M
3300022523|Ga0242663_1017791Not Available1038Open in IMG/M
3300022525|Ga0242656_1134548Not Available511Open in IMG/M
3300022528|Ga0242669_1002936Not Available1769Open in IMG/M
3300022718|Ga0242675_1032723Not Available799Open in IMG/M
3300022718|Ga0242675_1095410Not Available566Open in IMG/M
3300022724|Ga0242665_10151493All Organisms → cellular organisms → Bacteria → Proteobacteria733Open in IMG/M
3300026340|Ga0257162_1015478Not Available904Open in IMG/M
3300026351|Ga0257170_1034152Not Available691Open in IMG/M
3300026360|Ga0257173_1018902Not Available852Open in IMG/M
3300026494|Ga0257159_1021862All Organisms → cellular organisms → Bacteria → Proteobacteria → Acidithiobacillia → Acidithiobacillales → Acidithiobacillaceae → Acidithiobacillus → unclassified Acidithiobacillus → Acidithiobacillus sp. MC6.11046Open in IMG/M
3300026551|Ga0209648_10421956Not Available854Open in IMG/M
3300027587|Ga0209220_1117401Not Available695Open in IMG/M
3300027633|Ga0208988_1030050All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1399Open in IMG/M
3300027667|Ga0209009_1155894Not Available581Open in IMG/M
3300027862|Ga0209701_10523255Not Available641Open in IMG/M
3300027910|Ga0209583_10437743Not Available632Open in IMG/M
3300028792|Ga0307504_10199312Not Available708Open in IMG/M
3300028876|Ga0307286_10230828Not Available675Open in IMG/M
3300028880|Ga0307300_10081724Not Available955Open in IMG/M
3300029636|Ga0222749_10429840Not Available704Open in IMG/M
3300030548|Ga0210252_10162487Not Available521Open in IMG/M
3300030741|Ga0265459_12653409Not Available620Open in IMG/M
3300030945|Ga0075373_11663348Not Available939Open in IMG/M
3300030967|Ga0075399_11369303Not Available595Open in IMG/M
3300031094|Ga0308199_1016726All Organisms → cellular organisms → Bacteria → Proteobacteria1193Open in IMG/M
3300031096|Ga0308193_1055195Not Available605Open in IMG/M
3300031152|Ga0307501_10026280All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium erythrophlei1153Open in IMG/M
(restricted) 3300031197|Ga0255310_10170203Not Available603Open in IMG/M
3300031543|Ga0318516_10386667Not Available807Open in IMG/M
3300031564|Ga0318573_10472846Not Available674Open in IMG/M
3300031572|Ga0318515_10513034Not Available639Open in IMG/M
3300031679|Ga0318561_10715790Not Available550Open in IMG/M
3300031680|Ga0318574_10276873Not Available973Open in IMG/M
3300031720|Ga0307469_10923681Not Available810Open in IMG/M
3300031724|Ga0318500_10177998Not Available1011Open in IMG/M
3300031744|Ga0306918_10494103Not Available958Open in IMG/M
3300031747|Ga0318502_10578958Not Available675Open in IMG/M
3300031763|Ga0318537_10014621All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → unclassified Hyphomicrobiales → Rhizobiales bacterium GAS1132685Open in IMG/M
3300031770|Ga0318521_10761739Not Available589Open in IMG/M
3300031771|Ga0318546_10633007Not Available752Open in IMG/M
3300031781|Ga0318547_10903175Not Available551Open in IMG/M
3300031796|Ga0318576_10192342Not Available959Open in IMG/M
3300031798|Ga0318523_10397873Not Available684Open in IMG/M
3300031835|Ga0318517_10142350Not Available1069Open in IMG/M
3300031879|Ga0306919_11284360Not Available554Open in IMG/M
3300031896|Ga0318551_10883413Not Available521Open in IMG/M
3300031910|Ga0306923_10820345Not Available1025Open in IMG/M
3300031942|Ga0310916_10479572Not Available1059Open in IMG/M
3300031954|Ga0306926_12431660Not Available576Open in IMG/M
3300032052|Ga0318506_10280076Not Available739Open in IMG/M
3300032063|Ga0318504_10503198Not Available580Open in IMG/M
3300032066|Ga0318514_10610099Not Available581Open in IMG/M
3300032076|Ga0306924_10689152All Organisms → cellular organisms → Bacteria1148Open in IMG/M
3300032090|Ga0318518_10560457Not Available584Open in IMG/M
3300033290|Ga0318519_10598909Not Available669Open in IMG/M
3300034417|Ga0364941_184694Not Available536Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil36.07%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil9.84%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil9.02%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil7.38%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil7.38%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment4.10%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil3.28%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.46%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.46%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil2.46%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil1.64%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.64%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil1.64%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.64%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.82%
SoilEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil0.82%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.82%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.82%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.82%
Prmafrost SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Prmafrost Soil0.82%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil0.82%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.82%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.82%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.82%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.82%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001086Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3EnvironmentalOpen in IMG/M
3300001089Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3EnvironmentalOpen in IMG/M
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002677Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF124 (Metagenome Metatranscriptome, Counting Only)EnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300004281Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 30 MoBioEnvironmentalOpen in IMG/M
3300004608Peat soil microbial communities from Weissenstadt, Germany - Metatranscriptome 9 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005938Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Permafrost soil replicate 2 DNA2013-191EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006057Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2012EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006864Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Permafrost soil replicate 3 DNA2013-193EnvironmentalOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009029Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Permafrost soil replicate 1 DNA2013-189EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009168Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 19-21cm September2015EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010147Soil microbial communities from California, USA to study soil gas exchange rates - BB-CA-RED metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010856Boreal forest soil eukaryotic communities from Alaska, USA - W4-2 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300010862Boreal forest soil eukaryotic communities from Alaska, USA - C4-4 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011305Peat soil microbial communities from Weissenstadt, Germany - Metatranscriptome 10 (Metagenome Metatranscriptome) (version 2)EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300016387Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176EnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017970Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018051Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_b1EnvironmentalOpen in IMG/M
3300018067Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_5_coexEnvironmentalOpen in IMG/M
3300018073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_5_b1EnvironmentalOpen in IMG/M
3300018081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b1EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300020062Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a1EnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021418Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3s2EnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300022498Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-32-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022503Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-2-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022504Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-2-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022523Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022525Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022528Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022718Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300026340Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-AEnvironmentalOpen in IMG/M
3300026351Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-05-BEnvironmentalOpen in IMG/M
3300026360Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-BEnvironmentalOpen in IMG/M
3300026494Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-AEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027587Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027633Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027667Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028876Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_140EnvironmentalOpen in IMG/M
3300028880Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_181EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030548Metatranscriptome of forest soil microbial communities from Boreal Montmorency Forest, Quebec, Canada - FO133-ANR016SO (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030741Forest Soil Metatranscriptomes Boreal Montmorency Forest, Quebec, Canada ANR Co-assemblyEnvironmentalOpen in IMG/M
3300030945Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA10 EcM (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030967Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA11 Emin (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031094Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_203 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031096Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_194 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031152Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 15_SEnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031543Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.176b2f20EnvironmentalOpen in IMG/M
3300031564Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f21EnvironmentalOpen in IMG/M
3300031572Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.176b2f19EnvironmentalOpen in IMG/M
3300031679Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f23EnvironmentalOpen in IMG/M
3300031680Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f22EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031724Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f20EnvironmentalOpen in IMG/M
3300031744Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00H (v2)EnvironmentalOpen in IMG/M
3300031747Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f22EnvironmentalOpen in IMG/M
3300031763Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.117b4f29EnvironmentalOpen in IMG/M
3300031770Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f17EnvironmentalOpen in IMG/M
3300031771Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f19EnvironmentalOpen in IMG/M
3300031781Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f20EnvironmentalOpen in IMG/M
3300031796Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f24EnvironmentalOpen in IMG/M
3300031798Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f19EnvironmentalOpen in IMG/M
3300031835Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.176b2f21EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300031896Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f19EnvironmentalOpen in IMG/M
3300031910Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 (v2)EnvironmentalOpen in IMG/M
3300031942Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF176EnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300032052Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f19EnvironmentalOpen in IMG/M
3300032063Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f17EnvironmentalOpen in IMG/M
3300032066Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.176b2f18EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032090Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.176b2f22EnvironmentalOpen in IMG/M
3300033290Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f15EnvironmentalOpen in IMG/M
3300034417Sediment microbial communities from East River floodplain, Colorado, United States - 17_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI12709J13192_101076413300001086Forest SoilLVRVHPRHSGLPSASPPELSWRHAGFIKPAAIRPSKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPXELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKERMGRLPACFVPRITTSRRSVFRGCRYSLMFRPPSLLSPQIVPTAANTAAGQPGILRPGLSCFVASTRTGYANRPNTGN*
JGI12683J13190_100528923300001089Forest SoilLPSASPPELSWRYAGFIKPAAIRPSKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKERMGRLPACFVPRITTSRRSVFRGCRYSLMFRPPSLLSPQIVPTAANTAAGQPGILRPGLSCFVASTRTGYANRPNTGN*
JGI12635J15846_1035881823300001593Forest SoilVALRRFYQAGRHSDEEPAKVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSCTPAVHRVLSPVSSTMSSAFPKRRMGRLPAFCPTNYDFSQVRYFEDADFLDVPASKLLSPQIVPTAAHTAAGQPG
JGI12635J15846_1060008913300001593Forest SoilHLLRGRYPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVHHVLSPVSSTVSSAFPTRKVGRLSASVLRITTSRRPLFRGCRYFVMFRPPSLLAPQIVPTAADTTAGQPGLLRPGLSCFVTSARTGYANRPNAGN*
JGIcombinedJ26739_10133677313300002245Forest SoilLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVHHVLSPVSSTVSSAFPTRKVGRLSASVLRITTSRRPLFRGCRYFVMFRPPSLLAPQIVPTAADTTAGQPGLLRPGLSCFVTSARTGYANRPNAGN*
Ga0005475J37263_10873613300002677Forest SoilVALRRFYQAGRHTAKKPPSVQSPFAQLRCYLRWGDVSAISSEGVAPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRKLPDVISESLSLDAGSPTPAVYRVLSPVSSTVSSAFPTRKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN*
JGI25616J43925_1006385723300002917Grasslands SoilLPSASPPELSWRYAGFIKPAAIRPSKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKERMGRLPACFVPRITTSRRSVFRGCRYSLMFRPPSLLSPQIVPTAAHTAAGQPGILRPGLSCFVASTRTGYANRPNTGN*
Ga0066397_1005191113300004281Tropical Forest SoilPWLGEVQTLVRVHPCHSWFAHSLAPLSRHGVGAGFIKPAATRPSRPPSVQSPFAQLGCNLRWGDVVPVSSEGITPPSSLILAHSPLPLGSLLLRLLASFGESLQVVTSPCCPRQLPDVMSENLSLDAGPHSPAVRRVLAPVSSTASSAFPTEGCGSASRVSSANNDFSQGMFRGRRYFVMFRPPSSLAPRIVPTAATTPAGQLGLLRPGLSCFVAAARTGYANRPTSGNWRYGDLHPARLSALSAAP
Ga0068924_130765813300004608Peatlands SoilQLRCYLRWGDVGAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSHTPAVHRVLSPVSSTMSSAFPT*RLGRLPAFIPRITTSRRALSRGCRYSFMFRPPSLRAPQIVPTAASTFAGQPGLLRPGLSCFVTSARTGYANRLTQAIDGTGTFTLSDSQPCRLLTSWKSFAHGCCR*
Ga0066685_1082572113300005180SoilPFRLRVSHHLDRATFPASAKSRAHVPLPLGSLLLRHLASFEESLQVVRSPCCPRELPDVISENLSLDAGSPTPAVHRVLSPVSSTMSSAFPTGKVGRLSASVLRITTSRRPLFRGCRYFVMFRPPSLLAPQIVPTAAHTAAGQPGLLHPGLSCSFTRTGYANRPTQAIDGTGTFTLSDSQPCRLLPSHSRTLYSAREREPGRGSRWRP
Ga0066388_10454582313300005332Tropical Forest SoilVPLPLSSLFLRHLASFRESLQVVSSPCCPRQLPDVISESPSLDAGSRSPAVHRVLAPVPSTASSAFPQRRWGRLPALIPRTTSRRGLFRGCRYFVMFRPPSLLTPQVVPTAANTLAGQPGLLHPGLLCFVTSAHTGHANRPKTGN*
Ga0070698_10007833533300005471Corn, Switchgrass And Miscanthus RhizosphereRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISESLSLDAGSHTPAVHRVLSPVSSTMSSAFPKQRWVGCPRMPYELRLLVEGDSRGCRYSFMFRPPSLLAPQIVPTAANTAAGQPGLLHPGLSCFVTSTRTGYANRPNTGN*
Ga0066701_1032267713300005552SoilLGRCECHLLRGRYPSVIAPTGSCATPVGRSPASALASFEESLQVVRSPCCPRELPDVISENLSLDAGSPTPAVHRVLSPVSSTMSSTFPTGKVGRLSASVLRITTSRRPLFRGCRYFVMFRPPSLLAPQIVPTAADTTAGQPGLLRPGLSCSFTRTGYANRPNPGN*
Ga0066706_1128960113300005598SoilQVSRAPLPNVGVTSVGETCTVSSEGVTPPSSLLLAHVPLPLGSLLLRHLTSFEESWQVVRSPCCPWELPDVISENLSLDAGSPTPAVHRVLSPVSSTMSSAFPTGKVGRLSASVLRITTSRRPLFRGCRYFVMFRPPSLLAPQIVPTAANTTAGQPGLLHPGLSCSVTSARTGYANRPNTGN*
Ga0066795_1025871413300005938SoilLLRAHVPLPLGSPLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVHRVLSPVSSTMSSAFPTGKVGRLSASVLRITTSRRPLFRGCRYSLMFRPPSLLAPQIVPTAAHTDAGQPGLLHPGLSCSFTRTGYANRPTQAIDGTGTFTLSDSQPCRLLTSWKS
Ga0075028_10085876813300006050WatershedsGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSHTPAVHRVLSPVSSTMSSAFPTCGLGRLPASIPRTYDFSQEGSRGYRYSLMFRPPSLLAPQIVPTAAHTAAGQPGLLRPGLSCFVTSTRTGYANRPNTGN*
Ga0075026_10066521813300006057WatershedsWRYAGFIKPAAIRPRRPPSVQSPFAQLGCYLRWGDVTVSWEGVTPPSSLVLAHVPLPLSSLFLRHSASFRESLQVVHSPCCSRQLPDVISESPSLDAGSRSPAVHRVLAPVTSTVSSAFPKRRVGRLPASTPRMTTSRRFQFRGCRYFVMFRPPSLLAPRVVPTAANTAAGQLGLLRPGLLCFVTSAHTGYANRPNTGN*
Ga0070765_10147043113300006176SoilTPPSLLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRKLPDVISENLSLDAGSPTPAVHHVLSPVSSTVSSAFPTRKVGRLSASVLRITTSRRPLFRGCRYFVMFRPPSLLAPQIVPTAADTTAGQPGLLRPGLSCFVTSARTGYANRPNAGN*
Ga0066665_1097582213300006796SoilHLLRGRYPSGHRSYRLMCQLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSHTPAVHRVLSPVSSTMSSAFPIYGLGRLPASIPRITTSRRGGSRGCRYSLMFRPPSLLAPQIVPTAANTAAGQPGLLRPGLSCFVTSARTGYANRPNTGN*
Ga0066797_111921213300006864SoilLPPASPPDLSWRLAGFIKPAAIRPRKPPSVLSPFAQRRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVHRVLSPVSSTMSSAFPTGKVGRLSASVLRITTSRRPLFRGCRYSLMFRPPSLLAPQIVPTAAHTDAGQPGLLHPGLSCSFTRTGYANRPTQAIDGTGTFTLSDSQPCRLLPSMPTTRPCQCWPRARRARAGCG
Ga0075436_10013158313300006914Populus RhizosphereSSLILAHVPLPLSSLFLRHLASFRESLQVVSSPCCSRQLPDVISESPSLDAGSRSPAVHRVLVPVSSTASSAFPQRRWSRLPARIPRTTSRRGLFRGCRYSLMFRPPSLLTPQIVPTAANTPAGQPGLLHPGLLCFVTSAHTGYANRPNTGN*
Ga0099791_1014035313300007255Vadose Zone SoilLPSASPPELSWRYAGFIKPAAIRPSKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKGRMGRLPACSVPRITTSRRSVFRGCRYSLMFRPPSLLSPQIVPTAANTAAGQPGILRPGLSCFVASTRTGYANRPNTGN*
Ga0066793_1068659513300009029Prmafrost SoilLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRGLPDVISENLSFDAGSPTPAVHRVLSPVSSTVSSAFPTRKVGRLSASVLRITTSRRPLFRGCRYFVMFRPPSLLAPQIVPTAAYTAAGQPGLLRPGLSCFVTSARTGYANRPNAGN*
Ga0099792_1022312313300009143Vadose Zone SoilLSWRYAGFIKPAAIRPSKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKGRMGRLPACFVPRITTSRRSVFRGCRYSLMFRPPSLLSPQIVPTAANTAAGQPGILRPGLSCFVASTRTGYANRPNTGN*
Ga0105104_1068535813300009168Freshwater SedimentFGESLQVVTSPCCPRQLPDVSSENLSLDAGPHTPAVRRVLAPVSSTASSAFPTEGCGSASRVSSANNDFSQGMFRGCRYFVMFRPPSSLAPRIVPTAATTPAGQLGLLRPGLSCFVASARTGYANRPKTGNWRYGDLHPARLSALSAAPLPGMPSSMTTGSSTPISSRAAVSTLAFAEI*
Ga0126374_1071338023300009792Tropical Forest SoilAHVPLPLSSLFLRHLASFRESLQVVSSPCCPRQLPDVISESPSLDAGSRSPAVHRVLAPVPSTASSAFPQRRWGRLPALIPRTTSRRGLFRGCRYFVMFRPPSLLTPQVVPTAANTLAGQPGLLHPGLLCFVTSAHTGHANRPKTGN*
Ga0126319_110754713300010147SoilGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVHSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKRRMGRLPAFVPRITTSRRLVFRGCRYSFMFRPPSLLSPQIVPTAANTAAGQPGILRPGLSCFVTSARTGYANRPNTGN*
Ga0126381_10459769313300010376Tropical Forest SoilTSWEGITPPSSLVLAHSPLPLGSLLLRLSASFRESLQVVTSPCCPRQLPDVIPDSLSLDAGSPTPAVPPCALACCFHGVIGLPHGKVGRLPASIPRMTTSRRTAFRGRRYFVMFRPPSLLAPQIVPTAANTSAGQPGLLRPSRTRFVASPRIGYANRLNTRN*
Ga0126358_120543113300010856Boreal Forest SoilLPPASPPDLSWRYAGFIKPAAIRPRRPPSVQSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVHRVLSPVSSTVSSAFPTAKVGRLSASVPANYDFSQEASRGCRYSLMFRPPSLLAPQIVPTAAYTAAGQPGLLRPGLSCSSTRTGYANRPNPGN*
Ga0126348_109084513300010862Boreal Forest SoilSSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVHRVLSPVSSTMSSAFPTGKVGRLSASVLRITTSRRPLFRGCRYSLMFRPPSLLAPQIVPTAAHTDAGQPGLLRPGLSCSFTRTGYANRPTQAIDGTGTFTLSDSQ
Ga0126348_119597113300010862Boreal Forest SoilRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHSASFEESLQVVRSPCCPRELPDVISESLSLDAGSPTPAVHHVLSPVSSTVSSAFPTRKVGRLSASVLRITTSRRPLFRGCRYFVMFRPPSLLAPQIVPTAADTTAGQPGLLRPGLSCFVTSARTGYANRLTQAIDGTGTFTLSDSQPCRLLTSCQISL
Ga0137393_1147863223300011271Vadose Zone SoilSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISESLSLDAGSHTPAVHRVLSPVSSTMSSAFPKQRWVGCPRMPHELRLLVEGDSRGCRYSFMFRPPSLLAPQIVPTAANTAAGQPGLLHPGLSCFVTSTRTGYANRPTTGN*
Ga0138532_107983313300011305Peatlands SoilLSWRCAGFIRPAAIRPRKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSHTPAVHRVLLPVSSPVSSAFPQRRWGRLPASTPRITTSRGNVFRGCRYFLMFRPPSLLTPQIVPTAACTGAGQPGLLRPGLSCFVTSTRTGYANRPNTGN*
Ga0137389_1107207723300012096Vadose Zone SoilYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISESLSLDAGSHTPAVHRVLSPVSSTMSSAFPKQRWVGCPRMPHELRLLVEGDSRGCRYSFMFRPPSLLAPQIVPTAANTAAGQPGLLHPGLSCFVTSTRTGYANRPTTGN*
Ga0137388_1147288623300012189Vadose Zone SoilQLPVFGRLAYPVRVEATKHDRPGRKIPCCPRELPDVISESLSLDAGSHTPAVHRVLSPVSSTMSSAFPKQRWVGCPRMPHELRLLVEGDSRGCRYSFMFRPPSLLAPQIVPTAANTAAGQPGLLHPGLSCFVTSTRTGYANRPTTGN*
Ga0137399_1140282813300012203Vadose Zone SoilLSWRYAGFIKPAAIRPSKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLLAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKGRMGRLPACFVPRITTSRRSVFRGCRYSLMFRPPSLLSPQIVPTAANTAAGQPGILRPGLSCFVASTRTGY
Ga0137384_1143810313300012357Vadose Zone SoilAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESLQVVRSPCCPRELPDVISENLSLDAGSPTPAVHRVLSPVSSTVSSAFPTAKVGRLSASVLRITTSRRPLFRGCRYFVMFRPPSLLAPQIVPTAADTTAGQPGLLRPGLSCFVASTRTGYANRPNTGN*
Ga0137385_1085337713300012359Vadose Zone SoilWRYAGFIKPAAIRPSKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKERMGRLPACFVPRITTSRRSVFRGCRYSLMFRPPSLLSPQIVPTAAHTAAGQPGLLRPGLSCSSTRTGYANRPNPGN*
Ga0137419_1141036613300012925Vadose Zone SoilELSWRYAGFIKPAAIRPSKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKGRMGRLPACFVPRITTSRRSVFRGCRYSLMFRPPSLLSPQIVPTAANTAAGQPGILRPGLSCFVASTRTGY
Ga0126369_1067067513300012971Tropical Forest SoilSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN*
Ga0137411_127557613300015052Vadose Zone SoilTLVRVHPRHSGLPSASPPELSWRYAGFIKPAAIRPSKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKGRMGRLPACFVPRITTSRRSVFRGCRYSLMFRPPSLLSPQIVPTAANTAAGQPGILRPGLSCFVASTRTGYANRPNTGN*
Ga0182036_1076161513300016270SoilRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0182041_1049316813300016294SoilLGRRVLISSESVTPRSSLLRAHVPLPLGSLLLQHLASFGESLQVVPSPCCPWEFPDVISENLSLDAGSRTPAVPLRALTCFFRSVIGLLPAKMGSASRFAPQNDFTRSVFRGCRYSVMFRPPSLFTPQIVPTAAHTAAGQPGLLRPGLSCSKPRGIARILSASDENLRRIKISWQ
Ga0182035_1071205723300016341SoilISSESVTPRSSLLRAHVPLPLGSLLLQHLASFGESLQVVPSPCCPWEFPDVISENLSLDAGSRTPAVPLRALTCFFRSVIGLLPAKMGSPSRFAPQNDFTRSVFRGCRYSVMFRPPSLFTPQIVPTAAHTAAGQPGLLRPGLSCFVTSARTGYTNRPITGN
Ga0182035_1156087113300016341SoilQLRCYLCWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0182032_1094448013300016357SoilGEETAKGPEPLCPTSVLPPLGRRVFISSESVTPRSSLLRAHVPLPLGSLLLQHLASFGESLQVVPSPCCPWEFPDVISENLSLDAGSRTPAVPLRALTCFFRSVIGLLPAKMGSASRFAPQNDFTRSVFRGCRYSVMFRPPSLFTPQIVPTAAHTAAGQPGLLRPGLSCFVTSARTGYTNRPNTGN
Ga0182040_1100809613300016387SoilSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0134069_129525313300017654Grasslands SoilPSVLSPFAQRRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAAYRVLSPVSSTVSSAFPTGKVGRLSASVLRITTSRRPLFRGCRYSFMFRPPSLLAPQIVPTAAHTAAGQPGILRPGLSCSFTRTGYANRPIQAIDGTGTFTLSD
Ga0187783_1098961413300017970Tropical PeatlandPLSSLFLRFLASFRESLQVVPSPCCSRQLPDVISESPSLDAGSRSPAVHRVLAPVSSTVSSAFPKRRVGRLPASTLRMTTSRRIQFRGCRYFVMFRPPSLLAPRVVPTAANTAAGQLGLLRPGLLCFVASAHTGYANRPNTGN
Ga0184605_1015282813300018027Groundwater SedimentMASRRFYQAGRHTAKKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESLQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKERMGRLPASCPRITTSRRSVFRGCRYSLMFRPPSLLAPQIVPTAAYTAAGQPGLLRPGLSCSFTRTGYANRPTQAIDGT
Ga0184620_1008976513300018051Groundwater SedimentLPSASPPELSWRYAGFIKPAAIRPRKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSVDAGSRTPAVHRVLSPVSSTVSSAFPTAKVGRLSASVPANYDFSQEASRGCRYSLMFRPPSLLSPQIVPTAANTAAGQPGILRPGLSCFVTSTRTGYANRPNTGN
Ga0184611_119970613300018067Groundwater SedimentPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVHSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKRRMGRLPAFVPRITTSRRLVFRGCRYSVMFRPPSLLSPQIVPTAVHTAAGQPGILRPGLSCFVTSTRTGYANRPNTGN
Ga0184624_1037877613300018073Groundwater SedimentLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVHSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKRRMGRLPAFVPRITTSRRLVFRGCRYSVMFRPPSLLSPQIVPTAANTAAGQPGILRPGLSCFVTSARTGYANRPNTGN
Ga0184625_1050388723300018081Groundwater SedimentCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVHSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKRRMGRLPAFVPRITTSRRLVFRGCRYSVMFRPPSLLSPQIVPTAANTAAGQPGILRPGLSCFVTSARTGYANRPNTGN
Ga0137408_125179313300019789Vadose Zone SoilLPSASPPELSWRYAGFIKPAAIRPSKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKGRMGRLPACFVPRITTSRRSVFRGCRYSLMFRPPSLLSPQIVPTAANTAAGQPGILRPGLSCFVASTRTGYANRPNTGN
Ga0193733_106865313300020022SoilLPPASPPDPSWRLAGFIKPAAIRLRKPPSVLSPFAQRRCYLRWGDVNAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESSQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKQRMGRLPAFVPRMTTSRRSVFRGCRYSLMFRPPSLLAPQIVPTAANTAAGQPGILRPGLSCFVTSTRTGYANRPNTGN
Ga0193724_105660523300020062SoilLVRVHPRHSGLPSASPPELSWRYAGFIKPAAIRPSKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKGRMGRLPASCPRITTSRRSVFRGCRYSLMFRPPSLLSPQIVPTAAHTVAGQPGILRPGLSCFVASTRTGYANRPNTGN
Ga0210399_1102740613300020581SoilGRHTAEDAAKCPEPLCPTSVLPPLGRCECHLLRGRYPSVIVLRAHVPLPLGSLLLRHSASFEGSWQVVRSPCCPRELTDVISENLSLDAGSRTPAVHRVPSPVSSTMSSAFPKQRIGRLPAFCPTNYDFSQSVFRGCRYSLMFRPPSLLSPQIVPTAAATTAEQPGILRPGLSCFVTSTRTGYANRPNTGN
Ga0210404_1034787413300021088SoilIRPRRSGLPPASPLDLSWRCAGFIKPAAIRQRKPPSVLSPFAQLRCYLRWGDVRAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVHHVLSPVSSTVSSAFPTRKVGRLSASVLRITTSRRPLFRGCRYFVMFRPPSLLAPQIVPTAADTTAGQPGLLRPGLSCFVTSARTGYANRPNAGN
Ga0210397_1019148823300021403SoilVALRRFYQAGRHTAKKPPSVQSPFAQLRCYLRWGDVSAISSEGVAPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRKLPDVISESLSLDAGSPTPAVYRVLSPVSSTVSSAFPTRKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0193695_108983913300021418SoilRRFYQAGRHTARKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESLQVVRSPCCPRELPDVISENLSLDAGSPTPAVHRVLSPVSSTVSSAFPTAKVGRLSASVPANYDFSQEASRGCRYSLMFRPPSLLAPQIVPTAAYTAAGQPGLLRPGLSCSSTRTGYANRPNPGN
Ga0210394_1092009613300021420SoilVPRAPLPNSGVTFVGEMCTISWESITSPSSLVRAHAPLPLGSLLLRFLASFGESLQVVTSPCCPRQLPDVSSENLFLDAGPHTPAVLRVLAPVSSTAPSAFPMSRVGSASRVCPRMTTSRGVAFRGCRYFVMFRPPSLLAPQIVPTAANTAAGQPGLLRPGTSCFVTSARSGYANRPNTGNWRYGDLHPARF
Ga0210384_1174174913300021432SoilWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRPLASLEESWQVVRSPCCPGELPDVISIESFLGCWIPYPGGTPCALTCFFHDVIGLPQEKNGSASRVLSHELRLLAGPVFRGCRYSLMFRPPSLLSPQIVPTAAHTAAGQPGILRPGLSCFVASTRTGYANRPNTGN
Ga0242644_100513913300022498SoilTSPCCPRQLPDVSSENLFLDAGPHTPAVLRVLAPVSSTAPSAFPMSRVGSASRVCPRMTTSRGVAFRGCRYFVMFRPPSLLAPQIVPTAANTAAGQPGLLRPGTSCFVTSARSGYANRPNTGNWRYGDLHPARFSALSAASKKSGAPRCR
Ga0242650_100730023300022503SoilVALRRFYQAGRHTAKKPPSVQSPFAQLRCYLRWGDVSAISSEGVAPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRKLPDVISESLSLDAGSPTPAVYRVLSPVSSTVSSAFPTRKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNT
Ga0242642_103310323300022504SoilLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRKLPDVISESLSLDAGSPTPAVYRVLSPVSSTVSSAFPTRKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0242663_101779123300022523SoilFLASFGESLQVVTSPCCPRQLPDVSSENLFLDAGPHTPAVLRVLAPVSSTAPSAFPMSRVGSASRVCPRMTTSRGVAFRGCRYFVMFRPPSLLAPQIVPTAANTAAGQPGLLRPGTSCFVTSARSGYANRPNTGNWRYGDLHPARFSALSAASKKSGAPRCR
Ga0242656_113454813300022525SoilGDVSAISSEGVAPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRKLPDVISESLSLDAGSPTPAVYRVLSPVSSTVSSAFPTRKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0242669_100293633300022528SoilVALRRFYQAGRHTAKKPPSVQSPFAQLRCYLRWGDVSAISSEGVAPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRKLPDVISESLSLDAGSPTPAVYRVLSPVSSTVSSAFPTRKVGRLAAPVLRITTSRRPLFRGYRYFVMFRPPSLIAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0242675_103272313300022718SoilVALRRFYQAGRHTAQKPPSVQSPFAQLRCYLRWGDVSAISSEGVAPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRKLPDVISESLSLDAGSPTPAVYRVLSPVSSTVSSAFPTRKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYANRPNAGN
Ga0242675_109541013300022718SoilLPNSGVTFVGEMCTISWESITSSSSLVRAHAPLPLGSLLLRFLASFGESLQVVTSPCCPRQLPDVSSENLFLDAGPHTPAVLRVLAPVSSTAPSAFPMSRVGSASRVCPRMTTSRGVAFRGCRYFVMFRPPSLLAPQIVPTAANTAAGQPGLLRPGTSCFVTSARSGYANSPNTGNWRYGDLHPARFS
Ga0242665_1015149323300022724SoilEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRKLPDVISESLSLDAGSPTPAVYRVLSPVSSTVSSAFPTRKVDRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0257162_101547813300026340SoilLERFKRWSVYTPVILVCHSASAPELSWRYAGFIKPAALRPSEPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKGRMGRLPACFVPRITTSRRSVFRGCRYSLMFRPPSLLSPQIVPTAANTAAGQPGILRPGLSCFVASTRTGY
Ga0257170_103415213300026351SoilIHPRHSGLPPASPPDLSWRLAGFIKPAAIRPRKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLILAHVPLPLGSLLLRHLASFEESLQVVRSPCCPRELPDVISENLSLDAGSPTPAVHRVLSPVSSTVSSAFPTAKVGRLSASVPANYDFSQEASRGCRYSLMFRPPSLLAPQIVPTAAYTAAGQPGLLRPGLSCSSTRTGYANRPNPGN
Ga0257173_101890213300026360SoilLPSASPPELSWRYAGFIKPAAIRPSKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKGRMGRLPACSVPRITTSRRSVFRGCRYSLMFRPPSLLSPQIVPTAANTAAGQPGILRPGLSCFVASTRTGYANRPNTGN
Ga0257159_102186213300026494SoilLPSASPLDPSWRYAGFIKPAAIRPSKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKGRMGRLPACFVPRITTSRRSVFRGCRYSLMFRPPSLLSPQIVPTAANTAAGQPGILRPGLSCFVASTRTGYANRPNTGN
Ga0209648_1042195613300026551Grasslands SoilLPPVSPLDPSWRYAGFIKPTAIRPRKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISESLSLDAGSHTPAVHRVLSPVSSTMSSAFPKQRWVGCPRMPHELRLLVEGDSRGCRYSFMFRPPSLLAPQIVPTAANTAAGQPGLLHPGLSCFVTSTRTGYANRP
Ga0209220_111740113300027587Forest SoilPPELSWRHAGFIKPAAIRPSKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKERMGRLPACFVLRITTSRRSVFRGCRYSLMFRPPSLLSPQIVPTAANTAAGQPGILRPGLSCFVASTRTGYANRPNTGN
Ga0208988_103005013300027633Forest SoilPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESLQVVRSPCCPRELPDVISENLSLDAGSHTPAVHRVLSPVSSTMSSAFPKQRWVGCPRMPHELRLLVEGDSRGCRYSFMFRPPSLLAPQIVPTAANTAAGQPGLLHPGLSCFVTSTRTGYANRPTTGN
Ga0209009_115589413300027667Forest SoilLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVHHVLSPVSSTVSSAFPTRKVGRLSASVLRITTSRRPLFRGCRYFVMFRPPSLLAPQIVPTAADTTAGQPGLLRPGLSCFVTSARTGYANRPNAGN
Ga0209701_1052325513300027862Vadose Zone SoilAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISESLSLDAGSHTPAVHRVLSPVSSTMSSAFPKQRWVGCPRMPHELRLLVEGDSRGCRYSFMFRPPSLLAPQIVPTAANTAAGQPGLLHPGLSCFVTSTRTGYANRPTTGN
Ga0209583_1043774313300027910WatershedsSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVHHVLSPVSSTVSSAFPTRKVGRLSASVLRITTSRRPLFRGCRYFVMFRPPSLLAPQIVPTAADTTAGQPGLLRPGLSCFVTSARTGYANRPNAGN
Ga0307504_1019931213300028792SoilPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVHHVLSPVSSTVSSAFPTRKVGRLSASVLRITTSRRPLFRGCRYFVMFRPPSLLAPQIVPTAADTTAGQPGLLRPGLSCFVTSARTGYANRPNAGN
Ga0307286_1023082813300028876SoilSLASCQVVALRRFYQAGRHTAKKPPSVQSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESLQVVRSPCCPRELPDVISENLSLDAGSPTPAVHRVLSPVSSTVSSAFPTAKVGRLSASVPANYDFSQEASRGCRYSLMFRPPSLLAPQIVPTAAYTAAGQPGLLRPGLSCSSTRTGYANRPNPGN
Ga0307300_1008172413300028880SoilLRRFYQAGRHTAKKPPSVQSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESLQVVRSPCCPRELPDVISENLSLDAGSPTPAVHRVLSPVSSTVSSAFPTAKVGRLSASVPANYDFSQEASRGCRYSLMFRPPSLLAPQIVPTAAHTPAGQPGLLRPGLSCSSTRTGYANRPNPGN
Ga0222749_1042984013300029636SoilRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRKLPDVISESLSLDAGSPTPAVYRVLSPVSSTVSSAFPTRKVGRLSASVLRITTSRRPLFRGCRYFVMFRPPSLLAPQIVPTAADTTAGQPGLLRPGLSCFVTSARTGYANRPNAGN
Ga0210252_1016248713300030548SoilRKPPSVLSPFAQLRCCLRWGDVSVISSEGVTPPSLLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTVSSAFPTRKVRRLSASVLRITTSRRPLFRGCRYFVMLRPPSLLAPQIVPTAADTTAGQPGLLRPGLSCFVTS
Ga0265459_1265340913300030741SoilRKPPSVLSPFAQLRCCLRWGDVSVISSEGVTPPSLLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTVSSAFPTRKVGRLSASVLRITTSRRPLFRGCRYFVMLRPPSLLAPQIVPTAADTTAGQPGLLRPGLSCFVTSARTGYANRPNAGN
Ga0075373_1166334813300030945SoilLPSASPPELSWRYAGFIKPAAIRPSEPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSHTPAVHRVLSPVSSTMSSAFPTCGLGRLPASIPRTYDFSQEGSRGYRYSLMFRPPSLLAPQIVPTAAHTAAGQPGLLRPGLSCFVTSTRTGYANRPNTGN
Ga0075399_1136930313300030967SoilRFYQAGRLDSCEGRQVPRAPLPNSGVTFVGEMCTISWESITSPSSLVRAHAPLPLGSLLLRFLASFGESLQVVTSPCCPRQLPDVSSKNLFLDAGPHTPAVLRVLAPVSSTAPSAFPMSRVGSASRVCPRMTTSRGVAFRGCRYFVMFRPPSLLAPQIVPTAANTAAGQPGLLRPGTSCFVASARTGYANRPNTGN
Ga0308199_101672613300031094SoilLPSASPPELSWRYAGFIKPAAIRPSKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSVDAGPRTPAVHRVLSPVSSTMSSAFPKERMGRLPASCPRITTSRRSVFRGCRYSLMFRPPSLLSPQIVPTAVHTAAGQPGILRPGLSCFVTSTRTGYANRPNTGN
Ga0308193_105519513300031096SoilYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESLQVVRSPCCPRELPDVISENLSLDAGSHTPAVHRVLSPVSSTMSSAFPKQRMGRLPAFVPRMTTSRRSVFRGCRYSLMFRPPSLLAPQIVPTAAHTAAGQPGLLRPGLSCSSTRTGYANRPTQAIDGTGTFTLSDSQPCRLLLSRHSAIQTTGRLTF
Ga0307501_1002628013300031152SoilMASRRFYQAGRHTATKPPSVQSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKRRMGRLPAFVPRITTSRRLVFRGCRYSFMFRPPSLLSPQIVPTAANTAAGQPGILRPGLSCFVTSARTGYANRPNTGN
(restricted) Ga0255310_1017020313300031197Sandy SoilPELSWRYAGFIKPAAIRPSKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASFEESWQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKGRMGRLPACFVPRITTSRRSVFRGCRYSLMFRPPSLLSPQIVPTAANTAAGQPGILRPGLSCFVASTRTGYANRP
Ga0318516_1038666713300031543SoilRCYLCWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0318573_1047284613300031564SoilPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0318515_1051303413300031572SoilCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0318561_1071579013300031679SoilSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0318574_1027687313300031680SoilVALRRFYQAGRHTAKKPPSVQSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0307469_1092368123300031720Hardwood Forest SoilHSPLPLGSLLLRHLASFGESLQVVTSPCCPRQLPDVISDSLSLDAGSPTPAVPPCAFACFFHGVIGLPQRKVGRLPASVPLETTSCGGRISRTRRYFVMFRPPSLLAPQIVPTAANTAAGQPGLLRPGTSCFVASARTGYANRPNTGNWRCRDSHPARLRPCRLLLTADLPHSGDRR
Ga0318500_1017799813300031724SoilLPPASPHDLSWRYAGFIKPTATRPRRPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0306918_1049410313300031744SoilLLRAHVPLPLGSLLLQHLASFGESLQVVPSPCCPWEFPDVISENLSLDAGSRTPAVPLRALTCFFRSVIGLLPAKMGSASRFAPQNDFTRSVFRGCRYSVMFRPPSLFTPQIVPTAAHTAAGQPGLLRPGLSCFVTSARTGYTNRPNTGN
Ga0318502_1057895813300031747SoilLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0318537_1001462123300031763SoilGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0318521_1076173913300031770SoilVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0318546_1063300713300031771SoilAAIQPRKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0318547_1090317513300031781SoilQLGCYLRWGDWSVSWEGITPPSSLILAHSPLPLSSLLLRLLASFGESLQVVTSPCCPRQLPDVISDSLSLDAGSPTPAVPPCARACFFHGVIGLPHGKVGRLPAPIPRMTTSRRTAFRGRRYFVMFRPPSLLAPQIVPTAANTAAGQPGLLRPSRTRFVASPRIGYTNRPNTRN
Ga0318576_1019234213300031796SoilVALRRFYQADRHTAKKPPSVQSPFAQLRCYVRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASACFPFAGRV
Ga0318523_1039787313300031798SoilVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVPRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTG
Ga0318517_1014235013300031835SoilVALRRFYQADRHTAKKPPSVQSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0306919_1128436013300031879SoilVGITPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0318551_1088341313300031896SoilIVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0306923_1082034513300031910SoilVALRRFYQADRHTAKKPPSVQSPFAQLRCYVRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0310916_1047957213300031942SoilVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0306926_1243166013300031954SoilVFISSESVTPRSSLLRAHVPLPLGSLLLQHLASFGESLQVVPSPCCPWEFPDVISENLSLDAGSRTPAVPLRALTCFFRSVIGLLPAKMGSASRFAPQDDFSRPVFRGCRYSVMFRPPSLFTPQIVPTAAHTAAGQPGLLRPGLSCFVTSARTGYTNRPNTGN
Ga0318506_1028007613300032052SoilLRCYLCWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0318504_1050319813300032063SoilTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0318514_1061009913300032066SoilLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0306924_1068915223300032076SoilVALRRFYQADRHTAKKPPSVQSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSC
Ga0318518_1056045713300032090SoilSEGITPPSSLLRAHVPLPLGSPRLRHLASFEESLQVVRSPCCPRELPDAISENLSLDAGSPTPSVPPCALTCCFHSVIGLPQRNMSRLPAFVPRYDFSQGVFRGCRYSVMFRPPSLLAPQIVPTAAHTPAGQPGLLRPGLSCFVAAARTGYTNRLNTGN
Ga0318519_1059890913300033290SoilLLRAHVPFPLGSLLLRHLASLEESWQVVRSPCCPRELPDVISENLSLDAGSPTPAVYRVLSPVSSTVSSAFPTGKVGRLSAPVLRITTSRRPLFRGYRYFVMFRPPSLLAPQIVPTAAHTTARQPGLLRPGLSCFVTSARTGYASRPNTGN
Ga0364941_184694_1_5343300034417SedimentGRHTAKKPPSVLSPFAQLRCYLRWGDVSAISSEGVTPPSSLLRAHVPLPLGSLLLRHSASFEESLQVVRSPCCPRELPDVISENLSLDAGSRTPAVHRVLSPVSSTMSSAFPKRRMGRLPAFVPRITTSRRLVFRGCRYSFMFRPPSLLSPQIVPTAAHTAAGQPGILRPGLSCFVTS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.