NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F082250

Metagenome / Metatranscriptome Family F082250

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F082250
Family Type Metagenome / Metatranscriptome
Number of Sequences 113
Average Sequence Length 211 residues
Representative Sequence MSEFRRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPAVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPTGAAAATPSQASVPVVAGAK
Number of Associated Samples 82
Number of Associated Scaffolds 113

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 2.65 %
% of genes near scaffold ends (potentially truncated) 43.36 %
% of genes from short scaffolds (< 2000 bps) 59.29 %
Associated GOLD sequencing projects 69
AlphaFold2 3D model prediction Yes
3D model pTM-score0.25

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (67.257 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(46.018 % of family members)
Environment Ontology (ENVO) Unclassified
(42.478 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(46.018 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 26.42%    β-sheet: 14.63%    Coil/Unstructured: 58.94%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.25
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 113 Family Scaffolds
PF04828GFA 14.16
PF01323DSBA 12.39
PF06445GyrI-like 3.54
PF00903Glyoxalase 2.65
PF11185DUF2971 2.65
PF02641DUF190 2.65
PF030614HBT 2.65
PF01494FAD_binding_3 1.77
PF01926MMR_HSR1 1.77
PF00484Pro_CA 1.77
PF00106adh_short 0.88
PF00092VWA 0.88
PF02852Pyr_redox_dim 0.88
PF00034Cytochrom_C 0.88
PF13462Thioredoxin_4 0.88
PF01042Ribonuc_L-PSP 0.88
PF01346FKBP_N 0.88
PF01612DNA_pol_A_exo1 0.88
PF08002DUF1697 0.88
PF00990GGDEF 0.88
PF13442Cytochrome_CBB3 0.88
PF01545Cation_efflux 0.88
PF00326Peptidase_S9 0.88
PF00476DNA_pol_A 0.88
PF12833HTH_18 0.88

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 113 Family Scaffolds
COG3791Uncharacterized conserved proteinFunction unknown [S] 14.16
COG06542-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductasesEnergy production and conversion [C] 3.54
COG1993PII-like signaling proteinSignal transduction mechanisms [T] 2.65
COG0288Carbonic anhydraseInorganic ion transport and metabolism [P] 1.77
COG0578Glycerol-3-phosphate dehydrogenaseEnergy production and conversion [C] 1.77
COG0644Dehydrogenase (flavoprotein)Energy production and conversion [C] 1.77
COG0665Glycine/D-amino acid oxidase (deaminating)Amino acid transport and metabolism [E] 1.77
COG0053Divalent metal cation (Fe/Co/Zn/Cd) efflux pumpInorganic ion transport and metabolism [P] 0.88
COG0251Enamine deaminase RidA/Endoribonuclease Rid7C, YjgF/YER057c/UK114 familyDefense mechanisms [V] 0.88
COG0545FKBP-type peptidyl-prolyl cis-trans isomerasePosttranslational modification, protein turnover, chaperones [O] 0.88
COG0749DNA polymerase I, 3'-5' exonuclease and polymerase domainsReplication, recombination and repair [L] 0.88
COG1230Co/Zn/Cd efflux system componentInorganic ion transport and metabolism [P] 0.88
COG3797Uncharacterized conserved protein, DUF1697 familyFunction unknown [S] 0.88
COG3965Predicted Co/Zn/Cd cation transporter, cation efflux familyInorganic ion transport and metabolism [P] 0.88


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms67.26 %
UnclassifiedrootN/A32.74 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002245|JGIcombinedJ26739_101252284Not Available632Open in IMG/M
3300002245|JGIcombinedJ26739_101383917Not Available596Open in IMG/M
3300002907|JGI25613J43889_10018075All Organisms → cellular organisms → Bacteria → Proteobacteria1977Open in IMG/M
3300005537|Ga0070730_10414484Not Available870Open in IMG/M
3300006893|Ga0073928_10001685All Organisms → cellular organisms → Bacteria → Proteobacteria41192Open in IMG/M
3300006893|Ga0073928_10011474All Organisms → cellular organisms → Bacteria → Proteobacteria10329Open in IMG/M
3300006893|Ga0073928_10368774Not Available1058Open in IMG/M
3300007265|Ga0099794_10028843All Organisms → cellular organisms → Bacteria → Proteobacteria2583Open in IMG/M
3300007265|Ga0099794_10035004All Organisms → cellular organisms → Bacteria → Proteobacteria2368Open in IMG/M
3300007788|Ga0099795_10000917All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria5953Open in IMG/M
3300009088|Ga0099830_10457271All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Steroidobacteraceae1037Open in IMG/M
3300009088|Ga0099830_11231275Not Available622Open in IMG/M
3300009089|Ga0099828_11688282Not Available557Open in IMG/M
3300009143|Ga0099792_10001363All Organisms → cellular organisms → Bacteria → Proteobacteria9021Open in IMG/M
3300010159|Ga0099796_10002361All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria4129Open in IMG/M
3300011270|Ga0137391_10143232All Organisms → cellular organisms → Bacteria → Proteobacteria2081Open in IMG/M
3300011270|Ga0137391_10322899All Organisms → cellular organisms → Bacteria → Proteobacteria1329Open in IMG/M
3300011270|Ga0137391_10753154Not Available806Open in IMG/M
3300011271|Ga0137393_10020121All Organisms → cellular organisms → Bacteria → Proteobacteria4858Open in IMG/M
3300011271|Ga0137393_10040294All Organisms → cellular organisms → Bacteria → Proteobacteria3600Open in IMG/M
3300012202|Ga0137363_10002675All Organisms → cellular organisms → Bacteria → Proteobacteria10674Open in IMG/M
3300012203|Ga0137399_11443508Not Available575Open in IMG/M
3300012205|Ga0137362_10079911All Organisms → cellular organisms → Bacteria → Proteobacteria2725Open in IMG/M
3300012359|Ga0137385_10393725All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1182Open in IMG/M
3300012362|Ga0137361_10157186All Organisms → cellular organisms → Bacteria → Proteobacteria2037Open in IMG/M
3300012582|Ga0137358_10143943All Organisms → cellular organisms → Bacteria1622Open in IMG/M
3300012683|Ga0137398_10320504All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Acetobacteraceae → Komagataeibacter → Komagataeibacter diospyri1045Open in IMG/M
3300012685|Ga0137397_10043133All Organisms → cellular organisms → Bacteria → Proteobacteria3226Open in IMG/M
3300012685|Ga0137397_10056169All Organisms → cellular organisms → Bacteria → Proteobacteria2834Open in IMG/M
3300012685|Ga0137397_10161374All Organisms → cellular organisms → Bacteria → Acidobacteria1663Open in IMG/M
3300012917|Ga0137395_10138098All Organisms → cellular organisms → Bacteria → Proteobacteria1659Open in IMG/M
3300012918|Ga0137396_10632108Not Available791Open in IMG/M
3300012922|Ga0137394_10264093All Organisms → cellular organisms → Bacteria1472Open in IMG/M
3300012923|Ga0137359_10086708All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales2754Open in IMG/M
3300012924|Ga0137413_10001260All Organisms → cellular organisms → Bacteria → Proteobacteria9647Open in IMG/M
3300012925|Ga0137419_10033795All Organisms → cellular organisms → Bacteria → Proteobacteria3163Open in IMG/M
3300012927|Ga0137416_10316833All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1298Open in IMG/M
3300012927|Ga0137416_10635156Not Available933Open in IMG/M
3300012929|Ga0137404_10002903All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales11444Open in IMG/M
3300012929|Ga0137404_10085540All Organisms → cellular organisms → Bacteria2510Open in IMG/M
3300012930|Ga0137407_12096790Not Available540Open in IMG/M
3300012944|Ga0137410_10004016All Organisms → cellular organisms → Bacteria → Proteobacteria9968Open in IMG/M
3300012944|Ga0137410_10094146All Organisms → cellular organisms → Bacteria2210Open in IMG/M
3300014501|Ga0182024_11456335Not Available784Open in IMG/M
3300015241|Ga0137418_10046580All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria3990Open in IMG/M
3300015242|Ga0137412_10001193All Organisms → cellular organisms → Bacteria → Proteobacteria19838Open in IMG/M
3300015245|Ga0137409_10104111All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium2631Open in IMG/M
3300015245|Ga0137409_10302606All Organisms → cellular organisms → Bacteria → Proteobacteria1405Open in IMG/M
3300015264|Ga0137403_10051910All Organisms → cellular organisms → Bacteria → Proteobacteria4224Open in IMG/M
3300019789|Ga0137408_1185606All Organisms → cellular organisms → Bacteria2657Open in IMG/M
3300019887|Ga0193729_1226815Not Available614Open in IMG/M
3300019890|Ga0193728_1296575Not Available616Open in IMG/M
3300020140|Ga0179590_1011916All Organisms → cellular organisms → Bacteria → Proteobacteria1895Open in IMG/M
3300020199|Ga0179592_10016885All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria3211Open in IMG/M
3300020199|Ga0179592_10235172Not Available825Open in IMG/M
3300020579|Ga0210407_10476690Not Available975Open in IMG/M
3300020580|Ga0210403_10032244All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales4189Open in IMG/M
3300020581|Ga0210399_11082873Not Available642Open in IMG/M
3300021168|Ga0210406_10059158All Organisms → cellular organisms → Bacteria → Proteobacteria3327Open in IMG/M
3300021168|Ga0210406_10084787All Organisms → cellular organisms → Bacteria → Proteobacteria2707Open in IMG/M
3300021168|Ga0210406_10134854All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Steroidobacteraceae → unclassified Steroidobacteraceae → Steroidobacteraceae bacterium2079Open in IMG/M
3300021168|Ga0210406_10208046All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Steroidobacteraceae → unclassified Steroidobacteraceae → Steroidobacteraceae bacterium1621Open in IMG/M
3300021168|Ga0210406_10759530Not Available741Open in IMG/M
3300021168|Ga0210406_11040215Not Available607Open in IMG/M
3300021170|Ga0210400_10045603All Organisms → cellular organisms → Bacteria → Proteobacteria3408Open in IMG/M
3300021170|Ga0210400_10095464All Organisms → cellular organisms → Bacteria → Proteobacteria2351Open in IMG/M
3300021178|Ga0210408_10161332All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1778Open in IMG/M
3300021180|Ga0210396_10728790Not Available855Open in IMG/M
3300021404|Ga0210389_10130170All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1947Open in IMG/M
3300021405|Ga0210387_11798240Not Available516Open in IMG/M
3300021406|Ga0210386_10114697All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria2223Open in IMG/M
3300021420|Ga0210394_10021411All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria5970Open in IMG/M
3300021420|Ga0210394_10758587Not Available849Open in IMG/M
3300021475|Ga0210392_10230521Not Available1305Open in IMG/M
3300021478|Ga0210402_10043635All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria3905Open in IMG/M
3300021478|Ga0210402_10543865All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1078Open in IMG/M
3300021479|Ga0210410_10167163All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales1964Open in IMG/M
3300022530|Ga0242658_1183060Not Available560Open in IMG/M
3300022533|Ga0242662_10158091Not Available690Open in IMG/M
3300022557|Ga0212123_10032605All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Chromatiales5292Open in IMG/M
3300022557|Ga0212123_10138914All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Steroidobacteraceae → unclassified Steroidobacteraceae → Steroidobacteraceae bacterium1896Open in IMG/M
3300022557|Ga0212123_10666739Not Available646Open in IMG/M
3300026320|Ga0209131_1037083All Organisms → cellular organisms → Bacteria → Proteobacteria2822Open in IMG/M
3300026446|Ga0257178_1053161Not Available532Open in IMG/M
3300026482|Ga0257172_1064411Not Available672Open in IMG/M
3300026514|Ga0257168_1082101Not Available714Open in IMG/M
3300026557|Ga0179587_10354357Not Available952Open in IMG/M
3300027512|Ga0209179_1014334All Organisms → cellular organisms → Bacteria1454Open in IMG/M
3300027671|Ga0209588_1061678All Organisms → cellular organisms → Bacteria1212Open in IMG/M
3300027857|Ga0209166_10329904Not Available799Open in IMG/M
3300027903|Ga0209488_10003281All Organisms → cellular organisms → Bacteria → Proteobacteria13044Open in IMG/M
3300027908|Ga0209006_10724827Not Available812Open in IMG/M
3300028536|Ga0137415_10147386All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Steroidobacteraceae → unclassified Steroidobacteraceae → Steroidobacteraceae bacterium2191Open in IMG/M
3300028536|Ga0137415_10189407All Organisms → cellular organisms → Bacteria → Proteobacteria1880Open in IMG/M
3300028536|Ga0137415_10217942All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → unclassified Betaproteobacteria → Betaproteobacteria bacterium1723Open in IMG/M
3300030855|Ga0075374_11325047All Organisms → cellular organisms → Bacteria → Proteobacteria804Open in IMG/M
3300030934|Ga0075391_11226056Not Available582Open in IMG/M
3300031057|Ga0170834_100638570All Organisms → cellular organisms → Bacteria → Proteobacteria2002Open in IMG/M
3300031057|Ga0170834_105670590Not Available840Open in IMG/M
3300031122|Ga0170822_15468205All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria997Open in IMG/M
3300031128|Ga0170823_13883177All Organisms → cellular organisms → Bacteria → Proteobacteria2592Open in IMG/M
3300031231|Ga0170824_103142602All Organisms → cellular organisms → Bacteria → Proteobacteria1185Open in IMG/M
3300031231|Ga0170824_115358903All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria2032Open in IMG/M
3300031231|Ga0170824_121578877Not Available651Open in IMG/M
3300031525|Ga0302326_10824971All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1332Open in IMG/M
3300031715|Ga0307476_10251150All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Steroidobacteraceae → unclassified Steroidobacteraceae → Steroidobacteraceae bacterium1291Open in IMG/M
3300031753|Ga0307477_10225280All Organisms → cellular organisms → Bacteria1301Open in IMG/M
3300031754|Ga0307475_10614702Not Available870Open in IMG/M
3300031823|Ga0307478_10151143All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Steroidobacteraceae → unclassified Steroidobacteraceae → Steroidobacteraceae bacterium1842Open in IMG/M
3300031962|Ga0307479_10607155All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Steroidobacteraceae → unclassified Steroidobacteraceae → Steroidobacteraceae bacterium1076Open in IMG/M
3300032174|Ga0307470_10000435All Organisms → cellular organisms → Bacteria → Proteobacteria21216Open in IMG/M
3300032174|Ga0307470_10000664All Organisms → cellular organisms → Bacteria → Proteobacteria15323Open in IMG/M
3300034163|Ga0370515_0181863Not Available898Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil46.02%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil25.66%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil6.19%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil6.19%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring5.31%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.77%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.77%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.77%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.89%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost0.89%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa0.89%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300019887Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c2EnvironmentalOpen in IMG/M
3300019890Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c1EnvironmentalOpen in IMG/M
3300020140Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300022530Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022533Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-7-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026446Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-11-BEnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027512Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300030855Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - OA9 Emin (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030934Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - OB3 Emin (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031122Oak Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031128Oak Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031525Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_3EnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300034163Peat soil microbial communities from wetlands in Alaska, United States - Goldstream_04D_14EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10125228413300002245Forest SoilFPSNDPPQLAPAARCSGIRDKVLVGLVLFTLVRTGEALAGDQTSIANAALNADADGLRRSSPPVRALIELPQVFAAPTAAESQPFSATEFRPRKRTIFDSDPMVNSFGDAPMLRGTTVWQRMSEYKSHDRVRLLTLWESNVSTVSLQAGKRGDPSLQWTSRAMNRGGSTRGLLDRLFSGSLAGAGSRLRHADRPASTAATPSLAGAPAVA
JGIcombinedJ26739_10138391713300002245Forest SoilMPEFRPSNDPPLLDLAACRHGIRDKVLVGLVLFTLVRTGEALAGDQAWVGNGALNPDVGDFRRSSPPAPALVTAPELFAPAAGDSQAFSTTEFRPRKHSVLDADPDANSFGDAPMLRGTTVWQRLSEYRAHDRVRVLTLWESGGSAVSLQAGKRGSPSLQWTSGTMLHGAATRGLLDRLFAVSLAGAGNRLRHPDRPA
JGI25613J43889_1001807513300002907Grasslands SoilMPEFRPSNDPLLLNLAACRHCIRDKVLVGLVLFTLVRTGEALAGDQASIGNAALNPDVGDFRRSSPPAPALITTPELFARAAGDSEAFSTTEFRPRKHSVLDADPVANSFGDAPMLRGTTVWQRLSEYRSHDRVRLLTLWESGGSTLSLQAGKRGSPSLQWTSGAMIHGAATRGLFDRLFAVSLAGAGSRLRHPDRPASAPAPPNQVKVPVAASVK*
Ga0070730_1041448413300005537Surface SoilMSEFRPFNIVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQASMANAALDPDAGNLRQSSPPIRALITAPEFFAAPTADSQNFSATDFRPRKRTVFDRDPVANSFGDAPMLRGTTVWQRMSEYRSQDRVRLLTLWESSGSTVSLQAGKRGGPSLQWTSRSLNHGGSTRGLLDRLFAVSLAGAGGRLRHADRPTGAAAAPSQPSQASVPVVAGTK*
Ga0073928_10001685173300006893Iron-Sulfur Acid SpringMSEFRPSNSVPLLDLAARCNSIREKMLVGLVLFTLVRTGEALAGDQVSIANATPDPDFGNLRQPSPPARAFVTAPEFFTAPAAADSRLFSATDFRPRKPTVFDRDPTVNAFGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESSESTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRIRRSDRPTGAAATPSQAGVPVVAGAK*
Ga0073928_1001147413300006893Iron-Sulfur Acid SpringFTLVRSGEALAGDQASIANATLNAGADHLRRSSLPVRALITAPEIFAVPPPAESQVFSTTDFRPRKRTIFDRDPMTNSFGDAPMLRGTTVWQRMSEYKSRDRVRLLTLWESSASTVSLQAGKRGDPSLQWTSRAMNRGGSTRGLLDRLFSVSLAGAGNRLRRTDRPASAAATAGQASVPVIASSNR*
Ga0073928_1036877423300006893Iron-Sulfur Acid SpringMSESRPSNDAPQLDLAAHSNGIRDKMLVGLVLFTLVRSGEALAGDQMSIANGTLNADADNLRRSSPPARALITAPGVFAAPGVFAAPTAPDSQAFSATDFRPRKHTLFNSDPVVNSFVDTPMLRGTTVWQRMSEYKSHDRVRLLTLWESGDSTVSLQAGKRGDPSLQWTSRAMNRGGSTRGLLDRLFAVSLAGAGNRLRHTDRSASAAATPNPVSVPVIASVK*
Ga0099794_1002884333300007265Vadose Zone SoilMSEFRRSNSVPLLDLAARCNSICDKMLVGLVLFTLVRTGEALAGDHVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPAVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRSTGAAAATPGQASAPVVAGAK*
Ga0099794_1003500443300007265Vadose Zone SoilMSEFRPSNIAPLLDPAARGNCIRDKVLVGLVLFTMVRTGEALAGDQASIANAALHPDIGNLRQSSPPSAVLITAPELFAAPIAADREAFSATDFRPRKHTLLDSDPAVNSLGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESGGSTVSLQAGKRGAPSLQWTSGALNHGASARGLLDRLFAVSLAGAGNHLRHADRPPSAPAAPNPVGAPVTASVK*
Ga0099795_1000091763300007788Vadose Zone SoilMSEFRRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDHVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPAVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPTGAAAATPSQASVPVVAGAK*
Ga0099830_1045727113300009088Vadose Zone SoilEFRRSNKVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQASIANATPDPNFGDLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPAVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPTGAAAATPSQASVPVVAGAK*
Ga0099830_1123127513300009088Vadose Zone SoilTGEALAGDQASLADATLNAGADLRRSSPPARALITAPQVFAAPTATESQAFSTTDFRPRKRTIFDTDPVANSVGDTPMLRGTTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTCRAMGHGGSTRGLLDRLFSVSFAGAGNRLRHTDRSATTAPSASQASDPVVASVK*
Ga0099828_1168828213300009089Vadose Zone SoilTGEALAGDQTSFAHATVDPDFGNLRQPSSPARALIMAPEFFTAPTAADSQTFSATDFRPRKPTVFDRPTVNSFGDAPMLRGTTVWQRMSEYKSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFSVSLLGAGNSLRNPSRSTSAPATAKPVSTPVVAGSK*
Ga0099792_1000136323300009143Vadose Zone SoilMSEFRRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDHVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPTVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRSTGAAAATPGQASAPVVAGAK*
Ga0099796_1000236153300010159Vadose Zone SoilMSEFRRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDHVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPTVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPTGAAAATPSQASVPVVAGAK*
Ga0137391_1014323243300011270Vadose Zone SoilMSEFRPSNIAPLLDPAARGNCIRDKVLVGLVLFTMVRTGEALAGDQASIANAALHPDIGNLRQSSPPSAVLITAPELFAAPIAADREAFSATDFRPRKHTLLDSDPAVNSLGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESGGSTVSLQAGKRGAPSLQWTSGALNHGASARGLLDRLFAVSLAGAGNRLRHADRPPSAPAAPNPVGAPVTASVK*
Ga0137391_1032289913300011270Vadose Zone SoilMSEFHPSSNVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQASVANATLAPDLGNLRQSSPPARSLITAPEFFAAPTTADSQTFSATDFRPRKPTVFDRDPMVNSFGDAPMLRGTTVWQRMSEYKSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFSVSLLGAGNSLRNPSRSTSAPATAKPVSTPVVAGSK*
Ga0137391_1075315413300011270Vadose Zone SoilAMSEFRRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDHVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPTVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSESTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRHSDRPTGAAAATPSQASVPVVAGAK*
Ga0137393_1002012133300011271Vadose Zone SoilMSEFRPSNIAPLLDPAARGNCIRDKVLVGLVLFTLVRTSEALAGDQASIANAALHPDIGNSRQSSPRSPVLITAPELFAAPIAADREAFSATDFRPRKHTLLDSDPAVNSLGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESGGSTVSLQAGKRGAPSLQWTSGALNHGASARGLLDRLFAVSLAGAGNRLRHADRPPSAPAAPNPVGAPVTASVK*
Ga0137393_1004029423300011271Vadose Zone SoilMSEFRRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDHVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPTVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSESTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPTGAAAATPSQASVPVVAGAK*
Ga0137363_1000267533300012202Vadose Zone SoilMSEFRRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPTVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRSTGAAAATPGQASAPVVAGAK*
Ga0137399_1144350813300012203Vadose Zone SoilVRTGQALAGDQASIESGMPEPDFSKLRQPSPPPRALLTPLEFLAAPTAVDSKIFSATEFRPRKPTVFDSDPTVSSFGEAPMLRGTTVWQRLSEYKSRDRVRVLTLWESNEGTVSLQAGRRGDPSLQWTSRSMNRGGSTRGLLDRLFALSLAGASNRLRNSSRSTSAPATAKPVDMPGAPGQK*
Ga0137362_1007991133300012205Vadose Zone SoilMSEFRRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPAVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSESTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPTGAAAATPSQASVPVVAGAK*
Ga0137385_1039372513300012359Vadose Zone SoilMTEFRPSNKVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDHVSIANATPDPDFGNLRQPPPARAFVAPPEFFTAPTAADNQLFSATDFRPRKLTLFDRDPTVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRHSDRPTGAAAAATPSQASV
Ga0137361_1015718633300012362Vadose Zone SoilMSEFRRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPAVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPTGAAAATPSQASVPVVAGAK*
Ga0137358_1014394333300012582Vadose Zone SoilMSEFLPSNGVLPPDLAARCNGIREKILVGLVLFTLARTGEALAGDQASIAHATPDPDFGNLRQPSSRARALITAPEFFEAPATADSQKFSATDFRPRKPTIFNRDPVANSFGDAPMLRGTTVWQRMSEYKSHDRVRVLTLWESSASTVSLQAGKRGDPSVQWTSRVMSRGESTRGLLDRVFAVSLARAGNRLRHADRPAGAAAPGPAAGVPVSR*
Ga0137398_1032050423300012683Vadose Zone SoilAGDQASIGNGALKPDFGDFRRSSPAAPALITAPELFTPAAADKQEFSTTEFRPRKRSMLDADPVSNSFGDAPMLRGTTVWQRLSEYRSHDRVRLLTLWESGGSTLSLQAGKRGSPSLQWTSGAMIHGAATRGLFDRLFAVSLAGAGSRLRHPDRPASAPAPPNQVKVPVAASVK*
Ga0137397_1004313343300012685Vadose Zone SoilMPEFRSSNNATSLDPAARGNCIRDKVLVELVLFALVRTGEALAGDQASIANPALGPGVGNLRQSSPPAPVLIMSPDLFTAPIAGGPQVFSATDFSPRKRTLLDSDPALNFPGDAPMLHGTTVWQRMSEYRSHDRVRLLTLWESGGSTVSLQAGKRGAPSLQWTSGAMIHGAAARGLLDRLFMVSLAGASNRLRHADRPASAPAAPNPVSGPVTATGEMSRGLHRLP*
Ga0137397_1005616923300012685Vadose Zone SoilMSEFRRSNSVPLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDHVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPAVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPTGAAAATPSQASVPVVAGAK*
Ga0137397_1016137423300012685Vadose Zone SoilMSEIRPHDDAPPPDDAAHCNSIRDRIFVGLVLITLVRTGEALAGDQASINNAAFDRDGGNVRHLSPSAPAAIATPGFFTAPTDAGRPVFSATDFRPRKHSVFDTDPAVNAFAEAPMLQGTTVWQRMSEYKSHDGVRLLTLWESRGSTLSLQAGNRGDPSLQWTSRTMNRGGSTRGVFDRLFSISISGAGSGLRNASRPTNAPAAPKPLGVPAVAVLK*
Ga0137395_1013809833300012917Vadose Zone SoilMSEFRRSNKVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDHVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPTVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPTGAAAATPSQASVPVVAGAK*
Ga0137396_1063210813300012918Vadose Zone SoilNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQASIESGMPEPDFGKLRQPSPPPRALLTPLEFLAAPTAVDSKIFSATEFRPRKPTVFDSDPTVSSFGEAPMLRGTTVWQRLSEYKSRDRVRVLTLWESNEGTVSLQAGRRGDPSLQWTSRSMNRGGSTRGLLDRLFALSLAGASNRLRNSSRSTSAPATAKPVDMPGAPGQK*
Ga0137394_1026409343300012922Vadose Zone SoilVGLVLFTLVRTGEALAGDHVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPAVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPTGAAAATPSQASVPVVAGAK*
Ga0137359_1008670833300012923Vadose Zone SoilMSEFLPSNGVLPPDLAARCNGIREKMLVGLVLFTLARTGEALAGDQASIAHATPDPDFGNLRQPSSRARALITAPEFFEAPATADSQKFSATDFRPRKPTIFNRDPVANSFGDAPMLRGTTVWQRMSEYKSHDRVRVLTLWESSASTVSLQAGKRGDPSVQWTSRVMSRGESTRGLLDRVFAVSLARAGNRLRHADRPAGAAAPGPAAGVPVIAGAK*
Ga0137413_1000126073300012924Vadose Zone SoilMPEFRPSNDPPLLNPAACRHGIRDKVLVGLVLFTLVRTGEALAGDQAWIGNGGLKPDFGDFRRSSPAAPALITAPELFTPAAADKQEFSTTEFRPRKRSMLDADPVSNSFGDAPMLRGTTVWQRLSEYRARDRVRVLTLWESGGSTVSLQAGKRGSPSLQWTSGAMIHGAATRGLLDRLFAVSLAGAGNRLRRPDHRPANAPALSNQINGPVVANVK*
Ga0137419_1003379533300012925Vadose Zone SoilMQEFRPSNDPPLLNLAACRHGIRDKVLVGLVLFTLVRTGEALAGDQASIGNAALNPDVGDFRRSSPAAPALITAPELFTPAAAAKQEFSTTEFRPRKRSMLDADPVANSYGDAPMLRGTTVWQRLSEYRARDRVRVLTLWESGGSNVSLQAGKRGSPSLQWTSGAMIHGAATRGLFDRLFAVSLAGAANRLHHPDRPASAPAPPNQVKVPVVASVK*
Ga0137416_1031683323300012927Vadose Zone SoilHEASEALEDADRAPVSKSPAMSEFLPSNGVLPPDLAARCNGIREKILVGLVLFTLARTGEALAGDQASIESRMPEPDFSKLRQPSPPPRALLTPLEFLAAPTAVDSKIFSATEFRPRKPTVFDSDPTVSSFGEAPMLRGTTVWQRLSEYKSRDRVRVLTLWESNEGTASMKTGRRGDNSLKWTSRSMNRGGSTRGLLDRLFALSLAGASNRLRNSSRSTSAPATAKPVDMPGAPGQK*
Ga0137416_1063515623300012927Vadose Zone SoilMPEFRPSNDPPLLNLAACRHCIRDKVLVGLVLFTLVRTGEALAGDQASIGNAALNPDVGDFRRSSPPAPALITTPELFARAAGDSEAFSTTEFRPRKHSVLDADPVANSFGDAPMLRGTTVWQRLSEYRSHDRVRLLTLWESGGSTLSLQAGKRGSPSLQWTSGAMIHGAATRGLFDRLFAVSLAGAA
Ga0137404_1000290343300012929Vadose Zone SoilMSEFRPSNDVPLLDLAAHCNSIREKMLVGLVLFTLVRTGEVLAGDQTAIADAALDPSVGNLRQSSPPSRSLITAPEFFAVPTASESQKFSATDFRPRKPTVFDRNPTVSSFGDAPMLRGTTVWQRMSEYKSHDRVRVLTLWESSASTVSLQAGRRGAPSLQWTSRSMNRGGSTRGLLDRLFSVSLAGAGNRLRHADRSTSAPAAPKPAETPVVAGIK*
Ga0137404_1008554013300012929Vadose Zone SoilMPEFRSSNNATSLDPAARGNCIRDKVLVELVLFALVRTGEALAGDQASIANPALGPGVGNLRQPSPPAPVLIMSPDLFTAPIAGGPQVFSATDFSPRKRTLLDSDPALNFPGDAPMLHGTTVWQRMSEYRSHDRVRLLTLWESGGSTVSLQAGKRGAPSLQWTSGAMIHGAAARGLLDRLFTVSLAGAGNRLRHADRPASAPAAPNPVSGPVTAKVK*
Ga0137407_1209679013300012930Vadose Zone SoilPSAMPEFRPSNNATSLDPAARGNCIRDKVLVELVLFALVRTGEALAGDQASIANPALGPGVGNLRQSSPPAPVLIMSPDLFTAPIAGGPQVFSATDFSPRKRTLLDSDPALNFPGDAPMLHGTTVWQRMSEYRSHDRVRLLTLWESGGSTVSLQAGKRGAPPLQWTSGAMIHGAAARGL
Ga0137410_10004016183300012944Vadose Zone SoilMSEFRRSNSVPLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQTSIANATPDPDFGNLRQPPPPARAFVAPPEFFTAPTAADNQLFSATDFRPRKLTLFDRDPTVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRSTGAAAATPGQASAPVVAGAK*
Ga0137410_1009414613300012944Vadose Zone SoilMPEFRSSNNATSLDPAARGNCIRDKVLVELVLFALVRTGEALAGDQASIANPALGPGVGNLRQSSPPAPVLIMSPDLFTAPIAGGPQVFSATDFSPRKRTLLDSDPALNFPGDAPMLHGTTVWQRMSEYRSHDRVRLLTLWESGGSTVSLQAGKRGAPSLQWTSGAMIHGAAARGLLDRLFMVSLAGASNRLRHADRPASAPAAPNPVSGPVTAKVK*
Ga0182024_1145633513300014501PermafrostLEDADKPIGREKFCTAMPELPPAKDAPPLEHAAGCSLVRDRMLVGLVLFTLVRAGEALAGDEASITNATVDTEFGNLHVVSPATKALLAAPGVFTAPAAADTQVFSATDFRPRKRTVLDSDPTVNSFADAPMIHGTTVWQRLSEYKSHNRVQLLTLWETTGSTVSLQAGKRGDPSLQWTSRLMNRGGSTQGLLDRLFAASLAHAGNGLHNATRSTS
Ga0137418_1004658063300015241Vadose Zone SoilMQEFRPSNDPPLLNLAACRHGIRDKVLVGLVLFTLVRTGEALAGDQASIGNGALKPDFGDFRRSSPAAPALITAPELFTPAAAAKQEFSTTEFRPRKRSMLDADPVANSYGDAPMLRGTTVWQRLSEYRARDRVRVLTLWESGGSNVSLQAGKRGSPSLQWTSGAMIHGAATRGLFDRLFAVSLAGAANRLHHPDRPASAPAPPNQVKVPVVASVK*
Ga0137412_1000119373300015242Vadose Zone SoilMPEFRPSNDPPLLNPAACRHGIRDKVLVGLVLFTLVRTGEALAGDQAWIGNGGLKPDFGDFRRSSPAAPALITAPELFTPAAADKQEFSTTEFRPRRRSMLDADPVSNSFGDAPMLRGTTVWQRLSEYRARDRVRVLTLWESGGSTVSLQAGKRGSPSLQWTSGAMIHGAATRGLLDRLFAVSLAGAGNRLRRPDHRPANAPALSNQINGPVVANVK*
Ga0137409_1010411133300015245Vadose Zone SoilMPEFRPSNNATSLDPAARGNCIRDKVLVELVLFALVRTGEALAGDQASIANPALGPGVGNLRQSSPPAPVLIMSPDLFTAPIAGGPQVFSATDFSPRKRTLLDSDPALNFPGDAPMLHGTTVWQRMSEYRSHDRVRLLTLWESGGSTVSLQAGKRGAPSLQWTSGAMIHGAAARGLLDRLFMVSLAGASNRLRHADRPASAPAAPNPVSGPVTAKVK*
Ga0137409_1030260613300015245Vadose Zone SoilPSNIAPLLDPAARGNCIRDKVLVGLVLFTMVRTGEALAGDQASIANAALHPDIGNLRQSSPPSAVLITAPELFAAPIAADREAFSATDFRPRKHTLLDSDPAVNSLGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESGGSTVSLQAGKRGAPSLQWTSGALNHGASARGLLDRLFAVSLAGAGNRLRHADRPPSAPAAPNPVGAPVTASVK*
Ga0137403_1005191043300015264Vadose Zone SoilMPEFRPSNNATSLDPAARGKCIRDKVLVELVLFALVRTGEALAGDQASIANPALGPGVGNLRQPSPPAPVLIMSPDLFTAPIAGGPQVFSATDFSPRKRTLLDSDPALNFPGDAPMLHGTTVWQRMSEYRSHDRVRLLTLWESGGSTVSLQAGKRGAPSLQWTSGAMIHGAAARGLLDRLFTVSLAGAGNRLRHADRPASAPAAPNPVSGPVTAKVK*
Ga0137408_118560623300019789Vadose Zone SoilVGGPVSKSSVMSEFRPSNDVPLLDLAAHCNSIREKMLVGLVLFTLVRTGEVLAGDQTAIADAALDPSVGNLRQSSPPSRSLITAPEFFAVPTASESQKFSATDFRPRKPTVFDRNPTVSSFGDAPMLRGTTVWQRMSEYKSHDRVRVLTLWESSASTVSLQAGRRGAPSLQWTSRSMNRGGSTRGLLDRLFSVSLAGAGNRLRHADRSTSAPAAPKPAETPVVAGIK
Ga0193729_122681513300019887SoilGIRDKMLVGLVLFTLVRTSEALAGDQMSIGNGTLNADADNLRRSSPPAREVITAPGIFTAPTAPDSQVFSATDFRPRKRTIFDSDPMVNSFVDTPMLRGTTVWQRMSEYKSHDRVRLLTLWESGDSTVSLQAGKRGDPSLQWTSRAMNRGGSTRGLLDRLFAVSLAGVGNRLRHTDRSASAAATPNPVSVPVIASVK
Ga0193728_129657513300019890SoilAPCCNSIRDKLLMGFVLFTLVRSGEALAGDPASIANAALDPDVRNLRQSSPPAPALITAPEFFATPATADRRVFSATDFSPRKHTVLDSDPVTNSFGDTPMLRGTTVWQRMSEYKSHDRVRLITLWESSASTVSLQAGKRGDPSLQWTSRAMNRGGSTRGLLDRLFAVSLAGAGNRLHHADKSTSAAATSNQVSVPVVASVK
Ga0179590_101191633300020140Vadose Zone SoilMSEFRRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDHVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPTVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSESTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPTGAAAATPSQASVPVVAGAK
Ga0179592_1001688553300020199Vadose Zone SoilMSEFLPSNGVLPPDLAARCNGIREKILVGLVLFTLARTGEALAGDQASIAHATPDPDFGNLRQPSSRARALITAPEFFEAPATADSQIFSATDFRPRKPTIFNRDPVANSFGDAPMLRGTTVWQRMSEYKSHDRVRVLTLWESSASTVSLQAGKRGDPSVQWTSRVMSRGESTRGLLDRVFAVSLARAGNRLRHADRPAGAAAPGPAAGVPVIAGAK
Ga0179592_1023517213300020199Vadose Zone SoilGGAPVSKSPAMSEFRRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDHVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPTVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRHSDRPTGAAAATPSQASVPVVAGAK
Ga0210407_1047669013300020579SoilMPEISPSNDPPPLGHAARCNGIRDRMLVGLVLFTLVRSSEALAGDQTFAGNSALDPDAGKLRRPSPPATAIMATPQLFSAPAVFDNQGFSPTEFRPRKHTVFDTDPALNSFGDAPMLRGTTVWQRLSEYKSHDRVRLLTLWQSSGGTVSLQAGKHGDPSLQWSSRLMNRGGATQGLLDRLFSVSLARAGNRLRSTARTTNAAATPKQVGVPVVAELK
Ga0210403_1003224473300020580SoilMTEFCPSNDPPHLAPAARCGGIRDKVLVGLVLFTLVRTGEALAGDQASIANAAPNAGADGLRRSSPPVRALIALPPVFAAPNAAESPAFSATEFRPRKRTLFDSDPMVNSFGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESNVSTVSLQAGRRGDPSLQWTSRAMNRGESTRGLLDRLFSGSLAGAGSRLRHADRPASPAATPSQASAPVVGSVN
Ga0210399_1108287313300020581SoilNDPPPLGHAARCNGIRDRILVGLVLFTLVRSSEALAGDQTFAGNSALDPDAGKLRRPSPPATAIMATPQLFSAPAVFDNQGFSPTEFRPRKHTVFDTDPALNSFGDAPMLRGSTVWQRLSEYKSHDRVRLLTLWQSSGGTVSLQAGKHGDPSLQWSSRLMNRGGATQGLLDRLFSVSLARAGNRLRSTARTTNAAATPKQVGVPVVAELK
Ga0210406_1005915843300021168SoilMSEFRRSNSVPLLVLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQVSIANATPDPDFGNLRHSPPPARAFVTAPEFFTAPTAADSQLFSATDFRPRKPTVFDRDPTVNAFGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESSESTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRNSSRSTSALATAKSVDMPGAPGEK
Ga0210406_1008478743300021168SoilMSEFRPSNDAPLLDRAARCNGIRDKMLVGLVLFTLVRASEALAGDQASIINATVTPDIGKLRLSSPSSTAITATPGAFSAPAATDNPVFSATEFRPRKHTVFDSDPTVNSFDTPMLRGTTVWQRLAEYRSHDRVRLLTLWESSGSSVSLQAGKGGNPSLQWTSRLMNRGGSTEGLLDRLFSVSLAHAGNGLRNAARATNAPPTPTPASTPGVAGLK
Ga0210406_1013485423300021168SoilMPESRPSNDAPPLDLAARSNGIRDKMLVGLVLFTLVRSSEALAGDQMSIANGTLNADADNLRRTSPPARALITVPSVFAAPGVFTAPTAPDSQAFSATDFRPRKHTIFDSDPVVNSFVDTPMLRGTTVWQRMTEYRSHDRVRLLTLWESGDNTVSLQAGKRGDPSLQWTSRAMNRGGSTRGLLDRLFAASIAGASNRLRHTDRSASAAATPNPVSVPVIASVK
Ga0210406_1020804633300021168SoilMSEFVSSNSVPLLDLAARCNSIREKMLVGLVLFTLVRTGEALAGDQASIGNATLDPGGGNLRQSSPPAHPLIRAPGFFAMPADTESQIFSATEFRPRKRTIFDHDPIANSVGDAPMLRGTTVWQRMSEYKSHDRVRVLTLWESSGSSVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLLAVSLAGAGNRLRHADRPTGSPPPPSQVSVPVVVSAK
Ga0210406_1075953013300021168SoilMPEISPSNDPPPLGHTARCNGIRDRMLVGLVLFTLVRSSEALAGDQTFAGNSALDPDAGKLRRPSPPATAIMATPQLFSAPAVFDNQGFSPTEFRPRKHTVFDTDPALNSFGDAPMLRGTTVWQRLSEYKSHDRVRLLTLWQSSGGTVSLQAGKHGDPSLQWSSRLMNRGGATQGLLDRLFSVSLARAGNRLR
Ga0210406_1104021513300021168SoilMQEFRPSNDPPLLDLAARRHGIRDKVLVGLVLFTLVRSGEALAGDQAWIGNGALNPEVGDFRRSSPPAPALITAPELFAPAASDSQAFSTTDFRPRKHSMLDADPVANSFGDAPMLRGTTVWQRLSEYRAHDRVRVLTLWESGGSTVSLQAGKRGSPSLQWTSGAMLHGASTRGLLDRLFAV
Ga0210400_1004560313300021170SoilLDLAGVRHGIRDKVLVGLVLFTLVRTGEALAGDQAWIGNGALNPDIGDFRRSPPPAPALITAPELFAPAAGDSQAFSTTEFRPRKHSVLDADPVANSFGDAPMLRGTTVWQRLSEYRAHDRVRVLTLWESGGSTVSLQAGKRGSPSLQWTSGAMLHGASARGLLDRLFAVSLAGPGNRLRHPDRPSSAPTPANQVNVPVVAASVK
Ga0210400_1009546423300021170SoilMSEFRPSSIVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQASMANATLDPDVGNLRQSSPPARALITAPDFFAAPAAADSQIFSATDFRPRKRTVFDRDPVVNSFGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESSESTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRHSDRPTGTAATPSQVSVPVVAGAK
Ga0210408_1016133233300021178SoilMSEFRRSNSVPLLVLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQVSIANATPDPDFGNLRHSPPPARAFVTAPEFFTAPTAADSQLFSATDFRPRKSTVFDRDPTVNAFGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESSESTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGASNRLRHADRPTGSVP
Ga0210396_1072879023300021180SoilMQEFRPSNDPPLLDLAARRHGIRDKVLVGLVLFTLVRTGEALAGDQAWIGNGALNPDVGDFRRASPPAPALITAPELFAPAAGDSQAFSTTDFRPRKHSMLDADPVANSFGDAPMLRGTTVWQRLSEYRAHDRVRVLTLWESGGSTVSLQAGKRGSPSLQWTSGAMLHGAATRGLLDRLFAVSLAGAGN
Ga0210389_1013017013300021404SoilMQEFRPSNDPPLLDLAGFRHGIRDKVLVGLVLFTLVRTGEALAGDQAWIGNGALNPDVGDFRRSSPPAPALITAPELFAPAAGDSQAFSTTDFRPRKHSMLDADPVANSFGDAPMLRGTTVWQRLSEYRAHDRVRVLTLWESGGSTVSLQAGKRGSPSLQWTSGAMLHGASARGLLDRLFAVSLAGTGNRLRHPDRPSSAPTPANQVNVPVVAASVK
Ga0210387_1179824013300021405SoilKVLVGLVLFTLVRSGEALAGDQAWIGNGALNPEVGDFRRSSPPAPALITAPELFAPAASDSQAFSTTDFRPRKHSMLDADPVANSFGDAPMLRGTTVWQRLSEYRAHDRVRVLTLWESGGSTVSLQAGKRGSPSLQWTSGAMLHGASTRGLLDRLFAVSLAGAGNRLRHPD
Ga0210386_1011469733300021406SoilMLEFRPSNDAPQLELAARCNTIRDRVLVSCPVNTLLAAPRLFAPGGVAPPCPCPVYISASSARLARHENSRADRARYLRDRTLVGLVLFTLVRTGEVLAGDQASIANATLNADADHLHRSSPPARAAFVAAPDAFAAPTATGSQAFSTTDFRPRKRTIFDSDPLVNPFGDAPMLRGTTVWQRMSEYRSHDRVRVLTLWESSDSTVSLQAGKRGGPSLQWTSRWMNHGESTRGLLDRLFSVSLAGAANRLRHADRPASAAAAPTPASVPVIAGAN
Ga0210394_1002141163300021420SoilMSEIRTDDDAPPLDDAAHCTSIRDRIFVGLVLITLVRTGEALAGDHASINNANLDRGVGDFRQLSPSAPAATATPGFFTAPTDAGRQIFSATDFRPRKHTVFDTDPAVNAFGDAPMLQGTTVWQRMSEYKSHDGVRLLTLWETRGSTLSLQAGNRGDPSLQWTSRTMNRGGSTRGLFDRLFSIS
Ga0210394_1075858713300021420SoilFEPSGVAPPCHSARYGSSARLARHENSLADRARYLRDRTLVGLVLFTLVRTGEVLAGDQASIANATLNADADHLHRSSPPARAAFVAAPDAFAAPTATGSQAFSTTDFRPRKRTIFDSDPLVNPFGDAPMLRGTTVWQRMSEYRSHDRVRVLTLWESSDSTVSLQAGKRGDPSLQWTSRWMNHGESTRGLLDRLFSVSLAGAANRLRHADRPASAAAAPTPASVPVIAGAN
Ga0210392_1023052123300021475SoilMQEFRPSNDPPLLDLAARRHGIRDKVLVGLVLFTLVRSGEALAGDQAWIGNGALNPEVGDFRRSSPPAPALITAPELFAPAAGDSQAFSTTDFRPRKHSMLDADPVANSFGDAPMLRGTTVWQRLSEYRAHDRVRVLTLWESGGSTVSLQAGKRGSPSLQWTSGAMLHGASTRGLLDRLFAVSLAGAGNRLRHPDRPASAPTPTNQVNVPVAAASVK
Ga0210402_1004363563300021478SoilMPEISPSNDPPPLGHTARCNGIRDRMLVGLVLFTLVRSSEALAGDQTFAGNSALDPDAGKLRRPSPPATAIMATPQLFSAPAVFDNQGFSPTEFRPRKHTVFDTDPALNSFGDAPMLRGTTVWQRLSEYKSHDRVRLLTLWQSSGGTVSLQAGKHGDPSLQWSSRLMNRGGATQGLLDRLFSVSLARAGNRLRSTARTTNAAATPKQVGVPVVAELK
Ga0210402_1054386523300021478SoilMSEFRRSNSVPLLVLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQVSIANATPDPDFGNLRHSPPPARAFVTAPEFFRAPTAADSQLFSATDFRPRKPTVFDRDPTVNAFGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESSESTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRNSSRSTSALATAK
Ga0210410_1016716313300021479SoilMSEFRPSNRVPLLDLAARSNSIRDKMLVGLVLFTLARTGEALAGEQASIANAKLDSDVGNLPRSSEPARASITAPDFFAAPTVVDSQLFSTTDFRPRKPTVFNHDPTVSTFDDAPMLRSTTIWQRMSEYRSRDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRLMSRGESTRGLLDRLFAVSLAGAGNRLRHADRPTGPAAPTPSQASVPVVAGSK
Ga0242658_118306013300022530SoilVLFTLVRSSEALAGDQMSIANGTLNADADNLRRTSPPARALITAPSVFAAPGVFTAPGVFTAPTAPDSQAFSATDFRPRKRTILDSDPVVNSFVDTPMLRGTTVWQRMTEYRSHDRVRLLTLWESGDNTVSLQAGKRGDPSLQWTSRAMNRGGSTRGLLDRLFAASIAGASNRLRHTDRSASAAAT
Ga0242662_1015809113300022533SoilAPVSKSPAMSEFRRSNSVPLLVLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQVSIANATPDPDFGNLRHSPPPARAFVTAPEFFRAPTAADSQLFSATDFRPRKPTVFDRDPTVNAFGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESSESTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPTGAATAATTPSQAGVPVVAGAK
Ga0212123_1003260543300022557Iron-Sulfur Acid SpringMSEFRPSNSVPLLDLAARCNSIREKMLVGLVLFTLVRTGEALAGDQVSIANATPDPDFGNLRQPSPPARAFVTAPEFFTAPAAADSRLFSATDFRPRKPTVFDRDPTVNAFGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESSESTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRIRRSDRPTGAAATPSQAGVPVVAGAK
Ga0212123_1013891433300022557Iron-Sulfur Acid SpringMPEFPPSDDALQLDLAALCNSLRDKMLVGLVLFTLVRSGEALAGDQASIANATLNAGADHLRRSSLPVRALITAPEIFAVPPPAESQVFSTTDFRPRKRTIFDRDPMTNSFGDAPMLRGTTVWQRMSEYKSRDRVRLLTLWESSASTVSLQAGKRGDPSLQWTSRAMNRGGSTRGLLDRLFSVSLAGAGNRLRRTDRPASAAATAGQASVPVIASSNR
Ga0212123_1066673913300022557Iron-Sulfur Acid SpringLDLAARSNGIRDKMLVGLVLFTLVRSGEALAGDQMSIANGTLNADADNLRRSSPPARALITAPGVFAAPGVFAAPTAPDSQAFSATDFRPRKHTLFNSDPVVNSFVDAPMLRGTTVWQRMSEYKSHDRVRLLTLWESGDSTVSLQAGKRGDPSLQWTSRAMNRGGSTRGLLDRLFAVSLAGAGNRLRHTDRSASAAATPNPVSVPVIASVK
Ga0209131_103708333300026320Grasslands SoilMPEFRPSNDPLLLNLAACRHCIRDKVLVGLVLFTLVRTGEALAGDQASIGNAALNPDVGDFRRSSPPAPALITTPELFARAAGDSEAFSTTEFRPRKHSVLDADPVANSFGDAPMLRGTTVWQRLSEYRSHDRVRLLTLWESGGSTLSLQAGKRGSPSLQWTSGAMIHGAATRGLFDRLFAVSLAGAGSRLRHPDRPASAPAPPNQVKVPVAASVK
Ga0257178_105316113300026446SoilAMSEFRRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDHVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPTVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSESTVSLQAGKRGDPSLQWTSRSMNRGGSTRGL
Ga0257172_106441113300026482SoilMSEFRPSNSVPLLDLAARCNRIRDKMLVGLVLFTLVRTGEALAGDQPWIANATLDPDFGNLRQPPPARAFVTAPGFFTAPTAADGQLFSATDFRPRKPTVFDRDPTVNAFGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESSESTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRHSD
Ga0257168_108210113300026514SoilGAPVSKSPAMSEFRRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDHVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPTVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSESTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRHSDRPTGAAAATPSHASVPVVAGAK
Ga0179587_1035435723300026557Vadose Zone SoilMSEIRPHDDAPPPDDAGRCNSIRDRIFVGLVLITLVRTGEALAGDQASINNAAFDRDGGNVRHLSPSAPAAIATPGFFTAPTDAGHPVFSATDFRPRKHSVFDTDPAVNAFAEAPMLQGTTVWQRMSEYKSHDGVRLLTLWESRGSTLSLQAGNRGDPSLQWTSRTMNRGGSTRGVFDRLFSISISGAGSGLRNASRPTNAPAAPKPLGVPAVATLK
Ga0209179_101433433300027512Vadose Zone SoilMSEFRRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDHVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPAVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPTGAAAATPSQASVPVVAGAK
Ga0209588_106167833300027671Vadose Zone SoilMSEFRRSNSVPLLDLAARCNSICDKMLVGLVLFTLVRTGEALAGDHVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPAVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRSTGAAAATPGQASAPVVAGAK
Ga0209166_1032990413300027857Surface SoilMSEFRPFNIVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQASMANAALDPDAGNLRQSSPPIRALITAPEFFAAPTADSQNFSATDFRPRKRTVFDRDPVANSFGDAPMLRGTTVWQRMSEYRSQDRVRLLTLWESSGSTVSLQAGKRGGPSLQWTSRSLNHGGSTRGLLDRLFAVSLAGAGGRLRHADRPTGAAAAPSQPSQ
Ga0209488_10003281123300027903Vadose Zone SoilMSEFRRSNSVPLLDLAARCNSICDKMLVGLVLFTLVRTGEALAGDHVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPTVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRSTGAAAATPGQASAPVVAGAK
Ga0209006_1072482723300027908Forest SoilLFTLVRTGEALAGDQAWVGNGALNPDVGDFRRSSPPAPALVTAPELFAPAAGDSQAFSTTEFRPRKHSVLDADPDANSFGDAPMLRGTTVWQRLSEYRAHDRVRVLTLWESGGSAVSLQAGKRGSPSLQWTSGTMLHGAATRGLLDRLFAVSLAGAGNRLRHPDRPASAPTPTNQVNVPVVGASVK
Ga0137415_1014738633300028536Vadose Zone SoilMSEFRRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDHVSIANATLDPNFGNLRQPPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPTVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPTGAAAATPSQASVPVVAGAK
Ga0137415_1018940723300028536Vadose Zone SoilMPEFRPSNDPPLLNLAACRHCIRDKVLVGLVLFTLVRTGEALAGDQASIGNAALNPDVGDFRRSSPPAPALITTPELFAPAAGDSQAFSTTEFRPRKHSVLDADPVANSFGDAPMLRGTTVWQRLSEYRSHDRVRLLTLWESGGSTLSLQAGKRGSPSLQWTSGAMIHGAATRGLFDRLFAVSLAGAANRLHHPDRPASAPAPPNQVKVPVVASVK
Ga0137415_1021794223300028536Vadose Zone SoilMSEFRPSNIAPLLDPAARGNCIRDKVLVGLVLFTLVRTSEALAGDQASIANAALHPDIGNLRQSSPPSAVLITAPELFAAPIAADREAFSATDFRPRKHTLLDSDPAVNSLGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESGGSTVSLQAGKRGAPSLQWTSGALNHGASARGLLDRLFAVSLAGAGNRLRHADRPPSAPAAPNPVGAPVTASVK
Ga0075374_1132504713300030855SoilRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQVSIVNATPDPDFGNLFRSPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPTVTAFGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESSESTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPTGAAAVTPSQASVPVVAGAK
Ga0075391_1122605613300030934SoilSEFRRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQVSIANATLDPNFGNLRQPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPTVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRHS
Ga0170834_10063857033300031057Forest SoilMSEFRRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQVSIANATLDPNFGNLRQPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPTVTAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPTGAAAATPSQASVPVVAGAK
Ga0170834_10567059013300031057Forest SoilMTEFRPSNVPLLDLAVRSNSICDRMLVGLVLFTLVRTGEALAGEQAPITFATLDPGIGIARQSSPPVGSLITAPEFFGAPTMADNQIFSTTDFRPRKRTVFDRDPVVTSFGDAPMLRGTTVWQRMSEYKSRDRVRLLTIWESNASTVSLQAGRKGDPSLQWTSRSMNIGGSTRGLLD
Ga0170822_1546820513300031122Forest SoilNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQVSIVNATPDPDFGNLRQPPPPARAFVTAPEFFKAPAAADSQLFSATDFRPRKPTVFDRDPTVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRHSDRPTGAPAATPSQASVPVVAGAK
Ga0170823_1388317743300031128Forest SoilMSEFRRSNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQVSIANATLDPNFGNLRQPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPTVNAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSESTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPTGAAAATPSQASVPVVAGAK
Ga0170824_10314260213300031231Forest SoilMSREASAWLEDAERAPVSKSPAMTEFRPSNVPLLDLAVRSNSICDRMLVGLVLFTLVRTGEALAGEQAPITFATLDPGIGIARQSSPPVGSLITAPEFFGAPTMADNQIFSTTDFRPRKRTVFDRDPVVTSFGDAPMLRGTTVWQRMSEYKSRDRVRLLTLWESNASTVSLQAGRKGDPSLQWTSRSMNIGGSTRGLLDRLFSVSLAGAGNRLRRTDRSTGAAAANQVSVPVVANAK
Ga0170824_11535890343300031231Forest SoilSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQVSIANATLDPNFGNLRQPPPARAFVTAPEFFTAPTAADRQLFSATDFRPRKPTLFDRDPTVTAFGDAPMLRGSTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRHSDRPTGAPAATPSQASVPVVAGAK
Ga0170824_12157887713300031231Forest SoilPAMSEFRPTNSVPLLDLAARCNSIRDKMLVGLVLFTLVRTGEALAGDQVSIVNATPDPDFGNLRQPPPPARAFVTAPEFFKAPAAADSQLFSATDFRPRKPTVFDRDPTVNAFGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESSASTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPTGAAAATPSQASVPVV
Ga0302326_1082497113300031525PalsaMPELPPAKDAPPLEHAARCSLVRDRMLVGLVLFTLVRAGEALAGDEASITNATVDTEFGNLHVVSPATKALLAAPGVFTAPAAADTQVFSATDFRPRKRTVLDSDPTVNSFADAPMIHGTTVWQRLSEYKSHNRVQLLTLWETTGSTVSLQAGKRGDPSLQWTSRLMNRGGATRGLLDRLFPVSAINEGSGSRTAPRPANP
Ga0307476_1025115023300031715Hardwood Forest SoilMSEFRPSNDAPLLDRAARYNGIRDRMLVGLVLFALARTSEALAGDQASIINATAARDVGKLRLSSPSNTAITATPEAFSAPLATDNPIFSATDFRPRKHTVFDTGPTGNFFGDTPMLRDTTVWQRLAQYRSHDRVQLLTLWASSGSSVSLQAGKRGDPSLQWTSRLMNRGGSTQGLLDRLFSVSLARAGNRLRSATPPTNATAIPGPPSTPAVAGLK
Ga0307477_1022528033300031753Hardwood Forest SoilFRPSNRVPSLDLPARCNGIRDKMLVGLVLFTLVRTGEALAGEQASIANATLDSGAGNLRQSSQPTRASITTPDFFAAPTVVDSPLFSTTDFRPRKPTVFDRDPTVNTFGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRLMSRGESTRGLLDRLFAVSLAGAGNRLRHADRPTSSAAPIPSQATVPVVAGAK
Ga0307475_1061470213300031754Hardwood Forest SoilMSEFRPSNRVPSLDLPARCNGIRDKMLVGLVLFTLVRTGEALAGDQVSIANATRDPDFGNLRQPPPPARAFVTAPEFFTAPTATDSQLFSATEFRPRKPTVFDRDPTVNAFDDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESSESTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGNRLRRSDRPSGPAAATPSQASVPVVAGAK
Ga0307478_1015114313300031823Hardwood Forest SoilMSEFRPSNRVPSLDLPARCNGIRDKMLVGLVLFTLVRTGEALAGEQASIANATLDSGVGNLRQSSQPTRASITTPDFFAAPTVVDSPLFSTTDFRPRKPTVFDRDPTVNTFGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRLMSRGESTRGLLDRLFAVSLAGAGNRLRHADR
Ga0307479_1060715523300031962Hardwood Forest SoilMPEFSPSNRVPLLDHAARCNSIRDKMLVGLVLFTLVRTGEALAGEQASIANATLDSGVGNLRQSSQPTGASIAAPDFFAAPTVVDSPLFSTTDFRPRKPTVFDRDPTVNTFGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESSGSTVSLQAGKRGDPSLQWTSRLMSRGESTRGLLDRLFAVSLAGAGNRLRHADRPPGAATATPSQVTVPVVAGAK
Ga0307470_10000435113300032174Hardwood Forest SoilMLVGLVLLTLIPAGEALAGDQASIANTASNPGAGNFRRSSPPVPAMITAPELFAPPADDSPAFSTTEFRPRKRSVLDIDPPANTFADAPMLRGTTVWQRLQEYRSHDRVRLLTLWESGGSTVSLQAGKGGSPSLQWTSGAMMHGAAARGLLDRLFTVSLAGAGNRLRHPDRPASAPAPAAATSPASVPVVASMK
Ga0307470_1000066483300032174Hardwood Forest SoilMSECRPSNSVPLLDLAARCNSIRDKMLVGLVLVTLVRTGEALAGDQASIANATLDPDFGNLRQPPPARALVTAPEFFTAPTAADSPLFSATDFRPRKPTIFDRDSTVNGFGDAPMLRGTTVWQRMSEYRSHDRVRLLTLWESSVSTVSLQAGKRGDPSLQWTSRSMNRGGSTRGLLDRLFAVSLAGAGSRLRHSDRPTGPTAATPSQTSVPVVAGAK
Ga0370515_0181863_2_6223300034163Untreated Peat SoilMSEFRPSNDAPVLDRAARCNGIRDRMLVGLVLFTLVRASEALAGDQASIINAAVTQDMGKLRLSSPTSTAISATLGAFSAPVTSDNPVFSATEFRPRKHTVFDSDPAVNSFGDTPMLRGTTVWQRLAEYRSHDRVRLLTLWESSGSSVSLQAGKRGDPSLQWTSRLMNRGGSTEGLLDRLFSVSLARAGNGLRNATRSTNAPPTLTP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.