NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F068273

Metagenome / Metatranscriptome Family F068273

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F068273
Family Type Metagenome / Metatranscriptome
Number of Sequences 125
Average Sequence Length 177 residues
Representative Sequence MTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Number of Associated Samples 100
Number of Associated Scaffolds 125

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 4.84 %
% of genes near scaffold ends (potentially truncated) 42.40 %
% of genes from short scaffolds (< 2000 bps) 59.20 %
Associated GOLD sequencing projects 83
AlphaFold2 3D model prediction Yes
3D model pTM-score0.36

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.200 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(37.600 % of family members)
Environment Ontology (ENVO) Unclassified
(43.200 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(59.200 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 61.54%    β-sheet: 0.00%    Coil/Unstructured: 38.46%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.36
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 125 Family Scaffolds
PF05173DapB_C 24.00
PF02774Semialdhyde_dhC 21.60
PF01118Semialdhyde_dh 19.20
PF00701DHDPS 7.20
PF07521RMMBL 4.00
PF00312Ribosomal_S15 1.60
PF03167UDG 1.60
PF09594GT87 0.80
PF00154RecA 0.80
PF02912Phe_tRNA-synt_N 0.80
PF00575S1 0.80
PF03725RNase_PH_C 0.80
PF04296YlxR 0.80
PF03726PNPase 0.80

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 125 Family Scaffolds
COG02894-hydroxy-tetrahydrodipicolinate reductaseAmino acid transport and metabolism [E] 24.00
COG0002N-acetyl-gamma-glutamylphosphate reductaseAmino acid transport and metabolism [E] 21.60
COG0136Aspartate-semialdehyde dehydrogenaseAmino acid transport and metabolism [E] 21.60
COG03294-hydroxy-tetrahydrodipicolinate synthase/N-acetylneuraminate lyaseCell wall/membrane/envelope biogenesis [M] 14.40
COG0184Ribosomal protein S15P/S13ETranslation, ribosomal structure and biogenesis [J] 1.60
COG0692Uracil-DNA glycosylaseReplication, recombination and repair [L] 1.60
COG1185Polyribonucleotide nucleotidyltransferase (polynucleotide phosphorylase)Translation, ribosomal structure and biogenesis [J] 1.60
COG1573Uracil-DNA glycosylaseReplication, recombination and repair [L] 1.60
COG3663G:T/U-mismatch repair DNA glycosylaseReplication, recombination and repair [L] 1.60
COG0016Phenylalanyl-tRNA synthetase alpha subunitTranslation, ribosomal structure and biogenesis [J] 0.80
COG0468RecA/RadA recombinaseReplication, recombination and repair [L] 0.80
COG0689Ribonuclease PHTranslation, ribosomal structure and biogenesis [J] 0.80
COG2123Exosome complex RNA-binding protein Rrp42, RNase PH superfamilyIntracellular trafficking, secretion, and vesicular transport [U] 0.80
COG2740Nucleoid-associated protein YlxR, Predicted RNA-binding, DUF448 familyGeneral function prediction only [R] 0.80


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.20 %
UnclassifiedrootN/A0.80 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000887|AL16A1W_10001948All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium558Open in IMG/M
3300000887|AL16A1W_10831198All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium570Open in IMG/M
3300001086|JGI12709J13192_1002388All Organisms → cellular organisms → Bacteria2494Open in IMG/M
3300001334|A2165W6_1026146All Organisms → cellular organisms → Bacteria → Terrabacteria group583Open in IMG/M
3300001334|A2165W6_1067811All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium595Open in IMG/M
3300001359|A3035W6_1002857All Organisms → cellular organisms → Bacteria1874Open in IMG/M
3300001361|A30PFW6_1008252All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium543Open in IMG/M
3300001538|A10PFW1_11971950All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium763Open in IMG/M
3300001593|JGI12635J15846_10003372All Organisms → cellular organisms → Bacteria13902Open in IMG/M
3300002558|JGI25385J37094_10005582All Organisms → cellular organisms → Bacteria4432Open in IMG/M
3300002560|JGI25383J37093_10110189All Organisms → cellular organisms → Bacteria → Proteobacteria789Open in IMG/M
3300002561|JGI25384J37096_10025023All Organisms → cellular organisms → Bacteria2324Open in IMG/M
3300002562|JGI25382J37095_10017653All Organisms → cellular organisms → Bacteria2740Open in IMG/M
3300002911|JGI25390J43892_10032657All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1256Open in IMG/M
3300002912|JGI25386J43895_10011363All Organisms → cellular organisms → Bacteria2547Open in IMG/M
3300005167|Ga0066672_10019661All Organisms → cellular organisms → Bacteria3507Open in IMG/M
3300005171|Ga0066677_10001222All Organisms → cellular organisms → Bacteria9031Open in IMG/M
3300005174|Ga0066680_10012988All Organisms → cellular organisms → Bacteria4345Open in IMG/M
3300005176|Ga0066679_10000530All Organisms → cellular organisms → Bacteria13143Open in IMG/M
3300005177|Ga0066690_10506663All Organisms → cellular organisms → Bacteria → Proteobacteria812Open in IMG/M
3300005178|Ga0066688_10004047All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes6479Open in IMG/M
3300005178|Ga0066688_10372391All Organisms → cellular organisms → Bacteria925Open in IMG/M
3300005181|Ga0066678_10194737All Organisms → cellular organisms → Bacteria1292Open in IMG/M
3300005186|Ga0066676_10006454All Organisms → cellular organisms → Bacteria5500Open in IMG/M
3300005445|Ga0070708_100069261All Organisms → cellular organisms → Bacteria3172Open in IMG/M
3300005447|Ga0066689_10368852All Organisms → cellular organisms → Bacteria896Open in IMG/M
3300005552|Ga0066701_10341580All Organisms → cellular organisms → Bacteria928Open in IMG/M
3300005554|Ga0066661_10808802All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Comamonadaceae → Variovorax → unclassified Variovorax → Variovorax sp. URHB0020548Open in IMG/M
3300005555|Ga0066692_10033917All Organisms → cellular organisms → Bacteria2729Open in IMG/M
3300005586|Ga0066691_10058485All Organisms → cellular organisms → Bacteria2085Open in IMG/M
3300006041|Ga0075023_100165102All Organisms → cellular organisms → Bacteria828Open in IMG/M
3300006047|Ga0075024_100056158All Organisms → cellular organisms → Bacteria1654Open in IMG/M
3300006797|Ga0066659_10024821All Organisms → cellular organisms → Bacteria3497Open in IMG/M
3300006797|Ga0066659_10185281All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1511Open in IMG/M
3300006800|Ga0066660_10402936All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1130Open in IMG/M
3300007255|Ga0099791_10286308All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium784Open in IMG/M
3300007258|Ga0099793_10094951All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1374Open in IMG/M
3300009038|Ga0099829_10241811All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1470Open in IMG/M
3300009038|Ga0099829_10861258All Organisms → cellular organisms → Bacteria752Open in IMG/M
3300009038|Ga0099829_11489067All Organisms → cellular organisms → Bacteria → Terrabacteria group559Open in IMG/M
3300009088|Ga0099830_10017487All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes4585Open in IMG/M
3300009088|Ga0099830_10176911All Organisms → cellular organisms → Bacteria1658Open in IMG/M
3300009088|Ga0099830_10901557All Organisms → cellular organisms → Bacteria731Open in IMG/M
3300009089|Ga0099828_10205527All Organisms → cellular organisms → Bacteria1758Open in IMG/M
3300009089|Ga0099828_10275579All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1511Open in IMG/M
3300009090|Ga0099827_10206965All Organisms → cellular organisms → Bacteria1634Open in IMG/M
3300009090|Ga0099827_10493037All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia1052Open in IMG/M
3300010304|Ga0134088_10071834All Organisms → cellular organisms → Bacteria1609Open in IMG/M
3300010333|Ga0134080_10171484All Organisms → cellular organisms → Bacteria928Open in IMG/M
3300010336|Ga0134071_10000511All Organisms → cellular organisms → Bacteria12846Open in IMG/M
3300010905|Ga0138112_1084403All Organisms → cellular organisms → Bacteria → Terrabacteria group550Open in IMG/M
3300011269|Ga0137392_10029849All Organisms → cellular organisms → Bacteria3941Open in IMG/M
3300011269|Ga0137392_10151995All Organisms → cellular organisms → Bacteria1869Open in IMG/M
3300011269|Ga0137392_11323419All Organisms → cellular organisms → Bacteria → Terrabacteria group579Open in IMG/M
3300011270|Ga0137391_10250368All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1536Open in IMG/M
3300012096|Ga0137389_10106587All Organisms → cellular organisms → Bacteria2236Open in IMG/M
3300012189|Ga0137388_10093487All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia2565Open in IMG/M
3300012189|Ga0137388_10367562All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1328Open in IMG/M
3300012203|Ga0137399_10437514All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1093Open in IMG/M
3300012205|Ga0137362_10085262All Organisms → cellular organisms → Bacteria2641Open in IMG/M
3300012209|Ga0137379_11744789All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium518Open in IMG/M
3300012349|Ga0137387_10589473All Organisms → cellular organisms → Bacteria806Open in IMG/M
3300012351|Ga0137386_11037776All Organisms → cellular organisms → Bacteria → Terrabacteria group582Open in IMG/M
3300012363|Ga0137390_10202909All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1970Open in IMG/M
3300012363|Ga0137390_10716133All Organisms → cellular organisms → Bacteria962Open in IMG/M
3300012363|Ga0137390_11768596All Organisms → cellular organisms → Bacteria → Terrabacteria group550Open in IMG/M
3300012392|Ga0134043_1208054All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium976Open in IMG/M
3300012396|Ga0134057_1189610All Organisms → cellular organisms → Bacteria679Open in IMG/M
3300012398|Ga0134051_1365728All Organisms → cellular organisms → Bacteria → Terrabacteria group1131Open in IMG/M
3300012401|Ga0134055_1112039All Organisms → cellular organisms → Bacteria → Terrabacteria group695Open in IMG/M
3300012685|Ga0137397_10007126All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi7763Open in IMG/M
3300012918|Ga0137396_10219986All Organisms → cellular organisms → Bacteria1398Open in IMG/M
3300012925|Ga0137419_10312034All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1206Open in IMG/M
3300012927|Ga0137416_10009174All Organisms → cellular organisms → Bacteria5839Open in IMG/M
3300012927|Ga0137416_10803560All Organisms → cellular organisms → Bacteria832Open in IMG/M
3300012927|Ga0137416_10955135All Organisms → cellular organisms → Bacteria → Terrabacteria group764Open in IMG/M
3300012977|Ga0134087_10651813All Organisms → cellular organisms → Bacteria → Terrabacteria group552Open in IMG/M
3300013294|Ga0120150_1001080All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia7644Open in IMG/M
3300013764|Ga0120111_1118065All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium618Open in IMG/M
3300013765|Ga0120172_1033052All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1416Open in IMG/M
3300014154|Ga0134075_10038870All Organisms → cellular organisms → Bacteria1949Open in IMG/M
3300014823|Ga0120170_1046381All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium996Open in IMG/M
3300015052|Ga0137411_1152285All Organisms → cellular organisms → Bacteria2522Open in IMG/M
3300015359|Ga0134085_10064166All Organisms → cellular organisms → Bacteria1485Open in IMG/M
3300017659|Ga0134083_10017620All Organisms → cellular organisms → Bacteria2486Open in IMG/M
3300018431|Ga0066655_10001425All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia8610Open in IMG/M
3300018468|Ga0066662_10015361All Organisms → cellular organisms → Bacteria4131Open in IMG/M
3300020581|Ga0210399_10097045All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia2408Open in IMG/M
3300021046|Ga0215015_10957153All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1037Open in IMG/M
3300021432|Ga0210384_10763200All Organisms → cellular organisms → Bacteria864Open in IMG/M
3300021559|Ga0210409_10227022All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1695Open in IMG/M
3300021559|Ga0210409_10252525All Organisms → cellular organisms → Bacteria1597Open in IMG/M
3300026277|Ga0209350_1064400All Organisms → cellular organisms → Bacteria1039Open in IMG/M
3300026297|Ga0209237_1010445All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia5538Open in IMG/M
3300026298|Ga0209236_1001016All Organisms → cellular organisms → Bacteria16681Open in IMG/M
3300026301|Ga0209238_1048504All Organisms → cellular organisms → Bacteria1532Open in IMG/M
3300026310|Ga0209239_1004663All Organisms → cellular organisms → Bacteria7858Open in IMG/M
3300026318|Ga0209471_1000219All Organisms → cellular organisms → Bacteria41872Open in IMG/M
3300026322|Ga0209687_1005258All Organisms → cellular organisms → Bacteria4351Open in IMG/M
3300026324|Ga0209470_1079926All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1499Open in IMG/M
3300026329|Ga0209375_1007941All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes6793Open in IMG/M
3300026331|Ga0209267_1189119All Organisms → cellular organisms → Bacteria → Terrabacteria group802Open in IMG/M
3300026333|Ga0209158_1006497All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia6175Open in IMG/M
3300026334|Ga0209377_1007997All Organisms → cellular organisms → Bacteria6175Open in IMG/M
3300026524|Ga0209690_1003087All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi9512Open in IMG/M
3300026527|Ga0209059_1004398All Organisms → cellular organisms → Bacteria6337Open in IMG/M
3300026532|Ga0209160_1035191All Organisms → cellular organisms → Bacteria3094Open in IMG/M
3300026551|Ga0209648_10000605All Organisms → cellular organisms → Bacteria27615Open in IMG/M
3300026551|Ga0209648_10077003All Organisms → cellular organisms → Bacteria2793Open in IMG/M
3300026551|Ga0209648_10103655All Organisms → cellular organisms → Bacteria2326Open in IMG/M
3300027587|Ga0209220_1000015All Organisms → cellular organisms → Bacteria116182Open in IMG/M
3300027655|Ga0209388_1003361All Organisms → cellular organisms → Bacteria3958Open in IMG/M
3300027671|Ga0209588_1025154All Organisms → cellular organisms → Bacteria1882Open in IMG/M
3300027846|Ga0209180_10001973All Organisms → cellular organisms → Bacteria10275Open in IMG/M
3300027846|Ga0209180_10575994All Organisms → cellular organisms → Bacteria624Open in IMG/M
3300027846|Ga0209180_10669077All Organisms → cellular organisms → Bacteria → Terrabacteria group567Open in IMG/M
3300027862|Ga0209701_10025570All Organisms → cellular organisms → Bacteria3839Open in IMG/M
3300027862|Ga0209701_10080187All Organisms → cellular organisms → Bacteria2058Open in IMG/M
3300027875|Ga0209283_10370888All Organisms → cellular organisms → Bacteria936Open in IMG/M
3300027882|Ga0209590_10138942All Organisms → cellular organisms → Bacteria1495Open in IMG/M
3300027882|Ga0209590_10219995All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1205Open in IMG/M
3300027903|Ga0209488_10799309All Organisms → cellular organisms → Bacteria → Terrabacteria group669Open in IMG/M
3300027910|Ga0209583_10226796All Organisms → cellular organisms → Bacteria812Open in IMG/M
3300028536|Ga0137415_10002133All Organisms → cellular organisms → Bacteria19666Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil37.60%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil20.80%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil12.80%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil9.60%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost8.80%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.20%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.40%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.40%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.80%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.80%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.80%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000887Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A3-65cm-16A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300001086Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3EnvironmentalOpen in IMG/M
3300001334Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A21-65cm)- 6 month illuminaEnvironmentalOpen in IMG/M
3300001359Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A30-35cm)- 6 month illuminaEnvironmentalOpen in IMG/M
3300001361Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A30-PF)- 6 month illuminaEnvironmentalOpen in IMG/M
3300001538Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A10-PF 4A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006047Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010905Grasslands soil microbial communities from Angelo Coastal Reserve, California, USA - 15_R_Wat_40_2_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012392Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012396Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012398Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012401Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300013294Permafrost microbial communities from Nunavut, Canada - A3_65cm_0MEnvironmentalOpen in IMG/M
3300013764Permafrost microbial communities from Nunavut, Canada - A28_35cm_6MEnvironmentalOpen in IMG/M
3300013765Permafrost microbial communities from Nunavut, Canada - A30_80cm_6MEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014823Permafrost microbial communities from Nunavut, Canada - A3_80cm_0MEnvironmentalOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027587Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
AL16A1W_1000194813300000887PermafrostFKFATPPDWTYALLILTCLGGLGFIAFTIVVAAVSQRASGFLPLTRSSSRTVTLAFWIPLALLIAWPLCWVFAFVLPSTPSDPAIGVLVWLGIFFLVAGIIGRLVVTRFISPRAKVMEPAPGQFDRIVELRNVHPLFVAAVLQRQQASASQYAYRAQSPYLPGST*
AL16A1W_1083119813300000887PermafrostFKFATPPDWTYALLILVCLGGVGFIAFTIVVAVVSQRASGFLPLTRSSSRTVALALWIPLALLIAWVTSWLIAAIFAFSSNDPTASAIAGASFWLGVFFLAAGMIGRLVVTPLISPRAKVRELALGQIDRIVELRNVHPLFVAAVLQRQQASVSQYAHPAQSPYLPGST*
JGI12709J13192_100238833300001086Forest SoilVSRVSIAASTSSSACALGKVQVWASQLWANDFPPVCAMTGRPAETWKKFKFATPPDWTYALLILVCLGGLGFIAFTIVVAVVSQRASGFLPLTRSSSRTVRLAFWIPLVLLITWPVSWAIAVTFGLSSSDPTPAVIAGVSFWLGVLFLVAGLIGRLIINPLISPRAKVREPAPGQTDRIVELRNVHPLFVAAVLQRQQASALRYIAPAQSPYLPGST*
A2165W6_102614613300001334PermafrostGRPAETWKKFKFATPPDWTYALLILTCLGGLGFIAFTIVVAAVSQRASGFLPLTRSSSRTVTLAFWIPLALLIAWPLCWVFAFVLPSTPSDPAIGVLVWLGIFFLVAGIIGRLVVTRFISPRAKVMEPAPGQFDRIVELRNVHPLFVAAVLQRQQASASQYAYRAQSPYLPGST*
A2165W6_106781113300001334PermafrostGRPAETWKKFKFATPPDWTYALLILVCLGGVGFIAFTIVVAVVSQRASGFLPLTRSSSRTVALALWIPLALLIAWVTSWLIAAIFAFSSNDPTASAIAGASFWLGVFFLAAGMIGRLVVTPLISPRAKVRELALGQIDRIVELRNVHPLFVAAVLQRQQASVSQYAHPAQSPYLPGST*
A3035W6_100285733300001359PermafrostMTGRPAETWKKFKFATPPDWTYALLILTCLGGLGFIAFTIVVAAVSQRASGFLPLTRSSSRTVTLAFWIPLALLIAWPLCWVFAFVLPSTPSDPAIGVLVWLGIFFLVAGIIGRLVVTRFISPRAKVMEPAPGQFDRIVELRNVHPLFVAAVLQRQQASASQYAYRAQSPYLPGST*
A30PFW6_100825213300001361PermafrostTYALLILVCLGGVGFIAFTIVVAVVSQRASGFLPLTRSSSRTVALALWIPLALLIAWVTSWLIAAIFAFSSNDPTASAIAGASFWLGVFFLAAGMIGRLVVTPLISPRAKVRELALGQIDRIVELRNVHPLFVAAVLQRQQASVSQYAHPAQSPYLPGST*
A10PFW1_1197195013300001538PermafrostVAVVSQRASGFLPLTRSSSRTVALALWIPLALLIAWVTSWLIAAIFAFSSNDPTASAIAGASFWLGVFFLAAGMIGRLVVTPLISPRAKVRELALGQIDRIVELRNVHPLFVAAVLQRQQASVSQYAHPAQSPYLPGST*
JGI12635J15846_1000337273300001593Forest SoilMTGRPAETWKKFKFATPPDWTYALLILVCLGGLGFIAFTIVVAVVSQRASGFLPLTRSSSRTVRLAFWIPLVLLITWPVSWAIAVTFGLSSSDPTPAVIAGVSFWLGVLFLVAGLIGRLIINPLISPRAKVREPAPGQTDRIVELRNVHPLFVAAVLQRQQASALRYIAPAQSPYLPGST
JGI25385J37094_1000558223300002558Grasslands SoilMTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILALAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTXRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
JGI25383J37093_1011018913300002560Grasslands SoilKVQVWASQLWSNDFPPVCAMTGRPXETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK*
JGI25384J37096_1002502323300002561Grasslands SoilMTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
JGI25382J37095_1001765323300002562Grasslands SoilMTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAV*
JGI25390J43892_1003265713300002911Grasslands SoilGGIGILAXAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK*
JGI25386J43895_1001136333300002912Grasslands SoilMTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPMTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTXRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0066672_1001966153300005167SoilVETWKKFNFATPPGWAYALLILICLGGLGFIAFAIVLAVVSQRASGYLPLTSSSSRTATLALWIPVGLLLAGPVAFAIALIVALASNDATAATITAVFLWLGILVLGVGLLGRLVFTPLIRPRGKVMEAAPGQTDRIVELRNVHPAFVAAVQQHQQARAAQYAPAPQAPFLLGPK*
Ga0066677_1000122253300005171SoilMTGRPVETWKKFNFATPPGWAYALLILICLGGLGFIAFAIVLAVVSQRASGYLPLTSSSSRTATLALWIPVGLLLAGPVAFAIALIVALASNDATAATITAVFLWLGILVLGVGLLGRLVFTPLIRPRGKVMEAAPGQTDRIVELRNVHPAFVAAVQQHQQARAAQYAPAPQAPFLLGPK
Ga0066680_1001298853300005174SoilMTGRPVETWKKFNFATPPGWAYALLILVCLGGLGLIAFAIVLAVVSQRASGYLPLTRSSSRTATLALWIPVGLLLAGPVAFAIALIFALASNDATASTITAVFLWLGILVLGVGLLGRLVVTPLIRPRGKVMEAAPGQTDRIVELRNVHPAFVAAVQQHQQARAAQYAPAPQAPFLLGPK
Ga0066679_1000053073300005176SoilMTGRPAETWKKFSFATPPDWAYALLLLVCLGGIGVAAYAVVIAIVSQRASGYLPLTRSSARTATLAFWIPVGILIAWPVCWFIALVVSVAGNNDPTANTFAAVLLFLGLACLLAGLVGRLVITRLIRPRAKVMQAAPGQTDRIVELRNVHPAFVAAVQQHQQARAVQYAPAPQAPFLPGPK*
Ga0066690_1050666323300005177SoilWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK*
Ga0066688_1000404733300005178SoilMTGRPGETWKKFKFATPPDWVYALLVLVCLGGIGILALAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0066688_1037239123300005178SoilMTGRRVETWKKFNFATPPGWAYALLILICLGGLGFIAFAIVLAVVSQRASGYLPLTSSSSRTATLALWIPVGLLLAGPVAFAIALIVALASNDATAATITAVFLWLGILVLGVGLLGRLVFTPLIRPRGKVMEAAPGQTDRIVELRNVHPAFVAAVQQHQQARAAQYAPAPQAPFLLGPK
Ga0066678_1019473723300005181SoilMTGRPVETWKKFNFATPPGWAYALLILICLGGLGFIAFAIVLAVVSQRASGYLPLTSSSSRTATLALWIPVGLLLAGPVAFAIALIFALASNDATASTITAVFLWLGILVLGVGLLGRLVVTPLIRPRGKVMEAAPGQTDRIVELRNVHPAFVAAVQQHQQARAAQYAPAPQAPFLLGPK
Ga0066676_1000645423300005186SoilMTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPQTRSSSRTAALAFGIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0070708_10006926133300005445Corn, Switchgrass And Miscanthus RhizosphereMTGHPAETWRKFKFATPPDWAYSLLILVCLGGLGFIAFAIVMTLVAERASGYLPLTRSSSRTVTLVTWIPIGLLIAWPISWAIALISSSSNDSTWPTIAAVFFWLGFLFFGAGLIGRLLITRLVCPRGKVFPVAPGQTDRIVELRNVHPLFVAAVLQRQQAPSPHLAPAPQSPYLPGST*
Ga0066689_1036885213300005447SoilFIAFAIVLAVVSQRASGYLPLTSSSSRTATLALWIPVGLLLAGPVAFAIALIVALASNDATAATITAVFLWLGILVLGVGLLGRLVFTPLIRPRGKVMEAAPGQTDRIVELRNVHPAFVAAVQQHQQARAAQYAPAPQAPFLLGPK*
Ga0066701_1034158023300005552SoilALGRVRVWSSQLWANDFPPVCAMTGRPVETWKKFNFATPPGWAYALLILICLGGLGFIAFAIVLAVVSQRASGYLPLTSSSSRTATLALWIPVGLLLAGPVAFAIALIFALASNDATASTITAVFLWLGILVLGVGLLGRLVVTPLIRPRGKVMEAAPGQTDRIVELRNVHPAFVAAVQQHQQARAAQYAPAPQAPFLLGPK*
Ga0066661_1080880213300005554SoilPVCAMTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPG
Ga0066692_1003391713300005555SoilQLWANDFPPVCAMTGRRVETWKKFNFATPPGWAYALLILICLGGLGFIAFAIVLAVVSQRASGYLPLTSSSSRTATLALWIPVGLLLAGPVAFAIALIFALASNDATASTITAVFLWLGILVLGVGLLGRLVVTPLIRPRGKVMEAAPGQTDRIVELRNVHPAFVAAVQQHQQARAAQYAPAPQAPFLLGPK*
Ga0066691_1005848523300005586SoilLGKVQVWASQLWSNDFPPVCAMTGRPGETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK*
Ga0075023_10016510223300006041WatershedsMTGRPAETWKKFKFATPPDWTYALLVLTCLGGLGFIAFTIVVAAVSERASGFLPLTRSSSRTVALVFWIPLALLIAWPVCWAVALVTSGDPNASATTVGLVWLGIFFLVAGMIGRLVVTRFISPRAKVMELAPGHFDRIVELRNVHPLFVAAVLQRQQAQASQHAYPAQSPYLPGST*
Ga0075024_10005615833300006047WatershedsIAFTIVVAVVSERASGFLPLTRSSSRTVALVFWIPLALLIAWPVCWAVALVTSGDPNASATTVGLVWLGIFFLVAGMIGRLVVTRFISPRAKVMELAPGHFDRIVELRNVHPLFVAAVLQRQQAQASQHAYPAQSPYLPGST*
Ga0066659_1002482143300006797SoilMTGRPVETWKKFNFATPPGWAYALLILVCLGGLGLIAFAIVLAVVSQRASGYLPLTSSSSRTATLALWIPVGLLLAGPVAFAIALIVALASNDATAATITAVFLWLGILVLGVGLLGRLVFTPLIRPRGKVMEAAPGQTDRIVELRNVHPAFVAAVQQHQQARAAQYAPAPQAPFLLGPK
Ga0066659_1018528123300006797SoilAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK*
Ga0066660_1040293613300006800SoilMTGRPVETWKKFNFATPPGWAYALLILICLGGLGFIAFAIVLAVVSQRASGYLPLTRSSSRTATLALWIPVGLLLAGPVAFAIALIVALASNDATAATITAVFLWLGILVLGVGLLGRLVFTPLIRPRGKVMEAAPGQTDRIVELRNVHPAFVAAVQQHQQARAAQYAPTPQAPFLLGPK
Ga0099791_1028630813300007255Vadose Zone SoilPDWAYALLILVCLGGLGILAFAIVVAIVSQRASGYLPLTKSSSRIATLAFWIPIGLLIAWPLSWAFAAIFGSSSNGSTVAVFLWLGILFLAVGLLGRLLVMPLVSPRGKVFQIAPGQTDRIVELRNVHPALVAAVQQHQHARAAQYAYPPQAPFLPRSK*
Ga0099793_1009495123300007258Vadose Zone SoilWANDFPPVCAMTGRPAETWKKFSFATPPDWAYALLFLVCLGGIGIAAFVVVVALVSQRASGYLPLTRSSARTVTLAFWIPLGLLIAWPVAWASAAIFGFGFNDSTSSTIAGVFFWLGIVFLITGLMGRLLIMRLTAPRAKVFPTPPGQTDRIVELRNVHPAFVAAVQQHQHARAAQYSPAPQAPLLPS*
Ga0099829_1024181123300009038Vadose Zone SoilLGKVQVWASQLWANDFPPVCAITGRPAETWKKFKFATPPDWTYALLILVCLGGLGFIAFTIVVAVVSQRASGFLPLTRSSSRTVTLAFWTPLVLLIAWVVSWVIAAIFGFSSNDPTASVIAGTSFWVGVLFLAAGMIGRLVVTPLISPRAKVKELAPGQTDRIVELRNVHPLFVAAVLQRQQASASQYAYPAHSQYLPGST*
Ga0099829_1086125813300009038Vadose Zone SoilLGKVQVWASQLWANDFPPVCAMTGRSAETWKKFKFATPPDWTYALLILVCLGGLGFIAFTIVVAVVSQRASGFLPLTRSSGRTVTLAFWIPLTLLIAWPVFWALAATLAFSSNDPTASAIAGVSFWLGVLFLVAGLIGRLVVTPLISPRAKVRELAPGQIDRIVELRNVHPLFVAGVLRRQQA
Ga0099829_1148906713300009038Vadose Zone SoilRKFKFATPPDWTYALLILTCLGGVGFIAFTIVVAAVSERASGFLPLTRSSGRTVTLVFWIPLALLIAWPICWGLALITAADPNASSTAGGSFWLGNFFLLAGMIGRLVVTRFVSPRAKVMELAPGQTDRIVELRNVHPLFVAAVLQRQQAYASHYGAIAQSPYLPRST*
Ga0099830_1001748753300009088Vadose Zone SoilLGKVQVWASQLWANDFPPVCAMTGRSAETWKKFKFATPPDWTYALLILVCLGGLGFIAFTIVVAVVSQRASGFLPLTRSSGRTVTLAFWIPLTLLIAWPVFWALAATLAFSSNDPTASAIAGVSFWLGVLFLVAGLIGRLVVTPLISPRAKVRELAPGQIDRIVELRNVHPLFVAAVLQRQQASASQYVHTAQSQYLPGST*
Ga0099830_1017691123300009088Vadose Zone SoilMTGRPAETWKKFKFATPPDWTYALLILVCLGGVGFIAFTIVVAVVSQRASGFLPLTRSSSRTVTLAFWIPLALLIAWPVFWGLAATFAFSSNDPTASAIAGVSFWLGVLFLVAGLIGRLVVTPLISPRAKVMELAPGQTDRIVELRNVHPLFVAAVLQRQQAYTSQYATIAQSPYVPGSN
Ga0099830_1090155723300009088Vadose Zone SoilLGKVQVWASQLWANDFPPVCAITGRPAETWKKFKFATPPDWTYALLILVCLGGVGFIAFTIVVAVVSQRASGFLPLTRSSSRTVTLAFWTPLVLLIAWVVSWVIAAIFGFSSNDPTASVIAGTSFWVGVLFLAAGMIGRLVVTPLISPRAKVKELAPGQTDRIVELRNVHPLFVAAVLQRQQA
Ga0099828_1020552713300009089Vadose Zone SoilVCAMTGRPAETWKKFSFATPPDWAYALLFLVCLGGIGIAAFAVVVALVSQRAGGYLPLTRSSARTVTLAFWIPLGLLIAWPVVWAIAAIFGFGFNDSTSSKIAGVSFWLGIVFLITGLIGRLLIMRLTSPQAKVFPTPPGQTDRVVELRNVHPAFVAAVQQHQHARAAQYAPAPQAPLLPS*
Ga0099828_1027557923300009089Vadose Zone SoilMTGRPAETWKKFKFATPPDWTYALLILTCLGGVGFIAFTIVVAAVSERASGFLPLTRSSGRTVTLVFWIPLALLIAWPICWGLALITAADPTASSITSGSFWLGNFFLVAGMIGRLVVTRFVSPRAKVMELAPGQTDRIVELRNVHPLFVAAVLQRQQAYASHYGAIAQSPYLPRST*
Ga0099827_1020696523300009090Vadose Zone SoilMTGRPAETWKKFKFATPPDWTYALLILTCLGGVGFIAFTIVVAAVSERASGFLPLTRSSGRTVTLVFWIPLALLIAWPICWGLALITAADPKASSITGGSFWLGNFFLLAGMIGRLVVTRFVSPRAKVMELAPRQTDRIVELRNVHPLFVAAVLQRQQAYASQYAALAPSPYVPRST*
Ga0099827_1049303723300009090Vadose Zone SoilMTGRPAETWKKFKFATPPDWTYALLILVCLGGLGFIAFTIVVAVVSQRASGFLPLTRSSGRTVTLAFWIPLALLIAWPVFWALAAAFAFSSNDPTASAIAGVSFWLGVLFLVAGLIGRLVVTPLISPRAKVMELAPGHFDRIVELRNVHPLFVAAVLQRQQAQASQYAYPAQSPYLPGST
Ga0134088_1007183423300010304Grasslands SoilMTGRTAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0134080_1017148423300010333Grasslands SoilMTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLLRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0134071_1000051133300010336Grasslands SoilMTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVFPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0138112_108440313300010905Grasslands SoilGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFGIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLIVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK*
Ga0137392_1002984923300011269Vadose Zone SoilLGKVQVWASQLWANDFPPVCAITGRPAETWKKFKFATPPDWTYALLILVCLGGLGFIAFTIVVAVVSQRASGFLPLTRSSSRTVTLAFWIPLALLIAWPVFWGLAATFAFSSNDPTASAIAGVSFWLGVLFLVAGLIGRLVVTPLISPRAKVMELAPGQTDRIVELRNVHPLFVAAVLQRQQAYTSQYATIAQSPYVPGSN*
Ga0137392_1015199533300011269Vadose Zone SoilWKKFKFATPPDWTYALLILVCLGGLGFIAFTIVVALVSQRASGFLPLTRSSSRTVTLAFWTPLVLLIAWVVSWVIAAIFGFSSNDPTASVIAGTSFWVGVLFLAAGMIGRLVVTPLISPRAKVKELAPGQTDRIVELRNVHPLFVAAVLQRQQASASQYAYPAHSQYLPGST*
Ga0137392_1132341913300011269Vadose Zone SoilKFNFATPPDWAYALLFLICLGGIGFVAFAVTIAVVSQRASGYLPLTRSSARTVRLAFWIPLGFLIAWPACWLIALIVGVAGNNDSTANGVAGTFVFLGLLCMLGGLVGRLVITRLVTPQAKVAAPAPGQTDRLVELRNVHPAFVVAVQQHQQARAAQYVPAPQLPLPPSHG*
Ga0137391_1025036823300011270Vadose Zone SoilWTYALLILVCLGGLGFIAFTIVVAVVSQRASGFLPLTRSSSRTVTLAFWTPLVLLIAWVVSWVIAAIFGFSSNDPTASVIAGTSFWVGVLFLAAGMIGRLVVTPLISPRAKVKELAPGQTDRIVELRNVHPLFVAAVLQRQQASASQYAYPAHSQYLPGST*
Ga0137389_1010658733300012096Vadose Zone SoilLGKVQVWASQLWANDFPPVCAMTGRPAETWKKFKFATAPDWTYALLILVCLGGLGFIAFAIVVALVSQRAGGYLPLTRSSSRTVTLAFWIPLVLLIAWPVSWLIAAIFGFSSNDPTASTIAGVSFWLGVLFLVAGLIGRLVVTPLISPRAKVMEVAPGQTDRIVELRNVHPLFVAEVLQRQQAHAMQSASQSPYLPGST*
Ga0137388_1009348733300012189Vadose Zone SoilMTGRPAETWKKFKFATPPDWTYALLILVCLGGLGFIAFAIVVALVSQRASGFLPLTRSSSRTVTLAFWIPLVVLIAWPVSWLTAAIFGFSSNDPTASTIAGVSFWLGVLFLVAGLIGRLVVTPLISPRAKVMELAPGHFDRIVELRNV
Ga0137388_1036756223300012189Vadose Zone SoilMTGRPAETWKKFSFATPPDWAYALLFLVCLGGIGIAAFAVVVALVSQRASGYLPLTRSSARTVTLAFWIPLGLLIAWPVAWAIAAIFGFGFNDSTSSTIAGVFFWLGIVFLITGLIGRLLIMRLTSPQAKVFPTPPGQTDRVVELRNVHPAFVAAVQHHQHARAAQYAPAPQAPLLPS*
Ga0137399_1043751423300012203Vadose Zone SoilMTGRPAETWEKFSFATPPDWAYALLFLVCLGGIGIAAFVVVVALVSQRASGYLPLTRSSARTVTLAFWIPLGLLIAWPVAWASAAIFGFGFNDSTSSTIAGVFFWLGIVFLITGLMGRLLIMRLTAPRAKVFPTPPGQTDRIVELRNVHPAFVAAVQQHQHARAAQYSPAPQAPLLPS*
Ga0137362_1008526223300012205Vadose Zone SoilMTGRPAETWKKFKFATPPDWTYALLILVCLGGLGFIAFTIVVAVVSQRASGFLPLTRSSGRTVTLAFWIPLALLIAWPVFWVVAATFAFSSNDPTASVIAGVSFWLGVLFLVAGLIGRLVVMPLISPRAKVMEPPPGQTDRIVELRNVHPLFVAAVLQRQQASVSQYAYPAQSPCLPGST
Ga0137379_1174478913300012209Vadose Zone SoilVCLGGIGILAFAIVTAVVSQRASGYLPLLRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK*
Ga0137387_1058947313300012349Vadose Zone SoilMTGRTAETWKKFKFATPPDWVYALLVLVCLGGIGILALAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0137386_1103777613300012351Vadose Zone SoilMTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFGIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILP
Ga0137390_1020290933300012363Vadose Zone SoilLGKVQVWASQLWANDFPPVCAMTGRPAETWKKFKFATAPDWTYALLILVCLGGLGFIAFAIVVALVSQRASGYLPLTRSSSRTVTRAFWIPLVLLIAWPVSWLIAAIFGFSSNDPTASTIAGVSFWLGVLFLVAGLIGRLVVTPLISPRAKVMEVAPGQTDRIVELRNVHPLFVAEVLQRQQAHAMQSASQSPYLPGST*
Ga0137390_1071613313300012363Vadose Zone SoilLGKVQVWASQLWANDFPPVCAITGRPAETWKKFKFATPPDWTYALLILVCLGGVGFIAFTIVVAVVSQRASGFLPLTRSSSRTVTLAFWTPLVLLIAWVVSWVIAAIFGFSSNDPTASVIAGTSFWVGVLFLAAGMIGRLVVTPLISPRAKVKELEPGQTDRIVELRNVHPLFVAAVLQRQQASASQYAYPAHSQYLPGST*
Ga0137390_1176859613300012363Vadose Zone SoilMTGRPAETWKKFKFATPPDWTYALLILTCLGGVGFIAFTIVVAAVSERASGFLPLTRSSSQTVALAFWIPLALLIAWPICWGLALITAGNPSASATTDGSFWLGVFFLLAGMIGRLVVTRFVSPRAKVMELAPGHFDRIVELRNVHPLFVAAVLQRQQAQASQYAYPAQ
Ga0134043_120805423300012392Grasslands SoilETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK*
Ga0134057_118961023300012396Grasslands SoilMTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQ
Ga0134051_136572823300012398Grasslands SoilVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFGIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK*
Ga0134055_111203913300012401Grasslands SoilMTGRTAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVKPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0137397_1000712673300012685Vadose Zone SoilMTGRPAETWKKFKFATPPDWAYALLILVCLGGLGFIVFAVVMALVAQRASGYLPLTKSSSRTVSLAVWIPIGLLIAWPIAWVIALIFSSSNDSTASTIAAVFFWLGMLFFLAGLVGRLLITRLVTPQAKVFPVAPGQTDRIVELRNVHPLFVAAVSQRHQAQAQQFVPAPQSPYLPGST*
Ga0137396_1021998623300012918Vadose Zone SoilMTGRPAETWKKFKFATAPDWTYALLILVCLGGLGFIAFAIVVALVSQRASGYLPLTRSSSRTVTLAFWIPLALLIAWPVSWLIAAIFGFSSNDPTASTIAGVSFWLGVLFLVAGLIGRLIVTPLISPRAKVMELAPGQTDRIVELRNVHPLFVAAVLQRTASHYIAPAQSPYLPGST*
Ga0137396_1128247213300012918Vadose Zone SoilVALVSQRASGYLPLTRSSARTVTLAFWIPLGLLIAWPVAWASAAIFGFGFNDSTSSTIAGVFFWLGIVFLITGLMGRLLIMRLTAPRAKVFPTPPGQTDRIVELRNVHPAFVAAVQQHQHARAAQYSPAPQAPLLPS*
Ga0137419_1031203423300012925Vadose Zone SoilMTGRPAETWKKFKFATPPSWAYALLILVCLGGLGVIAFAIVMAIVAERASGYLPLTKSSSRTVSLAFWIPIGLLIAWPVSWGIALVFSAASDPTSSTIAAVFFWLGFLFFGAGLVGRLVITRLVAPRAKVFPIAPGQTDRIVELRNVHPLFVAAVSERQHAAPPQFAPAAQPPYLPGPT*
Ga0137416_1000917433300012927Vadose Zone SoilLGKVQVWASQLWANDFPPVCAMTGRPAETWKKFSFATPPDWAYALLFLVCLGGIGIAAFVVVVALVSQRASGYLPLTRSSARTVTLAFWIPLGLLIAWPVAWASAAIFGFGFNDSTSSTIAGVFFWLGIVFLITGLMGRLLIMRLTAPRAKVFPTPPGQTDRIVELRNVHPAFVAAVQQHQHARAAQYSPAPQAPLLPS*
Ga0137416_1080356023300012927Vadose Zone SoilMTGRPAETWKKFKFATPPSWAYALLILVCLGGLGIIAFAIVIAIVADRASGYLPLTKSSSRTVSLAFWIPIGLLIAWPFSWAIAVVFSSASDSTSSTIAAVFLWLGFLFFGAGLVGRLVITRLVAPRAKVFPIAPGQTDRIVELRNVHPVFVAAVSERQHAAPPQFAP
Ga0137416_1095513513300012927Vadose Zone SoilQLWANDFPPVCAMTGRPAETWKKFKFATPPDWTYALLILVCLGGLGFIAFTIVVAVVSQRASGFLPLTRSSGRTVTLAFWIPLALLIAWPVFWVVAATFAFFSNDPTASVIAGVSFWLGVLFLVAGLIGRLVVMPLISPRAKVMEPPPGQTDRIVELRNVHPLFVAAVLQRQQASVSQYAYPTQSPYLPGST*
Ga0134087_1065181313300012977Grasslands SoilMTGRPGETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQ
Ga0120150_100108013300013294PermafrostAMTGRPAETWKKFKFATPPDWTYALLILVCLGGVGFIAFTIVVAVVSQRASGFLPLTRSSSRTVALALWIPLALLIAWVTSWLIAAIFAFSSNDPTASAIAGASFWLGVFFLAAGMIGRLVVTPLISPRAKVRELALGQIDRIVELRNVHPLFVAAVLQRQQASVSQYAHPAQSPYLPGST*
Ga0120111_111806513300013764PermafrostMTGRPAETWKKFKFATPPDWTYALLILVCLGGVGFIAFTIVVAVVSQRASGFLPLTRSSSRTVALALWIPLALLIAWVTSWLIAAIFAFSSNDPTASAIAGASFWLGVFFLAAGMIGRLVVTPLISPRAKVRELALGQIDRIVELRNVHPLFVAAVLQRQQASVSQYAHPAQSPYLPGST
Ga0120172_103305213300013765PermafrostTGRPAETWKKFKFATPPDWTYALLILVCLGGVGFIAFTIVVAVVSQRASGFLPLTRSSSRTVALALWIPLALLIAWVTSWLIAAIFAFSSNDPTASAIAGASFWLGVFFLAAGMIGRLVVTPLISPRAKVRELALGQIDRIVELRNVHPLFVAAVLQRQQASVSQYAHPAQSPYLPGST*
Ga0134075_1003887033300014154Grasslands SoilDWVYVLLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK*
Ga0120170_104638113300014823PermafrostDWTYALLILTCLGGLGFIAFTIVVAAVSQRASGFLPLTRSSSRTVTLAFWIPLALLIAWPLCWVFAFVLPSTPSDPAIGVLVWLGIFFLVAGIIGRLVVTRFISPRAKVMEPAPGQFDRIVELRNVHPLFVAAVLQRQQASASQYAHPAQSPYLPGST*
Ga0137411_115228533300015052Vadose Zone SoilMTGRPAETWKKFKFATPPDWAYALLILVCLGGLGFIVFAVVMALVAQRASGYLPLTKSSSRTVSLAVWIPIGLLIAWPIAWVIALIFSSSNDSTASTIAAVFFWLGMLFFLAGLVGRLLITRLVTPQAKVFPVAPGQTDRIVELRNVHPLFVAAVSQRHHAQAQQFVPAPQSPYLPGST*
Ga0134085_1006416623300015359Grasslands SoilMTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPQTRSSSRTAALAFGIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPALVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0134083_1001762033300017659Grasslands SoilMTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPQTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0066655_1000142553300018431Grasslands SoilMTGRPAETWKKFKFATPPDWVYALLVLVWLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0066662_1001536143300018468Grasslands SoilMTGRPGETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0210399_1009704533300020581SoilMTGRPAERWKKFKFATPPDWTYALLVLTCLGGLGFIAFTIVVAAVSERASGFLPLTRTSSRIVTLAFWIPLALLIAWPICWLVALITAGDPNSSGTTGAFVWLGIFFLLAGMIGRLVVTRFVSPRAKVMELAPGHFDRIVELRNVHP
Ga0215015_1095715313300021046SoilLGKVQVWGSQLWANDFPPICAMTGRPAETWKKFKFATPPDWTYALLILVCLGGLGFIAFTIVVAVVSQRAAGFLPLTRSSSRTVALALWIPLGLLIAWLISWLIAAIFGFSSNDPTASAIAGVSFWLGVFFLAAGMIGRLLVTPLISPRAKVMEVAPGQMDRIVELRNVHPLFVAAVLQRQQAAASQYAYPAQSPYLPGST
Ga0210384_1076320023300021432SoilMTGRPAERWKKFKFATPPDWTYALLVLTCLGGLGFIAFTIVVAAVSERASGFLPLTRTSSRIVTLAFWIPLALLIAWPICWLVALITAGDPNASGTTGAFVWLGIFFLLAGMIGRLVVTRFVSPRAKVMELAPGHFDRIVELRNVHPLFVAAVLQRQQAQASQRAYPAQSPYLPGST
Ga0210409_1022702223300021559SoilMTGRPAETWKKFKFATPPDWTYALLVLTCLGGLGFIAFTIVVAAVSERASGFLPLTRSSSRTVALAFWIPLALLIAWPICWGLALITAGDPNASGTTAALVWLGIFFLGAGMIGRLVVNRFVSPRAKVMELAPGHVDRIVELRNVHPLFVAAVLQRQQAAASQYAYPAQSPFLPGST
Ga0210409_1025252523300021559SoilMTGRPAESWKKFKFATPPDWTYALLVLTCLGGLGFIAFTIVVAAVSERASGFLPLTRTSSRIVTLAFWIPLALLIAWPICWLVALITAGDPNASGTTGAFVWLGIFFLLAGMIGRLVVTRFVSPRAKVMELAPGHFDRIVELRNVHPLFVAAVLQRQQAQASQHAYPAQSPYLPGST
Ga0209350_106440023300026277Grasslands SoilMTGRPGETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVFPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0209237_101044533300026297Grasslands SoilMTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPMTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAV
Ga0209236_1001016103300026298Grasslands SoilMTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPMTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0209238_104850413300026301Grasslands SoilALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAV
Ga0209239_100466393300026310Grasslands SoilMTGRPAATWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0209471_1000219353300026318SoilMTGRPAETWKKFSFATPPDWAYALLLLVCLGGIGVAAYAVVIAIVSQRASGYLPLTRSSARTATLAFWIPVGILIAWPVCWFIALVVSVAGNNDPTANTFAAVLLFLGLACLLAGLVGRLVITRLIRPRAKVMQAAPGQTDRIVELRNVHPAFVAAVQQHQQARAVQYAPAPQAPFLPGP
Ga0209687_100525813300026322SoilLGRVRVWSSQLWANDFPPVCAMTGRPVETWKKFNFATPPGWAYALLILICLGGLGFIAFAIVLAVVSQRASGYLPLTSSSSRTATLALWIPVGLLLAGPVAFAIALIVALASNDATAATITAVFLWLGILVLGVGLLGRLVFTPLIRPRGKVMEAAPGQTDRIVELRNVHPAFVAAVQQHQQARAAQYAPAPQAPFLLGPK
Ga0209470_107992613300026324SoilETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPQTRSSSRTAALAFGIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0209375_100794153300026329SoilMTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFGIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0209267_118911923300026331SoilDFPPVCAMTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILALAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0209158_100649763300026333SoilMTGRPGETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILP
Ga0209377_100799713300026334SoilALGKVQVWASQLWSNDFPPVCAMTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILAFAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0209690_100308753300026524SoilMTGRPAETWKKFKFATPPDWVYALLVLVCLGGIGILALAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0209059_100439823300026527SoilMTGRPVETWKKFNFATPPGWAYALLILICLGGLGFIAFAIVLAVVSQRASGYLPLTSSSSRTATLALWIPVGLLLAGPVAFAIALIVALASNDATAATITAVFLWLGILVLGVGLLGRLVVTPLIRPRGKVMEAAPGQTDRIVELRNVHPAFVAAVQQHQQARAAQYAPAPQAPFLLGPK
Ga0209160_103519143300026532SoilAIVTAVVSQRASGYLPLTRSSSRTAALAFWIPVGLLIACPVAWAIAAIFGFSSGDSSAATITGVFFWLGLLFLAVGLLGRLLVTPLVSPRAKVWPIAPGQTDRIVELRNVHPAFVVAVQRQQQARAAQYTPAPPSPILPGSK
Ga0209648_10000605123300026551Grasslands SoilMTGRPAETWKKFNFATPPDWAYALLFLICLGGIGFVAFAVTIAVVSQRASGYLPLTRSSARTVRLALWIPVGFLLAWPACWLIALIVGVAGNNDSTANGVAGTFLFLGLLFMLGGLVGRLVVTRLLTPRAKVSAPAPGQTDRLVELRNVHPAFVAAVQQHQQARAAQYVLAPQAPFLPGA
Ga0209648_1007700323300026551Grasslands SoilMTGRPAETWKKFKFATPPDWTYALLILTCLGGVGFIAFTIVVAAVSERASGFLPLTRSSSRTVALAFWIPLALLMAWPICWVLALITAGNPSASATTGGLFWLGVFFLLAGMIGRLVVTRFVSPRAKVMELAPGHFDRIVELRNVHPLFVAAVLQRQQAQASQYAYPAQSPYLPGST
Ga0209648_1010365513300026551Grasslands SoilGRPAETWKKFKFAPPPDWTYALLILVCLGGLGFIAFTIVVAVVSQRASGFLPLTRSSGRTVTLAFWIPLALLIAWPVFWVVAATFAFSSNDPTASVIAGVSFWLGVLCLVAGLIGRLVVMPLISPRAKVMEPPPGQTDRIVELRNVHPLFVAAVLQRQQASVSPYAYPAQSPYLPGST
Ga0209220_1000015833300027587Forest SoilVSRVSIAASTSSSACALGKVQVWASQLWANDFPPVCAMTGRPAETWKKFKFATPPDWTYALLILVCLGGLGFIAFTIVVAVVSQRASGFLPLTRSSSRTVRLAFWIPLVLLITWPVSWAIAVTFGLSSSDPTPAVIAGVSFWLGVLFLVAGLIGRLIINPLISPRAKVREPAPGQTDRIVELRNVHPLFVAAVLQRQQASALRYIAPAQSPYLPGST
Ga0209388_100336133300027655Vadose Zone SoilMTGRPAETWKKFKFATPPDWTYALLILVCLGGLGFIAFTIVVAVVSQRASGFLPLTRSSGRTVTLAFWIPLALLIAWPVFWVVAATFAFSSNDPTASVIAGVSFWLGVLFLVAGLIGRLVVMPLISPRAKVMEPPPGQTDRIVELRNVHPLFVAAVLQRQQASVSQYAYPAQSPYLPGST
Ga0209588_102515413300027671Vadose Zone SoilAMTGRPAETWKKFKFATPPDWTYALLILVCLGGLGFIAFTIVVAVVSQRASGFLPLTRSSGRTVTLAFWIPLALLIAWPVFWVVAATFAFSSNDPTASVIAGVSFWLGVLFLVAGLIGRLVVMPLISPRAKVMEPPPGQTDRIVELRNVHPLFVAAVLQRQQASVSQYAYPAQSPYLPGS
Ga0209180_1000197373300027846Vadose Zone SoilMTGRPAETWKKFKFATPPDWTYALLILVCLGGLGFIAFAIVVALVSQRASGFLPLTRSSSRTVTLAFWIPLVVLIAWPVSWLTAAIFGFSSNDPTASTIAGVSFWLGVLFLVAGLIGRLVVTPLISPRAKVMELAPGHFDRIVELRNVHPLFVAAVLQRQQAQASQYAYPAQSPYLPGST
Ga0209180_1057599423300027846Vadose Zone SoilMTGRSAETWKKFKFATPPDWTYALLILVCLGGLGFIAFTIVVAVVSQRASGFLPLTRSSGRTVTLAFWIPLTLLIAWPVFWALAATLAFSSNDPTASAIAGVSFWLGVLFLVAGLIGRLVVTPLISPRAKVRELAPGQIDRIVELRNVH
Ga0209180_1066907713300027846Vadose Zone SoilRKFKFATPPDWTYALLILTCLGGVGFIAFTIVVAAVSERASGFLPLTRSSGRTVTLVFWIPLALLIAWPICWGLALITAADPNASSTAGGSFWLGNFFLLAGMIGRLVVTRFVSPRAKVMELAPGQTDRIVELRNVHPLFVAAVLQRQQAYASHYGAIAQSPYLPGST
Ga0209701_1002557043300027862Vadose Zone SoilVSRVSIADSTSCSVCGLGKVQVWASQLWANDFPPVCAMTGRSAETWKKFKFATPPDWTYALLILVCLGGLGFIAFTIVVAVVSQRASGFLPLTRSSGRTVTLAFWIPLTLLIAWPVFWALAATLAFSSNDPTASAIAGVSFWLGVLFLVAGLIGRLVVTPLISPRAKVRELAPGQIDRIVELRNVHPLFVAAVLQRQQASASQYVHTAQSQYLPGST
Ga0209701_1008018723300027862Vadose Zone SoilVSHVSTVASTSCSACDLGKVQVWASQLWANDFPPVCAMTGRPAETWKKFKFATPPDWTYALLILVCLGGVGFIAFTIVVAVVSQRASGFLPLTRSSSRTVTLAFWIPLALLIAWPVFWGLAATFAFSSNDPTASAIAGVSFWLGVLFLVAGLIGRLVVTPLISPRAKVMELAPGQTDRIVELRNVHPLFVAAVLQRQQAYTSQYATIAQSPYVPGSN
Ga0209283_1037088823300027875Vadose Zone SoilLGKVQVWASQLWANDFPPVCAITGRPAETWKKFKFATPPDWTYALLILVCLGGLGFIAFTIVVAVVSQRASGFLPLTRSSSRTVTLAFWTPLVLLIAWVVSWVIAAIFGFSSNDPTASVIAGTSFWVGVLFLAAGMIGRLVVTPLISPRAKVKELAPGQTDRIVELRNVHPLFVAAVLQRQQASASQYAYPAHSQYLPGST
Ga0209590_1013894223300027882Vadose Zone SoilVASTSCSACDLGKVQVWASQLWANDFPPVCAMTGRPAETWKKFKFATPPDWTYALLILTCLGGVGFIAFTIVVAAVSERASGFLPLTRSSGRTVTLVFWIPLALLIAWPICWGLALITAADPNASSTAGGSFWLGNFFLLAGMIGRLVVTRFVSPRAKVMELAPGQTDRIVELRNVHPLFVAAVLQRQQAYASHYGAIAQSPYLPGST
Ga0209590_1021999523300027882Vadose Zone SoilVVAAVSERASGFLPLTRSSSRTVALVFWIPLALLIAWPICWGLALITAADPKASSITGGSFWLGNFFLLAGMIGRLVVTRFVSPRAKVMELAPGQTDRIVELRNVHPLFVAAVLQRQQAYASQYAALAPSPYVPRST
Ga0209488_1079930913300027903Vadose Zone SoilTWMKFKFATPPDWAYALLILICVGGLGFIAFAIVMTLVAQRASGYLPLTRSASRTVSLAVWIPIGLLIAAPVTWAIALFAGSSNANTIGIVLLWLGILFLLAGLVGRLLVTRLVCPRAKVMEAAPGQTDRIVELRNVHPLFVTAVLERQHGRAARALL
Ga0209583_1022679613300027910WatershedsMTGRPAETWKKFKFATPPDWTYALLVLTCLGGLGFIAFTIVVAAVSERASGFLPLTRSSSRTVALVFWIPLALLIAWPVCWAVALVTSGDPNASATTVGLVWLGIFFLVAGMIGRLVVTRFISPRAKVMELAPGHFDRIVELRNVHPLFVAAVLQRQQAQASQHAYPAQSPYLPGST
Ga0137415_1000213383300028536Vadose Zone SoilMTGRPAETWKKFSFATPPDWAYALLFLVCLGGIGIAAFVVVVALVSQRASGYLPLTRSSARTVTLAFWIPLGLLIAWPVAWASAAIFGFGFNDSTSSTIAGVFFWLGIVFLITGLMGRLLIMRLTAPRAKVFPTPPGQTDRIVELRNVHPAFVAAVQQHQHARAAQYSPAPQAPLLPS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.