NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F091703

Metagenome Family F091703

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F091703
Family Type Metagenome
Number of Sequences 107
Average Sequence Length 210 residues
Representative Sequence LVVLATVVATLGCGQKLVFPRLEFKSDEAKSPSIPLTVRLEIPEALKHAQLFYKDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVTVVDMEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLGTRREMM
Number of Associated Samples 85
Number of Associated Scaffolds 107

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 28.71 %
% of genes near scaffold ends (potentially truncated) 92.52 %
% of genes from short scaffolds (< 2000 bps) 85.05 %
Associated GOLD sequencing projects 77
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (94.393 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(33.645 % of family members)
Environment Ontology (ENVO) Unclassified
(47.664 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(52.336 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 30.69%    β-sheet: 27.72%    Coil/Unstructured: 41.58%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 107 Family Scaffolds
PF01040UbiA 25.23
PF00034Cytochrom_C 3.74
PF13442Cytochrome_CBB3 1.87
PF04307YdjM 0.93
PF09459EB_dh 0.93

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 107 Family Scaffolds
COG1988Membrane-bound metal-dependent hydrolase YbcI, DUF457 familyGeneral function prediction only [R] 0.93


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms94.39 %
UnclassifiedrootN/A5.61 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000571|JGI1358J11329_10230176All Organisms → cellular organisms → Bacteria → Nitrospirae512Open in IMG/M
3300002558|JGI25385J37094_10136312All Organisms → cellular organisms → Bacteria → Nitrospirae673Open in IMG/M
3300005167|Ga0066672_10488741All Organisms → cellular organisms → Bacteria → Nitrospirae800Open in IMG/M
3300005172|Ga0066683_10345508All Organisms → cellular organisms → Bacteria → Nitrospirae924Open in IMG/M
3300005178|Ga0066688_10450122All Organisms → cellular organisms → Bacteria → Nitrospirae833Open in IMG/M
3300005178|Ga0066688_10985587All Organisms → cellular organisms → Bacteria → Nitrospirae514Open in IMG/M
3300005180|Ga0066685_10437936All Organisms → cellular organisms → Bacteria → Nitrospirae907Open in IMG/M
3300005186|Ga0066676_10104539All Organisms → cellular organisms → Bacteria → Nitrospirae1724Open in IMG/M
3300005186|Ga0066676_10272536All Organisms → cellular organisms → Bacteria → Nitrospirae1111Open in IMG/M
3300005187|Ga0066675_10067362All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira2266Open in IMG/M
3300005445|Ga0070708_101204198All Organisms → cellular organisms → Bacteria → Nitrospirae708Open in IMG/M
3300005446|Ga0066686_10039635All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira defluvii2818Open in IMG/M
3300005450|Ga0066682_10234810All Organisms → cellular organisms → Bacteria → Nitrospirae1179Open in IMG/M
3300005536|Ga0070697_100888416All Organisms → cellular organisms → Bacteria → Nitrospirae790Open in IMG/M
3300005540|Ga0066697_10159542All Organisms → cellular organisms → Bacteria → Nitrospirae1334Open in IMG/M
3300005540|Ga0066697_10508086All Organisms → cellular organisms → Bacteria → Nitrospirae683Open in IMG/M
3300005549|Ga0070704_101036953All Organisms → cellular organisms → Bacteria → Nitrospirae743Open in IMG/M
3300005554|Ga0066661_10090890All Organisms → cellular organisms → Bacteria → Nitrospirae1812Open in IMG/M
3300005555|Ga0066692_10845405All Organisms → cellular organisms → Bacteria → Nitrospirae562Open in IMG/M
3300005557|Ga0066704_10512073All Organisms → cellular organisms → Bacteria → Nitrospirae789Open in IMG/M
3300005557|Ga0066704_10782089All Organisms → cellular organisms → Bacteria → Nitrospirae595Open in IMG/M
3300005557|Ga0066704_10951520All Organisms → cellular organisms → Bacteria → Nitrospirae530Open in IMG/M
3300005558|Ga0066698_11051583All Organisms → cellular organisms → Bacteria → Nitrospirae516Open in IMG/M
3300005559|Ga0066700_10666983All Organisms → cellular organisms → Bacteria → Nitrospirae718Open in IMG/M
3300005561|Ga0066699_10139454All Organisms → cellular organisms → Bacteria → Nitrospirae1644Open in IMG/M
3300005568|Ga0066703_10079367All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira1896Open in IMG/M
3300005568|Ga0066703_10123476All Organisms → cellular organisms → Bacteria → Nitrospirae1541Open in IMG/M
3300005576|Ga0066708_10460593All Organisms → cellular organisms → Bacteria → Nitrospirae817Open in IMG/M
3300005586|Ga0066691_10312723All Organisms → cellular organisms → Bacteria → Nitrospirae928Open in IMG/M
3300006796|Ga0066665_10547415All Organisms → cellular organisms → Bacteria → Nitrospirae941Open in IMG/M
3300006797|Ga0066659_11531107All Organisms → cellular organisms → Bacteria → Nitrospirae559Open in IMG/M
3300006800|Ga0066660_10596198All Organisms → cellular organisms → Bacteria → Nitrospirae924Open in IMG/M
3300007265|Ga0099794_10289972All Organisms → cellular organisms → Bacteria → Nitrospirae847Open in IMG/M
3300009012|Ga0066710_102094996All Organisms → cellular organisms → Bacteria → Nitrospirae832Open in IMG/M
3300009012|Ga0066710_103488358All Organisms → cellular organisms → Bacteria → Nitrospirae595Open in IMG/M
3300009038|Ga0099829_10829258All Organisms → cellular organisms → Bacteria → Nitrospirae768Open in IMG/M
3300009038|Ga0099829_11271852All Organisms → cellular organisms → Bacteria → Nitrospirae609Open in IMG/M
3300009088|Ga0099830_10721146All Organisms → cellular organisms → Bacteria → Nitrospirae821Open in IMG/M
3300009088|Ga0099830_11430971All Organisms → cellular organisms → Bacteria → Nitrospirae575Open in IMG/M
3300009089|Ga0099828_12028764All Organisms → cellular organisms → Bacteria → Nitrospirae502Open in IMG/M
3300010304|Ga0134088_10030776All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae2420Open in IMG/M
3300010323|Ga0134086_10505996All Organisms → cellular organisms → Bacteria → Nitrospirae501Open in IMG/M
3300010335|Ga0134063_10449996All Organisms → cellular organisms → Bacteria → Nitrospirae638Open in IMG/M
3300010336|Ga0134071_10043473All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae2014Open in IMG/M
3300010336|Ga0134071_10183877All Organisms → cellular organisms → Bacteria → Nitrospirae1027Open in IMG/M
3300011269|Ga0137392_10184448All Organisms → cellular organisms → Bacteria → Nitrospirae1700Open in IMG/M
3300012096|Ga0137389_10602752All Organisms → cellular organisms → Bacteria → Nitrospirae944Open in IMG/M
3300012096|Ga0137389_11056104All Organisms → cellular organisms → Bacteria → Nitrospirae696Open in IMG/M
3300012096|Ga0137389_11439176All Organisms → cellular organisms → Bacteria → Nitrospirae585Open in IMG/M
3300012189|Ga0137388_10139447All Organisms → cellular organisms → Bacteria → Nitrospirae2133Open in IMG/M
3300012189|Ga0137388_11634660All Organisms → cellular organisms → Bacteria → Nitrospirae579Open in IMG/M
3300012206|Ga0137380_10481953All Organisms → cellular organisms → Bacteria → Nitrospirae1094Open in IMG/M
3300012349|Ga0137387_10388275All Organisms → cellular organisms → Bacteria → Nitrospirae1012Open in IMG/M
3300012349|Ga0137387_10504469All Organisms → cellular organisms → Bacteria → Nitrospirae878Open in IMG/M
3300012351|Ga0137386_10610992All Organisms → cellular organisms → Bacteria → Nitrospirae785Open in IMG/M
3300012353|Ga0137367_10280956All Organisms → cellular organisms → Bacteria → Nitrospirae1193Open in IMG/M
3300012354|Ga0137366_10491814All Organisms → cellular organisms → Bacteria → Nitrospirae887Open in IMG/M
3300012355|Ga0137369_10117229All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae2159Open in IMG/M
3300012356|Ga0137371_10037506All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira moscoviensis3742Open in IMG/M
3300012356|Ga0137371_10892161All Organisms → cellular organisms → Bacteria → Nitrospirae676Open in IMG/M
3300012363|Ga0137390_11031290All Organisms → cellular organisms → Bacteria → Nitrospirae774Open in IMG/M
3300012532|Ga0137373_10656266All Organisms → cellular organisms → Bacteria → Nitrospirae786Open in IMG/M
3300012927|Ga0137416_10386181All Organisms → cellular organisms → Bacteria → Nitrospirae1183Open in IMG/M
3300012927|Ga0137416_11511760All Organisms → cellular organisms → Bacteria → Nitrospirae610Open in IMG/M
3300012948|Ga0126375_11897026All Organisms → cellular organisms → Bacteria → Nitrospirae523Open in IMG/M
3300012976|Ga0134076_10140991All Organisms → cellular organisms → Bacteria → Nitrospirae982Open in IMG/M
3300014150|Ga0134081_10354647All Organisms → cellular organisms → Bacteria → Nitrospirae539Open in IMG/M
3300017659|Ga0134083_10182726All Organisms → cellular organisms → Bacteria → Nitrospirae860Open in IMG/M
3300017659|Ga0134083_10253381All Organisms → cellular organisms → Bacteria → Nitrospirae737Open in IMG/M
3300017997|Ga0184610_1218487All Organisms → cellular organisms → Bacteria → Nitrospirae637Open in IMG/M
3300018027|Ga0184605_10234927All Organisms → cellular organisms → Bacteria → Nitrospirae833Open in IMG/M
3300018052|Ga0184638_1060580All Organisms → cellular organisms → Bacteria → Nitrospirae1382Open in IMG/M
3300018052|Ga0184638_1188170All Organisms → cellular organisms → Bacteria → Nitrospirae730Open in IMG/M
3300018056|Ga0184623_10002898All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira moscoviensis7360Open in IMG/M
3300018061|Ga0184619_10332715All Organisms → cellular organisms → Bacteria → Nitrospirae694Open in IMG/M
3300018075|Ga0184632_10002401All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira7788Open in IMG/M
3300018468|Ga0066662_11829481All Organisms → cellular organisms → Bacteria → Nitrospirae635Open in IMG/M
3300021073|Ga0210378_10196322All Organisms → cellular organisms → Bacteria → Nitrospirae772Open in IMG/M
3300021344|Ga0193719_10221331All Organisms → cellular organisms → Bacteria → Nitrospirae805Open in IMG/M
3300022534|Ga0224452_1279575All Organisms → cellular organisms → Bacteria → Nitrospirae510Open in IMG/M
3300025160|Ga0209109_10479999All Organisms → cellular organisms → Bacteria → Nitrospirae568Open in IMG/M
3300025899|Ga0207642_10698740All Organisms → cellular organisms → Bacteria → Nitrospirae639Open in IMG/M
3300026297|Ga0209237_1202457All Organisms → cellular organisms → Bacteria → Nitrospirae635Open in IMG/M
3300026309|Ga0209055_1065621All Organisms → cellular organisms → Bacteria → Nitrospirae1525Open in IMG/M
3300026333|Ga0209158_1217762All Organisms → cellular organisms → Bacteria → Nitrospirae660Open in IMG/M
3300026537|Ga0209157_1048377All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira2275Open in IMG/M
3300026537|Ga0209157_1066893All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira1821Open in IMG/M
3300026540|Ga0209376_1214939All Organisms → cellular organisms → Bacteria → Nitrospirae859Open in IMG/M
3300026547|Ga0209156_10273133All Organisms → cellular organisms → Bacteria → Nitrospirae773Open in IMG/M
3300026552|Ga0209577_10389316All Organisms → cellular organisms → Bacteria → Nitrospirae1009Open in IMG/M
3300027846|Ga0209180_10074726All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira1906Open in IMG/M
3300028536|Ga0137415_11041164All Organisms → cellular organisms → Bacteria → Nitrospirae630Open in IMG/M
3300028799|Ga0307284_10393557All Organisms → cellular organisms → Bacteria → Nitrospirae564Open in IMG/M
3300028814|Ga0307302_10351582All Organisms → cellular organisms → Bacteria → Nitrospirae727Open in IMG/M
3300028828|Ga0307312_11081187All Organisms → cellular organisms → Bacteria → Nitrospirae531Open in IMG/M
3300031720|Ga0307469_11504802All Organisms → cellular organisms → Bacteria → Nitrospirae645Open in IMG/M
3300031820|Ga0307473_10173888All Organisms → cellular organisms → Bacteria → Nitrospirae1252Open in IMG/M
3300032156|Ga0315295_11739052All Organisms → cellular organisms → Bacteria → Nitrospirae594Open in IMG/M
3300032180|Ga0307471_100485540All Organisms → cellular organisms → Bacteria → Nitrospirae1381Open in IMG/M
3300033803|Ga0314862_0017954All Organisms → cellular organisms → Bacteria → Nitrospirae1352Open in IMG/M
3300034165|Ga0364942_0189403All Organisms → cellular organisms → Bacteria → Nitrospirae671Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil33.64%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil27.10%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil8.41%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment6.54%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.61%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.67%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.80%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.80%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.93%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.93%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.93%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.93%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.93%
PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Peatland0.93%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.93%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.93%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000571Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW60B uncontaminated upgradient, 5.4 mEnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300025160Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 2EnvironmentalOpen in IMG/M
3300025899Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028799Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_123EnvironmentalOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032156Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G14_0EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033803Tropical peat soil microbial communities from peatlands in Loreto, Peru - MAQ_0_10EnvironmentalOpen in IMG/M
3300034165Sediment microbial communities from East River floodplain, Colorado, United States - 19_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI1358J11329_1023017613300000571GroundwaterFPRLEFKSDESKTPSIPLTVRLDIPDAFKQTQLFYKDSCGVSQGIPLGERLTEQVKADAAGVFEKTFVGGPKDPADAVLSATIETSEINLYIPRREIGEYKMTALVRLRITMTDSEGKSLFNEAIKGEGKWNVTTDGTECTVRALMLPVTEAMEKLSDRTVEELAQSAKI
JGI25385J37094_1013631213300002558Grasslands SoilLMRLVIVATVVATLGCGQKLVFPRLEFKSDESKXPSIPLTVRLEXPEALTHAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKEPADAVLSAALETSEIKLYIPRREVGEYKMTTLIRVRVTVVDAEGKTXLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLGTRREMMATGRLGGSGGTGAPPAPT
Ga0066672_1048874123300005167SoilMLLLCSAQRLFMRLVVLATIVATLGCGQKLVFPRLEFKSDEAKTPSIPLTVRLEIPEALKHAQLFYKDSCGAPQAIPLGERLAEQVKADAAGVFEKTFEGSPKDPADAVLSTAMETGEINLYIPRREVGEYKMTVLVRLRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTIQGLLLPVTEALEKLSDRMVENLTQSVKIRDWAIRL
Ga0066683_1034550813300005172SoilMATSFLALYQSISRLRRSDFVARFMVCFCVLLSVALSACGQKLVFPRLEFPSEESKTPSIPLAVRIEIPDHLKQAQLFYRDSCNVPQAIPLGERLADQVKADVAQVFEKIVDADSKEPVDAVLTAALETGELDLHIPRREIGEYPLKVLVRLRLTVTDTEGKVLYNEAIKGEGKWRVQTDGTACTVQGLMLPVTEAIEKLSDREVESLTQAVRIRDAAIRLSTRKELLAAGKAPAGAGPGVKGGAEAPG
Ga0066688_1045012213300005178SoilMMGCCIFLSLALSGCGQKLVFPRLEFKSEEPKTPSIPLTVRIEIPDGLKQAQLFYRDSCNVPQAIPLGERLADQVKADAAQVFEKIVETGSKEPVDAVLTAALETGELNLHIPRREIGEYPLKVLVRLRLTVTDTEGKVLYNEAIKGEGKWRV
Ga0066688_1098558713300005178SoilIYPPRPSDFVARFTVGFCILLSFTVSGCGQKLVFPRLEFKSEEPKTPSIPLAVRIEIPDVLRQAQLFYLDSCNVPQAIPLGERLADQVKADAAQVFEKIVGAESKEPVDAVLTAALDTGELDLHIPRREIGEYPLKVLVRLRITVTDTEGKVLYNEAIKGEGKWRVQTDGT
Ga0066685_1043793613300005180SoilMSNLRPLMRFFVVATVVATLGCGQKLVFPRLEFKSDEAKSPSIPLTVRLEIPEALKHAQLFYKDSCGVPQAIPLGERLAEQVKSDATGVFEKTFEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVMVVDMEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVT
Ga0066676_1010453913300005186SoilMSTLRPLMRFFVVATVVATLGCGQKLVFPRLEFKSDEAKSPSIPLTVRLEIPEALKHAQLFYKDSCGAPQAIPLGERLAEQVKSDATGVFEKTFEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVMVVDMEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRL
Ga0066676_1027253613300005186SoilLSCSYRLVRPTLALLLLLLGATSGCGQKVVFPRLEFKSSEPQTPSIPLSVRLDLPDALKQAQMYYRDSCNVPQAIPLGERLAEQVKADAAQVFEKTFEGTAAKEPADAVLSAVLETSEINLNIPRREIGEYPLKVLIRLRISVSDTEGKSLFNDVIKAEGKWTARTDGTECRVQGIMLPVTEAIEKLSDRVVESLTQAVRIRDAAIRLRTRQELAAGAAAQPRSSAESPTLSFRASLEDENHNQI
Ga0066675_1006736213300005187SoilMRLFIVATVVATLGCGQKLVFPRLEFKSDESKAPSIPLTVRLEVPEALKRAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKDPADAVLSAALETSEIKLYIPRREVGEYKMTTLIRVRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKVRDWAIRLGTRKEMMAAGRLGGSGGTGAPPTPTAGAPTLSFRASLEDENRNQVLEPAEKLVVRVEVANAGPGVARGVAVELSGTPA
Ga0070708_10120419813300005445Corn, Switchgrass And Miscanthus RhizosphereLMATASFGCGQKLVFPRLEFKSEESKTPSIPLTVRVEIPDALKQAQLFYKDSCGMPQGIPLGERLAEQVKADVAGVFEKTFEGSAKEPADAVLSAVMETSEINLYIPRREIGEYKMSVLVRLRVTVVDTEGKTLFNEAIKGEGKWNVTTDGTECTVRGLMLPVTEAMEKLSDRTVDSLTQSIKIRDYAIRIRTRQEMMALSKQGGGAVAPPAGTGTADPPTLSFRASLEDENRNQ
Ga0066686_1003963513300005446SoilMLLPNSALRPLMRLVIVATVVATLGCGQKLVFPRLEFKSDESKAPSIPLTVRLEVPEALTHAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKEPADAVLSAALETSEIKLYIPRREVGEYKMTTLIRVRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLGTRREMMATGRLGGSGGTGAPPAPTTGAPTLSFRASLEDENRNQVLEPA
Ga0066682_1023481023300005450SoilMLLPCSALRPLMPLFIVATVVATLGCGQKLVFPRLEFKSEESKVPSIPLKVRLEVPEALKQAQLFYKDSCGASQAVPLGERLVEQVKADAAGVFEKTFEGSPKDPADAVLSAALETSEINLHIPRREVGEYKMTTLIRLRVTVVDAEGKTLFNEPVKGEGKWTVTTDGTN
Ga0070697_10088841613300005536Corn, Switchgrass And Miscanthus RhizosphereLLSCSALRPLMSLFIVAAVVATLGCGQKLVFPRLEFKSEESKAPSIPLKVRLEVPGTLKQAQLFYKDSCGAAQAVPLGERLAEQVKADAAGVFEKTFEGSPKDPADAVLSAALETSEINLHIPRREIGEYKMTTLIRVRVTVVDSEGKTLFNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRMVDGLNQSVKIRDWAIRLTTRREMMAAGRPGGGGGTGAPPAATAEAPTLSFRVSLEDENRNQVLEPVEKLVVRV
Ga0066697_1015954223300005540SoilMLLPCSALRPLMRLFIVATVVATLGCGQKLVFPRLEFKSEESKAPSIPLKVRLEVPEKLKQAQLFYKDSCGASQAVPLGERLAEQVKADAAGVFEKTFEGSPKDPADAVLRAALETSEVNLHIPRREVGEYKMTTLIRLRVTVVDAEGKTLFNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRMVDGLNQSVKIRDWAIRLG
Ga0066697_1050808613300005540SoilSACGQKLVFPRLEFPSEESKTPSIPLAVRIEIPDHLKQAQLFYRDSCNVPQAIPLGERLADQVKADVAQVFEKIVEADSKEPVDAVLTAALDTGELDLHIPRREIGEYPLKVLVRLRITVTDTEGKVLYNEAIKGEGKWRVQTDGTACTVQGLMLPVTEAIEKLSDREVESLTQAVRIRDAAIRLSTRKELLAVGKAPAGAGPGVKGGAEAPGLSFRASLEDENRNQ
Ga0070704_10103695313300005549Corn, Switchgrass And Miscanthus RhizosphereMIISLLALRRRISGICWSEFAASSTMVFFVALILAGCGQKLVFPRLEFKSEEPKQPSIPLAVRIEIPDAFKQAQLFYRDSCNTPQAIPLGERLAEQLKADAAQVFETIVETGSTEPTDAVLTPVLESGELDLHIPRREIGEYPLKVLIRVRLTVTDTQGKALFNETIKGAGKWTVQTDGTACTVQGVMLPVTESIEKLSDREVESLTQSVGI
Ga0066701_1021387323300005552SoilLSCSYRLVRPTLALLLLLLGATSGCGQKVVFPRLEFKSSEPQTPSIPLSVRLDLPDALKQAQMYYRDSCNVPQAIPLGERLAEQVKADAAQVFEKTFEGTAAKEPADAVLSAVLETSEINLNIPRREIGEYPLKVLIRLRISVSDTEGKSLFNDVIKAEGKWTARTDGTECRVQGIMLPVTEAIEKLSDRVVESLTQAVRIRDAAIRLRTRQELAAGAAAQPRSSAESPTLSFRASLEDENHNQILEGGEKVLIRLEV
Ga0066661_1009089013300005554SoilMLLLCSAQRPFMRLVVLATIVATLGCGQKLVFPRLEFKSDEAKTPSIPLTVRLEIPEALKHTQLFYKDSCGAPQAIPLGERLAEQVKADAAGVFEKTFEGSPKDPADAVLSTAMETGEINLYIPRREVGEYKMTVLVRLRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTIQGLLLPVTEALEKLSDRMVENLTQSVKIRDWAIRLGTRKEMMAAGRPGGSGGTGAPPTATVEAPTLSFRASLEDENRNQALEAGEKV
Ga0066692_1084540513300005555SoilLGCGQKLVFPRLVFTPEESKPPSIPMTVRLEIPEALKHAQLIYKDSCGTPQAIALGERLAEQVKADATGVFEKTFEGSAKDPADAVLSAALETGDMSLAIPRREIGEYPLKVLVRLRVTVVDAEGKTLYNEPVKGEGKWTVTTDGTNCTVQGVMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLG
Ga0066704_1051207313300005557SoilMATSFLALYQSISRLRRSDFGIRFMVCFCVLLSIALTGCGQKLVFPRLEFQSDEPKTPSIPLAVRIEIPDHLKQAQLFYRDSCNVPQAIPLGERLADQVKADAAQVFEKIVEADSKEPVDAVLTAALDTGELDLHIPRREVGEYPLKVLVRLRLTVTDTEGKVLYNEAVKGEGKWRVQTDGTACTVQGLMLPVTEAIEKLSDREVESLTQAVRIRDAAIRLRARQEMLAAGKAPAGAGPGVKG
Ga0066704_1078208913300005557SoilRPAHMLLLCSAQRPFMRLVVLATVVAALGCGQKLVFPRLEFKSDEAKTPSIPLTVRLEIPEALKHAQLFYKDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGSPKDPADAVFSAALETGEMNLVIPRREIGEYPLKVLVRLRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLENS
Ga0066704_1095152013300005557SoilIVATVVATLGCGQKLVFPRLEFKSDESKTPSIPLTVRLEIPEALTRAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKDPADAVLSAAVETSEIKLYIPRREVGEYKMTTLIRVRVTVIDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRL
Ga0066698_1105158313300005558SoilPEKLKQAQLFYKDSCGASQAVPLGERLAEQVKADAAGVFEKTFEGSPKDPADAVLRAALETSEVNLHIPRREVGEYKMTTLIRLRVTVVDAEGKTLFNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRMVDGLNQSVKIRDWAIRLGTRREMMVAGRPGGSGGT
Ga0066700_1066698313300005559SoilSRLRRSDFVARFMVGFCVLLSVALSACGQKLVFPRLEFQSEEPKTPSIPLAVRIEIPDRLKQAQLFYRDSCNVPQAIPLGERLADQVKADVAQVFEKIVEADSKEPVDAVLTAALDTGELDLHIPRREIGEYPLKVLVRLRITVTDTEGKVLYNEAIKGEGKWRVQTDGTSCTVQGLMLPVTEAIEKLSDREVESLTQAVRIRDAAIRLRARQEMLAAGKAPAGAGPGVKGAMEAPGLS
Ga0066699_1013945423300005561SoilMATSFLALYRSISRLRRSDFVARFMVCFCVLLSVAISACGQKLVFPRLEFQSEEPRTPSIPLAVRIQIPDVLRQAQLFYRDSCNVPQAIPLGERLADQVKADAAQVFEKIVGAESKEPVDAVLTAALDTGELDLHIPRREIGEYPLKVLVRLRITVTDTEGKVLYNEAIKGEGKWRVQTDGTACTVQGLMLPVTEAIEKLSDREVESLTQAV
Ga0066703_1007936733300005568SoilMLLPNSAVRPLMRLFIVATVVATLGCGQKLVFPRLEFKSDESKTPSIPLTVRLEIPEALTRAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKDPADAVLSAAVETSEIKLYIPRREVGEYKMTTLIRVRVTVIDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLGTRKEMMAAGRLGGSGGTGAPPTPT
Ga0066703_1012347613300005568SoilVRVFVAIGSHFLYNAAHMWLPMSTLRPLMRFFVVATVVVTLGCGQKLVFPRLEFKSDEAKSPSIPLSVRLEIPEALKHAQLFYRDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVTVVDMEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLGTRKEMMAAGRPGGSGGTGAPPTAT
Ga0066708_1046059323300005576SoilLSCSYRLVRPTLALLLLLLGATSGCGQKVVFPRLEFKSSEPQTPSIPLSVRLDLPDALKQAQMYYRDSCNVPQAIPLGERLAEQVKADAAQVFEKTFEGTAAKEPADAVLSAVLETSEINLNIPRREIGEYPLKVLIRLRISVSDTEGKSLFNDVIKAEGKWTARTDGTECRVQGIMLPVTEAIEKLSDRVVESLTQ
Ga0066691_1031272313300005586SoilMLLPCSALRPLMPLFIVAAVVATLGCGQKLVFPRLEFKSEESKAPSIPLKVRLEVPEALKQAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKDPADAVLRAAVETSEINLHIPRREVGEYKMTTLIRVRVTVVDSEGKTLFNEPVKGEGKWTVTTDGTNCTVQGLMLPV
Ga0066665_1054741513300006796SoilMLLLCSVQRPFMRLVVLGTVVAALGCGQKLVFPRLVFTPEESKTPSIPMTVRLEIPEALKHAQLFYKDSCGTPQAIPLGERLAEQVKADATGVFEKTFEGSAKDPADAVLSAALETGDMNLAISRREIGEYPLKVLVRLRVTVVDAEGKTLYNEPVKGEGKWTVTTDGTNCTVQGVMLPVTEALEKLSD
Ga0066659_1153110713300006797SoilMLLPNSALRPLMRLVIVATVVATLGCGQKLVFPRLEFKSDESKAPSIPLTVRLEVPEALTHAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKEPADAVLSAALETSEIKLYIPRREVGEYKMTTLIRVRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCT
Ga0066660_1059619813300006800SoilMATSFLALYRSISRLRRSDFVARFMVCFCVLLSVAISACGQKLVFPRLEFQSEEPRTPSIPLAVRIQIPDVLRQAQLFYRDSCNVPQAIPLGERLADQVKADAAQVFEKIVGAESKEPVDAVLTAALDTGELDLHIPRREIGEYPLKVLVRLRITVTDTEGKVLYNEAIKGEGKWRVQTDGTSCTVQGLMLPVTEAIEKLSDREVESLTQAVRIRDAAIRLRARQEMLAAGKAPAGAGPGVKGAMEAPGLSFRASLEDEN
Ga0099794_1028997213300007265Vadose Zone SoilMRLVVLATVVATLGCGQKLVFPRLEFKSDEAKSPSIPLTVRLEIPEALKHAQLFYKDSCGAPQAIPLGERLAEQVKADATGVFEKTFEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVTVVDMEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLGTRKEMMAAGRPG
Ga0066710_10209499613300009012Grasslands SoilVTARVFVALGSHFLYNAAHMWLPMSTLRPLMRFFVVATVVATLGCGQKLVFPRLEFKSDEAKSPSIPLTVRLEIPEALKHAQLFYKDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVMVVDMEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTE
Ga0066710_10223783513300009012Grasslands SoilMATSFLALYQSISRLRRSDFVARFMVGFCVLLSVALSACGQKLVFPRLEFQSEEPKTPSIPLAVRIEIPDRLKQAQLFYRDSCNVPQAIPLGERLADQVKADAAQVFEKIVEADSKEPVDAVLTAALDTGELDLIVPRREIGENPLKLLVRLGLTGSDIEVNVMYNDAVKGECKWSTQHD
Ga0066710_10348835813300009012Grasslands SoilGVKETAVRVFVAIGSHFLYNAAHMWLPMSTLRPLMRFFVVATVVVTLGCGQKLVFPRLEFKSDEAKSPSIPLSVRLEIPEALKHAQLFYRDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVTVVDMEGKTLLNEPVKGEGKWTVTTDGTNCTVQG
Ga0099829_1082925823300009038Vadose Zone SoilMLLLCSAQRPFMRLVVLATVVAALGCGQKLVFPRLEFKSDEAKTPSIPLTVRLEIPEALKHAQVFYKDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGSPKDPADAVLSAAIETGEMNLAIPRREIGEYPLKVLVRLRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGL
Ga0099829_1127185213300009038Vadose Zone SoilLIVRSIAFMTRLHPALRPLLHVLVLLIATASFGCGQKLVFPRLEFKSEESKTPSIPLTVRVEIPDALKQAQLFYKDSCGVPQGIPLGERLAEQVKADVAGVFEKTFEGSAKEPADAVLSAVMETSEINLYIPRREIGEYPMSTLVRLRVTVIDAEGKTLFNEAIKGEGKWKVTTDGTECTVRGLMLPVTEAMEKLSDRVVDS
Ga0099830_1072114613300009088Vadose Zone SoilMRFFVVATVVATLGCGQKLVFPRLEFKSDEAKPPSIPLTVRLEIPEALKHAQLFYKDSCGAPQAIPLGERLAEQVKADATGVFEKTFEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVTVVDMEGKTLLNEPVKGEGKWTVTTDGTNCTIQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLGTRREMMAAGRPVGSGGTGVPLTATAEAPTLSFLASLEDANRNQVLEAGEKVVVRVEV
Ga0099830_1133994513300009088Vadose Zone SoilMLLLCSAQRPFMRLVVLATVVAALGCGQKLVFPRLEFKSDEAKTPSIPLTVRLEIPEALKHAQVFYKDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGSPKDPADAVFSAALETGETNLAIPRREIGEYPLKVLVRLRVTVVDAEGKTLLNEPVKGEGKWTV
Ga0099830_1143097113300009088Vadose Zone SoilRRHPALRPLLHVLVLLMATASFGCGQKLVFPRLEFKSEESKTPSIPLTVRVEIPDALKQAQLFYKDSCGMPQGIPLGERLAEQVKADVAGVFEKTFEGSAKEPADAVLSAVMETSEINLYIPRREIGEYKMSVLVRLRVTVVDTEGKTLFNEAIKGEGKWNVTTDGTECTVRGLMLPVTEAMEKLSDRTVD
Ga0099828_1202876413300009089Vadose Zone SoilKAPSIPLTVRLEVPEALTHAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKEPADAVLSAALETSEIKLYIPRREVGEYKMTALVRVRVTVVDPEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLG
Ga0134088_1003077633300010304Grasslands SoilMRLVIVATVVATLGCGQKLVFPRLEFKSDESKAPSIPLTVRLEVPEALTHAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKEPADAVLSAALETSEIKLYIPRREVGEYKMTTLIRVRVTVVDAEGKTLLNEPVKGEGTWTVTTDGTNCTVQ
Ga0134086_1050599613300010323Grasslands SoilMRFFVVAMVVATLGCGQKLVFPRLEFKSDEAKSPSIPLTVRLEIPEALKHAQLFYKDSCGVPQAIPLGERLAEQVKSDATGVFEKTFEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVMVVDMEGKTLLNEPVKGEGKWTVTTDGTNCTV
Ga0134063_1044999613300010335Grasslands SoilRLEFKSDESKTPSIPLTVRLEIPEALTRAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKEPADAVLSAALETSEIKLYIPRREVGEYKMTTLIRVRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVGSLTQSVKIRDWAIRLGMRKEMMAAGRSGGSGGTGAPQTATAEAPTLSFRASL
Ga0134071_1004347313300010336Grasslands SoilMPLFIVAAVVATLGCGQKLVFPRLEFKSEESKAPSIPLKVRLEVPEALKQAQLFYKDSCGASQAVPLGERLAEQVKADAAGVFEKTFEGSPKDPADAVLSAALETSEINLHIPRREVGEYKMTALIRMRVTVVDAEGKTLFNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRL
Ga0134071_1018387723300010336Grasslands SoilMRLVIVATVVATLGCGQKLVFPRLEFKSDESKAPSIPLTVRLEVPEALTHAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKEPADAVLSAALETSEIKLYIPRREVGEYKMTTLIRVRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVGSLTQSVKIRDWAIRLGTRREMMAAGRLGGSGG
Ga0137392_1018444813300011269Vadose Zone SoilMRLVIVATVVATLGCGQKLVFPRLEFKSDESKAPSIPLTVRLEVPEALTHAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKEPADAVLSAALETSEIKLYIPRREVGEYKMTTLIRVRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLGTRKEMMAAGRLGGSGGTGAPPAPTAGAPTLSFRASLEDENRNQVLEPAEKLVVRVEVANAGPGVARGVAVEL
Ga0137389_1060275223300012096Vadose Zone SoilMLLLCSAQRPFMRLVVLATVVAALGCGQKLVFPRLEFKSDEAKTPSIPLTVRLEIPEALKHAQLFYKDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGSPKDPADAVFSAALETGEMNLVIPRREIGEYPLKVLVRLRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSV
Ga0137389_1105610413300012096Vadose Zone SoilISRKITQARRPPLIVRSIAFMTRLHPALRPLLHVLVLLIATASFGCGQKLVFPRLEFKSEESKTPSIPLTVRVEIPDALKQAQLFYKDSCGMPQGIPLGERLAEQVKADVAGVFEKTFEGSAKEPADAVLSAVMETSEINLYIPRREIGEYKMSVLVRLRVTVVDTEGKTLFNEAIKGEGKWNVTTDGTECTVRGLMLPVTEAMEKLSDRTVDSLTQAIKIRDYAIRIRTRQ
Ga0137389_1143917613300012096Vadose Zone SoilNAAHMWLPMSTLRPLRRFFVVATVVATLGCGQKRVFLRLELKSDEAKTPSIPLTVRLEIPEALKHAQLFYKDSCGAPQAIPLGERLAEQVKADATGVFEKTIEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVTVVDMEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDR
Ga0137388_1013944723300012189Vadose Zone SoilMRLVIVATVVATLGCGQKLVFPRLEFKSDESKAPSIPLTVRLEVPEALTHAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKEPADAVLSAALETSEIKLYIPRREVGEYKMTTLIRVRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEAL*
Ga0137388_1163466013300012189Vadose Zone SoilTLGCGQKLVFPRLEFKSDEAKTPSIPLTVRLEIPEALKHAQLFYKDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGSPKDPADAVFSAALETGETNLAIPRREIGEYPLKVLVRLRVTVVDAEGKTLLNAPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLGTRKEMM
Ga0137365_1002493363300012201Vadose Zone SoilMATSFLALYQSISRLRRSDFVARFMVGFCVLLSVALSACGQKLVFPRLEFQSEEPKTPSIPLAVRIEIPDRLKQAQLFYRDSCNVPQAIPLGERLADQVKADAAQVFEKIVEADSKEPVDAVLTAALDTGELDLHIPRREIGEYPLKVLVRLRLTVTDTEGKVLYNEAIK
Ga0137380_1048195323300012206Vadose Zone SoilMRLVIVATVVATLGCGQKLVFPRLEFKSDESKAPSIPLTVRLEVPEALTHAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKEPADAVLSAALETSEIKLYIPRREVGEYKMTTLIRVRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLGTRREMMAA
Ga0137387_1038827523300012349Vadose Zone SoilMRLFIVATVVATLGCGQKLVFPRLEFKSEESKAPSIPLKVRLEVPEALKQAQLFYKDSCGASQAVPLGERLAEQVKADAAGVFEKTFEGSPKDPADAVLRAAVETSEINLHIPRREVGEYKMTTLIRLRVTVVDAEGKTLFNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRMVDGLNQSVKIRDWAIRLGTRREMMVAGRPGGSGGTGAPPAPTAEAPTLSFRVSLEDENRNQVLEPVEKLVV
Ga0137387_1050446923300012349Vadose Zone SoilMATSFLALYQSISRLRRSDFVARFMVGFCVLLSFALSACGQKLVFPRLEFQSEEPKTPSIPLAVRIEIPDRLKQAQLFYRDSCNVPQAIPLGERLADQVKADAAQVFEKIVEADSKEPVDAVLTAALDTGELDLHIPRREIGEYPLKVLVRLRLTVTDTEGKVLYNEAIKGEGKWRVQTDGTACTVQGLMLPVTEA
Ga0137386_1061099213300012351Vadose Zone SoilTSFLALYQSISRLRRSDFVARFMVGFCVLLSFALSACGQKLVFPRLEFQSEEPKTPSIPLAVRIEIPDRLKQAQLFYRDSCNVPQAIPLGERLADQVKADAAQVFEKIVEADSKESVDAVLTAALDTGELDLHIPRRESGEYPLKVLVRLRLTVTDTEGKVLYNEAIKGEGKWRVQTDGTACTVQGLMLPVTEAIEKLSDREVESLTQAVRIRDAAIRLRARQEMLAAGKAPAGAGPGVKGGAEAPGLSFRASLEDENRNQ
Ga0137367_1028095623300012353Vadose Zone SoilMRFFVLATVVTALGCGQKLVFPRLEFKSDEAKSPSIPLTVRLEIPEALKHAQLFYKDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLNVLVRLRVPVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLGTRKEMMAAGRPGGSGGTGAPPTATAEAPTLSFLASLEDENRNQVL
Ga0137366_1049181413300012354Vadose Zone SoilLLVATASLGCGQKLVFPRLEFKSEESKTPSIPLTVRVEIPDALKQAQLFYKDSCGVPQGIPLGERLVEQVKADGAGVFEKTFEGSAKEPADAVLSAVMETSEINLSIPRREIGEYKTTVLVRLRVTVIDAEGKTLFNEAIKGEGKWNVTTDGTECTVRGLMLPVTEAMEKLSDRTVDGLTQAIKIRDYAIRIRTRQEMMALNKQGGSAVAPPAGTGTTDPPTLSFRAS
Ga0137369_1011722913300012355Vadose Zone SoilLLVATASLGCGQKLVFPRLEFKSEESKTPSIPLTVRVEIPDALKQAQLFYKDSCGVPQGVPLGERLAEQVKADVAGVFEKTFEGSAKEPADAVLSAMMETSEINLSIPRREIGEYKTTVLVRLRVTVIDAEGKTLFNEAIKGEGKWNVTTDGTECTVRGLMLPVTEAMEKLSDRTVDGLTQAIKIRDYAIRIRTRQEMIALSKQGGSAVAPPAGTGTADPPTLSFRASLEDENRNQVLDVGER
Ga0137371_1003750613300012356Vadose Zone SoilMRFFVVATVVATLGCGQKLVFPRLEFKSDEAKSPSIPLTVRLEIPEALKHAQLFYKDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVMVVDMEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLGTRKEMMTAGRPGGSGGTGAPPTATAEAPTLSFLASLEDENRNQVLEA
Ga0137371_1089216113300012356Vadose Zone SoilMATSFLALYQSISRLRRSDFVARFMVGFCVLLSVALSACGQKLVFPRLEFQSEEPKTPSIPLAVRIEIPDRLKQAQLFYRDSCNVPQAIPLGERLADQVKADAAQVFEKIVEADSKEPVDAVLTAALDTGELDLHIPRREIGEYPLKVLVRLRLTVTDTEGKVLYNEAIKGEGKWRVQTDGTACTVQGLMLPVTEAIEKLSDREVESL
Ga0137390_1103129013300012363Vadose Zone SoilLEFKSDESKAPSIPLTVRLEIPEALKHAQLFYKDSCGAPQAVPLGERIAEQVKADAAGVFEKTFEASPKDPADAVLSAAVETSEIKLYIPRREVGEYKMTTLIRVRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLGTRKEMMAAGRLGGSGGTGAPPAPTTVAPTLSFRASLEDENRNQVLEPAEKLVVRVEVANAGPGVARGVAVELSGTPALVKEFV
Ga0137373_1065626613300012532Vadose Zone SoilMRFFVLATVVTALGCGQKLVFPRLEFKSDEAKSPSIPLTVRLEIPEALKHAQLFYKDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLGTRKEMMAAGRPGG
Ga0137416_1038618123300012927Vadose Zone SoilMCLVVLATVVATLGCGQKLVFPRLEFKSDEAKTPSIPLTVRLEIPEALKHAQVFYKDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGSPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLGTRKEMMAAGRPGGSGGTGALPTATAEAPTLSFLASLEDENR
Ga0137416_1151176013300012927Vadose Zone SoilLVVLATVVATLGCGQKLVFPRLEFKSDEAKSPSIPLTVRLEIPEALKHAQLFYKDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVTVVDMEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLGTRREMM
Ga0126375_1189702613300012948Tropical Forest SoilKLVFPRLEFKSDESKSPSIPLTVRLEVPEAFKKAQLFYKDSCGAAQAVPLGERLAEQVKADAAGVFEKTFEGSPKDPADAVLSAAVQTSEINLYIPRREVGEYKMTTLIRLQVTVVDAEGKTLFNEPVKGEGKWNVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIR
Ga0134076_1014099123300012976Grasslands SoilMRLFIVATVVATLGCGQKLVFPRLEFKSEESKAPSIPLKVRLEVPEKLKQAQLFYKDSCGASQAVPLGERLAEQVKADAAGVFEKTFEGSPKDPADAVLRAALETSEVNLHIPRREVGEYKMTTLIRLRVTVVDAEGKTLFNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALRSCRIGWWTA*
Ga0134081_1035464713300014150Grasslands SoilMVCFCVLLSVALSACGQKLVFPRLEFQSEEPKTPSIPLAVRIEIPDRLKQAQLFYRDSCNVPQAIPLGERLADQVKADAAQVFEKIVEADSKEPVDAVLTAALDTGELDLHIPRREIGEYPLKVLVRLRITVTDTEGKVLFNEAIKGEGKWTVQTDGTACTVQGIMLPVT
Ga0134083_1018272613300017659Grasslands SoilMLLPNSALRPLMRLVIVATVVATLGCGQKLVFPRLEFKSDESKAPSIPLTVRLEVPEALTHAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKDPADAVLSAAVETSEIKLYIPRREVGEYKMTTLIRVRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRMVDGLNQSVKIRDWAIRLGTRREMMVAGRPGGSGGTGAPPAPTAEAPTLSFRVSLEDENRNQVLEPVEEA
Ga0134083_1025338113300017659Grasslands SoilVVATVVVTLGCGQKLVFPRLEFKSDEAKSPSIPLSVRLEIPEALKHAQLFYRDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVMVVDMEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLGTRREMMATGRLGGSGGTGAPPAPTTGAPTLSFRASLEDENRNQVLEPAEKL
Ga0184610_121848713300017997Groundwater SedimentMSRRHPALRPLLHVLVLLMATASFGCGQKLVFPRLEFKSEEAKTPSIPLTVRVEIPDALKQAQLFYKDSCGVPQGIPLGERLAEQVKADVAGVFEKTFEGSAKEPADAVLSAMMETSEINLHIPRREIGEYKMSMLVRLRVTVVDTEGKTLFNEAIKGEGKWNVTTDGTECTVRGLMLPVTEAMEKLSDR
Ga0184605_1023492723300018027Groundwater SedimentMIIPLLALRRRISALCCSDFAALSAIGLFVVLTLAVSGCGQKLVFPRLEFKSAEPKQPSIPLAVRIELPDAFKQAQLFYRDSCNVPQAIPLGERLAEQVKADAAQVFEKIVEPGSTEPVDAVLTAVLETGEMDLHIPRREIGEYPVKVLVRLRITVTDAEGKTLFNEPIKGEGKWTGRTDGTECTVQ
Ga0184638_106058023300018052Groundwater SedimentMSRRHPALRPLLHVLVLLMATASFGCGQKLVFPRLEFKSEESKTPSIPLTVRVEIPDALKQAQLFYKDSCGVPQGIPLGERLAEQVKADVAGVFEKTFEGSAKEPADAVLSAMMETSEINLSIPRREIGEYKMSMLVRLRVTVVDTEGKTLFNEAIKGEGKWNVTTDGTECTVRGLMLPVTEAMEKLSDRTVDSLTQAIKILDYAIRIRTRQEMMALSKKGGGAVTPPAGTGTADPPTLSFRASLEDENRNQVLDVGEKLVMRVE
Ga0184638_118817013300018052Groundwater SedimentLLGIFLALLLTGLAGCGQKLVFPRLEFKSEDRPAPSIPLAVRLDIPDVLRQAQMFYRDSCNVPQAIPLGERLAEQIKADAAQVFEKTFEGNPTDPADAVLTAVLETSEIDLHIPRREIGEYPLKVLVRLRLRVKDAEGKILFDEAIKGEGKWTARTDGTECTVQGLMLPVTEAVEKLSDREVESLTQSIKIRDTAIRLQTRKEMGAAGRPPAGPPVQGQPGREPPGLSFRASLEDENRNQVL
Ga0184623_1000289883300018056Groundwater SedimentMSRRHPALRPLLHVLVLLMATASFGCGQKLVFPRLEFKSEEAKTPSIPLTVRVEIPDALKQAQLFYKDSCGVPQGIPLGERLAEQVKADVAGVFEKTFEGSAKEPADAVLSAMMETSEINLHIPRREIGEYKMSMLVRLRVTVVDTEGKTLFNEAIKGEGKWNVTTDGTECTVRGLMLPVTEAMEKLSDRTVDSLTQAIKIRDYAIRIRTRQEMMALSKQGGGAVAPPAGTGTADPPTLSFRASLEDE
Ga0184619_1033271513300018061Groundwater SedimentLCCSDFAALPAIGLFVVLTLAISGCGQKLVFPRLEFESAEPKQPSIPLVVQIELPDAFKQAQLFYRDSCNVLQAIPLGERLAEQIKADAAQVFQKIVEPGSTEPVDAVLTAVLEAGEMDLHIPRREIGEYPLKVLVRLRITVTDAEGKVLFNEPIKGEGKWTGRTDGTECTVQGVMLPVTESIEKLSDREVESLTQAVKIRDAAIRLSTRKELLAAGKAPAAAVPGVKGGA
Ga0184632_10002401103300018075Groundwater SedimentMVMSLPAVCRSISRLRPPGLAARPALGLFLALLMALSGCGQKLVFPRLQFTAEDPKTPSIPLTVRVDIPDALRQAQLFYKDSCNVPQGIPVGERLAEQVKADAAQVFEKILEADSKEPPDAVLTAALETSEINLHIPRREIGEYPLKVLLRLRLTVTDTEGKSLFNEVIKAEGKWRVTTDGTNCTVQSLMLPVTEAIEKLSDREVDSLT
Ga0066662_1182948113300018468Grasslands SoilGQKLVFPRLEFKSDEAKSPSIPLSVRLEIPEALKHAQLFYRDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVTVVDMEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKIRDWAIRLGTRKEMMAAGRPGGSGGTGAPPTATAEAP
Ga0210378_1019632213300021073Groundwater SedimentMVMSLPAVCRSISRLRPPGLAARPALGLFLALLMALSGCGQKLVFPRLEFKSEEPKTPSIPLTVRVDIPDALRQAQLFYKDSCNVPQGIPVGERLAEQVKADAAQVFEKILEADSKEPPDAVLTAALETSEINLHIPRREIGDYPLKVLLRLRLTVTDAEGKSLFNEVIKAEGKWRVTTDGTNCTVQSLMLPVTEAIEKLSDREVDSLTQAVKIRDAAIRLHTRRELVAAGKL
Ga0193719_1022133123300021344SoilMIISLLALRRRISDLCWSDFAASPATIFFVALTLAINGCGQKLVFPRLEFKSEEPKQPSIPLAVRIEIPDVFKTAQLFYRDSCNTPQAIPIGERLAEQLKADAAQVFEKIVEPGSTEPADAVLTPVLEAGELDLHIPRREIGEYPLKVLIRLRLTVTDTQGKSLFNEAIKGAGKWTVQT
Ga0224452_127957513300022534Groundwater SedimentVAISACGQKLVFPRLEFPSEEPRTPSIPLAVRIEIPDHLKQAQLFYRDSCNVPQAIPLGERLADQVKADVAQVFEKIVEADSKEPVDAVLTAALDTGELDLHIPRREIGEYPLRVLVRLRLTVTDTEGKVLYNEAIKGEGKWRVQTDGTACTVQGLMLPVTEAIEKLSDR
Ga0209109_1047999913300025160SoilLLAALAGSGCGQKLVFPRLEFKADESKTPSIPLTVRLDIPDALKQYQLFYRDSCGEPQGIPLGERLAEQVKADAASVFEKTFVGGPKDPADAVLSAAIETGEINLNIPRREPNEYKMTALVRLRITMVDAEGKSLFNDAIKGEGKWNVNTDGTECTVRGLMLPVTEAMEKLSDRTVEELTQGVKIRDWA
Ga0207642_1069874013300025899Miscanthus RhizosphereMVFFVALILAGCGQKLVFPRLEFKSEEPKQPSIPLAVRIEIPDAFKQAQLFYRDSCNTPQAIPLGERLSEQLKADAAQVFEKVVETGSTEPTDAVLTPVLEAGEMDLHIPRREIGEYPLKVLIRVRLTVTDTQGKALFNETIKGAGKWTVQTDGTACTVQGVMLPVT
Ga0209237_120245713300026297Grasslands SoilVAIGSHFLYNAAHMWLPMSTLRPLMRFFVVATVVVTLGCGQKLVFPRLEFKSDEAKSPSIPLSVRLEIPEALKHAQLFYRDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVTVVDMEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQS
Ga0209055_106562113300026309SoilLSCSYRLVRPTLALLLLLLGATSGCGQKVVFPRLEFKSSEPQTPSIPLSVRLDLPDALKQAQMYYRDSCNVPQAIPLGERLAEQVKADAAQVFEKTFEGTAAKEPADAVLSAVLETSEINLNIPRREIGEYPLKVLIRLRISVSDTEGKSLFNDVIKAEGKWTARTDGTECRVQGIMLPVTEAIEKLSDRVVESLTQAVRIRDAAIRLRTRQEL
Ga0209801_118967713300026326SoilPWHMLLPNSALRPLMRLVIVATVVATLGCGQKLVFPRLEFKSDESKAPSIPLTVRLEVPEALTHAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKEPADAVLSAALETSEIKLYIPRREVGEYKMTTLIRVRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVGSLTQSVKIRDWAIRLGTRREMMAAGRLGGSGGTGAPPAPTAGAPTLSFRASLEDENRNQVLEPAEKLVVRVEVANAGPGVAR
Ga0209158_121776213300026333SoilMLLPNSALRPLMRLVIVATVVATLGCGQKLVFPRLEFKSDESKAPSIPLTVRLEVPEALTHAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKEPADAVLSAALETSEIKLYIPRREVGEYKMTTLIRVRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLAVTEALEKLSDRLVDSLTQSVKIRDWAIRLGT
Ga0209157_104837713300026537SoilMLLPNSALRPLMRLVIVATVVATLGCGQKLVFPRLEFKSDESKAPSIPLTVRLEVPEALTHAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKEPADAVLSAALETSEIKLYIPRREVGEYKMTTLIRVRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVGSLTQS
Ga0209157_106689333300026537SoilVRVFVAIGSHFLYNAAHMWLPMSTLRPLMRFFVVATVVVTLGCGQKLVFPRLEFKSDEAKSPSIPLSVRLEIPEALKHAQLFYRDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVTVVDMEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALE
Ga0209376_121493923300026540SoilMLLPISALRPLMRLFIVATVVAALGCGQKLVFPRLEFKSDESKAPSIPLTVRLEVPEALKRAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKDPADAVLSAALETSEIKLYIPRREVGEYKMTTLIRVRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDR
Ga0209156_1027313313300026547SoilMLLPISALRPLMRLFIVATVVAALGCGQKLVFPRLEFKSDESKAPSIPLTVRLEVPEALKRAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKDPADAVLSAALETSEIKLYIPRREVGEYKMTTLIRVRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKVRDWAIRLGTRREMMAAGRLGGSGGTGVPPAQPAGT
Ga0209577_1038931623300026552SoilMRLFIVATVVATLGCGQKLVFPRLEFKSDESKAPSIPLTVRLEVPEALKRAQLFYKDSCGAPQAVPLGERLAEQVKADAAGVFEKTFEGSPKDPADAVLSAALETSEIKLYIPRREVGEYKMTTLIRVRVTVVDAEGKTLLNEPVKGEGKWTVTTDGTNCTVQGLMLPVTEALEKLSDRLVDSLTQSVKVRDWAIRLGTRKEMMAAGRL
Ga0209180_1007472633300027846Vadose Zone SoilMTRRHPALRPLLHVLVLLMATASFGCGQKLVFPRLEFKAEESKTPSIPLTVRVEIPDALKQAQLFYKDSCGVPQGIPLGERLAEQVKADVAGVFEKTFEGSAKEPADAVLSAVMETSEINLYIPRREIGEYPMSTLVRLRVTVIDAEGKTLFNEAIKGEGKWNVTTDGTECTVRGLMLPVTEAMEKLSDRTVDSLTQAIKIRDYAIRIRTRQEMMAL
Ga0137415_1104116413300028536Vadose Zone SoilMLLHCSIQRPLMCLVVLATVVATLGCGQKLVFPRLEFKSDEAKSPSIPLTVRLEIPEALKHAQLFYKDSCGVPQAIPLGERLAEQVKADATGVFEKTFEGNPKDPADAVLSAALETGEMNLAIPRREIGEYPLKVLVRLRVTVVDMEGKTLLNEPVKGEGKWTVTTDGTNCMVQGLMLPVTEALEKLSDRLVDSLTQSV
Ga0307284_1039355713300028799SoilPATIFFVALTVAINGCGQKLVFPRLEFKSEEPKQPSIPLAVRIEIPDVFKTAQLFYRDSCNTPQAIPIGERLAEQLKADAAQVFEKIVEPGSTEPADAVLTPVLEAGELDLHIPRREIGEYPLKVLIRLRLTVTDTQGKSLFNEAIKGAGKWTVQTDGTACTVQGVMLPVTESIEKLSDREVESLTQA
Ga0307302_1035158213300028814SoilMATSFLALYQSISLLRRSDFVARFMVCFCVLLSVAISACGQKLVFPRLEFPSEEPRTPSIPLAVRIEIPDHLKQAQLFYRDSCNVPQAIPLGERLADQVKADVAQVFEKILEADSKEPVDAMLTAALETGELDLHIPRREIGEYPLKVLVRLRLTVTDTEGKVLYNEAIKGEGKWRVQTDGTACTVQGLMLPVT
Ga0307312_1028847513300028828SoilMAISLLALRRRISDLCWSDFSASPAIIFFVALTLAINGCGQKLVFPRLEFKSEEPKQPSIPLAVRIEIPDVFKTAQLFYRDSCNTPQAIPIGERLAEQLKADAAQVFEKIVEPGSTEPADAVLTPVLEAGELDLHIPRREIGEYPLKVLIRLRLTVTDTQGKSLFNEAIKGAGKWTVQTDGTACTVQGVMLPVTESIEKLSDREVESLTQAVGIRDAAIRIRTRQELLAAGKAPVGSGPGVKGNDETPGLSFRASLEDENRNQVLDSG
Ga0307312_1108118713300028828SoilGCGQKLVFPRLEFNSAEPKQPSIPLAVRIELPDAFKQAQLFYRDSCNVPQAIPLGEHLAEQVKADAAQVFEKIVEPGSREPVDAVLTAVLEAGEMDLHIPRREIGEYPLKVLVRLRITVTDAQGKTLFNEPIKGEGKWTVRTDGTACTVQGVMLPVTESIEKLSDREVESLTQAVGI
Ga0307469_1150480213300031720Hardwood Forest SoilLSGCGQKLVFPRLEFAPEEPKTPSIPLAVRIEIPDALKQAQLFYRDSCNVPQAIPLGDRLADQVKADVAQVFEKIVDADSKEPVDAVLTAAMETGELDLHIPRREIGEYPLKVLVRLRLTVTDTEGKVLYNEAIKGEGKWRVQTDGTACTVQGIMLPVTEAIEKLSDREVESLTQAVRIRDAAIRLSTRKELLAAGKTPAGAVPGVKGGAEAPG
Ga0307473_1017388823300031820Hardwood Forest SoilMTRRQPAMRPLLYVLVLLMATASFGCGQKLVFPRLEFKSEESKTPSIPLTVRVEIPDALKQAQLFYKDSCGVPQGIPLGERLAEQVKADVAGVFEKTFEGSAKEPADAVLSAVMETSEINLYIPRREIGEYKMSILVRLRVTVVDTEGKTLFNEAIKGEGKWNVTTDGTECTVRGLMLPVTEAMEKLSDRTV
Ga0315295_1173905213300032156SedimentGQKLVFPRLEFKADESKTPSIPLTVRLDIPDALKQYQLFYKDSCGAPQGIPLGARLAEQVMADAASVFEKTFVGGPKDPADAVLSAAIETGEINLNIPRRETAGEYRMDALVRLRITVVDAEGKSLFNEAIKGEGKWTVTTDGTECAVRGLMLPVTEAMEKLSDRTVEGLTQGIKIRDWAIRVQARKEMAAGGRQGGG
Ga0307471_10048554023300032180Hardwood Forest SoilMATSFLALSRSTSRLRRSDFVARFTVGCCILLSLALSGCGQKLVFPRLEFKSEEPKTPSIPLAVRIEIPDALKQAQLFYRDSCNVPQAIPLGERLADQVKADAAQVFEKIVETGSKEPVDAVLTAALDTGELNLHIPRREIGEYPLKVLVRLRITVTDTEGKVLYNEAIKGEGKWTVQTDGT
Ga0314862_0017954_2_5263300033803PeatlandMILLAVLAGVGCGQKLVFPRMEFKAEESRAPSIPLTVRLDIPDSLKQTQVFYKDSCGTPQGIPLGERLAEQVKADAAGVFEKTFVGDPKDPADAVLSAALETGEVTLNIQRREINEYPMRALIRLRITMTDSDGKPLFNEAIKGEGKWNVSTDGIECTVRGLMLPITEAMEKLSD
Ga0364942_0189403_3_6413300034165SedimentMDSGDQVGYKDYLVRIAAPPIPFRRLRHVCLSAPVLRPVLGIFLALLLTGLAGCGQKLVFPRLEFKSEERAAPSIPLTVRLDIPDALRQAQMFYRDSCNVPQAIPLGERLAEQVKADAAQVFEKTFEGKPTDPADAVLTAVLETSEIDLHIPRREIGEYPLKVLVRLRLRVKDAEGKILFDEAIKGEGKWTARTDGTECTVQGLMLPVTEAVE


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.