NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F084682

Metagenome Family F084682

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F084682
Family Type Metagenome
Number of Sequences 112
Average Sequence Length 225 residues
Representative Sequence VNHLSLLVPLLFALSLIVPILFRLFRSGPREARVVVEYAQRRGYALVNPALAQALDKSYLEMVKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHTFENVVDKIAGASKPAVNLDARQYPEFSAHFWIKGPDPAAVTAFLSGAKIRFLEAANLQGTLATNANYLVYFEDGVLVTEQDFDSFIARAETLVANLL
Number of Associated Samples 87
Number of Associated Scaffolds 112

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 32.14 %
% of genes near scaffold ends (potentially truncated) 53.57 %
% of genes from short scaffolds (< 2000 bps) 71.43 %
Associated GOLD sequencing projects 79
AlphaFold2 3D model prediction Yes
3D model pTM-score0.71

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(26.786 % of family members)
Environment Ontology (ENVO) Unclassified
(28.571 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(50.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 39.71%    β-sheet: 20.22%    Coil/Unstructured: 40.07%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.71
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
d.198.1.1: Type III secretory system chaperoned1ry9a_1ry90.52459
d.198.1.1: Type III secretory system chaperoned2fm8a12fm80.51068
d.198.1.1: Type III secretory system chaperoned1jyoa_1jyo0.5035
d.198.1.1: Type III secretory system chaperoned1xkpc11xkp0.50142


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 112 Family Scaffolds
PF07733DNA_pol3_alpha 5.36
PF07883Cupin_2 3.57
PF08734GYD 3.57
PF12158DUF3592 2.68
PF02811PHP 2.68
PF01743PolyA_pol 1.79
PF00069Pkinase 1.79
PF00106adh_short 0.89
PF04255DUF433 0.89
PF00456Transketolase_N 0.89
PF03551PadR 0.89
PF00775Dioxygenase_C 0.89
PF12681Glyoxalase_2 0.89
PF03929PepSY_TM 0.89
PF13442Cytochrome_CBB3 0.89
PF02899Phage_int_SAM_1 0.89
PF08530PepX_C 0.89
PF12867DinB_2 0.89
PF13185GAF_2 0.89

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 112 Family Scaffolds
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 7.14
COG0587DNA polymerase III, alpha subunitReplication, recombination and repair [L] 5.36
COG2176DNA polymerase III, alpha subunit (gram-positive type)Replication, recombination and repair [L] 5.36
COG4274Uncharacterized conserved protein, contains GYD domainFunction unknown [S] 3.57
COG0617tRNA nucleotidyltransferase/poly(A) polymeraseTranslation, ribosomal structure and biogenesis [J] 1.79
COG0021TransketolaseCarbohydrate transport and metabolism [G] 0.89
COG1695DNA-binding transcriptional regulator, PadR familyTranscription [K] 0.89
COG1733DNA-binding transcriptional regulator, HxlR familyTranscription [K] 0.89
COG1846DNA-binding transcriptional regulator, MarR familyTranscription [K] 0.89
COG2442Predicted antitoxin component of a toxin-antitoxin system, DUF433 familyDefense mechanisms [V] 0.89
COG2936Predicted acyl esteraseGeneral function prediction only [R] 0.89
COG3182PepSY-associated TM regionFunction unknown [S] 0.89
COG3295Uncharacterized conserved proteinFunction unknown [S] 0.89
COG3485Protocatechuate 3,4-dioxygenase beta subunitSecondary metabolites biosynthesis, transport and catabolism [Q] 0.89
COG3959Transketolase, N-terminal subunitCarbohydrate transport and metabolism [G] 0.89
COG4973Site-specific recombinase XerCReplication, recombination and repair [L] 0.89
COG4974Site-specific recombinase XerDReplication, recombination and repair [L] 0.89


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001867|JGI12627J18819_10014810All Organisms → cellular organisms → Bacteria → Acidobacteria3123Open in IMG/M
3300005174|Ga0066680_10347824All Organisms → cellular organisms → Bacteria → Acidobacteria945Open in IMG/M
3300005177|Ga0066690_10901127All Organisms → cellular organisms → Bacteria → Acidobacteria565Open in IMG/M
3300005436|Ga0070713_100076265All Organisms → cellular organisms → Bacteria2847Open in IMG/M
3300005436|Ga0070713_101585525All Organisms → cellular organisms → Bacteria → Acidobacteria635Open in IMG/M
3300005447|Ga0066689_10164012All Organisms → cellular organisms → Bacteria → Acidobacteria1329Open in IMG/M
3300005552|Ga0066701_10204931All Organisms → cellular organisms → Bacteria → Acidobacteria1210Open in IMG/M
3300005557|Ga0066704_10199479All Organisms → cellular organisms → Bacteria → Acidobacteria1353Open in IMG/M
3300005568|Ga0066703_10186401All Organisms → cellular organisms → Bacteria → Acidobacteria1256Open in IMG/M
3300005568|Ga0066703_10704717All Organisms → cellular organisms → Bacteria → Acidobacteria581Open in IMG/M
3300005569|Ga0066705_10249878All Organisms → cellular organisms → Bacteria → Acidobacteria1124Open in IMG/M
3300005575|Ga0066702_10501214All Organisms → cellular organisms → Bacteria → Acidobacteria739Open in IMG/M
3300005586|Ga0066691_10004031All Organisms → cellular organisms → Bacteria6245Open in IMG/M
3300005586|Ga0066691_10118627All Organisms → cellular organisms → Bacteria → Acidobacteria1500Open in IMG/M
3300005921|Ga0070766_10641219All Organisms → cellular organisms → Bacteria → Acidobacteria716Open in IMG/M
3300006028|Ga0070717_10562263All Organisms → cellular organisms → Bacteria → Acidobacteria1033Open in IMG/M
3300006050|Ga0075028_100683345All Organisms → cellular organisms → Bacteria → Acidobacteria616Open in IMG/M
3300006172|Ga0075018_10153660All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Edaphobacter → Edaphobacter lichenicola1063Open in IMG/M
3300006176|Ga0070765_100175668All Organisms → cellular organisms → Bacteria → Acidobacteria1929Open in IMG/M
3300006176|Ga0070765_101719116All Organisms → cellular organisms → Bacteria → Acidobacteria589Open in IMG/M
3300006755|Ga0079222_10343537All Organisms → cellular organisms → Bacteria → Acidobacteria1003Open in IMG/M
3300006800|Ga0066660_10047580All Organisms → cellular organisms → Bacteria2776Open in IMG/M
3300006804|Ga0079221_10049881All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1872Open in IMG/M
3300006954|Ga0079219_10569447All Organisms → cellular organisms → Bacteria → Acidobacteria818Open in IMG/M
3300006954|Ga0079219_11640808All Organisms → cellular organisms → Bacteria → Acidobacteria592Open in IMG/M
3300007788|Ga0099795_10058886All Organisms → cellular organisms → Bacteria → Acidobacteria1419Open in IMG/M
3300009038|Ga0099829_11238602All Organisms → cellular organisms → Bacteria → Acidobacteria618Open in IMG/M
3300009088|Ga0099830_10423928All Organisms → cellular organisms → Bacteria → Acidobacteria1078Open in IMG/M
3300009088|Ga0099830_11377170All Organisms → cellular organisms → Bacteria → Acidobacteria587Open in IMG/M
3300009090|Ga0099827_11070594All Organisms → cellular organisms → Bacteria → Acidobacteria700Open in IMG/M
3300011269|Ga0137392_10662185All Organisms → cellular organisms → Bacteria → Acidobacteria865Open in IMG/M
3300011269|Ga0137392_11473658All Organisms → cellular organisms → Bacteria → Acidobacteria539Open in IMG/M
3300011271|Ga0137393_10615027All Organisms → cellular organisms → Bacteria → Acidobacteria932Open in IMG/M
3300012189|Ga0137388_11389067All Organisms → cellular organisms → Bacteria → Acidobacteria641Open in IMG/M
3300012199|Ga0137383_10155945All Organisms → cellular organisms → Bacteria → Acidobacteria1671Open in IMG/M
3300012199|Ga0137383_10811169All Organisms → cellular organisms → Bacteria → Acidobacteria683Open in IMG/M
3300012205|Ga0137362_11649083All Organisms → cellular organisms → Bacteria → Acidobacteria528Open in IMG/M
3300012206|Ga0137380_10055001All Organisms → cellular organisms → Bacteria3647Open in IMG/M
3300012207|Ga0137381_10232933All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1599Open in IMG/M
3300012210|Ga0137378_10101737All Organisms → cellular organisms → Bacteria → Acidobacteria2639Open in IMG/M
3300012211|Ga0137377_10082810All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia3021Open in IMG/M
3300012349|Ga0137387_10083083All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2206Open in IMG/M
3300012351|Ga0137386_10147736All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1680Open in IMG/M
3300012351|Ga0137386_10597915All Organisms → cellular organisms → Bacteria → Acidobacteria794Open in IMG/M
3300012357|Ga0137384_10109403All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2309Open in IMG/M
3300012359|Ga0137385_10051051All Organisms → cellular organisms → Bacteria → Acidobacteria3696Open in IMG/M
3300012361|Ga0137360_10073904All Organisms → cellular organisms → Bacteria → Acidobacteria2533Open in IMG/M
3300012362|Ga0137361_10004767All Organisms → cellular organisms → Bacteria → Proteobacteria9042Open in IMG/M
3300012924|Ga0137413_10504868All Organisms → cellular organisms → Bacteria → Acidobacteria890Open in IMG/M
3300012927|Ga0137416_10698299All Organisms → cellular organisms → Bacteria → Acidobacteria891Open in IMG/M
3300012930|Ga0137407_10856716All Organisms → cellular organisms → Bacteria → Acidobacteria859Open in IMG/M
3300012957|Ga0164303_10282330All Organisms → cellular organisms → Bacteria → Acidobacteria969Open in IMG/M
3300012960|Ga0164301_10557379All Organisms → cellular organisms → Bacteria → Acidobacteria838Open in IMG/M
3300012986|Ga0164304_10287222All Organisms → cellular organisms → Bacteria → Acidobacteria1121Open in IMG/M
3300017927|Ga0187824_10009810All Organisms → cellular organisms → Bacteria → Acidobacteria2736Open in IMG/M
3300017930|Ga0187825_10001004All Organisms → cellular organisms → Bacteria8067Open in IMG/M
3300017993|Ga0187823_10000265All Organisms → cellular organisms → Bacteria13003Open in IMG/M
3300017994|Ga0187822_10000867All Organisms → cellular organisms → Bacteria5896Open in IMG/M
3300018468|Ga0066662_10216768All Organisms → cellular organisms → Bacteria → Acidobacteria1530Open in IMG/M
3300020579|Ga0210407_10226515All Organisms → cellular organisms → Bacteria → Acidobacteria1455Open in IMG/M
3300020581|Ga0210399_10026766All Organisms → cellular organisms → Bacteria → Acidobacteria4593Open in IMG/M
3300020583|Ga0210401_11133124All Organisms → cellular organisms → Bacteria → Acidobacteria641Open in IMG/M
3300021168|Ga0210406_10214593All Organisms → cellular organisms → Bacteria → Acidobacteria1592Open in IMG/M
3300021168|Ga0210406_10362840All Organisms → cellular organisms → Bacteria → Acidobacteria1167Open in IMG/M
3300021171|Ga0210405_10373162All Organisms → cellular organisms → Bacteria → Acidobacteria1125Open in IMG/M
3300021178|Ga0210408_10013780All Organisms → cellular organisms → Bacteria → Proteobacteria6603Open in IMG/M
3300021178|Ga0210408_10233543All Organisms → cellular organisms → Bacteria → Acidobacteria1464Open in IMG/M
3300021180|Ga0210396_10395830All Organisms → cellular organisms → Bacteria → Acidobacteria1215Open in IMG/M
3300021401|Ga0210393_11354674All Organisms → cellular organisms → Bacteria → Acidobacteria569Open in IMG/M
3300021404|Ga0210389_10945478All Organisms → cellular organisms → Bacteria → Acidobacteria670Open in IMG/M
3300021405|Ga0210387_11548523All Organisms → cellular organisms → Bacteria → Acidobacteria566Open in IMG/M
3300021420|Ga0210394_10140466All Organisms → cellular organisms → Bacteria2096Open in IMG/M
3300021420|Ga0210394_10200079All Organisms → cellular organisms → Bacteria → Acidobacteria1742Open in IMG/M
3300021432|Ga0210384_10012783All Organisms → cellular organisms → Bacteria → Acidobacteria8404Open in IMG/M
3300021432|Ga0210384_10239735All Organisms → cellular organisms → Bacteria → Acidobacteria1632Open in IMG/M
3300021432|Ga0210384_10400293All Organisms → cellular organisms → Bacteria → Acidobacteria1237Open in IMG/M
3300021476|Ga0187846_10054990All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1754Open in IMG/M
3300021478|Ga0210402_10187948All Organisms → cellular organisms → Bacteria1895Open in IMG/M
3300021478|Ga0210402_10697818All Organisms → cellular organisms → Bacteria → Acidobacteria938Open in IMG/M
3300021479|Ga0210410_10142072All Organisms → cellular organisms → Bacteria → Acidobacteria2137Open in IMG/M
3300021559|Ga0210409_10004139All Organisms → cellular organisms → Bacteria → Acidobacteria15782Open in IMG/M
3300021559|Ga0210409_10134844All Organisms → cellular organisms → Bacteria → Acidobacteria2266Open in IMG/M
3300025906|Ga0207699_10401068All Organisms → cellular organisms → Bacteria → Acidobacteria977Open in IMG/M
3300025906|Ga0207699_11480138All Organisms → cellular organisms → Bacteria → Acidobacteria502Open in IMG/M
3300025928|Ga0207700_11100243All Organisms → cellular organisms → Bacteria → Acidobacteria710Open in IMG/M
3300026333|Ga0209158_1010137All Organisms → cellular organisms → Bacteria4688Open in IMG/M
3300026333|Ga0209158_1219423All Organisms → cellular organisms → Bacteria → Acidobacteria657Open in IMG/M
3300027034|Ga0209730_1003924All Organisms → cellular organisms → Bacteria → Acidobacteria1232Open in IMG/M
3300027725|Ga0209178_1013753All Organisms → cellular organisms → Bacteria2535Open in IMG/M
3300027842|Ga0209580_10442975All Organisms → cellular organisms → Bacteria → Acidobacteria647Open in IMG/M
3300027862|Ga0209701_10379268All Organisms → cellular organisms → Bacteria → Acidobacteria793Open in IMG/M
3300027882|Ga0209590_10610689All Organisms → cellular organisms → Bacteria → Acidobacteria701Open in IMG/M
3300027889|Ga0209380_10182358All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1231Open in IMG/M
3300027903|Ga0209488_10323185All Organisms → cellular organisms → Bacteria → Acidobacteria1150Open in IMG/M
3300028047|Ga0209526_10050346All Organisms → cellular organisms → Bacteria → Acidobacteria2933Open in IMG/M
3300028047|Ga0209526_10323595All Organisms → cellular organisms → Bacteria → Acidobacteria1039Open in IMG/M
3300028047|Ga0209526_10414177All Organisms → cellular organisms → Bacteria → Acidobacteria892Open in IMG/M
3300028536|Ga0137415_10316299All Organisms → cellular organisms → Bacteria → Acidobacteria1365Open in IMG/M
3300028906|Ga0308309_10154338All Organisms → cellular organisms → Bacteria → Acidobacteria1851Open in IMG/M
3300031231|Ga0170824_119706199All Organisms → cellular organisms → Bacteria → Acidobacteria950Open in IMG/M
3300031715|Ga0307476_10360103All Organisms → cellular organisms → Bacteria → Acidobacteria1072Open in IMG/M
3300031720|Ga0307469_10013815All Organisms → cellular organisms → Bacteria → Acidobacteria4108Open in IMG/M
3300031753|Ga0307477_10044048All Organisms → cellular organisms → Bacteria3066Open in IMG/M
3300031754|Ga0307475_10032288All Organisms → cellular organisms → Bacteria → Acidobacteria3815Open in IMG/M
3300031823|Ga0307478_11151703All Organisms → cellular organisms → Bacteria → Acidobacteria647Open in IMG/M
3300031962|Ga0307479_10177616All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2094Open in IMG/M
3300031962|Ga0307479_10181746All Organisms → cellular organisms → Bacteria → Acidobacteria2069Open in IMG/M
3300032180|Ga0307471_100082782All Organisms → cellular organisms → Bacteria → Acidobacteria2832Open in IMG/M
3300032180|Ga0307471_100464011All Organisms → cellular organisms → Bacteria → Acidobacteria1408Open in IMG/M
3300032180|Ga0307471_101571365All Organisms → cellular organisms → Bacteria → Acidobacteria814Open in IMG/M
3300032180|Ga0307471_102078417All Organisms → cellular organisms → Bacteria → Acidobacteria714Open in IMG/M
3300032180|Ga0307471_102484244All Organisms → cellular organisms → Bacteria → Acidobacteria655Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil26.79%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil19.64%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil12.50%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil10.71%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.36%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil4.46%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.46%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil4.46%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment3.57%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.68%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.79%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.89%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.89%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.89%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.89%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001867Texas A ecozone_OM1H0_M2 (Combined assembly for Texas A ecozone Site metagenome samples, ASSEMBLY_DATE=20130705)EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017993Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_3EnvironmentalOpen in IMG/M
3300017994Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_2EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021476Biofilm microbial communities from the roof of an iron ore cave, State of Minas Gerais, Brazil - TC_06 Biofilm (v2)EnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300027034Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_RefH0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027725Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12627J18819_1001481023300001867Forest SoilMNRFPLLVILMFALSFILPMIFRLFKRGPRDKARVVVEYVLRRGYVLLNPSLAQTLDSSRLEMMRNPALRNPTKASSDIADIEGLDNGTGDWLAFTCDLKSKEVTIFNLSKTSSINSSGASIPYKVAKIRAAGLPRFSLGRNSVLRTVENVVGKMVGAPKTTINVDARQYPEFSAHFWIRGSDPGAVTAYLSHDKIRFLETAKLQGILATNANYLVYFEDGVLLSEKDFDSFIARVETLVTNML*
Ga0066680_1034782413300005174SoilVSHLPLLVAFFFALSLIVPVIFRLFRSGPRETRVVVEYAQKRGYALVNPGLAQAVDMSYLEMLKNPAFRNFNKASLDVDDIEKLNGGSGDWLAFTCNLRSKEVTIFNLSVTSRRVDAQGGSLQYKVAKIKTAGLPRFSLGRNSALHAFENAVDKIAGASKPAINLDARQYPEFSAHFWIKGSDPAAVTAFLSGDKIRFLEDAKLQGTLATNANYLVYFEDGVLRSEQDFDSFIARAEALVANLL*
Ga0066690_1090112713300005177SoilRFFRSGPREMKGVVEYAQRRGYALVNPALAQAAEMSPLEMMKDPALRNLERASLDIANIERLDHGTGDWLAFTCRLGSKEATIFNLSVTSRQGSGGAGVHYKVAKIKAAGLPRFSLGKNSAMHTFENTVEKLAHLSQPEIRVDARLYPEFAAHFWIKGPDAAAVTGFLSGDKIRFLESAKLAGTLATN
Ga0070713_10007626523300005436Corn, Switchgrass And Miscanthus RhizosphereMWLRCNSYVPQQPESRSVASTCNYTFDCQSERKRRPDIPMNHLTLLIILIFVLSILLPMLFRFFRASPRDEAKSVVEYAQKRGYVLVNPALANALDSSLLEMARNPAIKNSIRASSDIADIEELHDGTGDWLAFTCTLRSREATIFNLSVSPRRPDTSGRSLPYKVVKIKAPGLPRFSLGRNSVVHTVVNVVDKIVGAPKRTIDLDARLFPEFSAHYWISGSDAVAVTSFLAPGKIRFLETAKLEGILATNANYLVYFEDGVLLTEQDFDSFIARAETLVANLL*
Ga0070713_10158552513300005436Corn, Switchgrass And Miscanthus RhizospherePALRNSNRASSDIDNIEQLSNGTGDWLAFSCNLGSKEVTIFNLRVASRRADAGGGDIHYKVAKIKAAGLPRFSLGRNSALHSFENVIDKVTGASKPAITLDARQYPEFSSHFWINGFEPAAVTAFLSGGKIRFLETANLQGTLATNANYFVYFEDGELVTEQDFDSFIAKVETIVANLL*
Ga0066689_1016401223300005447SoilVNHLSLLVPLLFALSLIVPILFRLFRSGPREARVVVEYAQRRGYALVNPALAQALDKSYLGMLKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHTFENVVDKIAGASKPAVNLDARQYPEFSAHFWIKGPDPAAVTAFLSGARIRFLEAANLQGTLATNANYLVYFEDGVLVTEQDFDSFIGRAETLVANLL*
Ga0066701_1020493123300005552SoilVNPALAQALDKSYLEMLKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHTFENVVDKIAGASKPAVNLDTRQYPEFSAHFWIKGPDPAAVTAFLSGAKIRFLEAANLQGTLATNANYLVYFEDGVLVTEQDFDSFIGRAETLVANLL*
Ga0066704_1019947913300005557SoilVEYAQKRGYALVNPGLAQAVDMSYLEMLKNPAFRNFNKASLDVDDIEKLNGGSGDWLAFTCNLRSKEVTIFNLSVTSRRVDAQGGSLQYKVAKIKTAGLPRFSLGRNSALHAFENAVDKIAGASKPAINLDARQYPEFSAHFWIKGSDPAAVTAFLSGDKIRFLEDAKLQGTLATNANYLVYFEDGVLRSEQDFDSFIARAEALVANLL*
Ga0066703_1018640113300005568SoilVNPALAQALDKSYLEMLKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHTFENVVDKIAGASKPAVNLDARQYPEFSAHFWIKGPDPAAVTAFLSGAKIRFLEAANLQGTLATNANYLVYFEDGVLVTEQDFDSFIGRAETLVANLL*
Ga0066703_1070471713300005568SoilYALVNPALAQAVEMSPLEMMKNPALRNLERASLDIANIERLDHGTGDWLAFTCRLGSKEATIFNLGVTSRQGSGGAGVHYKVAKIKAAGLPRFSLGRNSAMHTFENTVEKLAHVSQPEIRVDARMYPEFAAHFWIKGPDAAAVTGFLSGDKIRFLESAKLAGTLATNANYLVYFEDGKLETEQDFDAFIAKAE
Ga0066705_1024987823300005569SoilMSHLPFFVTIVVALSLVLPILYRFFRSGPREMKGVVEYAQRRGYALVNPALAQAVEMSPLEMMKDPALRNLERASLDIANIERLDHGTGDWLAFTCRLGSKEATIFNLSVTSRQGSGGAGVHYKVAKIKAAGLPRLSLGKNSAMHTFENTVEKLAHVSQPEIRLDSRLYPEFATHFWIKGPDPAAVTGFLSGDKIRFLESAKLAGTLATNANYLVYFEDGKLETEQDFDAFIAKAEAISANFL*
Ga0066702_1050121413300005575SoilAQAANMSPLEMMKDPALRNLERASLDIANIERLDHGTGDWLAFTCRLGSKEATIFNLSVTSRQGSGGAGVHYKVAKIKAVGLPRFSLGKNSAMHTFENTVEKLAHLSQPEIRVDARMYPEFAAHFWIKGPDAAAVTGFLSGDKIRFLESAKLAGTLATNANYLVYFEDGKLETEQDFDAFIAKAESSAGNFV*
Ga0066691_1000403123300005586SoilVVEYAQKRGYALVNPGLAQAVDMSYLEMLKNPAFRNFNKASLDVDDIEKLNGGSGDWLAFTCNLRSKEVTIFNLSVTSRRVDAQGGSLQYKVAKIKTAGLPRFSLGRNSALHAFENAVDKIAGASKPAINLDARQYPEFSAHFWIKGSDPAAVTAFLSGDKIRFLEDAKLQGTLATNANYLVYFEDGVLRSEQDFDSFIARAEALVANLL*
Ga0066691_1011862723300005586SoilVNHLSLLVPLLFALSLIVPILFRLFRSGPREARVVVEYAQRRGYALVNPALAQALDKSYLEMLKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHTFENVVDKIAGASKPAVNLDARQYPEFSAHFWIKGPDPAAVTAFLSGAKIRFLEAANLQGTLATNANYLVYFEDGVLVTEQDFDSFIGRAETLVANLL*
Ga0070766_1064121913300005921SoilPLFFALVAVVSLVLPTLYRLLRGGPREMKGVVEYAQKRGYALVNPALAQAVDLSPIEMMKNQTFVNLNRASLDIYDIQKLDRGTGDWLAFTCKLGSKEVTIFNLTIPSQRTDGRGSDLHHKVAKIKAAGLPRFSLARNSAMHTFENVVENIAHVPQAEIHLDARQYPNFAKHFWIRGSDAGEVTSFLSDGKIRFLETANLGGTLATNANYLVYFEDGKLQTEQDFDSFIARAEAIVAN
Ga0070717_1056226323300006028Corn, Switchgrass And Miscanthus RhizosphereGIPSMWLRCNSYVPQQPESRSVASTCNYTFDCQSERKRRPNMPMNHLTLLIILIFVLSILLPMLFRFFRASPRDKAKSVVEYAQKRGYVLVNPALANALDSSLLEMARNPAFKNSIRASSDIADIEELHDGTGDWLAFTCTLRSREATIFNLSVSPRRPDTGGRSIPYKVVKIKAPGLPRFSLGRNSVVHTVVNVVDKIVGAPKGTINVDARLFPEFSAHYWISGSDAVAVTSFLPPGKIRFLETAKLEGILATNANYLVYFEDGVLLTEQDFDSFIARAETLVANLL*
Ga0075028_10068334513300006050WatershedsLGSSPVEMLKNPALRNSIQAGSDISDIEALDNGTGDWLAFTCNLRSHEVTIFNLSVTPRTVNSSSGNIHYKVAKISAVDLPRFSLGRNSGLHTLENVVGKLVGAPKPTVDLDARQYPEFAAHSWIRGSDAAAITAFLSPSKIKFLETANLPGVLATNANYLVYFEDGILRTGQDFDSFISRAETVAANLL*
Ga0075018_1015366013300006172WatershedsTNPAISQTLGSSPVEMLKNPALRNSIQAGSDISDIEALDNGTGDWLAFTCNLRSHEVTIFNLSVTPRTVNSSSGNIHYKVAKISAVDLPRFSLGRNSGLHTLENVVGKLVGAPKPTVDLDARQYPEFAAHSWIRGSDAAAITAFLSPSKIKFLETANLPGVLATNANYLVYFEDGILRTGQDFDSFISRAETVAANLL*
Ga0070765_10017566823300006176SoilMNYFPLLVLLLFALSFLLPMIFRLFSHGPRDKARVVVGYAQKRGYVLVNPSLAQALDSSPMEMLRNPTLRNSIKASSDVADIEGLDNGTGEWFAFTCNLRSKEVTIFNLNVTSRRAGTGGGGIPYKVAKIRAAGLPRFSVGRNSALHTIEDVVDKVAGASKPTIGADPRQYPEFSAHFWIRGSDPAAVNAFLSADKIRFLETAKLEGILATNANYLVYFEDGILLREQDFDVFIANVEKLVANIL*
Ga0070765_10171911613300006176SoilSPIEMMKNQTFVNLNRASLDIYDIQKLDRGTGDWLAFICKLGSKEVTIFNLTIPSQRTDGRGADLHHKVAKIKAAGLPRFSLARNSAMHTFENVVENIAHVPQAEIHLDARQYPNFAKHFWIRGSDAAAVTSFLSDGKIRFLETANLGGTLATNANYLVYFEDGKLTTEQDFDSFIARAEAIVANFL*
Ga0079222_1034353723300006755Agricultural SoilVSHLPFFVMLALGLSVVLPIFFRMLRSPREVKTVVEYAQKRGYALVNPGLAHAVDMSPLQMMKDPAFAHFERASLDISNIAKLDRGTGDWLAFTCRLGSKEATIFNLSVTSRQGSGGAGIHYKVAKIKAAGLPLFALARNSAMHTFENTVNKLAHLSQPEIRVDARLFPEFAAKFWITGPDAAAVTAFLSGDKIRFLEATKLPGTLAANANYLVYFEDGTLTAESDFDLFIAKAEAIAANFL*
Ga0066660_1004758023300006800SoilMSHLPFFVTIVVALSLVLPIVYRFFRSGPREMKGVVEYAQRRGYALVNPALAQAANMSPLEMMKDPALRNLERASLDIANIERLDHGTGDWLAFTCRLGSKEATIFNLSVTSRQGSGGAGVHYKVAKIKAAGLPRFSLGKNSAMHTFENTVEKLAHLSQPEIRVDARMYTEFAAHFWIKGPDAAAVTGFLSGDKIRFLESAKLAGTLATNSNYLVYFEDGKLETEQDFDAFITKAEAIAANFL*
Ga0079221_1004988113300006804Agricultural SoilVSHFTSVAAVLFALSLVVPILYRIFRSGPREARSVVEYAQRRGYALVNPGVAQAVDMSPLEMMKDPALRNLERASLDIANIERLDNGTGDWLAFTCRLGPKEATIFNLSVTSRQGGAGAGVHYKVAKIKAAGLPRFSLGRNSAMHTFESTVDKLAHLSQPEIRVDARMYPDFAANFWIKGPDAAAVTAFLSGDKIRFLEASKLAGTLATNADYLVYFEEGKLETEQDFDAFIAKAEAVAANFV*
Ga0079219_1056944713300006954Agricultural SoilMISGAPRKKARTLIEYAQKRGYALVNPAVLQALDMPYFEMLKSPVMKDSVKASADIDDIEKLSGGTGDWLAFTCNLRSKEVTIFNLNVTSRRADASGGNIHYKVAKIKASGLPRFSLGRNSRLHTFERVVDQLAGASKPEIKLDARQYPEFAAHFWIRGADAAAVTAFLSDAKIRFLEGANLAGTLATNAHYLVYFEDGILN
Ga0079219_1164080813300006954Agricultural SoilRMLRSPREVKTVVEYAQKRGYALVNPGLAHAVDMSPLQMMKDPAFAHFERASLDISNIAKLDRGTGDWLAFTCRLGSKEATIFNLSVTSRQGSGGAGIHYKVAKIKAAGLPLFALARNSAMHTFENTVNKLAHLSQPEIRADARLYPEFAANFWINGPDAAAVTAFLSDDKIRFLEATKLPGTLATNANYLVYFEDG
Ga0099795_1005888623300007788Vadose Zone SoilMSHFSLLAILLFALSILLPMIFRLFSSGPRDKARVVIEYAQRKGYALVNPSLAQALDVSRWEMLKNPAFRNNTQASSDIADIEGLDNGTGDWLAFTCNLRSKEVTIFNLNVTARNMNSQSPGISHKVAKTKSAGLPRFSLGRNSALHTFENVVDKIAGEPKAAINLDARQYPEFSSHFWIRGSDPGPVTAFLSSSKISFLQNAGLPGTLATNANYLVYFEDGVLLSENDFDSFIARVEILVTNML*
Ga0099829_1123860213300009038Vadose Zone SoilVLVNPSLAQALDTSRLEMLKNPALRNSIQASSDIADIERLHDGTGDWLAFTCNLRSKEVTIFNLSVTSRRTDTRGGSIPYKVAKIRAAGLPRFSLGRNSALHTFENAVDKIAGASKPAMDLDARQYPEFSSHFWIRGSDRAAVTAFLSGDKIRFLEDEKLEGILATNVNYLVYFEDGVLLTEQDFDSFIARAETLVANLL*
Ga0099830_1042392813300009088Vadose Zone SoilLELKKGNRDQTHSVNHFPLLVVFLFALSFVLPILFRLFNSGPREKAGVVVRYAQKRGYVLVNPSLAQALDTSRLEMLKNPALRNCIQASSDIADIERLHDGTGDWLAFTCNLRSKEVTIFNLSVTSRRTDTRGGSIPYKVAKIRAAGLPRFSLGRNSALHAIENAVDKIAGASKPAIDLGARQYPEFSSHFWIRGSDRAAVTAFLSGDKIRFLEAAKLEGILATNANYLVYFEDGVLLTEQDFDSFIARAETLVANLL*
Ga0099830_1137717013300009088Vadose Zone SoilKAGVVVRYAQKRGYALVNPSLAQALDMSPLEMMKNPDLRNSIQASSDIADIEKLHDGTGDWLAFTCNLGSKEVTIFNLSVTSRRADTRGGSIPYKVAKIRAAGLPRFSLGRNSALHTLENVVDKIAGTSKPAIDLDAGQYPEFSKHFWIRGSDRAAVTAFLSGDKIRFLETAKLEGILATNANYLVYFEDGVLLT
Ga0099827_1107059413300009090Vadose Zone SoilIADIERLHDGTGDWLAFTCNLRSKEVTIFNLSVTSRQADTRGGSIPYKVAKIRAAGLPRFSLGRNSALHTFENVVDKIAGASKPAIDLDACQYPGFSPLFWIRGSDRDAVTTFLSGDKIRFLEAAKLEGTLATNVNYLVYFEDGVLLTEQDFDSFIARAETLVANLL*
Ga0137392_1066218513300011269Vadose Zone SoilVNHLSLLVPLLFALSLIVPILFRLFRSGPREARVVVEYAQRRGYALVNPALAQALDKSYLEMLKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHAFENVVDKIAGASKPAVNLDARQYPEFSAHFWIKGPDPAAVTAFLSGAKIRFLEAANLQGTLATNANYLVYFEDGVLVTEQDFDSFI
Ga0137392_1147365813300011269Vadose Zone SoilRYAQKRGYALVNPSLAQALDMSPLEMMKNPDLRNSIQASSDIADIEKLHDGTGDWLAFTCNLRSKEVTIFNLSVTSRRADTRGGSIPYKVAKIRAAGLPRFSLGRNSALHTFENTVDKIAGASKPAIDVEASQYPQFSSHFWIRGSDRAAVTAFLSGDKIRFLETAKLEGILATNANYL
Ga0137393_1061502713300011271Vadose Zone SoilSHFASLVAVAFVLSFILPIVYRFFRSGPREMKGVVEYAQKRGYALVNPALAQAVNVSPIEMMKNPAFANLDRASLDISNIPKLDGGTGDWLAFTCNLRSKEVTIFNLSVTSRRTDAGAGDLHYKVAKIKAAGLPRFSLARNSALHTFENVVEKIAHVPQAEIHVDARLYPEFAAHFWIRGSDAAAVTSFLSDGKIRFLETANLGGTLATNANYLVYFEDGKLATEQDFDSFIAKAESIVANFL*
Ga0137388_1138906713300012189Vadose Zone SoilPSLAQALDTSPLEMLKNPALRNSIQASSDIADIEKLHDGTGDWLAFTCNLGSKEVTIFNLSVTSRRADTRGGSIPYKVAKIRAAGLPRFSLGRNSALHTLENVVDKIAGTSKPAIDLDAGHYPEFSKHFWIRGSDRAAVTAFLSGDKIRFLETAKLEGILATNANYLVYFEDGVLLTEQDFDSFIARAETLVANIVVKQNRSQLSAPRRTPRT
Ga0137383_1015594513300012199Vadose Zone SoilVKKGAGRTYSVNHLSLLVPLLFALSLIVPILFRLFRSGPRAARVVVEYAQRRGYALVNPALAQALDKSYLEMVKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHTFENVVDKIAGASKPAVNLDARQYPEFSAHFWIKGPDPAAVTAFLSGAKIRFLEAANLQGTLATNANYLVYFEDGVLVTEQDFDSFIGRAETLVANLL*
Ga0137383_1081116913300012199Vadose Zone SoilMIFRLFNRGPRDKARVVVEYVLRRGYVLLNPSLAQTLDSSRLEMMRNPALRNPTKASSDIADIEGLDNGTGDWLAFTCDQKSKQVTIFNLSKTSSINSSGASIPYKVAKIRAAGLPRFSLGRNSVLHTVESVVGKMVGAPKTTINVDARQYPEFSAHFWIRGSDPGAVTAYLSHDKIRFLETAKLQGILATNANYLVYFED
Ga0137362_1164908313300012205Vadose Zone SoilMDISDIEGLHDGTGGWLAFTCNLGSKEVTIFNLSVSSQRSDTQGRSIHYKVARIKAAGLPRFSLGRNSALHTIENVVDKITGASNRAITLDARQHPEFSAHFWIKGADPAAVTAFLSGDKIRFLESAKLAGTLSTNANYLVYFEDGVLASEKDFDSFIAKAETVV
Ga0137380_1005500123300012206Vadose Zone SoilMLKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHTFENVVDKIAGASKPAVNLDARQYPEFSAHFWIKGPDPAAVTAFLSGAKIRFLEAADLQGTLATNANYLVYFEDGVLVTEQDFDSFIGRAETLVANLL*
Ga0137381_1023293333300012207Vadose Zone SoilVNHLSLLVPLLFALSLIVPILFRLFRSGPREARVVVEYAQRRGYALVNPALAQALDKSYLGMLKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHTFENVVDKIAGASKPAVNLDARQYPEFSAHFWIKGPDPAAVTAFLSGAKIRFLEAANLQGT
Ga0137378_1010173723300012210Vadose Zone SoilVNHLSLLVPLLFALSLIVPILFRLFRSGPREARVVVEYAQRRGYALVNPALAQALDKSYLGMLKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHTFENVVDKIAGASKPAVNLDARQYPEFSAHFWIKGPDPAAVTAFLSGAKIRFLEAANLQGTLATNANYLVYFEDGVLVTEQDFDSFIGRAETLVANLL*
Ga0137377_1008281033300012211Vadose Zone SoilMNRFPLLVILLFALSFILPMIFRLFNRGPRDKARVVVEYVLRRGYVLLNPSLAQTLDSSRLEMMRNPALRNPTKASSDIADIEGLDNGTGDWLAFTCDQKSKQVTIFNLSKTSSINSSGASIPYKVAKIRAAGLPRFSLGRNSVLHTVESVVGKMVGAPKTTINVDARQYPEFSAHFWIRGSDPGAVTAYLSHDKIRFLETAKLQGILATNANYLVYFEDGVLLSEKDFDSFIARVETLVTNML*
Ga0137387_1008308313300012349Vadose Zone SoilVNHLSLLVPLLFALSLIVPILFRLFRSGPREARVVVEYAQRRGYALVNPALAQALDKSYLGMLKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHTFENVVDKIAGASKPAVNLDARQYPEFSAHFWIKGPDPAAVTAFLSGAKISFLEAANLQGTLATNANYLVYFEDGVLVTEQDFDSFIGRAETLVANLL*
Ga0137386_1014773633300012351Vadose Zone SoilVNHLSLLVPLLFALSLIVPILFRLFRSGPREARVVVEYAQRRGYALVNPALAQALDKSYLGMLKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHTFENVVDKIAGASKPAVNLDARQYPEFSAHFWIKGPDPAAVTAFLSGAKIRFLEAANLQGTLATNANYLVYFEDGVLVTEQDFDSFIG
Ga0137386_1059791513300012351Vadose Zone SoilVVEYVLRRGYVLLNPSLAQTLDSSRLEMMRNPALRNPTKASSDIADIEGLDNGTGDWLAFTCDQKSKQVTIFNLSKTSSINSSGASIPYKVAKIRAAGLPRFSLGRNSVLHTVESVVGKMVGAPKTTINVDARQYPEFSAHFWIRGSDPGAVTAYLSHDKIRFLETAKLQGILATNANYLVYFEDGVLLSEKDFDSFIARVETLVTNML*
Ga0137384_1010940313300012357Vadose Zone SoilVNHLSLLVPLLFALSLIVPILFRLFRSGPREARVVVEYAQRRGYALVNPALAQALDKSYLGMLKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHTFENVVDKIAGASKPAVNLDARQYPEFATHFWIKGPDPAAVIAFLSGAKIRFLEAADLQGTLATNANYLVYFEDGVLVTE
Ga0137385_1005105123300012359Vadose Zone SoilVNHLSLLVPLLFALSLIVPILFRLFRSGPREARVVVEYAQRRGYALVNPALAQALDKSYLEMVKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHTFENVVDKIAGASKPAVNLDARQYPEFSAHFWIKGPDPAAVTAFLSGAKIRFLEAANLQGTLATNANYLVYFEDGVLVTEQDFDSFIARAETLVANLL*
Ga0137360_1007390423300012361Vadose Zone SoilVNHLSLLVPLLFALSLIVPILFRLFRSGPREARVVVEYAQRRGYALVNPALAQALDKSCLEMLKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHTFENVVDKIAGASKPAVNLDARQYPEFSAHFWIKGPDPAAVTAFLSGAKIRFLEAANLQGTLATNANYLVYFEDGVLVTEQDFDSFIGRAETLVANLL*
Ga0137361_1000476733300012362Vadose Zone SoilVNHLSLLVPLLFALSLIVPILFRLFRSGPREARVVVEYAQRRGYALVNPALAQALDKSYLEMLKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHAFENVVDKIAGASKPGVNLDARQYPEFSAHFWIKGPDPAAVTAFLSGAKIRFLEAANLQGTLATNANYLVYFEDGVLVTEQDFDSFIGRAETLVANLL*
Ga0137413_1050486823300012924Vadose Zone SoilLVNPSLAQALDVSRWEMLKNPAFRNNTQASSDIADIEGLDNGTGDWLAFTCNLRSKEVKIFNLNLTARNMNSQSPGISHKVAKTKSAGLPRFSLGRNSALHTFENVVDKIAGEPKAAINLDARQYPEFSSHFWIRGADPGPVTAFLSSSKISFLENAGLPGTLATNANYLVYFEDGVLLSENDFDSFIARVETLVTNML*
Ga0137416_1069829913300012927Vadose Zone SoilKNPALRNPIQASSDIADIEKLHDGTGDWLAFTCNLRSKEVTIFNLSATSRRVDTRGGSIPYKVAKIRAAGLPRFSLGRNSALHTFENVVEKIAGASKPTIDLDARQYPEFSPHFWIRGSDRAEVTAFLSGDKIRFLEAAKLEGILATNANYLVYFEDGVLLTEQDFDSFIARAETLVANLL*
Ga0137407_1085671623300012930Vadose Zone SoilMSHFSLLAILLFALSILLPMIFRLFSSGPRDKARVVIEYAQRKGYALVNPSLAQALDVSRWEMLKNPAFRNNTQASSDIADIEGLDNGTGDWLAFTCNLRSKEVTIFNLNVTARNMNSQSPGISHKVAKIKSAGLPRFSLGRNSALHTFENVVDKIAGEPKAAINLDARQYPEFSSHFWIRGADPGPVTAFLSSSKISFLENAGLPGTLATNANYLVYFEDGV
Ga0164303_1028233013300012957SoilMNHIPLLVIVLFAMSFILPMIFRLFSSGPRDKARVVVEYAQRRGYVLDNPSVARALDSSRLEMLRNPDLRNSIKASSDIADIEGLDNGTGDWLGFTCNLRSKQVTIFNLSVTLQRSNTGGGSIPYKVAKIRAAGLPRFSLGRNSVLHTVENVVGKMVGAPKATIDVDARQYPEFSAHYWIRGSDPGTVTAFLSPDKIRFLETAKLQGILATNEKYMVYFEDGVLLGEKDFDSFIARVEILVANML*
Ga0164301_1055737913300012960SoilALDSSRLEMLRNPDLRNSIKASSDIADIEGLDNGTGDWLGFTCNLRSKQVTIFNLSVTLQRSNTGGGSIPYKVAKIRAAGLPRFSLGRNSVLHTVENVVGKMVGAPKATIDVDARQYPEFSAHYWIRGSDPGTVTAFLSPDKIRFLETAKLQGILATNEKYLVYFEDGVLLGEKDFDSFIARVEILVANML*
Ga0164304_1028722213300012986SoilMNHIPLLVIVLFAMSFILPMIFRLFSSGPRDKARVVVEYAQRRGYVLDNPSVARALDSSRLEMLRNPDLRNSIKASSDIADIEGLDNGTGDWLGFTCNLRSKQVTIFNLSVTLQRSNTGGGSIPYKVAKIRAAGLPRFSLGRNSVLHTVENVVGKMVGAPKATIDVDARQYPEFSAHYWIRGSDPGTVTAFLSPDKIRFLETAKLQGILATNEKYLVYFEDGVLLGEKDFDSFIARVEILVANML*
Ga0187824_1000981023300017927Freshwater SedimentVNNFSLLIAFVFAASVVLPILFRLFRSGPREARAVVEYAQKRGYTLVNPGLAQALDKSYLETLKDPAFRNSGQASSDISDIEKLHDGTGGWMAFTCNVGSREVTIFNLSVSSRRTDSQGGGIRYKVAKIKAPGLPRFTLGRNSALHKFENVVEEIAGVSKAAVQVDARQFPDFAAHFWVKGADPVAVNAFLSGDKIQFLETAKLEGTLATNANYLVYFEDGVLRTEQDFDSFIARAEKLVANLL
Ga0187825_1000100443300017930Freshwater SedimentVNNFSLLIAFVFAASVVLPILFRLFRSGPREARAVVEYAQKRGYTLVNPGLAQALDKSYLEMLKDPALRNSGQASSDISDIEKLHDGTGGWMAFTCNVGSREVTIFNLSVSSRRTDSQGGGIRYKVAKIKAPGLPRFTLGRNSALHKFENVVEEIAGVSKAAVQVDARQFPDFAAHFWVKGADPVAVNAFLSGDKIQFLETAKLEGTLATNANYLVYFEDGVLRTEQDFDSFIARAEKLVANLL
Ga0187823_1000026553300017993Freshwater SedimentVNNFSLLIAFVFAASVVLPILFRLFRSGPREARAVVEYAQKRGYTLVNPGLAQALDKSYLEMLKDPALRNSEQASSDISDIEKLHDGTGGWMAFTCNVGSREVTIFNLSVSSRRTDSQGGGIRYKVAKIKAPGLPRFTLGRNSALHKFENVVEEIAGVSKAAVQVDARQFPDFAAHFWVKGADPVAVNAFLSGDKIQFLETAKLEGTLATNANYLVYFEDGVLRTEQDFDSFIARAEKLRAKLL
Ga0187822_1000086723300017994Freshwater SedimentVNNFSLLIAFVFAASVVLPILFRLFRSGPREARAVVEYAQKRGYTLVNPGLAQALDKSYLEMLKDPALRNSGQASSDISDIEKLHDGTGGWMAFTCNVGSREVTIFNLSVSSRRTDSQGGGIRYKVAKIKAPGLPRFTLGRNSALHKFENAVEEIAGVSKAAVQVDARQFPDFAAHFWVKGADPVAVNAFLSGDKIQFLETAKLEGTLATNANYLVYFEDGVLRTEQDFDSFIARAEKLVANLL
Ga0066662_1021676823300018468Grasslands SoilVNHLSLLVPLLFALSLIVPILFRLFRSGPREARVVVEYAQRRGYALVNPALAQALDKSYLEMVKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHTFENVVDKIAGASKPAVNLDARQYPEFSAHFWIKGPDPAAVTAFLSGAKIRFLEAVNLQGTLATNANYLVYFEDGVLVTEQDFDSFIGRAETLVANLL
Ga0210407_1022651513300020579SoilGLAQAQDKSYLEMLKDPALRSSSRASVDISDIEKLHNGTGDWLAFTCNLGSKEVTIFNLSVTPKGTNTQGGSIHYKIAKIKAAGLPRFSVGRNSALHTVENVVDKITGASHLAITVDARQYPEFSTHFWIKGADPSAVTAFLSGDRIRFLESAKLEGILATNANFLVYFEDGVLATEQDFDSFIAKAEKVVANLLSNKLEPVSLST
Ga0210399_1002676623300020581SoilMNHLPLLVILIFVLSIILPMIFRLFGSGPRDKARAVVEYAQRRGYVLVNPSVAQALDSSLLDMRRNPALKNSIHASSDIVDIEGLHNGTGDWLAFTCNLRSKEVTIFNFNVSSQRADASGRSIPHKVAKIRAAGLPRFSVGRNSVLHTVVTIVDNIVGAPNVTINVDARQYPEFSAHYWIRGSDPGAVTSFLSPDKIGFLETQKLEGILATNTNYLVYFEDGVLLSEQNFDSFIAKVDALVAAIL
Ga0210401_1113312413300020583SoilARDHMSSNHFPFLVVLLLALSFILPMIFRIIFTRGPRDKARVLVQYAQKRGYALVNPSLAQALDVSRLEVLKNPALRNSIRSASDIADIEGLDNGTGEWFAFTCNLRSKEVTIFNLNVASKSVNTDSSGISYKVAKIRAAGLPRFSLGKNSVVHTIENVVEKVAGASEPTIAVDEHQYPEFSTTFWIKGADSAATTAFLSANKIVFLQNAKLK
Ga0210406_1021459333300021168SoilMGLENEARDHMSSNHFPFLVVLLLALSFILPMIFRIIFTRGPRDKARVLVQYAQKRGYALVNPSLAQALDVSRLEVLKNPALRNSIRSASDIADIEGLDNGTGEWFAFTCNLRSKEVTIFNLNVTSKSVNTDSSGISYKVAKIRAAGLPRFSLGKNSVVHTIENVVEKVAGASEPTITVDEHQYPEFSTTFWIKGADSAATTAFLSANKIVFLQNAKLKGTL
Ga0210406_1036284023300021168SoilHNGTGDWLAFTCNLGSKEVTIFNLSVTPKGTNTQGGSIHYKIAKIKAAGLPRFSVGRNSALHTVENVVDKITGASHLAITVDARQYPEFSTHFWIKGADPSAVTAFLSGDRIRFLESAKLEGILATNANFLVYFEDGVLASEQDFDSFIAKAEKVVANLLSNKLEPVSLST
Ga0210405_1037316213300021171SoilSGPREMKGVVEYAQKRGYALVNPALAQAVDMSPIEMMKNQTFANLNRASLDIYDIQKLDRGTGDWLAFTCKLGSKEATIFNLTIPSPRSDGRGGDLHHKVAKIKAPGLPRFSLAKNSAMHTFENVVENIAHVPQAEIHLDARLYPDFAKHFWIRGSDAAAVTSFLCDGKIRFLEAAKLEGTLATNANYLVYFEDGKLTTEQDFDSFIARAEAIVANFL
Ga0210408_1001378083300021178SoilMGLENEARDHMSSNHFPFLVVLLLALSFILPMIFRIIFTRGPRDKARVLVQYAQKRGYALVNPSLAQALDVSRLEVLKNPALRNSIRSASDIADIEGLDNGTGEWFAFTCNLRSKEVTIFNLNVTSKSVNTDSSGISYKVAKIRAAGLPRFSLGKNSVVHTIENVVEKVAGASEPTIAVDEHQYPEFSTTFWIKGADSAATTAFLSANKIVFLQNAKLKGTLATNANYLVYFEQGILVSEQDFDVFIAEVEKLVANLL
Ga0210408_1023354323300021178SoilMNHLPLLVILIFVLSIILPMIFRLFGSGPRDKARAVVEYAQRRGYVLVNPSVAQALDSSLLDMRRNPALKNSIHASSDIADIEGLHNGTGDWLAFTCNLRSKEVTIFNFNVSSQRADASGRSIPHKVAKIRAAGLPRFSVGRNSVLHTVVTIVDNIVGAPNVTINVDARQYPEFSAHYWIRGSDPGAVTSFLSPDKIGFLETQKLEGILATNTNYLVYFEDGVLLSEQNFDSFIAKVDALVAAIL
Ga0210396_1039583013300021180SoilRSSSRASVDISDIEKLHNGTGDWLAFTCNLGSKEVTIFNLSVTPKGTNTQGGSIHYKIAKIKAAGLPRFSVGRNSALHTVEKVVDKITGASQLAITVDARQYPEFSTHFWIKGADPSAVTAFLSGDRIRFLESAKLEGILATNANFLVYFEDGVLASEQDFDSFIAKAEKVVANLLSNKLEPVSLST
Ga0210393_1135467413300021401SoilLEMLKDPALRSSSRASVDISDIEKLHNGTGDWLAFTCNLGSKEVTIFNLSVTPKGTNTQGGSIHYKIAKIKAAGLPRFSVGRNSALHTVENVVDKITGASHLAITVDARQYPEFSTHFWIKGADPSAVTAFLSGDRIRFLESAKLEGILATNANFLVYFEDGVLASEQDFDSFIAKAEKVVANLLSNKL
Ga0210389_1094547813300021404SoilRGGPREAKAVVEYAQKRGYALVNPGLAQAQDKSYLEMLKDPALRSSSRASVDISDIEKLHNGTGDWLAFTCNLGSKEVTIFNLSVTPKGTNTQGGSIHYKIAKIKAAGLPRFSVGRNSALHTVEKVVDKITGASQLAITVDARQYPEFSTHFWIKGADPSAVTAFLSGDRIRFLESAKLEGILATNANFLVYFEDGVLASEQDFDSFIAKAEKVVANLLSNKL
Ga0210387_1154852313300021405SoilFRRGPRDKARVVVEYAQKKGYVLVNPSLAQALDSSPLEMLKNPALRNSIKASSDIADIEGLGNGTGEWFAFTCNLRSKEVTIFNLNATSRGAGTSGGGIPYKVAKIRATGLPRFSLGRNSVLHTIEDAVGKVTGGSKPTIHVDARQYPEFSTHFWIRGSDPAAVTAFLYADKIRFLETAKLEGILATT
Ga0210394_1014046623300021420SoilMSYLPLFFALVAVVSLVLPALYRLLRGGPREMRTVVEYAQRRGYALVNPGAAQAVDMSPLEMMKNPALRNLERASLDIYDIQKLDRGTGDWLAFICKLGSKEVTIFNLTIPSQRTDGRGSDLHHKVAKIKAAGLPRFSLARNSAMHTFENVVENIAHVPQAQIHLDARQYPNFAKHFWIRGSDAGEVTSFLSDGKIRFLETANLGGTLATNANYLVYFEDGKLQTEQDFDSFIARAEAIVANFL
Ga0210394_1020007913300021420SoilMHSMNYLPLLVVLFFALSFILPMIFRLFSGGPRDKARVVVGYAQKRGYVLVNPSLTQALDSSRLEMLKNPAFRNSIKASSDIADIEGLDNGTGEWLAFNCNLRSKEVTIFNLNVTSRGVGTSGRGIPYKVAKIRAAGLPRFSLGRNSVLHAIEDAVNKVAGASKPTIDVDARQYPEFSTHFWIRGSDPAAVTAFLSADKIRFLEAAKLEGILATNANYLVYFEGGILLREQDFDVFITGVEKLVANIL
Ga0210384_1001278383300021432SoilLQDPAHRHVILVCPCKKSPCNELKIPAMLFAGSQNELSDLIIHPKSHLALLFILLFALSFLLPMIFRLFSRSPRNNAKALVAYVQKRGYALVNPSLAQALDTPLPEMFKNPALRNSIRASSDITDINWLDNGTGDWLAFTCNLRSKEVTIFNLNVTSQQNATGVGIPYQVAKIKVPGLPRFSLGRNSVLHTLENVVGQVAGTSNPAITLDPCQYPEFAAHFWIRASDPAAVTSFLPPDKVKFLETAKLEGIVATNTHYLVYFEDGTLLKEQDFDAFISRVDTIIANLL
Ga0210384_1023973523300021432SoilMGLENEARDHMSSNHFPFLVVLLLALSFILPMIFRIIFTRGPRDKARVLVQYAQKRGYALVNPSLAQALDVSRLEVLKNPALRNSIRSASDIADIEGLDNGTGEWFAFTCNLRSKEVTIFNLNVTSKSVNTDSSGISYKVAKIRAAGLPRFSLGKNSVVHTIENVVEKVAGASEPTIAVDEHQYPEFSTTFWIKGADSAAITAFLSANKIVFLQNAKLKGTLATNANYLVYFEQGILVSEQDFDVFIAEVEKLVANLL
Ga0210384_1040029323300021432SoilPFLVAIVVALSMVGPIVYRIFRSGPREMKGVVEYAQKRGYALVNPALAQAVDMSPIEMMKNQTFANLNRASLDIYDIQKLDRGTGDWLAFTCNLRSKEVAIFNLTIPSQRTDGRGGDIHHKVAKIKAPGLPRFSLAKNSAMHTFENVVENIAHVPQAEIHLDARLYPDFAKHFWIRGSDAAAVTSFLSDGKIRFLETANLEGTLATNANYLVYFEDGKLTSEQDFDSFIARAEAIVANFL
Ga0187846_1005499023300021476BiofilmMRYLPFLAPVAFAFSVVLSILYRLFRSGPREAKLVVEYAQRRGYALVNPGVAQVVDLSPLEMMRNPALRNLERPSLDIANIEKLDRGTGDWLAFTCRLGSKDATIFNLIVTSRRADGKGGDLHYKVAKIKAAGLPRFSLARNSAMHSFENAVEKLAHQAQPDIRVDARLYPEFAAHFWIKGPDAAVVTAFLSDGKIRFLENAKLAGTLATNANYLVYFEGGKLVTEQDFDAFIATAQSIVANFL
Ga0210402_1018794823300021478SoilMSHFPLLVALLFVLIFILSTIFRLFSGPRPLNQAQAVVEYAQKKGYALVNPAISQALGSSPVEMLKNPALRNSIRAAADISDIEGLENGTGDWLAFTCSLRSHEVTIFNLSVTPRTVNSSSGNIHYKVAKICAADLPRFSLGRNSGLHTLENVVGKLVGTPKPTVDLDARQYPEFAAHSWIRGSDAAAITTFLSPSKIKFLGTAYLPGVLATNAKYLVYFEDGALRTGADFDSFVSRVETIVANVL
Ga0210402_1069781813300021478SoilMNNFSFIVALVFALSVVFPILFRLFRGGPREAKAVVEYAQKRGYDLVNPGLAQALDKSYLEMLKDPALRSSSRASVDISDIEKLHNGTGDWLAFTCNLGSKEVTIFNLSVTPKGTNTQGGSIHYKIAKIKAAGLPRFSVGRNSALHTVEKVVDKITGASQLAITVDARQYPEFSTHFWIKGADPGAVTAFLSGDRIRFLESAKLEGILATNANFLVYFEDGVLASEQDFDSFIAKAEKVVANLLSNKLEPVSLST
Ga0210410_1014207233300021479SoilMGLENEARDHMSSNHFPFLVVLLLALSFILPMIFRIIFTRGPRDKARVLVQYAQKRGYALVNPSLAQALDVSRLEVLKNPALRNSIRSASDIADIEGLDNGTGEWFAFTCNLRSKEVTIFNLNVTSKSVNTDSSGISYKVAKIRAAGLPRFSLGKNSVVHTIENVVEKVAGAAEPTITVDEHQYPEFSTTFWIKGADSAATTAFLSANKIVFLQNAKLKGTLATNANYLVYFEQGILVSEQDFDVFIAEVEKLVANLL
Ga0210409_1000413983300021559SoilLQDPAHRHVILVCPCKKSPCNELKIPAMLFAGSQNELSDLIIHPKSHLALLFILLFALSFLLPMISRLFSRSPRNNAKALVAYVQKRGYALVNPSLAQALDTPLPEMFKNPALRNSIRASSDITDINWLDNGTGDWLAFTCNLRSKEVTIFNLNVTSQQNATGVGIPYQVAKIKVPGLPRFSLGRNSVLHTLENVVGQVAGTSNPAITLDPCQYPEFAAHFWIRASDPAAVTSFLPLDKVKFLETAKLEGIVATNTHYLVYFEDGTLLKEQDFDAFISRVDTIIANLL
Ga0210409_1013484413300021559SoilVDMSPIEMMKNQTFANLNRASLDIYDIQKLDRGTGDWLAFTCNLRSKEVAIFNLTIPSQRTDGRGGDLHHKVAKIKAPGLPRFSLARNSAMHTFENVVENIAHVPQAEIHLDARLYPDFAKHFWIRGSDAAAVTSFLSDGKIRFLETAKLEGTLATNANYLVYFEDGKLTSEQDFDSFIARAEAIVANFL
Ga0207699_1040106813300025906Corn, Switchgrass And Miscanthus RhizosphereSLNPTLYLFETSSVIFFFTPVSRDVPALVTKSGIPSMWLRCNSYVPQQPESRSVASTCNYTFDCQSERKRRPNIPMNHLTLLIILIFVLSILLPMLFRFFRASPRDKAKSVVEYAQRRGYVLVNPALANALDSSLLEMARNPAFKNSIRASSDIADIEELHDGTGDWLAFTCTLRSREATIFNLSVSPRRPDTGGRSIPYKVVKIKAPGLPRFSLGRNSVVHTVVNVVDKIVGAPKGTIDVDARLFPEFSAHYWISGSDAVAVTSFLAPGKIRFLETARLEGILATNANYLVYFEDGVLVTEQDFDSFIARAETLVANLL
Ga0207699_1148013813300025906Corn, Switchgrass And Miscanthus RhizosphereALAQALDMSYLEMQKNPALRNSNRASSDIDNIEQLSNGTGDWLAFSCNLGSKEVTIFNLRVASRRTDASGGDIHYKVAKIKAAGLPRFSLGRNSALHSFENVIDKVTGASKPAITLDARQYPEFSSHFWINGSDPAAVTAFLSGGKIRFLETSNLQGTLATNANYFV
Ga0207700_1110024313300025928Corn, Switchgrass And Miscanthus RhizosphereRKRRPDIPMNHLTLLIILIFVLSILLPMLFRFFRASPRDEAKSVVEYAQKRGYVLVNPALANALDSSLLEMARNPAIKNSIRASSDIADIEELHDGTGDWLAFTCTLRSREATIFNLSVSPRRPDTSGRSLPYKVVKIKAPGLPRFSLGRNSVVHTVVNVVDKIVGAPKRTIDLDARLFPEFSAHYWISGSDAVAVTSFLAPGKIRFLETAKLEGILATNANYLVYFEDGVLLTEQ
Ga0209158_101013743300026333SoilVVEYAQKRGYALVNPGLAQAVDMSYLEMLKNPAFRNFNKASLDVDDIEKLNGGSGDWLAFTCNLRSKEVTIFNLSVTSRRVDAQGGSLQYKVAKIKTAGLPRFSLGRNSALHAFENAVDKIAGASKPAINLDARQYPEFSAHFWIKGSDPAAVTAFLSGDKIRFLEDAKLQGTLATNANYLVYFEDGVLRSEQDFDSFIARAEALVANLL
Ga0209158_121942313300026333SoilQCSWLQLCFPWEVKKGTGRTYSVNHLSLLVPLLFALSLIVPILFRLFRSGPREARVVVEYAQRRGYALVNPALAQALDKSYLEMLKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHTFENVVDKIAGASKPAVNLDARQYPEFSAHFWIKGPDPAAVTAFLSGAKI
Ga0209730_100392413300027034Forest SoilMNRFPLLVILMFALSFILPMIFRLFKRGPRDKARVVVEYVLRRGYVLLNPSLAQTLDSSRLEMMRNPALRNPTKASSDIADIEGLDNGTGDWLAFTCDLKSKEVTIFNLSKTSSINSSGASIPYKVAKIRAAGLPRFSLGRNSVLCTVENVVGKMVGAPKTTINVDARQYPEFSAHFWIRGSDPGAVTAYLSHDKIRFLETAKLQGILATNANYLVYFEDGVLLSEKDFDSFIARVETLVTNML
Ga0209178_101375323300027725Agricultural SoilVSHLPFFVMLALGLSVVLPIFFRMLRSPREVKTVVEYAQKRGYALVNPGLAHAVDMSPLQMMKDPAFAHFERASLDISNIAKLDRGTGDWLAFTCRLGSKEATIFNLSVTSRQGSGGAGIHYKVAKIKAAGLPLFALARNSAMHTFENTVNKLAHLSQPEIRVDARLFPEFAAKFWITGPDAAAVTAFLSGDKIRFLEATKLPGTLAANANYLVYFEDGTLTAESDFDLFIAKAEAIAANFL
Ga0209580_1044297513300027842Surface SoilILYRLFRSGPREARSVVEYAQRRGYALVNPALAQAVELSPIEMMKNPALRNLERASLDIANIEKLDGGTGDWLAFTCKLGSKEATIFNLSVSSRRADGRGADLHYRVAKIKAAGLPRFSLARNSAMHTFENAVEKIAHVSQAEIHLDARQYPNFAAHFWIRGSDAAAVASFLSDGKIRFLEAANLGGTLATNANYLVYFEDGKLTTEQDFDAFIA
Ga0209701_1037926813300027862Vadose Zone SoilVNHFPLLVVFLFALSFVLPILFRLFNSGPREKAGVVVRYAQKRGYVLVNPSLAQALDTSRLEMLKNPALRNSIQASSDIADIEKLHDGTGDWLAFTCNLGSKEVTIFNLSVTSRRADTRGGSIPYKVAKIRAAGLPRFSLGRNSALHTLENVVDKIAGTSKPAIDLDAGQYPEFSKHFWIRGSDRAAVTAFLSGDKIRFLETAKLEGILATNANYLVYFEDGVLLT
Ga0209590_1061068913300027882Vadose Zone SoilQALDTSPLEMLKNPALRNSIQASSDIADIERLHDGTGDWLAFTCNLRSKEVTIFNLSVTSRQADTRGGSIPYKVAKIRAAGLPRFSLGRNSALHTFENVVDKIAGASKPAIDLDACQYPGFSPLFWIRGSDRDAVTTFLSGDKIRFLEAAKLEGTLATNVNYLVYFEDGVLLTEQDFDSFIARAETLVANLL
Ga0209380_1018235823300027889SoilSMSYLPLFFALVAVASLVLPTLYRLLRSGPREMRTVVEYAQKRGYALVNPALAQAVDLSPIEMMKNQTFVNLNRASLDIYDIQKLDRGTGDWLAFTCKLGSKEVTIFNLTIPSQRTDGRGSDLHHKVAKIKAAGLPRFSLARNSAMHTFENVVENIAHVPQAEIHLDARQYPNFAKHFWIRGSDAGEVTSFLSDGKIRFLETANLGGTLATNANYLVYFEDGKLQTEQDFDSFIARAEAIVANFL
Ga0209488_1032318513300027903Vadose Zone SoilVNPALAQALDKSYLEMLKDPALRNSNKASLDISDIEKLHEGTGDWLAFTCNLGSKEVTVFNLGVTSRRADNSGGSIRYKVAKIKAAGLPRFSLGRNSALHAFENVVDKIAGASKPGVNLDARQYPEFSAHFWIKGPDPAAVTAFLSGAKIRFLEAANLQGTLATNANYLVYFEDGVLVTEQDFDSFI
Ga0209526_1005034623300028047Forest SoilMNHFSLFVILLFALTLILPMIFRHFRGSPRDKARVVVEYAQRRGYALVNPALAQALDSSRLEMLRNPALRDSINASSDIADIDELDNGTGDWLAFTCDLRAKQVTIFNVSVNSRRINTSGTSIPYKVAKIRAAGLPRFSLGRNSVVHTVENAVSKIVGAANATINVDARQYPEFSAHFWIKGSDPDAVNSFLSPDKIRFLETAKLQGTLATNANYLVYFEDGVLQSEKDFDSFIARLEMLVSKIL
Ga0209526_1032359513300028047Forest SoilHMRSMNHFPFLVVLLLALSFVLPMIFRIVFTRGPRDKAKVVVEYAQKKGYALVNPSLAQALDVSLLEVLKNPAFRNSIKSASDIADIEGLDNGTGEWFAFTCNLGSKEVTIFNLNVTSKGVNTASSGISYKVAKIRAAGLPRFSLGRNSVVNTIENAVDKIAGAPKPTIAVDAHQYPDFATHFWIKGPDSAAVTAFLSTDKIVFLQNAKPKGTLATNTNYLVYFEPGTLVSEQDFDVFIAAVEKLVANLL
Ga0209526_1041417713300028047Forest SoilNPALAQALDKSYLEMVKDPALRNSAKASLDISDIEKLNDGTGDWLAFTCNLRSREVTIFNLSVTSRRTDTRGGSIPYKVAKIKAAGLPRFSLGRNSALHTFENVVDKIAHASKPAINLDARLYPEFSAHFWIRGSDSAAVTSFLSGDKIRFLETAKLGGILATNANYLVYFEDGVLRTEQDFDSFIGRAETLVANLL
Ga0137415_1031629913300028536Vadose Zone SoilLFRLFRSGPREARVVVEYAQRRGYALVNPALAQALDKSYLEMLKDPVLRNSNKASLDISDIEKLHEGTGDWLAFTCNLRSKEVTIFNLSVTSRRADTNGGSIPYKVAKIRAEGLPRFSIGRNSALHTFENVVEKIAGASKPTIDLDARQYPEFSPHFWIRGSDRAEVTAFLSGDKIRFLEAAKLEGILATNANYLVYFEDGVLLTEQDFDSFIARAETLVANLL
Ga0308309_1015433823300028906SoilMNYFPLLVLLLFALSFLLPMIFRLFSHGPRDKARVVVGYAQKRGYVLVNPSLAQALDSSPMEMLRNPTLRNSIKASSDVADIEGLDNGTGEWFAFTCNLRSKEVTIFNLNVTSRRAGTGGGGIPYKVAKIRAAGLPRFSVGRNSALHTIEDVVDKVAGASKPTIGADPRQYPEFSAHFWIRGSDPAAVNAFLSADKIRFLETAKLEGILATNANYLVYFEDGILLREQDFDVFIANVEKLVANIL
Ga0170824_11970619923300031231Forest SoilVEYAQKRGYALVNPGVAQALDKSYLEMLKDPALRSSSRPSMDISDIEKLHDGTGDWLAFMCNLGSKEVTVFNLSVTPRGTNTQGGSIHYKVAKIKAAGLPRFSVGRNSALHTVENVVDKITGAAHPAISVDARQYPEFSAHFWVKGADPSAVIAFLSGDRIRFLESAKLEGILATNANFLVYFEDGVLATEQDFDAFIAKAEKVVANLLSNKLEPVSLST
Ga0307476_1036010323300031715Hardwood Forest SoilSHLPFFVAVVVALSMVGPIVYRIFRGGPREMKGVVEYTQKRGYALVNPALAQAVDLSPIEMMKNQTFVNLNRASLDIYDIQKLDRGTGDWLAFTCKLGSKEVTIFNLTIPSQRTDGRGADLHHKVAKIKAAGLPRFSLARNSAMHTFENVVENIAHVPQAEIHLDARQYPNFARHFWIRGSDAAAVTSFLSDGKIRFLETANLGGTLATNANYLVYFEDGKLTTEQDFDAFIAKTESIVANFL
Ga0307469_1001381523300031720Hardwood Forest SoilMNRFPLLVILMFALSFILPMIFRLFKRGPRDKARVVVEYVLRRGYVLLNPSLAQTLDSSRLEMMRNPALRNPTKASSDIADIEGLDNGTGDWLAFTCDLKSKEVTIFNLSKTSSINSSGASIPYKVAKIRAAGLPRFSLGRNSVLHTVENVVGKMVGAPKTTINVDARQYPEFSAHFWIRGSDPGAVTAYLSHDKIRFLETAKLQGILATNANYLVYFEDGVLLSEKDFDSFIARVETLVTNML
Ga0307477_1004404833300031753Hardwood Forest SoilMSYLPLFFALVAVASLVLPTLYRLLRSGPREMKGVVEYTQKRGYALVNPALAQAVDLSPIEMMKNQAFVNLNRASLDIYDIQKLDRGTGDWLAFTCKLGSKEVTIFNLTIPSQRTDGRGADLHHKVAKIKAAGLPRFSLGRNSAMHTFENVVENIAHVPQAEIHLDARQYPNFAKHFWIRGSDAAAVTSFLSDGKIRFLETANLGGTLATNANYLVYFEDGKLQTEQDFDAFIAKTESIVANFL
Ga0307475_1003228823300031754Hardwood Forest SoilVSHLPFFVAIVVALSMVGLIVYRIFRSGPREMKGVVEYAQKRGYALVNPALAQAVDMSPIEMMKNQTFANLNRASLDIYDIQKLDRGTGDWLAFTCNLRSKEVTIFNLSVTSRRSDGGGGDIHHKVAKIKAPGLPRFSLARNSAMHTFENVVENIAHVPQAEIHLDARLYPDFAKHFWIRGSDAAAVTSFLCDGKIRFLEATKLEGTLATNANYLVYFEDGKLATEQDFDAFIARAEAIVANFL
Ga0307478_1115170313300031823Hardwood Forest SoilSAVRPIETQKDLRRGASVSHLPFFIAVVVALSMVGPIIYRIFRGGPREMKGVVEYTQKRGYALVNPALAQAVDLSPIEMMKNQTFVNLNRASLDIYDIQKLDRGTGDWLAFTCKLGSKEVTIFNLTIPSQRTDGRGADLHHKVAKIKAAGLPRFSLARNSAMHTFENVVENIAHVPQAEIHLDARQYPNFAKHFWIRGSDAAAVTSFLSDGKIRF
Ga0307479_1017761613300031962Hardwood Forest SoilLSHFTSFVALFFALSVILPIVYRLFRSGPREVRGVVEYAQKRGYALVNPALAQAVVMSPIEMMKNQTFANLNRASLDIYDIQKLDRGTGDWLAFTCNLRSKEVTIFNLSVTSRRADGGGGDIHHKVAKIKAAGLPRLSLGKNSALHSFENVVNKLAHVSQPEIQLDARLYPDFAKHFWIKGPDAAAVSSFLSNGKIRF
Ga0307479_1018174623300031962Hardwood Forest SoilVSHFSLLAVLLFALSFILPIIFRLFSSGPREKARDVVKYAQKRSYALVNPSLAQAIDMSRLEMMKNPALRNSIQAASDIADIEKLHDGTGDWLAFTCNLRSKEVTIFNLSVTSRRVDTRSTSIPYKVAKIRAAGLPRFSLGRNSALNTFENAVEKIAGASKPAIDLDARQYPEFSSHFWIRGSDRAAVTAFLSGDKIRFLESAKLDGTLATNANYLVYFEDGTLLSEQDFDSFIARAETLVANLL
Ga0307471_10008278223300032180Hardwood Forest SoilSGPRETKVVVEYAQKRSYALVNPGLAQAVDMSYLEMLKNPAFRNFNKASLDVDDIEKLNGGTGDWLAFTCTLRSKEVTIFNLSVTSRRVDAQGGSLQYKVAKIKTAGLPRFSLGRNSALHTFENAVDKIAGASKPAINLDARQYPEFSAHFWIKGSDPAAVTAFLSGDKIRFLEDAKLQGTLATNANYLVYFEDGVLRSEQDFDSFIARAEALVANLL
Ga0307471_10046401113300032180Hardwood Forest SoilQALDMSRLEMMKNPALRNSIQASSDIADIEKLHDGTGDWLAFTCNLRSKEVTIFNLSVTSRRADARGGSIPYKVAKIRAGGLPRFSLGRNSALHTFENTMEKMAGASNPAIDLDARQFPEFSHDFWIRGSDRAAVTAFLSGDKIRFLEAAKLEGTLATNVNYLVYFEDGVLLTEQEFDSFIARAETIVANLL
Ga0307471_10157136513300032180Hardwood Forest SoilDIEGLGNGTGDWLAFTCDLKSKEVTIFNLSKTSSINSSGASIPYKVAKIRAAGLPRFSLGRNSVLHTVENVVGKMVGAPKTTINVDARQYPEFSAHFWIRGSDPGAVTAYLSHDKIRFLETAKLQGILATNANYLVYFEDGVLLSEKDFDSFIARVETLVTNML
Ga0307471_10207841713300032180Hardwood Forest SoilLPILFRLFRSGPREARAVVEYAQKRGYALVNPGLAEALDKSYLEMLKDPALRSSNRASMDISDIEGLHDGTGDWLAFTCNLGSKEVTIFNLNVSSQRSDTQGGGIHYKVARIKAAGLPRFSLGRNSALHTIENVVDKITGASNRAITLDARQHPEFSAHFWIKGADPAAVTAFLSGDKITFLESAKLAGTLATNANYLVYFEDGVLASEKDFDSFIAKAETVVANLL
Ga0307471_10248424413300032180Hardwood Forest SoilRVAQRSRYAQLCFPLEVGAHKTQKRICGEGIPLSHFTSFVALFFALSVILPIVYRLFRSGPREVRGVVEYAQKRGYALVNPALAQAVGMSPIEMMNNPAFANLNRASLDISDIPKLDGGTGDWLAFTCNLRSKEVTIFNLSVTSRRADGGGGDIHHKVAKIKAAGLPRFSLGKNSALHSFENVVNKLAHVSQPEIQLDARLYPDFAKHFWIKGPDAAA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.