NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F064851

Metagenome / Metatranscriptome Family F064851

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F064851
Family Type Metagenome / Metatranscriptome
Number of Sequences 128
Average Sequence Length 344 residues
Representative Sequence MAIQLDDVNTTTTKEIMPGVVDGYFRAGPVIAMAKARFTRKWVGPQIQENFMYKPMKGGAYKKGAPFNVTRHQTRTGLLFTPRYYQVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAAYHHGQALPGDDRSAEINGLEEALNDGTNASWNGNVFPSYGGQTRADVSPALTPPTGLIAANNANVLYRVLRHSYFSAIIGNEAPTIGVTTNRMMGFISENFLPHQIVDTTQPEIAWPGLKFDKATIVMSQYAPSQDGVNDPDLGNYLAAGETFAWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFAGNLTVKALRLSRILHGFTA
Number of Associated Samples 108
Number of Associated Scaffolds 128

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 69.53 %
% of genes near scaffold ends (potentially truncated) 40.62 %
% of genes from short scaffolds (< 2000 bps) 37.50 %
Associated GOLD sequencing projects 98
AlphaFold2 3D model prediction Yes
3D model pTM-score0.56

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (74.219 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Sediment → Unclassified → Unclassified → Soil
(15.625 % of family members)
Environment Ontology (ENVO) Unclassified
(32.812 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Subsurface (non-saline)
(42.188 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 20.48%    β-sheet: 12.23%    Coil/Unstructured: 67.29%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.56
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 128 Family Scaffolds
PF13392HNH_3 3.12
PF13385Laminin_G_3 2.34
PF01464SLT 2.34
PF06067DUF932 1.56
PF00958GMP_synt_C 0.78
PF13527Acetyltransf_9 0.78
PF12694cpYpsA 0.78
PF03167UDG 0.78
PF05050Methyltransf_21 0.78
PF15780ASH 0.78
PF12728HTH_17 0.78
PF01312Bac_export_2 0.78
PF13884Peptidase_S74 0.78
PF10162G8 0.78
PF05869Dam 0.78
PF05345He_PIG 0.78
PF08484Methyltransf_14 0.78
PF06182ABC2_membrane_6 0.78
PF01507PAPS_reduct 0.78
PF01391Collagen 0.78
PF14359DUF4406 0.78
PF00132Hexapep 0.78
PF00961LAGLIDADG_1 0.78
PF00300His_Phos_1 0.78

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 128 Family Scaffolds
COG0519GMP synthase, PP-ATPase domain/subunitNucleotide transport and metabolism [F] 0.78
COG0692Uracil-DNA glycosylaseReplication, recombination and repair [L] 0.78
COG1573Uracil-DNA glycosylaseReplication, recombination and repair [L] 0.78
COG3663G:T/U-mismatch repair DNA glycosylaseReplication, recombination and repair [L] 0.78
COG3694ABC-type uncharacterized transport system, permease componentGeneral function prediction only [R] 0.78


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A74.22 %
All OrganismsrootAll Organisms25.78 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300004633|Ga0066395_10133414Not Available1239Open in IMG/M
3300004799|Ga0058863_10077697Not Available2829Open in IMG/M
3300005180|Ga0066685_10000297All Organisms → cellular organisms → Bacteria15960Open in IMG/M
3300005332|Ga0066388_100000137All Organisms → cellular organisms → Bacteria26367Open in IMG/M
3300005336|Ga0070680_100002642All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Phyllobacteriaceae → Mesorhizobium → unclassified Mesorhizobium → Mesorhizobium sp.13283Open in IMG/M
3300005365|Ga0070688_100290937Not Available1177Open in IMG/M
3300005439|Ga0070711_100003741All Organisms → cellular organisms → Bacteria8921Open in IMG/M
3300005451|Ga0066681_10271467Not Available1033Open in IMG/M
3300005518|Ga0070699_100000124All Organisms → cellular organisms → Bacteria71076Open in IMG/M
3300005549|Ga0070704_100015222Not Available4826Open in IMG/M
3300005713|Ga0066905_100000028All Organisms → cellular organisms → Bacteria22624Open in IMG/M
3300005713|Ga0066905_100224923Not Available1422Open in IMG/M
3300005713|Ga0066905_100255300Not Available1350Open in IMG/M
3300005713|Ga0066905_100411413Not Available1100Open in IMG/M
3300005901|Ga0075274_1007618Not Available1302Open in IMG/M
3300006224|Ga0079037_100180120Not Available1879Open in IMG/M
3300006237|Ga0097621_100193087Not Available1764Open in IMG/M
3300006845|Ga0075421_100000091All Organisms → cellular organisms → Bacteria71451Open in IMG/M
3300006845|Ga0075421_100696614Not Available1181Open in IMG/M
3300006854|Ga0075425_100379818Not Available1627Open in IMG/M
3300006871|Ga0075434_100114317Not Available2712Open in IMG/M
3300006904|Ga0075424_100284555Not Available1759Open in IMG/M
3300007265|Ga0099794_10081651Not Available1595Open in IMG/M
3300009090|Ga0099827_10079561Not Available2555Open in IMG/M
3300009091|Ga0102851_10053407Not Available3262Open in IMG/M
3300009146|Ga0105091_10000004All Organisms → cellular organisms → Bacteria69732Open in IMG/M
3300009157|Ga0105092_10000021All Organisms → cellular organisms → Bacteria70775Open in IMG/M
3300009176|Ga0105242_10000074All Organisms → cellular organisms → Bacteria70146Open in IMG/M
3300009537|Ga0129283_10007095All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Thermoleophilia → Thermoleophilales → Thermoleophilaceae → unclassified Thermoleophilaceae → Thermoleophilaceae bacterium4930Open in IMG/M
3300009597|Ga0105259_1000133All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium10815Open in IMG/M
3300009609|Ga0105347_1000121All Organisms → cellular organisms → Bacteria37450Open in IMG/M
3300010047|Ga0126382_10025537All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3156Open in IMG/M
3300010359|Ga0126376_10001355All Organisms → cellular organisms → Bacteria13151Open in IMG/M
3300010362|Ga0126377_10000042Not Available53370Open in IMG/M
3300010362|Ga0126377_10000289All Organisms → cellular organisms → Bacteria29901Open in IMG/M
3300010373|Ga0134128_10014252Not Available9448Open in IMG/M
3300010376|Ga0126381_100000175All Organisms → cellular organisms → Bacteria69818Open in IMG/M
3300010401|Ga0134121_10131509Not Available2121Open in IMG/M
3300011415|Ga0137325_1017706Not Available1411Open in IMG/M
3300011423|Ga0137436_1010422Not Available2287Open in IMG/M
3300011423|Ga0137436_1012314Not Available2102Open in IMG/M
3300011438|Ga0137451_1012475Not Available2451Open in IMG/M
3300011443|Ga0137457_1000188All Organisms → cellular organisms → Bacteria15788Open in IMG/M
3300011444|Ga0137463_1000055All Organisms → cellular organisms → Bacteria49090Open in IMG/M
3300012161|Ga0137336_1004105Not Available2464Open in IMG/M
3300012168|Ga0137357_1008569Not Available1931Open in IMG/M
3300012171|Ga0137342_1010928Not Available1532Open in IMG/M
3300012206|Ga0137380_10000048All Organisms → cellular organisms → Bacteria68723Open in IMG/M
3300012232|Ga0137435_1028586Not Available1610Open in IMG/M
3300012916|Ga0157310_10126703Not Available855Open in IMG/M
3300012923|Ga0137359_10685561Not Available894Open in IMG/M
3300012971|Ga0126369_10175791Not Available2040Open in IMG/M
3300014656|Ga0180007_10090594Not Available2065Open in IMG/M
3300014656|Ga0180007_10181812Not Available1346Open in IMG/M
3300014861|Ga0180061_1016359Not Available1069Open in IMG/M
3300014872|Ga0180087_1000470Not Available5671Open in IMG/M
3300014878|Ga0180065_1018519Not Available1405Open in IMG/M
3300014880|Ga0180082_1015544Not Available1535Open in IMG/M
3300015200|Ga0173480_10015718Not Available3099Open in IMG/M
3300015255|Ga0180077_1004451Not Available2107Open in IMG/M
3300017997|Ga0184610_1000045All Organisms → cellular organisms → Bacteria31247Open in IMG/M
3300018056|Ga0184623_10027604Not Available2545Open in IMG/M
3300018059|Ga0184615_10001197All Organisms → cellular organisms → Bacteria14186Open in IMG/M
3300018063|Ga0184637_10001212Not Available17870Open in IMG/M
3300018063|Ga0184637_10004236Not Available8862Open in IMG/M
3300018063|Ga0184637_10348993Not Available890Open in IMG/M
3300018077|Ga0184633_10298736Not Available821Open in IMG/M
3300018079|Ga0184627_10127459Not Available1347Open in IMG/M
3300018084|Ga0184629_10001025Not Available11757Open in IMG/M
3300018084|Ga0184629_10001627All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium8954Open in IMG/M
3300018084|Ga0184629_10003176Not Available6133Open in IMG/M
3300018431|Ga0066655_10434662Not Available866Open in IMG/M
3300018481|Ga0190271_10000098All Organisms → cellular organisms → Bacteria46249Open in IMG/M
3300018481|Ga0190271_10115341Not Available2513Open in IMG/M
3300019249|Ga0184648_1186652Not Available2307Open in IMG/M
3300019886|Ga0193727_1000203All Organisms → cellular organisms → Bacteria → Acidobacteria21173Open in IMG/M
3300020020|Ga0193738_1019048Not Available2175Open in IMG/M
3300020068|Ga0184649_1165932Not Available2027Open in IMG/M
3300021081|Ga0210379_10003199Not Available5919Open in IMG/M
3300021081|Ga0210379_10069547Not Available1431Open in IMG/M
3300021090|Ga0210377_10130296Not Available1656Open in IMG/M
3300021432|Ga0210384_10067133Not Available3233Open in IMG/M
3300021976|Ga0193742_1018485Not Available3551Open in IMG/M
3300024232|Ga0247664_1030432Not Available1256Open in IMG/M
3300024254|Ga0247661_1018421Not Available1210Open in IMG/M
3300025146|Ga0209322_10085985Not Available1462Open in IMG/M
3300025155|Ga0209320_10009441Not Available4772Open in IMG/M
3300025159|Ga0209619_10011815Not Available5940Open in IMG/M
3300025174|Ga0209324_10246870Not Available1172Open in IMG/M
3300025174|Ga0209324_10351932Not Available934Open in IMG/M
3300025318|Ga0209519_10019651Not Available3566Open in IMG/M
3300025318|Ga0209519_10266584Not Available999Open in IMG/M
3300025322|Ga0209641_10162799Not Available1690Open in IMG/M
3300025325|Ga0209341_10001919All Organisms → cellular organisms → Bacteria20177Open in IMG/M
3300025325|Ga0209341_10403670Not Available1107Open in IMG/M
3300025326|Ga0209342_10270093Not Available1488Open in IMG/M
3300025326|Ga0209342_10339164Not Available1294Open in IMG/M
3300025912|Ga0207707_10034417Not Available4432Open in IMG/M
3300025916|Ga0207663_10002511All Organisms → cellular organisms → Bacteria8837Open in IMG/M
3300025917|Ga0207660_10057526Not Available2785Open in IMG/M
3300025934|Ga0207686_10000102All Organisms → cellular organisms → Bacteria70150Open in IMG/M
3300026540|Ga0209376_1047639Not Available2497Open in IMG/M
3300027362|Ga0208320_1002509Not Available2086Open in IMG/M
3300027513|Ga0208685_1001569Not Available7344Open in IMG/M
3300027533|Ga0208185_1008181Not Available2616Open in IMG/M
3300027655|Ga0209388_1000001All Organisms → cellular organisms → Bacteria59186Open in IMG/M
3300027675|Ga0209077_1000138All Organisms → cellular organisms → Bacteria18080Open in IMG/M
3300027722|Ga0209819_10000010All Organisms → cellular organisms → Bacteria70794Open in IMG/M
(restricted) 3300027856|Ga0255054_10054150Not Available2006Open in IMG/M
(restricted) 3300027865|Ga0255052_10220846Not Available926Open in IMG/M
3300027874|Ga0209465_10014987Not Available3503Open in IMG/M
3300027909|Ga0209382_10042495Not Available5444Open in IMG/M
3300027947|Ga0209868_1008064Not Available1004Open in IMG/M
3300028802|Ga0307503_10000200Not Available16685Open in IMG/M
3300028803|Ga0307281_10000002All Organisms → cellular organisms → Bacteria66713Open in IMG/M
3300031199|Ga0307495_10008492Not Available1435Open in IMG/M
3300031720|Ga0307469_10021137Not Available3540Open in IMG/M
3300031740|Ga0307468_100131098Not Available1567Open in IMG/M
3300031740|Ga0307468_100147673Not Available1503Open in IMG/M
3300031772|Ga0315288_10256891Not Available1854Open in IMG/M
3300031858|Ga0310892_10344997Not Available954Open in IMG/M
3300031949|Ga0214473_10014617Not Available9199Open in IMG/M
3300031949|Ga0214473_10058809Not Available4499Open in IMG/M
3300032205|Ga0307472_100037334Not Available2893Open in IMG/M
3300034149|Ga0364929_0001208All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium5828Open in IMG/M
3300034176|Ga0364931_0063406Not Available1136Open in IMG/M
3300034178|Ga0364934_0007844Not Available3731Open in IMG/M
3300034178|Ga0364934_0096152Not Available1111Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil15.62%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment8.59%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil7.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil5.47%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil5.47%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.69%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere4.69%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment3.91%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil3.91%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil3.91%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment3.12%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.12%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.12%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment3.12%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil2.34%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.34%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere2.34%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands1.56%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater1.56%
SeawaterEnvironmental → Aquatic → Marine → Inlet → Unclassified → Seawater1.56%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.56%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.56%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.56%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.78%
Beach Aquifer PorewaterEnvironmental → Aquatic → Unclassified → Unclassified → Unclassified → Beach Aquifer Porewater0.78%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.78%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.78%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil0.78%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.78%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.78%
Host-AssociatedHost-Associated → Human → Digestive System → Large Intestine → Fecal → Host-Associated0.78%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.78%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300004799Switchgrass rhizosphere and bulk soil microbial communities from Kellogg Biological Station, Michigan, USA for expression studies - soil CB-3 (Metagenome Metatranscriptome)Host-AssociatedOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005365Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3H metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005901Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_5C_80N_201EnvironmentalOpen in IMG/M
3300006224Freshwater wetland microbial communities from Ohio, USA, analyzing the effect of biotic and abiotic controls - Mud 3 Core 4 Depth 4 metaGEnvironmentalOpen in IMG/M
3300006237Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2 (version 2)Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009091Freshwater wetland microbial communities from Ohio, USA, analyzing the effect of biotic and abiotic controls - Mud 3 Core 4 Depth 3 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300009146Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 10-12cm March2015EnvironmentalOpen in IMG/M
3300009157Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009537Microbial community of beach aquifer porewater from Cape Shores, Lewes, Delaware, USA - D-2WEnvironmentalOpen in IMG/M
3300009597Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT299EnvironmentalOpen in IMG/M
3300009609Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011415Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT469_2EnvironmentalOpen in IMG/M
3300011423Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT119_2EnvironmentalOpen in IMG/M
3300011438Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT500_2EnvironmentalOpen in IMG/M
3300011443Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT630_2EnvironmentalOpen in IMG/M
3300011444Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT800_2EnvironmentalOpen in IMG/M
3300012161Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT300_2EnvironmentalOpen in IMG/M
3300012168Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT860_2EnvironmentalOpen in IMG/M
3300012171Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT466_2EnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012232Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT100_2EnvironmentalOpen in IMG/M
3300012916Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S213-509R-2EnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300014656Groundwater microbial communities from the Aspo Hard Rock Laboratory (HRL) deep subsurface site, Sweden - MM_PC_MetaGEnvironmentalOpen in IMG/M
3300014861Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT27_16_10DEnvironmentalOpen in IMG/M
3300014872Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT790_16_10DEnvironmentalOpen in IMG/M
3300014878Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT200A_16_10DEnvironmentalOpen in IMG/M
3300014880Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT660_16_10DEnvironmentalOpen in IMG/M
3300015200Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S209-509C-1 (version 2)EnvironmentalOpen in IMG/M
3300015255Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT466_16_10DEnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018077Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1EnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018481Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 356 TEnvironmentalOpen in IMG/M
3300019249Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019886Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2EnvironmentalOpen in IMG/M
3300020020Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a1EnvironmentalOpen in IMG/M
3300020068Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redoEnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021976Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2c1EnvironmentalOpen in IMG/M
3300024232Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK05EnvironmentalOpen in IMG/M
3300024254Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK02EnvironmentalOpen in IMG/M
3300025146Soil microbial communities from Rifle, Colorado, USA - sediment 19ft 1EnvironmentalOpen in IMG/M
3300025155Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 4EnvironmentalOpen in IMG/M
3300025159Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 3EnvironmentalOpen in IMG/M
3300025174Soil microbial communities from Rifle, Colorado, USA - sediment 19ft 3EnvironmentalOpen in IMG/M
3300025318Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 1EnvironmentalOpen in IMG/M
3300025322Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025325Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025326Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300027362Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT299 (SPAdes)EnvironmentalOpen in IMG/M
3300027513Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890 (SPAdes)EnvironmentalOpen in IMG/M
3300027533Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT700 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027675Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 10-12cm March2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027722Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027856 (restricted)Seawater microbial communities from Jervis Inlet, British Columbia, Canada - JV7_2_23EnvironmentalOpen in IMG/M
3300027865 (restricted)Seawater microbial communities from Jervis Inlet, British Columbia, Canada - JV7_2_21EnvironmentalOpen in IMG/M
3300027874Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300027947Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300028802Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 17_SEnvironmentalOpen in IMG/M
3300028803Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_120EnvironmentalOpen in IMG/M
3300031199Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 7_SEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031772Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G11_20EnvironmentalOpen in IMG/M
3300031858Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D2EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300034149Sediment microbial communities from East River floodplain, Colorado, United States - 20_j17EnvironmentalOpen in IMG/M
3300034176Sediment microbial communities from East River floodplain, Colorado, United States - 21_j17EnvironmentalOpen in IMG/M
3300034178Sediment microbial communities from East River floodplain, Colorado, United States - 27_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
Ga0066395_1013341413300004633Tropical Forest SoilVNTTVTKEIEPGVVDGYFKAGPFIAMAKNRFNRKWVGPQIQENFMYRPMKGGAYRKGTSFDIIRRQTRTGLLFGPRYYQVGVTEFLEDLEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQPIAGDDRSAEINGLEEALTNGSDPTWTNNVFPSYGGQTRADVVPALTTPTGLIPSNLGGQPISYRVLRHSYFSCIIGNEAPTIGITTNRCMGFIAENFLPHQIIDTTQPEINWPGMKFDKSTITMSQYCPGADGVNDDELGNYFSPFETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRVLSGIGS*
Ga0058863_1007769733300004799Host-AssociatedVKTLWSLMRRGFDALVAHPRLVALVVLILGVCTHLLHPAVGSVFLIGAVQLDDVNTITTKDIMPGVADNFFRSGPVTAYMRARFNRKWVGPQIQENYLYKPMKGGAYKKGGTFDITKRQVASGLLFTPRYYDVNVTEYLEDLEVEQTGPHAMFSRIKLDMSTAALSLSAILEIAFFHHGQSLVGDDRSAEVNGAEEALTDGTSQTMFGNIFPSYGGQTRVDVAPALNSPTGLVAASAGPALMFRVLEHSFMSTVIGQERPKMGVTSNRGMGFIAENFSPQQKIDVMDPEINWPGFKFNTATIVASQYAPSQDGVNDPDLGNYLATATQGEPLLWLNPGPQGDDAYMRLYIAQSPKFAFGFTGFKGARDDNMVAGQILFAGNYTFRSPRLSRWLYGFTK*
Ga0066685_1000029723300005180SoilMAIQLDDVNTVTTKEIMPGVVDGYFRAGPFIAMCRRRFTRKWVGPQIQENFLYKPMKGGAYRKGTAFDTTRRQTRTGMLFTPRYYEVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAAPRHGQALAGDDRSAELNGLEEALNDGVNASWAGNIFPSYGGQTRADVTPALTPPTGLIPALNTTMFYRILRHSYFSCVIGNEAPGIGITTNRLMGFIAENFLPHQIVDTTQPEISWPGLKFDKATILMSQYWPGQDGTNDPDLGNYSAAGETFTWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGARDDNQVSGQLLFGGNLTVKALRLSRFLHGFTA*
Ga0066388_100000137333300005332Tropical Forest SoilVPIQLDDVNTTVTKEIEPGVVDGYFKAGPLIAMAKARFNRKWIGPQIQENFMYKPMKGGSYKKGATFDITRRQTRTGLLFGPRYYQVTVTEFLEDLEVELAGPRAAFSVIRTDMSQAALTMSAILEIAAFHHGQPIAGDDRSSEINGLEEALTAGGDATWTGNVFPSYGGQTRADVAPALNPPTGLIAANLGGSPISYRVLRHSYFSCIIGNEAPGTGITTNRCMGYIAENFLPHQIIDTTQPEIAWPGLKFDKATIMMSQYCPGADGVNDDDLGNYFANNETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGAREDNQVSGQILYAGNLTVKALRLSRVLFGIGG*
Ga0070680_100002642143300005336Corn RhizosphereMAWREMAAEGVLPVKTLWSLMRRGFDALVAHPRLVALVVLILGVCTHLLHPAVGSVFLIGAVQLDDVNTITTKDIMPGVADNFFRSGPVTAYMRARFNRKWVGPQIQENYLYKPMKGGAYKKGGTFDITKRQVASGLLFTPRYYDVNVTEYLEDLEVEQTGPHAMFSRIKLDMSTAALSLSAILEIAFFHHGQSLVGDDRSAEVNGAEEALTDGTSQTMFGNIFPSYGGQTRVDVAPALNSPTGLVAASAGPALMFRVLEHSFMSTVIGQERPKMGVTSNRGMGFIAENFSPQQKIDVMDPEINWPGFKFNTATIVASQYAPSQDGVNDPDLGNYLATATQGEPLLWLNPGPQGDDAYMRLYIAQSPKFAFGFTGFKGARDDNMVAGQILFAGNYTFRSPRLSRWLYGFTK*
Ga0070688_10029093723300005365Switchgrass RhizosphereVALQLDDVNTTVTKTIMPGVVDGYFKAGPMIAMMKSRFNRKWIGPQIQDNIMFRPMIGGSYQKGASFNIVRQQTRTGLLFSPRYYEVNVTEFLEDLEVEMAGPTAAFDVLKTDMAQASLTMSAILEIAFWHHGQALAGDDRSMELNGIEEALNDGTNTSWAGNLFPSYGGQTRADVNGALTPPTGLISANVNGPISYRILRHSYLSCIIGNERPTIGVTTNRCMGFIAENFLPHQIVDTTQPEIGFPGLKFDQATIIMSQYAPGRDGVNDPFLGNYLNSTGEIFAWLNFGPSGDQAYGRLFIAQSSKFAFGFTGFKGARDDNQVSGQILFGGQIV
Ga0070711_100003741153300005439Corn, Switchgrass And Miscanthus RhizosphereMADIQFTEVNTVATKLINPGVVDNYFKAGPLMAYLKTRFNRKWTGPQIQENYEYGALRGGAYKKGATFNITPRQTRSGIIFDPRYYQVSITEFLEDLEVEMAGPTAVFSKLKADMANAALTLSAILEIALFHNGQNVGGADRTAELNGLEEALTDGTSATWTGAVFPTYGGQTRTAVAPALNSPTGLIPASVAQTSFRMLEHSFQSCTIGAERPKLGITTNREMGYIAETFTPQQKIDVLDPEINWPGLKFNTATLVESQYCPGQDGVNDPFLGNYSNTSETFWWLNPGPQGDDAYLRLYIAQSPKFAFGFTGFKGARDDNQVSGQILFGGNFTVRAPRLSRGLYGMTN*
Ga0066681_1027146713300005451SoilMFKPMKGGAYKKGSTFDIARRQTRTGLLFTPRYYEVNVTEFLEDLEVEMAGPRTAFSVIRTDMQQAALTLSAILEIAFFHHGQALAGDDRSAEINGIEEAFNDNLTASWAGNLFPSYGGQTRADVAPALTPPTGLITPSNATIAYRVLRHSYFSCIIGNEAPTVGITTNRLMGFISENFLPHQVVDTTQPEINWPGLKFDKATIVMSQYAPGQDGTNDADLGNYNATGETLAWLNFGPTGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVAGQI
Ga0070699_100000124153300005518Corn, Switchgrass And Miscanthus RhizosphereMHLLKRVDAFVRANPLFTSLLLMLLALLTQHPGTGLLLLAGPILLDDVNTVTTKTIMPGVVDNFFKAGPLIAYLKSRFTRRWIGPQIQENYMYAPMKGGAYKKGATFNILKRQTRSGMLFTPRYYEVNVTEFLEDIEVEQVGPNAVFNVVKTDMAEAALTMSAILEIAAFHHGQALAGDDRSAEINGLEEILTDGTNVTWTGSTFTSYGGQLRVSVSPALNSPVGLITPSVAGMSFRALQHSYLSTCIGNEHPAIGLTTNRGMGYIAETFLPHQIVDVVDPEINWPGLKFNQAKIVVSQYAPGADGVNDPDLGNYLASAGETFWWVNPGPQGDDAYIRLYIAQSPKFAFGFTGFKGARDDNMVSGQILFAGNLTGRAPRLSRALFGIAK*
Ga0070704_10001522223300005549Corn, Switchgrass And Miscanthus RhizosphereMPILLDDVNTVTTKTIMPGVVDNFFKAGPLIAYLKSRFTRRWIGPQIQENYMYAPMKGGAYKKGATFNILKRQTRSGMLFTPRYYEVNVTEFLEDIEVEQVGPNAVFNVVKTDMAEAALTMSAILEIAAFHHGQALAGDDRSAEINGLEEILTDGVNVTWTGSTFTSYGGQLRVSVTPALNSPVGLITPSVAGMSFRALQHSYLSTCIGNEHPAIGLTTNRGMGYIAETFLPHQIVDVVDPEINWPGLKFNQAKIVVSQYAPGADGVNDPDLGNYLASAGETFWWLNPGPQGDDAYLRLYIAQSPKFAFGFTGFKGARDDNMVAGQILFAGNMTGRAPRLSRALFGIAK*
Ga0066905_100000028153300005713Tropical Forest SoilVAIQLDDVNTTVTKEIEPGVVDGYFKAGPFIAMAKNRFNRKWIGPQIQENFMYKPMKGGAYRKGSSFDITRRQTRTGLLFGPRYYQVGVTEFLEDLEVEMAGPRAAFSVIRTDMNQASLTMSAILEIAAFHHGQAIPGDDRSAEINGLEEAFNNGTDPSWTNLVFPSYGGQTRADVVPALTPPTGLIAGNLAGQPISYRVLRHSYFSCIIGNEAPTIGITTNRCMGFIAENFLPHQIVDTTQPEINWPGMKFDKATITMSQYCPGADGVNDEDLGNYFSPNETFWWLNFGPPGDDAYIRLYIAQSRKFAFGFTGFKGSRQDNQVAGQILFAGNLTVKALRLSRYLSGIGS*
Ga0066905_10022492313300005713Tropical Forest SoilVPIQLDDVNTTVTKEIEPGVVDGYFKAGPFIAMAKNRFNRKWIGPQIQENFMYKPMKGGSYRKGTSFDITRRQTRTGLLFGPRYYQVGVTEFLEDLEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQALAGDDRSTEINGLEEAFNDGTTASWAGNIFPSYGGQTRVDVTPALDPPKGLIAQDLGGQPISYRVLRHSYFSCIIGNEAPTVGITTNRCMGYIAENFLPHQIIDTTQPEINWPGMKFDKATITMSQYCPGADGVNDVDLGSYYAPNETFWWLNFGPQGDDAYLRLYIAQSRKFAFGFTGFKGSRQDNQVAGQILFAGNLTVKALRLSRVISGIGS*
Ga0066905_10025530013300005713Tropical Forest SoilVAIQLDDVNTVVTKEIEPGVVDGYFKAGPFIAMAKSRFNRKWIGPQIQENFMYKPMKGGAYRKGTTFDITRRQTRTGLLFGPRYYQVGVTEFLEDLEVELAGPRAAFSVLRTDMAQASLTMSAILEIAAFHHGQAIAGDDRSAEINGLEEALNDGVTPSWTNTLFPSYGGQTRPDVAPALTPPSGLVSANLGGSPISYRVLRHSYFSCIIGNEAPTIGITTNRCMGFIAENFLPHQIIDTTQPEINWPGMKFDKATITMSQYCPGADGVNDEDLGNYFSPNETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRVLSGISG
Ga0066905_10041141313300005713Tropical Forest SoilPGVVDGYFKAGPFIAMAKNRFNRKWIGPQIQENFMYKPMKGGSYRKGASFDILRRQTRTGLLFGPRYYQVGVTEFLEDIEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQALAGDDRSAEINGLEEALNDGSNVSWTNSLFPSYGGQTRADVTPALTPPTGLIAANLAGSPISYRVLRHSYFSCIIGNEAPTIGITTNRCMGFIAENFLPHQIVDTTQPEINWPGMKFDKATITMSQYCPGADGVNDVDLGNYFATNETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRVLSGIGS*
Ga0075274_100761823300005901Rice Paddy SoilMAIQLDDVNTVTTKEIMPGVVDGYFRAGPFVAMAKARFTRKWIGPQIQENFMYKPMKGGAYKKGAAFNVVRHQTRTGLLFTPRYYQVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAAFHHGQALPGDDRSAEINGLEEALNDGVNAGMFGNIFPSYGGQTRADVAPALTTPTGLIPAANTNVLYRTLRHSYFSCILGNEAPTVGLTTNRMMGFISENFLPHQIIDTTQPEINWPGLKFDKATILMSQYAPGQDGVNDPDLGNYNAAGETFAWLNWGPQGDDAYIRLYIAQSSKFA
Ga0079037_10018012013300006224Freshwater WetlandsENLMYKPMKGGAYKKGATFDITRQQTRTGILFAPKYYQVNVTEFLEDLEVEMAGPRAAFSVIRTDMAQAALTMSALLEIAFFHHGQALVGDDRSAEINGIEEALNDGTNASWAGNLFPSYGGQTRADIAPAATPPTGLVAANVNGPISYRILRHSYLSCQIGNEAPSLGITTNRGMGFIAENFLPHQLVDTTNPEINWPGLKFDRATIVMSQYAPGRDGTNDPFLGNYYDADGETFCWLNFGPQGDDAYIRLYIAQSPKFAFGFTGFKGARDDNQVSGQILFGGNLTVKAIRLSRILHGITG*
Ga0097621_10019308723300006237Miscanthus RhizosphereVAIQLDEVNTTVTKEIEPGVVDGYFKAGPFIAMAKSRFNRKWIGPQIQENFMYKPMKGGAYRKGTSFDITRRQTRTGLLFGPRYYQVGITEFLEDLEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQALAGDDRSMEINGLEEAFNNGTSASWAGNIFPSYGGQTRTDVSPALDPPSGLISQDLGGAPISYRVLRHSYFSCIIGNEAPTIGITTNRCMGYIAENFLPHQIIDTTQPEINWPGMKFDKATITMSQYCPGADGVNDVDLGNYYAANETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRVISGIGS*
Ga0075421_100000091113300006845Populus RhizosphereVAIQLDDVNTVVTKEIEPGVVDGYFKAGPFIAMAKARFSRKWIGPQIQENFMYKPMRGGAYRKGTTFDITKRQTRTGLLFGPRYYQVGVTEFLEDLEVELAGPRAAFSVLRTDMAQASLTMSAILEIAAFHHGQAITGDDRSAEINGLEEAFNDGSTASWTGNVFPSYGGQTRADVVPALTPPTGLISANLAGAPISYRVLRHSYFSCIIGNEAPSIGITTNRCMGFIAENFLPHQIIDTTQPEINWPGMKFDKATITMSQYCPGADGVNDEDLGNYYAPYETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRVLSGIGS*
Ga0075421_10069661413300006845Populus RhizosphereTRRWVGPQIQENYMYAPMRGGAYKKGATFNITKRQTRSGMLFTPRYYEVNITEYLEDIEVEMTGPNAVFSIVKTDAAEAALTMSAILEIAIFRNGQNLGVGLDRSAEINGLEEALTNGTDVTWTGATFTAYGGQTRVDVAPALNSPTGLITPSMGSMSFRGLQHSYLSTCIGAEHPQIGVTTNRGLGYISETFLPHQIVDVVDPEINWPGLKFNQARIVVSQYAPGADGVNDPDLGNYLASAGETFWWLNPGPQGDDAYLRLYIAQSAKFAFGFTGFKGARDDNMVAGQILFGGNFTCRAPRLSRSLYGIAK*
Ga0075425_10037981823300006854Populus RhizosphereVAIQLDPVNTVATKRILPGVVDNFFKAGPLIAFLKTRFNRKWAGPQIQENFLYGVQGKGGAYAKGGNFNTVQQQSMTGMLFTPRYYYVNVTEFLEDIEVEMAGPTAILNRVKVDLANAALQMSSMLEIALYRNGQNVGGVDRTLELNGLEEALTNGTDTTFSGATFTSYGGQSRISVTPALNSPTGLIPASNTTTSFRLLEHSYQSCVIGNERPTMGLTSNRGMGFIAETFSPQQRVDAIQPEINWPGMKFNQATIMQSQYFPSQDGINDPNLGNYSASNESLL
Ga0075434_10011431713300006871Populus RhizosphereMHLLKRVDAFVRANPLFTSLLLALLAYWHGSVLPGLLLLGPILLDDVNTVTTKTIMPGVVDNFFKAGPLIAYLKSRFTRRWIGPQIQENYMYAPMKGGAYKKGATFNILKRQTRSGMLFTPRYYEVNVTEFLEDIEVEQVGPNAVFNVVKTDMAEAALTMSAILEIAAFHHGQALAGDDRSAEINGLEEILTDGSNVTWTGSTFTSYGGQLRVSVTPALNSPVGLIIPSVAGMSFRALQHSYLSTCIGNEHPAIGLTTNRGMGYIAETFLPHQIVDVVDPEINWPGLKFNQAKIVVSQYAPGADGVNDPDLGNYLASAGETFWWLNPGPQGDDAYIRLYIAQSPKFAFGFTGFKGARDDNMVAGQILFAGNLTGRAP
Ga0075424_10028455513300006904Populus RhizosphereMHLLKRVDAFVRANPLFTSLLLALLAYWHGSVLPGLLLLGPILLDDVNTVTTKTIMPGVVDNFFKAGPLIAYLKSRFTRRWIGPQIQENYMYAPMKGGAYKKGATFNILKRQTRSGMLFTPRYYEVNVTEFLEDIEVEQVGPNAVFNVVKTDMAEAALTMSAILEIAAFHHGQALAGDDRSAEINGLEEILTDGSNVTWTGSTFTSYGGQLRVSVTPALNSPVGLITPSVAGMSFRALQHSYLSTCIGNEHPAIGLTTNRGMGYIAETFLPHQIVDVVDPEINWPGLKFNQAKIVVSQYAPGADGVNDPDLGNYLASAGETFWWLNPGPQG
Ga0099794_1008165123300007265Vadose Zone SoilVAIQLDDVNTVVTKEIAPGVVDGYFKGGPCVAMSKARFTRKWIGPQIQENYMFRPMKGGSYKKGSTFDVSRRQTRSGMLFTPRYYEVNITEFLEDLEIEMAGPRAAFSVIRTDMQQAALTMSAILEIAFFRHGQALGGDDRSAELNGIEEAFNDGTNASWAGNLFTSYGGQLRSDVSPALTPPTGLVAASNTTMSYRVLRHSYFSCIIGNEAPTVGVTTNRLMGFIAENFLPHQIVDTKQPEIDWPGLKFDKATIVMSQYAPGQDGVNDADLGNYNAAGETFAWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVAGQVLFGGNLTFKALRPSRILHGFTA*
Ga0099827_1007956123300009090Vadose Zone SoilMRLGAALTWLRDHPRLTSVLLTLLATWIHPTGLLALPMVLAIQLDDVNTVTTKEIMPGVVDGYFRGGPCIAMCKARFTRKWIGPQIQENFMFKPMQGGAYKKGATFDVKRRQTRTGLLFTPRYYEVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAFFRHGQALAGDDRSAEINGIEEAFNDNLTASWAGNLFPSYGGQTRVDVAPALTPPTGLITPSNSTMSYRVLRHSYFSCVIGNEAPGVGVTTNRLMGFIAENFLPHQIIDTTQPEINWPGLKFDKATIMMSQYAPGQDGVNDPDLGNYNAAGETFAWLNFGPQGDDAYLRLYIAQSSKFAFGFTGFKGAREDNQVSGQLLFGGNLTVKAIRLSRILHNFTA*
Ga0102851_1005340723300009091Freshwater WetlandsMGLALLAAAVGVALAPDQYWPLLLLGVIVLDDVNTVTTKEIIPGVVDGYFKAGPLIAMCKSRFTRKWVGPQVQENLMYKPMKGGAYKKGATFDITRQQTRTGILFAPKYYQVNVTEFLEDLEVEMAGPRAAFSVIRTDMAQAALTMSALLEIAFFHHGQALVGDDRSAEINGIEEALNDGTNASWAGNLFPSYGGQTRADIAPAATPPTGLVAANVNGPISYRILRHSYLSCQIGNEAPSLGITTNRGMGFIAENFLPHQLVDTTNPEINWPGLKFDRATIVMSQYAPGRDGTNDPFLGNYYDADGETFCWLNFGPQGDDAYIRLYIAQSPKFAFGFTGFKGARDDNQVSGQILFGGNLTVKAIRLSRILHGITG*
Ga0105091_10000004863300009146Freshwater SedimentVAIQLDEVNTTVTKEIEPGVVDGYFKAGPFIAMAKSRFNRKWIGPQIQENFMYKPMKGGAYKKGSSFDITRRQTRTGLLFGPRYYQVGITEFLEDLEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQALAGDDRSMEMNGLEEALNNGTTASWAGNIFPSYGGQTRVDVSPALDPPSGLIAQDLGGAPISYRVLRHSYFSCIIGNEAPTIGITTNRCMGYIAENFLPHQIIDTTQPEINWPGMKFDKATITMSQYCPGADGVNDADLGSYYAPNETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRVISGIGS*
Ga0105092_10000021673300009157Freshwater SedimentVLPIDLPDLSAIGGLLSLLSLLAFAAGMAIQLDEVNTVATKRIMPGVVDNFFKAGPLMAYLKARFNRKWTGPLIQENYLYKPMRGGAYKKGATFNVTKRQTYSGLLFTPRYYEVNVTEFLEDLEVEMAGPTAMFSTLKVDLGNAALTLSSILEIALFHHGQNVGGNDRTAEINGLEEALTDGTNTTWTGATFPTYGGQLRASVAPALNSPVGLVPVSNTTTSFRVLEHSFMSCVIGAERPKLGLTTNREMGFIAETFSPQQKIDVLDPEINWPGMKFNQATIVVSQYCPGQDGVDDPDLGDYNADSETFWWLNPGPSGDDAYLRLFIAQSAKFAFGFTGFKGARDDNQVSGQILFGGNFTCRSPRLSRGMYGFTK*
Ga0105242_1000007473300009176Miscanthus RhizosphereVAIQLDEVNTTVTKEIEPGVVDGYFKAGPFIAMAKARFSRKWIGPQIQENFMYKPMKGGAYRKGSSFDITRRQTRTGLLFGPRYYQVGVTEFLEDLEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQALAGDDRSMEINGMEEALCRTGDTSWTGAAFPSYGGQTRVDVSPALDPPAGLIPSDLGGAPISYRTLRHSYFSCILGNEAPTIAITTNRCMGFIAENFLPHQIIDTTQPEINWPGMKFDKATITMSQYCPGADGVNDDDLGNYYAPFETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRVLKGIGS*
Ga0129283_1000709513300009537Beach Aquifer PorewaterLDDINTVTTKEIQPGVVDNYFKAGPIMAMCRSRFTRRWIGPQIQENYLFRAMKGGAYQKGGSFDLTKPQTRSGLLFSPRYYQTNVTEFLEDIEVEMVGPRAAFNVIRTDMQQAALTLSAILEIACIRHGQNLAGDNRSIELNGLAEALSDGTNASWDGNTFPVYGGQTRADVSPALNSPAGFQAANLNGPISYRALRHSYFSCVLGNEAPTIGVTTNRCMGFIAENFLPHQIVDTTQPEINWPGLKFDRATIVMSQYMPGQDGVNDADLGNYNNASETLAWFNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFGGNLTVRTPRLSRILFGITA*
Ga0105259_1000133103300009597SoilVALLLDDVNTTVTKDIQPGVVDGYFKSGPLIAMCRRRFTRKWIGPQIQENFMFKPMKGGAYAKGASFDVTRRQTRAGMLFGPKYYQVNVTEFLEDLEVEMAGPNAAFSVIRTDMAQAALTMSAILEIAAFHHGQNLAGDNRSLELNGLEEALNDGTNASWAGNLFPSYGGQTRADVAPALTPPTGLVASPNIGAAPYLGSMSYRVLRHSYLSCCIGNQAPDTGLTTRRGMGFISENFLPHQVIDTTQPEIAWPGIKFDRATVMMSDYCPGQDGVNDADLGNYNAAGETFWWLNFGPTGDDAYIRLYIAQSAKFAFGFTGFKGAREDNQVSGQILFGGNGPIVKALRLSRVLHGFTS*
Ga0105347_100012133300009609SoilMRHLARAGDFIRANPIFTTLCMALLLWASGVAPAAGLPLLMGPILLDDVNTVTTKTIMPGVVDNFFRAGPLVAYLKTRFTRKWIGPQIQENYMYAPMRGGAYKKGATFNITKRQTRTGMLFTPRYYEVNVTEFLEDIEVEQTGPNAMFSIVKTDMAEAALTLSGILEIAMFHHGQALPGDDRSAEINGLEEALTDGINVTWTGATFTSYGGQDRPSVGGALNSPVGLITPSVPSMSFRALQHSYLSTCIGSEHPAIGVTTNRGMGYISETFLPHQIVDVVDPEINWPGLKFNQAKIVVSQYAPGADGVNDPDLGNYLDTDGETFWWLNPGPQGDDAYLRFYIAQSPKFAFGFTGFKGARDDNMVAGQILFGGNFSCRAPRLQRGLYNIAK*
Ga0126382_1002553733300010047Tropical Forest SoilVAIQLDDVNTTVTKEIEPGVVDGYFKAGPFIAMAKSRFNRKWIGPQIQENFMYKPMKGGAYRKGASFDITRRQTRTGLLFGPRYYQVGVTEFLEDIEVEMAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQAIAGDDRSMEINGLEEAFGSAGTASWANNLFPSYGGQTRADVSPALDAPTGLISANLAGQPISYRVLRHSYFSCIIGNEAPTIGITTNRCMGFIAENFLPHQIIDTTQPEINWPGMKFDKATITMSQYCPGADGVNDEDLGNYYAPYETFWWLNFGPPGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRVLSGIGS*
Ga0126376_1000135593300010359Tropical Forest SoilVPIQLDDVNTTVTKEIEPGVVDGYFKAGPLIAMAKARFNRKWIGPQIQENFMYKPMKGGSYKKGATFDITRRQTRTGLLFGPRYYQVTVTEFLEDLEVEMAGPRAAFSVIRTDMSQAALTMSAILEIAAFHHGQPIAGDDRSSEINGLEEALTAGGDATWTGNVFPSYGGQTRADVAPALNPPTGLIAANLGGSPISYRVLRHSYFSCIIGNEAPGTGITTNRCMGYIAENFLPHQIIDTTQPEIAWPGLKFDKATIMMSQYCPGADGVNDDDLGNYFANNETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGAREDNQVSGQILYAGNLTVKALRLSRVLFGIGG*
Ga0126377_10000042693300010362Tropical Forest SoilVAIQLDEVNTTVTKEIEPGVVDGYFKAGPFIAMAKSRFNRKWIGPQIQENFMYKPMKGGAYKKGSSFDITRRQTRTGLLFGPRYYQVGITEFLEDLEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQALAGDDRSMELNGLEEALNNGTSASWAGNIFPSYGGQTRVDVSPALDPPSGLIAQDLGGAPISYRVLRHSYFSAIIGNEAPTIGITTNRCMGYIAENFLPHQIIDTTQPEINWPGMKFDKATITMSQYCPGADGVNDADLGGYFAPNETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRVISGIGS*
Ga0126377_10000289493300010362Tropical Forest SoilVPIQLDDVNTTVTKEIEPGVVDGYFKAGPFIAMAKGRFNRKWIGPQIQENFMYKPMKGGSYKKGTTFDITRRQTRTGLLFGPRYYQVTVSEFLEDLEVEMAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQPVVGDDRSMEINGLEEALTNGVDPTWTNNVFPSYGGQTRVDVAPALTAPTGLVAANLNGAPISYRVLRHSYYSCIIGNEAPGTAITTNRCMGFISENFLPHQIIDTTQPEIAWPGMKFDKATIMMSQYCPGADGVNDDDLGNYYAPNETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGAREDNQVAGQILYAGNLTVKALRLSRVLHGIGS*
Ga0134128_10014252123300010373Terrestrial SoilMADIQFTEVNTVATKRINRGIVDNYFKAGPLMAYMKTRFNQKWTGPQIQENYEYGALRGGAYKKGATFNITPRQTRSGIIFDPRYYQVSITEFLEDLEVEMAGPTAVFSKLKADMANAALTLSAILEIALFHNGQNVGGVDRTAELNGLEEALTDCTSETWSGAVFPTYGGQTRSAVAPALNSPTGLIPASVGQTSFRMLEHSFQSCTIGAERPKLGITTNREMGYIAETFTPQQKIDVLDPEINWPGLKFNTATLVESQYCPGQDGVNDPFLGNYSNTSETFWWLNPGPQGDDAYLRLYIAQSPKFAFGFTGFKGARDDNQVSGQILFGGNFTVRAPRLSRGLYGMTN*
Ga0126381_100000175893300010376Tropical Forest SoilVAIQLDDVNTVVTKEIEPGVVDGYFKAGPFIAMAKSRFNRKWIGPQIQENFMYKPMRGGAYKKGTTFDIIKYQTRTGLLFGPRYYQVGVTEFLEDLEVELAGPRAAFSILRTDMAQASLTMSAILEIAAFHHGQALSGDDRTAEINGLEEALNDGAANSWAGNVFPSYGGQTRVDVAPALTPPTGLVAGNLAGAPISYRVLRHSYFSCIIGNEAPTIGITTNRCMGFIAENFLPHQIVDTTQPEINWPGMKFDKATITMSQYCPGADGVNDAFLGNYYAPQETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRVLSGIGG*
Ga0134121_1013150923300010401Terrestrial SoilVAIALSELNTVMTATIMPGVVDGYFKAGPFIAMCKARFTRKWQGHTIQENFMYKPMKGGAYAKGAAFDITRHRTRTGIQFNPRYYQVNVTEFLEDLEVEMAGPNAAFSVLRTDMAQAALTISAILEIAAFRHGQNLAGDDRSLEISGVAEALNDGTNAGWDGNTFPSYGGQTRTDVAPALTPPTGLVPANIGAAPYNGAISYRVLRHSYLSSTIGNEAPGVGLTTKRAMGFIAENFLPHQIVDTTQPEIAWPGLKFDRATIMASDYCPGQDGVNDTFLGNYNAAGETFFWLNFGPQGDDAYIRLYIAASSKFAFGFTGFKGARDDNQIAGQILFGGNLIFRNIRLSRAHHGITA*
Ga0137325_101770613300011415SoilVFGLTCLRAIVDLANAHQRLVSVIAALFGAYWHPDLALSVVLIGAIQLDDVNTTVTKEIMPGVVDGYFKAGPLIAMMKARFTRKWIGPQIQENFMYKPMKGGAYKKGASFDVTRRQTRTGLLFTPRYYEVNVTEFLEDIEVEMAGPRAAFSVIRTDMQQAALTISAILEIATFHHGQALPGDDRSAELNGLEEALNDGTNASIFGNTFPSYGGQTRVDVRPALTPPTGLVAANVAGPMSYRALRHSYFSTIIGNERPTVGLTTNRNMGFIAENFLPHQIIDTTQPEINWPGMKFDQATIVMSQYAPGQDGENDPDLGNYNAATESFYWLNFGPQGDDAFIRLYIAQSSKF
Ga0137436_101042223300011423SoilMRHLARAGDFIRANPIFVTLCMALLLWASGVAPAAGLPLLMGPILLDDVNTVTTKTIMPGVVDNFFRAGPLVAYLKTRFTRKWIGPQIQENYMYAPMRGGAYKKGATFNITKRQTRTGMLFTPRYYEVNVTEFLEDIEVEQTGPNAMFSIVKTDMAEAALTLSGILEIAMFHHGQALPGDDRSAEINGLEEALTDGINVTWTGATFTSYGGQDRPSVGGALNSPVGLITPSVPSMSFRALQHSYLSTCIGSEHPAIGVTTNRGMGYISETFLPHQIVDVVDPEINWPGLKFNQAKIVVSQYAPGADGVNDPDLGNYLDADGETFWWLNPGPQGDDAYLRFYIAQSPKFAFGFTGFKGARDDNMVAGQILFGGNFSCRAPRLQRGLYNIAK*
Ga0137436_101231423300011423SoilMALLLWASGVAPAAGLPLLMGPILLDDVNTVTTKTIMPGVVDNFFRAGPLVAYLKTRFTRKWIGPQIQENYMYAPMRGGAYKKGATFNITKRQTRTGMLFTPRYYEVNVTEFLEDIEVEQTGPNAMFSIVKTDMAEAALTLSGILEIAMFHHGQALPGDDRSAEINGLEEALTDGINVTWTGATFTSYGGQDRLSVGGALNSPVGLITPSVPSMSFRALQHSYLSTCIGSEHPAIGVTTNRGMGYISETFLPHQIVDVVDPEINWPGLKFNQSKIVVSQYAPGADGVNDPDLGNYLDTDGETFWWLNPGPQGDDAYLRFYIAQSPKFAFGFTGFKGARDDNMVAGQILFGGNFSCRAPRLQRGLYNISK*
Ga0137451_101247523300011438SoilMAIQLDDVNTVTTKEIMPGVVDGYFRAGPFIAMAKARFTRKWIGPQIQENFMYKPMKGGAYKKGAAFNVVRHQTRTGLLFTPRYYQVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAAFHHGQSLPGDDRSAEINGIEEAYNDGTNASWAGNVFPSYGGQTRVDVAPALTPPTGLIAAQNANVLYRVLRHSYFSCILGNEAPTIGLTTNRMMGFISENFLPHQIVDTTQPEINWPGLKFDKATVLMSQYAPSQDGVNDADLGNYLAAGETFAWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFAGNLTVKALRLSRILHGFTA*
Ga0137457_1000188193300011443SoilVALLLDDVNTTVTKDIQPGVVDGYFKSGPLIAMCRRRFTRKWIGPQIQENFMFKPMKGGAYAKGGSFDVTRRQTRAGMLFGPKYYQVNVTEFLEDLEVEMAGPNAAFSVIRTDMAQAALTMSAILEIAAFHHGQNLAGDNRSLELNGLEEALNDGTNASWAGNLFPSYGGQTRADVAPALTPPTGLVASPNIGAAPYLGSMSYRVLRHSYLSCCIGNQAPDTGLTTRRGMGFISENFLPHQVIDTTQPEIAWPGIKFDRATVMMSDYCPGQDGVNDADLGNYNAAGETFWWLNFGPTGDDAYIRLYIAQSAKFAFGFTGFKGAREDNQVSGQILFGGNGPIVKALRLSRVLHGFTS*
Ga0137463_1000055323300011444SoilVAIQLDEVNTTVTKEIEPGVVDGYFKAGPFIAMAKSRFNRKWIGPQIQENFMYKPMKGGAYKKGSSFDITRRQTRTGLLFGPRYYQVGITEFLEDLEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQALAGDDRSMEMNGLEEALNNGTTASWAGNIFPSYGGQTRTDVSPALDPPAGLIAQDLGGAPISYRVLRHSYFSCIIGNEAPTIGITTNRCMGYIAENFLPHQIVDTTQPEINWPGMKFDKATITMSQYCPGADGVNDVDLGSYYAPNETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRVISGIGS*
Ga0137336_100410543300012161SoilGIMRQLARALDFVRANPIFTTLCMALLLWASGGSLAYGMPLLMGPILLDDVNTVTTKTIMPGVVDNFFRAGPLVAYLKTRFTRKWIGPQIQENYMYAPMRGGAYKKGATFNITKRQTRTGMLFTPRYYEVNVTEFLEDIEVEQTGPNAMFSIVKTDMAEAALTLSGILEIAMFHHGQALPGDDRSAEINGLEEALTDGINVTWTGATFTSYGGQDRLSVGGALNSPVGLIAPSVPSMSFRALQHSYLSTCIGSEHPAIGVTTNRGMGYISETFLPHQIVDVVDPEINWPGLKFNQAKIVVSQYAPGADGVNDPDLGNYLDADGETFWWLNPGPQGDDAYLRFYIAQSPKFAFGFTGFKGARDDNMVAGQILFGGNFSCRAPRLQRGLYNIAK*
Ga0137357_100856923300012168SoilMVDLANAHKRIVTVIAALFGAYWHPDLALSVMLIGAIQLDDVNTTVTKEIMPGVVDGYFKAGPLIAMMKARFTRKWIGPQIQENFMYKPMKGGAYKKGASFDVTRRQTRTGLLFTPRYYEVNVTEFLEDIEVEMAGPRAAFSVIRTDMQQAALTISAILEIAAFHHGQALPGDDRSAELNGLEEALNDGVNPSIFGNTFPSYGGQTRVDVRPALTPPTGLVAANVAGPMSYRALRHSYFSTIIGNERPTVGLTTNRNMGFIAENFLPHQIIDTTQPEINWPGMKFDQATIVMSQYAPGQDGENDPDLGNYNASTESFYWLNFGPQGDDAFIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFGGNMTVRALRLSRAMFGFTS
Ga0137342_101092813300012171SoilVAIQLDDVNTTVTKEIMPGVVDGYFRAGPFIAMAKARFTRKWIGPQIQENFMYKPMKGGAYKKGASFDVTRRQTRTGLLFTPRYYEVNVTEFLEDIEVEMAGPRAAFSVIRTDMQQAALTISAILEIAAFHHGQALPGDDRSAELNGIEEALNDGTNPSIFGNTFPSYGGQTRTDVTPALTPPTGLVPANVAGPMSYRALRHSYFSSIIGNERPTVGITTNRNMGFIAENFLPHQIVDTTQPEINWPGMKFDQATIVMSQYAPGVDGTNDPDLGNYSAASESFYWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFGGNMTFRALRLSRAMFGFTS*
Ga0137380_10000048223300012206Vadose Zone SoilMQSIRWLTATTRRVCLWLGAHPRLLAVVVALVALYVRPESLPFLPIALAIQLDDVNTVTTKEIMPGVVDGYFKGGPLIAMMKARFTRKWIGPQIQENYLFKPMKGSAYKKGATFDVARRQTRSGLLFTPRYYEVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAAYHHGQALAGDDRSAEINGIEEGFNDGVNASYAGNIFTSYGGQLRSDVSPALTPPTGLIAASNATMSYRVLRHSYFSTIIGNESPTVGLTTNRLMGFIAENFLPHQIVDTTQPEINWPGLKFDKATIMMSQYAPGQDGTNDADLGNYNASGETFAWLNFGPQGDDAYIRLYISQSSKFAFGFTGFKGAREDNQVSGQLLFGGNLTLKALRLSRILHNFTA*
Ga0137435_102858613300012232SoilWVGPQIQENFMYKPMKGGAYKKGATFDVSRHQTRTGMLFTPRYYQVNVTEFLEDLEVEMVGPRAAFNVIRTDMQQASLTMSAILEIASMQHGQTLVGDDRSAELNGFAEALNDGVNASWDGNVYPSYGGQTRADVAPALTTPTGLVAANVAGPISYRILRHSYFSCILGNEAPSVGITTNRCMGFIAENFLPHQIVDTTQPEINWPGLKFDKATILMSQYAPGADGVNDTFLGNYNAAAETFFWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFAGNLTVKALRLSRGLRGITS*
Ga0157310_1012670313300012916SoilIQLDEVNTTVTKEIEPGVVDGYFKAGPFIAMAKSRFNRKWIGPQIQENFMYKPMKGGAYKKGSSFDITRRQTRTGLLFGPRYYQVGITEFLEDLEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQALAGDDRSMEMNGLEEALNNGTTASWAGNIFPSYGGQTRPDVSPALDPPAGLIPQDLGGQPISYRVLRHSYFSCIIGNEAPTIGITTNRCMGYIAENFLPHQIIDTTQPEINWPGMKFDKATITMSQYCPGADGVNDPDLGSYYAPNETFWW
Ga0137359_1068556113300012923Vadose Zone SoilVTTKEIMPGVVDGYFRAGPFIAMCKARFTRKWVGPQIQENFMFKPMKGGAYKKGATFDVTRRQTRTGILFTPRYYEVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAASRHGQALVGDDRSAELNGFEEAYNDGITASYAGNVFTSYGGQLRSDVTPALTPPTGLVVPSNTTISYRVLRHSYFSTIIGNEAPGTGLTTNRLMGFIAENFLPHQIVDTTQPEINWPGLKFDKATIMMSQYMPGQDGVNDADLGNYSSTGETFAWLNFGPQGDDAYIRLYIAQSSKFA
Ga0126369_1017579123300012971Tropical Forest SoilVAIQLDDVNTTVTKEIEPGVVDGYFKAGPFIAMAKNRFNRKWVGPQIQENFMYRPMKGGAYRKGTSFDIIRRQTRTGLLFGPRYYQVGVTEFLEDLEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQPIAGDDRSAEINGLEEALTNGSDPTWTNNVFPSYGGQTRADVVPALTTPTGLIPSNLGGQPISYRVLRHSYFSCIIGNEAPTIGITTNRCMGFIAENFLPHQIIDTTQPEINWPGMKFDKSTITMSQYCPGADGVNDDELGNYFSPFETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRVLSGIGS*
Ga0180007_1009059413300014656GroundwaterKQQTRTGLLFTPRYYDANVTEFLEDLEVEMAGPHAVFSILKVDLQTAAMTLSAILEIAAYRHGQAYAGDDRSGEINGMEEALTNGVDQTFSGKTFLSYGGQTRVDVSPALNSPTGVIPASIVGGGINFKVLEHSQLSCSIGAERPKIGITTLSGLGFIAESFSPQQKVDVVDPEINWPGLKFNTATIVAGNYVPGAQGVNDPDIGDYSADHETFWWLNPGPQGDDAYIRLYIAQSPKFAFGFTGFKGARDDNMVSGQILFAGNLTFRAPRLSRGLYNISK*
Ga0180007_1018181213300014656GroundwaterVNVTEFLEDLEVEMAGPTLAFSIIKTDMAEAALTMSAIQEIAFYHHGQNLGVGNDRSLEINGLEEMLTTTAGVTWAGNTFPSYGGQTRVDVAPALNSPVGLIPANLAALGGAIAYRPLLHSYLSCVVGGEHPQIGVTTNRGFGFIAETFTPQQQIDVVQPEINWPGFKFMSATIVASQYCPGADGVNDANLGNYNNATETFWWLNWGPQGDDAFIRFYIAQSSKFAYGFTGFKGARGDNQVSGQLLFGGNLTGRAPRYSRVLYGFTR*
Ga0180061_101635913300014861SoilTGMLFTPRYYEVNVTEFLEDIEVEQTGPNAMFSIVKTDMAEAALTLSGILEIAMFHHGQALPGDDRSAEINGLEEALTDGTNVTWTGATFTSYGGQDRLSVGGALNSPVGLIAPSVPSMSFRALQHSYLSTCIGSEHPAIGVTTNRGMGYISETFLPHQIVDVVDPEINWPGLKFNQAKIVVSQYAPGADGVNDPDLGNYLDVDGETFWWLNPGPQGDDAYLRFYIAQSPKFAFGFTGFKGARDDNMVAGQILFGGNFSCRAPRLQRGLYNIAK*
Ga0180087_100047083300014872SoilMGPILLDDVNTVTTKTIMPGVVDNFFRAGPLVAYLKTRFTRKWIGPQIQENYMYAPMRGGAYKKGATFNITKRQTRTGMLFTPRYYEVNVTEFLEDIEVEQTGPNAMFSIVKTDMAEAALTLSGILEIAMFHHGQALPGDDRSAEINGLEEALTDGINVTWTGATFTSYGGQDRLSVGGALNSPVGLIAPSVPSMSFRALQHSYLSTCIGSEHPAIGVTTNRGMGYISETFLPHQIVDVVDPEINWPGLKFNQAKIVVSQYAPGADGVNDPDLGNYLDADGETFWWLNPGPQGDDAYLRFYIAQSPKFAFGFTGFKGARDDNMVAGQILFGGNFSCRAPRLQRGLYNIAK*
Ga0180065_101851923300014878SoilYEVNVTEFLEDIEVEQTGPNAMFSIVKTDMAEAALTLSGILEIAMFHHGQALPGDDRSAEINGLEEALTDGINVTWTGATFTSYGGQDRLSVGGALNSPVGLITPSVPSMSFRALQHSYLSTCIGSEHPAIGVTTNRGMGYISETFLPHQIVDVVDPEINWPGLKFNQAKIVVSQYAPGADGVNDPDLGNYLDTDGETFWWLNPGPQGDDAYLRFYIAQSPKFAFGFTGFKGARDDNMVAGQILFGGNFSCRAPRLQRGLYNIAK*
Ga0180082_101554423300014880SoilMRHLARAGDFIRANPIFTTLCMALLLWASGVAPAAGLPLLMGPILLDDVNTVTTKTIMPGVVDNFFRAGPLVAYLKTRFTRKWIGPQIQENYMYAPMRGGAYKKGATFNITKRQTRTGMLFTPRYYEVNVTEFLEDIEVEQTGPNAMFSIVKTDMAEAALTLSGILEIAMFHHGQALPGDDRSAEINGLEEALTDGINVTWTGATFTSYGGQDRPSVGGALNSPVGLITPSVPSMSFRALQHSYLSTCIGSEHPAIGVTTNRGMGYISETFLPHQIVDVVDPEINWPGLKFNQAKIVVSQYAPGADGVNDPDLGNYLDAD
Ga0173480_1001571823300015200SoilVAIQLDEVNTTVTKEIEPGVVDGYFKAGPFIAMAKSRFNRKWIGPQIQENFMYKPMKGGAYKKGSSFDITRRQTRTGLLFGPRYYQVGITEFLEDLEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFQHGQALAGDDRSMEMNGLEEALNNGTTASWAGNIFPSYGGQTRPDVSPALDPPAGLIPQDLGGQPISYRVLRHSYFSCIIGNEAPTIGITTNRCMGYIAENFLPHQIIDTTQPEINWPGMKFDKATITMSQYCPGADGVNDPDLGSYYAPNETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRVISGIGS*
Ga0180077_100445123300015255SoilMYKPMKGGAYKKGASFDVTRRQTRTGLLFTPRYYEVNVTEFLEDIEVEMAGPRAAFSVIRTDMQQAALTISAILEIAAFHHGQALPGDDRSAELNGIEEALNDGTNPSIFGNTFPSYGGQTRTDVTPALTPPTGLVPANVAGPMSYRALRHSYFSSIIGNERPTVGITTNRNMGFIAENFLPHQIVDTTQPEINWPGMKFDQATIVMSQYAPGVDGTNDPDLGNYSAASESFYWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFGGNMTFRALRLSRAMFGFTS*
Ga0184610_1000045333300017997Groundwater SedimentVTLWQRFHAAIGWIVAHPRLTGLALWALMAWIRPESLLLAPMILAIQLDDVNTTVAKEISPGVVDGYFKAGPFIAMAKARFTRKWIGPQIQENFMYKPMKGGAYKKGASFDVTRRQTRTGLLFNPRYYQVNVTEFLEDIEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAAFHHGQALPGDDRSAELNGIEEALNDGTNVSIFGNAFPSYGGQTRVDVAPALTPPTGLVAANVAGPMSYRALRHSYFSSIIGNERPTVGITTNRNMGFISENFLPHQVVDTTQPEINWPGMKFDQATIVMSQYAPGQDGTNDPDIGNYNASTESFYWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFGGNLAWRALRLSRAMFGFVQ
Ga0184623_1002760433300018056Groundwater SedimentMRHLARAGDFIRANPIFVTLCMALLLWASGVAPAAGLPLLMGPILLDDVNTVTTKTIMPGVVDNFFRAGPLVAYLKTRFTRKWIGPQIQENYMYAPMRGGAYKKGATFNITKRQTRTGMLFTPRYYEVNVTEFLEDIEVEQTGPNAMFSIVKTDMAEAALTLSGILEIAMFHHGQALPGDDRSAEINGLEEALTDGINVTWTGATFTSYGGQDRLSVGGALNSPVGLITPSVPSMSFRALQHSYLSTCIGSEHPAIGVTTNRGMGYISETFLPHQIVDVVDPEINWPGLKFNQAKIVVSQYAPGADGVNDPDLGNYLDTDGETFWWLNPGPQGDDAYLRFYIAQSPKFAFGFTGFKGARDDNMVAGQILFGGNFSCRAPRLQRGLYNIAK
Ga0184615_10001197113300018059Groundwater SedimentVFVLTCLKQLADLANAHKRLVAVIAALFGSYWHPEFASIVLIGAIQLDDVNTTVTKEIMPGVVDGYFKAGPLIAMMKARFTRKWIGPQIQENFMYKPMKGGAYKKGASFDVTRRQTRTGLLFTPRYYEVNVTEFLEDIEVEMAGPRAAFSVIRTDMQQAALTISAILEIAAFHHGQALPGDDRSAELNGLEEALNDGINASIFGNTFPSYGGQTRTDVRPALTPPTGLVAANVNGPMSYRALRHSYFSTIIGNERPTVGLTTNRNMGFIAENFLPHQIIDTTQPEINWPGMKFDQATIVMSQYAPGQDGENDPDLGNYNAATESFYWLNFGPQGDDAFIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFGGNMTVRALRLSRAMFGFTS
Ga0184637_1000121273300018063Groundwater SedimentVPNLPRSLRPLVRATDWLRCHPRLVALVVTCIASYLHPASLVAVPFVIGAIQLDDVNTTTTKEIMPGVVDGYFKAGPLIAMMKARFTRKWIGPQIQENFMYKPMKGGAYKKGAAFNVDRRQTRTGLLFGPRYYQVNVTEFLEDLEVEMVGPRAAFSVIRTDMQQAALTMSAILEIAAFRHGQALVGDDRSAEINGLEEALNDGINASWAGNIFASYGGQTRADVAPALTTPTGLIAANVAGPISYRILRHSYFSSIIGNEAPTIGITTNRCMGFISENFLPHQIVDTTQPEINWPGLKFDKATILMSQYIPGADGVNDADLGNYNAAAETFAWLNFGPQGDDAYVRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFAGNLTVKALRLSRILFGITS
Ga0184637_10004236123300018063Groundwater SedimentVRILYCLRDAALWLRSHPLLTALAVWLWALLDLESASGALLVVGIIQLDDVNTTVTKEIQPGVVDGYFKAGPNIAMAKARFTRKWIGPQIQENFMYKPMQGGAYKKGTAFNVQRRQTRTGLLFSPRYYQVNVTEFLEDLEVELAGPRAAFSVIRTDMQQAALTMSAILEIAFMRHGQALPGDDRSAEINGVEESYNDGVNASWAGNVFPSYGGQTRVDVSPALNPPTGLIAANVGGPISYRVLRHSYFSSIIGNEAPGVGITTNRCMGFIAENFLPHQIVDTTQPEINWPGLKFDKATIMMSQYMPGADGVNDPDLGNYNAAAETFAWLNYGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFGGNLTVKALRLSRILHGITS
Ga0184637_1034899313300018063Groundwater SedimentMRGGAYKKGATFNIQKRQTRTGMLFTPRYYEVNVTEFLEDVEVEQAGPNAMFSIVKTDMAEAALTLSAILEIAIYHHGQALVGDDRSDEINGLEEALTNGLLTTWTGATFLSYGGQTRADVNGALNSPTGLILPTMPSMNFRGLQHSYLSTCIGAEHPEIGVTTNRGLGYISETFLPHQIVDVIDPEIKWPGLKFNQSRIVVSQYAPGADGVNDADLGNYLASAGETFWWLNPGPIGDDAYLRLYIAQSPKFAFGFTGFKGARDDNMVAGQILFAGNFTCRAPRLQRGLFAIAK
Ga0184633_1029873613300018077Groundwater SedimentAGPNIAMAKARFTRKWIGPQIQENFMYKPMQGGAYKKGTAFNVQRRQTRTGLLFSPRYYQVNVTEFLEDLEVELAGPRAAFSVIRTDMQQAALTMSAILEIAFMRHGQALPGDDRSAEINGVEESYNDGVNASWAGNVFPSYGGQTRVDVSPALNPPTGLIAANVGGPISYRVLRHSYFSSIIGNEAPGVGITTNRCMGFIAENFLPHQIVDTTQPEINWPGLKFDKATIMMSQYMPGADGVNDPDLGNYNAAAETFAWLNYGPQGDDAYIRL
Ga0184627_1012745913300018079Groundwater SedimentMRTMLQAVTAVLLLMGLIWLHPELAYALPFALPVVGVILLDDVNTVTTKTIMPGVVDNFFRAGPTIAYLKSRFSRKWIGPQIQENYMYAPMRGGAYKKGATFNIQKRQTRTGMLFTPRYYEVNVTEFLEDVEVEQAGPNAMFSIVKTDMAEAALTLSAILEIAIYHHGQALVGDDRSDEINGLEEALTNGLLTTWTGATFLSYGGQTRADVNGALNSPTGLILPTMPSMNFRGLQHSYLSTCIGAEHPEIGVTTNRGLGYISETFLPHQIVDVIDPEIKWPGLKFNQSRIVVSQYAPGADGVNDADLGNYLASAGETFWWLNPGPMGDDAYLRLYIAQSPKFAFGFTGFK
Ga0184629_10001025163300018084Groundwater SedimentMAIQLDDVNTVTTKEIMPGVVDGYFRAGPFIAMAKARFTRKWIGPQIQENFMYKPMKGGAYKKGAAFNVVRHQTRTGLLFTPRYYQVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAAFHHGQSLPGDDRSAEINGIEEAYNDGTNASWAGNVFPSYGGQTRVDVAPALTPPTGLIAAQNANVLYRVLRHSYFSCILGNEAPTIGLTTNRMMGFISENFLPHQIVDTTQPEINWPGLKFDKATVLMSQYAPSQDGVNDADLGNYLAAGETFAWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFAGNLTVKALRLSRILHGFTA
Ga0184629_10001627133300018084Groundwater SedimentTRKWIGPQIQENYMYAPMRGGAYKKGATFNITKRQTRTGMLFTPRYYEVNVTEFLEDIEVEQTGPNAMFSIVKTDMAEAALTLSGILEIAMFHHGQALPGDDRSAEINGLEEALTDGINVTWTGATFTSYGGQDRLSVGGALNSPVGLITPSVPSMSFRALQHSYLSTCIGSEHPAIGVTTNRGMGYISETFLPHQIVDVVDPEINWPGLKFNQSKIVVSQYAPGADGVNDPDLGNYLDTDGETFWWLNPGPQGDDAYLRFYIAQSPKFAFGFTGFKGARDDNMVAGQILFGGNFSCRAPRLQRGLYNISK
Ga0184629_1000317653300018084Groundwater SedimentMTHALRHGRDFCRANPIVTMLVMTALLWLIRGSEALAVPLIMGSILLDDVNTVTTKTIMPGVVDNFFKAGPVIAYMRSRFTRRWVGPQIQENYMYAPMRGGAYKKGGTFNITKRQTRSGMLFTPRYYEVNITEYLEDIEVEQTGPNAVFSIVKTDAAEAALTLSAILEIAVFHHGQNLGIGNDRSAELNGLEEALTDGINPTYAGNIFPSYGGQTRIDVSPALNSPVGLVVPSMGSMSFRGLQHSYLSTCIGSEHPAIGTTTNRGMGYISETFLPHQIVDVVDPEINWPGLKFNQARIVVSQYCPGADGVNDPDLGNYLAPAGETFWWLNPGPQGDDAYMRLYIAQSAKFAFGFTGFKGARDDNMVAGQILFGGNYTHRAPRLSRGLYAIAK
Ga0066655_1043466213300018431Grasslands SoilWVGPQIQENFLYKPMKGGAYRKGTAFDTTRRQTRTGMLFTPRYYEVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAAPRHGQALAGDDRSAELNGLEEALNDGVNASWAGNIFPSYGGQTRADVTPALTPPTGLIPALNTTMFYRILRHSYFSCVIGNEAPGIGITTNRLMGFIAENFLPHQIVDTTQPEISWPGLKFDKATILMSQYWPGQDGTNDPDLGNYSAAGETFTWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGARDDNQVSGQLLFG
Ga0190271_10000098323300018481SoilVAIQLDDVNTTITKEIEPGVVDGYFKAGPFIAMAKSRFSRKWIGPQIQENFMYKPMKGGAYRKGGSFDISRRQTRTGLLFGPRYYQVGVTEFLEDLEVEMAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQNIPGNDRSMELNGMEEALNDGVTASWENNLFPSYGGQTRADVAPALTPPTGLIPANLNGQPISYRVLRHSYFSCIIGNEAPTIAITTNRCMGFIAENFLPHQIVDTTQPEINWPGMKFDKATITMSQYCPGADGVNDEDLGNYYAPNETFWWLNFGPPGDDAYIRFYIAQSRKFAFGFTGFKGARQDNQVSGQILYCGNLVTKALRLSRVLTGIGS
Ga0190271_1011534113300018481SoilVAIQLDEVNTTVTKEIEPGVVDGYFKAGPFIAMAKSRFNRKWIGPQIQENFMYKPMKGGAYRKGTSFDITRRQTRTGLLFGPRYYQVGITEFLEDLEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQPVVGDDRSMEINGLEEALNNGTTASWAGNVFPSYGGQTRTDVSPALDPPAGLIGQDLGGQPISYRVLRHSYFSCIIGNEAPTIGITTNRCMGYIAENFLPHQIIDTTQPEINWPGMKFDKATITMSQYCPGADGVNDADLGSYYAPNETFWWLN
Ga0184648_118665233300019249Groundwater SedimentMRHLARALTFVRANPIFTTLCMALLLWASGVAPAAGLPLLMGPILLDDVNTVTTKTIMPGVVDNFFRAGPLVAYLKTRFTRKWIGPQIQENYMYAPMRGGAYKKGATFNITKRQTRTGMLFTPRYYEVNVTEFLEDIEVEQTGPNAMFSIVKTDMAEAALTLSGILEIAMFHHGQALPGDDRSAEINGLEEALTDGINVTWTGATFTSYGGQDRLSVGGALNSPVGLITPSVPSMSFRALQHSYLSTCIGSEHPAIGVTTNRGMGYISETFLPHQIVDVVDPEINWPGLKFNQSKIVVSQYAPGADGVNDPDLGNYLDTDGETFWWLNPGPQGDDAYLRFYIAQSPKFAFGFTGFKGARDDNMVAGQILFGGNFSCRAPRLQRGLYNISK
Ga0193727_100020383300019886SoilVTICRFLVQPPLDALVAHPRLLLLVTITVAALTHHLHPGLSLLVIGAIQLDDVNTITTKDIMPGVADNFFRSGPVTAYMRGRFTRKWVGPQIQENYLFKPMKGGAYKKGAQFDITKRQVASGLLFTPRYYDVNVTEFLEDLEVEQTGPHAMFSRIKLDMSTAALSMSALLEIAFFHHGQSLAGDDRSAELNGAEEALTDGTSATMFGNLFPSYGGQTRVDVAPALNSPTGLVAASAGPSLMFRVLEHSFMSTVIGTERPKLGVTSNRGMGFIAENFSPQQKIDVMDPEINWPGFKFNTATIVASQYAPSQDGVNDPDLGNYLATAAQGEPLIWINPGPQGDDAYIRFYIAQSPKFAFGFTGFKGARDDNMVAGQILFAGNYTFRSPRLSRWLYGFTK
Ga0193738_101904843300020020SoilPGVVDNFFKAGPLMAYLKARFTRKWTGPTIQENYLYKPMKGGAYKKGATFNIDKQQTFTGMQFTPRYYETNVTEYLEDLEVEATGPTAMFSMLKTDLANAALTLSSILEIALFHHGQNVGGDDRTAEINGMEEQLTNGTDLTWTGKTFTTYGGQLRASVAPALNSPTGLFGPDISQTSFRALEQTYLSCRVGAESPKMGLTTKRMIGYIAETFSPQQRIDVLDPEINWPGMKFNQATIMSSDYCPGADGVDDPDLGNYYSPTETFWWLNPGPQGDDAYLRLYIAASSKFAFGFTGFKGARNDNKVAGQILFGGNMTNRAPRLGRAMYGFEN
Ga0184649_116593213300020068Groundwater SedimentAIQLDDVNTTVTKEIMPGVVDGYFKAGPLIAMMKARFTRKWIGPQIQENFMYKPMKGGAYKKGASFDVTRRQTRTGLLFTPRYYEVNVTEFLEDIEVEMAGPRAAFSVIRTDMQQAALTISAILEIAAFHHGQALPGDDRSAELNGLEEALNDGINASIFGNTFPSYGGQTRTDVRPALTPPTGLVAANVNGPMSYRALRHSYFSTIIGNERPTVGLTTNRNMGFIAENFLPHQIIDTTQPEINWPGMKFDQATIVMSQYAPGQDGENDPDLGNYNAATESFYWLNFGPQGDDAFIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFGGNMTVRALRLSRAMFGFTS
Ga0210379_1000319943300021081Groundwater SedimentVTRVLQSIRWVDRQVRTHPRLIASAIALLILAFAPDRLAAAAPLFIIGAIQLDDVNTVTTKEIMPGVVDGYFKAGPVIAMAKARFTRRWVGPQIQENFMYKPMKGGAYKKGATFDVSRHQTRTGMLFTPRYYQVNVTEFLEDLEVEMVGPRAAFNVIRTDMQQASLTMSAILEIASMQHGQALVGDDRSAELNGFAEALNDGINASWDGNVYPSYGGQTRADVAPALTTPTGLVAANVAGPISYRILRHSYFSCILGNEAPSVGITTNRCMGFIAENFLPHQIVDTTQPEINWPGLKFDKATILMSQYAPSQDGVNDADLGNYLAAGETFAWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFAGNLTVKALRLSRILHGFTA
Ga0210379_1006954723300021081Groundwater SedimentNFFRAGPLVAYLKTRFTRKWIGPQIQENYMYAPMRGGAYKKGATFNITKRQTRTGMLFTPRYYEVNVTEFLEDIEVEQTGPNAMFSIVKTDMAEAALTLSGILEIAMFHHGQALPGDDRSAEINGLEEALTDGINVTWTGATFTSYGGQDRPSVGGALNSPVGLITPSVPSMSFRALQHSYLSTCIGSEHPAIGVTTNRGMGYISETFLPHQIVDVVDPEINWPGLKFNQAKIVVSQYAPGADGVNDPDLGNYLDADGETFWWLNPGPQGDDAYLRFYIAQSPKFAFGFTGFKGARDDNMVAGQILFGGNFSCRAPRLQRGLYNIAK
Ga0210377_1013029623300021090Groundwater SedimentLAIQLDDVNTTVTKEIMPGVVDGYFKAGPLIAMMKARFTRKWIGPQIQENFMYKPMKGGAYKKGASFDVTRRQTRTGLLFTPRYYQVNVTEFLEDIEVEMAGPRAAFSVIRTDMQQAALTISAILEIAAFHHGQSLPGDDRSAELNGLEESLNDGTNASIFGNLFPSYGGQTRADVAPALTPPTGLVAANVAGPMSYRALRHSYFSSIIGNERPTVGLTTNRNMGFLAENFLPHQIVDTTQPEINWPGMKFDQATIVMSQYAPGQDGTNDPDLGNYNAASESFYWLNFGPQGDDAYIRFYIAQSSKFAFGFTGFKGAREDNQVSGQILFGGNMTVRALRLSRAMFGFTS
Ga0210384_1006713363300021432SoilMFKPMIGGSYKKGATFNVLRQQTRTGMLFNPRYYEVNLTEFLEDLEVEMAGPRAAFSVIRTDMAQAALTMSAILEIAFYQHGQALAGADRSAEINGIEEAFNDGTNASWNGNVFPSYGGQTRPDVAPALTPPTGLIAAQVAGAKMSYRVLRHSYFSCIIGNERPTVGITTNRGMGFIAENFLPHQLVDTTQPEIAWPGIKFDQSTILMSQYAPGQDGVNDAYLGNYLAPAVGTGSAGETLSWLNFGPVGEDAYIRLYIAQSQKFAFGFTGFKGAREDNQVSGQILFGGNLTCR
Ga0193742_101848573300021976SoilVRETMTRLGAAVRWLGRHPRLVNVLVALLVLWIHPAGLFALPVVLAIQLDDVNTVTTKEIMPGVVDGYFKGGPCIAMCKARFTRKWIGPQIQENFMFKPMKGGAYKKGTTFNVDRRQTRTGLLFTPRYYEVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAFFRHGQALPGDDRSAEINGIEEGYNDGVNPSWAGNVFPSYGGQTRTDVAPALNPPAGLIGALNTTMSYRVLRHSYFSSVIGNEAPGVGITTNRLMGFIAENFLPHQIIDTTQPEINWPGLKFDKATIMMSQYAPGQDGVNDADLGNYNAAGETFAWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQLLFGGNLTLKALRLSRILHGFTA
Ga0247664_103043223300024232SoilMPGVVDNYFKAGPLMAYLKTRFNRKWTGPQIQENYLYKAMKGGAYAKGATFDVTKRQTFTGMLFTPRYYQVNVTEFLEDLEVEMAGPTAMFSTLKVDLGNAALTLSSILEIALFHNGQNVGGVDRTLEVNGLEEALTNGTDTTWTGATFPIYGGQTRASVSPALNSPTGLVPASNPATSFRTLEHSFMSCVIGAERPKMGLTTNTMMGFIAETFSPQQKIDVLDPEINWPGMKFNTATIMVSQYCPGKYGANDADLGNYLASAGETFWWLNPGPQGDDAYLRLYIAQSPKFAFGFTGFKGARDDNQVSGQTLFGGNFTCRAPRLSRALYGFTS
Ga0247661_101842123300024254SoilVAIQLDEVNTVATKMIMPGVVDNYFKAGPLMAYLKTRFNRKWTGPQIQENYLYKAMKGGAYAKGATFDVTKRQTFTGMLFTPRYYQVNVTEFLEDLEVEMAGPTAMFSTLKVDLGNAALTLSSILEIALFHNGQNVGGVDRTLEVNGLEEALTNGTDTTWTGATFPIYGGQTRASVSPALNSPTGLVPASNPATSFRTLEHSFMSCVIGAERPKMGLTTNTMMGFIAETFSPQQKIDVLDPEINWPGMKFNTATIMVSQYCPGKYGANDADLGNYLASAGETFWWLNPGPQGDDAYLRLYIAQSPKFAFGFTGFKGARDDNQVSGQTLFGGNFTCRAPRLSRALYGFTS
Ga0209322_1008598523300025146SoilMATRPFRALLTALVAVLSGWLHPETLALMPLAIGAILLDDVNTVTTKTIMPGVVDNFFKAGPLIAYLKARFTRKWVGPQIQENYMYAPMRGGAYKKGATFNISKRQTRTGMLFTPRYYECNVTEFLEDIEVEQTGPNAMFNIVRTDMAEAALTLSAILEIAMYHHGQALVGDDRSAEINGLEEALTDGTNVTWTGATFPSYGGQTRVDVSPALNSPVGLITPALASMSFRGLQHSYLSTCIGSEHPLIGVTTNRGLGYISETFLPHQIVDVVDPEINWPGLKFNQSRIVVSQYAPGADGVNDADLGNYLDADGETFWWLNPGPQGDDAYLRLYIAQSPKFAFGFTGFKGARDDNMVSGQILFGGNFTCRAPRLSRGLYSIAK
Ga0209320_1000944153300025155SoilMAIQLDDVNTTTTKEIMPGVVDGYFRAGPVIAMAKARFTRKWVGPQIQENFMYKPMKGGAYKKGAPFNVTRHQTRTGLLFTPRYYQVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAAYHHGQALPGDDRSAEINGLEEALNDGTNASWNGNVFPSYGGQTRADVSPALTPPTGLIAANNANVLYRVLRHSYFSAIIGNEAPTIGVTTNRMMGFISENFLPHQIVDTTQPEIAWPGLKFDKATIVMSQYAPSQDGVNDPDLGNYLAAGETFAWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFAGNLTVKALRLSRILHGFTA
Ga0209619_1001181523300025159SoilMATRPFRALLTALVAVLSGWLHPETLALMPLAIGAILLDDVNTVTTKTIMPGVVDNFFKAGPLIAYLKARFTRKWVGPQIQENYMYAPMRGGAYKKGATFNISKRQTRTGMLFTPRYYECNVTEFLEDIEVEQTGPNAMFNIVRTDMAEAALTLSAILEIAMYHHGQALVGDDRSAEINGLEEALTDGTNVTWTGATFPSYGGQTRVDVSPALNSPVGLITPALASMSFRGLQHSYLSTCIGSEHPLIGVTTNRGLGYISETFLPHQIVDVVDPEINWPGLKFNQARIVVSQYAPGADGVNDADLGNYLDADGETFWWLNPGPQGDDAYLRLYIAQSPKFAFGFTGFKGARDDNMVSGQILFGGNFTCRAPRLSRGLYSIAK
Ga0209324_1024687013300025174SoilVRTILTCLDRLLAWTRSYPRITQVILGLVAWYLHPASVVAIPIIFGIIQLDDVNTVTAKEIIPGVVDNYFRAGPIVAMCRSRFTRKWIGPQIQENYLFGSLKGGAYAKGASFDVTKKRTKAPLLFGPRYYQTNVTEFLEDIEVEMVGPRAAFNVIRTDMQEAAMTMSAILEIAAVQHGQNIAGDNRSLEINGLEEALNDGTNGSWDGTTFTSYGGQTRADVSPALNSPAGFQAANVAGAITYRALRHSYFSCVIGNEAPTIAVTTNRCMGFIAENFLPHQIVDTRQPEIDWPGLKLDKATIMMSQYCPGQDGTNDPDLGDYNNAT
Ga0209324_1035193213300025174SoilVGVIQLDDVNTVTTKTIMPGVVDNFFKAGPLIAYLKARFTRKWVGPQIQENYMYAPMRGGAYKKGATFNISKRQTRTGMLFTPRYYECNVTEFLEDIEVEQTGPNAMFNIVRTDMAEAALTLSAILEIAMYHHGQALVGDDRSAEINGLEEALTDGTNVTWTGATFPSYGGQTRVDVSPALNSPVGLITPALASMSFRGLQHSYLSTCIGSEHPLIGVTTNRGLGYISETFLPHQIVDVVDPEINWPGLKFNQARIVVSQYAPGADGVNDADLGNYLDADGETFWWLNPGPQGDDAYLRLYIAQSPKFAFG
Ga0209519_1001965133300025318SoilMAIQLDDVNTTTTKEIMPGVVDGYFRAGPVIAMAKARFTRKWVGPQIQENFMYKPMKGGAYKKGAPFNVTRHQTRTGLLFTPRYYQVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAAYHHGQALPGDDRSAEINGLEEALNDGTNASWNGNVFPSYGGQTRADVSPALTPPTGLIAANNANVLYRVLRHSYFSAIIGNEAPTVGITTNRMMGFISENFLPHQIVDTTQPEIAWPGLKFDKATIVMSQYAPSQDGVNDPDLGNYLAAGETFAWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFAGNLTVKALRLSRILHGFTA
Ga0209519_1026658413300025318SoilVLRHQTRTGLLFTPRYYQVNVTEFLEDLEVEMVGPRAAFSVIRTDMQQAALTMSAILEIAAFHHGQNLAGDDRSAEINGLEEAYNDGVNASWAGNLFPSYGGQTRADVSPALTPPTGLIAASNANIMYRVLRHSYFSCILGNEAPTVGITTNRMMGFISENFLPHQIVDTTQPEIAWPGLKFDKATIVMSQYAPGQDGVNDADLGNYNAAGETFAWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFAGNLTARALRLSRIEDLQREAQRACPRGLPVDLFHPGRARREAKRPHL
Ga0209641_1016279923300025322SoilVAIQLDDVNTVTQKEIVPGVTDGYFKAGPYIAMAKARFTRKWVGPQIQENFLYAPMKGGAYAKGGAFDVIRRQTRTGLLFSPRYYQVNVTEFLEDLEVEMAGPRAAFSVIRTDLQQAALTMSAFLEVAAFRHGQNLAGDDRSLELNGLEEAFGSAGTASFAGNLFPSYGGQLRSDVTPALDAPVGQIGANVAGVITYRILRHSYFSCIIGNEAPTVGITTNRCMGYISENFLPHQIIDTTQPEINWPGMKFDKATLVMSQYAPGQDITNSEQTDLGAIRPTSGETFWWLNFGPQGDDAYIRLYIAQSAKFAFGFTGFKGARTDNQVSGQVLFGGNQTVKALRLSRVLYGITG
Ga0209341_1000191993300025325SoilMALQFDDVNTTVTKEIQPGVVDNYFKAGPMIAMCKSRFTKKWIGPQIQENFLFRPMIGGSYKKGGSFNVTRPQTRTGMLFTPRYYEVNVTEFLEDLEVEMAGPRAAFSVIRTDLAQAALTISAILEIAFWHHGQALAGDDRSGEINGIEEALGSITQASWAGNLFPSYGGQTRADVNGALTAPTGLVSNPDISVFAAPGAISYRILRHSYLSTIIANERPTVGVTTNRCFGFIAENFLPHQIVDTTQPEIGFPGFKFDNATIMMSQYAPGQDGVNDAFLGNYLDTGGEVFAWLNFGPQGDDAFIRLYIAQSAKFAFGFTGFKGAREDNQVSGQTLFGGNLTVRAPRLSRILHHITA
Ga0209341_1040367013300025325SoilDAADWLRARPIYTALAVWVWYGLHPESAMAAILVVGVIQLDDVNTTVTKEIQPGVVDGYFKAGPCIAMAKARFTRKWIGPQIQENFMYKPMQGGAYKKGTAFNVQRRQTRTGLLFSPRYYQVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAFFRHGQALPGDDRSAEINGIEEALNNGVAASWAGNVFPSYGGQTRVDVSPALDPPVGLITANVGGPISYRVLRHSYFSSIIGNEAPGVGITTNRCMGFIAENFLPHQIVDTTQPEINWPGLKFDKSTIMMSQYAPGQDGVNDPDLGNYNAAAETFAWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFGGNL
Ga0209342_1027009323300025326SoilMTRTCWRRGVAILRRPFMQKLLSLTGALVAIWLRPDDMLETLGVLLVGALQLDDVNTTVTKEIVPGVVDGYFKGGPFVAMCRRRFTRKWVGPTIQENLMFKPMIGGSYKKGGAFNVTRRQTRTGLLFTPRYYQTNITEFLEDLEVEMAGPRAAFSVIRTDMQQAALTLSAILEIAAFHHGQALAGDDRSAEVNGLEEAYGHATNASIFGNVFPSYGGQTRTDVSPALDSPIGVINANVAGPMSYRVLRHSYFSCVINNDRPTVGITTNRNMGYISENFLPHQVIDTTQPEIGWPGMKFDQATILMSQYAPGQDGVNDADLGNYNASTESFYWLNFGPQGDDAYIRLYISQSSKFAFGFTGFKGAREDNQVSGQIL
Ga0209342_1033916423300025326SoilVRTILTCLDRLLAWTRSYPRITQVILGLVAWYLHPASVVAIPIIFGIIQLDDVNTVTAKEIIPGVVDNYFRAGPIVAMCRSRFTRKWIGPQIQENYLFGSLKGGAYAKGASFDVTKKRTKAPLLFGPRYYQTNVTEFLEDIEVEMVGPRAAFNVIRTDMQEAAMTMSAILEIAAVQHGQNIAGDNRSLEINGLEEALNDGTNGSWDGTTFTSYGGQTRADVSPALNSPAGFQAANVAGAITYRALRHSYFSCVIGNEAPTIAVTTNRCMGFIAENFLPHQIVDTRQPEIDWPGLKLDKATIMMSQYCPGQDGTNDPDLGDYNNATETLFWLNFGPQGDDAYIRLYVAQSAKFAFGFTGFKGAREDNQVSGQLLFGGNLT
Ga0207707_1003441763300025912Corn RhizosphereVKTLWSLMRRGFDALVAHPRLVALVVLILGVCTHLLHPAVGSVFLIGAVQLDDVNTITTKDIMPGVADNFFRSGPVTAYMRARFNRKWVGPQIQENYLYKPMKGGAYKKGGTFDITKRQVASGLLFTPRYYDVNVTEYLEDLEVEQTGPHAMFSRIKLDMSTAALSLSAILEIAFFHHGQSLVGDDRSAEVNGAEEALTDGTSQTMFGNIFPSYGGQTRVDVAPALNSPTGLVAASAGPALMFRVLEHSFMSTVIGQERPKMGVTSNRGMGFIAENFSPQQKIDVMDPEINWPGFKFNTATIVASQYAPSQDGVNDPDLGNYLATATQGEPLLWLNPGPQGDDAYMRLYIAQSPKFAFGFTGFKGARDDNMVAGQILFAGNYTFRSPRLSRWLYGFTK
Ga0207663_10002511143300025916Corn, Switchgrass And Miscanthus RhizosphereMADIQFTEVNTVATKLINPGVVDNYFKAGPLMAYLKTRFNRKWTGPQIQENYEYGALRGGAYKKGATFNITPRQTRSGIIFDPRYYQVSITEFLEDLEVEMAGPTAVFSKLKADMANAALTLSAILEIALFHNGQNVGGADRTAELNGLEEALTDGTSATWTGAVFPTYGGQTRTAVAPALNSPTGLIPASVAQTSFRMLEHSFQSCTIGAERPKLGITTNREMGYIAETFTPQQKIDVLDPEINWPGLKFNTATLVESQYCPGQDGVNDPFLGNYSNTSETFWWLNPGPQGDDAYLRLYIAQSPKFAFGFTGFKGARDDNQVSGQILFGGNFTVRAPRLSRGLYGMTN
Ga0207660_1005752613300025917Corn RhizospherePRYYDVNVTEYLEDLEVEQTGPHAMFSRIKLDMSTAALSLSAILEIAFFHHGQSLVGDDRSAEVNGAEEALTDGTSQTMFGNIFPSYGGQTRVDVAPALNSPTGLVAASAGPALMFRVLEHSFMSTVIGQERPKMGVTSNRGMGFIAENFSPQQKIDVMDPEINWPGFKFNTATIVASQYAPSQDGVNDPDLGNYLATATQGEPLLWLNPGPQGDDAYMRLYIAQSPKFAFGFTGFKGARDDNMVAGQILFAGNYTFRSPRLSRWLYGFTK
Ga0207686_10000102803300025934Miscanthus RhizosphereVAIQLDEVNTTVTKEIEPGVVDGYFKAGPFIAMAKARFSRKWIGPQIQENFMYKPMKGGAYRKGSSFDITRRQTRTGLLFGPRYYQVGVTEFLEDLEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQALAGDDRSMEINGMEEALCRTGDTSWTGAAFPSYGGQTRVDVSPALDPPAGLIPSDLGGAPISYRTLRHSYFSCILGNEAPTIAITTNRCMGFIAENFLPHQIIDTTQPEINWPGMKFDKATITMSQYCPGADGVNDDDLGNYYAPFETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRVLKGIGS
Ga0209376_104763923300026540SoilMAIQLDDVNTVTTKEIMPGVVDGYFRAGPFIAMCRRRFTRKWVGPQIQENFLYKPMKGGAYRKGTAFDTTRRQTRTGMLFTPRYYEVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAAPRHGQALAGDDRSAELNGLEEALNDGVNASWAGNIFPSYGGQTRADVTPALTPPTGLIPALNTTMFYRILRHSYFSCVIGNEAPGIGITTNRLMGFIAENFLPHQIVDTTQPEISWPGLKFDKATILMSQYWPGQDGTNDPDLGNYSAAGETFTWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGARDDNQVSGQLLFGGNLTVKALRLSRFLHGFTA
Ga0208320_100250913300027362SoilVALLLDDVNTTVTKDIQPGVVDGYFKSGPLIAMCRRRFTRKWIGPQIQENFMFKPMKGGAYAKGASFDVTRRQTRAGMLFGPKYYQVNVTEFLEDLEVEMAGPNAAFSVIRTDMAQAALTMSAILEIAAFHHGQNLAGDNRSLELNGLEEALNDGTNASWAGNLFPSYGGQTRADVAPALTPPTGLVASPNIGAAPYLGSMSYRVLRHSYLSCCIGNQAPDTGLTTRRGMGFISENFLPHQVIDTTQPEIAWPGIKFDRATVMMSDYCPGQDGVNDADLGNYNAAGETFWWLNFGPTGAD
Ga0208685_100156943300027513SoilMRHLARAGDFIRANPIFTTLCMALLLWASGVAPAAGLPLLMGPILLDDVNTVTTKTIMPGVVDNFFRAGPLVAYLKTRFTRKWIGPQIQENYMYAPMRGGAYKKGATFNITKRQTRTGMLFTPRYYEVNVTEFLEDIEVEQTGPNAMFSIVKTDMAEAALTLSGILEIAMFHHGQALPGDDRSAEINGLEEALTDGINVTWTGATFTSYGGQDRPSVGGALNSPVGLITPSVPSMSFRALQHSYLSTCIGSEHPAIGVTTNRGMGYISETFLPHQIVDVVDPEINWPGLKFNQAKIVVSQYAPGADGVNDPDLGNYLDTDGETFWWLNPGPQGDDAYLRFYIAQSPKFAFGFTGFKGARDDNMVAGQILFGGNFSCRAPRLQRGLYNIAK
Ga0208185_100818143300027533SoilRKWIGPQIQENYMYAPMRGGAYKKGATFNITKRQTRTGMLFTPRYYEVNVTEFLEDIEVEQTGPNAMFSIVKTDMAEAALTLSGILEIAMFHHGQALPGDDRSAEINGLEEALTDGINVTWTGATFTSYGGQDRPSVGGALNSPVGLITPSVPSMSFRALQHSYLSTCIGSEHPAIGVTTNRGMGYISETFLPHQIVDVVDPEINWPGLKFNQAKIVVSQYAPGADGVNDPDLGNYLDTDGETFWWLNPGPQGDDAYLRFYIAQSPKFAFGFTGFKGARDDNMVAGQILFGGNFSCRAPRLQRGLYNIAK
Ga0209388_1000001663300027655Vadose Zone SoilVLTLRRLAAAFRWLRQHPRLSAGLATLVAAVWHPDALVLLVIGAIQLDDVNTVTTKEIMPGVVDGYFRAGPFIAMCKARFNRKWIGPQIQENFMFKPMKGGAYKKGSTFDITRRQTRTGLLFNPRYYEVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTISAILEIAAFRHGQALAGDDRSAEINGLEEALNDGSTASWAGNVFPSYGGQTRADVTPALTPPAGLVTPVNTVISYRVLRHSYFSAVIGNEAPGVGITTNRLMGFISENFLPHQVIDTTQPEINWPGLKFDKATIMMSQYAPGQDGTNDADLGNYNATGETFAWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVAGQVLFGGNLTFKALRLSRILHGFTA
Ga0209077_1000138113300027675Freshwater SedimentVAIQLDEVNTTVTKEIEPGVVDGYFKAGPFIAMAKSRFNRKWIGPQIQENFMYKPMKGGAYKKGSSFDITRRQTRTGLLFGPRYYQVGITEFLEDLEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQALAGDDRSMEMNGLEEALNNGTTASWAGNIFPSYGGQTRVDVSPALDPPSGLIAQDLGGAPISYRVLRHSYFSCIIGNEAPTIGITTNRCMGYIAENFLPHQIIDTTQPEINWPGMKFDKATITMSQYCPGADGVNDADLGSYYAPNETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRVISGIGS
Ga0209819_10000010843300027722Freshwater SedimentVLPIDLPDLSAIGGLLSLLSLLAFAAGMAIQLDEVNTVATKRIMPGVVDNFFKAGPLMAYLKARFNRKWTGPLIQENYLYKPMRGGAYKKGATFNVTKRQTYSGLLFTPRYYEVNVTEFLEDLEVEMAGPTAMFSTLKVDLGNAALTLSSILEIALFHHGQNVGGNDRTAEINGLEEALTDGTNTTWTGATFPTYGGQLRASVAPALNSPVGLVPVSNTTTSFRVLEHSFMSCVIGAERPKLGLTTNREMGFIAETFSPQQKIDVLDPEINWPGMKFNQATIVVSQYCPGQDGVDDPDLGDYNADSETFWWLNPGPSGDDAYLRLFIAQSAKFAFGFTGFKGARDDNQVSGQILFGGNFTCRSPRLSRGMYGFTK
(restricted) Ga0255054_1005415013300027856SeawaterIAYLKSRFTRRWTGPTIQENYEYKPMKGGAYKKGATFDVTRRQTRSGIQFTPRYYQVNVTEFLEDLEVEMAGPTAMFSTLKVDLASAALTLSSILEIAMYHHGQNVGTDRTAELNGLAEALSDGSAASWDGNTFTSYGGQTRADVAPALNSPTGLIPASVAVTSFRMLEHSYMSCVIGSERPKLGITTNREMGFIAETFSPQQKIDVTDPEINWPGIKFNQATIVVSQYAPGQDGVNDPDIGNYTNSSETFWWLNPGPQGDDAYMRLYIAQSPKFAFGFTGFKGARDDNQVSGQILFGGNYTHRASRLSRGLYGLTS
(restricted) Ga0255052_1022084613300027865SeawaterTFDVTRRQTRSGIQFTPRYYQVNVTEFLEDLEVEMAGPTAMFSTLKVDLASAALTLSSILEIAMYHHGQNVGTDRTAELNGLAEALSDGSAASWDGNTFTSYGGQTRADVAPALNSPTGLIPASVAVTSFRMLEHSYMSCVIGSERPKLGITTNREMGFIAETFSPQQKIDVTDPEINWPGIKFNQATIVVSQYAPGQDGVNDPDIGNYTNSSETFWWLNPGPQGDDAYMRLYIAQSPKFAFGFTGFKGARDDNQVSGQILFGGNYTHRASRLSRGLYGLTS
Ga0209465_1001498733300027874Tropical Forest SoilVAIQLDDVNTTVTKEIEPGVVDGYFKAGPFIAMAKNRFNRKWVGPQIQENFMYRPMKGGAYRKGTSFDIIRRQTRTGLLFGPRYYQVGVTEFLEDLEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQPIAGDDRSAEINGLEEALTNGSDPTWTNNVFPSYGGQTRADVVPALTTPTGLIPSNLGGQPISYRVLRHSYFSCIIGNEAPTIGITTNRCMGFIAENFLPHQIIDTTQPEINWPGMKFDKSTITMSQYCPGADGVNDDELGNYFSPFETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRVLSGIGS
Ga0209382_1004249563300027909Populus RhizosphereVAIQLDDVNTVVTKEIEPGVVDGYFKAGPFIAMAKARFSRKWIGPQIQENFMYKPMRGGAYRKGTTFDITKRQTRTGLLFGPRYYQVGVTEFLEDLEVELAGPRAAFSVLRTDMAQASLTMSAILEIAAFHHGQAITGDDRSAEINGLEEAFNDGSTASWTGNVFPSYGGQTRADVVPALTPPTGLISANLAGAPISYRVLRHSYFSCIIGNEAPSIGITTNRCMGFIAENFLPHQIIDTTQPEINWPGMKFDKATITMSQYCPGADGVNDEDLGNYYAPYETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRVLSGIGS
Ga0209868_100806413300027947Groundwater SandMKGGAYKKGSTFNTDKRQTRTGLLFSPRYYQMNITEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAFFRHGQALAGDDRSAEINGIEEAFNDGVNASWAGNIFPSYGGQTRVDVSPALNPPTGLVVANVNGPISYRVLRHSYFSSIIGNEAPGVGITTNRCMGFIAENFLPHQIVDTTQPEINWPGLKFDRSTIMMSQYAPGADGVNDADLGNYNAAAETFAWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVIRITHQPRLRPPRWPFSSMKDHIKPVQVHVRQQGRDHSPLRDAQPIASPRRLAVISHLLHRRSQPQ
Ga0307503_10000200193300028802SoilVAIQLDEVNTTVTKEIEPGVVDGYFKAGPLIAMCKARFNRKWIGPQIQENFMYKPMKGGAYRKGGSFDITRRQTRTGLLFGPRYYQVGITEFLEDLEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQALAGDDRSMEINGLEEAFNSGSVNSWAGNQFPSYGGQTRVDVSPALDPPQGLISPDLGGQPISYRVLRHSYFSCIIGNEAPSIGITTNRCMGYIAENFLPHQIVDTTQPEINWPGMKFDKATITMSQYCPGADGVNDVDLGNYFAANETFWWLNFGPPGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRVISGIGS
Ga0307281_10000002543300028803SoilVAIQFDEVNTVATKQIQPGIVDNYFKAGPLMAYLKKRFNRKWTGPLIQENYEYKAMTGGAYKKGTAFNTLRAQTRSGIIFTPRYYEVNITEFLEDLEVEMAGPTAMFSVLKADMANAALTMSAMLEIDLFRNGQNVGGNDRTAHLNGLEEALTNGVDTTWTGATFPSYGGQTRTDVSPALNSPTGLIAASVTTTSFRMLEHSFQSCCIGAERPKLGLTTLREMGFIAETFTPQQKIDVLDPEINWPGMKFNTATIVQSNYCPGQDGVNDAQLGNYNNSSETFWWLNPGPTGDDAYMRLYIAQSPKFAFGFTGFKGARDDNQVSGQVLFAGNFVDRAPRLSRGLYGLTK
Ga0307495_1000849213300031199SoilKGTSFDITRRQTRTGLLFGPRYYQVGITEFLEDLEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQALAGDDRSAEINGLEEAFNNGTTASWAGNTFPSYGGQTRVDVSPALDPPSGLISQDLGGAPISYRVLRHSYFSCIIGNEAPSIGITTNRCMGYIAENFLPHQIIDTTQPEINWPGMKFDKATITMSQYCPGADGVNDVDLGSYFAPNETFWWLNFGPQGDDAFIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTCKALRLSRVISGIGS
Ga0307469_1002113763300031720Hardwood Forest SoilMRTALARLSAACRWLGDHPRLTAMLGALILWALHPHALAASLLVGVIQLDDVNTVVTKEIQPGVVDGYFRGGPCIAMCKARFTRKWIGPQIQENFMFKPMKGGAYKKGASFNTDRRQTRTGLLFVPKYYEVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAFFRHGQNIAAEDRSAELNGIEEGYNDGINASWAGNVFPSYGGQTRTDVAPALNPPVGLIQPLNTTMSYRVLRHSYFSAIIGNEAPAVGVTTNRLMGFIAENFLPHQIVDTTQPEINWPGLKFDKATIMMSQYAPGQDGVNDADLGNYNAAFETFAWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGARDDNQVSGQLLFGGNLTLKALRLSRILHGFTA
Ga0307468_10013109823300031740Hardwood Forest SoilVPIQLDEVNTTVTKEIEPGVVDGYFKAGPFIAMAKSRFTRKWVGPQIQENFMYKPMKGGAYRKGTSFDITKRQTRTGLLFGPRYYQVGITEFLEDLEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQALAGDDRSMEINGLEEAFNNGVDTSWTGATFPSYGGQTRADVAPALTPPAGLINSDLAGAPISYRVLRHSYFSCIIGNEAPTIGITTNRCMGYIAENFLPHQIIDTTQPEINWPGMKFDKATITMSQYCPGADGVNDEDLGNYYAPNETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGARQDNQVAGQILFAGNLTVKALRLSRIISGIGS
Ga0307468_10014767313300031740Hardwood Forest SoilIQENYMYAPMKGGAYKKGSTFNILKRQTRTGMLFTPRYYEVNVTEFLEDIEVEQVGPNAVFSVVKTDMAEAALTMSAILEIAMFHHGQALAGDDRSAEINGLEESLTDGTNVTWTGATLTSYGGQLRASVAPALNSPTGLITPLVSGMSFRALQHSYLSVCIGNEHPLIGLTTNRGMGYIAETFLPHQIVDTVDPEINWPGLKFNQSKIVVSQYAPGADGVNDPDLGNYFASAGETFWWLNPGPQGEDAYLRLYIAQSPKFAFGFTGFKGARDDNMVAGQILFGGNFTNRAPRLSRVLYGIAK
Ga0315288_1025689113300031772SedimentPLVAMAKARFTRKWVGPQIQENFMYKPMKGGAYKKGTAFNVLRHQTRTGLLFTPRYYQVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAAFHHGQNLPGDDRSAEINGLEEALNDGVNASWAGNLFPSYGGQTRVDVAPALTPPTGLIAANNANVLYRVLRHSYFSCILGNEAPTVGITSNRMMGFISENFLPHQIVDTTQPEINWPGLKFDKATILMSQYMPSQDGVNDADLGNYLAAGETFCWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFAGNLTVKALRLSRILHGFTA
Ga0310892_1034499713300031858SoilTFNIAKRQTRTGMLFTPRYYEVNVTEFLEDIEVEQVGPNAVFSVVKTDMAEAALTMSAILEIAMYHHGQALAGDDRSAEINGLEEAFTDGTNVTWTGATFPSYGGQLRVSVAPALNSPTGLIQPSVPGMSFRALQHSYLSTCIGNEHPVIGITTNRGMGYIAETFLPHQIVDVVDPEINWPGLKFNQAKIVVSQYAPGADGVNDPDLGNYLASAGETFWWANPGPQGDDAYLRLYIAQSPKFAFGFTGFKGARDDNMVAGQILFGGNFTVRAPRLQRGLYAIAK
Ga0214473_1001461743300031949SoilVAILLDEVNTVATKTIQPGIVDNYFKAGPLMAYLKKRFTRKWTGPLIQENYEYAAMRGGAYKKGASFNTQRRQTRTGMLFTPRYYQCNVTEFLEDLEVEMAGPTAMFSVLKADLANAALTLSAILEIAFFHHGQNVGGNDRTAEINGLEEALTDGTNVTWTGATFPTYGGQLRASVAPALNSPVGLVGPSVTTTSFRMLEQSFQSCAIGAERPKLGITTTRGMGFIAETFSPQQKIDVLDPEINWPGMKFNTATIVQSNYCPGADGVVDPDLGDYYNASETFWWLNPGPVGDDAFIRLFIAASPKFAFGFTGFKGARDDNQVSGQILVACNLTVRAPRLSRGLYGITK
Ga0214473_1005880923300031949SoilMAKARFTRKWVGPQIQENFMYKPMKGGAYKKGTAFNVLRHQTRTGLLFTPRYYQVNVTEFLEDLEVEMAGPRVAFSVIRTDMQQAALTMSAILEIAALHHGQALAGDDRSAEINGLEEALNDGTNASWAGNLFPSYGGQTRADVAPALTPPTGLIGANNANVLYRVLRHSYFSCILGNEAPTVGITSNRMMGFISENFLPHQIVDTTQPEINWPGLKFDKATILMSQYMPSQDGVNDADLGNYLSTGETFAWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFAGNLTVKALRLSRILHGFTA
Ga0307472_10003733413300032205Hardwood Forest SoilVAIQLDEVNTTVTKEIEPGVVDGYFKAGPFIAMAKSRFNRKWIGPQIQENFMYKPMKGGAYRKGTSFDITRRQTRTGLLFGPRYYQVGITEFLEDLEVELAGPRAAFSVIRTDMAQASLTMSAILEIAAFHHGQALAGDDRSMEINGLEEAFNNGTSASWAGNIFPSYGGQTRTDVSPALDPPSGLISQDLGGAPISYRVLRHSYFSCIIGNEAPTIGITTNRCMGFIAENFLPHQIIDTTQPEINWPGMKFDKATITMSQYCPGADGVNDEDLGNYFAANETFWWLNFGPQGDDAYIRLYIAQSRKFAFGFTGFKGARQDN
Ga0364929_0001208_4539_56483300034149SedimentMALLLWASGVAPAAGLPLLMGPILLDDVNTVTTKTIMPGVVDNFFRAGPLVAYLKTRFTRKWIGPQIQENYMYAPMRGGAYKKGATFNITKRQTRTGMLFTPRYYEVNVTEFLEDIEVEQTGPNAMFSIVKTDMAEAALTLSGILEIAMFHHGQALPGDDRSAEINGLEEALTDGINVTWTGATFTSYGGQDRLSVGGALNSPVGLITPSVPSMSFRALQHSYLSTCIGSEHPAIGVTTNRGMGYISETFLPHQIVDVVDPEINWPGLKFNQSKIVVSQYAPGADGVNDPDLGNYLDTDGETFWWLNPGPQGDDAYLRFYIAQSPKFAFGFTGFKGARDDNMVAGQILFGGNFSCRAPRLQRGLYNISK
Ga0364931_0063406_297_11363300034176SedimentKRQVASGMLFTPRYYDINVTEYLEDLEVEQVGPHAMFSRLKLDMSTAALSMSALLEIALFRNGQNVGGVDRTAELNGLEEALTDPTSVTWTGATFPSYGGQTRVDVSPALNSPTGLIAMNAGPSLMFRILEHSFMSCVIGNERPKAGFTTNRAMGFIAENFSPQQKIDVMDPEINWPGFKFNQATIMASQYAPGADGVNDDDLGNYLNATETFWWFNPGPQGDEAFIRFFIAQSPKFAFGFTGFKGARDDNMVSGQILFAGNLTVRSPRLSRTMYGFTR
Ga0364934_0007844_674_18523300034178SedimentVRILYCLRDAALWLRSHPLLTALAVWLWALLDLESAAGALLVVGAIQLDDVNTTVTKEIQPGVVDGYFKAGPNIAMAKARFTRKWIGPQIQENFMYKPMQGGAYKKGTAFNVQRRQTRTGLLFSPRYYQVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAFMRHGQALPGDDRSAEINGIEEAYNNGVDASWAGNVFPSYGGQTRVDVSPALNPPTGLIAANVGGPISYRVLRHSYFSSIIGNEAPGVGITTNRCMGFIAENFLPHQIVDTTQPEINWPGLKFDKATIMMSQYMPGADGVNDPDLGNYNAAAETFAWLNYGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFGGNLTVKALRLSRILHGITS
Ga0364934_0096152_165_10613300034178SedimentMYKPMQGGAYKKGTAFNVQRRQTRTGLLFAPRYYQVNVTEFLEDLEVEMAGPRAAFSVIRTDMQQAALTMSAILEIAALHHGQALAGDDRSAEINGFEEGLNNGINASWAGNVFPSYGGQTRVDVSPALNPPAGLVAANVAGPISYRVLRHSYFSSMIGNEAPGVGITTNRCMGFIAENFLPHQIVDTTQPEINWPGLKFDKATIMMSQYMPGADGVNDADLGNYNAAAETFAWLNFGPQGDDAYIRLYIAQSSKFAFGFTGFKGAREDNQVSGQILFAGNLTLKALRLSRILHGITS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.