NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F064221

Metagenome / Metatranscriptome Family F064221

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F064221
Family Type Metagenome / Metatranscriptome
Number of Sequences 129
Average Sequence Length 201 residues
Representative Sequence YDPTIAPICKGKSNCPTQFGRKPGIIAEPATGFVFAGQLPVGNPGDVSYVLPLVDKVQTAISYVTTRPTPTLHSLAGDLALNDAKLRETLHARGILTVGIPRTVEPLSPTPTPEAVQQLLTAAGLHRKRTPRQVQLACAAGYSRPVVESIIASLLSRGAAQLRYKGWHGACVQLTMAVLAHNAATVRRIRYGRLTSRAQKFRRLLHLKSPNLLKNKE
Number of Associated Samples 113
Number of Associated Scaffolds 129

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 6.98 %
% of genes from short scaffolds (< 2000 bps) 6.98 %
Associated GOLD sequencing projects 105
AlphaFold2 3D model prediction Yes
3D model pTM-score0.39

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (93.023 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand
(21.705 % of family members)
Environment Ontology (ENVO) Unclassified
(30.233 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(36.434 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 40.82%    β-sheet: 2.45%    Coil/Unstructured: 56.73%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.39
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 129 Family Scaffolds
PF10387DUF2442 1.55
PF13565HTH_32 0.78
PF03400DDE_Tnp_IS1 0.78
PF13007LZ_Tnp_IS66 0.78
PF03811Zn_Tnp_IS1 0.78
PF00589Phage_integrase 0.78
PF13683rve_3 0.78
PF13372Alginate_exp 0.78

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 129 Family Scaffolds
COG1662Transposase and inactivated derivatives, IS1 familyMobilome: prophages, transposons [X] 0.78
COG3677Transposase InsAMobilome: prophages, transposons [X] 0.78


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A93.02 %
All OrganismsrootAll Organisms6.98 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005330|Ga0070690_100869964All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium703Open in IMG/M
3300005334|Ga0068869_101046137All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium712Open in IMG/M
3300009801|Ga0105056_1035026All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium665Open in IMG/M
3300010043|Ga0126380_11553990All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium588Open in IMG/M
3300010047|Ga0126382_11186513All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium683Open in IMG/M
3300014271|Ga0075326_1019738All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium1656Open in IMG/M
3300015245|Ga0137409_10696988All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium849Open in IMG/M
3300018429|Ga0190272_11613339All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium666Open in IMG/M
3300027187|Ga0209869_1024824All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium683Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand21.71%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil10.85%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere9.30%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil7.75%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds3.10%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil3.10%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment3.88%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.88%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere3.88%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.33%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere2.33%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere2.33%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment1.55%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands1.55%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.55%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil1.55%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands1.55%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.55%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil1.55%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.55%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.55%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.55%
Wetland SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Wetland Sediment0.78%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.78%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil0.78%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.78%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.78%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.78%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.78%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.78%
SoilEnvironmental → Terrestrial → Agricultural Field → Unclassified → Unclassified → Soil0.78%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.78%
Switchgrass RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Switchgrass Rhizosphere0.78%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.78%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.78%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000789Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001431Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.4TB clc assemlyEnvironmentalOpen in IMG/M
3300002122Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_2EnvironmentalOpen in IMG/M
3300003993Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailC_D2EnvironmentalOpen in IMG/M
3300004009Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004268Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 MoBioEnvironmentalOpen in IMG/M
3300005330Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaGEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005355Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaGHost-AssociatedOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005544Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3L metaGEnvironmentalOpen in IMG/M
3300005548Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3 metaGHost-AssociatedOpen in IMG/M
3300005617Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2Host-AssociatedOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005718Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005842Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2Host-AssociatedOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300006047Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006846Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300009092Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-4 metaGHost-AssociatedOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009177Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaGHost-AssociatedOpen in IMG/M
3300009797Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_10_20EnvironmentalOpen in IMG/M
3300009799Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_0_30EnvironmentalOpen in IMG/M
3300009801Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_20_30EnvironmentalOpen in IMG/M
3300009804Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_30_40EnvironmentalOpen in IMG/M
3300009805Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_0_10EnvironmentalOpen in IMG/M
3300009807Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_0_10EnvironmentalOpen in IMG/M
3300009808Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_40_50EnvironmentalOpen in IMG/M
3300009812Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_50_60EnvironmentalOpen in IMG/M
3300009814Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_50_60EnvironmentalOpen in IMG/M
3300009815Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_0_10EnvironmentalOpen in IMG/M
3300009822Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40EnvironmentalOpen in IMG/M
3300010039Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot56EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011003Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t9i015EnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300013308Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaGHost-AssociatedOpen in IMG/M
3300014271Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberrySE_CattailA_D2EnvironmentalOpen in IMG/M
3300014325Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S6-5 metaGHost-AssociatedOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017965Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 220 TEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018432Populus adjacent soil microbial communities from riparian zone of Yellowstone River, Montana, USA - 550 TEnvironmentalOpen in IMG/M
3300018481Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 356 TEnvironmentalOpen in IMG/M
3300019377Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 112 TEnvironmentalOpen in IMG/M
3300019767Populus adjacent soil microbial communities from riparian zone of Oak Creek, Arizona, USA - 239 TEnvironmentalOpen in IMG/M
3300025318Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 1EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026023Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026090Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberrySE_CattailA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026888Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_10_20 (SPAdes)EnvironmentalOpen in IMG/M
3300027006Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027032Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_0_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027056Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027068Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027169Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_10_20 (SPAdes)EnvironmentalOpen in IMG/M
3300027187Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027277Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027384Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027831Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T0Bare3Fresh (SPAdes)EnvironmentalOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300027950Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300027952Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300027954Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027961Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027964Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111 HiSeqEnvironmentalOpen in IMG/M
3300028043 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0.5_MGEnvironmentalOpen in IMG/M
3300028889Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day2EnvironmentalOpen in IMG/M
3300030619Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (Novaseq)EnvironmentalOpen in IMG/M
3300030903Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_369 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031092Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_367 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032005Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-O-1Host-AssociatedOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033551Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day5EnvironmentalOpen in IMG/M
3300033814Sediment microbial communities from East River floodplain, Colorado, United States - 55_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI1027J11758_1305733013300000789SoilAPELTEDQRARWDVTLTLALRAHQQIVTQSRRLTHGKPLTHCKIVNAYDPTIAPICKGKSNCPTQFGRKPGILGEPATGFVFAARLPVGNPADVSYVLPLVEQVQTACAKVTTRPTPTIHSLAGDLAVNDATLREQLHAHGILTVGIPRTVEPLSPTPSPDAIDELLSTADLHGTRTPRQVQLACAA
F14TB_10017789513300001431SoilCKIVNAYDPTIAPICKGKSNCPTQFGRKPGILAEPATGFVFAARLPVGNPSDVSYVLPLVDQVQQAFAQVTTRPVPAVHSLAGDLAVNDPTLRERLHLRGILTVGLPRTSEPLSPTPSPASIDELLTAADLHRKRTPRQVQMAGAAGYSRPVVESIIASLLSRGAGQLRYKGWHGACVQLTMTVMAHNAATVRRIRYGRLTTRAQKFRRLLHLKAPNLLKNKE*
C687J26623_1016247813300002122SoilRKPGIIAEPATGFVFAAQLPVGNPSDGSYVLPLVAKVQTAISHVTTRPTPAIHSLAGDLALNDATLRERLHARGILTVGIPRTVEPLSPTPTPEAVHQLLTAAGLSRKRTPRQVQLACAAGYSRPVVESIIASLLSRGAAQLRYKGWHGACVQVTMAILAHNAATVRRIRLGRLTTRAQKFRRLLHLKPPNLLKNQEGIN*
Ga0055468_1015024823300003993Natural And Restored WetlandsARLPVGNPSDVSYVVPLLDQAQAACRQVTTRPAPTILSLAGDLAVNDSTLRETLHTRGILTVGIPRTVEPLASTLSPETIDEVLTTTDLHGTRTPRHVQLACAAGYSRPVIESMIASLLSRGAGHLRYKGAHGARVQFTMAVLAHNAATVRRIRLGQLTTRAQKFRRLLHLKSLNPLKNKE*
Ga0055437_1026522613300004009Natural And Restored WetlandsCKIVNAYDPTIAPICKGKSNCPTQFGRKPGILGEPATGFVFAARLPMGNPSDVSYVLPLVEQVQSACTKVTTRPAPAIHSLAGDLAVNDATLREQLHARGILTVGIPCTIEPLSPPPSPEAVHELLTSANLQRKRTPRQVQLACAAGYSRPVVESLIASLLTRGAGQLRDKGWHGACVQLTMAVMAHN
Ga0066398_1006161313300004268Tropical Forest SoilIVNAYDPTIAPICKGKSNCPTQFGRKPGIIGEPATGFVFAAQLPVGNPSDASYVLPLVDQVQTACAKVTPRPGPAIHSLAGDLAVNDPTLRERLHARGILTVGIPHSLEPLSPTPTPEAVDQVLSTAGLLRKRTSHQVQMACAAGYSRPVVESMIASLLNRGAAQLRYKGWHGAGVQITMAVMAHNAATVRRIQFGRLTTRAQKFRRLLRLKSPNSLKLNAGIN*
Ga0066398_1010194613300004268Tropical Forest SoilGFVFAARLPVGNPSEVSYVLPLVDQVQTACSRVTTRPAPTIHSLAGDLAVNDPTLRERLQARGILTVGIPRTVEPLSPTPSPEAIQELLTTAALPHKPPLRHVQLACAAGYSRPVVESIIASLLSRGAGQLRYKGWHGACVQLTMAVMAHNAATVRRIRYGRLTTRAQKFRRLLHLKPPNLLKNKEGIN*
Ga0070690_10086996413300005330Switchgrass RhizosphereLAQAAQAQVQTAAALTEDQRARWDTTLTLALVAQQQIAIQSRRLTRGKPLTRCKIVNAYDLTIAPICKGKSNCPTQFGRKPGIVAEPATGFVFAAQLPVGNPSDTSYVLPLVDQVQATCARVTPRPAPSIHSLAGDLAVNDPTLREKLHTRGILTVGIPRTIESLSPTPSPEAIQELLTTADLPRKRTPRQVQLAYAAGYSRPVVESIIASLLSRGAGQLRYKGWHGAQVQFS
Ga0066388_10304511613300005332Tropical Forest SoilSRRLTHGKPLTQGKIVNAYDPTIAPIGKGKSNCPTQFGRKPGIIGEPATGVVLAARRPVGNPSDVSYVRPLVEQGQSAYTKVTNHPAPALHSLAGDLAVNDATVREQLHARGILTVGIPCTIAPLAPTPSPEAVHELLISANLQRKRTPRQLQLACAAGYSRPVVASLIASLLSRGAGQLRYKGWHGASVQLTMAVMAHNAATVRRIRLGHLTPRAQKFRRLLHLKSLILLKNKE*
Ga0068869_10104613713300005334Miscanthus RhizosphereVQTAPELTDDQRARWEVTLTLALRAHQQIVSQSRRLTHGKPLTHCKIVNAYDPTIAPICKGKSNCPTQFGRKPGILGEPATGFVLAARLPVGNPSDGSYVLPLLDQVQTALAKVTTRPAPAIHSLAGDLAVNDATLRETLHARGILTVGIPRTVEPLAPTLSPETIDEVLTTASLSHKRTPRQVQLACAAGYSRPVVESIITSVLGRGAGQLRYKGWHGACVQLTMAVMAHNAATVR
Ga0070671_10183928513300005355Switchgrass RhizosphereATGFVFAARLPVGNPSDVSYVLPLVAQVQIACAKVTTRPTPAIHSLAGDLAVNDATVREQLHARGILTVGIPRTIEPLAPTLSPETITAVLTTADLHGTRTPRQVQLACAAGYSRPVVESLIASLLSRGAGQLRYKGWHGACVQLTMAVMAHNAATVRRIRLGQLTTRAQKFRRLLHLK
Ga0070708_10194654813300005445Corn, Switchgrass And Miscanthus RhizosphereHGKPLTRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIAEPATGFVFAAQLPVGNPSDVSYVVPLVDKVQTAVTHVTRRPRPALHSLAGDLAVNDPTLRETLHARGILTVGIPRTVEPLSPTPSSETIQEMLTATGLHRNRTSRQVQLACAAGYSRPVVESIIASLLSRGAGQLRYKGW
Ga0070686_10025195823300005544Switchgrass RhizosphereATGFVFAAQLPVGNPSEGSYVVPLVDKVQTAVTHVTRRLQPALHALAGDLALNDPKLRETLQARGILTVGIPRTVEPLSLTPSPEDIHEVLIPAGLHRKRTPRQVQLACAAGYRRPVVESLIASLLSRGASQLRDKGWHGAWVQVTMAVLAHHAATVRRIRLGRLTTRAQKFRRLLHLKPPNLLKNHEGIN*
Ga0070686_10044362223300005544Switchgrass RhizosphereALGTLQQIATQSRSLTQGKPLTRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIGEPATGFVFAAQLPVGNPTDGSYVVPLVDQVQTALSYVTTPSTPTIHSLAGDLALNDVKLRETLHARHILTVGIPRTVESLSPTPTPAEIHEALTTAGLSHKRTPRQVQLACTAGYSRPVVESIIASLLSRGAAQLRYKGWHGAWVQLTMTVMAHNAATVRRIRLGRLTTRAQKFRRLLRLTPPNLLKNQEGLN*
Ga0070665_10187233313300005548Switchgrass RhizosphereGRKPGIIAEPATGFVFAAQLPVGNPSDTSYVLPLVDQVQATCARVTPRPAPSIHSLAGDLAVNDPTLREKLPARGILTVGIPRPIESLSPTPSPEAIQELLTTADLPRKRTPRQVQLAYAAGYSRPVVESIIASLLSRGAGQLRYKGWHGAQVQFSMAVMAHNAATVRRIRHGRLTPRAQKFRRLLHLKSPNLLKNKE*
Ga0068859_10313826713300005617Switchgrass RhizosphereIAEPATGFVFAAQLPVGNPSDGSYVLPLVAKVQTAISHVTTRPTPAIHSLAGDLALNDATLRETLHARGILTVGIPRTVEPLSPTPTPAAVHQLLTAAGLSRTRTPRQVQLACVAGYSRPVVESIIASLLSRGAAQLRYKGWHGACVQVTMAVLAHNAATIRRIRLG
Ga0066905_10174572513300005713Tropical Forest SoilVFAAQLPVGNPSDVSYVLPLVDQVQTACARVTTRPAPSIHSLAGDLAVNDATLRERLHTRGILTVGIPRTTEPLSPTPSPEAIQELLTTADLHRKCTPRQVQLACAAGYSRPVVESIIASLLSRGAGQLRDKGWHGASVQLTLAVMAHNAATVRRIRHGRLTMRAQKFRRLLHLKPSNLLKNKQEIN*
Ga0068866_1087563413300005718Miscanthus RhizosphereCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIGEPATGFVFAAQLPVGNPTDGSYVVPLVDQVQTALSYVTTPSTPTIHSLAGDLALNDVKLRETLHARHILTVGIPRTVESLSPTPTPAEIHEALTTAGLSHKRTPRQVQLACTAGYSRPVVESIIASLLSRGAAQLRYKGWHGAWVQLTMTVMAHNAATVRRIRLGRLTTRAQKFRR
Ga0066903_10686427313300005764Tropical Forest SoilGNPSDVSYVLPLVDQVQAACAKVTTRPAPAIHSLAGDLAVNDSTLRETLHARGILTVGIPRTVEPLSPTPSPEAVHELLISANLQRKRTPRQLQLACAAGYSRPVVASLIASLLSRGAGQLRYKGWHGASVQLTMAVMAHNAATVRRIRLGHLTPRAQKFRRLLHLKSLILLKNKE*
Ga0068858_10183526713300005842Switchgrass RhizosphereSDTSYVLPLVDQVQATCARVTPRPAPSIHSLAGDLAVNDPTLREKLHTRGILTVGIPRTIESLSPTPSPEAIQELLTTADLPRKRTPRQVQLAYAAGYSRPVVESIIASLLSRGAGQLRYKGWHGAQVQFSMAVMAHNAATVRRIRHGRLTPRAQKFRRLLHLKSPNLLKNKE*
Ga0068860_10181644413300005843Switchgrass RhizosphereTTTQLAEDQRTRWDTTLTVALTTHQQIATQSRHLTHGKPLRRCKIVNAYDATIAPICKGKSNCPTQFGRKPGLLAEPATGFVFAAQLPVGNPSDVSYVLPLVDQVQTACSYVTTRPPPAIHALAGDLAVNDPTLRERLHARGILTVGIPRTIEPLSPTPSPETIHELLTTAGLPCKRTSRQVQLACAAGYSRPVVESIIASLLSRGAGQ
Ga0075024_10003057153300006047WatershedsSNCPTQFGRKPGIIAEPATGFVFATQLPVGNSSDGSYVGPLVDKVQTALTHVTRRSRPALHSLEGDLALNDPKLRETLHARGILTVGIPRTIEPLSPTPSPEAIHEVLTAAGLHHKRTPRQMQLACAAGYSRPVVESIIASLLSRGAAQLRYKGWHGAQVQVTMAVLAHNAATVRRIRYGRLTTRAQKFRQLLHLKLPNLLKNQERVN*
Ga0075028_10085942413300006050WatershedsCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIAEPATGFVFAAQLPVGNPSDVSYVLPLVDQVQTAFTYVSTRLAPALHSLAGDMALNDATLRETLQARGILTVGIPRTVEPLSPTPTPEEIHEVLTTAGLHRKRTPRQVQLACAAGYSRPVVESIIASLLNRGAGQLRYKGWHGACVQLTMAVM
Ga0075428_10230539813300006844Populus RhizosphereGNPSDGSYVLPLLDQVQTALAKVTTRPAPAIHSLAGDLAVNDATLRETLHARGILTVGIPRTVEPLAPTLSPETIDEVLTTASLSHKRTPRQVQLACAAGYSRPVVESIITSVLGRGAGQLRYKGWHGACVQLTMAVMAHNAATVRRIRCGYLTPRAQKFRRLLHLKSFNLLKNKE*
Ga0075421_10040954913300006845Populus RhizosphereIIGEPATGFVFAARLPVGNPSDTSYVLPLVDQVQAACARITTRATPSIYSLAGDLAVNDATLRESLHNRGILTVGIPRSVEPLLPTPSPEAINEMLATADLHRKRTPRQGELACAAGYSRPVVESIIASLLSRGAGQLRYKGWHGACVQLTMAVMAHNAATVRRIRSGRLTSRAQKFRRLLHLRPLNPLKIKQGTN*
Ga0075421_10118934923300006845Populus RhizosphereCKIVNAYDPTIAPICKGKSNCPTQFGRKPGILGEPATGFVFAARLPVGNPTDVSYVLPLVDHVQTACAKVTTHPAPAIHSLAGDLAVNDATLREQLHLRGILTVGIPRSVEPLAPTPSPAAIDELLTTTGLHRKRTPRQVQLACAAGYSRPVVESIIASVLSRGAGQLRYKGWHGACVQLTMAVMAHNAATVRRIRCGQLTSRAQKFRRLLHLKPLNLLKNKQ*
Ga0075421_10193632913300006845Populus RhizosphereCPTQFGRKPGLLAEPATGFVFAARLPVGNPSDASYVLPLVDQVQQAFAQVTTRPVPAVHSLAGDLAVNDPTLRENLHTRGILTVGIPRSVEPLSPTPSPEVLQEVLTTADLHRKRTPCQVQLAYAAGYSRPVVESIIASLLSRGAGQLRYKGWHGAGVQLTMAVMAHNAATVHRIRYGRLTLRAQKFRRLLHLKPPNLLKNKQRIN*
Ga0075430_10133633613300006846Populus RhizospherePSDTSYVLPLVDQVQAACARITTRATPSIYSLAGDLAVNDATLRESLHNRGILTVGIPRSVEPLLPTPSPEAINEMLATADLHRKRTPRQGELACAAGYSRPVVESIIASLLSRGAGQLRYKGWHGACVQLTMAVMAHNAATVRRIRSGRLTSRAQKFRRLLHLRPLNPLKIKQGTN*
Ga0075433_1144947413300006852Populus RhizosphereGRKPGIIAEPATGFVFAAQLPVGNPSDVSYVVPLVDKVQTAVTHVTRRPRPALHSLAGDLAVNDPTLRETLHARGILTVGIPRTVEPLSPTPSSETIQEMLTATGLHRNRTSRQVQLACAAGYSRPVVESIIASLLSRGAGQLRYKGWHGACVQLTMAVMAHNAATVRRIRLGRLTTRAQKFRRLLHLKPPNPLKNQ
Ga0075425_10124121113300006854Populus RhizosphereLSTAQEAHQQISTQSRRLTQGKKLHHCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIAEPATGFVFAALLPVGNPSDVSYVVPLVDKVQTAVTHVTRRPRPALHSLAGDLAVNDPTLRETLHARGILTVGIPRTVEPLSPTPSSETIQEMLTATGLHRNRTSRQVQLACAAGYSRPVVESIIASLLSRGAGQLRYKGWHGACVQLTMAVMAHNAATVRRIRLGRLTTRAQKFRRLLHLKPPNPLKNQEGIN*
Ga0075425_10195485913300006854Populus RhizospherePICKGKSNCPTQFGRKPGIIGEPATGFVFAAQLPVGNPTDGSYVVPLVDQVQTALSYVTTPSTPTIHSLAGDLALNDVKLRETLHARHILTVGIPRTVESLSPTPTPAEIHEALTTAGLSHKRTPRQVQLACTAGYSRPVVESIIASLLSRGAAQLRYKGWHGAWVQLTMTVMAHNAATVRRIRLGRLTTRAQKFRRLLRLTPPNLLKNQEGLN*
Ga0075424_10187618113300006904Populus RhizosphereSNCPTQFGRKPGIIAEPATGFVFAAQLPVGNPSDVSYVVPLVDKVQTAVTHVTRRPRPALHSLAGDLAVNDPTLRETLHARGILTVGIPRTVEPLSPTPSSETIQEMLTATGLHRNRTSRQVQLACAAGYSRPVVESIIASLLSRGAGQLRYKGWHGACVQLTMAVMAHNAATVRRIRLGRLTTRAQKFRRLLHLKPPNPLKNQEGIN*
Ga0105250_1054674713300009092Switchgrass RhizosphereRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIAEPATGFVFAAQLPVGNPSDTSYVLPLVDQVQATCARVTPRPAPSIHSLAGDLAVNDPTLREKLHTRGILTVGIPRTIESLSPTPSPEAIQELLTTADLPRKRTPRQVQLAYAAGYSRPVVESIIASLLSRGAGQLRYKGWHGA
Ga0075418_1209553713300009100Populus RhizosphereTAPELTEDQRARWDTTLTLALLAHQQIVTQSRRLTHGKPLTQCKIVNAYDPTIAPICKGKSNCPTQFGRKPGILGEPATGFVFAARLPVGNPTDVSYVLPLVDHVQTACAKVTTHPAPAIHSLAGDLAVNDATLREQLHLRGILTVGIPRSVEPLAPTPSPAAIDELLTTTGLHRKRTPRQVQLACAAGYSRPVVESIIASVLS
Ga0114129_1129347013300009147Populus RhizosphereQFGRKPGLLAEPATGFVFAAQLPVGNPSDASYVLPLVAQAQQALSHVPTQPPPTIHSLAGDLAVNDATLRESLHNRGILTVGIPRSVEPLLPTPSPEAINEMLATADLHRKRTPRQGELACAAGYSRPVVESIIASLLSRGAGQLRYKGWHGACVQLTMAVMAHNAATVRRIRSGRLTSRAQKFRRLLHLRPLNPLKIKQGTN*
Ga0111538_1181924223300009156Populus RhizosphereCKIVNAYDATIAPICKGKSNCPTQFGRKPGLLAEPATGFVFAAQLPVGNPSDVSYVLPLVDQVQTACARVTTRPTPSIHSLAGDLAVNDPTLRESLHIRGILTVGIPRTVEPFSPILSPEAIDERLTTADLHRKRTPRQVQLACAAGYSRPVVESIIASLLSRGAGQLRYKGWHGACVQLTMTVMAHNAATVRRIRHGHLTTRAQKFRRLLHLKPLNLLKSKKGIN*
Ga0105248_1083970623300009177Switchgrass RhizosphereQSRSLTQGKPLTRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIGEPATGFVFAAQLPVGIPTDGSYVVPLVDQVQTALSYVTTPSTPTIHSLAGDLALNDVKLRATLHARHILTVGIPRTVESLSPTPTPAEIHEALTTAGLSHKRTPRQVQLACTAGYSRPVVESIIASLLSRGAAQLRYKGWHGAWVQLTMTVMAHNAATVRRIRLGRLTTRAQKFRRLLRLTPPNLLKNQEGLN*
Ga0105080_103439713300009797Groundwater SandNCPTQFGRKPGILGEPATGFVFAARLPVGNPSDVSYVVPLVDQVQAACAKVTTRPAPAIHSLAGDLAVNDSTLRETLHARGILTVGIPRTVEPISPTPSPEAVHELLTSADLHRKRTPRQVQLACAAGYSRPVVESLIASLLSRGLGHLRYKGWHGACVQLTMAVLAHDAATVRRIRHGRLTTRAQKFRRLLHLK
Ga0105075_101148213300009799Groundwater SandGFVFAAQLPVGNPTDVSYVVPLGDKVQTAVTQVTRRPRPVLHSLAGDLAVNDPPLRETLHARGILTVGIPRSVEPLAPTPSPEAIQELLTATGLQRKRTPRQVQLACAAGYSRPVVESLIARLLSRGAGQLRYKGWHGAQVQLTMAVLAHNAATVRHIRLGRLTTRAQKFRRLLHLKPPNLLKNQEGIN*
Ga0105056_103502613300009801Groundwater SandQAQVQTAAALTEDQRARWDTTLTRALVAHQQIATQSRRLTHGKSLPRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIAEPASGFIFAFELPVGNPTDPTYVVPLVDKVQTAMALVAGRSSLAIHSLAGDLALNDAKLRAALHTRGILTVGIPHTVEPLSPLPPPEDVRQMLTAAGLHGKRTPYQVQLACACGYSRPIVESLIASLLGRGAARLTYKGQR
Ga0105063_100971223300009804Groundwater SandGNIVKAYDPPIAPICQGKSNCPTPFGRKSGIIAEPATGFVCAAQLPVGNPTEVSSVVPLGDKVQTAVTQVTRRPRPVLHSLAGDLAVNDPPLRETLHARGILTVGIPRSVEPLAPTPSPEAIQELLTATGLQRKRTPRQVQLACAAGYSRPVVESLIARLLSRGAGQLRYKGWHGAQVQLTMAVLAHNAATVRHIRLGRLTTRAQKFRRLLHLKPPNLLKNQEGIN*
Ga0105079_104911813300009805Groundwater SandKPLTRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIAEPATGFVFAAQLPVGNPTDVSYVGPLVDKVQTALTHVTTRPTPAIHSRAGDLALNDAKLRETLHARGILTVGIPRTVEPLSPTPAPEAIQEVLTAAGLHRNRSPRQVQLACAAGYSRPVVESLIASLLS
Ga0105061_104301413300009807Groundwater SandAGQLPVGNPSDGSYVVPLVDKVQTALSHVTTRPTPAIHSLAGDLALNDATLRETLHARGILTVGIPHTVEPLSPTPPPEVIHHMLITAGLQRKRTPRQVQLACAAGYSRPVVESIIASLLSRGAAQLRYKGWHGAQVQVMMAVMAHNAATVRRIRLGRLTTRAQKFRRLLHLKSPNLLKNQEGLN*
Ga0105071_111250213300009808Groundwater SandPICKGKSNCPTQFGRKPGIIGEPASGFIFAFALPVGNPADLSSVVPLVDKVQTAIAQVAGRPTLAIHSVAGDLALNDAKLRETLHTRGILTVGIPQTVEPLSPTPTPEEIRRILHEAGLHPQRTPHQVRLACAGGYSRPVVESIIASLLCRGAARITYKGQRGAIVQI
Ga0105067_104308913300009812Groundwater SandIAPICKGKSNCPTQFGRKPGLLAEPATGFVFAVQLPVGNPSDASYVLPLVDQVQTACSHVTTRPTPAIHSLAGDLALNDPTLRERLHARGILTVGIPCTVAPLAPPPSPAAIHELLTAADLHRKRTPRQVQLACAAGYSRPVVESIIASLLSRGAGQLRYKGWHGACVQLTMAVMAHNAATVRRIRYGRLTTRTQKFRRLLHLKPPNLLKNKEGIN*
Ga0105067_106228813300009812Groundwater SandFAPQLPVGNPSEGSYVLPLVDQVQTALSPVTTRAIPTLHSLAGDLALNDATLRATLHARGILPVGIPHPLAPLSPTPTPEAVQQVLTAAGLQRKRTPRQVQLACAAGYSRPVGESLLASRLSRGAAQVRYKGWHGAQVQVTMAVLAHHAATIRRLQLGRLTTRAQKFRRFLRLKPPNLVKNKEEIN*
Ga0105067_108410713300009812Groundwater SandVALAAHQQIATHSCRLTHGKTLSQCKIVNAYAPTIAPIGQGKSNCPTQLGRNPGLIAEPASGFIFAFALPVGNPTDLSYVVPLVDKVQHALTQVAGRSALAIHSLAGDLALNDATLRAALHTRGILTVGIPHTVAPLSPLPTPEDVRQMLTEAGLHGKRTPHQVQLACACGYSRPIVESLIASLL
Ga0105082_104030513300009814Groundwater SandHGKPLTRGKIVKAYAPPIAPLCQGKRNCPTQFGRKPGIIAEPATGFVFAAQLPVGNPSDVSYVGPLVDKVQTALTQVTTRPPAIHARAGDLALNDAKLRETLHARGILTVGIPRTVEPLSPTPAPEAVQQMLTTAGLHRKRTPRQVQLACAAGYSRPVVESIIASLLSRGAAQLRYKGWHGAQVQVMMAVMAHNAATVRRIRLGRLTTRAQKFRRLLHLKSPNLLKNQEGLN*
Ga0105070_107962613300009815Groundwater SandFVCAAQLPVGNPTDVSSVVPLGDKVQTAVTQVTRRPRPVLHSLAGDLAVNDPPLRETLHARGILTVGIPRTVEPISPTPSPEAVHELLTSADLHRKRTPRQVQLACAAGYSRPVVESIIASLLSRGAGQLRYKGWHGACVQLTMAVLAHNAATVRRIRLGRLTTRAQKFRRLLHLKPPNLLKNQEGLN*
Ga0105066_115325513300009822Groundwater SandATGFVFAPQLPVGNPSEGSYVLPLVDQVQTALSPVTTRAIPTLHSLAGDLALNDATLRATLHARGILPVGIPHPLAPLSPTPTPEAVQQVLTAAGLQRKRTPRQVQLACAAGYSRPVGESLLASRLSRGAAQVRYKGWHGAQVQVTMAVLAHHAATIRRLQLGRLTTRAQKFRRFL
Ga0126309_1015028523300010039Serpentine SoilVHLPVGNPSDASYVLPLVDKVHAAFPHVTTRPTPAIHSLAGDLALNDPKLREVLHGRRILTVGIPHTVEPLSPTPTPAAVQQLLTTAGLSHKRTPHQVQLACAAGYSRPVVESIIASLLSRGATRLRYKGWHGAQVQVHMAVLAHNAATVRRIQLGCLTTRAQKFRRLLHLKSPNLLKYQEGIN*
Ga0126380_1155399013300010043Tropical Forest SoilQAQVQTATELTADQRARWDTTLTLALVAHQQIATQSRCLTHGKPLTRCKIVNAYDPTIAPICKGKSNCPTQFGRKSGLLAEPATGFVFAARLPVGNPSDVSYVLPLVDQVQAACARVTTRPTPHIHSLAGDLAVNDPTLRESLHLRGILTVGIPRTVEPLSPTPSPEAIDELLTTADLHPKPTPCQVQLACAAGY
Ga0126382_1118651313300010047Tropical Forest SoilVVAFAHTAQARVQSASHMTAEQRTRWDTMLTVAVATHDQIATQSRRLTHGKPLTRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGLLAEPATGFVFAAQLPAGNPSDTSYVLPLVDQVQTACARVTTRSAPRIHSLAGDLAVNDSTLRERLHARGILTVGIPRTIEPLSPTPSPEAIDELLTTAGLQRKRTSRQVQLACAAGYSRPVVESIIASLLSRGAGQLRYKG
Ga0126382_1228421013300010047Tropical Forest SoilHQPIATQSRRLTNGKPLTQCKIVNAYDATIAPICKGKSNCPTQFGRKPGILAEPATGFVFAAQLPVGNPSDVSYVLPLVDQVQTAGARVTTRPAPRIHSLAGDLAVNDATLRERLHTRGILTVGIPRTTEPLSPTPSPEAIQELLTTADLHRKCTPRQLQLACAAGYSRPVVE
Ga0126376_1177390813300010359Tropical Forest SoilPVGNPSHVSYALPLVDQVQTACARVPTRPAPCIHSLAGDLALNDPTLRERLHARGILTVGIPRTVEPLAPMPSPEAIDAMLTTAALPRKRTSRQVQLACAAGYSRPVVESIIASLLSRGAGQLRYKGWHGACVQLTMTVMAHNAATVRRIRYGRLTTRAQKFRRLLHLKPPKLLKNKQRIN*
Ga0126372_1165657813300010360Tropical Forest SoilIVNAYDPTIAPICKGKSNCPTQFGRKPGIIGEPATGFVFAARLPVGNPSDVSYVLPLLEHVQTACRKVTTRPVPAIQSLAGDLAVNDATLREQLHAQGILTVGIPRSVEPLSPTPSPEAITAVLSTADLQRKRTPRQVQLASAAGYSRPVVESLIASLLSRGAGQLRYKGWHGACVQFSMAVMAHNAATVRRIRLGQLTTRAQKFRRLLHLKPPNLLKNKE*
Ga0126377_1099975823300010362Tropical Forest SoilCKIVNAYDPSIAPICKGKSNCLTQFGRKPGIIGEPATGFVFAAQLPVGNPSDASYVLPLVDQVQTACAKVTTRPAPAIHSLAGDLALNDPTLRERLHARGILTVGIPHSLEPLSPTPTPEAVDQVLSTAGLLRKRTSHQVQMACAAGYSRPVVESMIASLLNRGAAQLRYKGWHGAGVQITMAVMAHNAATVRRIQFGRLTTRAQKFRRLLHLKPSNPLKNKQRIN*
Ga0126379_1025222533300010366Tropical Forest SoilLAEPATGFVFAARLPVGNPSDISYVLPLLDQVPHALSHVPTQPAPAIHALAGDLAVNDTTLRESLHARGILTVGIPHTAEPLLPTPAPATVQELLIAVHWHRKRTLRQVQLACAAGYRRPVVESILASLLSRGAGQLRYKGWHGACVQLTMAVLAHNAATVRRLRWGRLTSRAQKFRRLLHLKPPHLLKKQQEIN*
Ga0126379_1278248513300010366Tropical Forest SoilLAHQQIVTQSRRLTHGKSLTHCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIGEPATGFVFAARLPVGNPSDVSYVLPLVEQVQTACAKVTTCPAPAIHSLAGDLAVNDATVREHLHARGILTVGIPCTVEPLSPTPSPEAVHELLTSANLQRKRTPRQVQLACAAGYSRPVVESLIASLLSRGAGQLRYKGWH
Ga0134124_1320909113300010397Terrestrial SoilRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIAEPATGFVFAAQLPVGNPSDGSYVLPLVAKVQTAISHVTTRPTPAIHSLAGDLALNDATLRETLHARGILTVGIPRTVEPLSPTPTPAAVHQLLTAAGLSRTRTPRQVQLACVAGYSRPVGGKYHCQFAEPRGR
Ga0126383_1215196013300010398Tropical Forest SoilRARWDVTLTLALLAHQQIVTQSRRLTHGKPLTHCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIGEPATGFVFAARLPVGNPSDVSYVLPLVEQVQTACAKVTTCPAPAIHSLAGDLAVNDATVREHLHARGILTVGIPCTVEPLSPTPSPEAVHELLTSANLQRKRTPRQVQLACAAGYSRPVVESLIASLLSRGAGQLRYKGWHGACVQLT
Ga0134123_1154269913300010403Terrestrial SoilCKIVNAYDATIAPICKGKSNCPTQFGRKPGLLAEPATGFVFAAQLPVGNPSDVSYVLPLVDQVQTACSYVTTRPPPAIHSLAGDLAVNDPTLRERVHARGLLTVGIPRTIEPLSPTPSPETIHELLTPAGLPCKRTSRQVQLACAAGYSRPVVESIIASLLSRGAGQLRYKGWHGACVQLTMAVMAHNAATVRRIRYGRLTTRAQKFRRLLHLKSPNLLKNKE*
Ga0138514_10003643813300011003SoilIATQSRRLTHGKPLTRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIAEPATGFVFAAQLPVGNPSDVSYVLPLVDQVYTACSHVTTWPTPTIHSLAGDLALNDAKLRETLHTRHILTVGIPRTVEPLSPTPTPKGIHEVLTTAGLHHKRTPRQVQLACAAGHSRPVVESIIASLLSRGAAQLRYKGWHGACVQVTMAVLAHNAATIRRIRLGRLTTRAQKFRRLLHLQPPNLLKYHEGIN*
Ga0137399_1080319813300012203Vadose Zone SoilTRCKIVNAYDSTIAPICKGKSNCPTQFGRKPGILAESATGFVFAVQLPVGNPRDVSYVLPLLDQVQTALRQVTTRPTPAIHSLAGDLAVNDATLRETLHTRGILTVGIPHTIEPLAPTPPPGALQELLTAAGLLGKRTPRQAQLACAAGYSRPVVESIIASLLSRGAGQLRYKGWYGARVQVTMAVMAHNAATVRRIRYGQLTTRAQKFRRLLRLKSPNLLKMKEGIN*
Ga0137362_1106611713300012205Vadose Zone SoilTGFIFAVQLPVGNPSAASYVVPLVDKVQTALAHVTTRPTPAIHSVAGDLALNDPTVRDILHSRGILTVGISRTVAPLSPSPSPEEIRQMLTEAGLTQKRTPHQVQLACAAGYSRPVVESIIASLLGRGAARLTYKGQRGAIVQVGMAVLAHNAATVQRIHQDRLSKRAHKFRRLLRLKHHKTNEFKVSKN*
Ga0137410_1080174513300012944Vadose Zone SoilGKSLSQCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIAEPATGFILAVQLPVGNPSAASYVVPLVDKVQTALAHVTTRPTPAIHSVAGDLALNDPTVRDILHSRGILTVGISRTVAPLSPSPSPEEIRQMLTEAGLTQKRTPHQVQLACAAGYSRPVVESIIASLLGRGAARLTYKGQRGALVQVGMAVLAHTAATVQRIHQDRLSKRAHKFRRLLRLKHHKTNEFKVSKN*
Ga0126375_1136538613300012948Tropical Forest SoilTIAPICKGKSNCPTQFGRKPGIIGEPATGFVFAAQLPVGNPSDASYVLPLVDQVQSACAKVTPRPGPAIHSLAGDLAVNDPTLRERLHARGILTVGIPHSLEPLSPTPTPEAVDQVLSTAGLLRKRTSHQVQMACAAGYSRPVVESMIASLLNRGAAQLRYKGWHGAGVQITMAVMAHNAATVRRIQFGRLTTRAQKFR
Ga0163162_1302019313300013306Switchgrass RhizosphereVNAYDPTIAPICKGKSNCPTQFGRKPGILGEPATGFVFAARLPVGNPADVSYVLPLVEQVQTACAKVTTRPTPTIHSLAGDLAVNDATLREQLHAHGILTVGIPRTVEPLSPTPSPDAIDELLSTADLHGTRTPRQVQLACAAGYSRPVVESIIASVLSRGAGQLRYKGWHGACVQLTMA
Ga0157375_1297899313300013308Miscanthus RhizosphereEPATGFVFAAQLPVGNPSDTSYVLPLVDQVQATCARVTPRPAPSIHSLAGDLAVNDPTLREKLHTRGILTVGIPRTIESLSPTPSPEAIQELLTTADLPRKRTPRQVQLAYAAGYSRPVVESIIASLLSRGAGQLRYKGWHGAQVQFSMAVMAHNAATVRRIRHGRLTPRAQKFRRLLHLKSPNLLKN
Ga0075326_101973813300014271Natural And Restored WetlandsTQSRRLTHGKPLTQCKIVNAYDPTIAPICKGKSNCPTQFGRKPGILGEPATGFIFAARLPVGNPSDVSYVVPLLDQAQAACRQVTTRPAPTILSLAGDLAVNDSTLRETLHTRGILTVGIPRTVEPLASTLSPETIDEVLTTTDLHGTRTPRHVQLACAAGYSRPVIESMIASLLSRGAGHLRYKGAHGARVQFTMAVLAHNAATVRRIRLGQLTTRAQKFRRLLHLKSLNPLKNKE*
Ga0163163_1329142813300014325Switchgrass RhizosphereRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIAEPATGFVFAAQLPVGNPSDGSYVLPLVAKVQTAISHVTTRPTPAIHSLAGDLALNDATLRETLHARGILTVGIPRTVEPLSPTPTPAAVHQLLTAAGLSRTRTPRQVQLACVAGYSRPVVESIIASLLSRGAAQ
Ga0137409_1069698813300015245Vadose Zone SoilTIAPICKGKSNCPTQFGRKPGIIAEPATGFIFAVQLPVGNPSDASYVVPLVDKVQTALAHVTTRPTPAIHSVAGDLALNDPTVRDILHSRGILTVGISRTVAPLSPSPSPEEIRQMLTEAGLTQKRTPHQVQLACAAGYSRPVVESIIASLLGRGAARLTYKGQRGAIVQVGMAVLAHNAATVQRIHQDRLSKRAHKFRRLLRLKHHKTNEFKVSKN*
Ga0132257_10138247713300015373Arabidopsis RhizosphereGLLAEPATGFVFAARLPVGNPSDTSYVLPLIDQVQMACARVTTRPAPHIHSLAGDLAVNDPTLRESLHTRGILTVGIPRTVEPLSPTPSPEAIQELLTTADLQRTRTPRQGQLACAAGYSRPVVESIIASLLSRGAGQLRYKGWHGACVQLTMAVMAHNAATVRRIRYGRLTSRAQKFRQLLHLNPPNLLKNKEGIN*
Ga0132255_10396949913300015374Arabidopsis RhizosphereQRARWDVTLTLALLAHQQIVTQSRRLTHGKPLTQCKIVNAYDPTIAPICKGKSNCPTQFGRKPGILAEPATGFVLAARLPVGNPSDVSYVLPLVEQVQTACAKVSARPAPAIHSLAGDLAVNDARLREQLHARGILTVGIPCTVEPLALTPSPEADYELLTSATLHRKRTPRHVQLACAVGYSRPVVESIIASLLSRGAGQLRYKGWHG
Ga0190266_1122394113300017965SoilAPICKGKSNCPTQFGRKPGLLAEPATGFVFAAQLPVGNPSDASYVLPLVAQVQRALSHVTTRPAPTIHSLAGDLAVNDPPLRERLHAQGILPVGIPRTIEPLSPTPSPEAIHEVLTTAGLQHKRTPRQVRLACAAGYSRPVVESIIACLLRRGAGQVRYKGWHGACVQLTMAVM
Ga0184634_1011273123300018031Groundwater SedimentPLPHGKIVNAYAPTIAPICKGKSNCPTQFGRKPGIIAEPPTGFVFAAQLPVGNPTDVSYVLPLVDQVQTACSHVTSRPTPTLHSLAGDLALNDAKVRETLHARHILTVGIPRTVEPLSPTPTPEEIHEVLTTAGLHRKRTPRQVQLACAAGYSRPVVESIIASLLSRGAAQLRYKGWHGAQVQVTMAVLAHNAATVRRIRLGRLTTRAQKFRRLLHLKPPNLLKNQEGIN
Ga0184634_1034497613300018031Groundwater SedimentCPTQFGRKPGILAEPATGFVFAAQVPVGNPSAGSDVLPLVDQVQTALSQVTARPTPASHSRAGDLALNDATLRETLQTRGILPVGIPHPVEPLAPTPTPADIHEVLTTAGLHRRRTPRQVQLACAAGYRRPVVESLIASLLSRGAAQLRSKGWPGACVQVTMAVMAHTAATVRRIRLGRLTARAQKFRRLLHLKAPNLLKNKEGIN
Ga0184640_1046825513300018074Groundwater SedimentDPTITPICKSKSNCPTQFGRKPGIIGEPASGFIFAFALPVGNPADLSYVVPLVDKVQTAMALVAGRSALAIHSLAGDLALKYSKLRETLHTRGILTVGIPHTVEPLSPTPTPEAVHQLLTAAGLSRKRTPRQVPLACAAGYRRPVVESILASLLSRGAAQLRYKGWHGAQVQVTMAVLAHNAATV
Ga0184627_1041866923300018079Groundwater SedimentVGNPTDVRYLLPLVDQVQTACSHVTSRPTPTLHSLAGDLALNDAKVRETLHARHLLTFGIPRTGEPLSPTPPPEEIHEVLTTAGLHRNRTPRQLQLACAAGYSRPVVESIIASLLSRGAAQLRYKGWHGAQVQVTMAVLAHNAATVRRIRLGRLTTRAQKFRRLLHLKPPNLLKNQEGIN
Ga0184627_1068365813300018079Groundwater SedimentATGFICAVHLPVGKPSAASSVLPVVDKGQTAFSHVPTRATPALHSLAGDLALNDPKRRASLPARSLLTVGIPHTVEPLAPTPTPEAVQRLLTPAGLSRKRPPRQVQLACAAGSSRPVGESIIASLLSRGAARVRYKGWHGAQVQVPMAVLAHTAATVRRIQLGRLIP
Ga0190265_1381903413300018422SoilPHQQIVPQSRRLTHSKPLTRCTIVNAYDPTIAPIGKGKSNCPPQFGRKPGIIAEPATGFVFAAQLPVGNPHDVRYVLPLVDQVQTALTHVTLRPRPALHSLAGDLALTDTTLRTTLHARGILTVGIPHTVEPLSPTPTPEAVHPVLTTTGLHRKRTPRQVQLACAAG
Ga0190272_1161333913300018429SoilQAQLQSAAELTQDQRAHWDMTLTLALVTHQQNVTQSRRITHGKPLTRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIAEPATGFVFAAQLPVGNPNDVSYVLPLVDKVQTALTHVTLRPRPALHSLAGDLALNDTTLRTILHARGILTVGIPHTVEPLSPTPTPEAVPQVLTTTGLHRQRTSRQVQLACAAGYSRPVVESIIASLLSRGAAQLRYKGWHG
Ga0190272_1165647513300018429SoilSNCPTQFGRKPGIIAEPATGFVFAAQLPVGNPNDGSYVLPLVAKVQTAISHVTTRPTPAIHSLAGDLALNDATLRETLHARGILTVGIPRTVEPLAPTPTPEVVHQLLTAAGLSRKRTPRQVQLACAAGYSRPVVESIIASLLSRGAAQLRYKGWHGACVQLTMAILAHNAATVRRIRLGRLTTRAQEFRRLLHLRPPNLLKYQEGIN
Ga0190275_1289155613300018432SoilRARWDTTLTRALVTHQQIATQSRRLTPGKPLTRCKIVNAYAPTIAPLCKGKSNCPTQFGRKPGIIAEPATGFVFAAQLPVGNPSDGSYVLPLVAKVLTAISPVPPRPTPTIHSLAGDLALNDATLRETLHTRGILTVGIPRTVEPLVPTPPPEVVHQLLTTAGLSRKRTPRQVQLACAAGYSRP
Ga0190271_1143363013300018481SoilHQQIATQSRRLTHSKLLTRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIGEPATGFVFAAQLPVGNPSDGSYVIPLVDQVQTALSYVTTRATPTIHSLAGDLALNDTKLRETLHARHILTVGIPRTVEPLSPTPTPAQIHAVLTTAGLHRKRTPRQVQLACAAGYSRPVVESLIASLLSRGAGQLRYKGWHGACVQLTMAVLAHNAATVRRIRLGRLTTRAQKFRRLLHLKPPNLLKYQEGLN
Ga0190264_1049826513300019377SoilAVHLPVGNPSDASYVLPLVDKVQTAFPHVPPRRAPAIHSLAGDLALNDAQLRTTLHTRGVLTVGIPHSVEPLSPTPTPEAVQQLLTTPGVSRKRTPRQVQLACAAGYSRPVVESIIASLLNRGATRLRYKGWHGAQVQVHMAVLAHNAATVRRIQLGRLTTRAQKFRRLLHLKPPNRLKYQERIN
Ga0190267_1018061023300019767SoilKPLTRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIAAPATGFVFAAQLPVGNPGEGSYVLPLVDKVQTALSPVPTRPTPAIHSLAGDLALNDATLRETLYARGILTVGIPCTVEPLSPTLPPGAVHQLLTTAGLSRKRTPRQVQLACAAGYSRPVVESIIASLLSRGAAQLRYKGWHGACVQVTMAILAHNAATVRRIRLGRLTTRAQKFRRLLHLKSPNLLKYQGE
Ga0190267_1029271313300019767SoilQRLTSALNAAMKTQEHIRKQSLRLTQGKKLSHCKIVNAYDPTIAPICKGKSNCSTQFGRKPGIIAEPATGFVFAAQLPVGNPTDVSYVLPVVDKVQTALTHVTLRPRPALHSLAGDLALNDTTLRTILHARGILTVGIPPTGEPLSPTPTPEAVHQVLTTTGLHRKRTPRQVQLACAAGYSRLVVESIIASLLSRGAAQLRYKGWHGACVQVTRAVLAHNAATVRRIRLGRLTTRAQKFRRLLHLKPPNLLKNQEGIN
Ga0209519_1064012313300025318SoilNAYDPTIAPICKGKSNCPTQFGRKPGIIAEPAPGFVFAAQLPVGNPSDGSYVLPLVDKVQTARSHVTTRPTPAIHSLAGDLALNDAKLRETLHARGILTVGIPRTVEPLSPTPTPEAVHQLLTAAGLSRKRTPRQVQLACAAGYSRPVVESIIASLLSRGAAQLRYKGWHGACVQVTMAILAHNAATVRRIRL
Ga0209640_1075498123300025324SoilPTIAPICKGKSNCPTQFGRKPGIIGAPATGFVFAAQLPVGNPSDGSYVVPLVDQVQTALSYVTTRATPTIHSLAGDLALNDAKLRETRPARHLLTVGIPRTVAPLSPTPTPAEIHAVLTTAGLHRKRTPRQVQLACAAGYSRPVVESIIASLLSRGAGQLRYKGWHGACVQLMMAVMAHNAATVRRIRYGRLTTRAQKFRRLLHLKPPHLLKNQEGLN
Ga0207684_1110925313300025910Corn, Switchgrass And Miscanthus RhizospherePVGNPSDASFVLPLVDQVQTALSQVTTRPTPTIHSLAGDLALNDSQLRETLHTRDILTVGIPHTVEPLSPTPPPAVIHQFLTAAGLHRKRTPRQVQLACAAGYSRPVIESIIASLLSRGAAQLRYKGWHGARVQVTMAVMAHNAATVRRIHLGRLTTRAQKFRRLLHLTPPNLLKIKEGI
Ga0207677_1166178113300026023Miscanthus RhizosphereTDDQRARWEVTLTLALRAHQQIVSQSRRLTHGKPLTHCKIVNAYDPTIAPICKGKSNCPTQFGRKPGILGEPATGFVLAARLPVGNPSDGSYVLPLLDQVQTALAKVTTRPAPAIHSLAGDLAVNDATLRETLHARGILTVGIPRTVEPLAPTLSPETIDEVLTTASLSHKRTPRQVQLACAAGYSRPVVESIITSV
Ga0207641_1243418213300026088Switchgrass RhizosphereVTQSRRLTHGKPLTRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGMIAEPATGFVFAAQLPVGNPSDTSYVLPLVDQVQATCARVTPRPAPSIHSLAGDLAVNDPTLREKLHTRGILTVGIPRTIESLSPTPSPEAIQELLTTADLQRPRTPRQVQLACAAGYSRPVVESLIAS
Ga0208912_107069113300026090Natural And Restored WetlandsQCKIVNAYDPTIAPICKGKSNCPTQFGRKPGILGEPATGFIFAARLPVGNPSDVSYVVPLLDQAQAACRQVTTRPAPTILSLAGDLAVNDSTLRETLHTRGILTVGIPRTVEPLASTLSPETIDEVLTTTDLHGTRTPRHVQLACAAGYSRPVIESMIASLLSRGAGHLRYKGA
Ga0207675_10156495823300026118Switchgrass RhizosphereLPVGNPSDVSYVLPLVDQVQTACSYVTTRPPPAIHSLAGDLAVNDPKLRERLHARGILTVGIPRTIEPLSPTPSPETIHELLTTAGLPCKRTSRQVQLACAAGYSRPVVESIIASLLSRGAGQLRYKGWHGACVQLTMAVMAHNAATVRRIRYGRLTTRAQKFRRLLHLKSPNLLKNKQGIN
Ga0209900_101609913300026888Groundwater SandGILGEPATGFVFAARLPVGNPSDVSYVVPLVDQVQAACAKVTTRPAPAIHSLAGDLAVNDSTLRETLHARGILTVGIPRTVEPISPTPSPEAVHELLTSADLHRKRTPRQVQLACAAGYSRPVVESLIASLLSRGAGHLRYKGWHGACVQLTMAVLAHNAATVRRIRLGRLTTRAQKFRRLLHLKPPNLLKNQEGL
Ga0209900_101952813300026888Groundwater SandVALAAHQQIATQSRRLTHGKPLSQCKIVNAYDPTIAPICKGKSNGPTQFGRKPGIIGEPASGFIFAFALPVGNPADLSYVVPLVDKVQTAIAYVAGRPPLAIHSLAGDLALNDATLRETLHGRGILTVGIPHTVAPLSPSPTLEDIRQMLTAAGLHGKRTPHQVHLACASGYSRPVVESL
Ga0209896_103282213300027006Groundwater SandAHQQIATQSRRLTHGKPLTRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGILGEPATGFVFAARLPVGNPSDVSYVVPLVDQVQAACAKVTTRPAPAIHSLAGDLAVNDSTLRETLHARGILTVGIPRTVEPISPTPSPEAVHELLTSADLHRKRTPRQVQLACAAGYSRPVVESLIASLLSRGLGHLRYKGWHGAC
Ga0209877_101835613300027032Groundwater SandQRARWDTTLLLALDAHQQIATQSRRLTHGKPLTRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGILGEPATGFVFAARLPVGNPSDVSYVVPLVDQVQAACAKVTTRPAPAIHSLAGDLAVNDSTLRETLHARGILTVGIPRTVEPISPTPSPEAVHELLTSADLHRKRTPRQVQLACAAGYSRPVVESLIASLLSRGLGHLRYKGWHGACVQLTMAVLA
Ga0209879_107200013300027056Groundwater SandPGILGEPATGFVFAARLPVGNPSDVSYVVPLVDQVQAACAKVTTRPAPAIHSLAGDLAVNDSTLRETLHARGILTVGIPRTVEPISPTPSPEAVHELLTSADLHRKRTPRQVQLACAAGYSRPVVESLIASLLSRGLGHLRYKGWHGACVQLTMAVLAHNAATVRRIRHGRLTTRAQKFR
Ga0209898_102978613300027068Groundwater SandGQSNCSTQFGRKPGMIAEPATGFVFAPQLPVGNPSEGSYVLPLVDQVQTALSPVTTRAIPTLHSLAGDLALNDATLRATLHARGILPVGIPHPLAPLSPTPTPEAVQQVLTAAGLQRKRTPRQVQLACAAGYSRPVGESLLASRLSRGAAQVRYKGWHGAQVQVTMAVLAHHAATIRRLQLGRLTTRAQKFRRFLRLKPPNLVKNKEEIN
Ga0209898_103831813300027068Groundwater SandLTRCKIVTAYDPTIAPICKGKSNCPTQFGRQPGILGEPATGFVFAARLPVGNPSDVSYVVPLVDQVQAACAKVTTRPAPAIHSLAGDLAVNDSTLRETLHARGILTVGIPRTVEPISPTPSPEAVHELLTSADLHRKRTPRQVQLACAAGYSRPVVESLIASLLSRGLGHLRYKGWHGACVQLTMAVLAHNAATVRRIRHGRLTT
Ga0209897_105557613300027169Groundwater SandVFAVQLPVGNPSDASYVLPLVDQVQTACSHVTTRPTPAIHSLAGDLALNDPTLRERLHARGILTVGIPCTVAPLAPPPSPAAIHELLTAADLHRKRTPRQVQLACAAGYSRPVVESIIASLLSRGAGQLRYKGWHGACVQLTMAVMAHNAATVRRIRYGRLTTRTQKFRRLLHLKPPNLLKNKEGIN
Ga0209869_102482413300027187Groundwater SandGQSAAALTAEQRAGWDPTLTLALVTHQQLVTPSRRLTHGKPLTRGKIVHAYAPSLAPIGKGQSNCSTQFGRKPGMIAEPATGFVFAPQLPVGNPSEGSYVLPLVDQVQTALSPVTTRAIPTLHSLAGDLALNDATLRATLHARGILPVGIPHPLAPLSPTPTPEAVQQVLTAAGLQRKRTPRQVQLACAAGYSRPVGESLLASRLSRGAAQVRYKGWHGAQVQVTMA
Ga0209846_105606713300027277Groundwater SandLAAHQQIATQSRRLTHGKPLSQCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIGEPASGFIFAFALPVGNPADLSYVVPLVDKVQTAIAYVAGRPTLAVHSLAGDLALNDATLRATLHGRGILTVGIPQTVAPLSPTPTPEEIRRMLHEAGLPGQRTPHQVRLACAAGYSRPVVESLIASLLSRGAARITYKGQRGAIV
Ga0209854_104699013300027384Groundwater SandYDPTIAPICKGKSNCPTQFGRKPGIIAEPATGFVFAGQLPVGNPGDVSYVLPLVDKVQTAISYVTTRPTPTLHSLAGDLALNDAKLRETLHARGILTVGIPRTVEPLSPTPTPEAVQQLLTAAGLHRKRTPRQVQLACAAGYSRPVVESIIASLLSRGAAQLRYKGWHGACVQLTMAVLAHNAATVRRIRYGRLTSRAQKFRRLLHLKSPNLLKNKE
(restricted) Ga0233416_1027602513300027799SedimentATGFVFAARLPVGNPTDVSYVLPLLNHFQAACRKVTTRPASAIQSLAGDLAVNDAPLRETLQARGILTVGIPRTVEPLAPTPTPEALQELLAAPEQPRTRTPRQVQLAYAAGYSRPVVESIIASLLSRGAGHLRYKGQHGACVQLTMAVMAHNAATVRRIRHGRLTPRAQKFRRLLHLKAPNWLKNKE
Ga0209797_1040588613300027831Wetland SedimentRRLTHGKPLTRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIAEPATGFVFAAQLPVGNPSDGSYVRPLVDKVQTALSYVTTRPAPTIHSLAGDLALNDATLRETLHGRGILTVGIPRTVEPLSPTPTPAEIHTVLTTAGLHRKRTPRQVQLACAAGYSRPVVESIIASLLSRGAAQLRYKG
Ga0209583_1041863613300027910WatershedsEAQRARWDTTLTLALVTHHQIATQSRCLTHGKPLTRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIGEPATGFVFAALLPVGNPSDGSYVVPLVDQVQMACSHVTTRPSPTIHSLAGDLALNDAKLRETLHARGILTVGIPRTVEPLSPTPTPEAVQQMLTTAGLHRTRTPRQVQLACAAGYSRPVVESIIASLLSRGAAQLRYKGWHGAQI
Ga0209069_1038579313300027915WatershedsDPTLAPICKGKSNCPTQFGRKPGIIAEPATGFVFATQLPVGNSSDGSYVGPLVDKVQTALTHVTRRSRPALHSLEGDLALNDPKLRETLHARGILTVGIPRTIEPLSPTPSPEAIHEVLTAAGLHHKRTPRQMQLACAAGYSRPVVESIIASLLSRGAAQLRYKGWHGAQVQVTMAVLAHNAATVRRIRYGRLTTRAQKFRQLLHLKLPNLLKNQERVN
Ga0209885_102880313300027950Groundwater SandPLTRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIAEPATGFVFAPQLPVGNPSEGSYVLPLVDQVQTALSPVTTRAIPTLHSLAGDLALNDATLRATLHARGILPVGIPHPLAPLSPTPTPEAVQQVLTAAGLQRKRTPRQVQLACAAGYSRPVGESLLASRLSRGAAQVRYKGWHGAQVQVTMAVLAHHAATIRRLQLGRLTT
Ga0209889_103838613300027952Groundwater SandVKADAPTIAPIGKGKSNCPTQFGRQPGISAEPATGFVFAAQLPVGNPSAVSYVGPLVDKVQTALTHVTTRPTPAIHSRAGDLALNDAKLRETLHARGILTVGIPRTVAPLSPTPAPEAIQEVLTAAGLHRNRSPRQVQLACAAGYSRPVVESLIASLLSRGAAQLRYKGWHGAQVQVTMAVLAHNAATVRRIRLGRLTTRAQKFRRLLHLKPPNLLKNQEGLN
Ga0209859_104329713300027954Groundwater SandIATQSRRLTHGKPLTRCKIVKAYAPTIAPICKGKSNCPTQFGRKPGIIAEPATGFVFAAQLPVGNPSDVSYVGPLVDKVQTALTHVTTRPTPAIHSRAGDLALNDAKLRETLHARGILTVGIPRTVAPLSPTPAPEAIQEVLTAAGLHRNRSPRQVQLACAAGYSRPVVESLIASLLSRGAAQLRYKGWHGAQVQVTMAVLAHNAATVRRIRLGRLTTRAQKFRRLLHLKPPNLLKNQEGIN
Ga0209853_112278923300027961Groundwater SandPVGNPSDVSYVVPLVDKVQTALTQVTRRPRPALHSLAGDLALNDPKLRATLHARGILTVGIPRTVEPLSPTPSPEAIQEMLTATGLHRKRTPRQVQLACAAGYSRPVVESLIASLLSRGAAQLRYKGWHGAQVQVTMAVMAHNAATVRRIRLGRLTTRAQKFRRLLHLKPPNPLKYQEGI
Ga0256864_124947013300027964SoilTVAVATHQQIVTQSRRLTHGKPLTRCKIVNAYDATIAPICKGKSNCPTQFGRKPGILAEPATGFVFAARLPVGNPSDVSYVLPLVDQVQTARARVTTRPAPSIHSLAGDLAVNDPTLRERLHTRGLLTVGIPRTAEPLSPTPSPEAIQELLTTADLQCQRTPRQVQLAC
(restricted) Ga0233417_1018080613300028043SedimentYDPTIAPICKGKSNCPTQFGRKPGILGEPATGFVFAARLPVGNHSDVSYVLPLLDHVQTALAKVTTRPAPGIQSLACDLAVNDATLREQLHARGILTVGIPRSVEPLAPTPSPEDIDEALTTADLHGKRTPRQVHLACAAGYSRPVVESIIASLLSRGAGHLRYKGWHGAQVQFAMTVLAHNAATLRRIRHGRLTTRAQKFRRLLHLKAPNLLKNKE
Ga0247827_1074626713300028889SoilAPICKGKSNCPTQFGRKPGLLAEPATGFVFAARLPVGNPSDASYVLPLVDQVQQAFAQVTTRPVPAVHSLAGDLAVNDPTLRENLHTRGILTVGIPRSVEPLSPTPSPEVLQEVLTTADLHRKRTPCQVQLAYAAGYSRPVVESIIASLLSRGAGQLRYKGWHGAGVQLTMAVMAHNAATVRRIRYGRLTLRAQKFRRLLHLKPPNLLKNKQRI
Ga0268386_1066846313300030619SoilCKGKSNCPTQFGRKPGLLAEPATGFVFAARLPVGNPSDVSYVLPLVGQVQTACARVPTRPTPRLHSLAGDLAVNDPTLRESLHLRGILTVGIPRTVAPLSPTPSPEAIDALLTTTDLHRKPTLRQAQLACAAGYSRPVVASIIASLLSRGAGQLRYKGWHGAWVQVTMAVMAHNAATVRRIRCGHLTSRAQKFRRLLHLKPPNLLKNKERIN
Ga0268386_1101948813300030619SoilLTRCKIVNAYDATIAPICKGKSNCPTQFGRKPGILAEPATGFVFAARLPVGNPTDLSYVLPLVDQVQTACARVSSRPTPRIHSLAGDLAVNDSTLRESLHLRGILTVGIPRTVEPLSPTPSPEAIQEGLTAADLHRKRTAHQVRLACAAGYSRPVVESIIASLLSRGAG
Ga0308206_101384923300030903SoilMFAVQLPVGNPSAASYVLPLIAKVQPALTPVTTRLLPAIHSLAGDLALNDSQLREPLHARHLLTVGIPHTFEPLSPTPPPEVIHQLLTIASLHRNRTPRQVQLACTAGYSRPVVESISASLLNRGAAQLRYKGWHGACVQVPMVGLAHNAATVRRIQLGRLTTHAQKFR
Ga0308204_1006406413300031092SoilMFAVQLPVGNPSAASYVLPLIAKVQPALTPVTTRLLPAIHSLAGDLALNDSQLREPLHARHLLTVGIPHTFEPLSPTPPPEVIHQLLTIASLHRKRTPRQVQLACTAGYSRPVVESISASLLNRGAAQLRYKGWHGAWVQVPMAGLAHNAANVRRIQLGRLTTHAQKFRRLLHLK
(restricted) Ga0255311_110879013300031150Sandy SoilIATQSRRLTHGKPLTRGQIVNAYAPPIAPICKGKSNCPTQFGRKPGILAEPATGFVFAAQLPVGTPSDVSYVVPLVDKVQTALTQVTRRPRPALHSLAGDLALNDPQLREILHTRGILTVGIPRTIEPLAPTPLPEAIQAVLTAAGLHRKRSPRQVQLACAAGYSRPVVESIIASLLSRGAAQLRYKGWHGAQVQVTMAVL
Ga0307469_1122591623300031720Hardwood Forest SoilPATGFVFAAQLPVGNPSDGSYVVPLVDKVQTAVTHVTRRLQPALHSLAGDLALNDPKLRETLHVRGILTVGIPRTVEPLSPTPSPEDIHEVLTTAGLHRKRTPRQVQLACAAGYSRPVVESIIASLLSRGASQLRYKGWHGAWVQVTMAVLAHNAATVRRIRLGRLTTRAQKFRRLLHLKPPNLLKNHERIN
Ga0307468_10163865013300031740Hardwood Forest SoilPATGFVFAAQLPVGNPSDGSYVVPLVDKVQTAVTHVTRRLQPALHSLAGDLALNDPKLRETLHARGILTVGIPRTVEPLSPTPSPEDIHEVLTTAGLHRKRTPRQVQLACAAGYSRPVVESIIASLLSRGAAQLRYKGWHGACVQVTMAVLAHNAATVRRIRLGRLTTRAQKFRRLLHLKLPNLLKNQMGIN
Ga0307473_1119929213300031820Hardwood Forest SoilKPLTQCKIVNAYDPTIAPICKGKSNCPTQFGRKPGLIAEPAAGFIFAFQLPGGNPTDPSYVAPLVDKVQTAIAHVAGRSPLAIHSLAGDLALNDSRLRETLHGRGILTVGIPHTVEPLTPAPAPKEIRRMLHEAGLQHQRTPHQVQLACAGGYSRPVVESIIASLLGRGAARITYKGHRGAIVQVGM
Ga0307411_1111100213300032005RhizosphereQLLELGQPIGKLAQSVQQCLQEASHLRETTRERLASQLRVAMTAHQQISKQSHRLTQGKRLTQCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIISGPAEGFVFAVQLPVGNPSDASYVVPLVDKVEQAIARVTSRPKLAIHSVAGDMGVNAPKVHHTLHARGILTIGIAKTVEPMNPKPTPEEVLAILNEAGLNRTRTPHQVRLACACGYSRPVVESHIASLLARGAGQLRYKGPQ
Ga0335084_1141236113300033004SoilTHGKPLTRCKIVNAYDATIAPICKGKSNCPTQFGRKPGILGEPATGFVFAARLPVGNPSDVSYVLPLVEQVQTACARVMTRPTPTIHSLAGDLAVNDATLRERLHLHGILTVGIPRTVEPLSPTPSTDAIDELLTTAHLHGTRTPRQVQLACAAGYSRPVVESIIASLLSRGAGQLRYKGWHGACVQLTMAVMAHNAATLRRIRHGRLTTRAQKFRRLLHLKPSNLLKN
Ga0247830_1123919913300033551SoilTLTLALLAHQQIVTQSRRLTHGKPLTHCKIVNAYDPTIAPICKGKSNCPTQFGRKPGILAEPATGFVFAAQLPVGNPSDVSYVVPLIDQVQAACRKVTTRPAPAIHSLAGDLAMNDPTLRETLHARGILTVGIPRTVEPLVLPSSPEDIREILAATGLHRTRTPRQVQLACAAGDSRPVVESIIASLLSRGAGQLRYK
Ga0364930_0222650_3_6383300033814SedimentARWDTTLTLALVTHQQIATQSRRLTHGKPLTRCKIVNAYDPTIAPICKGKSNCPTQFGRKPGIIAEPATGFVFAAQLPVGNPSDGSYVLPLVAKVQTAISHVTTRPTPAIHSLAGDLALNDATLRERLHARGILTVGIPRTVEPLSPTPPPEVIRQVLTTAGLQRKRTPRQVQLACAAGYSRPVVESIIASLLSRGAAQLRYNTTLLEVAIP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.