NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F063526

Metagenome / Metatranscriptome Family F063526

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F063526
Family Type Metagenome / Metatranscriptome
Number of Sequences 129
Average Sequence Length 107 residues
Representative Sequence MCKYFTATDEFAVPTTQDMDPEHRGTRLMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPSDQTA
Number of Associated Samples 97
Number of Associated Scaffolds 129

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 60.00 %
% of genes near scaffold ends (potentially truncated) 1.55 %
% of genes from short scaffolds (< 2000 bps) 5.43 %
Associated GOLD sequencing projects 82
AlphaFold2 3D model prediction Yes
3D model pTM-score0.77

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (92.248 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(41.861 % of family members)
Environment Ontology (ENVO) Unclassified
(53.488 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(64.341 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 23.65%    β-sheet: 29.73%    Coil/Unstructured: 46.62%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.77
Powered by PDBe Molstar

Potential Novel Structural Fold:

This family has a high confidence model (pTM >=0.7) with no significant hits to either SCOPe or PDB biological assemblies. It is, therefore, classified as a potential novel structural fold.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 129 Family Scaffolds
PF02803Thiolase_C 43.41
PF00296Bac_luciferase 19.38
PF10604Polyketide_cyc2 3.10
PF13508Acetyltransf_7 2.33
PF01261AP_endonuc_2 1.55
PF00583Acetyltransf_1 1.55
PF00180Iso_dh 1.55
PF16112DUF4830 0.78
PF02782FGGY_C 0.78
PF02325YGGT 0.78
PF00005ABC_tran 0.78
PF03176MMPL 0.78
PF05977MFS_3 0.78
PF09261Alpha-mann_mid 0.78
PF06905FAIM1 0.78

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 129 Family Scaffolds
COG0183Acetyl-CoA acetyltransferaseLipid transport and metabolism [I] 43.41
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 19.38
COG0383Alpha-mannosidaseCarbohydrate transport and metabolism [G] 0.78
COG0762Cytochrome b6 maturation protein CCB3/Ycf19 and related maturases, YggT familyPosttranslational modification, protein turnover, chaperones [O] 0.78
COG1033Predicted exporter protein, RND superfamilyGeneral function prediction only [R] 0.78
COG2409Predicted lipid transporter YdfJ, MMPL/SSD domain, RND superfamilyGeneral function prediction only [R] 0.78
COG2814Predicted arabinose efflux permease AraJ, MFS familyCarbohydrate transport and metabolism [G] 0.78


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A92.25 %
All OrganismsrootAll Organisms7.75 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005174|Ga0066680_10009156All Organisms → cellular organisms → Bacteria4995Open in IMG/M
3300005446|Ga0066686_10116587All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1737Open in IMG/M
3300005556|Ga0066707_10138206All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1535Open in IMG/M
3300009088|Ga0099830_10078715All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium2411Open in IMG/M
3300009147|Ga0114129_10563105All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1481Open in IMG/M
3300012209|Ga0137379_10759855All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi874Open in IMG/M
3300012918|Ga0137396_10536664All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi866Open in IMG/M
3300026328|Ga0209802_1037227All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium2504Open in IMG/M
3300028536|Ga0137415_10442084All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1106Open in IMG/M
3300031421|Ga0308194_10028960All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1286Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil41.86%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil22.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil13.95%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil8.53%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.10%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.88%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.55%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.55%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.78%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost0.78%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.78%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.78%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300013772Permafrost microbial communities from Nunavut, Canada - A10_80_0.25MEnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019885Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m2EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300020018Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s2EnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028718Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_194EnvironmentalOpen in IMG/M
3300028768Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_119EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300028799Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_123EnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300028811Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_149EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300028885Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_185EnvironmentalOpen in IMG/M
3300031421Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0066672_1007429743300005167SoilMPGVMKAFVTIYEPNDDAHPSLPTLREFLAKNTDPKSGLVSYAVLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVSDEALAWSRSRPSDQTA*
Ga0066672_1034572023300005167SoilMSKYFEADDALAVETTRDMDPDHPGRRVLPGLMKAFLTVYEPGDDPHPSLPTLREFLATLTDPATGLVSYDLLGPALSIDPAPLRALPPEYLDYRVSLGASFVTDEAIDRTNVRNRPN*
Ga0066677_1014580613300005171SoilMCKYFEADDALAVETTRDMDPDHPGRRVLPGLMKAFLTVYEPGDDPHPSLPTLREFLATLTDPATGLVSYDLLGPALSIDPAPLRALPPEYLDYRVSLG
Ga0066683_1038909323300005172SoilMPGLMKAVVKIYEPDDDAHPSLSTLREFLAENTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPSDQTA*
Ga0066680_1000915663300005174SoilMPGLMKAFVTIYEPNDDAHPSLPTLREFLAKNTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLEYRVGPGASFVTDEALALSRSRPHDQTA*
Ga0066680_1012525823300005174SoilMCRYFTATDELAVETTQDMDPEHRGTRLMPGLMKAMVTIYEPVDDAHPSLPTLREFLAKHTDTESGLVRYDFFGPTLTIDPTPLLELPPEYLDYRVGPGCSFVTHEALTRSRSHPSGQPS
Ga0066679_1000123953300005176SoilMCKYFEADDALAVETTRDMDPDHPGRRVLPGLMKAFLTVYEPGDDPHPSLPTLREFLATLTDPATGLVSYDLLGPALSIDPAPLRALPPEYLDYRVSLGASFVTDEAIDRTNVRNRPN*
Ga0066690_1002099133300005177SoilMCKYFTATDEFALPTTQDMDPEHRGTRFMPGVMKAFVTIYEPNDDAHPSLPTLREFLAKNTDPKSGLVSYAVLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVSDEALAWSRSRPSDQTA
Ga0066688_1002923333300005178SoilMCKYFTATDEFAVPTTQDMDPEHRGTRFMPGVMKAFVTIYEPNDDAHPSLPTLREFLAKNTDPKSGLVSYAVLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVSDEALAWSRSRPSDQTA
Ga0066688_1003578223300005178SoilMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPSDQTA*
Ga0066688_1061028223300005178SoilDDALAVETTRDMDPDHPGRRVLPGLMKAFLTVYEPGDDPHPSLPTLREFLATLTDPATGLVSYDLLGPALSIDPAPLRALPPEYLDYRVSLGASFVTDEAIDRTNVRNRPN*
Ga0066684_1028321913300005179SoilMDPEHRGTRLMPGLMKAVVKIYEPDDDAHPSLSTLREFLAENTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPSDQ
Ga0066685_1002433053300005180SoilMCKYFTATDEFAVPTTQDMDPEHRGTRFMPGVMKAFVTIYEPNDDAHPSLPTLREFLAKNTDPKSGLVSYAVLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVSDEALAWSRSLSDQTA*
Ga0066685_1071233123300005180SoilMPGLMKAVVKIYEPDDDAHPSLSTLREFLAENTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLDYRVGPGVSFVTDEALALSRSRPSDQTA*
Ga0066675_1083518213300005187SoilMPGVMKAFVTIYEPNDDAHPSLPTLREFLAKNTDPKSGLVSYAVLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVSDEALAWSRSLSDQTA*
Ga0066686_1011658713300005446SoilMPGLMKAMVKIYEPDDDAHPSLSTLREFLAENTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPSDQTA*
Ga0066682_1086907313300005450SoilLMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPSDQTA*
Ga0070706_10010886323300005467Corn, Switchgrass And Miscanthus RhizosphereMDARTSTDSSEERFVCKYFTAGDEYAVETTRDMDPERRGNRVMSGLMKAHITVYEPGDYAHPSLPTLRAFLADHIEPSTGLLSYTTLGPNLTIDPAPLMALPEEYLGYRVSPGATFVTHEALRQAQLQS*
Ga0070706_10079494913300005467Corn, Switchgrass And Miscanthus RhizosphereAVETKKDMNPARPGTRLMSGLMKAMVKVYESGEVADPSLPTLREFLAKHTDPESGLVSYDYAGPTLTIDPRPLLQLPPEYLDYRVGPGSSFVTHEALAHSGSHPSARST*
Ga0070697_10206695413300005536Corn, Switchgrass And Miscanthus RhizosphereMSKWFTATDDLAVETKKDMNPARPGTRLMSGLMKAMVKVYESGEVADPSLPTLREFLAKHTDPESGLVSYDYAGPTLTIDPRPLLQLPPEYLDYRVGPGSSFVTHEALAHSGSHPSARST
Ga0066697_1004290423300005540SoilMDPEHRGTRLMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPSDQTA*
Ga0070695_10149522413300005545Corn, Switchgrass And Miscanthus RhizosphereMSKWFTATDDLAVETKKDMNPARPGTRLMSGLMKAMVKVYESGEVADPSLPTLREFLAKHTDPESGLVSYDYAGPTLAIDPRPLLQLPPEY
Ga0066701_1004271943300005552SoilMCKYFTATDEFAVPTTRDMDPEHRGTRLMPGLMKAMVKIYEPDDDAHPSLSTLREFLAENTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPSDQTA
Ga0066695_1001985313300005553SoilMCKYFTATDEFAVPTTRDMDPEHRGTRLMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEA
Ga0066707_1013820633300005556SoilMCKYFTATDEFAVPTTQDMDPEHRGTRLMPGLMKAMVTIYEPDDDAHHSLSTLREFLAENTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPHDQTA
Ga0066704_1051295623300005557SoilMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLEYRVGPGASFVTDEALALSRSRPHDQTA*
Ga0066698_1031018513300005558SoilMCKYFTATDEFAVPTTQDMDPEHRGTRFMPGVMKAFVTIYEPNDDAHPSLPTLREFLAKNTDPKSGLVSYAVLGPRLTIDPAPLLQLPPEYLDYRVGPGASFV
Ga0066703_1042829823300005568SoilMSKWFTATDDLAVETAKDMNPAGPGHRLMPGLMKAMVKVYEPGEVADPSLPTLREFLAKHTDPKSGLVSYDYAGPTLTIDPRPLLQLPPEYLDYRVGPGSSFVTHEALAHFT*
Ga0066703_1052287013300005568SoilMKAMVAVYEPIDDPHPSLLTLREFLATHTDPESGLVRYDHFGPNLSIDPAPLLQLPPEYLDYRVGPGSSFVTDDALALARSKGSH*
Ga0066705_1003573823300005569SoilMCKYFEADDALAVETTRDMDPDHPGRRVLPGLMKAFLTVYEPGDDPHPSLPTLREFLATLTDPATGLVSYDLLGPALSIDPAPLRALPPEYLDYRVSLGASFVTDEAPGRVNGRNRPN*
Ga0066702_1031940223300005575SoilPSLPTLREFLAKNTDPKSGLVSYAVLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVSDEALAWSRSRPSDQTA*
Ga0066708_1019752423300005576SoilMCKYFTATDEFAVPTTRDMDPEHRGTRLMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEAALAWSRSRPSDQTA*
Ga0066708_1035983623300005576SoilMCKYFTATDEFAVPTTQDMDPEHRGTRFMPGVMKAFVTIYEPNDDAHPSLPTLREFLANNTDPKSGLVSYAVLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVSDEALAWSRSRPSDQTA
Ga0066691_1089964313300005586SoilGTRLMPGLMKAFVTIYEPNDDAHPSLPTLREFLAKNTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLEYRVGPGASFVTDEALALSRSRPHDQTA*
Ga0066696_1024848723300006032SoilMCKYFTATDEFAVPTTRDMDPEHRGTRLMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLDYRIGPGASFVTDEALALSRSRPSDQTA
Ga0066696_1058827723300006032SoilLAVETTRDMDPDHPGRRVLPGLMKAFLTVYEPGDDPHPSLPTLREFLATLTDPATGLVSYDLLGPALSIDPAPLRALPPEYLDYRVSLGASFVTDEAIDRTNVRNRPN*
Ga0066665_1001092683300006796SoilMCKYFTATDEFAVPTTRDMDPEHRGTRLMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPSDQTA
Ga0066659_1025978923300006797SoilMCKYFTATDEFAVPTTRDMDPEHRGTRLMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLDYRVGPGATFVTDEALALSRSRPSDQTA
Ga0066659_1114895713300006797SoilEFAVPTTQDMDPEHRGTRFMPGVMKAFVTIYEPNDDAHPSLPTLREFLAKNTDPKSGLVSYAVLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVSDEALAWSRSRPSDQTA*
Ga0066660_1005361053300006800SoilFAVPTTQDMDPEHRGTRFMPGVMKAFVTIYEPNDDAHPSLPTLREFLAKNTDPKSGLVSYAVLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVSDEALAWSRSRPSDQTA*
Ga0066660_1038476623300006800SoilDHPGRRVLPGLMKAFLTVYEPGDDPHPSLPTLREFLATLTDPATGLVSYDLLGPALSIDPAPLRALPPEYLDYRVSLGASFVTDEAIDRTNVRNRPN*
Ga0099793_1008839123300007258Vadose Zone SoilMCKYFTATDEFAVPTTQDMDPEHRGTRLMPGRMKAFVTIYEPNDDAHPSLPTLREFLAKNTDPESGLLSYALLGPRLTIDPAPLLKLPPEYLDYRVGPGASFVTDEALAWSRSRPSDQTA
Ga0066710_10239796023300009012Grasslands SoilRLMPGLMKAMVKVYEPGEVADPSLPTLREFLAKHTDPKSGLVSYDYAGPTLTIDPRPLLQLPPEQLAYRVGPGSSFVTHEALAHFT
Ga0066710_10281640923300009012Grasslands SoilMCKYFTATDEFAVPTTRDMDPEHRGTRLMPGLMKAVVKIYEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPSDQTA
Ga0066710_10337085313300009012Grasslands SoilMCKYFTATDEFAVPTTQDMDPEHRGTRLMPGLMKAFVTIYEPNDDAHPSLPTLREFLKKNTDPKSGLVSYALLGPRLTIDPAPLLGLPPEYLDYRVGPGASFVTDEAALARSRSRPSDQT
Ga0099829_1069640213300009038Vadose Zone SoilMCKYFTATDEFAVRTTRDMDPEHRGTHLMPGLMKVFVTIYEPNDDAHPSLPTLREFLAKNTDRKSGLVSYALLGPRLTIDPAPLLQLPPKYLDYRVGPGASFVTDEALAW
Ga0099830_1007871513300009088Vadose Zone SoilPGLMRAMVTVYEPADEAHPSLPTLREFLAKHTNPESGLVSFDYLGPTLTIDPTPLLQLPPEYLDYRVGPGSSFVTYEGLARSGSRESGQSA*
Ga0099828_1002361643300009089Vadose Zone SoilMCKDFTATDDLAVETTHDMEPARPGTRLMPGLMRAMVTVYEPADEAHPSLPTLREFLAKHTNPESGLVSFDYLGPTLTIDPTPLLQLPPEYLDYRVGPGSSFVTYEGLARSGSRESGQSA
Ga0099827_1032599813300009090Vadose Zone SoilMCRYFTATDELAVETTQDMDPEHRGTRPMPGLMKAMVTIYEPVDDAHPSLPTLREFLAKHTDTESGLVRYDFFGPTLTIDPTPLLELPPEYLDYRVGPGCSFVTHEALTRSRSH
Ga0066709_10314792723300009137Grasslands SoilMCKYFTATDEFAVPTTQDMDPEHRGTRFMPGVMKAFVTIYEPNDDAHPSLPTLREFLAKNTDPKSGLVSYAVLGPRLTIDPAPLLQLPLEYLDYRVGPGASFVSDEALAWSRSRPSDQTA
Ga0066709_10317685823300009137Grasslands SoilMCRDFTATDEFAVPTTQDMDPEHRGTRLMPGLMKAFVTIYEPNDDAHPSLPTLREFLAKNTDRESGLVSYALLGPRLTIDPAPLLQLPPEYLEYRVGPGASFVTDEALALSRSRPHDQT
Ga0114129_1056310533300009147Populus RhizosphereMCKYFTATDEFAVETTHDMDPEHRGTRIVPGLMKAMIAVYEPGDYAHPSLPTLREFLAVHTDPESGLLSYDRLGPTLTIDPTPLLQLPPEYLGYRVSLGSSFVTLESLARSGSRRDDPTP
Ga0134067_1030472413300010321Grasslands SoilEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPSDQTA*
Ga0134084_1012879213300010322Grasslands SoilHPSLPTLREFLAKNTDPKSGLVSYAVLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVSDEALAWSRSLSDQTA*
Ga0137399_1014562523300012203Vadose Zone SoilMCKYFTATDEFAVPTTQDMDPEHRGTRLMPGRMKAFLTIYEPNDDGHPSLPTLREFLAKNTDTKSGLVSYALQGPTLTIDPAPLLKLPPEYLDYRVGPGASFVTDEALAWSRSRPSDQTA
Ga0137399_1032117423300012203Vadose Zone SoilMCKYFTATDEFAVKTTRDMDPEHRGSRLMPGLMKAMVTIYEPNDDPHPSLPTLREFLATHTDPESGLVRYDRFGPMLSIDPAPLLQLPPEYLDYRVGPGSSFVTDDALALARSKGSH*
Ga0137399_1037432313300012203Vadose Zone SoilMCKYFSAGDEFAVETTRDMDPERKGTRVMSGLMKAHITVYEPGEYVHPSLPTLRDSLVTHTDPNTGLLSYDRLGPTLVIDPAPLIALPTEYLDYRVSPGASFVTHEALRSVRGPS*
Ga0137399_1160265413300012203Vadose Zone SoilRGTRLMPGLMKTFVTIYEPNDDAHPSLQTLREFLAKNTDPTSGLVSYALQGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALAWSRSRPSDQTA*
Ga0137380_1031010823300012206Vadose Zone SoilMCKYFTATDEFAVRTTQDMDPEHRGTHLMPGLMKALVTIYEPNDDAHPSLPTLREFLAKNTEPKSGLVRYALLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALAWSRSRPNDQTA
Ga0137376_1064018613300012208Vadose Zone SoilMCKYFTATDDLAVETTKDMDPERRGNRWMPGLMKAMVKVYEPGEEADPSLPTLREFLAKHTDPESGLVNYDYAGPTLTIDPKPLLQLPPEYLDYPVGPGSS
Ga0137379_1075985523300012209Vadose Zone SoilMPGLMKAMVTIYEPNDDPHPSLLTLREFLAKHRDPQSGLVSYDFYGPTLTIDPSPFLQLPSEYLDYRVGPGASFVPLEVLARSGSGKVT*
Ga0137377_1155578723300012211Vadose Zone SoilMCKYFTATDDLAVETTKDMDPERRSNRWMPGLMKAMVKVYEPGEEADPSLPTLREFLAKHTDPESGLVNYDYAGPTLTIDPKPLLQLPPEYLDYPVGPGSTFVTYEALAHSL*
Ga0137385_1054902223300012359Vadose Zone SoilMCKYFTATDEFAVRTTQDMDPEHRGTHLMPGLMKALVTIYEPNDDAHPSLPTLREFLAKNTEPKSGLVRYALLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALA
Ga0137397_1049582323300012685Vadose Zone SoilLSKWFTATDDLAVETKKDMNPAAPGTRLMPGLMKAMVKVYEPGEVADPSLPTLREFLAKHTDPKSGLVSYDYAGPILMIDSRPLLQLPPEYLDYRVGPGSSFVTHEALAHFI*
Ga0137396_1022524023300012918Vadose Zone SoilMCKYFSAGDEFAVKTTRDMDPERKGTRVMSGLMKAHITVYEPGEYAHPSLLTLRDFLVTHTDPNTGLLSYDRLGPTLVIDPAPLIALPTEYLDYRVSPGASFVTHEALRSVRGPS*
Ga0137396_1032090023300012918Vadose Zone SoilVCKYFTATDEFAVPTTQDMDPEHRGTRLMPGLMKAFVTIYEPNDDAHPSLPTLREFLAKNTDPTSGLVSYALQGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALAWSRSRPSDQTA
Ga0137396_1053666423300012918Vadose Zone SoilMCKYFTATDEFAVKTTRDMDPEHRGTRLMPGLMKAMVTIYEPNDDPHPSLPTLREFLATHTDPESGLVRYDRFGPMLSIDPAPLLQLPPEYLDYRVGPGSSFVTDDALARSRSGKIT*
Ga0137396_1065794623300012918Vadose Zone SoilGDAPHPSLPTLREFLAKHTDPESGIVSYDFLGPTLMIDPTPLLQLPPEYLDYRAGPGSSFVAHQALARSGSHPSGQSA*
Ga0137396_1123627613300012918Vadose Zone SoilGDAPHPSLPTLREFLAKHTDPESGIVSYDFLGPTLMIDPTPLLQLPPEYLDYRVGPGSSFVTHEALARSRSHPGGQSA*
Ga0137359_1075406123300012923Vadose Zone SoilTDDLAVETTKDMNPAGPGTRLMPGLMKAMVKVYEPGEVADPSLPTLREFLAKHTDPKSGLVSYDYAGPTLTIDPRPLLQLPPAYLDCRVGPGSSFVTHEALAHFI*
Ga0137419_1188205223300012925Vadose Zone SoilMSKWFTARDDLAVETKKDMNPAGPGHRLMPGLMKAMVKVYEPGEVADPSLPTLREFLAKHTDPKSGLVSYDYAGPILTIDPRPLLQLPPEHLDYRVGPGSSFVTHEALAHFT*
Ga0137416_1032153323300012927Vadose Zone SoilMCKYFTATDDLGVKAIKDMDPERRSTRVMAGLMKAMVTIYEPGDAPHPSLPTLREFLAKHTDPETGLVSYGYAGPTLAIDPTPLLQLPPEYLDYRVGPGSSFVTHEALAHSRSRTSGQSA
Ga0137416_1035934123300012927Vadose Zone SoilMCKYFTATDEFAVKTTRDMDPEHRGTRLRPGLMKAMVTIYEPNDDPHPSLPTLREFLATHTDPESGLVRYDRLGPMLSIDPAPLLQLPPEYLDYRVGPGSSFVTDDALALARSKGSH*
Ga0137416_1204909313300012927Vadose Zone SoilNDDAHPSLPTLREFLARNTDPKSGLVSYARLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPIDQTA*
Ga0134087_1019976223300012977Grasslands SoilMPGLMKAFVTIYEPDDDAHPSLPTLREFLAENTDPEPGLVSYARLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPSDQTA*
Ga0120158_1041976223300013772PermafrostMPGLMKAMVTIYEPNDDPHPSLTTLREFLAKHTDPTSGLVSYEFVGPTLTIDPTPLLQLPAEYLDYRVGPGSSFVTNEALAWSRSRRSDQTA*
Ga0134072_1023908223300015357Grasslands SoilMDPEHRGTRLMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRP
Ga0184604_1002162413300018000Groundwater SedimentMCKYFSAGDEYAVETTRDMDPEQRGTHVMSGRMKAHLTVYEPGDSCSPTLPTLRQFLADYTDSKTGLLSYQILGPKLTIDPAPLLALPEEYLDYGVSPGASFVTHEALSGSRRPAPRTPGQA
Ga0184605_1041193613300018027Groundwater SedimentMCKYFSAGDEYAVETTRDMDPEQKGTHVMPGRMKAHLTVYEPNDFAHPSLPTLRDFLADYTNPDTGLLTYEMLGPKLTIDPAPLRALPEEYLDYRVSPGASFVTQEALSG
Ga0066667_1128473223300018433Grasslands SoilMCKYFEADDALAVETTRDMDPDHPGRRVLPGLMKAFLTVYEPGDDPHPSLPTLREFLATLTDPATGLVSYDLLGPALSIDPAPLRALPPEYLDYRVSLGASFVTDEAIDRTNVRNRPN
Ga0066667_1148044723300018433Grasslands SoilMPGLMKAVVKIYEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPSDQTA
Ga0066662_1007783733300018468Grasslands SoilMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPHDQTA
Ga0066662_1293057313300018468Grasslands SoilMCRYFTATDELAVETTQDMDPEHRGTRLMPGLMKAMVTIYEPVDDAHPSLPTLREFLAKHTDTESGLVRYDFFGPTLTIDPTPLLELPPEYLDYLVGPGCSFVTHEALTRSRSHPSGQPA
Ga0193747_100473913300019885SoilPRIHRSMCKYFTATDEFAVPTTQDMDPEHRGTRLMPGRMKAFLTIYEPNDDAHPSLLTLREFLAENTDRESGLVSYALLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALAWSRSRPSDQSA
Ga0193735_100442343300020006SoilMCKYFTATDEFAVPTTQDMDPEHRGTRLMPGRMKAFLTIYEPNDDAHPSLLTLREFLAENTDRESGLVSYALLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALAWSRSRPSDQSA
Ga0193721_112796523300020018SoilMCKYFTATDEFAVPTTQDMDPEHRGTRLMPGRMKAFLTIYEPNDDAHPSLLTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALAWSRSRPSDQSA
Ga0193719_1025121123300021344SoilMCKYFTATDEFAVPTTQDMDPEHRGTRLMPGLMKAFLTIYEPNDDAHPSLLTLREFLAENTDPESGLVSYALPGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALAWSRSR
Ga0222622_1105998313300022756Groundwater SedimentMCKYFSAGDEYAVETTRDMDPEQRGTHVMSGRMKAHLTVYEPGDSCSPTLPTLRQFLADYTDPRTGLLSYQILGPKLTIDPAPLLALPKEYLD
Ga0207684_1061419223300025910Corn, Switchgrass And Miscanthus RhizosphereAVETKKDMNPARPGTRLMSGLMKAMVKVYESGEVADPSLPTLREFLAKHTDPESGLVSYDYAGPTLTIDPRPLLQLPPEYLDYRVGPGSSFVTHEALAHSGSHPSARST
Ga0209237_126060723300026297Grasslands SoilMCKYFTATDEFAVKTTQDMDPERRGTHLMPGLMKAFVTIYEPNDDAHPSLPTLREFLAKNTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLEYRVGPGASFVTDEALALSRSRPHDQTA
Ga0209239_118722923300026310Grasslands SoilDHPGRRVLPGRMKAFLTVYEPGDDPHPSLPTLREFLATLTDPATGLVSYDLLGPALSIDPAPLRALPPEYLDYRVSLGASFVTDEAIDRTDVRNRPN
Ga0209471_101045113300026318SoilRRVLPGLMKAFLTVYEPGDDPHPSLPTLREFLATLTDPATGLVSYDLLGPALSIDPAPLRALPPEYLDYRVSLGASFVTDEAIDRTNVRNRPN
Ga0209472_131459823300026323SoilMCKYFTATDEFAVPTTQDMDPEHRGTRFMPGVMKAFVTIYEPNDDAHPSLPTLREFLAKNTDPKSGLVSYAVLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVSDEALAWSRSLSDQTA
Ga0209152_1044457723300026325SoilMCKYFTATDEFAVPTTRDMDPEHRGTRLMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLDYRVGPGASF
Ga0209801_1000810263300026326SoilMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPSDQTA
Ga0209802_103722723300026328SoilMPGLMKAFVTIYEPNDDAHPSLPTLREFLAKNTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLEYRVGPGASFVTDEALALSRSRPHDQTA
Ga0209802_111237023300026328SoilMCRYFTATDELAVETTQDMDPEHRGTRLMPGLMKAMVTIYEPVDDAHPSLPTLREFLAKHTDTESGLVRYDFFGPTLTIDPTPLLELPPEYLDYRVGPGCSFVTHEALTRSRSHPSGQPA
Ga0209158_125584923300026333SoilCRYFTATDEFAVPTTQDMDPEHRGTRLMPGLMKAFVTIYEPNDDAHPSLPTLREFLAKNTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLEYRVGPGASFVTDEALALSRSRPHDQTA
Ga0209377_104775413300026334SoilMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLEYRVGPGASFVTDEALALSRS
Ga0209808_109795613300026523SoilMCKYFTATDEFAVPTTRDMDPEHRGTRLMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPSDQTA
Ga0209378_112818823300026528SoilMDPEHRGTRLMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPSDQTA
Ga0209806_102867343300026529SoilMCKYFTATDEFAVPTTQDMDPEHRGTRLMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPSDQTA
Ga0209807_101114943300026530SoilMCRYFTATDEFAVRTTQDMDPEHRGTHLMPGLMKAFVTIYEPNDDAHPSLSTLREFLAENTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPHDQTA
Ga0209160_132627013300026532SoilYEPNDDAHPSLPTLREFLAKNTDPESGLVSYALLGPRLTIDPAPLLQLPPEYFEYRVGPGASFVTDEALALSRSRPHDQTA
Ga0209058_112439623300026536SoilMCKYFTATDELAVETTHDMNPEHRGTRLMPGLMKAMVTIYEPADDAHPSLPTLREFLAKHTDPESGLVSYDFFGPTLRIDPAPFLQLPPEYLDYRVGPGSSFVTNEPLARSRSRPSGQSA
Ga0209376_1011375103300026540SoilPTTRDMDPEHRGTRLMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPSDQTA
Ga0209474_1001599223300026550SoilMPGLMKAFVTIYEPDDDAHPSLSTLREFLAENTDPESGLVSYARLGPRLTIDPAPLLQLPPEYLDYRIGPGASFVTDEALALSRSRPSDQTA
Ga0209577_1035478513300026552SoilLTVYEPGDDPHPSLPTLREFLATLTDPATGLVSYDLLGPALSIDPAPLRALPPEYLDYRVSLGASFVTDEAIDRTNVRNRPN
Ga0209689_1006082103300027748SoilMCKYFTATDEFAVPTTQDMDPEHRGTRFMPGVMKAFVTIYEPNDDSHPSLPTLREFLAKNTDPKSGLVSYAVLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVSDEALAWSRSRPSDQTA
Ga0209283_1005583413300027875Vadose Zone SoilMCKDFTATDDLAVETTHDMEPARPGTRLMPGLMRAMVTVYEPADEAHPSLPTLREFLAKHTNPESGLVSFDYLGPTLTIDPTPLLQLPPEYLDYRVGPGSSFVTYE
Ga0137415_1034104923300028536Vadose Zone SoilVYEPGEVADPSLPTLREFLAKHTDPKSGLVSYDYAGPTLTIDPRPLLQLPPEYLDCRVGPGSSFVTHEALAHSRSRTSGQSA
Ga0137415_1044208423300028536Vadose Zone SoilMKAMVTIYEPNDDPHPSLPTLREFLATHTDPESGLVRYDRFGPMLSIDPAPLLQLPPEYLDYRVGPGSSFVTDDALALARSKGSH
Ga0137415_1064347923300028536Vadose Zone SoilMCKYFTATDDLGVKAIKDMDPERRSTRVMAGLMKAMVTIYEPGDAPHPSLPTLREFLAKNTHPKSGLVSYALQGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALALSRSRPIDQTA
Ga0307307_1003977623300028718SoilMPGLMKAFLTIFEPNDDAHPSLLTLREFLAENTDRESGLVSYALLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALAWSRSRPSDQSA
Ga0307280_1036761023300028768SoilMCKYFTATDEFAVPTTQDMDPEHRGTRLMPGRMKAFLTIYEPNDDAHSSLLTLREFLAENTDPESGLVSYALPGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALAWSRSRPSDQSA
Ga0307282_1001162763300028784SoilMCRYFTATDEFAVPTTQDMDPEHRGTRLMPGRMKAFLTIYEPNDDAHPSLLTLREFLAENTDPESGLVSYALPGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALAWSRSRPSDQTA
Ga0307284_1000048033300028799SoilMCKYFTATDEFAVPTTQDMDPEHRGTRLMPGRMKAFLTIYEPNDDAHPSLLTLREFLAENTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALAWSRSRPSDQTA
Ga0307305_1000209013300028807SoilRLMPGRMKAFLTIYEPNDDAHPSLLTLREFLAENTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALAWSRSRPSDQSA
Ga0307292_1033006223300028811SoilMCKYFSAGDEYAVETTRDMDPEQRGTHVMSGRMKAHLTVYEPGDSCSPTLPTLRQFLADYTDPRTGLLSYQILGPKLTIDPAPLLALPKEYLDYGVSPCASFVTHEALSGSRRPAPRTPGQA
Ga0307312_1001704023300028828SoilMCKYFTATDEFAVPTTQDMDPEHRGTRLMPGRMKAFLTIYEPNDDAHPSLLTLRDFLAENTDPESGLVSYALPGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALAWSRSRPSEQSA
Ga0307312_1017342233300028828SoilMCKYFSAGDEYAVETTRDMDPEQRGTHVMSGRMKAHLTVYEPGDSCSPTLPTLRQFLADYTDPRTGLLSYQILGPKLTIDPAPLLALPKEYLDYGV
Ga0307308_1001037423300028884SoilMPGLMKAFLTIFEPNDDAHPSLLTLREFLAENTDRESGLVSYALLGPRLTIDPAPLLQLPPEYLEYRVGPGASFVTDEALAWSRSRPSDQPRSDGPVVRAA
Ga0307308_1038887923300028884SoilFAVPTTQDMDPEHRGTRLMPGRMKAFLTIYEPNDDAHSSLLTLREFLAENTDPESGLVSYALLGPRLTIDPAPLLQLPPEYLEYRVGPGASFVTDEALAWSRSRPSDQTA
Ga0307304_1014892023300028885SoilMCKYFTATDEFAVPTTQDMDPEHRGTRLMPGLMKAFLTIYEPNDDAHPSLLTLREFLAENTDPESGLVSYALPGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALAWSRSRPSDQTA
Ga0307304_1059989013300028885SoilMPKYFTATDEFAVSTTQDMDPERRGTRLMSGQMKAMITIFEPIDDPHPSLPTLREFLAKHTDSKSGLVSYELRGPKLTIDPAPLLQLPPEYLGYRVGPGASFVTDEALAWSRSRTGQQTA
Ga0308194_1002896013300031421SoilIYEPNDDAHSSLLTLREFLAENTDRESGLVSYALLGPRLTIDPAPLLQLPPEYLEYRVGPGASFVTDEALAWSRSRPSDQPRSDGPVVRAA
Ga0308194_1010264913300031421SoilMCKYFTATDEFAVPTTRDMDPEHRGTRLMPGLMKAFLTIFEPNDDAHPSLLTLREFLAENTDPESGLVSYALPGPRLTIDPAPLLQLPPEYLDYRVGPGASFVTDEALAWSRSRPSDQSA
Ga0307471_10405114913300032180Hardwood Forest SoilVCKYFEADDALAVETTRDMDPDHPGRRVLPGRMKAFLTVYEPGDDPHPSLPTLREFLATLTDPATGLVSFDLLGPALTIDPAPLRALPPEYLDYRISLGASFVTDEAIDRA
Ga0307472_10003780123300032205Hardwood Forest SoilVCKYFEADDALAVETTRDMDPDNPGKRVLPGRMKAFLTVYEPGDDPHPSLPTLREFLATLTDPATGLVSFDLLGPALTIDPAPLRALPPEYLDYRISLGASFVTDEAIDRASARKRPN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.