NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F096797

Metagenome Family F096797

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F096797
Family Type Metagenome
Number of Sequences 104
Average Sequence Length 200 residues
Representative Sequence VTQARAARRAARAQKVDVRKKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFVAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATLAGLYAIQQATGHDPVQWVDFDPRIRPFSSF
Number of Associated Samples 94
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 33.33 %
% of genes near scaffold ends (potentially truncated) 5.77 %
% of genes from short scaffolds (< 2000 bps) 4.81 %
Associated GOLD sequencing projects 89
AlphaFold2 3D model prediction Yes
3D model pTM-score0.38

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (94.231 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(27.885 % of family members)
Environment Ontology (ENVO) Unclassified
(47.115 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(59.615 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 68.89%    β-sheet: 0.00%    Coil/Unstructured: 31.11%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.38
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF01061ABC2_membrane 36.54
PF12697Abhydrolase_6 2.88
PF01909NTP_transf_2 0.96



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A94.23 %
All OrganismsrootAll Organisms5.77 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005467|Ga0070706_102140202All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium505Open in IMG/M
3300005554|Ga0066661_10019093All Organisms → cellular organisms → Bacteria3609Open in IMG/M
3300005556|Ga0066707_10185822All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1334Open in IMG/M
3300009147|Ga0114129_12800289All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium580Open in IMG/M
3300009162|Ga0075423_11789370All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium662Open in IMG/M
3300031720|Ga0307469_11777453All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium595Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil27.88%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil18.27%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil13.46%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil11.54%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil7.69%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.73%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.81%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere4.81%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost2.88%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.96%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.96%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012014Permafrost microbial communities from Nunavut, Canada - A10_80cm_6MEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300013294Permafrost microbial communities from Nunavut, Canada - A3_65cm_0MEnvironmentalOpen in IMG/M
3300014056Permafrost microbial communities from Nunavut, Canada - A20_5cm_0MEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019866Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1m1EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020008Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1m2EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026307Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 (SPAdes)EnvironmentalOpen in IMG/M
3300026308Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103 (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300028711Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_150EnvironmentalOpen in IMG/M
3300028716Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_198EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300028824Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_197EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028872Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_204EnvironmentalOpen in IMG/M
3300028875Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_143EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0066674_1006722313300005166SoilVTQARAARRAARAQKVDVRKKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFVAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATLAGLYAIQQATGHDPVQWVDFDPRIRPF
Ga0066679_1014682623300005176SoilVAQARKKARRERPVTRGVFWRELALLIVCVEIAIFILSFDASILNVFDLTKASFTHGLAWALLGAIVVIALSDGVRVPASPLFLAFFAVIATEVIATATAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRTSWLAWTIGAGATLAGVYAIQQATGHDPVQWVDLDPRFRPF
Ga0066688_1071119013300005178SoilVAQARAARRAARATKADRKAGRERPVVRGVFWRELALVLVCVEIAIFILGFDPTVLNVFDLTKASFTHALAWGLLGALIVVALGDGIRIPVSPVFVTFYAVVAVEALTTLTAENQYVAIYGEVGRYLGLTTHAVLGLIALAIAVSIDYPRRTPWLAWTIGTAAALAGLYAIQQALGRDPIQWVDFDPRVRPFSSFGNADFYGQFL
Ga0066688_1097908613300005178SoilQARAARRAARAQKVDVRKKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSALNVFNLTKASLTHALAWALLGVLLVIALGDGIRIPASPVFLAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALMSVAIGVSIDYPRRASWLAWTVGVGAVLAGSYAI
Ga0070694_10055763523300005444Corn, Switchgrass And Miscanthus RhizosphereVAQARKKARRQQPVARGVFWRELALFIVCAEILIFILSFDPSVLNVFDLTKASFTHALAWALLGAVLVIGLSDGVRIPASPIFVAFFAVIATEVITTITAENQYVAMYGEVGRYLGLTTHAVLALISVAIAVSLDYPRRTAWLGWTIGIGATLAGLYAIQQATGHD
Ga0070694_10106768813300005444Corn, Switchgrass And Miscanthus RhizosphereVLVCAEIAIFVLAFDPSVLNVFDLTKASFTHALAWGLLGTLVVIALGDGFRIPISPVFLAFYAVVAVEVLTTFTAENQYVAVYGEVGRYLGLTTHAVLAVLAVSIAISIDYPRRLSWLAWTIGGAATIAGLYAVQQALGRDPVQWVDLDPRLRPFSTFGNPDFYGQFLAVVAIACAAIL
Ga0070708_10019448113300005445Corn, Switchgrass And Miscanthus RhizosphereVAEARAARRAARENKVDARKKARRERPVVRGVFWRELALVIVCVEIGIFILSFDPSVLNVFDLTKASFTHALAWALLGALIVIALGDGVRIPASPLFLAFFAVIATELITTITAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRTSWLAWTIGIGAALAGAYAIQQWTGHDPVQWVDFDPRFRPFSSFGNADFYGQFLAVVATGCAAVLVFTRQRLWLTAVVVLLGVLNVGLMLIVQTRGSFLGIVA
Ga0066686_1017538113300005446SoilVTQARAARRAARAQKVDVRKKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFGAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATVAGLYAIQQATGHDPVQWVDLDPRVRPFATFGNPDFYGQFLAVVATGCAAVLVFARQRLWLMGVVALLAVLNVALMLVV
Ga0066689_1022973823300005447SoilVTQARAARRAARAQKVDVRKKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFGAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATVAGLYAIQQATGHDPVQWVDLDPRVRPFATFGNPDFYGQFLAVVATGCAAVLVFARQRLWLMGVVALLAVLNVALMLVVQTRGSFLGIA
Ga0066687_1014143813300005454SoilVAQARKKPRRERPVARGVFWRELALLIVCVEIAIFILSFDPSILNVFDLTKASFTHGLAWALLGAIVVIALSDGIRVPASPLFLAFFAVIATEVIATATAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRTSWLAWTIGAGATLAGVYAIQQATGHDPVRWVDLDPRVRPFATFGNPDFYGQFLAVVATGCAAVLVFARQRLWLMGIVALLAVLNVALMLVVQ
Ga0070706_10214020213300005467Corn, Switchgrass And Miscanthus RhizosphereVFWRELALFIVCAEILIFILSFDPSVLNVFDLTKASFTHALAWALLGAVLVIGLSDGVRIPSSPVFVAFFAVIATEVITTITAENQYVAMYGEVGRYLGLTTHAVLALIAVAIAVSLDYPRRTAWLGWTIGIGATLAGLYAIQQATGHD
Ga0070698_10082320923300005471Corn, Switchgrass And Miscanthus RhizosphereVAEARAARRAARENKVDARKKARRERPVVRGVFWRELALVIVCVEIGIFILSFDPSVQNVFDLTKASFTHALAWALLGALIVIALGDGVRIPASPLFLAFFAVIATELITTMTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRTSWLAWTIGIGAAIAGAYAIQQWTGHDPVQWVDFDPR
Ga0070699_10023749033300005518Corn, Switchgrass And Miscanthus RhizosphereVFWREVALLLVCAEIAIFVLAFDPSVLNVFDLTKASFTHALAWGLLGALIVIALAEGLRIPLSPVFLAFYAVVAVEALTTVTAENQYVALYGEVGRYLGLTTHAVLALLAVAIAVSIDYPRRNSWLAWTIGAVATLAGLYAVQQALGRDPVQWVDFDPRVRPFSTFGNPDFYGQFLAVVAIGCAAVLVFVKQ
Ga0066661_1001909313300005554SoilVAQARAARRAARATKADRKAGRERPVVRGVFWRELALVLVCVEIAIFILGFDPTVLNVFDLTKASFTHALAWGLLGALIVIALGDGIRIPVSPVFVTFYAVVAVEALTTLTAENQYVAIYGEVGRYLGLTTHAVLGLIALAIAVSIDYPRRTPWLAWTIGTAAALAGLYAIQQALGRDPIQWVDFDPRVRPF
Ga0066692_1034153423300005555SoilVADARVARRGAQKKPRRDRPVERPVVRGVFWREVALLVVCAEIAIFILAFDPSVLNVFDLTKASFTHALAWGLLGTLVVIGIGNGLRVPLSPIFVAFYLVVAVEVLTTVTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVTIDYPRRTSWLAWTIGAAAALAGLYAVQQALGRDPVQWVDFDPRIRPFS
Ga0066707_1018582213300005556SoilVTQARAARRAARAQKVDVRKKAQRERPVLRGVFWREVALGIVCMEVAIFILSFDPSVLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFGAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGAT
Ga0066699_1071897613300005561SoilVARRRAARRERPVVRGVFWRELALVIVCVETLIFILSFDPTVLNVFDLTKASFTHGVAWALLGALIVIALGDGVRIPASPLFIAFFAVVATEIVTTITAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRTSWLAWTIGAGATLAGLYAIQQATGHDPVRW
Ga0066693_1021985913300005566SoilLLVFRPREGRLRERPVVARARAARRGRPVARGVFWREVALVLVCVEIAIFILAFDPTVLNVFDLTKASFTHALAWGLLGALVVIALGDGFRIPLSPVFVAFYAVVAVEVVTTFTAENQYVAIYGEVGRYLGLTTHLVLALIAVAIAVSIDYPRRSSWLAWTIGAAAAVAGLYAVQQALGLDPVQWADQDPRARPFSTFGNPDFYGQFLAVVFIGCVAALAFARQRLWLAALVGLLGLSSLGLMLTVA
Ga0066703_1002643843300005568SoilVAQARAARRAARATKAERKAGRERPVVRGVFWRELALVLVCVEIAIFILGFDPTVLNVFDLTKASFTHALAWGLLGALIVIALGDGIRIPVSPIFVTFYAVVAVEALTTLTAENQYVAIYGEVGRYLGLTTHAVLGLIALAIAVSIDYPRRTPWLAWTIGTAAALAGLYAIQQALGRDPIQWVDFDPRVRPFSSFGNADFYGQFLAVVAIACGA
Ga0066703_1050440713300005568SoilVAQARKKPRRERPVARGVFWRELALLIVCVEIAIFILSFDPSILNVFDLTKASFTHGLAWALLGAIVVIALSDGVRVPASPLFLAFFAVIATEVIATATAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRTSWLAWTIGAGATLAGVYAIQQATGHDPVQWVDLD
Ga0066705_1075382213300005569SoilVARARAARRGRPVSRGVFWREVALVLVCVEIAIFILAFDPTVLNVFDLTKASFTHALAWGLLGALVAIALGDGFRIPLSPVFVAFYAVVAVEVVTTFTAENQYVALYGEVGRYLGLTTHAVLALIAVGIAVSIDYPRRTSWLAWTIGAAAALAGVYAVQQVTGHDPVQWVDSDPRA
Ga0066708_1005697833300005576SoilVTQARAARRAARAQKVDVRKKAQRERPVLRGVFWREVALSIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFGAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATVAGLYAIQQATGHDPVQWVDLDPRVRPFATFGNPDFYGQFLAVVATGCAAVLVFARQRLWLMGVVALLAVLNVALMLVVQTRGSFLGIA
Ga0066654_1061770813300005587SoilIVCVEALIFILSFDPTVLNVFDLTKASFTHGVAWALLGALIVIALGDGVRIPASPLFIAFFALIATEIVTTITAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRTSWLAWTIGAGATLAGLYAIQQATGHDPVRWVDLDPRFRPFATFGNPDFYGQFLAVVATACAAVLVFARQRVWVVGIVVLLAVLNVA
Ga0066706_1009912933300005598SoilVTQARAARRAARAQKVDVRKKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFGAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATVAGLYAIQQATGHDPVQWVDLDPRVRPFATFGNPDFYGQFLAVVATGCAAVLVFARQRL
Ga0066706_1061348013300005598SoilVAQARTARRAARAQKVDVRKKAQRERPVLRGVFWREVALSIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGATLVIALGDGVRIPASPLFVAFFAVVATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALMSVAIAVSIDYPRRALWLAWTVGIGATLAGLYAIQQATGHDPVQWVDLDPRVRPFATFGNPDFYGQFLAVVATGCAAVLVFARQRL
Ga0066651_1056997313300006031SoilEGRLRERPVVPRARASRRARPVARGVFWREVALVLVCVEIAIFILAFDPTVLNVFDLTKASFTHALAWGLLGALVVIALGDGFRIPLSPVFVAFYAVVAVEVVTTFTAENQYVAIYGEVGRYLGLTTHLVLALIAVAIAVSIDYPRRSSWLAWTIGAAAAVAGLYAVQQALGLDPVQWADQDPRARPFSTFGNPDFYGQ
Ga0066652_10146579613300006046SoilPVVAQARAARRAARESRLDAQKKPPRGRTVARGVFWRELALVVVCVEIAIFILAFDPSVLNVFDLTKASFTHALAWGLLGALVVIALGDGFRVPLSPVFVAFYAVVAVEILTTFTAENQYVALYGEVGRYLGLTTHAVLALLAVAIAVSVDYPRRVSWLAWTIAAASTLAGVYAVQQALGLDPVQWIDFDPRIRPFSTFGNADFYGQFLAV
Ga0075417_1056609113300006049Populus RhizosphereRSVFLRELALAIVCGQIAIFILAFDPTLLNAFDLTKASYTHALAWGLLGALLAIALAAGVRVPVSALFVGFYAVLAVEALTTLTAENQYVAVYGEVGRYLGLTTHAVLALMTIAIAISLDYPRRAPWLGWVIGFAAGVAGLYAVQQALGGDPVQWADVDPRLRPFSSFGNADFYGQFLAVIATGCAALLVFA
Ga0075425_10017203743300006854Populus RhizosphereVAQARKKARRQQPVARGVFWRELALFIVCAEILIFILSFDPAVLNVFDLTKASFTHALAWALLGAVLVVGLSDGVRIPASPIFIAFFAVIATEVITTITAENQYVAMYGEVGRYLGLTTHAVLALISVAIAVSLDYPRRTAWLGWTIGIGATLAGLYAIQQATGHDPVQWVDLDPRIRPFATFGNPDFYGQFLAVVASGCAAVLVFARQRLWLTLAVALLAVVNVG
Ga0066710_10382634113300009012Grasslands SoilERPLVRGVFWREIALLLVCAEISIFILAFDPSVLNVFDLTKASFTHALAWGLLGTLVVIGLGGGLRVPLPLIFVAFYAVVAIEVLTTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRTSWLAWTIGAAAALAGLYAIQQALGRDPVQWSDLNSRARPFATFGNPDFYGQFLAAVFI
Ga0099829_1074115823300009038Vadose Zone SoilVAEARAARRAARENKVDARKKARRERPVVRGVFWRELALVIVCVEIGIFILSFDPSVLNVFDLTKASFTHALAWALLGALIVSALGDGVRIPASPLFLAFFAVIATEVITTITAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRTSWIAWTIGIGAALAGAYAIQQWTG
Ga0099828_1100206713300009089Vadose Zone SoilAARENKVDARKKARRERPVVRGVFWRELALVIVCVEIGIFILSFDPSVLNVFDLTKASFTHALAWALLGALIVIALGDGVRIPASPLFLAFFAVIATEVITTITAENQYVALYGEVGRYLGLTTHAVLALMAVAIAVSIDYPRRTSWLAWTIGIGAALAGAYAIQQWTGHDPVQWVDFDPRVRPFSSFGNADFYGQFLAVVATACAAVLVFARQRLWLTAVVVLLAVLNVALMLIVQTRGSFLGIVA
Ga0099827_1157471213300009090Vadose Zone SoilDARKKARRERPVVRGVFWRELALVIVCVEIGIFILSFDPSVLNVFDLTKASFTHALAWALLGALIVIALGDGVRIPASPLFLAFFAVIATEVITTITAENQYVALYGEVGRYLGMTTHAVLALIAVAIAVSIDYPRRTSWIGWTVGVVALVAGLYAVQQALGRDPVQWLDADPHARPFSSFGNADFYGQF
Ga0066709_10095160113300009137Grasslands SoilVAEARAARRAAQRDRSQKKTRPQQTVARDVFWREIALLLVCAEILIFILSFDAAVLNVFDLTKASFTHALAWGLLGTLLVIGLGDGVRVPLSAIFVAFYAVVAIEVLTTITAENQYVALYGEVGRYLGLTTHAALALIAVAIAVSIDYPRRTSWLAWTIGAAAVLAGLYAIQQALGRDPVHWIDLDPRARPFATFGNPDFYGQFLAVVAVACAAFPVFVRPGLWLVIVVALLAPIDGSLL
Ga0114129_1280028913300009147Populus RhizosphereVFLRELALAIVCGQIAIFILAFDPTLLNAFDLTKASYTHAFAWGLLGAVVAIALRDGVRVPVSALFVAFYAVLAVEALTTLTAENQYIAVYGEVGRYLGLTTHAVLALMTVAIAMSLDYPRRAPWLGWVIGFAAAVAGLYAVQQA
Ga0075423_1109270523300009162Populus RhizosphereVAQARKKARRQQPVARGVFWRELALFIVCVEILIFILSFDPAVLNVFDLTKASFTHALAWALLGAVLVIGLSDGVRIPASPIFVAFFAVIATEVITTITAENQYVAIYGEVGRYLGLTTHAVLALISVAIAVSLDYPRRTAWLGWTIGLGATLAGLYAIQQATGHDPVQWVDLDPRFRPFATFGNPDFYGQFLAVVASGCAAVLVF
Ga0075423_1178937013300009162Populus RhizosphereVFLRELALAIVCGQIAIFILAFDPTLLNAFDLTKASYTHALAWGLLGAVVAIALRDGVRVPVSALFVAFYAVLAVEALTTLTAENQYIAVYGEVGRYLGLTTHAVLALMTVAIAISLDYPRRAPWLGWVIGFAAGVAGLYAVQQALGGD
Ga0134070_1007245523300010301Grasslands SoilVAQTRTARRAARAQKVDVRKKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLDALVVIALGDGIRIPASPLFVAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATLAGLYAIQQATGHDPVQWVDFDPRIRPFSTFGNADFYGQFLAVVAVACGAVLMFVR
Ga0134088_1027492023300010304Grasslands SoilVADARAARRTAQKRVRRDRPVVRGVFWRELALLLVCAEIAIFVLAFDPSILNVFDLTKASFTHGLAWGLLGVLVVIALGDGLRIPLSPLCIAFYAVIAVEILTTVTAENQYVAFYGEVGRYLGLTTHAVLALLAVAIAVSIDYPRRTSWLAWTIGAAAAIAGAYAIEQFV
Ga0134109_1021723023300010320Grasslands SoilVTQARAARRAARAQKVDVRKKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFVAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATLAGLYAIQQATGHDPVQWVDFDPRIRPFSSF
Ga0134065_1038615613300010326Grasslands SoilQKVDVRKKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFVAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATVAGLYAIQQATGHDPVQWVDLDPRVRPFATFGN
Ga0134111_1019187523300010329Grasslands SoilVAQARTARRVARAQKVDVRKKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSGLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFGAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATLAGLYAIQQATGHDPVQWVDFDPRIHPFSSFG
Ga0134063_1010758923300010335Grasslands SoilVTQARAARRAARAQKVDVRKKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFVAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATLAGLYAIQQATGHDPVQWVDFDPRIRPFSSFGNADFYGQFLAVVATACAAVLVFARQRLWLTAVVVLLAVLNVALMLVVQTRGSFLGIAAG
Ga0134062_1029103013300010337Grasslands SoilVAQARAARRAARESRLDAQKKPPRGRTVARGVFWRELALVVVCVEIAIFILAFDPSVLNVFDLTKASFTHALAWGLLGALVVIALGDGFRVPLSPVFVAFYAVVAVEILTTFTAENQYVALYGEVGRYLGLTTHAVLALLAVAIAVSVDYPRRVSWLAWTIAAASTLAGVYAVQ
Ga0134127_1333151213300010399Terrestrial SoilEIAIFVLAFDPSVLNVFDLTKASFTHALAWGLLGTLVVIALGDGLRIPISPVFLAFYAVVAVEVLTTFTAENQYVAVYGEVGRYLGLTTHAVLAVLAVSIAISIDYPRRLSWLAWTIGGAATIAGLYAVQQALGRDPVQWVDLDPRLRPFSTFGNPDFYGQFLAVVAIACAAILV
Ga0137391_1065934523300011270Vadose Zone SoilVAEARAARRAARENKVDARKKARRERPVVRGVFWRELALVIVCVEIGIFILSFDPSVLNVFDLTKASFTHALAWALLGALIVIALGDGVRIPASPLFLAFFAVIATEVITTITAENQYVALYGEVGRYLGLTTHAVLALMAVAIAVSIDYPRRTSWLAWTIGIGAALAGAYAIQQWTGHDPVQWVDFDPRVRPFSSFGNADFYGQFLAVVATGC
Ga0120159_118982113300012014PermafrostCAEIGVFVLAFDPSVLNVFDLTKASFTHALAWGLLGALIVIAIGDGFRVPLSPIFLAFYAVVAVEVLTTFTAENQYVALYGEVGRYLGLTTHAVLALLAIAIAVSIDYPRRTSWLAWTIGAAATLAGLYAVQQALGRDPVQWVDFDPRVRPFSTFGNADFYGQFLAVVAIACAAVLVFVRQRLW
Ga0137389_1143383313300012096Vadose Zone SoilRAARENKVDARKKARRERPVVRGVFWRELALVIVCVEIGIFILSFDPSVLNVFDLTKASFTHALAWALLGALIVIALGDGVRMPASPLFLAFFAVIATEVITTITAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRTSWLAWTIGIGAALAGAYAIQQWTGHDPVQWVDFDPRVRPFSSFGNADF
Ga0137364_1065667013300012198Vadose Zone SoilVADTRAARRAAQKRVRRERPVVRGVFWREVALGIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFVAFFAVIATAVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATLAGLYAIQQATGHDPVQWVDFDPRIRPFSSFGNADFYGQFLAVVATACAAVLVFARQRLWLTAVVVLLAVLNVALMLVVQTRGSFLGIAA
Ga0137382_1098219413300012200Vadose Zone SoilEIAIFILAFDPSVLNVFDLTKASFTHALAWGLLGTLVVIGIGNGLRVPLSPIFVAFYLVVAVEVLTTVTAENQYVALYGEVGRYLGLTTHAVLARIAVAIAVTIDYPRRTSWLAWTIGSAAALAGLYAVQQALGRDPVQWVDFDPRIRPFSTFGNADFYGQFLAVVATACAAVLVFVRQRLWLRGVVALLAVFNVALMLIV
Ga0137399_1139426313300012203Vadose Zone SoilVAQARAARRAARASKVERKARRERPLVRGVLWRELALVLVCVEIAIFILGFDPTVLNVFDLTKASFTHALAWGLLGALIVVALGDGLRIPVSPVFVAFYAVVAVEALTTLTAENQYVAIYGEVGRYLGLTTHAVLGLIAVAIAVSIDYPRRVPWLAWTIGIAAALAGLYAIQQALGRDPVQWVDLDPRVRPFATF
Ga0137370_1051799713300012285Vadose Zone SoilVARARAARRARPVARGVFWREVALVLVCVEIAIFILAFDPTVLNVFDLTKASFTHALAWGLLGVLIVIALGDGFRIPLSPVFVAFYAVVAVEVVTTFTAENQYVAVYGEVGRYLGLTTHLVLALIGVAIAVSIDYPRRTSWVGWTIGAAATVAGLYAVQQALGHDPVQWADLDSRARPFSTFGNPDFYGQF
Ga0137371_1072666213300012356Vadose Zone SoilVARAQKVDVRKKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGALLVIALGDGIRIPASPLFVAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATLAGLYAIQQATGHDPVQWVDFDPRIRPFSSFGNADFYGQFLAVVATACAAVLVFARQRLWLTAVVVLLAVL
Ga0137361_1015367913300012362Vadose Zone SoilVADARAARRGAQKKPRRDRPVERPVVRGVFWREVALLVVCAEIAIFILAFDPSVLNVFDLTKASFTHALAWGLLGTLVVIGIGNGLRVPLSPIFVAFYAVVAVEVLTTVTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVTIDYPRRTSWLAWTIGAAAALAGLYAVQQALGRDP
Ga0137361_1142855713300012362Vadose Zone SoilVVADARAARRAAQKRVRRERPVVRGVFWREVALLLVCAEIAIFVLAFDPSVLNVFDLTKASFTHALAWGLLGALVVIALGDGQRIPLSPLFIAFYAVVAVEVVTTFTAENQYVAIYGEVGRYLGLTTHAVLALIAVAVAVSIDYPRRTSWLAWTIGAAAAIAGLYAVQQALGRDPVQWVDADPRARPFATFGNADFYGQFVAVV
Ga0137419_1014729933300012925Vadose Zone SoilVAQARAARRAARANKVDAQKKTARGRPVARGVFWRELALVLVCVEIAIFILAFDPSVLNVFDLTKASFTHALAWGLLGALIVIALGDGFRVPVSPVFVAFYAVVAVEIVTTVTAENQYVALYGEVGRYLGLTTHAVLALLAVAIAVSVDYPRRVSWLAWTIAAASTLAGVYAVQQALGLDPV
Ga0137416_1070494623300012927Vadose Zone SoilVFFREVALVLVCVEILIFILSFDPSILNVFDLTKASFTHALAWALLGSLLVIALGDGVRIPVSPLFIAFFAVVATEVVTTITAENQYVALYGEVGRYLGLTTHAVLGLIAVAIAVSIDYPRRTSWLAWTVGAGATVAGLYAIQQATGHDPVQWLDADPRTRPFATFGNADFYGQFLAVVATACVGVLVFARQRLWLTAVAVLLGLLNVALLVVVQTR
Ga0137416_1108059913300012927Vadose Zone SoilAEARAARRAARAATKAQRKAARDRPVVRGVFWRELALLVVCAEILIFILGFDPTVLNGFDLTKASFTHALAWGLLGTLIVIALGDGLRVSLSPIFVAFYAVVAIEVLTTVTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRTSWLAWTIGGGALLAGLYAIQQATGHDPVQWIDFDPRNRPFSTFGNADFYGQFLAVVAVAGAAVLVFVRQRLWLMVVVALLAVLNVAL
Ga0137410_1025062013300012944Vadose Zone SoilVAQARAARRAARANRVDAQKKTPRGRPVARGVFWRELALVVVCVEIAIFILAFDPSVLNVFDLTKASFTHALAWGLLGALIVIALGDGFRIPVSPIFVAFYAVVAVAILTTVTAENQYVALYGEVGRYLGLTTHAVLALLAVAIAVSVDYPRRVSWLAWTIAAASTLAGVYAVQQALGLDPVQWIDFDPRVRPFSTFGNADF
Ga0134077_1027596923300012972Grasslands SoilVFWREIALLLVCAEILIFILSFDAAVLNVFDLTKASFTHALAWGLLGTLLVIGLGDGVRVPLSAIFVAFYAVVAIEVLTTITAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRTSWLAWTIGAAAVLAGLYAVQQALGRDP
Ga0134110_1018428213300012975Grasslands SoilVAQARTARRVARAQKVDVRKKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFGAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIVYPRRAPWLAWTVGIGATVAGLYAIQQATGHDPVQWVDLDPRVRPFATF
Ga0134087_1028459913300012977Grasslands SoilVAQARTARRVARAQKVDVRKKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFGAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATVAGLYAIQQATVHDPVQWVDFDPRIRPFSSFGNADFYGQFLAVVATACAAV
Ga0120150_103308323300013294PermafrostVAEARAARRANKAQKGRRDRPVVRGVFWRELALILVCVEIGVFVLAFDPSVLNVFDLTKASFTHALAWGLLGALIVIAIGDGFRVPLSPIFLAFYAVVAVEVLTTFTAENQYVALYGEVGRYLGLTTHAVLALLAIAIAVSIDYPRRTSWLAWTIGAAATLAGLYAVQQ
Ga0120125_101191813300014056PermafrostVAQARAARRANKAQTKAQKARRDRPVVRGVFWRELALVLVCAEIAIFILAFDPSVLNVFDLTKASFTHALAWGLLGTLIVIALGDGLRIPLSPVFLAFYAVVAVEVLTTFTAENQYVALYGEVGRYLGLTTHAVLALLAVAIAISIDYPRRTSWLAWTIGAAATVAGLYAVQQALGLDPVQWVDFDPRIRPFSTFGNADFYGQFLAVVAVACAAVLVFVRQRLWLLGVVALLGVFNVALM
Ga0134075_1005278513300014154Grasslands SoilVAEARAARRAAQRDRSQKKTRPQQTVARDVFWREIALLLVCAEILIFILSFDAAVLNVFDLTKASFTHALAWGLLGTLLVIGLGDGVRVPLSAIFVAFYAVVAIEVLTTITAENQYVALYGEVGRYLGLTTHAVLALIGVAIAVSIDYPRRTSWLAWTIGAAAVLAGLYAIQQALGRDPV
Ga0137418_1083727413300015241Vadose Zone SoilPVVRGVFWRELALLLVCAEMAIFILAFDPSVLNVFDLTKASFTHALAWALLGTLVVIALGDGFRVPRSPLFLAFYAVVAVEVLTTFTAENQYVALYGEVGRYLGLTTHAVLALLAVAIAVSVDYPRRTSWLAWTIGGAATLAGLYAVQQALGRDPVQWIDFDPRIRPFATFGNADFYGQFLAVVAIACAAVLVFARQRLWMMGAIALLAIFNVALMLVVQTRGSFLGI
Ga0137409_1024389713300015245Vadose Zone SoilVAQARAARRAARANRVDAQKKTPRGRPVARGVFWRELALVVVCVEIAIFILAFDPSVLNVFDLTKASFTHALAWGLLGALIVIALGDGFRIPVSPIFVAFYAVVAVAILTTVTAENQYVALYGEVGRYLGLTTHAVLALLAVAIAVSVDYPRRVSWLAWTIAAASTLAGLYAVQQALGLDPVQWIDFDPRVRPFSTFGN
Ga0137409_1121868413300015245Vadose Zone SoilVAEARAARRAARAAKAERKAGRERPIVRGVFWREIALLLVCAEISIFILSFDPSVLNVFDLTKASFTHALAWGLLGTLVVIALGNGLRVPLSPIFVAFYAVVAIEVLTTFTAENQYVALYGEVGRYLGLTTHAVLALMAVAIAVSIDYPRRTSWLGWTIGIAATLAGLY
Ga0134069_125128913300017654Grasslands SoilVAQARTARRVARAQKVDVRRKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFGAFFALIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATVAGLYA
Ga0184621_1002665633300018054Groundwater SedimentVAQARAARRAARADRLDAQKKAPRGRPVARGVFWRELALVLVCVEIAIFVLAFDTSVLNVFDLTKASFTHALAWGLLGTLIVIALGDGFRVPLSPVFVAFYAVVAVEILTTITAENQYVALYGEVGRYLGLTTHAVLALLAVAIAVSVDYPRRVSWLAWTTGAAAVIAGTYALEQFVGLDPVHWA
Ga0066655_1135314813300018431Grasslands SoilFWGELALVIVCVEALIFILSFDPTVLNVFDLTKASFTHGVAWALLGALIVIALGDGVRIPASPLFIAFFALIATEIVTTITAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRTSWLAWTIGGGATLAGLYAIQQATGHDPVRWVDLDPRFRPFATFGNPDF
Ga0066667_1060225123300018433Grasslands SoilVTQARAARRAARAQKVDVRKKAQRERPVLRGVFWREVALSIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGATLVIALGDGVRIPASPLFVAFFAVVATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALMSVAIAVSIDYPRRALWLAWTVGIGATLAGLYAIQQATG
Ga0066667_1092054023300018433Grasslands SoilVAQARKKARRQEPVARGVFWRELALVLVCVETLIFILAFDPTVLNVFDLTKASFTHGLAWALLGVLIVIALGDGVRVPASPLFIAFFAVLATEVVTTITAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSLDYPRRASWLAWTIGGGATLAGLYAVQQATGHDPVQWVDLDPRFRPFATFGNPDFYGQFLAVVATACAAVLVFTRQRL
Ga0066662_1235455613300018468Grasslands SoilVAQARAARRAARATKAERKAGRERPVVRGVFWRELALVLVCVEIAIFILGFDPTVLNVFDLTKASFTHALAWGLLGVLIVIALGDGLRIPVSPVFVTFYAVVAVEALTTVTAENQYVAIYGEVGRYLGLTTHAVLGLIALAIAVSIDYPRRTPWLAWTIGTAAALAGLYAIQQALGRDPIQWVDFD
Ga0066669_1170753613300018482Grasslands SoilEGRLRERPVVAQARAARRAARESRLEAQKKPPRGRTVARGVFWRELALVVVCVEIAIFILAFDPSVLNVFDLTKASFTHALAWGLLGALIVIALGDGFRVPLSPVFVAFYAVVAVEILTTFTAENQYVALYGEVGRYLGLTTHAVLALLAVAIAVSVDYPRRVSWLAWTIAAASTLAGVYAVEQSLGLDPVQ
Ga0193756_104026113300019866SoilAARTNKGKARRDRPVVRGVFWRELALLLVCAEIAIFVLAFDPSVLNVFDLTKASFTHALAWGLLGALVVIALGDGFRIPLSPIFLAFYAVVAIEVLTTFTAENQYVALYGEVGRYLGLTTHAVLALLAVGIAVSIDYPRRASWLAWTIGAAATLAGLYAVQQALGRDPVQWVDFDPRVRPFSTFGNADFYGQFLAVVAIACAAVLVFVRQRLWLTGVV
Ga0193723_102743913300019879SoilVFWREVALLVVCLEIAIFILAFDPSALNVFDLTKASFSHALAWGLLGTLIVIALGDGFRIPLSPIFLAFYAVVAVEVLTTVTAENQYVALYGEVGRYLGLTTHAVLALIAVAVAVSIDYPRRTSWLAWTIGAAATLAGLYAVEQFRGLDPIQWTDPNARLRPFSTFGNADFYGQFLAVVATGCAAALVFIRQRRRLTGAVALLAVFSVALMLVVQTRGSFLGIVAG
Ga0193755_106806623300020004SoilVFWREVALLVVCLEIAIFILAFDPSALNVFDLTKASFSHALAWGLLGTLIVIALGDGFRIPLSPIFLAFYAVVAVEVLTTVTAENQYVALYGEVGRYLGLTTHAVLALIAVAVAVSIDYPRRTSWLAWTIGAAATLAGLYAVEQFRGLDPIQWTDPNARLRPFSTFGNADFYGQFLAVVATGCAAALVVIRQRRRLTGAVALLAVFSVALMLVVQTRGSFLGIVAG
Ga0193757_102653413300020008SoilKGKARRDRPVVRGVFWRELALLLVCAEIAIFVLAFDPSVLNVFDLTKASFTHALAWGLLGALVVIALGDGFRIPLSPIFLAFYAVVAIEVLTTFTAENQYVALYGEVGRYLGLTTHAVLALLAVGIAVSIDYPRRASWLAWTIGAAATLAGLYAVQQALGRDPVQWVDFDPRVRPFSTFGNADFYGQFLAVV
Ga0193733_103802413300020022SoilVARARAARRGRPVARGVFWREVTLVLVCVEIAIFILAFDPTVLNVFDLTKASFTHALAWGLLGALVVIALGDGLRIPLSPVFVAFYAVVAVEVLTTFTAENQYVAVYGEVGRYLGLTTHLVLALIAVAIAVSIDYPRRTSWLGWTIGAAASLAGLYAVQQALGHDPVQWADLDSRARPFSTFGNPDFYG
Ga0207665_1114257013300025939Corn, Switchgrass And Miscanthus RhizosphereAEARAARRAARENKVDARKKARRERPAVRGVFWRELALVIVCVEIGIFILSFDPSVLNVFDLTKASFTHALAWALLGALIVIALGDGVRIPASPLFLAFFAVIATELITTITAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRTSWLAWTIGIGAALAGAYAIQQWTGHDPVQWVDFDPRVRPFSSFGNADFYGQFL
Ga0209469_114968813300026307SoilARAARRAARAQKVDVRKKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFVAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATLAGLYAIQ
Ga0209265_115520013300026308SoilLALVIVCVEALIFILSFDPTVLNVFDLTKASFTHGVAWALLGALIVIALGDGVRIPASPLFIAFFALIATEIVTTITAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRTSWLAWTIGAGATLAGLYAIQQATGHDPVRWVDLDPRFRPFATFGNPDFYGQFLAVVATACAAVLVFARQRVWVVGIVVL
Ga0209239_108916613300026310Grasslands SoilVAQARTARRVARAQKVDVRKKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFVAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATLAGLYAIQQATGHDPVQWVDFDPRIRPFSSFGNADFYGQFLAVVATACAAVLVFARQRLWLTAVVVLLAVLNVALMLVVQTRG
Ga0209267_110561923300026331SoilVAQARAARRAARATKADRKAGRERPVVRGVFWRELALVLVCVEIAIFILGFDPTVLNVFDLTKASFTHALAWGLLGALIVVALGDGIRIPVSPVFVTFYAVVAVEALTTLTAENQYVAIYGEVGRYLGLTTHAVLGLIALAIAVSIDYPRRTPWLAWTIGTAAALAGLYAIQQALGRDPIQWVDFDPRVRPFSSFGNADFYGQFLAVVA
Ga0209158_103634533300026333SoilVAQARAARRAARATKADRKAGRERPVVRGVFWRELALVLVCVEIAIFILGFDPTVLNVFDLTKASFTHALAWGLLGALIVIALGDGIRIPVSPIFVTFYAVVAVEALTTLTAENQYVAIYGEVGRYLGLTTHAVLGLIALAIAVSIDYPRRTPWLAWTIGTAAALAGLYAIQQALGRDPIQWVDFDPRVRPFSSFGNADFYGQFLAV
Ga0209057_102127553300026342SoilVARRRAARRERPVVRGVFWRELALVIVCVEALIFILSFDPTVLNVFDLTKASFTHGVAWALLGALIVIALGDGVRIPASPLFIAFFALIATEIVTTITAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRTSWLAWTIGAGATLAGLYAIQQATGHDPVRWVDLDPRFRPFATFGNPDFYGQFLAVVATACA
Ga0209160_111969123300026532SoilVAQARAARRAARAQKVDVRKKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFGAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATVAGLYAIQQATGHDPVQWVDLDPRVRPFATFGNPDFYGQFLAVVATGCAAVL
Ga0209058_100641883300026536SoilVTQARAARRAARAQKVDVRKKAQRERPVLRGVFWREVALGIVCIEIAIFILSFDPSVLNVFDLTKASFTHALAWALLGALVVIALGDGIRIPASPLFGAFFAVIATEVITTFTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVSIDYPRRAPWLAWTVGIGATVAGLYAIQQATGHDPVQWVDLDPRVRPFATFGNPDFYGQFLAVVATGCAAVLVFARQRLWLMGVVALLAVLNVALMLVVQTR
Ga0209577_1022912823300026552SoilVPQARAARRAARATKADRKAGRERPVVRGVFWRELALVLVCVEIAIFILGFDPTVLNVFDLTKASFTHALAWGLLGALIVIALGDGIRIPVSPVFVTFYAVVAVEALTTLTAENQYVAIYGEVGRYLGLTTHAVLGLIALAIAVSIDYPRRTPWLAWTIGTATALAGLYAIQQALGRDPIQWVDFDPRV
Ga0307293_1007647923300028711SoilVADARAARRAARASKGKARRDRPVGRGVFWRELALVLVCAEIAIFVLAFDPSVLNVFDLTKASFTHALAWGVLGALVVIAFGDGVRIPVSPVFLAFYAVVAVEVLTTFTAENQYVALYGEVGRYLGLTTHAVLALLAVAIAVSVDYPRRTSWLAWTIAAAATLAGLYAVQQALGRDPVQWVDFDPRIRPFSTFGNADFYGQFLAVVAIACAAV
Ga0307311_1005734013300028716SoilVADARAARRAARASKGKARRDRPVGRGVFWRELALVLVCAEIAIFVLAFDPSVLNVFDLTKASFTHALAWGVLGALVVIAFGDGVRIPVSPVFLAFYAVVAVEALTTFTAENQYVALYGEVGRYLGLTTHAVLALLAVAIAVSVDYPRRTSWLAWTIAAAATLAGLYAVQQALGRDPVQWVDFDPRIRPFSTFGNADFYGQFLAVVAIACAAVLVFARQRLWLVGAVALLAVFNVALMLVVQTRGSFLGI
Ga0307282_1053544213300028784SoilRDVFWREIALLVVCVEIAIFILAFDPSALNVFDLTKASFTHALAWGLLGTLIVIALGDGFRIPLSPIFLAFYAVVAVEVLTTFTAENQYVAVYGEVGRYLGLTTHAVLALLAVAIAVSTDYPRRLSWLAWTIAAAATIAGLYAVQQALGRDPVQWVDLDPHLRPFSTFGNPDFYGQFLAVVAIACAAIL
Ga0307504_1039917013300028792SoilPVVRGVFWREVALLLVCVEIAIFVLAFDPSVLNVFDLTKASFTHALAWGLLGALIVIALAEGLRIPRSPLFLAFYAVVAVEALTTVTAENQYVALYGEVGRYLGLTTHAVLALLAVAIAVSIDYPRRNSWLAWTIGAAATLAGLYAVQQALGLDPVQWVDLDPRIRPFSTFGNPDFYGQ
Ga0307296_1027638213300028819SoilVADARAARRAARASKAQKRVRREPPVVRGVFWREIALLLVCGEIAIFVLAFDPSVLNVFDLTKASFTHALAWGLLGALLVIALGDGLRIPLSPVFLAFYAVVAVEVITTLTAENQYVALYGEVGRYLGLTTHAVLALLAVGIAVSTDYPRRTSWVAWTIAAAATLAGLYAVQQALGL
Ga0307310_1056497713300028824SoilARRAARASKAQKRVRREPPVVRGVFWREIALLLVCGEIAIFVLAFDPSVLNVFDLTKASFTHALAWGLLGALLVIALGDGLRIPLSPVFLAFYAVVAVEVITTLTAENQYVALYGEVGRYLGLTTHAVLALLAVGIAVSSDYPRRTSWVAWTIAAAATLAGLYAVQQALGLDPVQWVDFDPRVRPFSSFGNA
Ga0307312_1032872023300028828SoilVADARAARRAARASKAQKRVRREPPVVRGVFWREIALLLVCGEIAIFVLAFDPSVLNVFDLTKASFTHALAWGLLGALLVIALGDGLRIPLSPVFLAFYAVVAVEVITTLTAENQYVALYGEVGRYLGLTTHAVLALLAVGIAVSSDYPRRTSWVAWTIAAAATLAGLYAVQQALGLDPVQWVDFDPRIRPFSSFGNADFYGQFLAVVAIGCAAVLAFVRKRWWLMGAVALLGV
Ga0307314_1031134713300028872SoilADARAARRAARASKGKARRDRPVGRGVFWRELALVLVCAEIAIFVLAFDPSVLNVFDLTKASFTHALAWGVLGALVVIAFGDGVRIPVSPVFLAFYAVVAVEVLTTFTAENQYVALYGEVGRYLGLTTHAVLALLAVAIAVSVDYPRRTSWLAWTIAAAATLAGLYAV
Ga0307289_1018285323300028875SoilVADARAARRAARASKGKARRDRPVGRGVFWRELALVLVCAEIAIFVLAFDPSVLNVFDLTKASFTHALAWGVLGALVVIAFGDGVRIPVSPVFLAFYAVVAVEVLTTFTAENQYVALYGEVGRYLGLTTHAVLALLAVAIAVSVDYPRRTSWLAWTIAAAATLAGLYAVQQALGRDPVQWVDFDPRIRPFSTFGNADFYGQFLAVVAIACAAVLVFARQRLWLVGAVALLAVFNV
Ga0307469_1120374713300031720Hardwood Forest SoilVFWRELALILVCVEIGVFVLAFDPSVLNVFDLTKASFTHALAWGLLGALIVIAFGDGLRVPLSPVFIAFYAVVAVEVLTTFTAENQYVALYGEVGRYLGLTTHAVLALVAVGIAVSIDYPRRAPWLGWTIGGIGLLAGLYAIEQAVGADPVQWVDFDPRVRPFSSFGNADFYGQFLGVMVIGCAAVLVFMRQRLWLKVLVALLGVMSLWLMVIVKTRGSFIGVVAGGIIIAALWLRRS
Ga0307469_1177745323300031720Hardwood Forest SoilVFWREVALLLVCVEIAIFVLAFDPSVLNVFDLTKASFTHALAWGLLGVLIVIALVEGLRIPLSPLFLAFYAVVAIEALTTVTAENQYVALYGEVGRYLGLTTHAVLALLAAAIAVSIDYPRRNSWLAWTIGAAGTLAGLYAVQQA
Ga0307473_1043931723300031820Hardwood Forest SoilVFWREVALLLVCVEIAIFVLAFDPSVLNVFDLTKASFTHALAWGLLGVLIVIALAEGLRIPLSPLFLAFYAVVAVEALTTVTAENQYVALYGEVGRYLGLTTHAVLALLAVAIAVSIDYPRRNSWLAWTIGAAATLAGLYAVQQALGLDPVQWVDLDPRVRPFSTFGNPDFYGQFLAVV
Ga0307470_1050138223300032174Hardwood Forest SoilVAEARAARRAKRTPNRAPNKAPNKAQKERRDRPIARGVFWRELALVLACAEIGVFVLAFDPSVLNVFDLTKASFTHALAWGLLGTLIVIALGDGFRIPLSPVFLAFYAVVAVEVLTTFTAENQYVALYGEVGRYLGLTTHLVLALIAVAIAVSIDYPRRISWVAWAIGAAATIA
Ga0307471_10083584423300032180Hardwood Forest SoilVADARAARRAAQKKLRRDRPVDRPVVRGVFWREVALLVVCVEIAIFILAFDPSVLNVFDLTKASFTHALAWGLLGTLVVIGLGNGLRVPLSPIFVAFYVVVAIEALTTVTAENQYVALYGEVGRYLGLTTHAVLALIAVAIAVTIDYPRRTSWLAWAIGAAAALAGLYAVQQALGRDPVQWVDFDPRIRPFSTFGNADFYGQFLAVVAT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.