NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F081124

Metagenome / Metatranscriptome Family F081124

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F081124
Family Type Metagenome / Metatranscriptome
Number of Sequences 114
Average Sequence Length 127 residues
Representative Sequence MKFLPLVLTLLLAQIPKYNPNGIWESESGSQYELLLNGSDLHVKLVPGSNPKFLQYEVDTKNQEEVNTYKGTGFFVAKMDTGKECKLSTEWQFVVVSPERIIGITTSITADQNTCEVKEKSQTQLDLKKKK
Number of Associated Samples 88
Number of Associated Scaffolds 114

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 86
AlphaFold2 3D model prediction Yes
3D model pTM-score0.80

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil
(37.719 % of family members)
Environment Ontology (ENVO) Unclassified
(66.667 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(82.456 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 5.66%    β-sheet: 47.80%    Coil/Unstructured: 46.54%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.80
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
f.4.1.4: OMPA-liked5b5eo_5b5e0.7
f.4.1.1: OMPA-liked1p4ta_1p4t0.69
b.61.1.0: Avidin/streptavidind4z2oa_4z2o0.67
b.61.1.1: Avidin/streptavidind1y55x11y550.66
b.61.1.0: Avidin/streptavidind4bj8a_4bj80.66


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 114 Family Scaffolds
PF10066DUF2304 15.79
PF13439Glyco_transf_4 13.16
PF13432TPR_16 4.39
PF13579Glyco_trans_4_4 4.39
PF00535Glycos_transf_2 1.75
PF07719TPR_2 1.75
PF00664ABC_membrane 0.88
PF04321RmlD_sub_bind 0.88
PF13231PMT_2 0.88
PF08409TMTC_DUF1736 0.88
PF00534Glycos_transf_1 0.88
PF13181TPR_8 0.88
PF01370Epimerase 0.88
PF00288GHMP_kinases_N 0.88

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 114 Family Scaffolds
COG0451Nucleoside-diphosphate-sugar epimeraseCell wall/membrane/envelope biogenesis [M] 1.75
COG0702Uncharacterized conserved protein YbjT, contains NAD(P)-binding and DUF2867 domainsGeneral function prediction only [R] 1.75
COG1086NDP-sugar epimerase, includes UDP-GlcNAc-inverting 4,6-dehydratase FlaA1 and capsular polysaccharide biosynthesis protein EpsCCell wall/membrane/envelope biogenesis [M] 1.75
COG1087UDP-glucose 4-epimeraseCell wall/membrane/envelope biogenesis [M] 0.88
COG1088dTDP-D-glucose 4,6-dehydrataseCell wall/membrane/envelope biogenesis [M] 0.88
COG1089GDP-D-mannose dehydrataseCell wall/membrane/envelope biogenesis [M] 0.88
COG1090NAD dependent epimerase/dehydratase family enzymeGeneral function prediction only [R] 0.88
COG1091dTDP-4-dehydrorhamnose reductaseCell wall/membrane/envelope biogenesis [M] 0.88


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil37.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil17.54%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil11.40%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil9.65%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil7.02%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.39%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.63%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.75%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.75%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.75%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs0.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.88%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.88%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.88%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300004104Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF218 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004139Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF230 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005538Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009444Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010085Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_2_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010103Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010107Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010109Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010115Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010122Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010124Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010136Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_2_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010139Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_20_2_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010140Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012386Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012392Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012393Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012397Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012398Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012399Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012401Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012402Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012403Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012405Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012407Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012409Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012410Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300022533Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-7-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300027869Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300034643Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_120 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
soilH2_1040002423300003324Sugarcane Root And Bulk SoilMKLSLLALVFLLAQIPKNSANGIWESTSGTRYELRQNGPNLQVKLVPGSNPKYVQYEVSLKNQDEINTYKGTGSFVAKMEGGKECKFETEWTLVVVSAERILGDSTNIVADKNTCEIKQKNQGQLDLKKKK*
Ga0058891_156683423300004104Forest SoilTQIPKNSPNGIWESTSGVQYEIRQNGADVRVKMVPGSNSKYLQYDVTLKNQDEINTYKGTGTFVAKMESGKECKFETEWQLVVVSPDRILGSTTGVQADKNTCAIKEKNEAQLDLKKKK*
Ga0058897_1095639613300004139Forest SoilLILLLTQIPKNNATGVWESNSGARYEIRQNGSDLQVKLVPGSNPKYVQYDVTLKNQDEINSYKGTGTFVAKMEGGKECKFDTEWQLVVVAPDRILGVTTGVQADKNTCAIKEKDQAQLDLKKKK*
Ga0058897_1098763223300004139Forest SoilLILLFAQIPRNNPNGIWESTSGVRYEIRQNGADLQVKLVPGSNAKYVQYDVTLKNQDEINTYKGTGTFVAKMESGKECKFDTEWQLVVVSAERILGVTTGVEADKKTCAIKEKNQAQLDLKKKK*
Ga0066674_1001600213300005166SoilMRLPVLALTLLVVQIPKYNPTGIWESETGVEYQILENGADLEVKLVPGSNTKYLQYDVSLKKQDEINTYRGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGILADKKTCAVKEKNQEQMDLKKKK*
Ga0066674_1011537423300005166SoilMILLPLIAGLLLQQLPKNNPNGIWESDSGTQYELRLNGADLQVKLVPGSNQKFLQYEVNLKNQEEINTYKGAGFFVAKMESGKECKFDTEWQFVVVSPERIIGGATTIIADKETCRVQEKNQTQLDLKRKH*
Ga0066674_1012669623300005166SoilMKFLPLVLTLLLAQIPKYNPNGVWESESGSQYELVLNGSDLHVKLVPGSNPKFLQYEVDTKNQEEVNTYKGTGFFVAKMDTGKECKLSTEWQFVVVSPERIIGITTSITADQNTCEVKEKSQTQLDLKKKK*
Ga0066672_1022930923300005167SoilMILLPLIAGLLLQQLPKNNPNGIWESDSGTQYELRLNGADLQVKLVPGSNQKFLQYEVNLKNQEEINTYKGAGFFVAKMESGKECKFDTEWQFVVVSPERIIGGATTVIADKETCRVREKNQTQLDLKRKH*
Ga0066679_1095990113300005176SoilMILLPLIAGLLLQQLPKNNPNGIWESDSGTQYELRLNGADLQVKLVPGSNQKFLQYEVNLKNQEEINTYKGAGFFVAKMESGKECKFDTEWQFVVVSPERIIGGATTIIADKETCRVREKNQTQLDLKRKH*
Ga0066671_1040867513300005184SoilMIGTMKFFLLPLALLMTQAPIPKNVPNGLWEAKSGSKYEIRQNGTDVLVKLVPGSNPKFINYEVTLKNQGEINTYKGTGTFVAKMESGKECKFDTEWMFVVVSPDRIIGSATGIQADKNTCKVTEKNQLQLDLKKK*
Ga0066676_1055379323300005186SoilMKFLPLVLTLLLGQIPKYNPNGIWESESGSQYELLLNGSDLHVKLVPGSNPKFLQYEVDTKNQEEVNTYKGTGFFVAKVDTGKECKLSTEWQFVVVSPERIIGITTSITADQNTCEVKEKSQTQLDLKKKK*
Ga0066676_1061050623300005186SoilVADTGSQYEIHQDGAGVQVTLVPGSNPKFLKYDVALKSQQEINTYKGTGTFTAKMEGGKECKFDTEWMFVVVTPDRILGSTTNIVADSKTCAIRQKNQLQLDLKKKK*
Ga0066686_1028285913300005446SoilMILLALIAGLLLQQLPKNNPNGVWESESGTQYELRLNGADLQVKLVPGSNQKFLQYEVNLKNQEEINTYKGAGFFVAKMESGKECKFDTEWQFVVVSPERIIGGATTVIADKETCRVREKNQTQLDLKRKH*
Ga0070707_10000171093300005468Corn, Switchgrass And Miscanthus RhizosphereLALLVAQKVPKNDPNGVWQADTGSQYELHLTGANLQVKLVPGSNPKFLSYEVTLSNQDEINTYKGTGTFVAKMEGGKECKFETEWQLVVVSADRILGAATGILADKQTCAIREKNQLQLDLKKKK*
Ga0070741_1069236523300005529Surface SoilMKLTFLALALLLAQVPKHDPNGVWVSESGAQYEIHQDGSNVQVKLVPGSNPKFLQYEVALKNQEEVNTYKGTGSFTAKMEGGKECKFTTDWMFIVVTPDRILGSSTGVVADSKTCAIKEKNQVQLDLKKKK*
Ga0070731_1023692833300005538Surface SoilMKLTLLALAFLLTQVPKHDPNGVWVADSGSQYEIRQDGPNVQVKLVPGSNPKFLQYEVALKNQEEINTYKGTGTFVAKMQGGKECKFDTEWMFIVVTPERILGSSTNIVADSKTCAIQEKNQAQLDLKKKK*
Ga0066701_1022933623300005552SoilLTLLLGQIPKYNPNGIWESESGSQYELLLNGSDLHVKLVPGSNPKFLQYEVDTKNQEEVNTYKGTGFFVAKMDTGKECKLSTEWQFVVVSPERIIGITTSITADQNTCEVKEKSQTQLDLKKKK*
Ga0066695_1081258913300005553SoilTLLLGQIPKYNPNGIWESESGSQYELVLNGSDLHVKLVPGSNPKFLQYEVDTKNQEEVNTYKGTGFFVAKMDTGKECKLSTEWQFVVVSPERIIGITTSITADQNTCEVKEKSQTQLDLKKKK*
Ga0066707_1071861613300005556SoilMKIVALVLALLVAQRVPRNDPNGTWESSTGSKYELRLNGSNLQVKLVPGSNSKYITYEVALKNQEEINSYKGAGTFVAKMEGGKECKFETEWQLVVVSADRILGATTGVLADKETCAIKEKNQVQLDLKKVK*
Ga0066698_1071671013300005558SoilMKLLPLALTLLLAQIPKYNPTGVWEADTGSQFELRLDGSDVHVKIVPGSNPKFLEYELETKSQEEANSYKGSGFFVAKMEGGKECKLPTEWQFVVVSPERIIGVASSIIADRETCEVKEKSQVQLDLKKKK*
Ga0066698_1085308823300005558SoilLPLIAGLLLQQLPKNNPNGIWESDSGTQYELRLNGADLQVKLVPGSNQKFLQYEVNLKNQEEINTYKGAGFFVAKMESGKECKFDTEWQFVVVSPERIIGGATTIIADKETCRVREKNQTQLDLKRKH*
Ga0066706_1088773723300005598SoilMILLPLIAGLLLLQIPKNNPNGIWESDSGTQYELRLNGADLQVKLVPGSNQKFLQYEVNLKNQEEINTYKGAGFFVAKMESGKECKFDTEWQFVVVSPERIIGGATTIIADKETCRVREKNQTQLDLKRKH*
Ga0066656_1003333923300006034SoilLIFRFFLHNFHLSLHGMKFLPLVLTLLLAQIPKYNPNGVWESESGSQYELVLNGSDLHVKLVPGSNPKFLQYEVDTKNQEEVNTYKGTGFFVAKMDTGKECKLSTEWQFVVVSPERIIGITTSITADQNTCEVKEKSQTQLDLKKKK*
Ga0066656_1024570823300006034SoilPMILLPLIAGLLLQQLPKNNPNGIWESDSGTQYELRLNGADLQVKLVPGSNQKFLQYEVNLKNQEEINTYKGAGFFVAKMESGKECKFDTEWQFVVVSPERIIGGATTVIADKETCRVREKNQTQLDLKRKH*
Ga0066652_10109234313300006046SoilMKIVALILTLLVAQKVPRNDPNGTWESSTGSKYELRLNGSNLQVKLVPGSNSKYIIYDVTLKNQEEINSYKGAGTFVAKMEGGKECKFETEWQLVVVSADRILGATSGVLADKETCAIKEKNQVQLDLKRVK*
Ga0066665_1047079723300006796SoilMKFLTFVLTLLTAQIPKYNPTGVWEAETGSQFELRLSGSDLHVKIVPGSNPKFLEYELDMKSQDEVNTYKGSGFFVAKMEGGKECKLPTEWEFVVVSPDRIIGAATLVMANRETCEITEKSQGQLDLKKKK*
Ga0075434_10024163813300006871Populus RhizosphereMKLFALALALLVAQKVPKNDPNGLWLADSGSQYELHLNGANLQVKLVSGSNPKFLSYEVTLTNQDEINTYKGTGTFVAKMEGGKECKFETEWQLVVVSPDRILGGATGILADSKTCAIKEKNQLQLDLKKKK*
Ga0075426_1128082313300006903Populus RhizosphereLLAQIPKHNPAGLWQADTGSQFEIRLANGDVQVKLVPGSNPKFIRYEVALKNQQEINTYKGTGTFVAKMDNGKQCMFDIEWTFVVVSPERIIGVASDIFPDPNTCAIKQKSQVQLDLKKKK*
Ga0066710_10016864143300009012Grasslands SoilGVWEADTGSQFELRLDGSDVHVKIVPGSNPKFLEYELETKSQEEANSYKGSGFFVAKMEGGKECKLPTEWQFVVVSPERIIGVASSIIADRETCEVKEKSQVQLDLKKKK
Ga0066710_10037787533300009012Grasslands SoilMNLFILALALLQAPIPRNNPAGIWDATSGSKYEIHQNGADVQVTLVPGSNPKFTKYEVTLKNQSEPNTYKGTGTFTAKMESGKECKFDTEWMFVIVSPDRILGTATGIIADKNTCRITEKNQLQLDLKKKK
Ga0066710_10041327633300009012Grasslands SoilMKFLTFVLTLLTAQIPKYNPTGVWEAETGSQFELRLSGSDLHVKIVPGSNPKFLEYELDMKNQDEVNTYKGSGFFVAKMEGGKECKLPTEWEFVVVSPDRIIGAATLVMANRETCEITEKSQGQLDLKKKK
Ga0066710_10047634023300009012Grasslands SoilLPLIAGLLLQQLPKNNPNGIWESDSGTQYELRLNGADLQVKLVPGSNQKFLQYEVNLKNQEEINTYKGAGFFVAKMESGKECKFDTEWQFVVVSPERIIGGATTVIADKETCRVREKNQTQLDLKRKH
Ga0066710_10080985723300009012Grasslands SoilMKLVVLALSLLLSQIPKHDPNGIWEAATGSQFDIRLTGSNLHVKLVPGSNPKFLQYEVDMKNQDEVNTYKGTGSFVAKMESGKECKFETEWQFVVVSPERIIGASTNITADKNTREIRRKDQAQLELKKTKR
Ga0066710_10457812513300009012Grasslands SoilMKFTVLVLALLAAQGPKNDPSGVWVADTGSQYEIHQDGAGVQVTLVPGSNPKFLKYDVALKSQQEINTYKGTGTFTAKMEGGKECKFDTEWMFVVVTPDRILGSTTNIVADSKTCAIRQKNQLQLDLKKKK
Ga0066709_10005450033300009137Grasslands SoilMKLSILALALLLAQIPKYNPNGVWQADSGSQYDIRLTGSNIHVQMVAGSNPKFLRYEVDMKNQDEVNTYKGNGTFVAKMEGGKECKFDTEWQFVVVSPDRIIGVTTGITADKNSCEIKQKDQLQLDLKKKK*
Ga0066709_10024570223300009137Grasslands SoilMKLLPLALTLLLAQIPKYNPTGVWEADTGSQFELRLDGSDVHVKIVPGSNPKFLEYELETKNQEEANSYKGSGFFVAKMEGGKECKLPTEWQFVVVSPERIIGVASSIIADRETCEVKEKSQVQLDLKKKK*
Ga0066709_10161014923300009137Grasslands SoilMKFLTFVLTLLTAQIPKYNPTGVWEAETGSQFELRLSGSDLHVKIVPGSNPKFLEYELDMKNQDEVNTYKGSGFFVAKMEGGKECKLPTEWEFVVVSPDRIIGAATLVMANRETCEITEKSQGQLDLKKKK*
Ga0066709_10269852223300009137Grasslands SoilMKLVVLALSLLLSQIPKHDPNGIWEAATGSQFDIRLTGSNLHVKLVPGSNPKFLQYEVDMKNQEEVNTYKGTVSFVAKMQSGKECKFETEWQFVVVSPERIIGASTNVTADKNTCEIRRKDQAQLELKKTKR*
Ga0114945_1036339823300009444Thermal SpringsQIPKYSPKGIWETETGSQYELRLTGSDLHVKIVPGSNPKYLQYEVDMKNQEELNTYKGTGFFVAKMEGGKECKFETEWLLVVVSSDRIVGGGTNIIADKETCEIKEKAQVQLNLKKKT*
Ga0126384_1031985423300010046Tropical Forest SoilMRLSVFALTLLVAQIPNYNPTGIWESETGVEYQILQNGADLQVKLVPGSNTKYLQYDVSLKKQDEINTYKGTGTFTAKMEGGKECKFDTEWQLVIVSPARMFGTTTGIQADKKTCVVKEKSQEPVDLKKKK*
Ga0126382_1002164733300010047Tropical Forest SoilMRLSVFALTLLVAQIPNYNPAGIWESETGVEYQILQNGADLQLKLVPGSNTKYLQYDVSLKKQDEINTYKGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGIQADKKTCVVKEKSQEPVDLKKKK*
Ga0127445_100906923300010085Grasslands SoilLALSLLLSQIPKHDPNGIWEAATGSQFDIRLTGSNLHVKLVPGSNPKFLQYEVDMKNQEEVNTYKGTGSFVAKMESGKECKFETEWQFVVVSPERIIGASTNITADKNTCEIRRKDQTQLELKKTKR*
Ga0127500_113796013300010103Grasslands SoilLSLLLSQIPKHDPNGIWEAATGSQFDIRLTGSNLHVKLVPGSNPKFLQYEVDMKNQDEVNTYKGTGSFVAKMESGKECKFETEWHFVVVSPERIIGASTNITADKNTCEIRRKDQAQLELKKTKR*
Ga0127494_109185113300010107Grasslands SoilWESETGVEYQILENGADLEVKLVPGSNTKYLQYDVSLKKQDEINTYRGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGILADKKTCAVKEKNQEQMDLKKKK*
Ga0127494_109963923300010107Grasslands SoilLSLLLSQIPKHDPNGIWEAATGSQFDIRLTGSNLHVKLVPGSNPKFLQYEVDMKNQDEVNTYKGTGSFVAKMESGKECKFETEWQFVVVSPERIIGASTNITADKNTCEIRRKDQAQLELKKTKR*
Ga0127497_111765513300010109Grasslands SoilQIPKHDPNGIWEAATGSQFDIRLTGSNLHVKLVPGSNPKFLQYEVDMKNQDEVNTYKGTGSFVAKMESGKECKFETEWQFVVVSPERIIGASTNITADKNTCEIRRKDQTQLELKKTKR*
Ga0127495_108863813300010115Grasslands SoilQIPKHDPNGIWEAVTGSQFEIRLTGSNLHVKLVPGSNPKFLQYEVDMKNQDEVNTYKGTGSFVAKMESGKECKFETEWQFVVVSPERIIGASTNITADKNTCEIRRKDQTQLELKKTKR*
Ga0127488_114082123300010122Grasslands SoilLSQIPKHDPNGTWEAVTGSQFEIRLTGSNLHVKLVPGSNPKFLQYEVDMKNQDEVNTYKGTGSFVAKMESGKECKFETEWQFVVVSPERIIGASTNITADKNTCEIRRKDQTQLELKKTKR*
Ga0127498_109660813300010124Grasslands SoilALTLLVAQIPKYNPAGIWESETGVEYQILENGADLEVKLVPGSNTKYLQYDVSLKKQDEINTYRGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGILADKKTCAVKEKNQEQMDLKKKK*
Ga0127447_100985323300010136Grasslands SoilVVLALSLLLSQIPKHDPNGVWEAVTGSQFEIRLTGSNLHVKLVPGSNPKFLQYEVDMKNQEEVNTYKGTGSFVAKMESGKECKFETEWQFVVVSPERIIGASTNITADKNTCEIRRKDQTQLELKKTKR*
Ga0127464_104734623300010139Grasslands SoilVQIPKYNPTGIWESETGVEYQILENGADLEVKLVPGSNTKYLQYDVSLKKQDEINTYRGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGILADKKTCAVKEKNQEQMDLKKKK*
Ga0127456_113378213300010140Grasslands SoilLSQIPKHDPNGIWEAVTGSQFEIRLTGSNLHVKLVPGSNPKFLQYEVDMKNQDEVNTYKGTGSFVAKMESGKECKFETEWHFVVVSPERIIGASTNITADKNTCEIRRKDQAQLELKKTKR*
Ga0134070_1004986323300010301Grasslands SoilMRLPVLALTLLVVQIPKYNPTGIWESETGVEYQILENGPDLEVKLVPGSNTKYLQYDVSLKKQDEINTYRGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGILADKKTCAVKEKNQEQMDLKKKK*
Ga0134088_1000246813300010304Grasslands SoilMRLPVLALTLLVAQIPKYNPTGIWESETGVEYQILENGADLEVKLVPGSNTKYLQYDVSLKKQDEINTYRGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGILADKKTCAVKEKNQEQMDLKKKK*
Ga0134088_1003708623300010304Grasslands SoilMILLPLIAGLLLLQIPKNNPNGVWESESGTQYELRLNGADLQVKLVPGSNQKFLQYEVNLKNQEEINTYKGTGFFVAKMESGKECKFDTEWQFVVVSPERIIGGATTIIADKQTCQVQERNQAQLDLKRKH*
Ga0134088_1025253323300010304Grasslands SoilMKFLPLVLTLLLGQIPKYNPNGIWESESGSQYELLLNGSDLHVKLVPGSNPKFLQYEVDTKNQEEVNTYKGTGFFVAKMDTGKECKLSTEWQFVVVSPERIIGITTSITADQNTCEVKEKSQTQLDLKKKK*
Ga0134109_1039737923300010320Grasslands SoilMILLPLIAGLLLQQLPKNNPNGIWESDSGTQYELRLNGADLQVKLVPGSNQKFLQYEVNLKNQEEINTYKGAGFFVAKMESGKECKFDTEWQFVVVSPERIIGGATTIIADKETCRV
Ga0134111_1012704923300010329Grasslands SoilMKFLPLVLTLLLAQIPKYNPNGVWESESGSQYELLLNGSDLHVKLVPGSNPKFLQYEVDTKNQEEVNTYKGTGFFVAKMDTSKECKLSTEWQFVVVSPERIIGITTSITADQNTCEVKEKSQTQLDLKKKK*
Ga0134111_1032401913300010329Grasslands SoilMRLPVLALTLLVVQIPKYNPTGIWESETGVEYQILENGADLEVKLVPGSNTKYLQYDVSLKKQDEINTYRGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGILADKKTCA
Ga0134080_1024339223300010333Grasslands SoilMKLLPLALTLLLAQIPKYNPTGVWEADTGSQFELRLDGSDVHVKIVPGSNPKFLEYELETKSQEEANSYKGSGFFVAKMEGGKECKLPTEWQFVVVSPERIIGVASSIIADRE
Ga0134080_1054664323300010333Grasslands SoilMRLPVLALTLLVVQIPKYNPTGIWESETGVEYQILENGADLEVKLVPGSNTKYLQYDVSLKKQDEINTYRGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGILADKKTCAV
Ga0134071_1000357333300010336Grasslands SoilMILLALIAGLLLQQLPKNNPNGVWESESGTQYELRLNGADLQVKLVPGSNQKFLQYEVNLKNQEEINTYKGTGFFVAKMESGKECKFDTEWQFVVVSPERIIGGATTIIADKQTCQVQERNQAQLDLKRKH*
Ga0134071_1023664823300010336Grasslands SoilMRLSVFALTLLVAQIPKYNPAGIWESETGVEYQILQNGDDLQVKLAPGSNTKYLQYDVSLKKQDEINTYKGTGTFTAKMEGGKECQFDTEWLLVIVSPARIFGTTTGIQADKKTCAVKEKNQEQMDLKKKK*
Ga0126370_1134139313300010358Tropical Forest SoilMKYIVLALALLQAPLSKNSPNGAWQANSGSVYDIKQNGTDVQVVMVPGTNAKLRNYEVTLKNQDEPNTYKGTGTFIATMEGGKECKFTTEWMFVVVSPDRIIGTATGINADSKTCEIKERPQLQLDLKKKK*
Ga0126376_1057675213300010359Tropical Forest SoilMRLSVLALTLLVAQIPNYNATGIWESETGVQYQILQNAAHLQVKLAPGSNTKYLQYDVSLKKQDEINTYKGTGTFTAKMEGGKECKFDTEWQLVIVSPARMFGTTTGIQADKKTCAVKEKSQEPVDLKKKK*
Ga0126372_1225753223300010360Tropical Forest SoilMKFFVLALALLQAPVPKNNPNGTWQAVSGSAYEIKQNGTSVQVALVAGSNAKFKNYEVTLMNQDEANTYKGTGTFIAKMESGKECKFTTEWMFVVVSPDRIIGTATGINADKNTCEIKERPQLQLDLKKKK*
Ga0126383_1131521413300010398Tropical Forest SoilMRLSVFALTLLVAQIPNYNPAGIWESETGVEYQILQNGADLQVKLVPGSNTKYLQYDVSLKKQDEINTYKGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGIQADKKTCVVKEKSQEPVDLKKKK*
Ga0150983_1183641213300011120Forest SoilLTLLLAQVPKHDPNGIWVADTGSQYQIQQNGSNVQVKLVPGSNPKFLQYEVALKNQEEINTYKGTGTFVAKMEGGKECKFETEWMFVVVTPERILGSATGIVADSKTCAIREKNQLQLDLKKKK*
Ga0150983_1297231113300011120Forest SoilPKNSPAGIWESTSGVQYEIRQNGANVQVKMVPGSNSKYLQYDVTLKNQDEINTYKGTGTFVAKMESGKECKFETEWQLVVVSPDRILGSTTGVQADKNTCAIKEKNEAQLDLKKKK*
Ga0137364_1020353123300012198Vadose Zone SoilMKLSLLALTLLLAQVPKNNPNGIWESTSGVKYEIRQNGANLQVKMVPGSNPKYVQYDVTLKNQDEINTYKGTGTFVAKMEGGQECKFDTEWQLVVVAAERILGTTTGVEADKKTCAIKEKNQALLDLKKKK*
Ga0137383_1044759613300012199Vadose Zone SoilMKFLPLVLTLLLAQIPKYNPNGIWESESGSQYELLLNGSDLHVKLVPGSNPKFLQYEVDTKNQEEVNTYKGTGFFVAKMDTGKECKLSTEWQFVVVSPERIIGITTSITADQNTCEVKEKSQTQLDLKKKK*
Ga0137399_1043564523300012203Vadose Zone SoilMKLLALALTLLFSQQIPKNNPNGVWEAESGSQFELRLAGSDLHVKLVPRSNAKFLQYEIEMKSEEEVNTYSGKGFFVAKMEGGKECKLPAQWRLIVVSPDRIIGIASTVTADQETCEVKETGQAQLDLKKKK*
Ga0137381_1035866623300012207Vadose Zone SoilMRLSVLALTLLVAQIPNYNPTGIWESETGVEYQMLQNGADLQVKLVAGSNTKYLQYDVSLKKQDEINTYKGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGIQADKKTCAVKEKNQVQLDLKKKK*
Ga0150985_10280544413300012212Avena Fatua RhizosphereMKIGFLGLILLLAQVPKNNATGVWESLSGAKYEIRQNGDSLQVKLVPGSNPKYIQYEVTLKNQDELNSYKGAGTFVAKMETGKECKFDTEWVMVIVSPDRIVGTSTGFLADKKTCAIQEKNQAQLDLRKKK*
Ga0137387_1007799013300012349Vadose Zone SoilMKFLPLVLTLLLAQIPKYNPNGIWESESGSQYELVLNGSDLHVKLVPGSNPKFLQYEVDTKNQEEVNTYKGTGFFVAKMDTGKECKLSTEWQFVVVSPERIIGITTSITADQNTCEVKEKSQTQLDLKKKK*
Ga0137386_1043813213300012351Vadose Zone SoilTMRLSVLALTLLVAQIPNYNPTGIWESETGVEYQMLQNGADLQVKLVAGSNTKYLQYDVSLKKQDEINTYKGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGIQADKKTCAVKEKNQEPMDLKKKK*
Ga0137371_1009342023300012356Vadose Zone SoilMKLLPLALTLLLAQIPKYNPTGVWEADTGSQFELRLDGSDVHVKMVPGSNPKFLEYELETKNQEEANSYKGSGFFVAKMEGGKECKLPTEWQFVVVSPERIIGVASSIIADRETCEVKEKSQVQLDLKKKK*
Ga0137371_1075330413300012356Vadose Zone SoilRRPGSLIFRFFLHNFHLSLHGMKFLPLVLTLLLAQIPKYNPNGIWESESGSQYELLLNGSDLHVKLVPGSNPKFLQYEVDTKNQEEVNTYKGTGFFVAKMDTGKECKLSTEWQFVVVSPERIIGITTSITADQNTCEVKEKSQTQLDLKKKK*
Ga0137384_1051126423300012357Vadose Zone SoilMKFLPLVLTLLLAQIPKYNPDGVWESESGSQYELVLNGSDLHVKLVPGSNPKFLQYEVDTKNQEEVNTYKGTGFFVAKMDTGKECKLSTEWQFVVVSPERIIGITTSITADQNTCEVKEKSQTQLDLKKKK*
Ga0134046_109668423300012386Grasslands SoilLLLSQIPKHDPNGIWEAVTGSQFEIRLTGSNLHVKLVPGSNPKFLQYEVDMKNQEEVNTYKGTGSFVAKMESGKECKFETEWQFVVVSPERIIGASTNITADKNTCEIRRKDQTQLELKKTKR*
Ga0134046_129424323300012386Grasslands SoilLTLLVVQIPKYNPTGIWESETGVEYQILENGADLEVKLVPGSNTKYLQYDVSLKKQDEINTYRGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGILADKKTCAVKEKNQEQMDLKKKK*
Ga0134043_128753113300012392Grasslands SoilIAGLLLQQLPKNNPNGIWESDSGTQYELRLNGADLQVKLVPGSNQKFLQYEVNMKNQEEINTYKGAGFFVAKMESGKECKFDTEWQFVVVSPERIIGGATTVIADKETCRVREKNQTQLDLKRKH*
Ga0134052_112855323300012393Grasslands SoilLSLLLSQIPKHDPNGIWEAATGSQFDIRLTGSNLHVKLVPGSNPKFLQYEVDMKNQEEVNTYKGTGSFVAKMESGKECKFETEWQFVVVSPERIIGASTNITADKNTCEIRRKDQAQLELKKTKR*
Ga0134056_127833313300012397Grasslands SoilALTLLVVQIPKYNPTGIWESETGVEYQILENGADLEVKLVPGSNTKYLQYDVSLKKQDEINTYRGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGIQADKKTCALKKRIRSR*
Ga0134051_118570713300012398Grasslands SoilLLLSQIPKHDPNGVWEAVTGSQFEIRLTGSNLHVKLVPGSNPKFLQYEVDMKNQEEVNTYKGTGSFVAKMESGKECKFETEWQFVVVSPERIIGASTNITADKNTCEIRRKDQTQLELKKTKR*
Ga0134061_137202823300012399Grasslands SoilVLALSLLLSQIPKHDPNGIWEAATGSQFDIRLTGSNLHVKLVPGSNPKFLQYEVDMKNQEEVNTYKGTGSFVAKMESGKECKFETEWQFVVVSPERIIGASTNITADKNTCEIRRKDQAQLELKKTKR*
Ga0134055_118622413300012401Grasslands SoilQIPKYNPTGIWESETGVEYQILENGADLEVKLVPGSNTKYLQYDVSLKKQDEINTYRGTGTFTAKMEGGKECQFDTEWLLVIVSPARMFGTTTGILADKKTCAVKEKNQEQMDLKKKK*
Ga0134059_129623613300012402Grasslands SoilALTLLVVQIPKYNPTGIWESETGVEYQILENGADLEVKLVPGSNTKYLQYDVSLKKQDEINTYRGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGILADKKTCAVKEKNQEQMDLKKKK*
Ga0134049_117758923300012403Grasslands SoilLSQIPKHDPNGVWEAVTGSQFEIRLTGSNLHVKLVPGSNPKFLQYEVDMKNQEEVNTYKGTGSFVAKMESGKECKFETEWQFVVVSPERIIGASTNITADKNTCEIRRKDQTQLELKKTKR*
Ga0134041_104007013300012405Grasslands SoilWESDSGTQYELRLNGADLQVKLVPGSNQKFLQYEVNLKNQEEINTYKGAGFFVAKMESGKECKFDTEWQFVVVSPERIIGGATTVIADKETCRVRRKIRRSST*
Ga0134050_116312613300012407Grasslands SoilTGIWESETGVEYQILENGADLEVKLVPGSNTKYLQYDVSLKKQDEINTYRGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGILADKKTCAVKEKNQEQMDLKKKK*
Ga0134050_140508133300012407Grasslands SoilQIPKHDPNGVWEAVTGSQFEIRLTGSNLHVKLVPGSNPKFLQYEVDMKNQEEVNTYKGTGSFVAKMESGKECKFETEWQFVVVSPERIIGASTNITADKNTCEIRRKDQTQLELKKTKR*
Ga0134045_119180223300012409Grasslands SoilLSQIPKHDPNGIWEAVTGSQFEIRLTGSNLHVKLVPGSNPKFLQYEVDMKNQEEVNTYKGTGSFVAKMESGKECKFETEWQFVVVSPERIIGASTNITADKNTCEIRRKDQTQLELKKTKR*
Ga0134060_108740513300012410Grasslands SoilALIAGLLLQQLPKNNPNGVWESESGTQYELRLNGADLQVKLVPGSNQKFLQYEVNLKNQEEINTYKGTGFFVAEMESGKECKFDTEWQFVVVSPERIIGGATTIIADKQTCQVQERNQAQLDLKRKH*
Ga0137397_1000042523300012685Vadose Zone SoilMKLLALVLTFLISQQIPKNNPNGVWEADTGSQYELRLSGSDLHVKMVSGSNSKFLQYEVDMTNEKEINTYKGTGFFVAKMEGGKECKLATEWQFVVVSNDRIIGAATAVIADQKTCQVLEKNQVQIDLKKRK*
Ga0137359_1107989823300012923Vadose Zone SoilMKFIVLVLALLAAQGPKNDPSGVWVADTGSQYAIHQDGAGVQVTLVPGSNPKFLKYDVALKSQQEINTYKGTGTFTAKMEGGKECKFDTEWMFVVVTPDRILGSTTNIVADSKTCAIRQKNQLQLDLKKKK*
Ga0126375_1003945213300012948Tropical Forest SoilWESETGVEYQILQNGADLQVKLVPGSNTKYLQYDVSLKKQDEINTYKGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGIQADKKTCAVKEKSQEPVDLKKKK*
Ga0126369_1291740523300012971Tropical Forest SoilMKYGITNATGTVVSSHSMKLFVLALALLQAPIPKNSPNGTWQADTGSVYEIKQNGADVQVAMVPGSNAKLRSYEVTLTNQSEANTYKGTGTFIAKMESGKECKFTTEWMFVVVSPDRIIGTVTGINANKDTCEIKERPQLQLDL
Ga0134077_1017908823300012972Grasslands SoilMNLLPIVLTLLLAQIPKHDPTGVWEADTGSQFELRLTGSGLHVKIVPGSNPKFLDYELDMKNEDEVNTYKGSGFFVAKMEGGKECKLPTEWQFVVVSPERIIGVATLVMANKETCEITEKGQGQLDLKKKK*
Ga0134076_1006207533300012976Grasslands SoilMRLSVFALTLLVAQIPKYNPAGIWESETGVEYQILQNGDDLQVKLVPGSNTKYLQYDVSLKKQDEINTYRGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGILADKKTCAVKEKNQEQMDLKKKK*
Ga0132258_1058291433300015371Arabidopsis RhizosphereMKLSLLALVLLVAQIPKNNATGVWQESNSGSKYEIHQNGQNVQVKLVAGSNPKFLQYEVALKNQDEVNTYKGTGTFVAKMESGKECKFETEWQFVVVSSERILGTTTRVLADKNTCAIKEKSQTQLDLKKQK*
Ga0132257_10143031723300015373Arabidopsis RhizosphereMKLSLLALVLLVAQIPKNNATGVWQESNSGSKYEIHQNGQNVQVKLVAGSNPKFLQYEVALKNQDEVNTYKGTGTFVAKMESGKECKFDSDWQFVVVSMERIIGSSTNIIADKDTCEIKEKSQVQIDLKKKK*
Ga0134074_112570113300017657Grasslands SoilMKLLPLALTLLLAQIPKYNPTGVWEADTGSQFELRLDGSDVHVKIVPGSNPKFLEYELETKSQEEANSYKGSGFFVAKMEGGKECKLPTEWQFVVVSPERIIGVASSIIADRETCEVKEKSQVQLDLKKKK
Ga0134074_118728223300017657Grasslands SoilMRLPVLALTLLVVQIPKYNPTGIWESETGVEYQILENGADLEVKLVPGSNTKYLQYDVSLKKQDEINTYRGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGILADKKTCAVKEKNQEQM
Ga0134083_1001468723300017659Grasslands SoilMRLSVFALTLLVAQIPKYNPAGIWESETGVAYQILQNGDDLQVKLAPGSNTKYLQYDVSLKKQDEINTYKGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGIQADKKTCAVKEKNQEQMDLKKKK
Ga0134083_1024247113300017659Grasslands SoilQIPKHDPTGVWEADTGSQFELRLTGSGLHVKIVPGSNPKFLDYELDMKNEDEVNTYKGSGFFVAKMEGGKECKLPTEWQFVVVSPERIIGVATLVMANKETCEITEKGQGQLDLKKKK
Ga0066655_1001731523300018431Grasslands SoilMRLPVLALTLLVVQIPKYNPTGIWESETGVEYQILENGADLEVKLVPGSNTKYLQYDVSLKKQDEINTYRGTGTFTAKMEGGKECKFDTEWLLVIVSPARMFGTTTGILADKKTCAVKEKNQEQMDLKKKK
Ga0066662_1124758523300018468Grasslands SoilMKFLPLVLTLLLGQIPKYNPNGIWESESGSQYELLLNGSDLHVKLVPGSNPKFLQYEVDTKNQEEVNTYKGTGFFVAKMDTGKECKLSTEWQFVVVSPERIIGITTSITADQNTCEVKEKSQTQLDLKK
Ga0242662_1019900513300022533SoilAQAPIPRNNATGIWQSDTGTSWEIRQNGADLQVKLAPGSNPKYLQYEVTLKNQDEINTYKGTGSFVAKMEGGKECKFETEWQLVVVSSDRILGVTTGVQADKDTCAIKEKNQLQLDLKKK
Ga0207646_1000966653300025922Corn, Switchgrass And Miscanthus RhizosphereLALLVAQKVPKNDPNGVWQADTGSQYELHLTGANLQVKLVPGSNPKFLSYEVTLSNQDEINTYKGTGTFVAKMEGGKECKFETEWQLVVVSADRILGAATGILADKQTCAIREKNQLQLDLKKKK
Ga0209761_112460823300026313Grasslands SoilMKLLALALTFLISQQIPKNNPNGVWEADTGSQYELRLSGSDLHVKMVPGSNPKFLQYEVDMKNQEEINSYKGTGFFVAKMEGGKECKLPTEWQFVVVSNDRIIGAVTSVVADQQTCQVREKTQVQLDLKKKK
Ga0209266_105322423300026327SoilMKFLPLVLTLLLAQIPKYNPNGVWESESGSQYELVLNGSDLHVKLVPGSNPKFLQYEVDTKNQEEVNTYKGTGFFVAKMDTGKECKLSTEWQFVVVSPERIIGITTSITADQNTCEVKEKSQTQLDLKKKK
Ga0209579_1025776723300027869Surface SoilMKLTLLALAFLLTQVPKHDPNGVWVADSGSQYEIRQDGPNVQVKLVPGSNPKFLQYEVALKNQEEINTYKGTGTFVAKMQGGKECKFDTEWMFIVVTPERILGSSTNIVADSKTCAIQEKNQAQLDLKKKK
Ga0370545_120528_160_5583300034643SoilMKVLALAFTLLVSQIIPKNNPTGIWEADTGSQFELRLEGSDLHVTIVPGSSPKFLQYQVEMKNLDEINSYKGTGFFVAKMESGKECKFETEWQLIVVSPDLIIGGGTSVTADKDTCKIEETTQIQLRLNKKK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.