NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F088469

Metagenome / Metatranscriptome Family F088469

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F088469
Family Type Metagenome / Metatranscriptome
Number of Sequences 109
Average Sequence Length 179 residues
Representative Sequence MAAKLSFVPALVAGLLFPLLAQAQDRDDSELARALAQRGWFDLAEEICDRLEKGSSRALVNYIRAEIQLGKVDRESEFAKASEGLASAAGFLKKFLDENPTHPMALEAQTSIGWVQARKGRLAVDAIDMESDATKHAELQKQAIQSYADAGKYYQDTIEKLKKEKSER
Number of Associated Samples 90
Number of Associated Scaffolds 109

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 0.92 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 89
AlphaFold2 3D model prediction Yes
3D model pTM-score0.77

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.083 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere
(13.761 % of family members)
Environment Ontology (ENVO) Unclassified
(27.523 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(33.945 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 71.43%    β-sheet: 0.00%    Coil/Unstructured: 28.57%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.77
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
a.118.8.0: automated matchesd5mjza_5mjz0.72216
a.118.8.1: Tetratricopeptide repeat (TPR)d1zb1a_1zb10.70327
a.118.1.30: FAT domaind4jspa34jsp0.69059
a.118.7.1: 14-3-3 proteind3efza13efz0.68766
a.118.8.0: automated matchesd6kp3a_6kp30.68197


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 109 Family Scaffolds
PF00432Prenyltrans 3.67
PF13243SQHop_cyclase_C 2.75
PF00005ABC_tran 0.92
PF13249SQHop_cyclase_N 0.92



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.08 %
All OrganismsrootAll Organisms0.92 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300012944|Ga0137410_10000033All Organisms → cellular organisms → Bacteria94483Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere13.76%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil11.01%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil10.09%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere7.34%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil6.42%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil6.42%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.50%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.59%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere3.67%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.75%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere2.75%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.83%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.83%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks1.83%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.83%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.92%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.92%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)0.92%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil0.92%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.92%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.92%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.92%
Permafrost SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost Soil0.92%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.92%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.92%
SoilEnvironmental → Terrestrial → Soil → Sand → Desert → Soil0.92%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.92%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Soil → Unclassified → Miscanthus Rhizosphere0.92%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.92%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.92%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300003321Sugarcane bulk soil Sample H1EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005330Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaGEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005333Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-3 metaGHost-AssociatedOpen in IMG/M
3300005340Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaGEnvironmentalOpen in IMG/M
3300005438Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-2 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005456Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M7-3 metaGHost-AssociatedOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005829Microbial communities from Cathlamet Bay sediment, Columbia River estuary, Oregon - S.190_CBCEnvironmentalOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009661Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil DNA_2013-062EnvironmentalOpen in IMG/M
3300009678Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT100EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011439Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT820_2EnvironmentalOpen in IMG/M
3300011440Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT840_2EnvironmentalOpen in IMG/M
3300011445Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT700_2EnvironmentalOpen in IMG/M
3300012039Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT534_2EnvironmentalOpen in IMG/M
3300012041Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT754_2EnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012902Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S169-409C-1EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300019362Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S104-311B-1 (version 2)EnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300020067Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLIBT47_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021062Soil microbial communities from Anza Borrego desert, Southern California, United States - S1_10-13CEnvironmentalOpen in IMG/M
3300025936Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027675Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 10-12cm March2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027965Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen12_06102014_R2 (SPAdes)EnvironmentalOpen in IMG/M
3300028802Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 17_SEnvironmentalOpen in IMG/M
3300031099Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_152 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031668Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.168b4f23EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031908Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D1EnvironmentalOpen in IMG/M
3300031944Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D1EnvironmentalOpen in IMG/M
3300031945Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX082EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300032013Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D3EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032157Garden soil microbial communities collected in Santa Monica, California, United States - V. faba soilEnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300032421Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NN3EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M
3300034660Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60R2 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034663Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20R1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034664Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20R3 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034667Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8R1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034668Metatranscriptome of lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0R1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034670Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8R4 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034673Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24R3 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
soilH1_1011337013300003321Sugarcane Root And Bulk SoilMAAKLSFIPALVAGLLFPLLAQAQDRDDSELARALAQRGWFDLAEEICDRLEKGSSRALVNYIRAEIQLGKVDRESEYAKASEGLASAASFLKKFLDENPTHALALEAQTSIGWVQARKGRLAIDAIDMESDASKHAELQKQAIQAYADAGKYYMDTIEKLKKEKSERAMDALMDARLELPRVLIDHAKISSVDDATKKKLLTQANALLVDFEFDYGDRPIAFEAMLEGGKCLTELGEYKQAESKLRATFALRKRLAEAKIKPNDYHNKIIYGAYIALA
Ga0062593_10249809413300004114SoilMLAKLSIVPALAAALLVPVLAHAQDRDDSELARVLAQRGWFDLAEEICDKLDKGSARALVPYIRADIQLGKVERETEFPKAVEGLTAATGFLKKFLDENPSHPMALEAQTSIGWVQARKGRLAIDALEIE
Ga0062589_10137593813300004156SoilMLAKLSFVPALAAALLVPLVAHAQDRDDSELARVLAQRGWFDLAEEICDKLDKGSARALVPYIRADIQLGKVERETEFPKAVEGLTAATGFLKKFLDENPSHPMALEAQTSIGWVQARKGRLAIDALEIE
Ga0063356_10149726613300004463Arabidopsis Thaliana RhizosphereMLAKLSFVPALAAALLVPVLAHAQDRDDSELARVLAQRGWFDLAEEICDKLDKGGARALVPYIRAEIQLGKVERETDFNTAVKGLTEAASFLKKFLDENPSHPMALEAQTSIGWVQARKGRLAIDALEIESDASKHSELQKLAIQAYSDAGKYYEDTISKLKKEKQTDRVLDALMDARLELPRVMIDHAKVTGVDEVAKKRLLTGANTLLVDFEFDYGDRPIAFEAMLEG
Ga0063356_10282128023300004463Arabidopsis Thaliana RhizosphereMIARVLSMGAFLLASLLMPVVVLAQDEVELARVLAQRGWFDLAEEICQRIEKGPSRSMASFIRAEIKLGAVDRDSNFDKSLDGLAEAALLLKKFLADSPNHPLALDAQTTIGWVEGRKGRLALDAMEVESDPARHSDLQKMAIQAYSD
Ga0063356_10287166813300004463Arabidopsis Thaliana RhizosphereMAAKLSFVPALVAGLLFPLLAQAQDRDDSELARALAQRGWFDLAEEICDRLEKGSSRAMVNYIRAEVELGKVDRESEFAKASDGLAKAAGFLKKFLDENGTHPLALEALTSIGWVQARKGRLAIDAIDMESDASKHADLQKQAVQAYSDAAKYYADTVEKLKKEKSERAMDALMDARLQLPRV
Ga0063356_10325512113300004463Arabidopsis Thaliana RhizosphereMLAKLSFVPALVAALLVPLIAHAQERDDSELARVLAQRGWFDLAEEICDKLDKGAARALVPYIRAEIQLGKVERESDYATADKGLTDAAGFLKKFLDENPSHAMALEAQTSIGWVQARKGRLAIDALEIESD
Ga0070690_10176898013300005330Switchgrass RhizosphereRVLAQRGWFDLAEEICDRLDKGGARALVPYIRAEIQLGKVERETEFPKAVEGLTSASTFLKKFLDENPTHAMALEAQTSIGWVQARKGRLAIDALEIESDASKHAELQKMAIQAYADAGKYYEDTIAKLRKEKQTDRVLDALMDARLELPRVLIDHAKVTGVDDIAK
Ga0066388_10108415823300005332Tropical Forest SoilMAAKLSIVPAFLAGLLIPLLAHAQDRDDSELARVLAQRGWFDLAEEICDRLEKGSNRATVNYIRAEVELGKVDRESEYPKASEGLAKAAGFLKKFLDENGSHPLALEALTSIGWVQARKGRLAIDAIDMESDPSKHADLQKAAVQAYSDAAKYYQDTVEKLKKEKSERAMDALMDARLQLPRVLIDHARISTVDDATKKKLLKQAND
Ga0070677_1075356913300005333Miscanthus RhizosphereVAGLLFSLTAYAQDDSDLARALAQRGWFDLAEEICDRLDKGAARNMVPFIRAEIKLGQVDRETDFAKSSQGLADAAGLYKKFLDESPTHPNALEAQTNIGWVLARKGRLAVDAIDLESDATKHADLQKQAIQAYTDAEKYYGDTIEKLKKEKSDRAQDALMDARLEMPRILIDHAKLSGVDDAS
Ga0070689_10117826913300005340Switchgrass RhizosphereMAAKLSFVPALVAGLLFPLLAQAQDRDDSELARALAQRGWFDLAEEICDRLEKGSSRALVNYIRAEIQLGKVDRESEFAKASEGLASAATFLKKFLDENPTHPMALEAQTSIGWVQARKGRLAIDAIDLESDPSKHADLQKQAIQAYADAGKYYQDTIEKL
Ga0070701_1128595813300005438Corn, Switchgrass And Miscanthus RhizosphereMLAKLSFVPALAAALLVPLVAHAQERDDSELARVLAQRGWFDLAEEICDRLDKGSARALVPYIRAEIQLGKVERETEFPKAVEGLTTASTFLKKFLDENPTHPMALEAQTSIGWVQARKGRLAIDALEIESD
Ga0070705_10133376113300005440Corn, Switchgrass And Miscanthus RhizosphereMLAKLSFVPALVAALLVPLLAHAQERDDSELARVLAQRGWFDLAEEICDRLDKGSARALVPYIRAEIQLGKVERETEFPKAVEGLTSASTFLKKFLDENPTHPMALEAQTSIGWVQARKGRLAIDALEIESDATKHSELQKMAIQAYADAGKYYEDTIAKLRKEKQTDRVLDAL
Ga0070694_10101285413300005444Corn, Switchgrass And Miscanthus RhizosphereAEEICDRLDKGSARALVPYIRAEIQLGKVERETEFPKAVEGLTTASTFLKKFLDENPTHPMALEAQTSIGWVQARKGRLAIDALEIESDASKHAELQKMAIQAYADAGKYYEDTIAKLRKEKQTDRVLDALMDARLELPRVLIDHAKVTGVDDIAKKKLLTQANSLLVDFEFDYGDRPIAFEAMLEGGKCLTELGEYKQAESKLRATFALRKRLAEAKIKPNDYHNKIIF
Ga0070708_10076214513300005445Corn, Switchgrass And Miscanthus RhizosphereMVAKLSFVPALVAGLLFPLLAQAQDRDDSELARVLAQRGWFDLAEEICDRLEKGSARSMVNYIRAEIQLGKVERETEFPKASQGLTDAAAFLKKFLDENPTHAMALEAQTSIGWVQARKGRLAMDAVELESDASKHSDLQKMAIQAYSDAGKYYQDTIEKLKKEKGERAQDALMDARLELPRVLIDHARISSVDDASKKKLLTQANALLVDFEFDYGDRPIAFEAMLEGGKCLTELGEFKQAE
Ga0066687_1021905223300005454SoilMVAKLSFVPALVAGFLFPLLAQAQDRDDTELARALAQRGWFDLAEEICDRLEKGSNKSMVNYIRAEIQLGKVDRETEFPKASQGLADAAGFLKKFLDENPSHPMALEAQTSIGWVQARKGRLAIDAVELESDASKHAELQKMAIQAYSDAGKYYSDTIDKLRKEKQSDRVLDALMDARLELPRVLIDHAKISG
Ga0070678_10116076213300005456Miscanthus RhizosphereMSAKLSFVPALVAGLLFPLMAQAQDRDDSELARALAQRGWFDLAEEICDRLEKGSSRALVNYIRAEIQLGKVDRESEFAKASEGLASAASFLKKFLDENPTHPMALEAQTSIGWVQARKGRLAIDAIDLESDPSKHADLQKQAIQAYADAGKYYQDTIEKLK
Ga0070698_10024443123300005471Corn, Switchgrass And Miscanthus RhizosphereMVAKLSIVTAFLAGLLFPLIAHAQDRDDTELARALAQRGWFDLAEEICDRLEKGSSRAMVNYIRAEVELGKVDRESEFAKASEGLAKAAEFLKKFLSENATHPLALEAQTSIGWVQARKGRLAIDAIELESDASKHADLQKQAIQAYSDAAKYYQDTIEKLKKEKSDRAQD
Ga0070699_10090111213300005518Corn, Switchgrass And Miscanthus RhizosphereLVTALLFPLLAQAQDRDDTELARALAQRGWFDLAEEICDRLEKGASRAMVNYIRAEVELGKVDRETEFPKASEGLAKAAGFLKKFLDENGTHPLALEAQTSIGWVQARKGRLAIDAIDLESDASKHADLQKQAIQAYSDAAKYYQDTIEKLKKEKSDRAQDALMDARLELPRVLIDHAKISAVDDASKKRLLTQANALLVDFEFDYGDRPIAFEAMLEGGKCLTELGDYKQAETKLKATFALRKRLAEAKIKPNEYHNKIIWGAYIALA
Ga0070733_1037862023300005541Surface SoilMVAKLSMVPALVVGLLFPLLAQAQDRDDTELARALAQRGWFDLAEEICDRLDKGSARSMVNYIRAEIQLGKVDRESEFQKASDGLAQAAGYLKKFIDENPSHPMALEAQTTIGWVQARKGRLAVDAMEIESDPGKHAEL
Ga0070695_10121624613300005545Corn, Switchgrass And Miscanthus RhizosphereAKLSFVPALVAGLLFPLLAQAQDRDDSELARSLAQRGWFDLAEEICDRLEKSGGARATVPYIRAEIELGKVDRETDFAKSAEGLAKAVGLFKKFLEENPTHAMALEAQTNIGWVQARKGRLAMDAVELEADAAKHADLQKQAIQAYSDAGKYYKDTIEKLKKEKQTDRVLDALMDARLELPRVLIDHAKVSGVDDVAKKRLLTEAN
Ga0070695_10158612413300005545Corn, Switchgrass And Miscanthus RhizosphereMLAKLSFVPALVAALLVPLLAHAQDRDDSELARVLAQRGWFDLAEEICDRLDKGSARALVPYIRAEIQLGKVERETEFPKAVEGLTSASTFLKKFLDENPTHPMALEAQTSIGWVQARKGRLAIDALEIESDASKHSDLQK
Ga0068861_10035411123300005719Switchgrass RhizosphereMAAKLSIVSAFLAGLLIPLLAHAQDRDDSELARVLAQRGWFDLAEEICDRLEKGSNRATVNYIRAEVELGKVDRESEYPKASEGLAKAAGFLKKFLDENANHPLALEALTSIGWVQARKGRLAIDAIDLESDPSKHADLQKAAVQAYSDAAKYYMDTVEKLKKE*
Ga0066903_10620788713300005764Tropical Forest SoilMAAKLSIVPAFLAGLLIPLLAHAQERDDSELARALAQRGWFDLAEEICDRLEKGSNRATVNYIRAEVELGKVDRESEYPKASEGLAKAAGFLKKFLDENGSHPLALEALTSIGWVQARKGRLAIDAIDMESDPSKHADLQKAAVQAYSDAAKYYQDTVEKLKKEKSERAMDA
Ga0066903_10804455113300005764Tropical Forest SoilMAAKLSFVPALVAGLLFPLLAQAQDRDDSELARALAQRGWFDLAEEICDRLEKGSSRALVNYIRAEIQLGKVDRESEFQKASEGLASAAGFLKKFLDENGSHPLALEALTSIGWVQARKGRLAIDAIDMESDPSKHAD
Ga0074479_1090893113300005829Sediment (Intertidal)MAAKLSFVPALVAGLLFPLLAAQAQDRDDSELARSLAQRGWFDLAEEICDRLEKGSGGSRATVSYIRAEIQLGKVDRETDFAKSSQGLADAVALFKKFLEESPNHPMALEAQTNIGWVQARKGRLAMDAVEVESDAGKHADLQKQAIQAYSDAGKYYMDTIEKLKKEKQTDRVLDALMDARLELPRVL
Ga0068860_10240958513300005843Switchgrass RhizosphereMLAKLSFVPALAAALLVPLVAHAQDRDDSELARVLAQRGWFDLAEEICDKLDKGSARALVPYIRADIQLGKVERETEFPKAVEGLTSASTFLKKFLDENPTHPMALEAQTSIGWVQARKGRLAIDALEIESDATKHSELQK
Ga0075428_10022740513300006844Populus RhizosphereMVAKLSFVPALVAGLLFPLLAQAQDRDDSELARVLAQRGWFDLAEEICDRLEKGSSKAMVNYIRAEIKLGQVDREVEFEKSTKGLADAVDLLKKFLADSPNHPMALEAQTSIGWVQGRKGRLAIDAIEVESDASKHAELQKLAIQAYSD
Ga0075421_10035978223300006845Populus RhizosphereMLAKLSFVPALAAALLVPLVAHAQERDDSELARVLAQRGWFDLAEEICDRLEKGGSRALVPYIRAEIQLGKVERETEFPKAVEGLTTASTFLKKFLDENPTHPMALEAQTSIGWVQARKGRLAIDALEIESDASKHAELQKMAIQAYADAGKYYEDTIAKLRKEKQTDRVLDALMDA
Ga0075431_10136756313300006847Populus RhizosphereLAACLAAGFLVPLAVHAQDEVELARVLAQRGWFDLAEEICDRLEKGPHRSMVSYVRAEIQLGKVERETEFPKAVEGLTSAATFLKKFIDENSSHPMALEAQTSIGWVQARKGRLAIDALEIESDASKHSDLQKMAIQAYADAGKYYEDTIAKLKKEKQSDRVLDALMDARLELPRVMIDHAKVTGVDDLAKKRLLTQANTLLVDFEFDYGDRPIAFEAMLEG
Ga0075420_10115781413300006853Populus RhizosphereMVAKLSFVPALVAGLLFPLLAQAQDRDDSELARVLAQRGWFDLAEEICDRLEKGSSKAMVNYIRAEIKLGQVDREVEFEKSTKGLADAVDLLKKFLADSPNHPMALEAQTSIGWVQGRKGRLAIDAIEVESDASKHAELQKLAIQAYSDAAKFYQDTIEKLKKDKSEKAQDSLMDCRLELPRIMIDHARISSVDDTSKKKLLTAANALLVDFEFDY
Ga0075425_10093317923300006854Populus RhizosphereMLAKLSIVPAVAAALLFPLVAQAQDRDDSELARVLAQRGWFDLAEEICDRLDKGASRALVPYIRAEIQLGKVERETEFPKAVEGLTSASTFLKKFLDENPTHPMALEAQTSIGWVQARKGRLAIDALEIESDATKHSELQKMAIQAYADAGKYYEDTIAKLRKEKQTDRVLDALMDARLELPRVLIDHAKVTGVDDIAKKKLLTQAN
Ga0075434_10067762423300006871Populus RhizosphereMMAKLSFVPALAAAFLFPLLAQAQDRDDSELARVLAQRGWFDLAEEICDKLDKGSARALVPYIRADIQLGKVERETEFPKAVEGLTAATGFLKKFLDENPSHPMALEAQTSIGWVQARKGRLAIDALEIESDASKHSDLQKMAIQAYADAGKYYEDTIAKLRKEKQTDRVLDALMDARLELPRVMIDHAKVTGVDEVAK
Ga0075424_10278042313300006904Populus RhizosphereMLAKLSIVPAVAAALLFPLVAQAQDRDDSELARVLAQRGWFDLAEEICDRLDKGASRALVPYIRAEIQLGKVERETEFPKAVEGLTSASTFLKKFLDENPTHPMALEAQTSIGWVQARKG
Ga0075436_10031651113300006914Populus RhizosphereMLAKLSIVPAVAAALLFPLVAQAQDRDDSELARVLAQRGWFDLAEEICDRLDKGASRALVPYIRAEIQLGKVERETEFPKAVEGLTAATGFLKKFLDENPSHPMALEAQTSIG
Ga0075435_10141387513300007076Populus RhizosphereKLSFVPTLVAALLVPLLAHAQDRDDSELARALAQRGWFDLAEEICDRLDKGGARALVPYIRAEIQLGKVERETEFPKAVEGLTTASTFLKKFLDENPTHPMALEAQTSIGWVQARKGRLAIDALEIESDATKHSELQKMAIQAYADAGKYYEDTIAKLRKEKQTDRVLDALMDARLELPRVLIDHAKVTGVDDIAKKKLLTQ
Ga0075435_10148756813300007076Populus RhizosphereMAAKLSIVSAFLAGLLIPLLAHAQDRDDSELARVLAQRGWFDLAEEICDKLDKGSARALVPYIRADIQLGKVERETEFPKAVEGLTAATGFLKKFLDENPSHPMALEAQTSIGWVQARKGRLAIDALEIESDASKHSDLQKMAIQSYADAGKYYEDTIAKLKKEKQTDRVLDALMDARLELPRVMIDHA
Ga0075418_1154025123300009100Populus RhizosphereMVAKLSFVPALVAGLLFPLLAQAQDRDDSELARVLAQRGWFDLAEEICDRLEKGSSKAMVNYIRAEIKLGQVDREVEFEKSTKGLADAVDLLKKFLADSPNHPMALEAQTSIGWVQGRKGRLAIDAIEVESDASKHAELQKLAI
Ga0075418_1156017013300009100Populus RhizosphereLARVLVQRGWFDLGEEICDRLEKGGSRSLVPYVRAEIQLGKVERETEFPKAVEGLTTASTFLKKFLDENPTHPMALEAQTSIGWVQARKGRLAIDALEIESDASKHAELQKMAIQAYADAGKYYEDTIAKLRKEKQTDRVLDALMDARLELPRVLIDHAKVTGVDDVAKKRLLTQANALLVDFEFDYGDRPIAFEAMLEGGKCLTELGDFKQAESKLRATFALRKRLAEAKIKPNEYH
Ga0066709_10421614513300009137Grasslands SoilMTAKLSIVPAFLAGLLIPLLAQAQDRDDSDLARALAQRGWFDLAEEICDRLEKGSSRAMVNYIRAEVELGKVDRESEFAKASDGLAKAAGFLKKFLDENATHPLALEAQTSIGWVQARKGRLAIDAIDMESDASKHSDLQKQAVQAYS
Ga0111538_1157565023300009156Populus RhizosphereMLAKLSFVPALVAALLVPLLAHAQERDDSELARVLAQRGWFDLAEEICDRLDKGSARALVPYIRAEIQLGKVERETEFPKAVEGLTTASTFLKKFLDENPTHPMALEAQTSIGWVQARKGRLAI
Ga0075423_1118202513300009162Populus RhizosphereMMAKLSFVPALAAAFLFPLLAQAQDRDDSELARVLAQRGWFDLAEEICDKLDKGASRALVPYIRAEIQLGKVERETEFPKAVEGLTAASTFLKKFLDENPSHPMALEAQTSIGWVQARKGRLAIDALEIESDPSKHSDLQKMAIQSY
Ga0105858_110037713300009661Permafrost SoilMMAKLSSVTALVAGLLISLTAHAQDDSDLARALAQRGWFDLADEICDRLDKGSARNMVPFIRAEIKLGQVDRETEFTKASQGLADAAALYKKFLDESPTHPNALEAQTNIGWVLARKGRLAVDAIDLESDATKHADLQKQAIQAYSDAEKYYGETIEKLKKEKSDRAQDALMDARLEMPRILIDHAKLSGVDDASKKRMLTQAKTLLV
Ga0105252_1004410623300009678SoilMVAKLSFVPALVAGLLFPLLAQAQDRDDSELARVLAQRGWFDLAEEICDRLEKGSSKAMVNYIRAEIKLGQVDREVEFEKSTKGLADAVDLLKKFLADSPNHPMALEAQTSIGWVQGRKGRLAIDAIELESDASKHAELQKLAIQAYSDAAKFYLDTIEKLKKDKSERAQDSLMDCRLELPRIMIDHAKISSVDDATKKKLLTQANALLI
Ga0134066_1023814013300010364Grasslands SoilMAAKLSFVPALVAGLLFPLLAQAQDRDDSELARALAQRGWFDLAEEICDRLEKGSSRALVNYIRAEIQLGKVDRESEFAKASEGLASAAGFLKKFLDENPTHPMALEAQTSIGWVQARKGRLAVDAIDMESDATKHAELQKQAIQSYADAGKYYQDTIEKLKKEKSER
Ga0126383_1145302913300010398Tropical Forest SoilIRSGAEGVAVSVRCACSVRDRADRLLRAARIPRRYEHCALDNFDPVNPELARALAQRGWFDLAEEICDRMDKSASRALVPYIRAEIQLGKVERETEFPKAVEGLTSASSFLKKFLDENPTHAMALEAQTSIGWVQARKGRLAIDALEIESDASKHADLKKMAIQAYADAGKYYEDTIAKLKKEKQTDRVLDALMDARLELPRVMIDHAKITGVDDLQKKKLLSAANALLIDFEFDYGDRPIAFEAMLEGGKCLTELG
Ga0134122_1050060323300010400Terrestrial SoilMYWGFDSGFRAGFSFSRSNRMLAKLSIVPAVAAALLFPLVAQAQDRDDSELARVLAQRGWFDLAEEICDRLDKGASRALVPYIPAEIQLGKVERETEFPKAVEGLTSASTFLKKFLDENPTHPMALEAQTSIGWVQARKGRLAIDALEIESDASKHSDLQKMA
Ga0134122_1117013813300010400Terrestrial SoilMMAKLSTVPALVAGLLFSLTAYAQDDSDLARALAQRGWFDLAEEICDRLDKGSARNMVPFIRAEIKLGQVDRETDFAKSSQGLADAAALYKKFLDESPTHPNALEAQTNIGWVLARKGRLAVDAIDLESDATKHADLQKQAIQAYSDAEKYYGDTIEKLKKEKSDRAQDALMDARLEMPRILIDHARLSGVDDASKKKMLTQAKTLLVDFEFDYGDRPIAFEA
Ga0134122_1121421723300010400Terrestrial SoilMVAKLSFVPALVAGFLFPLLAQAQDRDDSELARVLAQRGWFDLAEEICDRLEKGSSRAMVSYIRAEIKLGQVDRESDFQKSTQGLAEAVELLKKFLADSPNHPMALEAQTSIGWVQARKGRLATDAIEVESDASKHADLQKMAIQA
Ga0134122_1144748213300010400Terrestrial SoilMAAKLSFVPALVAGLLFPLLAQAQDRDDTELARVLAQRGWFDLAEEICDRLDKGSSRAMVNYIRAEIKLGQVDRETEFAKASEGLASAAGFLKKFLDENPNHAMALEAQTTIGWVQARKGRLAIDAIEMES
Ga0134121_1040143923300010401Terrestrial SoilMVAKLSFVPALVAGFLFPLFAQAQDRDDTELARALAQRGWFDLAEEICDRLEKGSNKSMVNYIRAEIQLGKVDRETEFPKASQGLADAAGFLKKFLDENPSHPMALEAQTSIGWVQARKGRLAIDAVELESDASKHAELQKMAIQAYSDAGKYYSDTIDKLRKEKQSDRVLDALMDARLELPRVLIDHAKISGVDDSTKKKLLTQ
Ga0134121_1121087713300010401Terrestrial SoilMMAKLSTVPALVAGLLFSLTAYAQDDSDLARALAQRGWFDLAEEICDRLDKGSARNMVPFIRAEIKLGQVDRETDFAKSSQGLADAAALYKKFLDESPTHPNALEAQTNIGWVLARKGRLAVDAIDLESDATKHADLQKQAIQAYSDAEKYYGDTIEKLKKEKSDRAQDALMDARLEMPRILIDHSKLSGVDDASKKRMLTQAKTLLVDFEFDYGDRPIAFEAMLEGGKCLTE
Ga0134123_1093949023300010403Terrestrial SoilMLAKLSFVPALVAALLVPLLAHAQDRDDSELARVLAQRGWFDLAEEICDRLDKGSARALVPYIRAEIQLGKVERETEFPKAVEGLTTASTFLKKFLDENPTHPMALEAQTSIGWVQARKG
Ga0150983_1026024723300011120Forest SoilMAAKLSTVTAFGAALLVSLTAHAQEDSDLARALAQRGWFDLAEDICDRMDKGAGRSMVPFIRAEIRLGQVDRETEYAKSSAGLAEAAALYKKFVDENPAHPMALEAQTNIGWIQSRRGSLAIEAIEVESDATKHADLQKQAMQSYSDAEKYYQDTIEKLKKEKGDRAQDALMDARLALPRVLIDHAKISGVDDATKKRMLNQAKTLLVDF
Ga0137432_102959513300011439SoilMVAKLSFVPALVAGLLFPLLAQAQDRDDSELARVLAQRGWFDLAEEICDRLDKGAARALVPYIRAEIQLGKVERETEFPKAVECLTGAATFLKKFIDENPGHPMALEAQTS
Ga0137433_118647913300011440SoilMAAKLSFVPALVAGLLFPLLAQAQDRDDSELARVLAQRGWFDLAEEICDRLEKGSSKGIASYIRAEIKLGQVDREVEFEKASKGLAEAVDLLKKFLADSPNHPMALEAQTSIGWVQGRKGRLA
Ga0137427_1030433313300011445SoilMVAKLSFVPALVAGLLFPLLAQAQDRDDSELARVLAQRGWFDLAEEICDRLEKGSSKAMVNYIRAEIKLGQVDREVEFEKSTKGLADAVDLLKKFLADSPNHPMALEAQTSIGWVQGRKGRLAIDAIELESDASKHAELQKLAIQAYSDAAKFYLDTIEKLKKDKSEKAQDSLMDCRLELPRIMIDHAKISSVDDASRKKMLTAANALLIDFEFDY
Ga0137427_1031506313300011445SoilNRMLAKLSFLPALAAALLVPVLAHAQDRDDSELARVLAQRGWFDLAEEICDRLDKGGGRALVPYIRAEIQLGKVERETEFPKAVEGLTSASTFLKKLLDENPSHPMALEAQTSIGWVQARKGRLAIDALEMESDAGKHSDLQKMAIQAYADAGKYYEDTIVKLKKEKQTDRVLDALMDARLELPRVMIDHAKVSGVDDIAKKRLLTAANTLLIDFEFDY
Ga0137421_106415923300012039SoilMVAKLSFVPALVAGLLFPLLAQAQDRDDSELARVLAQRGWFDLAEEICDRLEKGSSKAMVNYIRAEIKLGQVDREVEFEKSTKGLADAVDLLKKFLADSPNHPMALEAQTSIGWVQGRKGRLAIDAIELESDASKHAELQKLAIQAYSDAGKFYLDTIEKLKKDKSERAQDSLMDCRLELPRIMIDHAKISSVDDSSRKKMLTQAN
Ga0137430_115080713300012041SoilWFDLAEEICDRLEKGSSKAMVNYIRAEIKLGQVDREVEFEKTTKGLADAVDLLKKFLADSPNHPMALEAQTSIGWVQGRKGRLAIDAIELESDASKHAELQKLAIQAYSDAAKFYLDTIEKLKKDKSEKAQDSLMDCRLELPRIMIDHAKISSVDDSSRKKMLTQANALLIDFEFDYGDRPIAFEAMLEGGKCLTELGDFKQAESKLRATFALRKRLAEAKIK
Ga0150985_11148233613300012212Avena Fatua RhizosphereMKAKLSSVPALVAGLLFSLTAYAQDDSDLARALAQRGWFDLAEEICDRLDKGSARNMVPFIRAEIKLGQVDRETDFAKSSQGLADAAGLYKKFLDESPTHPNALEAQTNIGWVLARKGRLAVDAIDLESDATKHADLQKQAIQAYTDAEKYYGDTIEK
Ga0157291_1037174613300012902SoilLAQRGWFDLAEEICDKLDKGGARALVPYIRAEIQLGKVERETEFPKAVEGLTSASTFLKKFLDENPSHPMALEAQTSIGWVQARKGRLAIDALEIESDASKHSDLQKMAIQAYADAGKYYEDTIAKLKKEKQTDRVLDALMDARLELPRVMIDHARVTGVDDVAKKRLLTQAN
Ga0137410_1000003313300012944Vadose Zone SoilMTAKLSSVAAFAAALLCSLAAHAQDDSDLARALAQRGWFDLAEEICDKLDKGAARNMVPFIRAEIKLGQVDRETDYAKSSQGLADAVGLYKKFLDENPTHPMALEAQTNIGWVQARKGRLAVDAIEVESDATKHADLQKQAAGAYGDAEKYYLETIEKLKKEKSDKAQDALMDARLELPRVLIDHARLSSVDDATKKRLLGQAKTLLVDFEFDYGDRPIAFEAMLEGGKCLTELDEYKQAESKLRATFALRKRLAEAKIKPNDYHNKIIFGAYIA
Ga0126369_1193810913300012971Tropical Forest SoilMLAKLSFVPALAAALLVPLLAHAQERDDSELARVLAQRGWFDLAEEICDKLDKGAARALVPYIRAEIQLGKVERETEFPKAVEGLTSASTFLKKFLDENPTHPMALEAQTSIGWVQARKGRLAIDALEIESDATKHADLQKMAIQAYADAGKYYEDTIAKLRKEKQTDRVLDALMDARLELPRVMIDHAKVTGVDDIAKKRLLTGANTLLVDFEFDYGDR
Ga0132256_10279709013300015372Arabidopsis RhizosphereMLAKLSFVPALAAALLVPLVAHAQDRDDSELARVLAQRGWFDLAEEICDKLDKGSARALVPYIRADIQLGKVERETEFPKAVEGLTAATGFLKKFLDENPSHPMALEAQTSIGWVQARKGRLAIDALEIESDASKHSDLQKMAIQAYADAGKYYEDTIAKLRKEKQTDR
Ga0182036_1117514713300016270SoilMAAKLSSVTAFMAAMLLSLTAHAQDDSDLARALAQRGWFDLAEEICDRMDKGSGRSMVPFIRAEILLGQVDRETEYVKASAGLKEAADLYKKFVDENPTHAMALEAQTNIGWIQSRRGSLAVEAIEMESDASKHADLQKQAIQSYTD
Ga0182036_1122329113300016270SoilHAQERDDSELARVLAQRGWFDLAEEICDRLDKGGSRALVPYIRAEIQLGKVERETEFPKAVEGLNGASGFLKKFLDENPNHPMALEAQTSIGWVQARKGRLAIDALEIESDASKHADLQKMAIQAYADAGKYYEDTIAKLRKEKQTDRVLDALMDARLELPRVMIDHAKVTGVDDVAKKRLLSSANALLVDFEFDYGDRPIAFEAML
Ga0187894_1045488513300019360Microbial Mat On RocksMRGTVGFAAALLLAGASQGQDEAQLARALAQRGWFDLAEDLCDRLEKGPSRAVAGFIRAEIKLGQVDRETEFGKASQGLEEAVALLKKFVADSPTHPMALEARTTIGWVQGRKGQMAIDAIELEADPARHADLQKQ
Ga0173479_1006483523300019362SoilMMAKLSTVPALVAGLLFSLTAYAQDDSDLARALAQRGWFDLAEEICDRLDKGSARNMVPFIRAEIKLGQVDRESDFAKSSQGLADAAALYKKFLDESPTHPNALEAQTNIGWVLARKGRLAVDAIDLESDATKHADLQKQAIQAYTDAEKYYGDTIEKLKKEKSDRAQDALMDARLEMPRILID
Ga0187893_1081164013300019487Microbial Mat On RocksELARVLAQRGWFDLAEEICDRLEKGASRSMVSYIRADIQLGKVERETEFQKAVEGLTSATTFLKKFLDENPNHALSLEAQTSIGWVQARKGRLAVDAIEIESDASKHAELQKIAIGSYSDAAKYYEETIEKLKKDKSERAQDSLMDARLQLPVVLIDHARISGVDDVAKKKLLTKANALLLDFEFDYG
Ga0180109_139031213300020067Groundwater SedimentMVAKLSFVPALVAGLLFPLFAQAQDDTELARALAQRGWFDLAEEICDRLEKTAARSMVNYIRAEIKLGQVDRETDFQKSTQGLADAVVLLKKFLDESPAHPMALEAQTSIGWVQARKGRL
Ga0196974_108356113300021062SoilMSAKLSFVPALVAGLLFPLAAQAQDRDDSELARALAQRGWFDLAEEICDRLDKGSSKGLVSYIRAEIKLGQVDREVEFDKATKGLADAVDLLKKFLSESPNHPMALEAQTSIGWVQARKGRLAIDTLEMESD
Ga0207670_1068739513300025936Switchgrass RhizosphereMLAKLSIVPAVAAALLFPLVAQAQDRDDSELARVLAQRGWFDLAEEICDRLDKGASRALVPYIRAEIQLGKVERETEFPKAVEGLTSASTFLKKFLDENPTHPMALEAQTS
Ga0209077_103164213300027675Freshwater SedimentMAAKLSFVPALVAGLLFPLLAQAQDRDDSELARVLAQRGWFDLAEEICDRLEKGSSKGIANYIRAEIKLGQVDREVEFEKASKGLAEAVDLLKKFLADSPNHPMALEAQTSIGWVQGRKGRLAVEAIDMESDASKHAELQKLAIQAYADAGKFYADTIEKLKKDKSERAQD
Ga0207428_1078563123300027907Populus RhizosphereMAAKLSFVPALVAGLLFPLLAQAQDRDDSDLARVLAQRGWFDLAEEICDRLEKGSGKAIASYIRAEIKLGQVDREVEYEKASKGLADAVELLKKFLAESPNHAMALEAQTSIGWVQGRKGKLAVDAIEMETDASKHAELQKVAITAYSD
Ga0209062_107795923300027965Surface SoilMAAKLSSVTAFTAAMLLSLTARAQDDSDLARALAQRGWFDLAEEICDRMDKGSGRSMVPFIRAEIKLGQVDRETEYAKSSAGLAEAAALYKKFVDENPTHVMALEAQTNIGWIQSRRGSLAMEAIEVESDASKHADLQKQAIQAYTDAEKYYHDTIEKLRKEKSDRAQDALMDARLALPRVLIDHSKISGVDDATRKRMLNEAKTLLVDFEF
Ga0307503_1064320413300028802SoilMAAKLSTASALVAGLLFSLTLHAQDDSDLARALAQRGWFDLAEEICDKLDKGASRSMVPFIRAEIKLGQVDRETEYAKSSQGLADAVTLYKKFLDENPTHAMALEAQTNIGWIQSRKGSLAMEAIEVESDAT
Ga0308181_112860413300031099SoilMAAKLSFVPALVAGLLFPLFAQAQERDDTELARALAQRGWFDLAEEICDRLDKGSSRAMVNYIRAEIKLGQVDRETEFAKASEGLAGAAGFLKKFLDENPTHAMALEAQTTIGWVQARKGRLAIDAIDMESDASKHADLQKAAIQAYADAGKYYMDTIEKLRKEKSERAQDA
(restricted) Ga0255310_1017205613300031197Sandy SoilMSAKLSFVPALVTALLFPLLAQAQDRDDTELARALAQRGWFDLAEEICDRLEKGASRAMVNYIRAEVELGKVDRETEFPKASEGLAKAAGFLKKFLDENGTHPLALEAQTSIGWVQARKGRLAIDAIDLESDASKHADLQKQAIQAYSDAAKYYQDTIEKLKKEKSDR
Ga0318542_1060647413300031668SoilMAAKLSSVTAFTAALLVSLTAHAQDDSDLARALAQRGWFDLAEEICDRMDKGSGRSMVPFVRAEIKLGQVDRETEYVKASAGLKEAADLYKKFVDENPTHAMALEAQTNIGWIQSRRGSLAVEAIEVESDASKHADLQKQAIQSYTDAEKYYHDTIEKLKKEKGD
Ga0310813_1133316513300031716SoilLAQAQERDDSDLARALAQRGWFDLAEEICDRLEKGSSRAMVNYIRAEVELGKVDRESEFAKASEGLAKAAGFLKKFLDENATHPLALEALTSIGWVQARKGRLAIDAIDMESDASKHADLQKQAIQAYSDAAKYYQDTVEKLKKEKSERAMDALMDARLQLPRVLIDHAKISSVDDASKKKLLTQANALLVDFEFDYGDRPIAFEAMLEGGKCLTELGEY
Ga0310813_1198550013300031716SoilDRDDSDLARVLAQRGWFDLAEEICDRLEKGSGKAIASYIRAEIKLGQVDREVEYDKASKGLAEAVDLLKKFLADSPNHPMALEAQTSIGWVQGRKGKLAVDAIEMETDASKHAELQKVAITAYSDAAKFYNDTIEKLKKDKSERAQDSLMDCRLALPRIMIDHARISSVDDSSKKKLLTAAN
Ga0307469_1022487513300031720Hardwood Forest SoilMAAKLSFVPALVAGLLFPLLAQAQDRDDTELARALAQRGWFDLAEEICDRLEKGSSKGMVNYLRAEIQLGKVDRETEFPKASQGLADAAAFLKKFLDENPTHPMALEAQTSIGWVQARKGRLA
Ga0307469_1253908513300031720Hardwood Forest SoilMAAKLSTVPAFVAGLLLSLSLHAQDESDLARALAQRGWFDLAEEICDKLDKGSSRAMVNYIRAEIKLGQVDRETEFAKASEGLAGAAGFLKKFLDENPNHPMALEAQTTIGWVQARKGRLAIDALEIESDASKHSDLQKMAIQAYADAGKYYED
Ga0307468_10222361613300031740Hardwood Forest SoilRVLAQRGWFDLAEEICDRLDKGSSRAMVNYIRAEIKLGQVDRETEFAKASEGLAGAAGFLKKFLDENPNHAMALEAQTTIGWVQARKGRLAVDAIEMESDASKHADLQKTAIQAYSDAGKFYQDTIEKLKKDKSERAQDSLMDCRLELPRIMIDHARVTGVDDGAKKRLLQQANTLL
Ga0310900_1167700113300031908SoilDLARVLAQRGWFDLAEEICDRLEKGSGKAIANYIRAEIKLGQVDREVEYEKASKGLADAVELLKKFLADSPNHAMALEAQTSIGWVQGRKGKLAVDAIEMETDASKHAELQKVAITAYSDAAKFYTDTIEKLKKDKSERAQDSLMDCRLALPRIMIDHARISSVDDASKKKLLTAANAQ
Ga0310884_1046889623300031944SoilMAAKLSFVPALVAGLLFPLLAQAQDRDDSDLARVLAQRGWFDLAEEICDRLEKGSGKAIASYIRAEIKLGQVDREVEYDKASKGLAEAVDLLKKFLADSPNHPMALEAQTSIGWVQGRKGKLAVDAIEMETDASKHAELQKVAITAYSDAA
Ga0310913_1021177923300031945SoilMAAKLSSVTAFTAALLVSLTAHAQDDSDLARALAQRGWFDLAEEICDRMDKGSGRSMVPFVRAEIKLGQVDRETEYVKASAGLKEAADLYKKFVDENPTHAMALEAQTNIGWIQSRRGSLAVEAIEVESDASKHADLQKQAIQSYTDAEKYYHDTIEKLKKEKGDRAQDALMDARLALPRVLIDHSKISGVDDASRKRMLGEAKTLLVDFEFDYGDRPIAFEAMLEGGKCLTELAEYKQAESKLRATFALRKRLAEAKIKPNDYHN
Ga0307479_1080616113300031962Hardwood Forest SoilMAAKLSTVTAFVAALLVSLTAHAQDDSDLARSLAQRGWFDLAEEICDRMDKGAGRSMVPFIRAEIRLGQVDRETEYAKSSAGLADAAQLYKKFVDENPTHPMALEAQTNIGWIQARRGSLAIEAIDLESDATKHADLQKQAIQSYTDAEKYYQDTIEKLKKEKSERAQDALMDARLALPRVLIDHAKISGVDDATKKKMLTQAKTLLVDFEFDYGDRPIAFEAMLEGGKCLTELGEWKQAESKLRATFALRKRLAEAKLKPNDY
Ga0306922_1132999613300032001SoilMLAKLSFVPALAAALLVPVLAHAQERDDSELARVLAQRGWFDLAEEICDRLDKGGSRALVPYIRAEIQLGKVERETEFPKAVEGLNGASGFLKKFLDENPNHPMALEAQTSIGWVQARKGRLAIDALEIESDASKHADLQKMAIQAYADAGKYYEDTIAKLRKEKQTDRVLDALMDARLELPRVMIDHAKV
Ga0310906_1109784813300032013SoilDSDLARVLAQRGWFDLAEEICDRLEKGSGKAIASYIRAEIKLGQVDREVEYEKASKGLADAVELLKKFLADSPNHAMALEAQTSIGWVQGRKGKLAVDAIEMETDASKHAELQKVAITAYSDAAKFYTDTIEKLKKDKSERAQDSLMDCRLALPRIMIDHARISSVDDASKKKLLTAANAQLVDFEFDFGD
Ga0310890_1048354223300032075SoilMAAKLSFVPALVAGLLFPLLAQAQDRDDSDLARVLAQRGWFDLAEEICDRLEKGSGKAIASYIRAEIKLGQVDREVEYEKASKGLADAVELLKKFLADSPNHAMALEAQTSIGWVQGRKGKLAVDAIEMETDASKHAELQKVAITAYSDAAKFYTDTIEKLKKDKSERAQDSLMDCRLALPRIMIDHARISSVDDSSKKKLL
Ga0315912_1099151523300032157SoilMAAKLSTVSALVAGLLFSLTVHAQDDSDLARALAQRGWFDLADEICDRMDKGAGRNMVPFIRAEIKLGQVDRETEYAKASAGLAEAVALYKKFLDENPTHAMALEAQTNIGWIQSRKGSLAIEAIDLESDASKHAELQKQAIQSYTDAEKYYQDTIEKLKKEKSER
Ga0307470_1139046513300032174Hardwood Forest SoilMVAKLSFVPALVAGLLFPLLAQAQDRDDTELARVLAQRGWFDLAEEICDKLDKGGARALVPYIRAEIQLGKVERETEFPKAVEGLTAASTFLKKFLDENPSHPMALEAQTSIGWVQARKGRLAIDAIDLESDASKHADLQKAAIQAYSDAGKYYMDTIEKLRKEKSERAQDA
Ga0307471_10105084923300032180Hardwood Forest SoilMAAKLSFVPALAAAFLFPLIAQAQDRDDTELARALAQRGWFDLAEEICDRLDKSASRAMVNYIRAEIQLGKVDRETEFAKSSAGLDTAVTLFKKFLDENPTHAMALEAQTNIGWVQARKGRLAIEALEMETDSTKHAELQKQAIQAYADAGKYYKDTIEKLKKEKQTDRVLDALMDARLELPRVLIDHARISGVDDVAKKRLLTEANAFLIDFEFDYGD
Ga0306920_10158183323300032261SoilMLAKLSFVPALAAALLVPVLAHAQERDDSELARVLAQRGWFDLAEEICDRLDKGGSRALVPYIRAEIQLGKVERETEFPKAVEGLNGASGFLKKFLDENPNHPMALEAQTSIGWVQARKGRLAIDALEIESDASKHADLQKMAIQAYA
Ga0306920_10218785113300032261SoilMAAKLSFVPALVAGLLFPLLAQAQDRDDSELARALAQRGWFDLAEEICDRLDKGSSRALVNYIRAEIQLGKVDRESEFAKASEGLANAAGFLKKFLDENPTHPLALEAQTSIGWVQARKGRLAIDAIDLESDASKHADLQKQAIQAYSDAGKYYQDTIEKLKKEKSERAMDALMDARLELPRVLIDHAKISSVDDASKKKLLTQANALLVDFEFDYGDRPIAFE
Ga0310812_1013443813300032421SoilMVAKLSFVPALVAGLLFPLLAQAQDRDDSDLARVLAQRGWFDLAEEICDRLEKGSGKAIASYIRAEIKLGQVDREVEFEKASKGLADAVDLLKKFLADSPNHPMALEAQTSIGWVQGRKGRLAIDAIEVESD
Ga0310812_1024421013300032421SoilMLAKLSFVPAVAAALLFPLVAHAQDRDDSELARVLAQRGWFDLAEEICDRLDKGASRALVPYIRAEIQLGKVERETEFPKAVEGLTSASTFLKKFLDENPTHAMALEAQTSIGWVQARKGRLAIDALEIESDASKHSELQKMAIQAYADAGKYYEDTIAKLRKEKQ
Ga0310810_1020246913300033412SoilMAAKLSFVPALVAGLLFPLLAQAQDRDDSELARALAQRGWFDLAEEICDRLDKGSSRALVNYIRAEIQLGKVDRESEFAKASEGLASAAGFLKKFLDENPTHPMALEAQTSIGWVQARKGRLAVDAIDMESDASKHADLQKQAIQAYSDAGKYYQDTIEKLKKEKSERAMDALMDARLELPRVLIDHAKISSVDDASKKKLLTQANALLVDFEFDYGDRPIAFEAMLEGGK
Ga0310810_1027475913300033412SoilMLAKLSFVPALAAALLVPLVVHAQDRDDSELARVLAQRGWFDLAEEICDKLDKGSARALVPYIRADIQLGKVERETEFPKAVEGLTAATGFLKKFLDENPSHPMALEAQTSIGWVQARKGRLAIDALEIESDASKHSDLQKMAIQAYADAGKYYEDTIAKLRKEKQTDRVLDALMDAR
Ga0310810_1074881413300033412SoilMMAKLSTVPALVAGLLFSLTAYAQDDSDLARALAQRGWFDLAEEICDRLDKGSARNMVPFIRAEIKLGQVDRETDFAKSSQGLADAAALYKKFLDESPTHPNALEAQTNIGWVLARKGRLAVDAIDLESDATKHADLQKQAIQAYSDAEKYYGDTIEKLKKEKSDRAQDALMDARLEMPRILIDHAKLSGVDDASKKKMLTQAKTLLVDFEFDYGDRPIAFEAMLEGGKCLTELGEYKQAESKLRATFALRKRLSEAKIKPNDY
Ga0314781_013530_644_11863300034660SoilMAAKLSFVPALVAGLLFPLLAQAQDRDDSDLARVLAQRGWFDLAEEICDRLEKGSGKAIASYIRAEIKLGQVDREVEYEKASKGLADAVELLKKFLADSPNHAMALEAQTSIGWVQGRKGKLAVDAIEMETDASKHAELQKVAITAYSDAAKFYTDTIEKLKKDKSERAQDSLMDCRLAL
Ga0314784_012900_654_12113300034663SoilMAAKLSFVPALVAGLLFPLLAQAQDRDDSDLARVLAQRGWFDLAEEICDRLEKGSGKAIASYIRAEIKLGQVDREVEYDKASKGLAEAVDLLKKFLADSPNHPMALEAQTSIGWVQGRKGKLAVDAIEMETDASKHAELQKVAITAYSDAAKFYNDTIEKLKKDKSERAQDSLMDCRLALPRIMID
Ga0314786_044790_315_8123300034664SoilMAAKLSFVPALVAGLLFPLLAQAQDRDDSDLARVLAQRGWFDLAEEICDRLEKGSGKAIASYIRAEIKLGQVDREVEYEKASKGLADAVELLKKFLADSPNHAMALEAQTSIGWVQGRKGKLAVDAIEMETDASKHAELQKVAITAYSDAAKFYTDTIEKLKKDKS
Ga0314792_180780_114_5813300034667SoilMAAKLSFVPALVAGLLFPLLAQAQDRDDSDLARVLAQRGWFDLAEEICDRLEKGSGKAIASYIRAEIKLGQVDREVEYEKASKGLADAVELLKKVLADSPNHAMALEAQTSIGWVQGRKGKLAVDAIEMETDASKHAELQKVAITAYSDAAKFYTD
Ga0314793_009744_2_7243300034668SoilMAAKLSFVPALVAGLLFPLLAQAQDRDDSDLARVLAQRGWFDLAEEICDRLEKGSGKAIASYIRAEIKLGQVDREVEYEKASKGLAEAVDLLKKFLAESPNHAMALEAQTSIGWVQGRKGKLAVDAIEMETDASKHAELQKVAITAYSDAAKFYTDTIDKLKKDKSERAQDSLMDCRLALPRIMIDHARISSVDDSSKKKLLTAANALLVDFEFDFGDRPIAFEAMLEGGKCLTELGDFK
Ga0314795_055226_222_7133300034670SoilMAAKLSFVPALVAGLLFPLLAQAQDRDDSDLARVLAQRGWFDLAEEICDRLEKGSGKAIASYIRAEIKLGQVDREVEYDKASKGLAEAVDLLKKFLADSPNHPMALEAQTSIGWVQGRKGKLAVDAIEMETDASKHAELQKVAITAYSDAAKFYNDTIEKLKKD
Ga0314798_006136_1_4683300034673SoilMAAKLSFVPALVAGLLFPLLAQAQDRDDSDLARVLAQRGWFDLAEEICDRLEKGSGKAIASYIRAEIKLGQVDREVEYEKASKGLAEAVDLLKKFLADSPNHPMALEAQTSIGWVQGRKGKLAVDAIEMETDASKHAELQKVAITAYSDAAKFYND


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.