NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F105954

Metagenome / Metatranscriptome Family F105954

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105954
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 197 residues
Representative Sequence IVFALICSRFFEMSNRLSFFFGLMCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAIVQALSVLLIGFRMSFLPVVQACTILLPLIAFARCAFPALRKRSEARVPERGVLTTGLTHVIASIAMMFVTHGAYKYANGWLSNREPAYLYDAGAHLAAVWAPALEPSDASD
Number of Associated Samples 90
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 7.00 %
% of genes from short scaffolds (< 2000 bps) 7.00 %
Associated GOLD sequencing projects 82
AlphaFold2 3D model prediction Yes
3D model pTM-score0.30

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (93.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(18.000 % of family members)
Environment Ontology (ENVO) Unclassified
(33.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(45.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 75.48%    β-sheet: 0.00%    Coil/Unstructured: 24.52%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.30
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF12833HTH_18 2.00



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A93.00 %
All OrganismsrootAll Organisms7.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005093|Ga0062594_101877296All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae634Open in IMG/M
3300006852|Ga0075433_10881788All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae781Open in IMG/M
3300015374|Ga0132255_102730134All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium755Open in IMG/M
3300019865|Ga0193748_1012769All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium773Open in IMG/M
3300019866|Ga0193756_1026222All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium819Open in IMG/M
3300028717|Ga0307298_10121734All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium749Open in IMG/M
3300031231|Ga0170824_109544524All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1167Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil18.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil11.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil7.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil5.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil5.00%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere5.00%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere5.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.00%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere4.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere4.00%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere4.00%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil3.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.00%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil3.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil2.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.00%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil2.00%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere1.00%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.00%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.00%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere1.00%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090014Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000033Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000787Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001431Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.4TB clc assemlyEnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005331Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S6-3 metaGHost-AssociatedOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005367Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaGHost-AssociatedOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005455Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3 metaGHost-AssociatedOpen in IMG/M
3300005466Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3L metaGEnvironmentalOpen in IMG/M
3300005548Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3 metaGHost-AssociatedOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009551Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaGHost-AssociatedOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300014969Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4-5 metaGHost-AssociatedOpen in IMG/M
3300015262Rhizosphere microbial communities from Sorghum bicolor, Mead, Nebraska, USA - 072115-113_1 MetaGHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016371Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172EnvironmentalOpen in IMG/M
3300017792Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S4-5 metaGHost-AssociatedOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300019865Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1s1EnvironmentalOpen in IMG/M
3300019866Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1m1EnvironmentalOpen in IMG/M
3300019870Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m1EnvironmentalOpen in IMG/M
3300019875Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3s2EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300020015Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m1EnvironmentalOpen in IMG/M
3300025898Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025909Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C1-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025925Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S6-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025945Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026300Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026685Grasslands soil microbial communities from Chapel Hill, North Carolina, USA that are Nitrogen fertilized -NN349 (SPAdes)EnvironmentalOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028717Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_158EnvironmentalOpen in IMG/M
3300028719Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_182EnvironmentalOpen in IMG/M
3300028768Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_119EnvironmentalOpen in IMG/M
3300028791Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_144EnvironmentalOpen in IMG/M
3300028811Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_149EnvironmentalOpen in IMG/M
3300028876Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_140EnvironmentalOpen in IMG/M
3300030847Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA10 SO (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030916Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA12 EcM (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300031941Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080EnvironmentalOpen in IMG/M
3300031942Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF176EnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300032421Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NN3EnvironmentalOpen in IMG/M
3300033289Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN108EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_011459302088090014SoilFCSAVNRSGARQRCKAIVFALICSRFFEMSNSLSFLFGLLCALDPCQLLWERYVMTETFSLVVYVLVVYWSLTYLRDRRLWQLAVVQALSVLLIDFRMSCLLVVQVCTILLPLIAFARCALATMRSRSEVRASKTGVLTIRLTHLIASVAMMFVMHAPTSTRMAG
ICChiseqgaiiDRAFT_044371313300000033SoilWPHSFTPLVLIQTLASGGTAIVFALICRRFFELSNVLSFLFGFICALDPLQLVWERYVMTETLSLFFYVLVLYWSLAYLRERRLWQLGVGQALSVLLIGFRMSYLLVVQACTILLPVIAFTRCAFPLLRRRSDVRIPQAXVLXIGLTXLXASXAMMFVMHGAYKYA
JGI11643J11755_1174069813300000787SoilLCALDPLQLVWERYVMTETLSLLVYALVVYWSMAYLRDRRLWQLAVVQALSVLLIGFRMSYLLIVQACTILLPLIAFARCGFPLLHQRTSKTGVLTGLTHVIASVTMMLVMHGAYKYANGRLSSRKPVYMNETGAHLAAVWAPVLKPSDASDPRFADVIAKGDQFKIKDLQSR
JGI1027J12803_10072202313300000955SoilPLLVAQTLASAGTAIVFALVCSRLLELGNKISFLFGLICALDPLQLVWERYLMTETCSLLVYVLVLYWSLAYLRDRRLRQLAVVQALSVLLIGFRMSYLPVVQACTILLPLIAFARCGLPVVRKRSEPHTFQARLLTIGLLHLIASIAMMFVMHGAYKYAYGWLTKREPAYLHNTGDHLAAVWAPALQPSDASDPRFADIIANGDQFKIKDLQSRNAQQYAQGFLIKRWREIEK
F14TB_10010570113300001431SoilDRSYFYGYLVRWLAVWPHSFTPLLVVQALASAATAIVFALICSRFFEMSNRLSFLFGLMCALDPCQLLWERYVMTETFSLLVYVLVLYWSLAYLRNRRLWQLAVVQALSVLLIGFRMSYLLVVQACAILLPLIAFARCGLPALRNRSGARAPEAGVLTTGLTHVVASIAMMFIMHGAYKYANGWLSNRQPAYLYAAGSHLVAVWAPALQPSDATDSRFRDLIANGHQFKIEDLTLRNAQHFGEGFLIDR
F14TB_10032531023300001431SoilTAIVFALICSRFFEMSNTVSFLFGLLWALDPCQLVWERYVMTETFSLLVYALVLYWSLAYLRDRRLWQLAVVQALSVLLIGIRMSYLLLVQACTILLPLIAFARCALPVFRKRSEAPAPESRLLATGLTHVIAVLR*
F14TB_10091375613300001431SoilEMSNSLSFFFGLLCALDPCQLVWERYVMTETFSLLAYVVVLYWSLAYLRDRRLWQLAIVQALSVPLIGFRMSYLLVIQTCTILLPVIAFARCALLSFADRSEWRASAAGILITGFTHLLASIAMMLIMHGAYKQVNGWLSNREPAYLYNVGAHLASVWAPALQPSDATDP
F14TB_10462533213300001431SoilLSFLFGFLCAVDPCQLVWERYVMTETFSLLVYVAVLYLSFLYARDRRIWQLAVVQALSVVLIAFRISYLLVVQVCTVLLPIIAFARCVSPAFRNRSEARSSVTNLLRGFEHVVISVALMFVMHGAYKYANGWLSSREPAYLYDAGAHLAAVWAPALQPSDARDPRFRDLIANGDQFEIDNLRQRNAQQFGEGFLIDR
Ga0062595_10121419913300004479SoilPLLVAQTLASAGTAIVFALVCSRLLELGNKISFLLGLICALDPLQLVWERYLMTETCSLLVYVLVLYWSLAYLRDRRLWQLAVVQAFSVLLIGFRMSYLPVVQACTILLPLIAFARCGLPVVRKRSEPHTFQARLLTTRLLHLIASIAMMFVMHGAYKYAYGWLTKREPAYLHNTGDHLAAVWAPALQPSDASDPRFADIIANGDQFKIKDLQSRNAQQYAQG
Ga0062594_10187729613300005093SoilFEMSNTLSFLFGLLCALDPCQLVWERYVMTETFSLFVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSYLLLVQVCTILLPLIAFARCALPVFRKRSEARAPEAGVLRTGLIHVVASIAMMLMMHGAYKYVNGWLSNREPAYLYGTGSHLAAVSAPALEPSDASDPRFGELIANGDQFKIKNLHFRNAQQYSGGFLIKRWRAIEKD
Ga0065705_1083161713300005294Switchgrass RhizosphereFLFGLMCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAIVQALSVLLIGFRMSYLLVVQACTILLPLIAFARCDLPVLRNRLGARAPEAAVLTTGLTHVVVSVAMMFIMHGAYKYANGWLSRREPAYLYDSGAHLVAVWAPALEPSDATDPRFGEIIANGHQFKIKDLHSRNAQQFGNGLLVKRWRE
Ga0070670_10124799513300005331Switchgrass RhizosphereGYLLRWVALWPHSFTPLLLIQTLASGGTAIVFALICRRFFELSNVLSFLFGLLCALDPLQLVWERYVMTETLSLFFYVLVLYWSLAYLRDRRLWQLGVVQALSVLLISFRMSYLLVVQACTILLPVIAFARCAFPLLRQRSDARIPQASVLPIGLTHLVASVAMMFVMHGAYKYAYGWLSKSEPTYLYETGAHLVAVWAPALEPSDSSDPRFAELIANGDQFKIKD
Ga0066388_10483096213300005332Tropical Forest SoilLPHSFTPLLVIQALASGTTAIVFAVICSRFFELSNALSFLFGFICALDPLQLVWERYVMTETFSLLAYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSYLLVVQACTILLPLIAFARCVFPVLPKRSESRILQPNVLTIGLTHVIASVAMMFLMHGAYKYANGWLSKREPGYLYATGDHLAAVWAPALEPSDASDPRFGELIANGDQFKIKDLHFRNAQQYAE
Ga0070667_10180411013300005367Switchgrass RhizosphereFLFGLMCALDPCQLVWERYVMTETFSLLVYVLVLYCSLAYLRDRRLWQLAVVQALSVLLIGFRMKFLPVVQACTILLPLIAFARCGLPGLCNRSGAHAPAGGVLTTGLMHVAASIAMMFVMHGGYKYANGWLSKREPAYLHDSGTHLAAVWAPVLEPSDATDSRFRDLIANGHQFKIDDLTLRNAQHFGEGF
Ga0070709_1152752613300005434Corn, Switchgrass And Miscanthus RhizosphereIVFALICSRFFEMSNRLSFFFGLMCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAIVQALSVLLIGFRMSFLPVVQACTILLPLIAFARCAFPALRKRSEARVPERGVLTTGLTHVIASIAMMFVTHGAYKYANGWLSNREPAYLYDAGAHLAAVWAPALEPSDASD
Ga0070714_10144501813300005435Agricultural SoilGYLVRWLALFPHSFTPLLVIQALASGATAIVFALICSRFFEMSNGLSFLFGLLCVLDPLQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSYLLVVQACTILLPLIAFARCALPVLGKRSEPRTLEARVLGIGLTHVIASTAMMFVMHGAYKYANGWLSIREPAYLYETGAHLAAVWAPAFEPSDASDPRFGELIANGNQFK
Ga0070663_10177787313300005455Corn RhizosphereIPSDRSYFYGYLVRWLAVWPHGFGPLLLSQMLVSAVTAIVFTLICSRFFRMSNRLSFLFGFLCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSFLPVVQACTILLPLIAFARCGLAGLRNRSGTHAPAGGVLTTGLMHVAASIAMMFVMHGAYKY
Ga0070685_1142825513300005466Switchgrass RhizosphereMTETFSLLVYMLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSFLLVVEGCTILLPLIAFARCALPALRERIEARVPQGSVLALALTHVAASIAMMFLMHGAYKYANGWLSNREPAYLHDAGAHLAAVWAPALDPSDATDSRFRDLIANGDQFKIDDLTL
Ga0070665_10173451513300005548Switchgrass RhizospherePHGFGPLLLSQMLVSAVTAIVFTLICSRFFRMSNRLSFLFGFLCALDPCQLVWERYVMTETFSLLVYMLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSFLLVVEGCTILLPLIAFARCALPALRERIEARVPQGSVLAMALTHVAASIMMMFLMHGAYKYGNGYLTHREPAYLYDAGAHLAAVWAPALQPSDATDPRFGEIIANGP
Ga0066702_1027811723300005575SoilLVRWLAVWPHSFLPLLAAQALASGVTAIVFARICSAFFGVSNRLSFLFGFLCALDPCQLVWERYVMTETFSLLVYVLVLYRSLLYVRDRRLWELAVVQALSVVLIAFRIGYLLVVQVCTVLLPIIAFARCASPAFRNRSEARSSVTNPFSTGFEHVVVSVALMNVMHDAYKHVNASFAKREPAYLYDAGAHLAAVWAPALQPSDATDPRFRDLIANGDQFEIEHWPFSCSFMLQS*
Ga0066903_10885708113300005764Tropical Forest SoilMTETFSLFVYALVVYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSYLLVIQACTILLPLIAFARCGLPLLRKRTSKTGVLTGLTHVIASVTMMLVMHGAYKYANGRLSNRKPAYLDETGAHLAAVWAPALKPSDSSDPRFADIIANGDQFKIKDLQSRNAQQYA
Ga0070715_1083644313300006163Corn, Switchgrass And Miscanthus RhizosphereMLVSAVTAIVFTLICSRFFRMSNRLSFFFGLMCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAIVQALSVLLIGFRMSFLPVVQACTILLPLIAFARCGLPVLRKRSEARAPEGGVLTTGLIHVIASIVMMFVTHGAYKYANGWLSNREPAYLYDAGAHLAAVWAPALEPSDAS
Ga0066665_1117060013300006796SoilIVFALICSRFFEMSNSLSFLIGLLCALDPCQLVWERYVMTETFSLLVYVLVLYWSMAYLRDRRLWQLAVVQALSVLLIGFRMSYLLVVQACTILLPLIAFARCAFPVLHKQSETRTPEAAVLTIGLTHVLASVGMMFVMHGAYKYANGWLSNREPAYLYDAGAHLAAVWAPALQPSDARDPRFGDIIANGDQFE
Ga0066659_1013197813300006797SoilMTETFSLLVYVLVLYRSLLYVRDRRLWELAVVQALSVVLIAFRIGYLLVVQVCTVLLPIIAFARCASPALRNRSEARSSVTNPFSTGFEHVVVSVALMNVMHDAYKHVNASFAKREPAYLYDAGAHLAAVWAPALQPSDATDPRFRDLIAN
Ga0075428_10212256213300006844Populus RhizosphereTPLLMTQALASGATAIVFALICSRFFGMSNGLSFLFGLLCALDPCQLVWERYVMTESFSLFVYVLVLYWSLAYLRDRRLWQLVVVQALAVLLIGLRISYLLVVQACAIALPLIAFGRCALSALRNRSGSRASELSVLTTPLTHVIASIAMMFVMHGAYKQLNGLLTKREPAYLYSAGTHLVAVWAPALEPSDA
Ga0075433_1088178813300006852Populus RhizosphereAPLLIVQALASGATAIVFALICSRFFQMPKRLSFLFGLLCALDPCQLVWERYVMTEAFSLLVYVLVLYWSLAYLRDRRLWQLVVVQALSVLLIGFRMSYLLVVQACAILLPLIAFARCALPMLHNRSESRASELSVLTTGLTHVIASIAMIFVMHGAYERVNGLLTKREPAYLYSAGTHLVAVWAPALEPSDATDSRFRDLIANGDQFKIHDLSLRNAQHFGKGFLIDRWTKIEKNRRKRERIARETAMNALHRRPLQIA
Ga0075435_10181402713300007076Populus RhizosphereDRSYFYGYLIRWLAVWPHSFAPLLVAQTLASAGTAIVFALVCSRLLELGNKISFLFGLICALDPLQLVWERYLMTETCSLLVYVLVLYWSLAYLRDRRLRQLAVVQALSVLLIGFRMSYLPVVQACTILLPLIAFARCGLPVVRKRSEPHTFQARLLTIGLLHLIASIAMMFVMHGA
Ga0066710_10004309883300009012Grasslands SoilMTETFSLLVYVLVLYRSLLYVRDRRLWELAVVQALSVVLIAFRIGYLLVVQVCTVLLPIIAFARCASPAFRNRSEARSSVTNPFSTGFEHVVVSVALMNVMHDAYKHVNASFAKREPAYLYDAGAHLAAVWAPALQPSDATDPRFRDLIANGDQFEI
Ga0111538_1324314513300009156Populus RhizosphereLGNKISFLFGLICALDPLQLVWERYLMTETCSLLVYVLVLYWSLAYLRDRRLRQLAVVQALSVLLIGFRMSYLPVVQACTILLPLIAFARCGLPVVRKRSEPHTFQARLLTIGLLHLIASIAMMFVMHGAYKYAYGWLTKREPAYLHNTGDHLAAVWAPALQPSDASDPRFADIIANWDQFKIKDLQS
Ga0105238_1231802413300009551Corn RhizosphereHSFTPLLVAQTLASGATAIVFALICSRLFRMSNRLSFLFGLMCALDPCQLVWERYVMTESYSLLVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSFLPVVQACTILLPLIAFARCGLPGLCNRSGAHAPAGGVLTTGLMHVAASIAMMFVMHGGYKYANGWLSKREPAYLHDSGTHLAAV*
Ga0134067_1038403913300010321Grasslands SoilFALICSRFFRMSNRLSCFFGLMCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSFLLVVQACTILLPLIAFARCGLPVLRKRSEACAPEPTVLTTGLTHVVASITMMFVMHGAYKYANGWLSKREPAYLYDAGAHLAAVWAPVLEPSDASDPRFGELIA
Ga0126379_1347022113300010366Tropical Forest SoilTPLLMIQALASGATAIVFVLICSRFFELSNALSFLFGFICALDPLQLVWERYVMTETFSLLVYVLILYWSFAYLRDRRLWQLAVVQALSVLLIGFRMSYLLVVQACTILLPLIAFARCACPVLPKRSESRILQPNVLTIGLTHVIASVAMMFLMHGAYKYANGWLSKREPGYLYA
Ga0126383_1074635613300010398Tropical Forest SoilLAAQFCSAAIVQALASGATAIVFALICSRFFQMPKRLSFLFGLLCALDPCQLVWERYVMTEAFSLLVYVLVLYWSLAYLRDRRLWQLVVVQALSVLLIGFRMSYLLVVQACAILLPLIAFARCALPMLHNRSESRASELSVLTTGLTHVIASIAMIFVMHGAYERVNGLLTKREPAYLYSAGTHLVAVWAPALEPSDATDSHFRDLIANGDQFKIHDLSLRNAQHFAKGFLIDRWTKIEKNRGKRERIA
Ga0137383_1109926413300012199Vadose Zone SoilTPLLVIQSLASGTTAIVFALICSRFFEMSNSLSFLIGLLCALDPCQLVWERYVMTETFSLFVYALVVYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSYLLVIQACTILLPLIEFARCGLPLLHKRTSKTGVLTGLTHVVASVTMLVMHGAYKYANGRLSNREPAYLDETGAHLAAVWAPALKPSDASDPR
Ga0137378_1173270613300012210Vadose Zone SoilSRFFRMSNTLSISFGFICALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAIVQAVSVLLIGFRMSFLLVVQACTILLPLIAFARCGLPVLRKRSEACAPEATVLTGLTHVAASVAMMFVMHGAYKYANGWLSNREPAYLYDAGAHLAAVWAPALVPSDASDPRFR
Ga0137394_1158173713300012922Vadose Zone SoilAIVFALICSRLFRMSNRISFLFGLICALDPCQLVWERYVMTETFSLFVYVLVLYWSLLYLRDRRIWQLAVVQALSVVLIGFRMSYLLVVLACTILLPLIAFALAPMRAVRNQSDPRRLSPLTTGFAHVVASIAMMFIMHGAYKQANGWLSNRQPAYLYDAGSHLVAVWAPV
Ga0164303_1068194413300012957SoilDRSYFYGYLIRWLAVWPHSFTPLLVAQALASAATAIVFALICSRFFRMSNRLSFLFGLMCALDPCQLLWERYVMTETFSLLVYVLVLYWSLVYLRDRRLWQLAVVQALSVLLIGFRMSFLPVVQACTILLPLIAFAHCAFPALRKRSEAPALEGGVLTTGLTHVIASVAMMFVMHGAYKYANGWLSKREPAYLYNTGEHLAAVWAPALEPSDATDSRFRDLIASGDQFKI
Ga0164302_1115521413300012961SoilVFALICSRFFRMSNRLSFLFGLMCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSFLPVVQACTILLPLIAFAHCAFPALRKRSEAPALEGGVLTTGLTHVIASVAMMFVMHGAYRYSNGWLSNREPAYLYDAGAHLAAVWAPALEPSDASDLRFGEIIANGPQFKIQDLRSRNAQQY
Ga0164307_1128639313300012987SoilSIPSDRSYFYGYLVRWLAVWPRSFTPLLITQALASGATAIVFALVCSRFFGMSKRLSFLFGSMCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVIQALSVLLIGFRMSFLPVVQACTILLPLIAFARGGLPGLRNRSEARAPEAGVLTIGLIHVIASITMMFVMHDAYKYANGWLSKREPAYLHDSGAH
Ga0164305_1092765513300012989SoilTALTGSIPSDRSYFYGYLVRWLAVWPHSFTPLLIVQALASGATAIVFVLICSRFFRMSKRLSFLFGLMCALDPCQLVWERYVMTETFSLLVYVLVLYCSLAYLRDRRLWQLAVIQALSVLLIGFRMSFLPLVQACTILLPLIAFARCGLLALHKKSEARVMEGRVLTSGLTHVIASIAMMFVMHGAYKYANGWLSNREPAYLYDAGAHLAAVWSPILEPSDASDLRFGEIIANGSQFKIQDLLS
Ga0157374_1211216413300013296Miscanthus RhizosphereRFFRMSNRLCFLFGFLCALDPCQLVWERYVMTETFSLLVYMLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSFLLVVEGCTILLPLIAFARCALPALRERIEARVPQGSVLAMALTHVAASIMMMFLMHGAYKYGNGYLTHREPAYLYDAGAHLAAVWAPALQPSDATDPRFGEIIANGPQFKIKVLGLRNAQ
Ga0157374_1293365213300013296Miscanthus RhizosphereVFALICSRFFKMSNGLSFLFGLLCVLDPLQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSYLLVVQACTILLPLIAFARCALPVLGKRSEPRTLEARVLGIGLTHVIASTAMMFVMHGAYKYANGWLSIREPAYLYETGAHLAAV
Ga0134078_1039267813300014157Grasslands SoilICSRFFEMSNRLSFLFGLLCALDPCQLVWERYIMTETFSLLVYVLVLYWSLTYLRDRRLWQLAMVQALSVLLIGFRMSYLLVVQACTILLPLIAFARYALPAFRNRSEARAPETDVLTTGLTHVMASVAMMFVMHGAYKYANGWLSNREPAYLYDAGAHLAAVWAPALQPSDARDPRFGDIIANGDQFEIDNLRQRNAQQFSEGFLI
Ga0157376_1152885913300014969Miscanthus RhizosphereTPLLVAQALASAMTAVVFALIYSRFFEMSKRLSFLFGSMCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSYLLVVQACTILLPLIAFARCGLPALRNRSGTRAPESAVLTTGLMHVAASIAMMFVMHGAYKYANGWLSKREPAYLHDSGLHLAAVWAPALEPSDATDSRFRDLIANGDQFKIDDLTLRNAQHFGEGFLIDRWEKI
Ga0157376_1216562913300014969Miscanthus RhizosphereELSNVLSFLFGLLCALDPLQLVWERYVMTETLSLFFYVLVLYWSLAYLRDRRLWQLGVVQALSVLLISFRMSYLLVVQACTILLPVIAFARCAFPLLRQRSDARIPQASVLPIGLTHLVASVAIMFVMHGAYKYAYGRLSKSEPAYLYETGAHLVAVWAPALEPSDTSDPRFAELIANGDQFKIKDFHSRNAQQYGEG
Ga0182007_1019593913300015262RhizosphereFTPLLVIQVLASTATAIVFALICSRLFEISNRLSFLFGLLCTVDPCQLVWERYVMTETFSLLVYVLVLYWSLTYLRNRRLWQLAIVQGLSVLLIGFRMSYLLVVQGCTILLPVIAFAHCALRVLRHRSGTRTPEARVLIIGSIHVIASIAMMFVMHGAYKYANGRLSHREPAYLYNTGDHLAAVWAPALKPSDATDPRFGEIIANGDQIKIKDLHFRNAQQYAEGLLIPRWHQI
Ga0132256_10308181313300015372Arabidopsis RhizosphereCSRFFEMSKRLSFLFGLMCALDPCQLVWERYVMTETFSLLLYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSYLLVVQACTILLPLIAIARCSLPALRMPSEARVMKGGVLTTGLTHVAVSIAMMFVMHDAYKYANGWLGKRESAYLHDSGIHLAAVWAPALEPSDATDSRFRDLIANGHQ
Ga0132257_10201044213300015373Arabidopsis RhizosphereLVRWLAVWPHSFTPLLVGQALASGATAIVFALICSRFFEMSNRLSFLFGLMCALDPCQLVWERYVMTETFSLLAYVLVLYWSLAYLRNRRLWQLAVVQALSVLLIGFRMSYLLVVQACTILLPLIAFARCGLPALRNRSGARAPEAAVLATVLMHVVASVAMMFVMHGAYKYANGWLSKREPAYLHDAGTHLAAVWAPALEPSDATDSRFRDLIANGHQFKIDDPTLRNAQHFGEGFLIDRWKKIENN
Ga0132257_10230308813300015373Arabidopsis RhizospherePSDRSYFYGYLVRWLAVWPHSFTPLLIAQALASGATAIVFALVCSRFFGMSKRLSFLFGSMCALDPCQLVWERYVMTESFSLLVYVLVLYWSLAYLRDRRLWQLAVIQALSVLLIGFRMSFLPVVQACTILLPLIAFARCGLPGLRNRSEARAPESGVLTIGLIHVIASITMMFVMHDAYKYANGWLSKREPAYLHDSGAHLAAIWAPALEPSDATDPRFRDLIANGREFKI
Ga0132257_10355023413300015373Arabidopsis RhizosphereFGSVCALDPCQLVWERYVMTETFSLLIYVLVLYWSLAYLRDRRLWQLAVVQVLSVLLIGFRMSYLLVVQACTILLPVIAFARCGTPALRNRSGTRAPEAGVLKNGLTHVVASIAMMFIMHAAYKQANGWLSNRQPAYLYDAGTHLVAVWAPALQPSDATDSRFRDLITNGHQFKIDDPTLRNAQHFGE
Ga0132255_10273013413300015374Arabidopsis RhizosphereAVWPHSFTLLLLIQALASAATAIVFALICIRFLEISNPLSFVFGSLCALDPSQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVVQTLSVLLIGFRMSYLLVVQTCTILLPLIAFASCALPVFRKRPETCAPKAGILGTGLTHVAASIAMMLMMHGTYKYVNGWLSNREPAYLYDTGSHLVAVWAPALEPSDAHDPRFAELIANGAQFKIKNLHLRNAQQFGRDLLVNRWREIEKDLQKNERVA
Ga0132255_10571739813300015374Arabidopsis RhizosphereLICSRFFRMSNRLSFLFGLMCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSYLLVVQACTILLPLIAFARCGLPALRNRLGTRAPEAAVLTTGLMHVAASIAMMFVMHGAYKYANGWLSKREPAYLHDSGLHLAAVWAPALEPSDA
Ga0182036_1176362513300016270SoilPHSFTPLLIVQALAGGATAIVFALICSRFFEISNTLSFLFGLLCALDPCQLVWERYVMTESFSLIVYVLVVYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSYLLVVQACTILLPLIAFAHYAFPALRNRSEARGPESRFVTTALTHVIASIAMMFVMHGAYKYANGWLSNRE
Ga0182041_1143425413300016294SoilRSYFYGYLVRWLAVWPHSFTPLLLSQAIASGVIAITFVTICSRFFDISNRVSFLFGLLCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRIWQLAVVQALSVMLIGFRMSYLLVVQVCTVLLPVIAFAPCALSSFRNGSKARTSEAGFLKNGFAHLAASVAIMFVMHAAYKHVNGSLSKGEPAYLYNAGAHLVAVWAPALQPSD
Ga0182035_1169655413300016341SoilAIVFALICSRLFEMSNALSFLFGSLCALDPSQLVWERYVMTETFSLLIYVLVLYWSLAYLRRRRLWQLAIVQALSVLLIGFRMSYLLVVQACTVLLPLFAFTRCGLPALRSRSEAHASKSRVITTTLTHVIASIAMISVMHGAYKYANGWLSIREPAYLYDAGAHLASVWAPVLEPSDASDPRLAEIIAN
Ga0182034_1125431923300016371SoilMSNNLSFLFGLLCALDPLQLIWERYVMTETLSLLVYALVLYWSLAYLRDRRLWQLAVVQVLSVLLIGFRMSYLIVVQACTILLPLISFARCYLPALGNRFGARVPRATILMTGLMQVIAIIAMMFTMHGAYKYVNGRLSNREPAYLDETGAHLAAVW
Ga0163161_1140782213300017792Switchgrass RhizosphereRSYFYGYLIRWLAVWPHSFTPLLVAQALASGATAIVFVLICSRFFRMSNRLSFFFGLMCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSFLPVVQACTILLPLIAFARCGLPGLCNRSGAHAPAGGVLTTGLMHVAASIAMMFVMHGGYKYANGWLSKREPAYLHDSGTHLAAVW
Ga0066655_1070752313300018431Grasslands SoilYFYCYLVRWLAVWPHSLTPLLVIQSLASGATAIVFALICSRFFEMSNSLSFLIGLLCALDPCQLVWERYVMTETFSVLVYVLVLYWSMAYLRDRRLWQLAVVQALSVLLIGSRMSYLLVVQACTILLPLIAFARCALPALRKRSEARALEAGVLTQGLTHVIASIAMMFVMHGAYKYANGWLSNREPAYVYDAGAHIVAVWAPALERADASDARFGEVIAKGHRLKI
Ga0193748_101276913300019865SoilPLLVVQALASGATAIVFALICSRFFEMSNRLSFLFGLTCALDPCQLVWERYVMTETFSLLVYVVVLYWSLAYLRNRRLWQLAVVQALSVLLIGFRMSYLLVVQACTILLPLIAFARCAFPVLRKRSETRAPEAGLLLTGLTHVIASVAMMFVMHGAYKYANGWLSSREPAYLYDAGAHLIAVWAPALEPSDASDPRFGAVIANGHQFKMKDLHLRNAQQYGDGLLVKRWREIEKDPRKNDRIARETAINAMRRRPLE
Ga0193756_102622213300019866SoilLAVWPHSFTTLLVAQALASGGTAIVFALICSRFFEMSNRLSFLFGLTCALDPCQLVWERYVMTETFSLLVYVVVLYWSLAYLRNRRLWRLAVVQALSVLLIGFRMSYLMVVQACTILLPLIAFACCGIPALRNRSGARPPEAGVLNTGLRHVVASIAMMFIMHGAYKQANGWLSNRQPAYLYDAGSHLVAVWAPALQPSDATDSRFRELIANGHQFKIDDPTLRNAQHFGKGFLIDRWREIEKDRRTRDRIARETAMNALRRRPLQIAGLAL
Ga0193746_102360513300019870SoilVSAVTAIMFALICSRFFEISKRLSFFFGLLCALDPCQLVWERYVMTETFSLLVYVLVLYCSLAYLRDRRLWQLAVVQALSVLLIGFRMSFLPVVQACTILLPLIAFARCGLPGLRNRSGTHAPVAGVLMTGLMHVIASITMMLVMHGAYKYANGWLSKREPAYLYDSGTHLAAVWAPALEPSDATDPRFGELIGNGNQFDIKDLHSRNAQQY
Ga0193701_106960913300019875SoilPKDRSYFYGYLVRWIAVWPHSFTPLLVVQALASGTTAIVLALICSRFFEMSNTFSFSFGLLCALDPCQLVWERYIMTETFSLLLYVLVLYWSLLYVRDRRIWQLAVVQALSVMLFGFRISYLLVVQVCTILLPIIAFARCAMPVFGNRSETRSSTTYVLSTGFQHVVVSVALMFVMHGVYKQVNGSLTKREPAYLYGAGTHLVSVWAPALQPSDATDSRFRDLIANGHQF
Ga0193725_110056713300019883SoilVATAIVFALICSRFFEMSNRLSFLFGLLCAMDPCQLVWERYVMTETFSLLVYVVVLYWSLAYLRNRRLWRLAVVQALSVLLIGFRMSYLLVVQACTILLPLIAFARCGLPALRNRSGARAPEAGVLKTGLTHVVASIAMMFVMHGAYKQANGWLSNRQPAYLYDAGSHLVAVWAPALQPSDATDSRFRDLIANGHQFKIDELTLRNAQHFGKGFLIDRWREIEKDRR
Ga0193734_106176713300020015SoilGPLLLSQMLVSAVTAIMFALICSRFFEISKRLSFFFGLLCALDPCQLVWERYVMTETFSLLVYVLVLYCSLAYLRDRRLWQLAVVQALSVLLIGFRMSFLPLVQACTILLPLIAFARCGLLALHKRSEARVMEGRVLTTELTHVVASIAMMFVMHGAYKYANGWLSNREPAYLYDAGAHLAAVWSPVLEPSDASDLRFGEIIANGSQFKIQDLLSRNAQQYGKG
Ga0207692_1118667113300025898Corn, Switchgrass And Miscanthus RhizosphereNGLSFFFGLLCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSFLPLVQACTILLPLIAFARCAFPVLRNSSEARVMEGRVLTTGLMHVTVSIAMMLVMHGAYKYANGWLSNREPGYLYDAGAHLAAVWAPALEPSDATDSR
Ga0207705_1148876413300025909Corn RhizosphereWLAVWPHSFTPLLVAQALASGATAIVFALICSRFFEMSNRLSFLFGLLCALDPCQLVWERYVMTETFSLLVYMLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSYLLVVEACTILLPLIAFARCGFSALRERIEARVAQRGVLIGGITHVVASIMMMFLMHGAYKYA
Ga0207650_1163044013300025925Switchgrass RhizosphereFYGYLVRWLALFPHSFTPLLVIQALASGATATVFALICSRFFKMSNGLSFLFGLLCVLDPLQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSYLLVVQACTILLPLIAFARCALPVLGKRSEPRTLEARVLGIGLTHVIASTAMMFVMHGAYKYANGWLS
Ga0207664_1108049813300025929Agricultural SoilYFYGYLVRWLALFPHSFTPLLVIQALASGATAIVFALICSRFFEMSNGLSFLFGLLCVLDPLQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSYLLVVQACTILLPLIAFARCALPVLGKRSEPRTLEARVLGIGLTHVIASTAMMFVMHGAYKYANGWLSIREPAYLYETGAHLAAVWAPALEPSDASDLRFGEIIANGPQFKIQDLRSRNAQQYG
Ga0207706_1127881813300025933Corn RhizosphereKGWIPGDRSYFYGYLVRWAALWPHSFTPLLLIQALASGATAIVFALICRRFFELSNVLSFLFGFICALDPLQLVWERYVMTETLSLSFYVLVLYWSLAYLRDRRLWQLAVVQALSVLLVGFRISYLVVVQACTILLPVIAFVRCTLPVLRRPSDMSIPHVSVLAIGLTHLIASIAMMLVMHGAYKYAFGRLSKSEPAYLYE
Ga0207665_1127485813300025939Corn, Switchgrass And Miscanthus RhizosphereFEMSNRISFSFGLICALDPLQFVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSYLLVVQACTILLPLIAFARCAFPALRNQSEARTPKGGVLTIGLANVAASIAMMFIMHGAYKYANGWLSEREPAYLYHSGAHLAA
Ga0207679_1206855013300025945Corn RhizospherePLLVAQALASAATAIVFALICSRFFEMSKRLSFLFGLMCALDPGQLVWERYVMTETFSLLVYMLVLYWSLAYLRDRRLWQLAVIQALSVLLIGFRMSYLLVVEASTILLPLIAFARCGLSALRERIEARVPQGSVLALALTHVAASIAMMFVMHGAYKYANGYLTHREPAYL
Ga0209027_127346113300026300Grasslands SoilMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVIQALSVLLIGFRMSFLLVVQACTILLPLIAFARCGLPVLRKRSEACAPEPTVLTTGLTHVVASITMMFVMHGAYKYANGWLSNREPAYLYDAGAHLAAVWAPVLEPSDAGDPRFGELIANGHQFKIKTLGLR
Ga0209472_128585613300026323SoilIVFALICSRFFRMSNRLSFFFGLMCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVIQALSVLLIGFRMSFLLVVQACTILLPLIAFARCGLPVLRKRSEACAPEATVLTTVLAHVVASITMMFVMHGAYKYANAWLSNREPAYLYDAGAHLAAVWAPVLEP
Ga0209267_121849013300026331SoilIVFARICSAFFGVSNRLSFLFGFLCALDPCQLVWERYVMTETFSLLVYVLVLYRSLLYVRDRRLWELAVVQALSVVLIAFRIGYLLVVQVCTVLLPIIAFARCASPAFRNRSEARSSVTNPFSTGFEHVVVSVALMNVMHDAYKHVNASFAKREPAYLYDAGAHLAAVWAPALQPSDATDPRFRDLIANGDQFEIDNLRQRNAQQFSDGFLMGRWSKIEKDPYKRDRIAKETAINA
Ga0209158_129263513300026333SoilWIPSDRSYLYGYLVRWLAVWPHSFLPLLAAQALASGVTAIVFARICSAFFGVSNRLSFLFGFLCALDPCQLVWERYVMTETFSLLVYVLVLYRSLLYVRDRRLWELAVVQALSVVLIAFRIGYLLVVQVCTVLLPIIAFARCASPAFRNRSEARSSVTNPFSTGFEHVVVSVALMNVMHDAYKQVN
Ga0209056_1033889413300026538SoilMTETFSLLVYVLVLYRSLLYVRDRRLWELAVVQALSVVLIAFRIGYLLVVQVCTVLLPIIAFARCASPALRNRSEARSSVTNPFSTGFEHVVVSVALMIVMHDAYKHVNASFAKREPAYLYDAGAHLAAVWAPALQPSDATDPRFRDLIA
Ga0208706_10261113300026685SoilAVWPHSFGPLLLSQMLVSAVTAIVFALICSRFFEMSNGLSFFFGLLCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAIVQALSVLLIGFRMSFLPLVQACTVLLPLIAFARCAFPVLRNSSEARVMGGRVLTTGLMHVTVSIAMMLVMHGAYKYANGWLSN
Ga0207428_1111730913300027907Populus RhizosphereLLIQAVASGVTAIVFALSCRRFFDISNNLSFLFGLLCALDPLQLVWERYVMTETFSLLVYALVVYWSMAYLRDRRLWQLAVVQALSVLLIGFRMSYLLIVQACTILLPLIAFARCGFPLLQQRTSKTGVLTGLTHVIASVTMMLVMHGAYKYANGRLSSRKPAYMNETGAHLAAVWAPVLKPS
Ga0268264_1246877913300028381Switchgrass RhizosphereFRMSNRLSFLFGLMCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSFLPVVQACTILLPLKAFARCGLPSLHNRPETRVPKGGVLTTGLMHVIASIAMMFVMHDAYKYANGWLSKREPAYLHDSGTHLAAVWAPVLEPSDATDSRFR
Ga0307298_1012173413300028717SoilPLLVIQALASSATAIVFVLVCSRFFGMSKRLSFLFGSMCALDPCQLVWERYVMTESFSLLVYVLVLYWSFAYLRDRRLWQLAVVQALSVLLIGFRMSFLPVVQACTILLPLIAFARCGLPGLRNRSEVHAPEAGVLTIGLMNVIASITMMFVMHDAYKYANGWLSKREPAYLHDAGTHLAAVWAPALEPSDATDPRFGELIATGHQFQIKDLHLRNAQQYREGFLIKRWRGIEKKRGKNERTLRETAIN
Ga0307301_1025487713300028719SoilSRFFEMSNRLSFLFGLMCALDPCQLLWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSYLLVVQACTILLPLIAFARCGVPALRNRSGARVPEAGVLKTGLTHVVASIAMMFIMHGAYKHANGWLSNRQPAYLYDAGSHLVAVWAPALQPSDATDSRFRDLIANGHQFKIDD
Ga0307280_1026793313300028768SoilWTALTGSIPSDRSYFYGYLVRWLAVWPHSLGPLLLSQMLVSAVTAIMFALICSRFFEISKRLSFFFGSLCALDPCQLVWERYVMTEAFSLLVYVLVLYCSLAYLRDRRLWQLAVVQALSVLLIGFRMSFLPLVQACTILLPLIAFARCGLLALHKRSEGHVMERRVLTTGLTHVVASVAMMFVMHGAYKYANGWLSNREPAYLYD
Ga0307280_1038067013300028768SoilLQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSYLVVVQACTILLPLIAFARCALPVLGKRSDPDTLGAGVLTTGLTHVIASIVMMFVMHGAYKYANGWLSKREPAYLYETGAHLAAVWAPALEPSDATDPRFGELIATGHQFQIKDLHLRNA
Ga0307290_1039142113300028791SoilLASGATAIVFALICSRFFEMSNRLSFLFGLMCALDPCQLLWERYVMTETFSLLVYVLVLYWSLAYLRNRRLWQLVVVQALSVLLIGFRMSYLLVVQACTILLPLIAFARCGVPALRNRSGARVPEAGVLKTGLTHVVASIAMMFIMHGAYKHANGWLSNRQPAYLYDAG
Ga0307292_1049567013300028811SoilKALSFLFGLLCVLDPLQLVWERYVMTESFSLLVYVLVLYWSFAYLRDRRLWQLAVVQALSVLLIGFRMSFLPVVQACTILLPLIAFARCGLPGLRNRSEVHAPEAGVLTIGLMHVIASITMMFVMHDAYKYANGWLSKREPAYLHDAGTHLAAVWAPALEPSDATDPRFGELIA
Ga0307286_1027438713300028876SoilLTGSIPSDRSYFYGYLVRWLAVWPHSLGPLLLSQMLVSAVTAIMFALICSRFFEISKRLSFFFGLLCALDPCQLVWERYVMTETFSLLVYVLVLYCSLAYLRDRRLWQLAVVQALSVLLIGFRMSFLPLVQACTILLPLIAFARCGLLALHKRSEARVMEGRVLTTELTHVVASIAMMFVMHGAYKYANGWLSNREPAYLYDAGAH
Ga0075405_1177310113300030847SoilMTETFSLLVYVLVLYWSLAYLRDRRLWQLAIVQALSVVLIGFRMSFLPVVQACTILLPLIAFARCAFPALRKRSEARVPERGVLTTGLTHVIASIAMMFVTHGAYKYANGWLSNREPAYLYDAGAHLAAVWAPALEPSDASDLRFGEIIANGPQFKIQDLRSRNAQQYGK
Ga0075386_1219322213300030916SoilTGWIPSDRSYFYGYLIRWLAVWPHSFGPLLLSQMLVSAVTVIVFVLICSRFFRMSNRLSFLFGLMCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSFLPVVQACTILLPLITFAHCAFPALRKRSEARTPEGGVLTTGLTHVITSV
Ga0170824_10340388813300031231Forest SoilVFALICSRFFGLSNRLSFLFGLMCALDPCQLVWERYVMTETFSLFVYVLVLYWSLLYLRDCRIWQLAVVQAFSVVLIGFRMSYLLVVLTCTILLPLIAFALAPMRAVRNQSDPRRLSPLTTGFAHVVASLAIMLLMHGAYKEVYGKLRKRESAYLHDSGAHLVSVWAPVLEPSDATDSRLGELIANGHQFKIHDLTLRNAQHFGEDFLID
Ga0170824_10954452433300031231Forest SoilPLVYVLVLYWSLAYLRNRRLWQLAVVQALSVLLISFRMSFLPVVQACTILLPLIAFARCGLPASRKRLGARALEAGVLTKGLTHVVPSIAMMFVMHGAYKYANGWLSNREPAYLYDAGEHLAAVWAPALEPSDATDSRFRDLIAKGHQFKIDDLTLEKFPALRRRFSD
Ga0170820_1692925313300031446Forest SoilDRSYFYGYVVRWLALWPHSFTPLLVAQTLASGATAIVLALICSRFFGMSNRLSFLFGLLCALDPCQLVWERYVMTETFSLLVYVVVLYWSLAYLRNRRLWQLAVVQALSVLLIGFRMSYLLVVQACTILLPLIAFARCGLPALRNRSGARAPEAAVLTTGLMQVVASIAMMFVMHGAYKY
Ga0307469_1186820713300031720Hardwood Forest SoilFALICSRLFRMSNRISFLFGLICALDPCQLVWERYVMTETFSLFVYVLVLYWSLLYLRDRRIWQLAVVQAFSVVLIGFRMSYLLVVLACTILLPLIAFALARMRAVRNQSNPRTLSPLTTGFAHLVASLAIMLLIHGAYKEVYGKLSKRESAYLHDSGAHLVSVWAPVLEPSDATDSRLGDLIANGHQFKIHD
Ga0306919_1142441713300031879SoilWCALDPLQLVWERYVMTETFSLFVYALVVYWSLAYLRDRRLWQLAVVQTLSVLLIGFRMSYLLVIQACTILLPLIAFTRCGLPLLHKRTSKTGVLTGLTHVIASVTMMLVMHGAYKYANGRLSNREPAYLDETGAHLAAVWAPVLEPSDATDSRFRDLIANGYQFKIKDLQSRN
Ga0310912_1122185113300031941SoilCSRFFDISNRVSFLFGLLCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRIWQLAVVQALSVMLIGFRMSYLLVVQVCTVLLPVIAFAPCALSSFRNGSKARTSEAGFLKNGFAHLAASVAIMFVMHAAYKHVNGSLSKGEPAYLYNAGAHLVAVWAPALQPSDATDPRLRDLIADGDQFKIN
Ga0310916_1049639013300031942SoilVVVILRGGFRHEAFTPLAGPNAKTLYGRFGGLRSDSADRLDSSGPIVLLRLFGTLLGLWLGSFTPLLLIQALASGVTAIVFALICGRFFEMSKSISFVFGLLCALDPLQLVWERYVMTETFSLFVYALVVYWSLAYLRDRRLWQLVVVQALSVLLIGFRMSYLLVVLACAILLPLIAFARCALPVLHNRSESRVSEPSALTTGLTHVMASIAVVFVMHGAYKCVNGLLTKREPAYLYSAGTHLVAVWAPALEPSDAMDLRFRDLIANGDQFKIHD
Ga0306922_1239201113300032001SoilFTPLLLSQAIASGVIAITFVTICSRFFDISNRVSFLFGLLCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRIWQLAVVQALSVMLIGFRMSYLLVVQVCTVLLPVIAFAPCALSSFRNGSKARTSEAGFLKNGFAHLAASVAIMFVMHAAYKHVNGSLSK
Ga0307470_1096677913300032174Hardwood Forest SoilTAIVFVLICSRFFRMSNRLSFLFGLMCALDPCQLVWERYVMTETFSLLVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSYLLVVQACTILLPLIAFARCSLPALRKRSGRHALAGGVLTTGLMHVAASIAMMFVMHGAYKYANGWLSKREPAYLHDSGIHLAAVWAPALEPSDATDSRFRDLIANGHQFKIDDPTLRNAQQFGDGFLIDRWREIEKDR
Ga0306920_10381223813300032261SoilPLLVVQTLASGATAIVFALICSRFFEMSNNLSFLFGLLCALDPLQLVWERYLMTETFSLFVYVLVLYWSLAYLRDRRLWQLAVVQALSVLLIGFRMSYLPVVQACTILLPLISFARCYLPALGNRFGARVPRATILMTGLMQVIASIAMMFTMHGAYKYVNGRLSNREPAYLEETGAHLAAVW
Ga0310812_1055988813300032421SoilGYLIRWLALWPHSFGPLLVMQTVASSATAIVFALICSRFFEMSNALSFLFGLLCVLDPLQLVWERYVMTEAFSLLVYVLVLYWSLAYLRDRRLWQLAVVQTLSVLLIGFRMSYLLLVQACTVLLPLIAFARCTFPMLSKRSEPRTVGAGVLKIGFIHLIASIAMMFVMHGA
Ga0310914_1097380413300033289SoilLRSDSADRLDSSGPIVLLRLFGTLLGLWLGSFTPLLLIQALASGVTAIVFALICGRFFEMSKSISFVFGLLCALDPLQLVWERYVMTETFSLFVYALVVYWSLAYLRDRRLWQLVVVQALSVLLIGFRMSYLLVVLACAILLPLIAFARCALPVLHNRSESRVSEPSALTTGLTHVMASIAVVFVMHGAYKCVNGLLTKREPAYLYSAGTHLVAVWAPALEPSDAMDLRFRDLIANGDQFKIHDLSLR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.