NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F103807

Metagenome Family F103807

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103807
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 85 residues
Representative Sequence MSRKKSRNNLLSGIIVVMSIAVIAVWQFYLFVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEMHITSPPPRRSLS
Number of Associated Samples 81
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 77.78 %
% of genes near scaffold ends (potentially truncated) 1.98 %
% of genes from short scaffolds (< 2000 bps) 1.98 %
Associated GOLD sequencing projects 73
AlphaFold2 3D model prediction Yes
3D model pTM-score0.56

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (91.089 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(31.683 % of family members)
Environment Ontology (ENVO) Unclassified
(53.465 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(54.455 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 56.67%    β-sheet: 1.67%    Coil/Unstructured: 41.67%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.56
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF01740STAS 26.73
PF01192RNA_pol_Rpb6 9.90
PF13443HTH_26 4.95
PF13282DUF4070 2.97
PF00291PALP 1.98
PF00072Response_reg 0.99
PF00118Cpn60_TCP1 0.99
PF00487FA_desaturase 0.99
PF02696SelO 0.99
PF09107SelB-wing_3 0.99
PF00266Aminotran_5 0.99
PF13248zf-ribbon_3 0.99
PF00303Thymidylat_synt 0.99
PF14534DUF4440 0.99
PF00571CBS 0.99
PF05977MFS_3 0.99
PF13483Lactamase_B_3 0.99
PF00578AhpC-TSA 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG1758DNA-directed RNA polymerase, subunit K/omegaTranscription [K] 9.90
COG0207Thymidylate synthaseNucleotide transport and metabolism [F] 0.99
COG0397Protein adenylyltransferase (AMPylase) SelO/YdiU (selenoprotein O)Posttranslational modification, protein turnover, chaperones [O] 0.99
COG0459Chaperonin GroEL (HSP60 family)Posttranslational modification, protein turnover, chaperones [O] 0.99
COG1398Fatty-acid desaturaseLipid transport and metabolism [I] 0.99
COG2814Predicted arabinose efflux permease AraJ, MFS familyCarbohydrate transport and metabolism [G] 0.99
COG3239Fatty acid desaturaseLipid transport and metabolism [I] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A91.09 %
All OrganismsrootAll Organisms8.91 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005167|Ga0066672_10003688All Organisms → cellular organisms → Bacteria6461Open in IMG/M
3300005171|Ga0066677_10144222All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1299Open in IMG/M
3300006173|Ga0070716_100000002All Organisms → cellular organisms → Bacteria396037Open in IMG/M
3300012206|Ga0137380_10153455All Organisms → cellular organisms → Bacteria → Acidobacteria2100Open in IMG/M
3300012355|Ga0137369_10038300All Organisms → cellular organisms → Bacteria → Acidobacteria4319Open in IMG/M
3300012975|Ga0134110_10072271All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1374Open in IMG/M
3300025910|Ga0207684_10084367All Organisms → cellular organisms → Bacteria → Acidobacteria2705Open in IMG/M
3300025939|Ga0207665_10000007All Organisms → cellular organisms → Bacteria177869Open in IMG/M
3300034174|Ga0334932_000002All Organisms → cellular organisms → Bacteria619337Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil31.68%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil20.79%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere14.85%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil11.88%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil9.90%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.98%
Sub-Biocrust SoilEnvironmental → Terrestrial → Soil → Unclassified → Desert → Sub-Biocrust Soil0.99%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.99%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.99%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.99%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005355Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaGHost-AssociatedOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300018920Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 ISEnvironmentalOpen in IMG/M
3300019885Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m2EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300034174Sub-biocrust soil microbial communities from Mojave Desert, California, United States - 28HNSEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25384J37096_1023441013300002561Grasslands SoilMSRKKSRNNLLSGIIVVMSIAVIAVWQFYLFVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEMHITSPPPRRSLS*
JGI25382J43887_1003033343300002908Grasslands SoilMSRKKSRNNLLSGIIVVMSIAVIAVWQFYLFVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEXHITSPPPRRSLS*
JGI25382J43887_1014938723300002908Grasslands SoilMSRKKSRKNLLSGIIVVISIAVIAVWQFYLFVTFKNTNGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEMHITSPPPRRSLS*
JGI25382J43887_1017306023300002908Grasslands SoilMSRKKSRKNLLSGIIVVISIAVIAVWQFYLFVTFKNTDGIVDVQGGIQHLWWAIGFGLFACTAAFLFFSVFLRYDRNDEMHITSPPPRRSLS*
Ga0066672_1000368853300005167SoilMSRRKNRKNLLTAIIVVMLIAVIAVWQFYLFVTFKNISGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEMHITSPPPRRSLS*
Ga0066672_1092577813300005167SoilMSRKKSRNNLLFAIIAVMSIGVIAVWQFYLFVTFKNTNGIVDVQGGIQHLWWAIGFGLLGFTAAVLFFSVFMRYDSNDEMHITSPPPRRSLS*
Ga0066677_1014422213300005171SoilFKNTNGIVDVQGGIQHLWWAIGFGLLGFTAAVLFFSVFMRYDSNDEMHITSPPPRRSLS*
Ga0066683_1004835553300005172SoilMSRKKSRNNLLSGIIVVMSIAVIAVWQFYLFVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDELHITSPPPRRSLS*
Ga0066680_1047054213300005174SoilKNVLSAVIGVMFITAVAIWQFYQFVTFKNADGVVDLQGGTHHLWWAIGLGLIAFIVALLFFSVFLRYDRNDELHITSPS*
Ga0066679_1001781063300005176SoilMSRRKNRKNLLTAIIVVMLIAVIAVWQFYLFVTFKNISGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDSNDEMHITS
Ga0066679_1006549753300005176SoilMSRKKSRKNLLSAIIVVMSIAVIAVWQFYMFVTFKNTNGIVDAQGGIQHLWWAIGSGLLACTAAVLFFSVFLRYDSNDEMHITSPPPRGSLS*
Ga0066678_1075302813300005181SoilMSHKKNGKQVLSAVIGVISITAVAIWQFYLFVTFKNAQGVVDVQGGTYHLWWAIGLALVACIGSFLFFSVFLRYDRNDEMHITSLPS*
Ga0066671_1073840313300005184SoilVTFKNTNGIVDVQGGMQHLWWAIGFGLLACTGAVLFFSVFLRYDSSDEMHITSPPPRRNLS*
Ga0066676_1019757943300005186SoilMSRKKSRNNLLSGIIVVMSIAGIAVWQFYLFVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEMHITSPPPRRSLS*
Ga0066675_1039042713300005187SoilVIAVWQFYMFVTFKNTNGIVDVQGGIQHLWWAIGFGLLGFTAAVLFFSVFMRYDSNDEMHITSPPPRRSLS*
Ga0070671_10003904953300005355Switchgrass RhizosphereMGYRNRGKNVLSAIIVVMLFAVVAVWQFYTFATFKDTNGILDKQGGTQHFWWAVGFGLLACAMAFLSFSVFLRHDANNDTHITSPPSRT*
Ga0070708_10008610933300005445Corn, Switchgrass And Miscanthus RhizosphereMSHKKSRTNILAGVVSVLALAGIGIWQFYQFVAFKNASGVVDLQGGTHHLWWAIGLGLTAFIVAFLFFSVFLRYERSDEIHITSVS*
Ga0070708_10011872933300005445Corn, Switchgrass And Miscanthus RhizosphereMNHKKTGKNVVSGVIGLMFITAVAVWQFYQFVTFKNADGVVDLQGGTYHLWWAIGLGLIAFIAALLFFSVFLRYDRNDELHITSPS*
Ga0070708_10019815623300005445Corn, Switchgrass And Miscanthus RhizosphereMSHKKNGKQVLSAVIGVISIAAIAIWQFYLFVTFKNAQGVVDVQGGTYHLWWAIGLALIACIGSFLFFSVFLRYDRNDEMHITSLPS*
Ga0070708_10062121023300005445Corn, Switchgrass And Miscanthus RhizosphereMRHRKSRRNVLSAVVVVMSLAAIAVWQFYLFVVFKNANGIADMQGGTQHLWWAIGLGLIAFIVAFVVFSVFLRYDGSDEMHITSQPS*
Ga0066686_1017362333300005446SoilLLSGIIVVMSIAVIAVWQFYLFVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDELHITSPPPPRRSLS*
Ga0066689_1012810813300005447SoilVIAVWQFYLFVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDELHITSPPPRRSLS*
Ga0066689_1016769943300005447SoilMSHQKNGKQVLSAVIGVISITAVAIWQFYLFVTFKNAQGVVDVQGGTYHLWWAIGLALVACIGSFLFFSVFLRYDRNDEMHITSLP
Ga0066689_1027352613300005447SoilVIAVWQFYLFVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEMHITSPPPRRSLS*
Ga0066681_1033216123300005451SoilMSRKKSRNNLLFAIIAVMSIGVIAVWQFYMFVTFKNTNGIVDVQGGIQHLWWAIGFGLLGFTAAVLFFSVFMRYDSNDEMHITSPPPRRSLS*
Ga0070706_10000789573300005467Corn, Switchgrass And Miscanthus RhizosphereMSRRKSRKNLLSAVIGVMSIAAIAIWQFWLFVTFKNAAGVVDVQGGKQHLWWAIGFGLFACIAAFLFFSVFLRYDRNDEMHITSQPS*
Ga0070706_10014762343300005467Corn, Switchgrass And Miscanthus RhizosphereMSRKKSRKNLLSAIIVVMSIAVIAVWQFYMFVTFKNTNGIVDAQGGIQHLWWAIGFGLLACTAAVLFFSVFLRYDSNDEMHITSPPPRGSLS*
Ga0070706_10068961133300005467Corn, Switchgrass And Miscanthus RhizosphereMSHKKSRTNILAGVVSVLALAGIGIWQFYQFVAFKNASGVVDLQGGTHHLWWAIGLGLTAFIVAFLFFSVFLRYDRSDEIHITSVS*
Ga0070707_10199394823300005468Corn, Switchgrass And Miscanthus RhizosphereMRHRKSRRNVLSAVVVVMSLAAIAVWQFYLFVVFKNANGIADMQGGTQHLWWAVGLGLIACIIAFLFFSVFLRYDRSDEMHITSQPS*
Ga0070698_10004494533300005471Corn, Switchgrass And Miscanthus RhizosphereMSRRKSRKNLLLGIIVVISIAVIAVWQFYLFVTFKNTEGIVDVQGGVQHLWWAIGFALFACTAAFLFFSVFLRYDGNDEMHITSPPPRRSLS*
Ga0070698_10067099523300005471Corn, Switchgrass And Miscanthus RhizosphereMRHRKSRRNVLSAVVVVMSLAAIAVWQFYLFVVFKNANGIADMQGGTQHLWWAIGLGLIAFIVAFVVFSVFLRYDRSDEMHITSQPS*
Ga0066701_1005735023300005552SoilMSHRKSRKNVLSAVIGVMFITAVAIWQFYQFVTFKNADGVVDLQGGTHHLWWAIGLGLIAFIVALLFFSVFLRYDRNDELHITSPS*
Ga0066707_1068970233300005556SoilFYSFVTYRNPAGVVDLQGGAHHLWWAISFGLVACVAAFLIFSIFLRYDKNDELHITSPPAPREMIL*
Ga0066670_1016939823300005560SoilMSRKKSRNNLLSAIIVVTSIAVIAVWQFYIFVTFKNTNGIVDVQGGMQHLWWAIGFGLLACTGAVLFFSVFLRYDSSDEMHITSPPPRRNLS*
Ga0066654_1000642663300005587SoilMSRKKSRNNLLFAIIAVMSIGVIAVWQFYLFVTFKNISGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEMHITSPPPRRSLS*
Ga0066706_1015571613300005598SoilMSRRKNRKNLLTAIIVVMLIAVIAVWQFYLFVTFKNLSGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEMHITSPPPRRSLS*
Ga0066706_1036753033300005598SoilLWQFYSFVTYRNPAGVVDLQGGAHHLWWAISFGLVACVAAFLIFSIFLRYDKNDELHITSPPAPREMIL*
Ga0066696_1053613523300006032SoilMSRKKSRNNLLFAIIAVMSIGVIAVWQFYTFVTFKNTNGIVDVQGGIQHLWWAIGFGLLGFTAAVLFFSVFMRYDSNDEMHITSPPPRRSLS*
Ga0066652_10102279123300006046SoilMSRKKSRNNLLFAIIAVMSIGVVAVWQFYMFVTFKNTNGIVDVQGGIQHLWWAIGFGLLGFTAAVLFFSVFMRYDSNDEMHITSPPPRRSLS*
Ga0070716_1000000021533300006173Corn, Switchgrass And Miscanthus RhizosphereMSYKKSGKQVLSAAIGVISIATVAIWQFYLFVTFKNAEGVVDVQGGTYHLWWAVGLALIACIGSFLFFSVFLRYDRNDEMHITSLPS*
Ga0066660_1007835273300006800SoilAIIVVTSIAVIAVWQFYIFVTFKNTNGIVDVQGGIQHLWWAIGFGLLACTGAVLFFSVFLRYDSSDEIHITSPPPRRNLS*
Ga0075425_10128158123300006854Populus RhizosphereMSYQKSGKRVLSAAIGVISIATLAIWQFYLFVTFKNAEGVVDVQGGTYHLWWAVGLALIACIGSFLFFSVFLRYDRNDEMHITSLPS*
Ga0075426_1055278323300006903Populus RhizosphereMSRKKSRNNLLSAIIVVTSIAVIAVWQFYMFVTFKNTNGIVDVQGGIQHLWWAIGVGLLACTAAFLFFSVFLRYDRNDEIHITSPPPRSSLS*
Ga0075426_1091324023300006903Populus RhizosphereMSYKKSGKQVLSAAIGVISIATLAIWQFYLFVTFKNAEGVVDVQGGTYHLWWAVGLALIACIGSFLFFSVFLRYDRNDEMHITSLPS*
Ga0099828_1041665623300009089Vadose Zone SoilMSHKKNGKQVLSAVIGVISIAAIAIWQFYLFVTFKNAQGVVDVQGGTYHLWWAIGLALVACIGSFLFFSVFLRYDRNDEMHITSLPS*
Ga0066709_10274383913300009137Grasslands SoilVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEMHITSPPPRRSLS*
Ga0134088_1042632523300010304Grasslands SoilLFVTFKNINGIGDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDELHITSPPPRRSLS*
Ga0134109_1008924923300010320Grasslands SoilMSRKKSRNNLLFAIIAVMSIGVIAVWQFYLFVTFKNTNGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEMHITSPPPRRSLS*
Ga0134084_1012744513300010322Grasslands SoilFYLFVTFKNTNGIVDVQGGIQHLWWAIGFGLLGFTAAVLFFSVFMRYDSNDEMHITSPPPRRSLS*
Ga0134063_1061098613300010335Grasslands SoilIIVVMSIAVIAVWQFYLFVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEMHITSPPPRRSLS*
Ga0137391_1123565323300011270Vadose Zone SoilLAISGIAIWQFYQFVTFKNAGVVDLQGGTHHLWWAIGLGLTAFIVAFLFFLVFLRYDRNDEMHITSQMS*
Ga0137389_1018330213300012096Vadose Zone SoilMSHKKSRTNILSGLFSVLAISGIAIWQFYQFVTFKNAGVVDLQGGTHHLWWAIGLGLTAFIVAFLFFLVFLRYDRNHEMDITSQMS*
Ga0137389_1172469713300012096Vadose Zone SoilMSVGKQRRKKVVSGVIAVMSVVTVAVWQFYLFARFRNEAGVVDVQGGTHHLWLAVGLGLIACIVAFLFFSVFLQHDRNDELHITS*
Ga0137364_1022081523300012198Vadose Zone SoilMSIGIIAVWQFYMFVTFKNTNGIVDVQGGIQHLWWAIGFGLLGFTAAVLFFSVFMRYDSSDEMHITSPPPRRNLS*
Ga0137383_1004077353300012199Vadose Zone SoilMSRKKSRNNLLFAIIAVMSIGVIAVWQFYVFVTFKNTNGIVDVQGGIQHLWWAIGFGLLGFTAAVLFFSVFMRYDSNDEMHITSPPPRRSLS*
Ga0137365_1077081523300012201Vadose Zone SoilMSRRKSRKNLLTEIIVVVSIAVIAVWQFYTFVTFKTTNGIVDAQGGLQHLWWAIGVGLLACIAAVLFFSVFLRYDSNDEMHITSPPPRGSLS*
Ga0137363_1140689513300012202Vadose Zone SoilRKNRKNLLTAIIVVMLIAVIAVWQFYLFVTFKNISGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEMHITSPPPRRSLS*
Ga0137399_1025722113300012203Vadose Zone SoilMSLAAIAVWQFYLFVVFKNANGIADMQGGTQHLWWAVGSGLIACIVAFLFFSVFLRYDRSDEMHITSQPS*
Ga0137380_1015345533300012206Vadose Zone SoilMSRKKSRNNLLFAIIAVMSIGVIAVWQFYMFVTFKNTNGIVDVQGGIQHLWWAIGFGLLGFTAAVLFFSVFMRYDSSDEMHITSPPPRRSLS*
Ga0137381_1003382063300012207Vadose Zone SoilMSRKKSRNNLLFAIIAVMSIGVIAVWQFYVFVTFKNTNVIVDVQGGIQHLWWAIGFGLLGFTAAVLFFSVFMRYDSNDEMHITSPPPRRSLS*
Ga0137378_10000827183300012210Vadose Zone SoilMLITALAVWQFYRFVTFETAGGVADVQGGTHHLWWAIGLGLIAFIAAFLFFSVFLRYDRSDEMHITSVS*
Ga0137378_1017306023300012210Vadose Zone SoilMSRKKSRKNLLSGIIVVMSIAVIAVWQFYLFVTFKNTNGIVDVQGGIQHLWWAIGFGLFACIAAFLSFSVFLRYDGNDEMHITSPPPRRSPS*
Ga0137387_1084139313300012349Vadose Zone SoilMSRKKSRNNLLFAIIAVMSIGVIAVWQFYMFVTFKNTNGIVDAQGGIQHLCWAIGFGLLGFTAAVLFFSVFMRYDSSDEMHITSPPPRRSLS*
Ga0137372_1000901653300012350Vadose Zone SoilMLITALAVWQFYRFVTFETAGGVADVQGGTHHLWWAIGLGLTAFIAAFLFFSVFLRYDRSDEMHITSVS*
Ga0137386_1004742033300012351Vadose Zone SoilMSRKKSRNNLLFAIIAVMSIGVIAVWQFYMFVTFKNTNGIVDVQGGIQHLWWAIGFGLLGFTAAVLFFSVFMRYDGSDEMHITSPPPRRSLS*
Ga0137367_1004609023300012353Vadose Zone SoilMSRRKSRKNLLTGIIVVVSIAVIAVWQFYTFVTFKNTNGIVDAQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEMHITSPPPRRSLS*
Ga0137366_1022487023300012354Vadose Zone SoilMLITALAVWQFYRFVTFETAEGVADVQGGTHHLWWAIGLGLTAFIAAFLFFSVFLRYDRSDEMHITSVS*
Ga0137369_1003830073300012355Vadose Zone SoilMSRRKSRKNLLTGIIVVVSIAVIAVWQFYTFVTFKNTNGIMDAQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEMHITSPPPRRSLS*
Ga0137384_1160743113300012357Vadose Zone SoilMLITALAVWQFYRFVTFETAGGVADVQGGTHHLWWAIGLGLIAFIAAFLFFSVFLRYDRSDE
Ga0137361_1003624633300012362Vadose Zone SoilMSRKKSRKNLLSAIIVVMSIAIIAIWQFYMFVTFKNTNGIVDAQGGLQHLWWATGVGLLACIAAVLFFSVFLRYDSNEEMHITSPPPRGSLS*
Ga0134110_1007227113300012975Grasslands SoilIAVWQFYLFVTFQNTNGIVDVQGGIKHLWWAIGFGLLGFTAAVLFFSVFMRYDSNDEMHITSPPPRRSLS*
Ga0134076_1022179113300012976Grasslands SoilMSRKKSRNNLLSGIIVVMSIAVIAVWQFYLFVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDSNDEMHITSPPPRGSLS*
Ga0134078_1019801413300014157Grasslands SoilRNNLLFAIIAVMSIGVIAVWQFYMFVTFKNTNGIVDVQGGIKHLWWAIGFGLLGFTAAVLFFSVFMRYDSNDEMHITSPPPRRSLS*
Ga0134079_1068593323300014166Grasslands SoilAVMSIGVVAVWQFYMFVTFKNTNGIVDVQGGIKHLWWAIGFGLLGFTAAVLFFSVFMRYDSNDEMHITSPPPRRSLS*
Ga0134073_1007187613300015356Grasslands SoilMSRKKSRNNLLSGIIVVMSIAVIAVWQFYLFVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEMHIT
Ga0134089_1038566513300015358Grasslands SoilNLLSGIIVVMSIAVIAVWQFYLFVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDELHITSPPPRRSLS*
Ga0132256_10011294033300015372Arabidopsis RhizosphereMGYRNRGKNVLTAIIVVMLFAVVAVWQFYTFATFKDTNGILDKQGGTQHFWWAVGFGLLACAMAFLSFSVFLRHDANNDTHITSPPSRTRLS*
Ga0132257_10110308313300015373Arabidopsis RhizosphereMGSRKSRKNVLSAIIVMMSVVAVAIWQFYTFVTFKNTNGMFDLQGGTQHFWWAVGFGLVACIVAVLFFSIFLRYDRNDETHIILPPSRKRLS*
Ga0134069_115727113300017654Grasslands SoilQFYLFVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDELHITSPPPRRSLS
Ga0134112_1013281123300017656Grasslands SoilLLSGIIVVMSIAVIAVWQFYLFVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDELHITSPPPRRSLS
Ga0066655_1004970523300018431Grasslands SoilMSRKKSRNNLLSGIIVVMSIAVIAVWQFYLFVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEMHITSPPPRRSLS
Ga0066667_1196932023300018433Grasslands SoilKKSRNNLLFAIIAVMSIGVIAVWQFYLFVTFKNTNGIVDVQGGIQHLWWAIGFGLLGFTAAVLFFSVFMRYDSNDEMHITSPPPRRSLS
Ga0066662_1009367823300018468Grasslands SoilMSRRKNRKNLLTAIIVVMLIAVIAVWQFYLFVTFKNISGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEMHITSPPPRRSLS
Ga0066662_1032863123300018468Grasslands SoilMSHRKNRKNILSGVIGVMFITAVAIWQFYQFVTFKNADGVVDLQGGTHHLWWAIGLGLIAFIVALLFFSVFLRYDRNDELHITSPS
Ga0066669_1082708823300018482Grasslands SoilMSRKKSRNNLLFAIIAVMSIGVIAVWQFYLFVTFKNTNGIVDVQGGIQHLWWAIGFGLLGFTAAVLFFSVFMRYDSNDEMHITSPPPRRSLS
Ga0190273_1013968033300018920SoilMNIRKSPVNVLAAVIGVITIATIAAWQFYLFVTFKNAQGLVDVQGGTHHLWWAIGAALVACLAAFLLLSSSLRYDKDNEMHITS
Ga0193747_101341433300019885SoilVTKGKSRMSRKKSRKNLLSAIIVVMSIAIIAIWQFYMFVTFKNTNGIVDAQGGLQHLWWAIGIGLFACIAAVLFFSVFLRYDSNDEMHITSPPPRGSLS
Ga0207684_1008436743300025910Corn, Switchgrass And Miscanthus RhizosphereMSRKKSRKNLLSAIIVVMSIAVIAVWQFYMFVTFKNTNGIVDAQGGIQHLWWAIGFGLLACTAAVLFFSVFLRYDSNDEMHITSPPPRGSLS
Ga0207684_1010420843300025910Corn, Switchgrass And Miscanthus RhizosphereMSRRKSRKNLLSAVIGVMSIAAIAIWQFWLFVTFKNAAGVVDVQGGKQHLWWAIGFGLFACIAAFLFFSVFLRYDRNDEMHITSQPS
Ga0207684_1061446733300025910Corn, Switchgrass And Miscanthus RhizosphereMSHKKSRTNILAGVVSVLALAGIGIWQFYQFVAFKNASGVVDLQGGTHHLWWAIGLGLTAFIVAFLFFSVFLRYDRSDEIHITSVS
Ga0207665_10000007403300025939Corn, Switchgrass And Miscanthus RhizosphereMSYKKSGKQVLSAAIGVISIATVAIWQFYLFVTFKNAEGVVDVQGGTYHLWWAVGLALIACIGSFLFFSVFLRYDRNDEMHITSLPS
Ga0209471_110294913300026318SoilMSRRKNRKNLLTAIIVVMLIAVIAVWQFYLFVTFKNISGIVDVQGGIQHLWWAIGSGLLACTAAVLFFSVFLRYDSNDEMHITSPPPRRSLS
Ga0209472_129471613300026323SoilMSRKKSRNNLLFAIIAVMSIGVVAVWQFYIFVTFKNTNGIVDVQGGIQHLWWAIGFGLLGFTAAVLFFSVFMRYDSNDEMHITSPPPRRSLS
Ga0209803_109160943300026332SoilMSHKKNGKQVLSAVIGVISITAVAIWQFYLFVTFKNAQGVVDVQGGTYHLWWAIGLALVACIGSFLFFSVFLRYDRNDEMHITSLPS
Ga0209803_119372623300026332SoilMSRKKSRNNLLSGIIVVMSIAVIAVWQFYLFVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDELHITSPPPRRSLS
Ga0209804_126169323300026335SoilVMSIGVIAVWQFYLFVTFKNTNGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDEMHITSPPPRRSLS
Ga0209690_102986353300026524SoilMSRKKSRNNLLSGIIVVMSIAVIAVWQFYLFVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDELHITSPPAPREMIL
Ga0209157_132855213300026537SoilMSRKKSRNNLLSGIIVVMSIAVIAVWQFYLFVTFKNINGIVDVQGGIQHLWWAIGFGLLACTAAFLFFSVFLRYDRNDELHITSPPPPRRSLS
Ga0209577_1085385513300026552SoilMSRKKSRNNLLSAIIVVTSIAVIAVWQFYIFVTFKNTNGIVDVQGGIQHLWWAIGFGLLACTGAVLLFSVFLRYDSSDEMHITSPPPRRNLS
Ga0307471_10013766243300032180Hardwood Forest SoilMVIACLGGAALWQFYSFVTYRNAVGVLDLQGGEHHLWWAIGFGLIACFAAFLMFSIFLRYDRNDELHITSPPPPREMIL
Ga0334932_000002_107106_1073633300034174Sub-Biocrust SoilMGEIKKGRTNIIAAVIGVLSIAAVAIWQFYLFVTFKSANGVVDVQGGTHHLWWAIGIGFIACLVGFLAFSVLLRYDKNDEMHITS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.