NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F102995

Metagenome Family F102995

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102995
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 94 residues
Representative Sequence MAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPAIVVDAAYADLTTTLGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPGLK
Number of Associated Samples 79
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 73
AlphaFold2 3D model prediction Yes
3D model pTM-score0.36

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(26.733 % of family members)
Environment Ontology (ENVO) Unclassified
(71.287 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(80.198 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 48.74%    β-sheet: 0.00%    Coil/Unstructured: 51.26%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.36
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF12327FtsZ_C 79.21
PF08478POTRA_1 0.99
PF10555MraY_sig1 0.99
PF0563523S_rRNA_IVP 0.99
PF14450FtsA 0.99
PF08245Mur_ligase_M 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG1589Cell division septal protein FtsQCell cycle control, cell division, chromosome partitioning [D] 0.99
COG4775Outer membrane protein assembly factor BamACell wall/membrane/envelope biogenesis [M] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil26.73%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil24.75%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil19.80%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil11.88%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.95%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil3.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.98%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.98%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.99%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.99%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002909Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300024275Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK15EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026316Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300031152Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 15_SEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1003071013300002558Grasslands SoilMALPGRARHYVAAGLLTAGGLLWATRDAWPWRRPLTAPPIVVTDAYAEFTETLGRRETLAEVLARAGITGRDYAGFLAAAQHLPVRRLRP
JGI25382J43887_1010121613300002908Grasslands SoilMTLRSRASQYVAAGLVTAGGLFWATRDAWPWRRPLTAEPIVITDAYAEFTETLGRRETLSDVLARAGITGRDYATFLAAAQHLPVRRLRPGLAFQVRRL
JGI25388J43891_105961113300002909Grasslands SoilMGGRWRTSHYLAAGALTAAGLVWASREAWPWRQPLTAPAIVVDAAYSDFSDALGRRETVSDVLARAGVTGRDYAGFLAAAKSLPVRRLRPGLVFDFRRLKTE
JGI25386J43895_1018682913300002912Grasslands SoilMALPGRARHYVAAGLLTAGGLLWATRDAWPWRRPLTAPPIVVTDAYAEFTETLGRRETLAEVLARAGITGRDYAGFLAAAQHLPVRRLRPGL
Ga0066672_1018176013300005167SoilMGGRWRTSHYLAAGALTAAGLVWASREAWPWRQPLTAPAIVVDAAYSDFSDALGRRETVSDVLARAGVTGRDYAGFLAAAKSLPV
Ga0066677_1028509223300005171SoilMVGRWRTSHYLAAGALTAAGLVWASREAWPWRQPLTAPAIVVDAAYSDFSEALGRRETVSDVLARAGVTGRDYAAFLAAAKSLPVRR
Ga0066680_1095838623300005174SoilMNRRLSHYVVAGALVAGGLVWAATDAWPWRRPLTARPLMVDAAYVDFTETLGKRETLGDVLARGGVRGRDYVAFLAAAKTLPVRRLRPGLAFHFRRLK
Ga0066690_1015184823300005177SoilMGGRWRTSHYLVAGSLTVAGLVWASREAWPWRQTLTAPAIEVNAAYSDFSEVLGRRETVSDVLARAGVTGRDYGAFLAAAKSLPVRRLRPGLVF
Ga0066685_1090730113300005180SoilMTLFSHWRASHYVAAGVLSAGGLVWATREAWPWRRPLTAPPIVVSEAYAEVTETLGRREVLSDVLGRAGITGRDYVAFLAAARHLPARRLRPGLAF
Ga0066685_1108139413300005180SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPTIVVDAAYADLTTTLGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPG
Ga0066678_1093944013300005181SoilMPASWRASHYAVAGALVVGGLAWATREAWPWRRPLTASPIVVDGAYADVTETLGRRQTLSDVLARGGVTGRDYARFLAAAKSLPVRRLRPGLKFDLRRLKTDSIA
Ga0066676_1042665713300005186SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPTIVVDAAYADLTTTLGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPGLKFDLRRLKTDSIA
Ga0066675_1064333213300005187SoilMGWRRNHYIAAGVLSAGGLLWASHDAWPWRQPLTAAPIVVNDAYAEVTETLGRREILADVLARAGITGRDYAAFLAAAKHL
Ga0070694_10019669823300005444Corn, Switchgrass And Miscanthus RhizosphereMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPVIVVDAAYADLTTALGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPGLRFDLRRLKTDS
Ga0066682_1030564923300005450SoilMPSSWRASHYIVAGVLVAGGLVWEGATSDAWPWRRPLTAPAIVVDAAYAEVTTRLGRREMLSDVLARGGITGRDYAAFLAAATALPVRRLRPGLEFQLR
Ga0066697_1049448723300005540SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPAIVVDAAYADLTTTLGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPGLK
Ga0066697_1070885723300005540SoilMTLFSHWRASHYVAAGVLSAGGLVWATREAWPWRRPLTAPPIVVSEAYAEVTETLGRREVLSDVLGRAGITGRDYVAFLAAAR
Ga0066695_1090473123300005553SoilMAWRTSHFATAGALVAGGLVWASRDAWPWRRPLTASPIVVDAAYADVTETLGRREILSDVLARGGVTGRDYAAFLAAAKSL
Ga0066698_1072521813300005558SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPAIVVDAAYADLTTTLGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPG
Ga0066693_1022684123300005566SoilMGGRWRTSHYLVAGALTAAGLAWASREAWPWRQPLTAPAIVVDAAYSDFSDALGRRETVSDVLARAGVTGRDYAGFLAAAKSLPVRRLRPGLVFDFRRLKTESAA
Ga0066708_1012257913300005576SoilMPSSWRASHSIVAGVLVAGGLVWATGVAWPWRRPLTAPAILVYAAYRDVDDALGRRERLADVLARAGVTGRDYANFLAAAKSLPVRRLRPGLAFQVRRLKTDSVARRITIRL
Ga0066696_1029379923300006032SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPTIVVDAAYADLTTTLGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPGLKFD
Ga0066656_1065217813300006034SoilMAWRTSHFVAASALVAGGLAWASRDAWPWRRPLTAPAIVVDAAYADLTTALGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPG
Ga0066652_10026284813300006046SoilMPSSWRASHSIVAGVLVAGGLVWATGVAWPWRRPLTAPAILVYAAYRDVDDALGRRERLADVLARAGVTGRDYANFLAAAKSLPVRRLRPGLAFQVRRL
Ga0066653_1018861423300006791SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPAIVVDAAYADLTTTLGRRETLADVLARGGFRGRDYAAFLAAAKSLPVRRLRPGLKFDLRRLKTD
Ga0066659_1003399113300006797SoilMAWRTSHFATAGALVAGGLVWASRDAWPWRRPLTASPIVVDAAYADVTETLGRREILSDVLARGGVTGRDYAAFLAAAKSLPVRRLRPGLKFDMRRLKT
Ga0066659_1122572823300006797SoilMPASWRASHYAVAGALVVGGLAWATRDAWPWRRPLTASPIVVDAAYADLTATLGRRETLSDVLARGGVTGRDYAEFLAAAKSLPVRRLRPGLKF
Ga0066659_1165311613300006797SoilMGGRWRTSHYLVAGSLTVAGLVWASREAWPWRQTLTAPAIEVNAAYSDFSEVLGRRETVSDVLARAGVTGRDYGAFLAAAKSLPVRRLRPGLVFAFRRLKT
Ga0079221_1043503313300006804Agricultural SoilMGGWRLSHYVAAGALVAGGLAWAARDAWPWRRPLTARPLVVNAAYVDFTETLGRRERLGDVLARGGVRGRDYVAFLTAAKHLPVRRLRPGLAFHFRRL
Ga0075434_10034187523300006871Populus RhizosphereMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPVIVVDAAYADVTTSLGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPGLRFDLRRLKTD
Ga0079218_1182863413300007004Agricultural SoilMPPLSWRASRYVAAGVLSAGGLVWATREAWPWRRPLTAPPIVVTAAYVEFADTLGRRETLLDVLGHARIRGRDYGAFLAAAPSLP
Ga0099793_1029739413300007258Vadose Zone SoilMAWRTSHFVAAGALVAGGLVWASRDAWPWRRPLTAPAILVDAAYADLTTALGRRETLAEVLARGGIRGRDYAAFLAAAKSLPVRRL
Ga0066710_10221518523300009012Grasslands SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPTIVVDAAYADLTTTLGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPGL
Ga0066710_10277118913300009012Grasslands SoilMAWRTSHFVAAGALVAGGLVWASRDAWPWRRPLTAPAIVVDAAYADWTTALGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPG
Ga0066709_10368530213300009137Grasslands SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPTIVVDAAYADLTTTLGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLR
Ga0134070_1015684623300010301Grasslands SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPAIVVDAAYADLTTTLGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPGLKFDLRRLKTD
Ga0134088_1016316133300010304Grasslands SoilLPSLPPLPPSLRRAGVVAAGLVAAGGLLWAARDAWPSRRRLTAAPIVITAAYAEFVDTLGRRETLSDALQRGGIVGRDYAAFLVAAKSLPVRRLRPG
Ga0134084_1028960113300010322Grasslands SoilMTLFSQWRASHYVAAGVLSAGGLVWATREAWPWHRPLTAPPIVVSEAYAEVTETLGRRETLSDVLARAGITGRDYAAFLGAARH
Ga0134086_1041677023300010323Grasslands SoilMPSSWRASHSIVAGVLVAGGLVWATGVAWPWRRPLTAPAILVYAAYHDVDDALGRRERLADVLARAGVTGRDYANFLAAAKSLPVRRLRPGLAFQVRRLKTDSVARRIT
Ga0134064_1019395513300010325Grasslands SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPAIVVDAAYADLTTALGRRETLADVLARGGIRGRDYTAFLAAAKSLPVRRLRPGLKFDLRRLKTDSIA
Ga0134064_1040864523300010325Grasslands SoilMTLVSHWRASHYVAAGVLSAGGLVWATRDAWPWRRPLTAPPIVVSEAYAEFTETLGRRETLSDVLARAGITGREYTALLAAARHLPARR
Ga0134063_1012361313300010335Grasslands SoilMAWRTSHFATAGALVAGGLVWASRDAWPWRRPLTASPIVVDAAYADVTETLGRREILSDVLARGGVTGRDYAAFLAAAKSLPVRRLRPGLKFDL
Ga0134062_1037084613300010337Grasslands SoilMARPWRTSHYVVAAALTAAGLVWATREAWPWRQPLTAPAIVVDAAYSEFSHSLGRRETLSDVLARAGVPGRDYAGFLAAAKALPVRRLRPGLV
Ga0134062_1038199213300010337Grasslands SoilMTLVSHWRASHYVAAGVLSAGGLVWATRDAWPWRRPLTAPPIVVSEAYAEFTETLGRRETLSDVLARAGITGREYTALLAAARHLPARRLRP
Ga0134127_1060597723300010399Terrestrial SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPAIVVDAAYADLTTALGRRETLADVLARGGIRGRDYAAFLAAAKSLPV
Ga0137383_1080547713300012199Vadose Zone SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPAIVVDAAYADLTTALGRRETLADVLARGGIRGRDYTAFLAAAKSLPVRRLRPGLKFD
Ga0137382_1036037523300012200Vadose Zone SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPAIVVDAAYADLTTTLGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPGLKFDLRRLKTDSIARHVTV
Ga0137376_1181559623300012208Vadose Zone SoilMPSSWRASHSIVAGVLVAGGLVWATGVAWPWRRPLTAPTILVNAAYSDVDDALGRRERLADVLARAGVTGRDYASFLAAAKSLPVRRLRPGLAFQVRRLKTDSVA
Ga0137367_1024810513300012353Vadose Zone SoilMPASWRASHYAVAGALVVGGLVWTTRDAWPWRRPLTASPIVVDAAYADVTETLGRRETLSDVLARGGVTGRDYAAFLAAATSLPVR
Ga0137384_1055652013300012357Vadose Zone SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPAIVVDAAYADLTTTLGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPGLKFDLRRLKTDSIARRV
Ga0137395_1118803723300012917Vadose Zone SoilMAWRTSHFVAAGALVAGGLVWASRDAWPWRRPLTAPAILVDAAYADLTTALGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPGLKFDLRRLKTDSIAR
Ga0137419_1021890513300012925Vadose Zone SoilMPASWRASHYAVAGALVVVGLAWATRDAWPWRRPLTAPAIVVNAAYADLTATLGRRETLSEVLARGRVTGRDYAAFLAAAKS
Ga0137404_1057726523300012929Vadose Zone SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPAIVVDAAYADLTTALGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPGLKFDLRRLR
Ga0137407_1158870623300012930Vadose Zone SoilMAWRTSHFVAAGALVAGGLVWASRDAWPWRRPLTAPAILVDAAYADLTTALGRRETLAEVLARGGIRGRDYAAFLAAAKSLPVRRLR
Ga0134077_1000009113300012972Grasslands SoilMAWRGSHYVVAGALAAGGLAWATRDAWPWRQPLTARAIVVDRAYVAFAETLGRRETVSDVVARAGITGRHYAGLLAVARDLPVRRLRPRLV
Ga0134077_1029919813300012972Grasslands SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPAIVVDEAYADLTTALGRRETLAEVLARGGIRGRDYAAFLAAAKSLPVRR
Ga0134077_1030507623300012972Grasslands SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPAIVVDAAYADLTTALGRRETLADVLARGGIRGRDYAAFLAAAQSLPVRRLRP
Ga0134110_1007117823300012975Grasslands SoilMARPWRTSHYVVAAALTAAGLVWATREAWPWRQPLTAPAIVVDAAYSEFSHSLGRRETLSDVLARAGVTGRDYAGFLAAAKALPVRRLRPGLVFEFRRLKTDSVASRV
Ga0134110_1009214013300012975Grasslands SoilMAWRTSHFATAGALVAGGLVWASRDAWPWRRPLTASPIVVDAAYADVTETLGRREILSDVLARGGVTGRDYAAFLAAAKSLPVRRLRPGLKFDLRRLKTDRI
Ga0134110_1025247723300012975Grasslands SoilMVGRWRTSHYLAAGALTAAGLVWASREAWPWRQPLTAPAIVVDAAYSDFSDALGRRETVSDVLARAGVTGRDYAGFLAAAKSLPVRR
Ga0134076_1009534623300012976Grasslands SoilMAWRGSHYVVAGALAAGGLAWATRDAWPWRQPLTARAIVVDRAYVAFAETLGRRETVSDVVARAGITGRNYAGLLLAARDLP
Ga0134081_1004088023300014150Grasslands SoilMGGRWRTSHYLAAGALTAAGLVWASREAWPWRQPLTAPAIVVDAAYSDFSDALGRRETVSDVLARAGVTGRDYAGFLAAAKSLPVRRLRPGLV
Ga0134075_1002553133300014154Grasslands SoilMTLFSHWRASHYVAAGVLSAGGLVWATREAWPWRRPLTAPPIVVSEAYAEVTETLGRREVLSDVLGRAGITGRDYVAFLAAARHLPARRLRPGLAFLVRRLKT
Ga0134075_1009659623300014154Grasslands SoilMTWRASHSLVAGALAAGGLVWATRDAWPWRRPLTAPAIVVDAAYADFADTLGRREIPSDVLARAGITGRDYVGFLAAAKSLPVRRLRPGLVFHFRRLWT
Ga0134075_1025137813300014154Grasslands SoilMPSSWRASHYIVAGVLVAGGLVWATSDAWPWRRPLTAPAILVNAAYSDADDALGRRERLADVLDRAGVTGRDYASFLAAAKSLPVRRLRP
Ga0134078_1010684813300014157Grasslands SoilMARPWRTSHYVVAAALTAAGLVWATREAWPWRQPLTAPAIVVDAAYSEFSHSLGRRETLSDVLARAGVTGRDYAGFLAAAKALPVRRLRPGLVFEFRRLKTDSVASRVSV
Ga0137420_121638013300015054Vadose Zone SoilMAWRTSHFVAAGALVAGGLAWATRDAWPWRRPLVASPIVVDAAYTDVTETLGRRETLSDVLARGGVTGRDYAAFLAAAKSLPVRRLRPGLKFDIRRLKTDR
Ga0134089_1009838623300015358Grasslands SoilMTWRASHSLVAGACAAGGLVWATRDAWPWRRPLTAPAIVVDAAYADFADTLGRRETPSDVLARAGITGRDYVGFLAAAKS
Ga0134089_1036334113300015358Grasslands SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPAIVVDAAYADLTTALGRRETLAEVLARGGIRGRDYAAFLAAAKSLPVRRLRPGLKFDLRRLKT
Ga0134085_1053783923300015359Grasslands SoilMSPRSRASQYVAAGLVTAGGLFWATRDAWPWRRPLTAQPIVITDAYVEFTETLGRRETLSDVLARAGITGRDYAAFLAAAQHLPVRRLRPGLAFQV
Ga0134074_118069123300017657Grasslands SoilMAWRTSHFATAGALVAGGLVWASRDAWPWRRPLTASPIVVDAAYADVTETLGRREILSDVLARGGVTGRDYAAFLAAAKSLPVRRLRPGL
Ga0066655_1039544613300018431Grasslands SoilMGGRWRTSHYLAAGALTAAGLVWASREAWPWRQPLTAPAIVVDAAYSDFSDALGRRETVSDVLARAGVTGRDYAGFLAAAKSLPVRRL
Ga0066655_1067970113300018431Grasslands SoilMAWRTSHFATAGALVAGGLVWASRDAWPWRRPLTASPIVVDAAYADVTETLGRREILSDVLARGGVTGRDYAAFLAAAKSLPVRRLRPGLKFDLR
Ga0066655_1073059113300018431Grasslands SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPAIVVDAAYADLTTALGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPGLK
Ga0066667_1159948013300018433Grasslands SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPAIVVDAAYADLTTTLGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPGLKFDLR
Ga0066667_1172937523300018433Grasslands SoilMPSSWRASHSIVAGVLVAGGLVWATGVAWPWRRPLTAPAILVNAAYSDVDDALGRRERLADVLARAGVTGRDYASFLAAAKSLPVRRLRPGLAFQVRRLKTDSVAR
Ga0066662_1127106813300018468Grasslands SoilMSRLSHWRASHYVAAGVLSAGGLVWATREAWPWRRPLTAPPIVVSEAYAEFTETLGRREMLSDVLARAGLTGRDYGAFLAAARHLPVRRLRPGLAFQVR
Ga0247674_104829723300024275SoilMNRRLSHYVVAGALVAGGLVWAASDAWPWRRPLTARPLMVDAAYVDFTETLGKRETLGDVLARGGVRGRDYVAFLAAAKSLPVRRLRPGLAFHFRRLKADSVTTSVTVRL
Ga0209237_100552583300026297Grasslands SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPAIVVDAAYADLTTALGRRETLAEVLARGGIRGRDYTAFLAAAKSLPVRRLRPGLKFDLRRLKT
Ga0209237_124372613300026297Grasslands SoilMPASWRASHYAVAGALVVGGLAWATRDAWPWRRPLTASPIVVDAAYADLTATLGRRETLSDVLARGGVTGRDYAEFLAAAKSLPVRRL
Ga0209236_126425923300026298Grasslands SoilMAWRTSHFATAGALVAGGLLWASRDAWPWRRPLTASPIVVDAAYADVTETLGRRETLSDVLARGGVTGRDYAAFLAAAKSLPVRR
Ga0209239_101040353300026310Grasslands SoilMPASWRVSHYAVAGALAVAGLAWATRDAWPWRRPLTASPIVVDGAYADVTETLGRRQTLSDVLARGGVTGRDYARFLAAAKSLPVRRLRPGLKFDLRRLKTDS
Ga0209239_104063313300026310Grasslands SoilMSPRSRASQYVAAGLVTAGGLFWATRDAWPWRRPLTAQPIVITDAYVEFTETLGRRETLSDVLARAGITGRDYAAFLAAAQHLPVRRL
Ga0209239_124027323300026310Grasslands SoilMGGRWRTSHYLVAGALTAAGLAWASREAWPWRQPLTAPAIVVDAAYSDFRDALGRRETVSDVLARAGVTGRDYAGFLAAAKSLPVR
Ga0209155_105907313300026316SoilMTLFSHWRASHYVAAGVLSAGGLVWATREAWPWRRPLTAPPIVVSEAYAEVTETLGRREVLSDVLGRAGITGRDYVAFLAAA
Ga0209471_121118313300026318SoilMPSSWRASHSIVAGMLVAGGLVWATGVAWPWRRPLTAPAILVYAAYRDVDDALGRRERLADVLARAGVTGRDYANFLAAAKSLP
Ga0209057_119849813300026342SoilMAWRTSHFVAAGALVAGGLVWASRDAWPWRRPLIASPIVVDAAYADLTATLGRRETLSDVLARGGVTGRDYAAFLAAAQSLPVRRLRPGLKFQLRRLKTESVASRV
Ga0209690_119571123300026524SoilMAWRRKSHVVAAGLAAAALLWAGRVAWSLRRPLTAPPILVAGAYTDFDDSLGRRETMSDVLARAGITGQDYARFLRAATKLPVRRLQPGLVFHFRRLRTEPTVRQVMVRPAYDRRLWL
Ga0209156_1017868913300026547SoilMGWRRNHYIAAGVLSAGGLLWASHDAWPWRQPLTAAPIVVNDAYAEVTETLGRREILADVLARAGITGRDYAAFLAAAKH
Ga0209474_1021788713300026550SoilMTLFSQWRASHYVAAGVLSAGGLVWATREAWPWRRPLTAPPIVVSEAYAEVTQTLGRRETLSGVLARAGITGRDYAAFLGAARHLPARRLR
Ga0209648_1003526353300026551Grasslands SoilMAWRASHYAVAGVLAVGGLAWASRAAWPWRRPLTAPAIVVTAAYADGTDALGRRETLSDVLARAGVTGRHYARFLAAARHLPVRRLRPGLVFQVRRLKTDSV
Ga0209074_1003264423300027787Agricultural SoilMNRRLSHYVVAGALVAGGLVWAASDAWPWRRPLTARPLMVDAAYVDFTETLGKRETLGDVLARGGVRGRDYVAFLAAAKSLPVRRLRPGLAFHFRRLKADN
Ga0209074_1047646413300027787Agricultural SoilMNWRLSHYVVAGALVAGGLVWAATDAWPWRRPLTARPLMVDAAYVDFIETLGKRETLGDVLARGGVRGRDYVAFLAAAKSLPVRRLRP
Ga0209488_1073430613300027903Vadose Zone SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTAPAIVVDAAYADLTTALGRRETLADVLARGGIRGRDYAAFLAAAKSLPVRRLRPGLKFDLRRLKTDSIARRV
Ga0307282_1022222813300028784SoilMAWRASHYVVAGALAAGGLSWAARDAWPWRRPLTAPTIVVDAAYTDFTETLGRREMLADVLARGHVTGRDYAAFLAAAHGLPVRR
Ga0307501_1009741613300031152SoilMPAWRASHYAVAGALVVGGLAWAARDAWPWRRPLTAPPIVVNGAYADLVETLGRRETVSDVLARGRVTGRDYAAFLAAAKSLPVRRLRPGLKF
Ga0307469_1011839113300031720Hardwood Forest SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTARAIVVDGAYADLTTALGRRETLADVLARGGIRGRDNATFLAAAKSLPVRR
Ga0307473_1010628323300031820Hardwood Forest SoilMAWRTSHFVAAGALVAGGLAWATRDAWPWRRPLVASPIVVSAAYSDVTETLGRRETLSDVLARGGVTGRDYAAFLAAAKSLPVRRLRPGLKF
Ga0307473_1114678423300031820Hardwood Forest SoilMAWRTSHFVAAGALVAGGLAWASRDAWPWRRPLTARAIVVDGAYADLTTALGRRETLADVLARGGIRGRDYATFLAAAKSLPVRRLRPGLKFDLRRLKTDSIARR
Ga0307479_1104197823300031962Hardwood Forest SoilMGGRWRTSHYVAAGALAAAGLVWASREAWPWHQPLTAPAIVVNAAYSDFSDVLGRRETVSDVLGRAGVTGRDYAGFLAAAKSLPVRRLRPGLVFAFRRL
Ga0307470_1116394713300032174Hardwood Forest SoilMNWRLSHYVVAGALVAGGLVWAATDAWPWRRPLTARPLMVDAAYVDFTETLGKRETLGDVLARGGVRGRDYVAFLTAAKSLPVRR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.