NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F103450

Metagenome / Metatranscriptome Family F103450

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103450
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 70 residues
Representative Sequence RERQVGLEIGLNFKVILDDLGAGRNTWWGYTLHIIFDNFRVPFTSVGFRYDLNHGRWTGPDNGNGYAAR
Number of Associated Samples 72
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 50.00 %
% of genes near scaffold ends (potentially truncated) 0.99 %
% of genes from short scaffolds (< 2000 bps) 0.99 %
Associated GOLD sequencing projects 69
AlphaFold2 3D model prediction Yes
3D model pTM-score0.27

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (98.020 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil
(23.762 % of family members)
Environment Ontology (ENVO) Unclassified
(40.594 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(53.465 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 19.59%    β-sheet: 18.56%    Coil/Unstructured: 61.86%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.27
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF12680SnoaL_2 7.00
PF02627CMD 5.00
PF12974Phosphonate-bd 5.00
PF13180PDZ_2 4.00
PF04185Phosphoesterase 4.00
PF05239PRC 2.00
PF12706Lactamase_B_2 2.00
PF00248Aldo_ket_red 2.00
PF00296Bac_luciferase 2.00
PF03706LPG_synthase_TM 2.00
PF00691OmpA 2.00
PF10043DUF2279 2.00
PF02416TatA_B_E 1.00
PF13231PMT_2 1.00
PF01515PTA_PTB 1.00
PF01507PAPS_reduct 1.00
PF13618Gluconate_2-dh3 1.00
PF04545Sigma70_r4 1.00
PF00941FAD_binding_5 1.00
PF00072Response_reg 1.00
PF00496SBP_bac_5 1.00
PF03129HGTP_anticodon 1.00
PF02775TPP_enzyme_C 1.00
PF13439Glyco_transf_4 1.00
PF02922CBM_48 1.00
PF07992Pyr_redox_2 1.00
PF04226Transgly_assoc 1.00
PF16822ALGX 1.00
PF01288HPPK 1.00
PF13692Glyco_trans_1_4 1.00
PF05872HerA_C 1.00
PF00291PALP 1.00
PF00254FKBP_C 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG0599Uncharacterized conserved protein YurZ, alkylhydroperoxidase/carboxymuconolactone decarboxylase familyGeneral function prediction only [R] 5.00
COG2128Alkylhydroperoxidase family enzyme, contains CxxC motifInorganic ion transport and metabolism [P] 5.00
COG3511Phospholipase CCell wall/membrane/envelope biogenesis [M] 4.00
COG0392Predicted membrane flippase AglD2/YbhN, UPF0104 familyCell wall/membrane/envelope biogenesis [M] 2.00
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 2.00
COG0124Histidyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 1.00
COG0280Phosphotransacetylase (includes Pta, EutD and phosphobutyryltransferase)Energy production and conversion [C] 1.00
COG0423Glycyl-tRNA synthetase, class IITranslation, ribosomal structure and biogenesis [J] 1.00
COG0433Archaeal DNA helicase HerA or a related bacterial ATPase, contains HAS-barrel and ATPase domainsReplication, recombination and repair [L] 1.00
COG0441Threonyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 1.00
COG0442Prolyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 1.00
COG08017,8-dihydro-6-hydroxymethylpterin pyrophosphokinase (folate biosynthesis)Coenzyme transport and metabolism [H] 1.00
COG1826Twin-arginine protein secretion pathway components TatA and TatBIntracellular trafficking, secretion, and vesicular transport [U] 1.00
COG2261Uncharacterized membrane protein YeaQ/YmgE, transglycosylase-associated protein familyGeneral function prediction only [R] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A98.02 %
All OrganismsrootAll Organisms1.98 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300010047|Ga0126382_10309188All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1191Open in IMG/M
3300018433|Ga0066667_10068988All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2226Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil23.76%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere12.87%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil11.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil5.94%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.95%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands3.96%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil3.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil2.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil2.97%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil1.98%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.98%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.98%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand1.98%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere1.98%
Hot SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Hot Spring0.99%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.99%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.99%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.99%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.99%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.99%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.99%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.99%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.99%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.99%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300000787Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300003911Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300004009Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005206Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D2EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006237Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2 (version 2)Host-AssociatedOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006969Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3Host-AssociatedOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010313Hot spring microbial communities from South Africa to study Microbial Dark Matter (Phase II) - Sagole hot spring metaGEnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300011435Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT660_2EnvironmentalOpen in IMG/M
3300012896Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S118-311C-2EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300017959Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_10_MGEnvironmentalOpen in IMG/M
3300018058Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_20_MGEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300025908Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025940Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M1-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025957Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025962Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300027880Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3 (SPAdes)Host-AssociatedOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300027950Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300027952Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300030683Metatranscriptome of soil fungal communities from truffle orchard in Rollainville, France - Anb10 (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031546Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f23EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031748Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.108b1f22EnvironmentalOpen in IMG/M
3300031765Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f22EnvironmentalOpen in IMG/M
3300031771Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f19EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031910Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 (v2)EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300032003Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D1EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032094Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f25EnvironmentalOpen in IMG/M
3300032122Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D4EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032828Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.4EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
F24TB_1004899513300000550SoilKGYPHGLPELRERQVGLEIGLNLKVILDDLGVGRNTWWGYTLHVVFDNVRVPFTSVGVRYDLNHRKWIGPDNGNGYAAR*
F24TB_1047162913300000550SoilNLEEILNAVGVRRDTWWGYGLHVVFDNIRFPFTSVGFQHDLNHSRWYGPGNGNQYSTTP*
F14TC_10075423413300000559SoilDDLGVGRNTWWGYTLHVVFDNVRVPFTSVGVRYDLNHHKWIGPDNGNGYAAR*
F14TC_10075423513300000559SoilDDLGVGRNTWWGYTLHVVFDNVRVPFTSVGVRYDLNHRKWIGPDNGNGYAAR*
JGI11643J11755_1167082813300000787SoilGLNLAIIMYDLGVRRDTWWGYTLHVVFDNFRVPFTSVGFRYDLNSKRWRGPDNGNGFATP
JGI1027J12803_10844817923300000955SoilLTYGTKGYPHGLPELRERQVGFEIGLNLKVILDDLGVGRNTWWGYTLRTVFDNLRVPFTSVGVRYDLNHRKWIGPDNGNGYAARXXXXSKSVSTSR*
JGI1027J12803_10844817933300000955SoilVILDDLGVGRNTWWGYTLHVVFDNLRVPFTSVGVRYDLNHRKWIGPDNGNGYAAR*
JGI25405J52794_1004156513300003911Tabebuia Heterophylla RhizosphereLPELRERQVGFEIGLNLAVILNDLGVRRDTWWGYALHVVCDNVRVPFTSVGFRYDLNAKRWRGPDNGNRFAAP*
Ga0055437_1010561023300004009Natural And Restored WetlandsLNFEEILNSLGARRDTWWGYGLHVVFDTIRFPFTSVGFQYDLNHGRWYGPGNGNQYSRTP
Ga0062595_10174014023300004479SoilVTYGSKYYPSGIVSLRERQVGFEIGLNFEEILNAVGVRRDTWWGYAIHVVMDNTRFPFTSVGFQYDLNHGKWYGPGNGNQYSTP*
Ga0066395_1019526923300004633Tropical Forest SoilGVQRDTWWGYTLHMAFDNFRVPFTSVGFQYDINHGRWYGPGNGNEYATER*
Ga0062594_10002390213300005093SoilEILNSIGVHRDTWWGYATHVVMDNTRFPFTSVGFQYDLNHGKWYGPGNGNEYSTTP*
Ga0068995_1001122523300005206Natural And Restored WetlandsRQVGFEIGLNFEEILNSLGVRRDKWWGYGLHIFFDNIRVPFTAVGFQYGLNRGRWYGPGNGNQYSTNP*
Ga0066388_10113804223300005332Tropical Forest SoilLNFKVILEDLGVGRNTWWGYTLHVVFDNVRVPFTSVGVRYDLNHRKWIGPDNGNGYAAR*
Ga0066388_10518857833300005332Tropical Forest SoilILNDLGVQRDTWWGYGLHVVFDNFRVPFTSVGFQYDMNHGRLHGPGNGNEYATRP*
Ga0066388_10559451523300005332Tropical Forest SoilILNAFGVERNTWWGYSLHFVLDNFRIPFTSVGFQYDLNGGKWYGPGNGNQYTR*
Ga0066905_10015200433300005713Tropical Forest SoilVGRIGNLNALGLQRDTRWGYGLHVVFDNIRFPLTAVGFQYDLNHRRWYGPGNGNQYSTTP
Ga0066905_10204175313300005713Tropical Forest SoilYPHGLPELRERQVGLEIGLNFKVILEDLGVGRNTWWGYTLHVVFDNVRVPFTSVGVRYDLNHHKWIGPGQ*
Ga0066903_10070829713300005764Tropical Forest SoilEIGLNFEEILNSLGARRNTWWGYGLHVVFDNIRFPFTAVGFQYDLNHRRWYGPGNGNQYSTTP*
Ga0066903_10370345813300005764Tropical Forest SoilGLEIGLNFKVILEDLGVGRNTWWGYTLHVVFDNVRVPFTSVGVRYDLNHRKWIGPDNGNGYAAR*
Ga0066903_10383523623300005764Tropical Forest SoilLTYGTKGYPSGVPELRERQVGLEIGLNFEQILDDLRVTRTTWWGYGLHLIFDNLRFPYTSVGFQYDLNHDQWRGPNNGNSFRSP*
Ga0066903_10385520113300005764Tropical Forest SoilERQVGFEIGLNFEEILNSLGARRDTWWGYGLHVVFDNTRFPFTSVGFQYDLNHGKWHGPGNGNQYSTTP*
Ga0066903_10464269223300005764Tropical Forest SoilDLRERQVGFEVGLNSAEILNAFGVQRNTWWGYSLHFVLDNFRIPFTSVGFQYDLNGGKWYGPGNGNNYTR*
Ga0066903_10887628413300005764Tropical Forest SoilGVRRDTWWGYALHVVGDNLRVPFTAVGVRYDLNTHRWRGPDNGSSFATP*
Ga0097621_10145638313300006237Miscanthus RhizosphereGYPTGAPAQRERQVGFEIGLDFEVILDDLGVKRNTWWGYGLHIVFDNFRFPFTSVGFQYDLNHDKWVGPDNGNGFAAR*
Ga0075428_10127382323300006844Populus RhizosphereLVSVTYSTKGYPSGLPELRERQVGFEIGLNLAIILNDLGVRRDTWWGYALHVVCDNVRVPFTSVGFRYDLNSKRWRGPDNGHGFATP*
Ga0075428_10143319813300006844Populus RhizosphereYPGGAIEPRERQVGLEIGLNFKVILDDLGAGRNTWWGYTLHIIFDNFRVPFTSVGFRYDLNHGRWTGPDNGNGYAAR*
Ga0075421_10156837223300006845Populus RhizosphereGGAIEPRERQVGLEIGLNFKVILDDLGAGRNTWWGYTLHIIFDNFRVPFTSVGFRYDLNHGRWTGPDNGNGYAAR*
Ga0075425_10215730213300006854Populus RhizosphereVGPLRYLLLSLTYGSKGYPHGLPELRERQVGLEIGLNFKVILDDFGVGRNTWWGYTLHVVFDNLRVPFTAIGVRYDLNHH
Ga0075429_10114341913300006880Populus RhizosphereGYATDNPALHERQVGFEIGLNVEEVLYGIGVQRDTWWGYVLHFALDNIRVPFTSLGVQYDLNDGRWRGPGNGNSYSTR*
Ga0075426_1018698313300006903Populus RhizosphereGFEIGLDFEVMLNDLGVNRKTWWGYGLHIVFDNFRFPFTSVGFQYDMNHDKWIGPGNGNEFATQ*
Ga0075424_10048057613300006904Populus RhizosphereFEIGLDFEVILNDLGVNRKTWWGYGLHIVFDNFRFPFTSVGFRYDMNHDKWIGPDNGNGFAAR*
Ga0075419_1048361123300006969Populus RhizosphereDLGARRNTWWGYTLHIVFDNFRVPFTSVGVRYDLNSGKWTGPDNGNGFAR*
Ga0075419_1109285223300006969Populus RhizosphereILDDLGVGRNTWWGYTLHVVFDNVRVPFTSVGVRYDLNHHKWIGPDNGNGYAAR*
Ga0075419_1142035323300006969Populus RhizosphereVGIEIGINFSPILTDLGVQRNTWWGYTLHIIFDNFRVPFTAVGFRYDLNHGKWIGPDNGNGFATR
Ga0099828_1006662723300009089Vadose Zone SoilMRRRAIPPVLQTIGLNLEEIPNVAGARRDTWWGYALHVIFDNFRVPYTSVGFRYDLNHDRWRGPDYGNGFSR*
Ga0114129_1203582413300009147Populus RhizosphereRERQVGLEIGLNFKVILDDLGAGRNTWWGYTLHIIFDNFRVPFTSVGFRYDLNHGRWTGPDNGNGYAAR*
Ga0105249_1130998433300009553Switchgrass RhizosphereYPTGTPALRERQVGFEIGLDFEVILNDLGVQRNTWWGYGLHIVVDNFRFPFTSVGFQYDMNHDKWVGPGNGNGFAAR*
Ga0126374_1080416223300009792Tropical Forest SoilSLRERQVGFEIGIDFKTVLNDPGVGRKTWWGYTLHIALDNFRIPYTSVGYRYDLNHEKWTGPDNGNGFAAR*
Ga0126380_1118787623300010043Tropical Forest SoilYPTGTPSQRERQVGFEIGIDFPVILNDIGVTRSTWWGYGLHIVFDNFRFPFTAVGFRYDLNHGKWTGPDNGNEFAAR*
Ga0126384_1167698823300010046Tropical Forest SoilSITYGSKGYPSGVPQLRERQVGIEIGLNFEQILDDLRVTRSTWWGYGLHLVFDNIRFPYTSIGVQYDLNHDQWRGPNNGNSFRSP*
Ga0126382_1030918813300010047Tropical Forest SoilTYGTKGYPAGAPDARERQVGIEIGLNFRVILDDLGARRNTWWGYTLHIIFDNFRVPFTSVGFRYDLNHNKWIGPDNGNGYATR*
Ga0126382_1177654023300010047Tropical Forest SoilYPHGLPELRERQVGLEIGLNLKVILDDLGVGRNTWWGYTLHVVFDNLRVPFTAVGFRYDLNSRRWIGPDNGNGYATR*
Ga0126382_1245791013300010047Tropical Forest SoilLPELRERQVGFEIGLNLAVILNDLGVRRDTWWGYVLHVVCDNVRVPFTSVGFRYDLNAKRWRGPDNGNRFAAP*
Ga0116211_118537913300010313Hot SpringQVGFEIGLNLEEILNSLGARRDTWWGYGLHFVLDNFRVPFTSVGFFYDLNAGKWYGPGNGNTYATR*
Ga0126370_1177762413300010358Tropical Forest SoilTYGSKFYTSGEKDLRERQIGFEIGLNFEEILNSLGARRDSWWGYGLHVVFDNFRVPFTSVGFQYDLQHGRWYGPGNGNQYSTTP*
Ga0126370_1217889513300010358Tropical Forest SoilSGTVDLRERQVGFEIGLNFEEILNAVGVRRDTWWGYAMHLVMDNTRFPFTSVGFQYDLNHGKWYGPGNGNQYTSP*
Ga0126370_1248535913300010358Tropical Forest SoilPELRERQVGFEIGLNLAIIMDDLGVRRDTWWGYTLHVVFDNFRVPFTSVGFGYDLNSKRWRGPDNGNGFATP*
Ga0126372_1016210013300010360Tropical Forest SoilFEVILDDLHVTRQTWWGYGLHIVFDNFRFPFTAVGFRYDLNHDKWTGPDNGNRFAAR*
Ga0126372_1049844713300010360Tropical Forest SoilIGLNFEQILDDLRVTRSTWWGYGLHLVFDNIRFPYTSVGFQYDLNHDQWRGPNNGNSFMSQ*
Ga0126372_1209217713300010360Tropical Forest SoilMMDDLGVRRDTWWGYTLHVAFDNFRVPFTSVGFRYDLNSKRWRGPDNGNEFATP*
Ga0126372_1237145513300010360Tropical Forest SoilRYLLLSLTYGSKGYPHGLPELRERQVGLEIGLNLKVILDDLGVGRNTWWGYTLHVVFDNLRVPFTAVGFRYDLNSRRWIGPDNGNGYATR*
Ga0126378_1018300433300010361Tropical Forest SoilLNFEEILNSLGVRRDTWWGFGLHVVFDNFRVPFTSVGFQYDLQHGRWYGPGNGNQYSTTP
Ga0126377_1068938623300010362Tropical Forest SoilPDLRERQVGIEIGLNFQKILDDVGVRRSTWWGYALHTVFDNVRLPFTAVGYRYDLNHGEWRGPDNGNSFLSH*
Ga0126379_1081332513300010366Tropical Forest SoilTPSLRERQVGFEIGIDFKTVLNDLGVGRKTWWGYTLHIALDNFRIPYTSVGYRYDLNHEKWTGPDNGNGFAAR*
Ga0126383_1121264713300010398Tropical Forest SoilGVVDLRERQVGFEIGLNFEEILNAVGVRRDTWWGYAMHVVIDNTRFPFTSVGFQYDLNHGKWYGPGNGNQYSTSP*
Ga0126383_1129471133300010398Tropical Forest SoilLRRETWWGYGLHVIFDNTRFPFTAVGFQYDLNHRRWYGPSNGNQYSTTP*
Ga0126383_1264907023300010398Tropical Forest SoilPSGEVDLRERQVGFEIGLNFEEILNAVGVRRDTWWGYGLHVVFDNTRFPFTSVGFQYDLNHGKWYGPGNGNQYSTRP*
Ga0134122_1145987223300010400Terrestrial SoilFRYLLLSVTYGTKGYPSGAPELRERQVGIEIGLNFEQILDDLRVTRRTWWGYALHVVFDNVRVPFTAVGYRFDLNHHRWRGPNNGNTFSSP*
Ga0134122_1197210023300010400Terrestrial SoilKGYPSGTPSLRERQVGFEIGLDFEVILNDLGVKRNTWWGYGLHIVFDNFRFPFTSVGFQYDMNHDKWIGPGNGNGFAAQ*
Ga0137426_119450823300011435SoilEEILNALGVRRDTWWGYGLHVVGDNIRFPFTSIGFQYGLNHRRWYGPGNGNQYATTP*
Ga0157303_1032079513300012896SoilSGTPSLRERQVGFEIGLDFEVILNDLGVKRNTWWGYGLHIVFDNFRFPFTSVGFQYDMNHDKWIGPGNGNGFAAQ*
Ga0137404_1182272213300012929Vadose Zone SoilTPSLRERQIGLEIGLDFEVILNDVGVKRNTWWGYGLHIVFDNFRFPFTAVGFRYDLNHDKWVGPGNGNGFAAR*
Ga0126375_1018636533300012948Tropical Forest SoilGVVDLRERQVGFEIGLNFEEILNAVGVRRDTWWGYAMHVVMDNTRFPFTSVGFQYDMNHGKWYGPGNGNQYSTSP*
Ga0126375_1117610623300012948Tropical Forest SoilRQVGFEFGLNFEEILNALGLRRETWWGYGLHVIFDNTRFPFTAVGFQYDLNHRRWYGPSNGNQYSTTP*
Ga0126375_1187293813300012948Tropical Forest SoilLFSITYGTKGDPSGTPSLRERQVGFEIGIDFKTVLNDPGVGRKTWWGYTLHIALDNFRIPYTQLGYRYDLRHGRWHGPDIGNSYGWR*
Ga0132256_10166573913300015372Arabidopsis RhizospherePKGTIETRERQVGIEIGLNFPEILDDLGARRDRWWGYVAHMVLDNIRFPFTAGGFRFDLNHKKWHGLDSGDSFGF*
Ga0132256_10308451223300015372Arabidopsis RhizosphereRERQVGIELGLNFQEILDDLGVRRNRWWGYVAHGIFDNIRFPFTAGGFRYDLNHGKWHGPDSGNSFGC*
Ga0132257_10255975813300015373Arabidopsis RhizosphereTGLPELRERQVGIEIGLNLQQILNDVGVRRTTWWGYALHAVFDNVRIPFTSVGMRYDLNHGEWRGPDNGNSFLKP*
Ga0182033_1050446613300016319SoilTYNVKGYPTGTPSLRERQVGFEIGIDFPIILNDIGVTRKTWWGYGLHLVFDNFRFPFTAVGFRYDLNSGKWTGPDNGNGFAAR
Ga0187779_1063476633300017959Tropical PeatlandLNSLGVRRDTWWGYGLHVIFDNTRFPFTSVGFQYDVNHHRWYGPGNGNQFSTTP
Ga0187766_1113740013300018058Tropical PeatlandSVTYGSKFYPTGLPSLRERQVGFEIGLNFEEILNSLGVRRDTWWGYGLHVVFDNTRFPFTSVGFQYDLNHGKWHGPGNGNQYSTTP
Ga0066667_1006898823300018433Grasslands SoilLNDLGARRNTEWGYTLHIVFDNFRVPFTSVGYRYDLNHNKWIGPDNGNGFATR
Ga0126371_1232653713300021560Tropical Forest SoilPDLRERQVGFEIGLNFEEILNSLGARRDTWWGYGLHVVFDNTRFPFTSVGFQYDLNHGKWHGPGNGNQYSTTP
Ga0126371_1318823423300021560Tropical Forest SoilLNSLGARRDSWWGYGLHVVFDNFRVPFTSVGFQYDLQHGRWYGPGNGNQYSTTP
Ga0207643_1049636223300025908Miscanthus RhizosphereVRHLQVGIEIGLNFEQILDDLRVTRRTWWGYALHVVFDNVRVPFTAVGYRFDLNHNRWRGPNNGNTFSSP
Ga0207691_1131738213300025940Miscanthus RhizosphereRYLLLSVTYGTKGYPSGAPELRERQVGIEIGLNFEQILDDLRVTRRTWWGYALHVVFDNVRVPFTAVGYRFDLNHNRWRGPNNGNTFSSP
Ga0210089_102548423300025957Natural And Restored WetlandsMFSVTYGSKFYPSGPPELRERQVGFEIGLNFEEILNSLGVRRDKWWGYGLHIFFDNIRVPFTAVGFQYGLNRGRWYGPGNGNQYSTNP
Ga0210070_103398413300025962Natural And Restored WetlandsLGVRRDKWWGYGLHIFFDNIRVPFTAVGFQYGLNRGRWYGPGNGNQYSTNP
Ga0209481_1043004023300027880Populus RhizosphereSCTYGSKGYPHGLPELRERQVGLEIGLNVKLILDDLGVGRNTWWGYTLHVVFDNVRVPFTSVGVRYDLNHHKWIGPDNGNGYAAR
Ga0209382_1053905133300027909Populus RhizosphereDSRERQVGIEIGINFSPILTDLGVQRNTWWGYTLHIIFDNFRVPFTAVGFRYDLNHGKWIGPDNGNGFATR
Ga0209885_102415913300027950Groundwater SandEIGLNFKVILDDLGARRGTWWGYTLHVVFDNFRIPFTSVGYRYDLNHGRWTGPDNGNGFATR
Ga0209889_108127723300027952Groundwater SandSQEERQVGFEIGLNVEEILNSVGVRRNTWWGYTLHLVLDNVRIPFTAVGFRYDLNSGRWSGPNNGNTSSTR
Ga0247621_103666313300030683SoilPEFRERQVGFEIGLNFGIILSDLGVRRDTWWGYSLHVVFDNLRIPFTSVGVRYDLNHRRWIGPDNGNGYATR
(restricted) Ga0255310_1011475513300031197Sandy SoilHEDQERQVGFEIGLNLEEILRTVGVRRDSWWGYPLRLVGDNVRFPYLSVGFRYDLDHGKWRGPNNGNYP
Ga0318538_1079013213300031546SoilGVRRDTWWGYAMHVVMDNTRFPFTSVGFQYDLNHGKWYGPGNGNQYTSP
Ga0307468_10015333123300031740Hardwood Forest SoilGVGRNTWWGYTLHVVFDNVRVPFTSIGVRYDLNHHKWIGPDNGNGYAAR
Ga0318492_1012147313300031748SoilGTPSLRERQVGFEIGIDFPVILNDLGVNRSTWWGYGLHIVFDNFRFPFTAVGFRYDLNHGKWTGPDNGNEFAAR
Ga0318554_1001217343300031765SoilDFPVILNNLGVNRSTWWGYGLHIVFDNFRFPFTAVGFRYDLNHGKWTGPDNGNEFAAR
Ga0318546_1002852513300031771SoilYSVRGYPTGTPSLRERQVGFEIGIDFPVILNDLGVNRSTWWGYGLHIVFDNFRFPFTAVGFRYDLNHGKWTGPDNGNEFAAR
Ga0307473_1139782813300031820Hardwood Forest SoilVTYGTKGYPSGTPSLRERQVGFELGLDFEVILDDLGVKRDTWWGYGLHIVFDNFRFPFTAVGFRYDMNHDKWTGPDNGNGFAAR
Ga0306923_1030947133300031910SoilYNVKGYPTGTPSLRERQVGFEIGIDFPIILNDIGVTRKTWWGYGLHLVFDNFRFPFTAVGFRYDLNSGKWTGPDNGNGFAAR
Ga0306921_1113728113300031912SoilRQVGFEIGLNFEEILNSLGARRDTWWGYGLHVVFDNTRFPFTSVGFQYDLNHGRWYGPGNANQYSTTP
Ga0310897_1028635613300032003SoilRYLMVSATYGVKGYPGGAIEPRERQVGLEIGLNFKVILDDLGAGRNTWWGYTLHIIFDNFRVPFTSVGFRYDLNHGRWTGPDNGNGYAAR
Ga0310890_1021375213300032075SoilILDDLGAKRNTWWGYGLHIVFDNFRIPFTSVGFQYDMNHDKWIGPGNGNGFAAQ
Ga0318540_1038655123300032094SoilSLRERQVGFEIGIDFAVILNDIGVTRKTWWGYGLHIVFDNFRFPFTAVGFRYDLNHGKWTGPDNGNEFAAR
Ga0310895_1058873513300032122SoilYGVKGYPGGAIEPRERQVGLEIGLNFKVILDDLGAGRNTWWGYTLHIIFDNFRVPFTSVGFRYDLNHGRWTGPDNGNGYAAR
Ga0307470_1009506423300032174Hardwood Forest SoilDFEVILDDLGVKRNTWWGYGLHIVFDNFRFPFTSVGFQYDMNHDKWIGPGNGNRFAAQ
Ga0307470_1065029213300032174Hardwood Forest SoilEIGLDFEVILNDPGVKRNTWWGYGLHIVFDNFRFPFTSVGFQYDMNHDKWIGPGNGNGFAAQ
Ga0307472_10028686513300032205Hardwood Forest SoilGIPSLRERQMGFELGLDFEVILDDLGVKRDTWWGYGLHIVFDNFRFPFTAVGFRYDMNHDKWTGPDNGNGFAAR
Ga0335080_1094479013300032828SoilGFEIGLNFEEILTSLGVRRDTWWGYGLHVVFDNTRFPFTSVGFQYDLNHHRWYGPGNGNQYSTTP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.