NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F049992

Metagenome / Metatranscriptome Family F049992

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F049992
Family Type Metagenome / Metatranscriptome
Number of Sequences 146
Average Sequence Length 38 residues
Representative Sequence MTKNKWLVHLAKVRKENPKIKDVAKLAKLAKKTYKK
Number of Associated Samples 53
Number of Associated Scaffolds 146

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 64.79 %
% of genes near scaffold ends (potentially truncated) 5.48 %
% of genes from short scaffolds (< 2000 bps) 68.49 %
Associated GOLD sequencing projects 49
AlphaFold2 3D model prediction Yes
3D model pTM-score0.49

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (82.192 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Freshwater → Lake → Sediment → Sediment
(39.726 % of family members)
Environment Ontology (ENVO) Unclassified
(50.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Unclassified
(49.315 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 35.94%    β-sheet: 0.00%    Coil/Unstructured: 64.06%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.49
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 146 Family Scaffolds
PF07460NUMOD3 5.48
PF13639zf-RING_2 0.68
PF13692Glyco_trans_1_4 0.68
PF01972SDH_sah 0.68
PF00145DNA_methylase 0.68
PF07043DUF1328 0.68
PF04480DUF559 0.68
PF01145Band_7 0.68
PF13392HNH_3 0.68
PF10102DUF2341 0.68
PF00583Acetyltransf_1 0.68
PF13489Methyltransf_23 0.68
PF12323HTH_OrfB_IS605 0.68
PF01844HNH 0.68
PF16203ERCC3_RAD25_C 0.68
PF02518HATPase_c 0.68

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 146 Family Scaffolds
COG0616Periplasmic serine protease, ClpP classPosttranslational modification, protein turnover, chaperones [O] 1.37
COG0270DNA-cytosine methylaseReplication, recombination and repair [L] 0.68
COG5487Uncharacterized membrane protein YtjA, UPF0391 familyFunction unknown [S] 0.68


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A82.19 %
All OrganismsrootAll Organisms17.81 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001533|MLSed_10188748Not Available824Open in IMG/M
3300001533|MLSed_10228566All Organisms → cellular organisms → Archaea → DPANN group → Nanoarchaeota → unclassified Nanoarchaeota → Nanoarchaeota archaeon709Open in IMG/M
3300003852|Ga0031655_10094311Not Available1214Open in IMG/M
3300004239|Ga0066650_10533710Not Available536Open in IMG/M
3300009091|Ga0102851_13105311Not Available533Open in IMG/M
3300009149|Ga0114918_10031655All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Parcubacteria group → Candidatus Nealsonbacteria → Candidatus Nealsonbacteria bacterium RBG_13_42_113766Open in IMG/M
3300009149|Ga0114918_10055871Not Available2622Open in IMG/M
3300009149|Ga0114918_10078710Not Available2104Open in IMG/M
3300009149|Ga0114918_10091558Not Available1909Open in IMG/M
3300009149|Ga0114918_10094976All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon GW2011_AR11866Open in IMG/M
3300009149|Ga0114918_10139282Not Available1459Open in IMG/M
3300009149|Ga0114918_10162943Not Available1321Open in IMG/M
3300009149|Ga0114918_10197898Not Available1169Open in IMG/M
3300009149|Ga0114918_10202718Not Available1151Open in IMG/M
3300009149|Ga0114918_10279738All Organisms → cellular organisms → Bacteria939Open in IMG/M
3300009149|Ga0114918_10486274Not Available662Open in IMG/M
3300009149|Ga0114918_10702148Not Available530Open in IMG/M
3300009169|Ga0105097_10019933All Organisms → cellular organisms → Archaea → DPANN group → Nanoarchaeota → unclassified Nanoarchaeota → Nanoarchaeota archaeon3532Open in IMG/M
3300009171|Ga0105101_10656054All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Pacearchaeota → Candidatus Pacearchaeota archaeon523Open in IMG/M
3300009285|Ga0103680_10154997Not Available1236Open in IMG/M
3300009529|Ga0114919_10012363Not Available6628Open in IMG/M
3300009529|Ga0114919_10018344Not Available5349Open in IMG/M
3300009529|Ga0114919_10024146All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Ignavibacteriae → Ignavibacteria → unclassified Ignavibacteria → Ignavibacteria bacterium4606Open in IMG/M
3300009529|Ga0114919_10024806All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon4543Open in IMG/M
3300009529|Ga0114919_10042410Not Available3391Open in IMG/M
3300009529|Ga0114919_10112420Not Available1974Open in IMG/M
3300009529|Ga0114919_10147789Not Available1692Open in IMG/M
3300009529|Ga0114919_10201895Not Available1417Open in IMG/M
3300009529|Ga0114919_10253688Not Available1243Open in IMG/M
3300009666|Ga0116182_1009517All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla7502Open in IMG/M
(restricted) 3300013127|Ga0172365_10218813Not Available1158Open in IMG/M
(restricted) 3300013127|Ga0172365_10559162Not Available657Open in IMG/M
(restricted) 3300013128|Ga0172366_10037490All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Woesearchaeota → Candidatus Woesearchaeota archaeon3479Open in IMG/M
(restricted) 3300013128|Ga0172366_10585010Not Available663Open in IMG/M
(restricted) 3300013128|Ga0172366_10588314Not Available660Open in IMG/M
(restricted) 3300013138|Ga0172371_10124666Not Available2344Open in IMG/M
3300014204|Ga0172381_10027440Not Available5093Open in IMG/M
3300014204|Ga0172381_10238307All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon BMS3Abin171458Open in IMG/M
3300014205|Ga0172380_10594039Not Available810Open in IMG/M
3300014613|Ga0180008_1210492Not Available746Open in IMG/M
3300014839|Ga0182027_10208736Not Available2260Open in IMG/M
3300019217|Ga0179946_1073245Not Available739Open in IMG/M
3300020171|Ga0180732_1056731Not Available1276Open in IMG/M
(restricted) 3300023109|Ga0233432_10441740Not Available558Open in IMG/M
3300024262|Ga0210003_1024446Not Available3550Open in IMG/M
3300024262|Ga0210003_1062265All Organisms → Viruses → Predicted Viral1835Open in IMG/M
3300024262|Ga0210003_1080328Not Available1538Open in IMG/M
3300024262|Ga0210003_1090030Not Available1420Open in IMG/M
3300024262|Ga0210003_1097298Not Available1346Open in IMG/M
3300024262|Ga0210003_1171222Not Available913Open in IMG/M
3300024262|Ga0210003_1243416Not Available714Open in IMG/M
3300024433|Ga0209986_10023355All Organisms → cellular organisms → Archaea4113Open in IMG/M
3300024433|Ga0209986_10036265Not Available3079Open in IMG/M
3300024433|Ga0209986_10097443Not Available1605Open in IMG/M
3300024433|Ga0209986_10132743Not Available1310Open in IMG/M
3300024433|Ga0209986_10156269Not Available1175Open in IMG/M
3300024433|Ga0209986_10322761Not Available724Open in IMG/M
3300025164|Ga0209521_10292103All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → unclassified Actinobacteria → Actinobacteria bacterium RBG_13_35_12931Open in IMG/M
3300027721|Ga0209492_1008340Not Available3429Open in IMG/M
(restricted) 3300027856|Ga0255054_10496287Not Available592Open in IMG/M
3300027885|Ga0209450_10222498All Organisms → cellular organisms → Bacteria1342Open in IMG/M
3300027896|Ga0209777_10002127Not Available26786Open in IMG/M
3300027896|Ga0209777_10036626Not Available4584Open in IMG/M
3300027896|Ga0209777_10065428All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Pacearchaeota → Candidatus Pacearchaeota archaeon3215Open in IMG/M
3300027896|Ga0209777_10067833Not Available3143Open in IMG/M
3300027896|Ga0209777_10100775Not Available2462Open in IMG/M
3300027896|Ga0209777_10272991Not Available1320Open in IMG/M
3300027896|Ga0209777_10694141Not Available725Open in IMG/M
3300027896|Ga0209777_10955288Not Available590Open in IMG/M
(restricted) 3300028553|Ga0247839_1158256Not Available975Open in IMG/M
(restricted) 3300029268|Ga0247842_10012605Not Available7997Open in IMG/M
(restricted) 3300029268|Ga0247842_10097795Not Available1803Open in IMG/M
3300031227|Ga0307928_10144689Not Available1330Open in IMG/M
3300031539|Ga0307380_10176939All Organisms → Viruses → Predicted Viral2087Open in IMG/M
3300031539|Ga0307380_10459014Not Available1132Open in IMG/M
3300031539|Ga0307380_10556220Not Available997Open in IMG/M
3300031539|Ga0307380_10633452Not Available914Open in IMG/M
3300031539|Ga0307380_10704434Not Available850Open in IMG/M
3300031539|Ga0307380_10794660Not Available783Open in IMG/M
3300031539|Ga0307380_11118411Not Available618Open in IMG/M
3300031565|Ga0307379_10414151Not Available1287Open in IMG/M
3300031565|Ga0307379_10844803Not Available800Open in IMG/M
3300031565|Ga0307379_11111311Not Available663Open in IMG/M
3300031565|Ga0307379_11442853Not Available552Open in IMG/M
3300031565|Ga0307379_11462254Not Available547Open in IMG/M
3300031578|Ga0307376_10151979Not Available1601Open in IMG/M
3300031578|Ga0307376_10177579Not Available1463Open in IMG/M
3300031578|Ga0307376_10894350Not Available542Open in IMG/M
3300031578|Ga0307376_10916646Not Available533Open in IMG/M
3300031673|Ga0307377_10925017Not Available592Open in IMG/M
3300031673|Ga0307377_10948192Not Available582Open in IMG/M
3300031707|Ga0315291_10049706Not Available4737Open in IMG/M
3300031707|Ga0315291_10133401Not Available2626Open in IMG/M
3300031707|Ga0315291_10156096All Organisms → cellular organisms → Archaea2381Open in IMG/M
3300031707|Ga0315291_10543487Not Available1067Open in IMG/M
3300031707|Ga0315291_10608198Not Available990Open in IMG/M
3300031746|Ga0315293_10000271Not Available46400Open in IMG/M
3300031746|Ga0315293_10268852Not Available1378Open in IMG/M
3300031772|Ga0315288_10832200Not Available847Open in IMG/M
3300031772|Ga0315288_11220238Not Available646Open in IMG/M
3300031873|Ga0315297_10000303Not Available28173Open in IMG/M
3300031873|Ga0315297_10406325Not Available1142Open in IMG/M
3300031885|Ga0315285_10090074All Organisms → cellular organisms → Bacteria → Spirochaetes → unclassified Spirochaetota → Spirochaetota bacterium2710Open in IMG/M
3300031885|Ga0315285_10586976All Organisms → cellular organisms → Archaea742Open in IMG/M
3300031885|Ga0315285_10865466Not Available558Open in IMG/M
3300031952|Ga0315294_10119551Not Available2708Open in IMG/M
3300031952|Ga0315294_10188149Not Available2063Open in IMG/M
3300031952|Ga0315294_10452059Not Available1188Open in IMG/M
3300031952|Ga0315294_10526559Not Available1075Open in IMG/M
3300031952|Ga0315294_10572100Not Available1017Open in IMG/M
3300031952|Ga0315294_11518609Not Available523Open in IMG/M
3300031999|Ga0315274_10001110Not Available41674Open in IMG/M
3300031999|Ga0315274_10079755All Organisms → cellular organisms → Archaea4308Open in IMG/M
3300031999|Ga0315274_10157921All Organisms → cellular organisms → Archaea2868Open in IMG/M
3300031999|Ga0315274_10212518Not Available2384Open in IMG/M
3300031999|Ga0315274_10287798Not Available1971Open in IMG/M
3300031999|Ga0315274_10449586Not Available1476Open in IMG/M
3300031999|Ga0315274_10523259Not Available1334Open in IMG/M
3300031999|Ga0315274_10525757All Organisms → Viruses → Predicted Viral1330Open in IMG/M
3300031999|Ga0315274_10929936Not Available900Open in IMG/M
3300032046|Ga0315289_10442125Not Available1274Open in IMG/M
3300032046|Ga0315289_10513131Not Available1147Open in IMG/M
3300032046|Ga0315289_11167679Not Available625Open in IMG/M
3300032046|Ga0315289_11199488Not Available612Open in IMG/M
3300032046|Ga0315289_11319623Not Available568Open in IMG/M
3300032046|Ga0315289_11557257Not Available500Open in IMG/M
3300032053|Ga0315284_10013653All Organisms → cellular organisms → Archaea11574Open in IMG/M
3300032053|Ga0315284_10659897Not Available1236Open in IMG/M
3300032053|Ga0315284_11025997Not Available926Open in IMG/M
3300032053|Ga0315284_11313290Not Available784Open in IMG/M
3300032069|Ga0315282_10359561Not Available904Open in IMG/M
3300032118|Ga0315277_11329736All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Woesearchaeota → Candidatus Woesearchaeota archaeon628Open in IMG/M
3300032156|Ga0315295_10976215Not Available842Open in IMG/M
3300032173|Ga0315268_10734926Not Available985Open in IMG/M
3300032401|Ga0315275_10039936Not Available4978Open in IMG/M
3300032516|Ga0315273_10219515Not Available2595Open in IMG/M
3300032516|Ga0315273_10258959Not Available2369Open in IMG/M
3300032516|Ga0315273_10307626All Organisms → Viruses → Predicted Viral2151Open in IMG/M
3300032516|Ga0315273_10563837Not Available1516Open in IMG/M
3300032516|Ga0315273_11051039Not Available1038Open in IMG/M
3300033233|Ga0334722_11246698Not Available520Open in IMG/M
3300033433|Ga0326726_11286281Not Available712Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment39.73%
Deep SubsurfaceEnvironmental → Aquatic → Marine → Oceanic → Sediment → Deep Subsurface23.29%
SoilEnvironmental → Terrestrial → Soil → Clay → Unclassified → Soil12.33%
Freshwater Lake SedimentEnvironmental → Aquatic → Freshwater → Lentic → Sediment → Freshwater Lake Sediment6.85%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater2.74%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment2.05%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater2.05%
Landfill LeachateEngineered → Solid Waste → Landfill → Unclassified → Unclassified → Landfill Leachate2.05%
BenthicEnvironmental → Aquatic → Freshwater → Unclassified → Unclassified → Benthic1.37%
SeawaterEnvironmental → Aquatic → Marine → Inlet → Unclassified → Seawater1.37%
Anaerobic Digestor SludgeEngineered → Wastewater → Anaerobic Digestor → Unclassified → Unclassified → Anaerobic Digestor Sludge1.37%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.68%
GroundwaterEnvironmental → Aquatic → Freshwater → Drinking Water → Chlorinated → Groundwater0.68%
Saline WaterEnvironmental → Aquatic → Non-Marine Saline And Alkaline → Saline → Unclassified → Saline Water0.68%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.68%
FenEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Fen0.68%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.68%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.68%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001533Benthic freshwater microbial communities from British Columbia, CanadaEnvironmentalOpen in IMG/M
3300002223Soil microbial communities from Rifle, Colorado - Rifle CSP2_plank lowO2_1.2EnvironmentalOpen in IMG/M
3300003852Freshwater lake sediment microbial communities from the University of Notre Dame, USA, for methane emissions studies -HBP12 HBEnvironmentalOpen in IMG/M
3300004239Groundwater microbial communities from aquifer - Crystal Geyser CG23_combo_of_CG06-09_8/20/14_allEnvironmentalOpen in IMG/M
3300009091Freshwater wetland microbial communities from Ohio, USA, analyzing the effect of biotic and abiotic controls - Mud 3 Core 4 Depth 3 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300009149Deep subsurface microbial communities from Baltic Sea to uncover new lineages of life (NeLLi) - Landsort_02402 metaGEnvironmentalOpen in IMG/M
3300009169Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 10-12cm May2015EnvironmentalOpen in IMG/M
3300009171Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 19-21cm May2015EnvironmentalOpen in IMG/M
3300009285Microbial communities from groundwater in Rifle, Colorado, USA - 2A_0.1umEnvironmentalOpen in IMG/M
3300009529Deep subsurface microbial communities from Black Sea to uncover new lineages of life (NeLLi) - Black_00105 metaGEnvironmentalOpen in IMG/M
3300009666Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_UKC077_MetaGEngineeredOpen in IMG/M
3300013127 (restricted)Sediment microbial communities from Lake Kivu, Rwanda - Sediment site 48cmEnvironmentalOpen in IMG/M
3300013128 (restricted)Sediment microbial communities from Lake Kivu, Rwanda - Sediment site 69cmEnvironmentalOpen in IMG/M
3300013130 (restricted)Sediment microbial communities from Lake Kivu, Rwanda - Sediment s2_kivu2a2EnvironmentalOpen in IMG/M
3300013138 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_022012_12mEnvironmentalOpen in IMG/M
3300014204Leachate microbial communities from a municipal landfill in Southern Ontario, Canada - Leachate well 64-88 metaGEngineeredOpen in IMG/M
3300014205Leachate microbial communities from a municipal landfill in Southern Ontario, Canada - Leachate well 162 metaGEngineeredOpen in IMG/M
3300014613Groundwater microbial communities from the Aspo Hard Rock Laboratory (HRL) deep subsurface site, Sweden - MM_PW_MetaGEnvironmentalOpen in IMG/M
3300014839Permafrost microbial communities from Stordalen Mire, Sweden - 712E1D metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300019217Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from Japan ? AD_JPNNA4_MetaT (Metagenome Metatranscriptome)EngineeredOpen in IMG/M
3300020171Groundwater microbial communities from the Olkiluoto Island deep subsurface site, Finland - KR11_0.1 MetaGEnvironmentalOpen in IMG/M
3300023109 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_122_August2016_10_MGEnvironmentalOpen in IMG/M
3300024262Deep subsurface microbial communities from Baltic Sea to uncover new lineages of life (NeLLi) - Landsort_02402 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300024433Deep subsurface microbial communities from Black Sea to uncover new lineages of life (NeLLi) - Black_00105 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025164Soil microbial communities from Rifle, Colorado, USA - sediment 19ft 4EnvironmentalOpen in IMG/M
3300027721Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 10-12cm May2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027856 (restricted)Seawater microbial communities from Jervis Inlet, British Columbia, Canada - JV7_2_23EnvironmentalOpen in IMG/M
3300027885Freshwater lake sediment microbial communities from the University of Notre Dame, USA, for methane emissions studies - LWP11 LW (SPAdes)EnvironmentalOpen in IMG/M
3300027896Freshwater lake sediment microbial communities from the University of Notre Dame, USA, for methane emissions studies -HBP12 HB (SPAdes)EnvironmentalOpen in IMG/M
3300028553 (restricted)Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2015_16mEnvironmentalOpen in IMG/M
3300029268 (restricted)Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2015_19mEnvironmentalOpen in IMG/M
3300031227Saline water microbial communities from Ace Lake, Antarctica - #232EnvironmentalOpen in IMG/M
3300031539Soil microbial communities from Risofladan, Vaasa, Finland - UN-3EnvironmentalOpen in IMG/M
3300031565Soil microbial communities from Risofladan, Vaasa, Finland - UN-2EnvironmentalOpen in IMG/M
3300031578Soil microbial communities from Risofladan, Vaasa, Finland - TR-2EnvironmentalOpen in IMG/M
3300031673Soil microbial communities from Risofladan, Vaasa, Finland - TR-3EnvironmentalOpen in IMG/M
3300031707Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G12_20EnvironmentalOpen in IMG/M
3300031746Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G13_20EnvironmentalOpen in IMG/M
3300031772Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G11_20EnvironmentalOpen in IMG/M
3300031873Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G15_0EnvironmentalOpen in IMG/M
3300031885Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G09_36EnvironmentalOpen in IMG/M
3300031952Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G13_40EnvironmentalOpen in IMG/M
3300031999Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G02_20EnvironmentalOpen in IMG/M
3300032046Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G11_40EnvironmentalOpen in IMG/M
3300032053Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G09_16EnvironmentalOpen in IMG/M
3300032069Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G07_20EnvironmentalOpen in IMG/M
3300032118Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G05_15EnvironmentalOpen in IMG/M
3300032156Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G14_0EnvironmentalOpen in IMG/M
3300032173Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C1_topEnvironmentalOpen in IMG/M
3300032401Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G03_0EnvironmentalOpen in IMG/M
3300032516Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G02_0EnvironmentalOpen in IMG/M
3300033233Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_bottomEnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
MLSed_1018874813300001533BenthicMANKWLVHLAKVRKANPKIKDVAKLAKLAKKSYRK*
MLSed_1022856613300001533BenthicMVCKNPWLKHLAKVRKENPKIKNVAALAKLARKSYRKGGK*
C687J26845_1014066443300002223SoilMAGNPWLVHVAKVRKANPKLKFKAILVMAKKSYKKKSYKK*
Ga0031655_1009431133300003852Freshwater Lake SedimentMAKTINKWLQHLAKVRKANPKIKDVAKLAKLAKKTYKPAK*
Ga0066650_1053371023300004239GroundwaterMTKQNSWLVHLAEVRKENPKVKDVGKLAKLAKKTYKPKK*
Ga0102851_1310531123300009091Freshwater WetlandsMTKNKWLEHLAEVRKENPKVKDVGKLAKLAKKSYKPNKK*
Ga0114918_1003165533300009149Deep SubsurfaceMAKKNPWMIHLAKVRKANPKIKDVGKIAKLAKKSYKK*
Ga0114918_1005587153300009149Deep SubsurfaceMVKTNPWLVHLAKCRKQNPKIKDIGLIAKLAKKTYKPTKK*
Ga0114918_1007871053300009149Deep SubsurfaceMKTKNKWLIHLAKVRKANPKVKKVSLIAKLAKKSYKK*
Ga0114918_1009155833300009149Deep SubsurfaceMVKTNPWLVHLAKCRKENPKIKNVSLIAKLAKKTYKPVKK*
Ga0114918_1009497623300009149Deep SubsurfaceMKVTTKKKCNPWLVHLKKVRKANPKVTDVGKLAKMAKKTYKPMK*
Ga0114918_1013928213300009149Deep SubsurfaceMAKSKNPWILHLSATRKANPKIKEFAKLAKLAKSTYKPKK*
Ga0114918_1016294333300009149Deep SubsurfaceMAKQNKWLVHLAKVRKENPKVKHVGKLAKLAKKTYKPCHN*
Ga0114918_1019789833300009149Deep SubsurfaceMTNKNPWLEHLAKVRKANPKEKNVGKLAKLAKSTYKKK*
Ga0114918_1020271823300009149Deep SubsurfaceMKTKNPWMIHLAKVRMANPKIKDVGALAKLAKKTYKPIKK*
Ga0114918_1027973823300009149Deep SubsurfaceMKTKNKWLIHLAKVRKANPKIKDVTLIAKLAKKSYTK*
Ga0114918_1048627423300009149Deep SubsurfaceMSSKTNPWLLHLSKVRKENPKIRDVGKIAMMAKKTYAPKK*
Ga0114918_1070214813300009149Deep SubsurfaceNYKEVKMKSKNPWIVHLAKIRKENPKIKDVGKLAKLAKQSYKPKK*
Ga0105097_1001993373300009169Freshwater SedimentMVKKHNKWLDHLAKVRKAHPKIKDVAELAKLAKKSYK*
Ga0105101_1065605423300009171Freshwater SedimentMVNAWMKHLAQVRKDNPKIKDVVKISKIAKATYKKGK*
Ga0103680_1015499743300009285GroundwaterMNKWMIHLAKVRRANPKIKNVVELARLAKKTYRA*
Ga0114919_1001236383300009529Deep SubsurfaceMANKWIVHMAKVRKANPKIKDFKALANLAKKTYKK*
Ga0114919_10018344133300009529Deep SubsurfaceMKSKNPWIAHLAKVRKENPKIKDVAKLAKLAKKSYSPKK*
Ga0114919_10024146113300009529Deep SubsurfaceMKSKNPWMIHLAKVRKANPKVKNVGELAKLAKKSYKK*
Ga0114919_1002480663300009529Deep SubsurfaceMKTNKWLIHLAQFRKANPKMPVTDIMKNAKKTYKPKGGK*
Ga0114919_1004241033300009529Deep SubsurfaceMKSKNPWIVHLAKVRKENPKIKDVAALAKLAKKSYKPIK*
Ga0114919_1011242033300009529Deep SubsurfaceMVKQNPWLVHLAKCRKENPKIKDVGLIAKLAKKTYKPAKKC*
Ga0114919_1014778923300009529Deep SubsurfaceMKTKNPWMIHLAKVRIANPKIKDVGALAKLAKKTYKPIKK*
Ga0114919_1020189533300009529Deep SubsurfaceMATKKKNPWMVHLAKVRKDNPKIKDVVKLSKMAKKTYKCIK*
Ga0114919_1025368833300009529Deep SubsurfaceMKTQNKWLLHLAKVRKENPKIKDVSQIAKLAKKTYKGGK*
Ga0116182_100951733300009666Anaerobic Digestor SludgeMKSKNPWLEHLAKTRKLNPKVKDVKKLAKLAKATYKKK*
(restricted) Ga0172365_1021881343300013127SedimentMKNKWMEHLAKTRKENPKIKDVAKIAKLAKASYKPIKK*
(restricted) Ga0172365_1055916223300013127SedimentMKTKNPWMVHLAAVRKANPKIKDVAKIAKLAKSTYKPKK*
(restricted) Ga0172366_1003749043300013128SedimentMVKKVNSWIKHLAKVRKENPKIKDVKALAKIAKSTYKIK*
(restricted) Ga0172366_1058501033300013128SedimentMKTKNPWLVHLAKIRKENPKIKDVVKLSAIAKKSYKPIK*
(restricted) Ga0172366_1058831423300013128SedimentMNAWIKHLLKVRKENPKIKDVKQLAKLAKKTYKK*
(restricted) Ga0172363_1048356913300013130SedimentMVTKNPWLTHLAKVRKQNPKIKDFKKLAVLAKKSYK*
(restricted) Ga0172371_1012466633300013138FreshwaterMTQNKWLQHLAEVRKENPKVKDVSKLAKLAKKTYKPNKK*
Ga0172381_1002744053300014204Landfill LeachateMKKVVTKNPWLVHLSKVRKENPKIKDVKTLAKIAKKTYKK*
Ga0172381_1023830723300014204Landfill LeachateMTKNKNPWLEHLAVVRKENPKIKDVSKISAIAKKSYKK*
Ga0172380_1059403943300014205Landfill LeachateMPSSIIMTKNKNPWLEHLAVVRKENPKIKDVSKISAIAKKSYKK*
Ga0180008_121049233300014613GroundwaterMAKVNPWFAHLAKTRKANPKVKDVAKIARLAKKTYKK*
Ga0182027_1020873663300014839FenMAKSNNPWINHLSKVRKDNPKVKDVKKLSQIAKKSYKPVKK*
Ga0179946_107324513300019217Anaerobic Digestor SludgeKNPWLEHLAKTRKLNPKVKDVKKLAKLAKATYKKK
Ga0180732_105673133300020171GroundwaterMTKNKWLVHLAKVRKENPKIKDVAKLAKLAKKTYKK
(restricted) Ga0233432_1044174013300023109SeawaterMNPWFKHLAEVRRANPNIKDVGEIAKLAKKTYQPSG
Ga0210003_102444673300024262Deep SubsurfaceMVKKNPWLVHLAKCRKENPKIKNVSLIAKLAKKTYKPVKK
Ga0210003_106226533300024262Deep SubsurfaceMTNKNPWLEHLAKVRKANPKEKNVGKLAKLAKSTYKKK
Ga0210003_108032853300024262Deep SubsurfaceMKTKNKWLIHLAKVRKANPKVKKVSLIAKLAKKSYKK
Ga0210003_109003013300024262Deep SubsurfaceTKEVNKYMAKSKNPWILHLSATRKANPKIKNFAKLAKLAKSTYKPKK
Ga0210003_109729823300024262Deep SubsurfaceMKTKTTKVRKVNPWLKHLAKVRKANPKVKNVGDLCKLAKKSYKK
Ga0210003_117122223300024262Deep SubsurfaceMAKQNKWLVHLAKVRKENPKVKHVGKLAKLAKKTYKPCHN
Ga0210003_124341623300024262Deep SubsurfaceMKTKNKWLIHLAKVRKANPKIKDVTLIAKLAKKSYTK
Ga0209986_10023355103300024433Deep SubsurfaceMATKKKNPWMVHLAKVRKDNPKIKDVVKLSKMAKKTYKCIK
Ga0209986_1003626563300024433Deep SubsurfaceMKSKNPWMIHLAKVRKANPKVKNVGELAKLAKKSYKK
Ga0209986_1009744333300024433Deep SubsurfaceMKTNKWLIHLAQFRKANPKMPVTDIMKNAKKTYKPKGGK
Ga0209986_1013274333300024433Deep SubsurfaceMKSKNPWIAHLAKVRKENPKIKDVAKLAKLAKKSYSPKK
Ga0209986_1015626943300024433Deep SubsurfaceMKTKNPWMIHLAKVRIANPKIKDVGALAKLAKKTYKPIKK
Ga0209986_1032276133300024433Deep SubsurfaceMVKQNPWLVHLAKCRKENPKIKDVGLIAKLAKKTYKPAKKC
Ga0209521_1029210323300025164SoilMANKWMQHLAKVRKANPKIKDVGAMSKLAKKTYKK
Ga0209492_100834083300027721Freshwater SedimentMVKKHNKWLDHLAKVRKAHPKIKDVAELAKLAKKSYK
(restricted) Ga0255054_1049628723300027856SeawaterMKTKNPWMIHLAKVRMANPKIKDVGALAKLAKKTYCPVKK
Ga0209450_1022249823300027885Freshwater Lake SedimentMAKSKNPWILHLSATRKANPKIKEFAKLAKLAKSTYKPKK
Ga0209777_10002127193300027896Freshwater Lake SedimentMAKTINKWLQHLAKVRKANPKIKDVAKLAKLAKKTYKPAK
Ga0209777_1003662643300027896Freshwater Lake SedimentMKSNSWLIHLAKVRKANPEIKNVAQLARLAKKTYK
Ga0209777_1006542813300027896Freshwater Lake SedimentMKTKNAWLVHLAKVRKANPKIKDVGALAKLAKSTYKKK
Ga0209777_1006783343300027896Freshwater Lake SedimentMANKWLQHLAKVRKENPKIKDVSKLAKLAKKTYVPIKK
Ga0209777_1010077563300027896Freshwater Lake SedimentMKTKNPWMVHLAKVRKANPKIKDVAALAKLAKKTYKPGK
Ga0209777_1027299153300027896Freshwater Lake SedimentMANKWLVHLAKVRKANPKIKDVAALAKLAKKTYKPVK
Ga0209777_1069414133300027896Freshwater Lake SedimentMVKKKNPWMEHLAKVRKANPKIKDVKLIAKLAKKSYK
Ga0209777_1095528813300027896Freshwater Lake SedimentMAQKNPWMVHLAKIRKENPKVKNVGALAKLAKKTYKK
(restricted) Ga0247839_115825613300028553FreshwaterLWRLNMVNKWIEHLAKVRKANPTVKDVKKLAKMAKETYKK
(restricted) Ga0247842_1001260593300029268FreshwaterMVNKWIEHLAKVRKANPTVKDVKKLAKMAKETYKK
(restricted) Ga0247842_1009779543300029268FreshwaterMAKNPWLIHLAKVRKDNPKIKDVAALAKLAKKSYKK
Ga0307928_1014468923300031227Saline WaterMANKWFIHLAKVRKANPKIKDIGTLAKLAKKTYKK
Ga0307380_1017693933300031539SoilMVKQKQLVKRVNPWMVHLAKVRKANPTIKDITLLAKLAKKSYKK
Ga0307380_1045901413300031539SoilMKSKNPWMVHLAKVRKANPKIKNVGELAKLAKKSYKK
Ga0307380_1055622033300031539SoilMKEKKTNPWLTHMAKTRKENPKVKDVGKLAKLAKATYKPKK
Ga0307380_1063345233300031539SoilMTANKWILHLSKVRKANPKIKDFAKLAKLAKSTYKPIK
Ga0307380_1070443413300031539SoilMATKNNWLVHMAKVRKENPKIKNVSILAKLAKKTYKPKTCKP
Ga0307380_1079466023300031539SoilMKSKNPWMIHLAKVRMDNPKIKDVGALAKLAKKTYKPIKK
Ga0307380_1111841123300031539SoilMKTKNKWLIHLAKVRKANPKVKNVGVLAKLAKKSYKK
Ga0307379_1041415123300031565SoilMKTKNKWLVHLAQVRKANPKIKDVGALAKLAKKSYSKK
Ga0307379_1084480333300031565SoilMVKQKQLVKRVNPWMVHLAKVRKANPNIKDITLLAKLAKKSYKK
Ga0307379_1111131113300031565SoilMSNKNPWLEHLAKVRKANPKEKNVGKLAKLAKSTYKKK
Ga0307379_1144285323300031565SoilMKENKWLVHLAKVRKANPKIKDIGKLAKIAKKTYKPIK
Ga0307379_1146225413300031565SoilMEMKKKNNPWLVHLAKVRKANPKVKDVGKLAKLAKMTYKPKK
Ga0307376_1015197923300031578SoilMKTKNPWMIHLAKVRKDNPKIKDITLLAKLAKKSYKPNK
Ga0307376_1017757923300031578SoilMKENKWLVHLAKVRKANPKIKDIGKLAKIARKTYKPIK
Ga0307376_1089435033300031578SoilMVKQKQLVKRVNPWMVHLAKVRKANPTIKDITLLAKLAKKTYKK
Ga0307376_1091664623300031578SoilMKTKNPWMIHLAKVRKDNPKIKDITLLAKMAKKSYKK
Ga0307377_1092501723300031673SoilMANKWLIHLAKVRKANPKIKDIGKMAKLAKKTYKPCK
Ga0307377_1094819223300031673SoilMVNKWLVHLAKVRKANPKVKDVAKLAKLAKKTYKK
Ga0315291_1004970693300031707SedimentMANKWLLHLAKVRKANPKIKDVAKIAKLAKKSYKK
Ga0315291_1013340143300031707SedimentMVVKKTNPWMKHLAQVRKANPKVKDVGKLAKLAKASYKKK
Ga0315291_1015609623300031707SedimentMGSSNKWLIHLAKVRKANPQIKDVVKLSKIAKKSYKK
Ga0315291_1054348723300031707SedimentMVKKKNPWLVHLAKCRKDNPKVKDVKQLAKLARKSYQAGK
Ga0315291_1060819813300031707SedimentMAKQNKWMVHLAKVRKENPKIKNVAMIAKIAKKSYLK
Ga0315293_10000271853300031746SedimentMVVKNKWLEHLAKTRKANPKIKDVGKLAKLAKASYQK
Ga0315293_1026885223300031746SedimentMANKWIQHLAQVRKENPKIKNVKEIAKIAKASYSKQVKK
Ga0315288_1061947623300031772SedimentMAKNPWLAHMAWVRKQNPKIKDFKKIALIAKKSYKIKVK
Ga0315288_1083220023300031772SedimentMVVKKNPWMVHLAKVRKENPKIKDVAKLAKLAKRSYHK
Ga0315288_1122023823300031772SedimentMKSNPWLIHLAKTRKANPKIKSVVELSKIAKKTYKK
Ga0315297_10000303163300031873SedimentMTKTNAWLVHLAKVRKANPKLSVVEIAKKASKTYHKK
Ga0315297_1040632543300031873SedimentMAKNKWLVHLAAVRKANPKIKDVGALAKLAKKTYKPIKK
Ga0315285_1009007443300031885SedimentMAKSKNPWMIHLAAVRKANPKVKDVSALAKLAKKTYKPIK
Ga0315285_1058697613300031885SedimentMAKQNKWLIHLAKVRKENPKIKDVGKLAKLAKSSYKK
Ga0315285_1086546643300031885SedimentMVKKINPWMVHLAKIRKANPKIKNVGELAKLAKRSYK
Ga0315294_1011955123300031952SedimentMAKSTNPWILHLKKIRAANPKIKDVAKLAKLAKATYKPKK
Ga0315294_1018814923300031952SedimentMAKKINPWMVHLSKVRKANPGMKVGKLAKLAKSTYKKK
Ga0315294_1045205913300031952SedimentMKQNKWLQHLAKCRRENPKIKDVAKIAKLAKKTYKK
Ga0315294_1052655943300031952SedimentMVKQSVKNPWLVHLAKVRKANPKIKDFAKLARLAKKSYK
Ga0315294_1057210023300031952SedimentMKPNNWLIHLAKIRKQNPKLKISALAKLAKSSYKK
Ga0315294_1151860913300031952SedimentMKTQNKWLLHLAKIRRENPKIKDVAQIAKLAKKSYSPKK
Ga0315294_1154274013300031952SedimentMTNQNPWIKHLSVVRKQNPTIKDVKKLAKIAKASYKPKK
Ga0315274_10001110203300031999SedimentMVQNKWLLHLKKIRKENPKIKDFAKLAKLAKASYQK
Ga0315274_1007975533300031999SedimentMANAWLVHLAKVRKANPKVKDVAALAKLAKKSYKSKK
Ga0315274_1015792113300031999SedimentMGSNNPWITHLAKIRKANPKVKDIVKLSKIAKASYKPAIKKK
Ga0315274_1021251873300031999SedimentMVKKTNPWLVHLAKCRKDNPKIKDVGKLAKLAKKTYKVK
Ga0315274_1028779813300031999SedimentMAKKINPWMVHLAKVRKANPKIKNVSALAKLAKKTYK
Ga0315274_1044958653300031999SedimentMAKQNKWMVHLAKVRKENPKIKNVVELSKIAKKSYLK
Ga0315274_1052325923300031999SedimentMASTNPWIMHLSKTRAANPKIKDVKILAKMAKATYVGVKKK
Ga0315274_1052575733300031999SedimentMGNNAWLNHLAKTRKAHPQVKDVAKLAKLAKKTYKK
Ga0315274_1092993623300031999SedimentMKTKFLTKNPWLVHLAKCRKAHPEIKDVAKMAKLAKKTYKK
Ga0315289_1044212533300032046SedimentMTKNKWLLHLADVRKAHPKIKDVAKLAKLAKSTYKK
Ga0315289_1051313123300032046SedimentMVKQNKWLVHLAKTRKENPKVKDVVKLSKIAKKTYKCAK
Ga0315289_1116767923300032046SedimentMANAWLVHLAKTRKANPKVKDIVKLAKIAKASYKK
Ga0315289_1119948833300032046SedimentMAKSKNPWIVHLSKIRAQNPKIKDVKTLAKLAKKSYKPKK
Ga0315289_1131962323300032046SedimentMAKLNKWLIHLAKVRKANPKIKDVVKLSKIAKASYKK
Ga0315289_1155725733300032046SedimentMAKSTNPWILHLKKIRAANPKIKDVAKLAKLAKSTYKPIKK
Ga0315284_1001365353300032053SedimentMANKWLVHLAKVRKANPKIKDVAKLAQLAKKSYKPAK
Ga0315284_1065989733300032053SedimentYIIANELNGMAKQNPWLVHLAKCRKANPKVKDVAKLAKIAKKTYTPKK
Ga0315284_1102599723300032053SedimentMAKKTNPWMVHLSKVRKANPKIKDVGKLAKLAKSTYKKK
Ga0315284_1131329043300032053SedimentMIMATKNKWLVHLAQVRKANPKIKDFAALAKIAKKS
Ga0315282_1035956133300032069SedimentMANKWQQHLAKTRKANPKIKDVGKISKLAKKTYKK
Ga0315277_1132973623300032118SedimentMVTKINPWIAHLGKVRKANPTIKDVKILAKMAKKTYLK
Ga0315295_1097621523300032156SedimentMTNPWLVHLAKVRKQHPNIKDVKKISIIAKKTYKK
Ga0315268_1073492633300032173SedimentMAKKKNAWMVHLAATRKANPKLGVAAAAKKAKSSYKPAPKK
Ga0315275_1003993633300032401SedimentMVVKKTNPWMTHLAKVRKANPKIKDVGKLAKLAKKTYKA
Ga0315273_1021951553300032516SedimentMAKQNAWLIHLAKCRKLHPKIKDVGKIAKLAKATYKKK
Ga0315273_1025895933300032516SedimentMANNPWLVHLAKVRKANPKIKDFAKLAKIAKKTYK
Ga0315273_1030762633300032516SedimentMAKQNKWLVHLAKVRKENPKCKDVAKLAKLAKKTYKV
Ga0315273_1056383733300032516SedimentMAKTNPWMVHLAKVRKANPKIKDFAELAKIAKKSYSK
Ga0315273_1105103943300032516SedimentMAKQNKWMIHLAKVRKENPKIKSVALISKIAKKSYLK
Ga0334722_1124669833300033233SedimentMTKNKWLIHLAQTRKENPKIKDVSILAKLAKKSYKK
Ga0326726_1128628133300033433Peat SoilMTQNKWLQHLAETRKENPKVKDVAKIAALAKKTYKKK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.