NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300024341

3300024341: Freshwater microbial communities from Lake Matano, South Sulawesi, Indonesia - Watercolumn_Matano_2014_107_MG



Overview

Basic Information
IMG/M Taxon OID3300024341 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0127566 | Gp0272224 | Ga0233421
Sample NameFreshwater microbial communities from Lake Matano, South Sulawesi, Indonesia - Watercolumn_Matano_2014_107_MG
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyRestricted

Note: The use of this dataset is restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of the sequences below requires obtaining a license from the dataset's corresponding author(s).


Dataset Contents
Total Genome Size740493820
Sequencing Scaffolds35
Novel Protein Genes36
Associated Families36

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria7
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1
All Organisms → cellular organisms → Bacteria → Proteobacteria1
Not Available13
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales → unclassified Desulfobacterales → Desulfobacterales bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → unclassified Proteobacteria → Proteobacteria bacterium1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Parcubacteria group → Candidatus Nomurabacteria → Candidatus Nomurabacteria bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria1
All Organisms → cellular organisms → Bacteria → PVC group1
All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia1
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales1
All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Pacearchaeota → Candidatus Pacearchaeota archaeon1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → Anaerolineales → Anaerolineaceae1
All Organisms → cellular organisms → Bacteria → Acidobacteria1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameMethane Metabolizing Microbial Communities From Different Methane-Rich Environments From Various Locations
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater → Methane Metabolizing Microbial Communities From Different Methane-Rich Environments From Various Locations

Alternative Ecosystem Assignments
Environment Ontology (ENVO)freshwater lake biomefreshwater lakelake water
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Water (non-saline)

Location Information
LocationIndonesia: South Sulawesi
CoordinatesLat. (o)-2.4667Long. (o)121.2833Alt. (m)N/ADepth (m)107
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000748Metagenome / Metatranscriptome908Y
F000769Metagenome / Metatranscriptome898Y
F001057Metagenome / Metatranscriptome791Y
F002525Metagenome / Metatranscriptome552Y
F002824Metagenome527Y
F009271Metagenome / Metatranscriptome320Y
F016728Metagenome / Metatranscriptome245Y
F017088Metagenome242Y
F019160Metagenome231Y
F020376Metagenome224Y
F022232Metagenome / Metatranscriptome215Y
F023511Metagenome209Y
F024750Metagenome / Metatranscriptome204Y
F032705Metagenome179Y
F033261Metagenome / Metatranscriptome178Y
F037788Metagenome / Metatranscriptome167Y
F042227Metagenome158Y
F043839Metagenome / Metatranscriptome155Y
F049080Metagenome147Y
F050235Metagenome / Metatranscriptome145Y
F051242Metagenome144Y
F052683Metagenome / Metatranscriptome142Y
F053775Metagenome140Y
F054519Metagenome / Metatranscriptome139Y
F067652Metagenome / Metatranscriptome125Y
F068994Metagenome124Y
F075676Metagenome118N
F077448Metagenome117Y
F081864Metagenome114Y
F083717Metagenome / Metatranscriptome112N
F088111Metagenome / Metatranscriptome109Y
F091247Metagenome / Metatranscriptome107Y
F091390Metagenome107Y
F093357Metagenome106Y
F094907Metagenome105Y
F095696Metagenome105Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0233421_10003105All Organisms → cellular organisms → Bacteria5659Open in IMG/M
Ga0233421_10005064All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi4382Open in IMG/M
Ga0233421_10005490All Organisms → cellular organisms → Bacteria4212Open in IMG/M
Ga0233421_10030372All Organisms → cellular organisms → Bacteria1795Open in IMG/M
Ga0233421_10042165All Organisms → cellular organisms → Bacteria → Proteobacteria1529Open in IMG/M
Ga0233421_10043744Not Available1501Open in IMG/M
Ga0233421_10057696All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales → unclassified Desulfobacterales → Desulfobacterales bacterium1310Open in IMG/M
Ga0233421_10064588All Organisms → cellular organisms → Bacteria → Proteobacteria → unclassified Proteobacteria → Proteobacteria bacterium1239Open in IMG/M
Ga0233421_10101568All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes990Open in IMG/M
Ga0233421_10104888All Organisms → cellular organisms → Bacteria975Open in IMG/M
Ga0233421_10105271All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Parcubacteria group → Candidatus Nomurabacteria → Candidatus Nomurabacteria bacterium973Open in IMG/M
Ga0233421_10107660All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria962Open in IMG/M
Ga0233421_10116568All Organisms → cellular organisms → Bacteria925Open in IMG/M
Ga0233421_10145466All Organisms → cellular organisms → Bacteria → PVC group828Open in IMG/M
Ga0233421_10154063Not Available805Open in IMG/M
Ga0233421_10165891Not Available776Open in IMG/M
Ga0233421_10186883All Organisms → cellular organisms → Bacteria731Open in IMG/M
Ga0233421_10193572Not Available718Open in IMG/M
Ga0233421_10194092All Organisms → cellular organisms → Bacteria717Open in IMG/M
Ga0233421_10211440Not Available686Open in IMG/M
Ga0233421_10222804Not Available668Open in IMG/M
Ga0233421_10227539All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia661Open in IMG/M
Ga0233421_10229259All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium658Open in IMG/M
Ga0233421_10230327Not Available657Open in IMG/M
Ga0233421_10233760All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales652Open in IMG/M
Ga0233421_10245116All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Pacearchaeota → Candidatus Pacearchaeota archaeon636Open in IMG/M
Ga0233421_10245336Not Available636Open in IMG/M
Ga0233421_10315386All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → Anaerolineales → Anaerolineaceae559Open in IMG/M
Ga0233421_10321968Not Available553Open in IMG/M
Ga0233421_10341228All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium536Open in IMG/M
Ga0233421_10347851Not Available531Open in IMG/M
Ga0233421_10349022Not Available530Open in IMG/M
Ga0233421_10353893Not Available526Open in IMG/M
Ga0233421_10362411Not Available520Open in IMG/M
Ga0233421_10369951All Organisms → cellular organisms → Bacteria → Acidobacteria515Open in IMG/M

Sequences

Note: The use of this dataset is restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of the sequences below requires obtaining a license from the dataset's corresponding author(s).

Scaffold IDProtein IDFamilySequence
Ga0233421_10003105Ga0233421_100031051F016728ILTVGVTRVWAGVDSAWEQKKLEARKMLENAAESHTSGARFVGHGLRQSIY
Ga0233421_10005064Ga0233421_100050642F053775VRRAIATVVQVTGSCNAGYQAGDKIVVDLETACLDKEQSGHLCLWALSAILANLSRIRPGEQALASCPDPATGLGGNVIFSVVRE
Ga0233421_10005490Ga0233421_100054901F002525VWAGVDSAWEQKKLEARKMLENAAESHTSGARFVWHGL
Ga0233421_10030372Ga0233421_100303721F091247VEQLEEEQPLQEELAVLLNFPPLEKAKADIIRLTFLLLHS
Ga0233421_10042165Ga0233421_100421652F051242MNEDDRLKSDEQKPLELSIYKPEEQAYQKLHLCEYLRELYEALEEEDKHSAEKRKD
Ga0233421_10043744Ga0233421_100437442F000769EDRPRCPRGCPERLHRHGSYGRYCQPTGGERFGVPRFLCVGCGHTVSVLPPGRLTYRPLEVERLQGFFDAQAGIRTGLDPPPGPVEAGCLQRAWNRFLTRAKVLTDSFGQMLPAELPTGPQLWQGLRRAVGAAEQMLRFLAQYCKRSLLGDYACLRPAP
Ga0233421_10057696Ga0233421_100576961F081864MQADTTQFQARHGRQPRGFGRWGFLVGNETFFFVGQYSNSKKEACKVAQAHGEERVVVLPDKKLAPQGGNPERPEKTQVQINNIKERLMAQAETRTINKRTRTVIPVKSMKAAEVMVVFDIQSKRTAHAIVKRGYHIVDYMESRQCPGTMEDVEEAYRIAKWWFHKKLGGRVPYGLDADDMIQEAVTRLVELAGDPRMQEASYKFYVVRSTMAEYLRSNKKH
Ga0233421_10064588Ga0233421_100645882F037788MKQVKGLGGWIRSAIWLELIIPGYFFGKTFGTADAKYRFLDQFMYPTLFIKICGYFRDLPVGHYTGRFLGVMAGALIGVLWVLLMGVFYKWVLKDWAYSKAARVGSMAFILIALMTGFMSDMFNESIGMGLRIDEDAPTELVGKSLVQDYGSFVKGTVITEENAGLLAAYGKVRANNEASAHSWWVASLIFMYAWFVLYFICTHRFFKRNLVPRESYFKLQMINLGAIIIYLRMGFGIAPAEYHYIVTRFQHLFGGGM
Ga0233421_10101568Ga0233421_101015681F067652MNVGDPNEGGKSNRQARESDMPVVVKIIGKVKPEGAKGHYYR
Ga0233421_10104888Ga0233421_101048882F019160MKLHLYYENVERPLVLEVPDSEIDGFLREYEEALHDTSVETFQWKNSSFRIAGLLAIVQERHVTS
Ga0233421_10105271Ga0233421_101052711F002824MTKLTNTALKPGNGKPSAEEMTVIKTVLNSAENKTEEKKPVLQIEPEKPVIPAPEPRKEMTIMEKILRIENLQLVVEKRAKLVQTRSELDRFQISSNDFNCTMRLNDSDGNVFTTSFTPGIKKVIDFLK
Ga0233421_10107660Ga0233421_101076601F093357MKNFEQTWRIKRLLAEKGETISSLSRKLDRPVGSVANNIYGYRANAQLQAEIAGFLGQPVPELFEAEAKAIGR
Ga0233421_10116568Ga0233421_101165682F001057VTTIVQLDGSNRIVLPLDLRRAAGVPRGQKLKASATPGRIVLEMEPTAQGKIVKRGKLKLWTGAVPATPLAEAVEAARHYER
Ga0233421_10145466Ga0233421_101454662F043839MLVIAEQLDATLKALDARAALSLEKLVRDALELVEAQNGTRPCAPLPPDFFTRISQEFGPEPLERGPQGELEKREEW
Ga0233421_10154063Ga0233421_101540631F023511IFIYDNRVFDNEKVFFYRNSFSAYDTFRFLGKTELNLEYERATGNTIRDEKYSFFNAPSRQFSAKETESCKANSGWISITEKNCLRELLLSTEAYEQIGKELFRIIVKTAKVTPFLKDGEYLYNLEIEYERAYQNSFFSLHTPDSSANPVIPHEALTWDNMDVSFDNVEITFDQVTF
Ga0233421_10165891Ga0233421_101658911F009271LSGSVANGHQTMHLVSPAPSKIPYGGFSPVRLQTRLTPRPPSPTPHCQLIGRYCRYLRPRRWFRNRSCDQAAPRAADHDRESSGPWLPSRLCCPAGSSLTMATCAPLSATRRLMVYSARLRHQAASHRRSPICSASPFTPCRRPYSGGPGNCTRRCLRRRFCLRHIRTGSAATCPTIPEPVGRVTKLQHSLDATAWRCCLPCSGQDFYDRACVGRVTPAAHVGYDWMAHRHLPSPDFHRL
Ga0233421_10186883Ga0233421_101868831F033261LTDDHQTDARQLANEFGDHWGQEFGHRIGKHDLCLDIVPPGYRLTSRRDENGQLQREVEYDPTAFFLSSWLRCLVFNLMSLFVKRLGGKYAKMWAGTLLRKFIRRPATLYLIGNELHVVFDPFQGQDDLRPLLEELNAKRVALPWLNGLVLQFSIKEDEPLHPLAELEQRKRLFGDG
Ga0233421_10193572Ga0233421_101935721F054519MDADLVALREEIDARLAEVEHSRGTCQGHPALARAIATLLRCQRAQLNQRAANKVVAAKAGGIVGALLVGAVEAGKHLFGK
Ga0233421_10194092Ga0233421_101940922F024750SAWAREEWEPACYQLPAHRLQLAHGVSGGRLIMFIKAWTQGEGKKDQDILGAATNPQPLWPWFTGEGFERRFAEASRWRTHRETTESASGVEVI
Ga0233421_10211440Ga0233421_102114402F068994MDEDIVCKNNGEISLILPETDQKGCEALMNRLLNIIQTHTLFKSDEVLKSYAQTISFQSYSYPTQFEIPESLKKMISC
Ga0233421_10222804Ga0233421_102228041F032705TFRASADRYKELSAEARKVAAETEQADVREVEKIRRNNLAFSRILGDINEVMTRMREASSASYVRVVNQNLGSLTTTARPGDTPEQREALVRLESLKDAAVKVVAQISGGRRSEDEARVQTFTMLPAARAVWVYAGDVFYAWAGGIAIDAIPTIFVILLTLGRPEDEAERDEERRRAAARAAAQAPRPVARSGAWPGGGADDAGIVPGSRARET
Ga0233421_10227539Ga0233421_102275392F050235ADAEAVFRHAFEGEPLDPEVRCRVRQRAARITEEIRRTHGLIDDETFQKLLSDDDDENP
Ga0233421_10229259Ga0233421_102292591F088111YSFADQASSSFFAVLDPYFFNTDTLHPNGLGGHIDSTQMVWLKAQVAQTTATHKFLFIHTPYYYVSNDPEEPSAADTSYTILWAFLDANKFDLYACGHSHLYARRTIDNSVLPDPQTIPPTSPWQNNVVHLLNGTCGAGPSTGTIDPDIRTAWNVHNDALTYYFSVVDVCGNTVTVNSYRGYTGAYSVFDTFTITK
Ga0233421_10230327Ga0233421_102303271F000748MTRSPDSLVSSAAEQSAAEKRDDKAFNGRQEGEAKLRAATEVNAVSASTTPPVSRPTQLDLFPGRACVPKAKPATIRNPARLSPDRRGGVSGGGTQRQNIKLTGETLFGPAKESSSGREAHKGRTRKRGIEAGQGVGGGRSTEEARDNRVEGRAATFVQRTEQGKAAGLPPLGKAQPRRQPAARKAPVR
Ga0233421_10233760Ga0233421_102337602F017088MSHTHYSGDPRQFTARFPSTCHTCGKPIKKGETIIYWPNGKKAGHYDCDIQDYRQSIASFEDEDRYNNVYH
Ga0233421_10245116Ga0233421_102451161F091390SYQNKDTNSTTIRINQSTKEKLESLDFVRKDTFDEILLKLIGCYEKNKR
Ga0233421_10245336Ga0233421_102453362F075676MRMFKSQRGAELSEIVAGLLVVMVAGVAATSVMFNGLDTAAGKVATWLGGLTIP
Ga0233421_10315386Ga0233421_103153861F094907MFLISHTENTIEDTLQVEFRPARNGTGVICRLPNGKIAFPSRYHWQAPLPQPFEIWAVLPCGGTASVAYVKPIQRLAEAPFLDTRAGRFGNSLLWRLGSFLAALVSLARRPHVSALPAASQTQGD
Ga0233421_10320598Ga0233421_103205982F052683QMKTKKGVMNDVIGVFILVLVVSVIAGMTFLFVSQLKSQVETTATGGVNSTSYNAINKTEAAGATIVNYLPLLFLAIVFGAILAVVLKIILPYINLGNQMGSSF
Ga0233421_10321968Ga0233421_103219681F077448MHVMIELKIPGLGSEVTVTPEPLNSKETAVIGVIKAFDRSITPRFSNRDKDGFDSQREAESENNSKRPWVTVAPAESQLVVELEKVGHSQSLPASGQPLGDGLIVFGSLRMKEDAMTVQIHDIEGKEATIVLDIARTHKIGL
Ga0233421_10341228Ga0233421_103412281F049080MAKDLVAKLFLSKEEIVHIYHGGVLSVKIKGKDTDGLVEIFCREVSERKGTRIRFKTIDA
Ga0233421_10347851Ga0233421_103478511F022232NIKVTYLGADGDSNVGIGSRTNVLEYTAGSGSSGSYLNNFTSTGKTNILSGGNGSGFITNMVDVGGGTNRPARYYRIRVLAP
Ga0233421_10349022Ga0233421_103490222F020376FEGKSQEDLATLTQREDVAVLIGKELTGFLLEEHVALDPAAQPAEASATCCPKCGQPGTPVVQGGEELPERTVTTRVGPIRVRRQRWHCAKCRIIFFSAGRTAGSGDGRL
Ga0233421_10353893Ga0233421_103538932F095696NFGRQLYKLSRQQNISGESFAMAAQVLLDKWAARGCDPEILAKIRTGVFDIPAPKP
Ga0233421_10362411Ga0233421_103624111F083717MFAQARASRRPSQMGATPGTSGKHSTGNVPTPGRTYYQRGKGARYVPVKTGKTRRAGSKRSEDLAHSWRRNRNADSTVVEVLTGVSYAGRVQGGADSVRQQTRIMNARGWETVDEVGAAVEDNIGKVMRNTVARLYQQWFSRHGISSTVSGT
Ga0233421_10369951Ga0233421_103699511F042227PLGAVLSTWQWGFVRLTRSTERIGDLLLDKRWRAIVGLKPEHGGSPDRAAEILDGLSIDEWNEMMLEDFFIARRAGMLTDDGPYGKRCAVVDLNELFKSEKVHCARCQVREKTVVDAKGEKRTVLEYYHQAVALTWVSGKIPFVIGWEMLSPGEGELTAALRLLRRLLPRL

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.