NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300025317

3300025317: Groundwater microbial communities from Rifle, Colorado - Rifle CSP2_plank lowO2_0.1 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300025317 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0053054 | Gp0056897 | Ga0209541
Sample NameGroundwater microbial communities from Rifle, Colorado - Rifle CSP2_plank lowO2_0.1 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?Y
Use PolicyOpen

Dataset Contents
Total Genome Size1974598719
Sequencing Scaffolds27
Novel Protein Genes27
Associated Families9

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Archaea17
All Organisms → cellular organisms → Bacteria3
All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Woesearchaeota → Candidatus Woesearchaeota archaeon1
All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Pacearchaeota → Candidatus Pacearchaeota archaeon3
Not Available2
All Organisms → cellular organisms → Bacteria → Acidobacteria1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Rifle, Colorado, Usa
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater → Soil Microbial Communities From Rifle, Colorado, Usa

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Subsurface (non-saline)

Location Information
LocationRifle, Colorado, United States
CoordinatesLat. (o)39.53Long. (o)-107.78Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F001231Metagenome / Metatranscriptome741Y
F003477Metagenome / Metatranscriptome484Y
F004010Metagenome / Metatranscriptome457Y
F065251Metagenome / Metatranscriptome128Y
F070778Metagenome122Y
F080249Metagenome / Metatranscriptome115Y
F091390Metagenome107Y
F094578Metagenome / Metatranscriptome106Y
F103989Metagenome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0209541_10000512All Organisms → cellular organisms → Archaea76119Open in IMG/M
Ga0209541_10000588All Organisms → cellular organisms → Archaea71789Open in IMG/M
Ga0209541_10001647All Organisms → cellular organisms → Archaea43935Open in IMG/M
Ga0209541_10004534All Organisms → cellular organisms → Bacteria24611Open in IMG/M
Ga0209541_10013828All Organisms → cellular organisms → Archaea12272Open in IMG/M
Ga0209541_10015231All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Woesearchaeota → Candidatus Woesearchaeota archaeon11513Open in IMG/M
Ga0209541_10017219All Organisms → cellular organisms → Archaea10600Open in IMG/M
Ga0209541_10023509All Organisms → cellular organisms → Archaea8507Open in IMG/M
Ga0209541_10023767All Organisms → cellular organisms → Archaea8441Open in IMG/M
Ga0209541_10030881All Organisms → cellular organisms → Archaea6995Open in IMG/M
Ga0209541_10074660All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Pacearchaeota → Candidatus Pacearchaeota archaeon3661Open in IMG/M
Ga0209541_10082384All Organisms → cellular organisms → Archaea3405Open in IMG/M
Ga0209541_10089492All Organisms → cellular organisms → Archaea3206Open in IMG/M
Ga0209541_10094191All Organisms → cellular organisms → Archaea3088Open in IMG/M
Ga0209541_10100981All Organisms → cellular organisms → Bacteria2931Open in IMG/M
Ga0209541_10112991All Organisms → cellular organisms → Archaea2697Open in IMG/M
Ga0209541_10151937All Organisms → cellular organisms → Bacteria2158Open in IMG/M
Ga0209541_10211876All Organisms → cellular organisms → Archaea1673Open in IMG/M
Ga0209541_10213506All Organisms → cellular organisms → Archaea1664Open in IMG/M
Ga0209541_10300307All Organisms → cellular organisms → Archaea1275Open in IMG/M
Ga0209541_10366876Not Available1088Open in IMG/M
Ga0209541_10403522All Organisms → cellular organisms → Archaea1009Open in IMG/M
Ga0209541_10403551All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Pacearchaeota → Candidatus Pacearchaeota archaeon1009Open in IMG/M
Ga0209541_10577589All Organisms → cellular organisms → Bacteria → Acidobacteria755Open in IMG/M
Ga0209541_10604368All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Pacearchaeota → Candidatus Pacearchaeota archaeon727Open in IMG/M
Ga0209541_10637285Not Available696Open in IMG/M
Ga0209541_10644622All Organisms → cellular organisms → Archaea689Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0209541_10000512Ga0209541_1000051254F094578MDNKEERPKWPLRRLKSMYFAMHCWKCKKFELSVEEYKQSCEENIQKIIDGLDIKEMEFREFNEIIKHSTFCKFEKRGFNNNCLICKKLGI
Ga0209541_10000588Ga0209541_1000058817F094578MEKEKPKWPLQRLKSMYFAMHCWKCKKFELPVEEYKKMCEDNIQKIIDGLEVREMKFKEFNNIIKNSLFCTFEKRDFSNDCSICKKFGVHGRL
Ga0209541_10001647Ga0209541_1000164750F094578MIEKEKPKWPLQRLKGMYFAMHCWKCKKFELPMEEYKQKCENNIQKIIDNLEIKEMKFKEFNEIIKNSTFCKFEKREFNNKCPICKKLRIKF
Ga0209541_10004534Ga0209541_1000453426F004010MPLGSNVSANMRELYKDNRKKGKARGANGKPRSRKQMIAIAINAANKATGRKRKK
Ga0209541_10013828Ga0209541_1001382819F091390MSTKSTTLRIDDSTKLKLESLDFVRKHTFNQILLELIEHYEKTKKRK
Ga0209541_10015231Ga0209541_100152312F091390MYTKSTTIRINQSTKEKLGNLDFVRKDTFDQILSKLIKYYEKK
Ga0209541_10017219Ga0209541_100172199F094578MEKEKPKWPFQRLKSMYFAMHCWKCRKCEDSPERYKQKCEDNIQKIIDNLEIKEMTFKEFNNIIKNTSFCKFEKRDFSNNCPICKNLGISNHF
Ga0209541_10023509Ga0209541_100235091F065251MYFRHFTKGSKKYYYIAKAVRKGKKITQKFVLYIGTADTLYEKLKTLKKN
Ga0209541_10023767Ga0209541_1002376716F094578MNNQKERLKWPLQRLKGMYFAMHCWKCKKFELPMEEYKQRCENNIQKIIDSLEIKEMKFKEFNDIIKSATFCKFDKRDFSNNCPICKKLGISRRGL
Ga0209541_10030881Ga0209541_100308815F094578MIEEEKPKWSLQRLKGMYFAVHCWKCKRFELPMEEYKQRCENNIQKIIDSLEIKEMKFKEFNEIIKNSTFCKFEKREFNNKCPICKKLGIKF
Ga0209541_10074660Ga0209541_100746602F094578MENKKERLKWSLKRLKGMYFAMHCWKCKKFELPMREYQKRCEENIQKIINNMEIKEMKFKDFNNIIKNTTFCKFEKRDFSNNCPICKKFGIKG
Ga0209541_10082384Ga0209541_100823841F003477MTYSMLKFDPLEPFKEKKLLDEVMKLLENFEYKLYGKIPIKFHRGKPTIYEFDLNLGIIHFTEQDFKGLSEIFAKINKKWGTQMTFCIYPSREKNRDMIINVRGSPKAPSEIG
Ga0209541_10089492Ga0209541_100894922F094578MDDKKERPKWPLQRLKSMYFAIHCWTCKNFELPMEEYKRRCEENIQKIIDNIGIKEMKFKEFNNIVKSSTFCQFEKRNFSNNCPI
Ga0209541_10094191Ga0209541_100941914F094578MNNQKERPKWFLQSLKVMYFAMHCWKCKKFELPKEEYKQRCEDNIQKIIDDLEIKEMEFKKFNEIIKNTTFCKFDKRDFSNSCPICKKLGISRRGL
Ga0209541_10100981Ga0209541_101009813F080249MADDKNKVKEDRNLISFKENYEVYYAVNQLKKQFPDETKSDIKEALFDAAKQVSPSEGREKIMRLTRKELNS
Ga0209541_10112991Ga0209541_101129913F094578MEQKERPKWLLKRLKSMYFAMHCWKCKIWGNSIEEYNKRCENNIQKIIDNLDVKEMKFKEFNNIIKNSIFCQFEKKDFSNNCPICKAWGISGR
Ga0209541_10151937Ga0209541_101519373F070778MREGWRPIGVAAILMALGACGYAGSDEIDAESAAILARVPVGTSFNSVPAAMGALVFSCTTSRRQFPDAKGEMRETEPHLVCERERSDWLICSRRTRAILIQLNGRLSNVLVNVGRFCA
Ga0209541_10211876Ga0209541_102118763F065251MYLRSFKKGKKKYYYIAKAVRVGKRVIQKSVLYLGNADN
Ga0209541_10213506Ga0209541_102135062F065251MYLRHFTKRNKKYYYIAKAVRKGARVIQKAILYIGTADTLYEKLKNLQKK
Ga0209541_10300307Ga0209541_103003072F091390MATKSIKSTTIRIDERTKNKLGNLDFVRKHTFDEILLELILFYEKNKKKI
Ga0209541_10366876Ga0209541_103668762F091390MSTKSTTIRIDQSTKDKLENLDFVKKHTFDEILLNLIEIYERKKK
Ga0209541_10403522Ga0209541_104035221F103989MNMEKSAIKESNNHQEFDGEINATDIFMRDLRRSYPEFAKALEEFMKTELQRLDEELK
Ga0209541_10403551Ga0209541_104035513F065251MYLRSFIKGRKKYYYIAKAVRKGAQVIQKSILYIGTADTLYEKLISLKKK
Ga0209541_10577589Ga0209541_105775891F001231MVLNFRIRNTVFSAGRAGQRSSVRPITLLLPSVEEAALICTVDDMNRLVPRRDVFLLSFTRTGPVIEGFENCILGEFMKSQIECLCENNEPPSSCEINGLETFLEMMEPGQVEELRDLVSICEFRKYEFSPEAAIRFLKESGFFDEYIIVGVKYGNGWRRSRCITTFEGRA
Ga0209541_10604368Ga0209541_106043682F094578MIKKERLKWPLQRLKRMYFAMHCWKCKKFELSMEEYKKRCEENIQKIIDILNIKEMEFREFNEIIKNSTFCKFEKRDFSNDCPICKKFGINYKF
Ga0209541_10637285Ga0209541_106372852F003477MPKFDPLEPFKEKKLLDEVMKLLENFDYKLYGKIPIKFHRGKPTVYEFDLNLGIIHFTEQDFKGLSEIFAKINKKWGTQMTFCIYPSREKNRDMIINVRGSPKAPSEIG
Ga0209541_10644622Ga0209541_106446221F094578MNGQIVNVGENNMTEKERPKWPLKRLKGMYFAMHCWKCKKFELPMEEYKRRCEDSIQKMIDNMEIKEMKFKEFNDIIKNRLFCEFEKRDFSNNCPICKKFGGLGGVRH

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.