NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300007159

3300007159: Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaT_CSR_2013 (Metagenome Metatranscriptome)



Overview

Basic Information
IMG/M Taxon OID3300007159 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0114818 | Gp0116275 | Ga0075020
Sample NameFreshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaT_CSR_2013 (Metagenome Metatranscriptome)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size17986644
Sequencing Scaffolds40
Novel Protein Genes47
Associated Families34

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available35
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Enterobacter → Enterobacter cloacae complex → Enterobacter hormaechei1
All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → Solibacteraceae → Candidatus Sulfopaludibacter → unclassified Candidatus Sulfopaludibacter → Candidatus Sulfopaludibacter sp. SbA41
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Stichotrichia → Sporadotrichida → Oxytrichidae1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Oligostraca → Ichthyostraca → Branchiura → Arguloida → Argulidae → Argulus → Argulus foliaceus1
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Opitutae → Opitutales → Opitutaceae → Lacunisphaera → Lacunisphaera limnophila1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameFreshwater And Sediment Microbial Communities From Various Areas In North America, Analyzing Microbe Dynamics In Response To Fracking
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Watersheds → Freshwater And Sediment Microbial Communities From Various Areas In North America, Analyzing Microbe Dynamics In Response To Fracking

Alternative Ecosystem Assignments
Environment Ontology (ENVO)aquatic biomewatershedsediment
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Water (non-saline)

Location Information
LocationPennsylvania, USA
CoordinatesLat. (o)41.1289Long. (o)-78.4195Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000203Metagenome / Metatranscriptome1619Y
F000344Metagenome / Metatranscriptome1257Y
F000734Metagenome / Metatranscriptome915Y
F000817Metagenome / Metatranscriptome879Y
F001296Metagenome / Metatranscriptome728Y
F002020Metagenome / Metatranscriptome603Y
F002557Metagenome / Metatranscriptome548Y
F003497Metagenome / Metatranscriptome483Y
F006338Metagenome / Metatranscriptome375Y
F008512Metagenome / Metatranscriptome332Y
F010305Metagenome / Metatranscriptome305Y
F014145Metagenome / Metatranscriptome265Y
F014264Metagenome / Metatranscriptome264Y
F017072Metagenome / Metatranscriptome243Y
F018868Metagenome / Metatranscriptome232Y
F020120Metagenome / Metatranscriptome226Y
F022652Metagenome / Metatranscriptome213Y
F023644Metagenome / Metatranscriptome209Y
F025923Metagenome / Metatranscriptome199Y
F027759Metagenome / Metatranscriptome193Y
F029031Metagenome / Metatranscriptome189Y
F036126Metagenome / Metatranscriptome170Y
F037738Metagenome / Metatranscriptome167Y
F037987Metagenome / Metatranscriptome167Y
F046349Metagenome / Metatranscriptome151Y
F052605Metagenome / Metatranscriptome142Y
F056352Metagenome / Metatranscriptome137Y
F063173Metagenome / Metatranscriptome130Y
F064734Metagenome / Metatranscriptome128Y
F065812Metagenome / Metatranscriptome127Y
F071600Metagenome / Metatranscriptome122Y
F081359Metagenome / Metatranscriptome114Y
F100490Metagenome / Metatranscriptome102N
F105212Metagenome / Metatranscriptome100Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0075020_101266Not Available553Open in IMG/M
Ga0075020_103340Not Available566Open in IMG/M
Ga0075020_104960Not Available565Open in IMG/M
Ga0075020_105047Not Available722Open in IMG/M
Ga0075020_108241Not Available626Open in IMG/M
Ga0075020_108926Not Available810Open in IMG/M
Ga0075020_108995Not Available554Open in IMG/M
Ga0075020_109602Not Available505Open in IMG/M
Ga0075020_109827Not Available699Open in IMG/M
Ga0075020_111649Not Available581Open in IMG/M
Ga0075020_112575All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Enterobacter → Enterobacter cloacae complex → Enterobacter hormaechei666Open in IMG/M
Ga0075020_113102Not Available566Open in IMG/M
Ga0075020_113807Not Available566Open in IMG/M
Ga0075020_115781Not Available548Open in IMG/M
Ga0075020_115911Not Available564Open in IMG/M
Ga0075020_115938All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → Solibacteraceae → Candidatus Sulfopaludibacter → unclassified Candidatus Sulfopaludibacter → Candidatus Sulfopaludibacter sp. SbA4572Open in IMG/M
Ga0075020_116725All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Stichotrichia → Sporadotrichida → Oxytrichidae620Open in IMG/M
Ga0075020_117072All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Oligostraca → Ichthyostraca → Branchiura → Arguloida → Argulidae → Argulus → Argulus foliaceus622Open in IMG/M
Ga0075020_118030Not Available555Open in IMG/M
Ga0075020_120618Not Available573Open in IMG/M
Ga0075020_120815Not Available538Open in IMG/M
Ga0075020_121517Not Available602Open in IMG/M
Ga0075020_122406Not Available687Open in IMG/M
Ga0075020_122474Not Available799Open in IMG/M
Ga0075020_124352Not Available679Open in IMG/M
Ga0075020_124432Not Available534Open in IMG/M
Ga0075020_124602Not Available520Open in IMG/M
Ga0075020_124929Not Available778Open in IMG/M
Ga0075020_125121Not Available541Open in IMG/M
Ga0075020_125200Not Available577Open in IMG/M
Ga0075020_125265Not Available581Open in IMG/M
Ga0075020_125302Not Available543Open in IMG/M
Ga0075020_125621Not Available538Open in IMG/M
Ga0075020_125649All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Opitutae → Opitutales → Opitutaceae → Lacunisphaera → Lacunisphaera limnophila590Open in IMG/M
Ga0075020_156305Not Available563Open in IMG/M
Ga0075020_160811Not Available800Open in IMG/M
Ga0075020_161269Not Available841Open in IMG/M
Ga0075020_163607Not Available539Open in IMG/M
Ga0075020_164875Not Available565Open in IMG/M
Ga0075020_165087Not Available567Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0075020_101266Ga0075020_1012661F001296MKVAERRQPLRESEKPLRATGPGQKAARGEPGSIDPQAGENSLRGAERRIVRRFCLHRSMPCMPVVSQAGNQVTW
Ga0075020_103340Ga0075020_1033401F037738QVIRFPALLTTGMHGTEHCNRKRRTFRLSAPLKLGFPRLRINASWLAAAYFLPTGTGNS*
Ga0075020_104960Ga0075020_1049601F014145PGSLPACTAWSTANKSAGRFASPLPDGRFPQPPDQCFLARRWLPPARNRPLVTAFRSPATVAASRRPPFRGRSSQPATSRPSKSASVPVRPFGSATASRFAPVAAVSLPGARCTSTTRFSLPRLRSPLPSRTFTSLGIKAFNRVCCLPVRLTNPPDLLSLPAARPD*
Ga0075020_105047Ga0075020_1050471F010305LAWFASWWRNHQPKRAEKPHSKFRRRKALDGPATRPRTPLAVENGVGKLTAPEKGAPNVSMGKRDWRTPI
Ga0075020_108241Ga0075020_1082411F022652TIDRTTRMLSVRSAAKCALSFVCRSPMLPFGLLRVNAANAVYGFRILPEPVSRCGLSLARNDAFTPFRGQCSRPAPSIPHRIPSRIRSISDSFAPFGFEADPGRYHRPTPVSRAGFRRSRVFQQSPLPFGRFRTLQIKAFDHHPTREARLAWLRSPFAPRRDFFRFRSSLLPVRCARLD*
Ga0075020_108926Ga0075020_1089262F000817RKRSVAFNRLQPEVSRIIPGDRGKAGSGWLTKPLLNQIARFGSGGRIHQFL*
Ga0075020_108995Ga0075020_1089951F001296MRFDPGMEAAERRHPLQASKKPLRATGSGQKAASGEPGSIDPEAEKHSIRGAKRRIARRFCPQCSMPCMPVE
Ga0075020_109602Ga0075020_1096021F071600LLLCISTVRTTLAGLNKIKTAQRKTLKGGQKERTHREGHGNLAGSVKNFRAPAMLKTTPLDCEIKSNLREKRR
Ga0075020_109827Ga0075020_1098271F063173VPRVWPLNSQVAFLPQFSGIVFNIAELSKGFHGSCLDRVSFPVRPLTLFRLSAFFVWRIPFFFGNARVVRPVLMHSSNHLPSGNCDSLEFETSSSYLTGFGTVSHRSSSLSPFFACTDSARTLPEELVNSASPCR
Ga0075020_109827Ga0075020_1098272F006338MLSGPATRPKTPLAVENSVGKLAAAEMWCQMPAREREPGELP
Ga0075020_111649Ga0075020_1116491F003497SDPSNSPLPDRHARSKHGSQRSGDAALLLPVTAFIRLRIGAPESIRLFYLLEASVSERPFARPHRLFSFENHRSEVKAPDLSLRRNSELFFQPVRPYAPTLDCVHHASGDVRRAKPVAVSRAQNSQTSIQLSLPFRTFVPPDRSAQSAAWSEKPTLVSGPFSLRSPKVSITF*
Ga0075020_112575Ga0075020_1125751F000344MRPRHPHAAESGVGKHTARESERAQACATGKERVAN
Ga0075020_112793Ga0075020_1127931F000203SDGDKWGMGVRQALFPMPTLGAPLSEAVSFPTPFSTASGVFGLVAGPSSALRTLDFE*
Ga0075020_113102Ga0075020_1131021F029031GQQNLVSGSDPGTLIRWQTNQTERWRGAKGNRAEHKGELLKRQIG*
Ga0075020_113807Ga0075020_1138071F064734DFQPCSPPACTVLSTALKSAGRFASLLPSGGFRQPRDQCFKARRSLFSPKRDRSLVTAFPSPATAALFRATIPGSEALACYFALSPQASLPVRPFCSATGSGSPRSRPLLRFWPVTTRLAASTCRYSGLHSPSGLLPPSGSKRSAGLAAGRSAFRIRPIPSRSPLPVLFQLRPRIIVPGSLRFRRLAV
Ga0075020_115781Ga0075020_1157811F003497SKHGSQRSGDTALLLPVTAFIRLRISEPELIRHFYLLEAFVSERPLARPQRLFSFENHRSEVKAPDLSLRRNSELFFQPVWPYAPTLCDVYHATGDVRRTKPVAVSRAQNSQTSIQLSLPFRTFVPPDRSAQSAARSEKPTLVSGPFSLRSPKASITF*
Ga0075020_115911Ga0075020_1159111F017072RHGSGGRIHQFLWHGRTCQTRMRDLAARDDEKPFRTESSNPGKFRSRMNRSHPEGGKLLLCISTVRTTLAGSNKTGTAQRKTPKGGPKERTHREEHGNSAGSVKNFRAPAMLKTTPLDCEIKSNLREKRRDPWHWANAPSKAVADPKPVVKTRMKIAGVS*
Ga0075020_115938Ga0075020_1159381F023644LLLGSPATAFGHYRIKASLRVAAFSPARDRMLVTTFRSPATTSAFTDSIPGSKFLACHFASQPADSTARSALLLHYRIRFAPVPAASLLLARCSFIDLLDLLRLLPPLLLGTFASLRIKAFNRICRLSARLPNPPDFLSLPAAVFYY*
Ga0075020_116725Ga0075020_1167251F065812MGDKLLPRLNNVENPIVNKYCEGKMKRALKRGLKDLKPLRRKRSKLITSE*
Ga0075020_117072Ga0075020_1170722F065812MGDKLLLRLNIYGKPIAYKYFEGKMKRTLKRGLKDLKALRRKR*
Ga0075020_118028Ga0075020_1180281F056352RPDERKAAQVPRIIPGDWGKVGPGWLAQPLSKIRPSGAVREAKFTSSSGGVRAPSVNAKKGLSGEGR*
Ga0075020_118030Ga0075020_1180301F020120SLPACTALSTADKSAGRFAAQLPKGRFLQPPDQCFLARRLPPSARIRSLVTAFCSPTTATASRLPPFRGQSSQPATSLPPKYLPCPFGPSAPLPRSPVCPGGGRFTASNPLHFHYSVWPAAPAISTPLRGFCPPRDQSVQPLLLPAGPPDESARFPLAPRRPSW*
Ga0075020_118583Ga0075020_1185831F037987GMGVRHTLFPVPAFGALLTEAAGFPTPFSTASGVLGLVAGPCGVLQRADFE*
Ga0075020_118687Ga0075020_1186871F036126VNSASPNLRSVQRNRDNQPVPIFPRSPGIIHETRDSDLTLDRRFV
Ga0075020_120618Ga0075020_1206182F029031FKGQQNLVSGSDPGTLIRWQTYQTECWRGAKGNRAEHKGELLKRQIG*
Ga0075020_120815Ga0075020_1208151F081359LERKTAVCREASIQGKRNQTLGSRTAEHSALLDEMLSTVHAGDELGWK
Ga0075020_121517Ga0075020_1215171F100490NIGQFTRLLRSLELGAVSHRTVLLLWDQKRLLRMRDRYGGPKKHLPTPGLML*
Ga0075020_122406Ga0075020_1224061F010305LKFSGRKALDGPATRPKTPLAVENGVGKLTAPEKGAPNVSMGKRDWRTPIP
Ga0075020_122474Ga0075020_1224741F027759SQIEAARFTDNPGRPGKSQVRLVDLTPQNRGASCGEAKFTSSSGGVRTPSNANEELGSEKRLETVSS*
Ga0075020_122474Ga0075020_1224742F000203DKWGMGVRHALFPVPAFGATLAGAAGFPTPFSTASGVSGLVAGPSGALQRTDFE*
Ga0075020_124352Ga0075020_1243521F052605KRAAESKIEVVRFRIIPGDWGKVRSGWLIWLLINRSASFGEAEFTGSSGGVRTPSNANEGLGGEGR*
Ga0075020_124432Ga0075020_1244321F000734VKTFRIVGDITNDTAGLKMKGNLRVKRRDPWHWANALPKAAADPAL
Ga0075020_124602Ga0075020_1246022F071600LLLCISTVRTALAGANKIETAQRKTPKGGQKERTHREDHGNSAGSVKNFQAPAMLKTTPLDCELKAT*
Ga0075020_124929Ga0075020_1249291F027759KRAAESKIEAVSFQDNPRRLGKSQVRLVDLAPLNRSASFGEAEFTSSSGDVRTSSNANEGLGSKPRLETAAS*
Ga0075020_124929Ga0075020_1249292F008512LLLCISTVRTALAGLSKIETAQRKTPKGGQKERTHREEHGNSAGSVKNFQAPAMLKTTPLDCEIKGNLREKRRDPWFGANAPPKAVADPEPVVKTRMKFAGVS*
Ga0075020_125121Ga0075020_1251211F001296MEVAERRQLLRASEKPLRATGSGQEAASGDPGSIDPGAGENSQRGAERRIVRRSCSQCSMPCMPVESQAGNQVTW
Ga0075020_125200Ga0075020_1252001F001296MEAAERGQPLRASEKPLRATGSGQEAASGEPGSIDPWAGENSQRGAERRIVRRSCSQCSMPCMPVESQAGNQVTW
Ga0075020_125265Ga0075020_1252651F001296MEVAERWQPLWASERPLRATGSGQKAASGEPGSIDPEAGENGLRGAERRNVRRFCLRCSMPCMPVVSQAGNQVTW
Ga0075020_125302Ga0075020_1253021F002020LTTGMHGIEHREQERRTFRLSAPRWPFSPASVSVLPGSPLAASCPEPVARNGFLLARNSCRLSATSIPGSKLPACYFASFQIASVPVRPFGSTTASRIAPVAAASLPGARCTSTTWFSWPRPRSPLPSGTFTSLGIKAFNRVCCLPVRLTNPPDFLSLPAARSNESLGCGSSFQVRYVSA
Ga0075020_125621Ga0075020_1256211F081359VCREASIQGKRNQTLGSRTAERSALLDEMLSTVHAGDELGWK
Ga0075020_125649Ga0075020_1256491F025923QSQRGGLKKIAKSTAGIIPGDRGKGDGSWCRFPLQTALAV*
Ga0075020_156305Ga0075020_1563051F022652SRVHGTIERTTRMLSVRSAAKCALSFACRSPLPPFSLPRVNAANALFGLRTLPEPESRYGLSLSRNDAFAPFRGQSSRPAPSIPHRTPSRVRSIPSSSAPFGFEADPGRSHRLTPVSRANFRRSRDFRPSPLPFRPFPAFQIKAFDWLHPRKLASPDARLSFAPRDALFRFRLGSLLKTPVSSCSAI
Ga0075020_160811Ga0075020_1608111F018868MRPRALLAAVSGVGEHTARESEAPNVCAGKERVANAH
Ga0075020_161269Ga0075020_1612691F105212KGSPGKGGSGQKFLIELEDKSNGSQDLRQGRVGVQDHKEQSDGIRTAKKC*
Ga0075020_163607Ga0075020_1636071F002557VLIHSSNRLPSGNCDSLRTETCLNYLIRLGAVSNRPLSLSPFFACTDGARTPPEELVNPASAPG
Ga0075020_164875Ga0075020_1648751F046349RFTASLSADMPGTELRKQVSRTDSRSLPATAFQRQRINATKSVCSYYLFGAGFSSRPFTRSERLSAHRTTIPRSKLLTCDFDALLFRLPAR*
Ga0075020_165087Ga0075020_1650871F014264GSGTMIRRAKRIRDYFWRGGERWKIGQLVRVSFSVWKRVEHYNPEGSRGPDGE*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.