


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300022144 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0114818 | Gp0238668 | Ga0213855 |
| Sample Name | Metatranscriptome of freshwater sediment microbial communities from post-fracked creek in Pennsylvania, United States - G-2016_31 (Metagenome Metatranscriptome) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 56951643 |
| Sequencing Scaffolds | 12 |
| Novel Protein Genes | 19 |
| Associated Families | 15 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| Not Available | 5 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Enterobacter → Enterobacter cloacae complex → Enterobacter hormaechei | 1 |
| All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Oligohymenophorea | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria | 1 |
| All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Postciliodesmatophora → Heterotrichea → Heterotrichida | 2 |
| All Organisms → Viruses → Predicted Viral | 1 |
| All Organisms → Viruses → Riboviria → Orthornavirae → Lenarviricota → Leviviricetes → Norzivirales | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Freshwater And Sediment Microbial Communities From Various Areas In North America, Analyzing Microbe Dynamics In Response To Fracking |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Freshwater → Sediment → Unclassified → Watersheds → Freshwater And Sediment Microbial Communities From Various Areas In North America, Analyzing Microbe Dynamics In Response To Fracking |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | freshwater biome → bayou → sediment |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Water (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Pennsylvania | |||||||
| Coordinates | Lat. (o) | 41.5097 | Long. (o) | -76.4485 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000212 | Metagenome / Metatranscriptome | 1580 | Y |
| F000251 | Metagenome / Metatranscriptome | 1457 | Y |
| F000344 | Metagenome / Metatranscriptome | 1257 | Y |
| F001602 | Metagenome / Metatranscriptome | 664 | Y |
| F001633 | Metagenome / Metatranscriptome | 660 | Y |
| F002706 | Metagenome / Metatranscriptome | 535 | Y |
| F020183 | Metagenome / Metatranscriptome | 225 | Y |
| F033681 | Metagenome / Metatranscriptome | 176 | Y |
| F046210 | Metagenome / Metatranscriptome | 151 | N |
| F056350 | Metagenome / Metatranscriptome | 137 | Y |
| F058120 | Metagenome / Metatranscriptome | 135 | Y |
| F065810 | Metagenome / Metatranscriptome | 127 | Y |
| F071274 | Metagenome / Metatranscriptome | 122 | Y |
| F082052 | Metagenome / Metatranscriptome | 113 | Y |
| F087278 | Metagenome / Metatranscriptome | 110 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0213855_1007451 | Not Available | 578 | Open in IMG/M |
| Ga0213855_1010136 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Enterobacter → Enterobacter cloacae complex → Enterobacter hormaechei | 641 | Open in IMG/M |
| Ga0213855_1021409 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Oligohymenophorea | 1223 | Open in IMG/M |
| Ga0213855_1030310 | Not Available | 792 | Open in IMG/M |
| Ga0213855_1035243 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria | 1348 | Open in IMG/M |
| Ga0213855_1035560 | Not Available | 618 | Open in IMG/M |
| Ga0213855_1039435 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Postciliodesmatophora → Heterotrichea → Heterotrichida | 776 | Open in IMG/M |
| Ga0213855_1070926 | Not Available | 591 | Open in IMG/M |
| Ga0213855_1074304 | All Organisms → Viruses → Predicted Viral | 3951 | Open in IMG/M |
| Ga0213855_1118277 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Postciliodesmatophora → Heterotrichea → Heterotrichida | 834 | Open in IMG/M |
| Ga0213855_1119790 | All Organisms → Viruses → Riboviria → Orthornavirae → Lenarviricota → Leviviricetes → Norzivirales | 3974 | Open in IMG/M |
| Ga0213855_1121591 | Not Available | 779 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0213855_1006385 | Ga0213855_10063851 | F000212 | LSLLVITLLTTYTYAASLKTSCHPQCSWKCDDPHCPAICDPVCEPPKCHTSCAEPKNAICDVKCEKPECEIKCPDKGCEMFDCPKCVTVCKAPHCVTHCQAPKPECEAVCEEPKCDWKCHKPNCPKPKCELVCENPNCVPKVECCPCNNGGARVATALPFFKESETNKQCCSCAK |
| Ga0213855_1007451 | Ga0213855_10074511 | F033681 | YVLLIVYITSAFCGLPTTPKAADLNDHFGTEPTLGIYGPKPPTIGPSLMRRGAAAGLPITPIRNFEREIFPPNVKSGDLTNTSYDASRILTPTIARPKAEIKAQFLHAAVVKTPVQLGTVVQENTVTTLNRLDGKIYTKKVVAEKPVLGVLHTTKNVWTPHSATVDLLSGKLLGGADKPKYHGLK |
| Ga0213855_1010136 | Ga0213855_10101362 | F001633 | LPDATLRGMRMSRSHGGTVLTVAGRDLSSEASAPSSDAPCRERLAGRGADTPAISIVSRLHLYHGDGASFWLAVGPALRV |
| Ga0213855_1021153 | Ga0213855_10211531 | F087278 | ILNWIRSPCETESPKTLPFKKIDQLLPKKPMQTPEDRAKDIDNTLTWLRDRGVNLVAEQAPKFQGVDTISIDHRTPEQKARDVDDILNWMRNPRENDSPKAEPFKRIDQLLPKKPGQSPEDRANDIDNTLTWLRNRGVDEPQFGPEEPFKNAKNVPVPDNRSPEEKLADLDDILNWVRNPKENDTPQTEPFKRIDQLLPKKPGQSPEDRAKDIDNALTWVRNRGVDQPE |
| Ga0213855_1021409 | Ga0213855_10214091 | F056350 | KLSLFSFVGYFVTLGVSFHDFMSLFGFFTFLIIANQLVSGTMLAFSLVPESMMVPIVRDEEDLEDLYTDDFFWLHERGVDLLFIFVFFHLLRKLYLVILDYEQEFA |
| Ga0213855_1030310 | Ga0213855_10303101 | F000344 | MRPKHPPAAESGVGKHTARESERAQACANGKERVAHAHPQIWPR |
| Ga0213855_1035243 | Ga0213855_10352431 | F065810 | LNSFQNKYIYYAIDDILETFQNSPVERDQLLGLIYSPILSLQNHFSINFFDIWIRQVYIEEIVKPNKFLTKESEGFIQFTSITIEFIYNVKAPVKKPEALW |
| Ga0213855_1035560 | Ga0213855_10355601 | F082052 | ECEIKCPDKGCEMFDCPKCVTVCKAPHCVTHCQAPKPECEAVCEEPKCDWKCHKPNCPKPKCELVCENPNCTPKVECCPCQGGASGVSPAFPFFKETETNQQCCGCGKFRTGSGAPSENPSIVGVNGSLTNTFPSTQIVDNGTHFLRLDKRAKVEMFKTGRPIGTGQYGGTNGNNRDWETARPSLNLDLVHKDRY |
| Ga0213855_1039435 | Ga0213855_10394351 | F001602 | IFFLGMLSIAHSSLLKSQCHPQCSWKCDDPHCPAICDPVCEPPKCHTSCAEPKNAICDVKCEKPECEIKCPDKGCEMFDCPKCVTVCKAPHCVTHCQAPKPECEAVCEEPKCDWKCHKPNCPKPKCELVCENPNCVPKVDCCPCGQGGSRVANPLPFFKESERNPTCCGCSKPLRLVSPDNANTSKSTQIHDYGTHVVQYDSVSTQRPFKNPRQIGDAFYDRPDNLFKSFDQTAAPRIVN |
| Ga0213855_1051248 | Ga0213855_10512483 | F071274 | VTNLWTNVEDLGDYAESDYAYDAVKTASFLMWALSGRKFSGVTTVTERYVSTQDPY |
| Ga0213855_1070926 | Ga0213855_10709261 | F046210 | IVANIQMKNLKIEAEKGFRRTLFEPELVDPNLEINFVNDAGVAVKGCNVNS |
| Ga0213855_1074304 | Ga0213855_10743047 | F058120 | MIPAGEFGETLLETIKSKDALKSFTQVQQFRESMRDTTVGADYVAWISEPVNLTRVHKALAEDLSVPPRAMAIKRVLMSRTQRAVLLTQAMEIAIKRVYNL |
| Ga0213855_1095729 | Ga0213855_10957291 | F000212 | KIYVLLVILLTSYASATFMNTQCHPQCSWKCDDPHCPAICDPVCEPPKCHTSCAEPKNAICDVKCEKPECEIKCPDKGCEMFDCPKCVTVCKAPHCVTHCQAPKPECEAVCEEPKCDWKCHKPNCPKPKCELVCENPNCVPKVECCPCGMGGARVANPLPFFKETESNKQCCSCNK |
| Ga0213855_1096598 | Ga0213855_10965981 | F000251 | GFGEVWVKTMSDPKAFKIIHNIDRAGCCWFPTLCFCAKGQKARSYMVISENFIHGNNSCPVGVCCCLRDIPYTMYYDRQPFAETMCERSCKSILGGGDFVTEHDLCYCCYVIPCMPVYDILCRPCWGGYVARVSSNRSKELCFLLETRCCGHTAVPIMNFVGDSVKAKDAMTIELAMFRKNGGGEWALP |
| Ga0213855_1107470 | Ga0213855_11074701 | F000212 | LTLIFVVLLSSYVQSSSLKSTCHPQCSWKCDDPHCPAICDPVCEPPKCHTSCAEPKNAICDVKCEKPECEIKCPDKGCEMFDCPKCVTVCKAPHCVTHCQAPKPECEAVCEEPKCDWKCHKPNCPKPKCELVCENPNCVPKVECCPCNNGGARVASPLPFFKETEMNKQCCSCAK |
| Ga0213855_1118277 | Ga0213855_11182771 | F002706 | FVIKAALLLIFTTNILASMLKTGCHPQCSWKCDDPHCPAICDPVCEPPKCHTSCAEPKNALCDVKCEKPECEIKCPDKGCEMFDCPKCVTVCKAPHCVTHCQAPKPECEAVCEEPKCDWKCHKPNCPKPKCELVCENPNCVPKVECCNCNNGYPRVNTQLPFFKETENNSACCGCGAPSEIPNIDGINQALLLKQPKTTFIDSGSHLIRHEKTYDVQLLKSGVPLNQGQYIRTTGKNPHFDRQQPLLNLNLAHRDILNH |
| Ga0213855_1119790 | Ga0213855_11197903 | F020183 | MSLTVNAKTYNADSFQQNAVGYIGTLKTVSVKDDLALRRTSPKSTDAFSGVGRTSAKLTRTLTLTGAKTLAGDCIAEMSVSVPVGYAGADVDAVLNDMGAFIASASFKAHVKSQQVSF |
| Ga0213855_1121517 | Ga0213855_11215171 | F000212 | MVYSKCHPQCAWKCDDPKCPAICEPVCEQPKCHTSCAEPRNAVCDVKCEQPHCTIKCPDKGCSALDCPKCVTVCSQPNCVTHCQALKPECEAVCEEPRCDWKCHKPQCPVPKCELVCENPNCSPQVNCCACNNAAVGVYNGMMFKETEKNPTCCSCNGNNAPQQ |
| Ga0213855_1121591 | Ga0213855_11215911 | F000344 | MRPRHPHAAESGVGKHTARESERVQACAAGKEHVTNAH |
| ⦗Top⦘ |