


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300013293 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0118444 | Gp0134408 | Ga0120688 |
| Sample Name | Aquatic prokaryotic and eukaryotic communities from a canal in New York, USA: aquatic canal water -GCSS-13 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Weill Cornell Medical College |
| Published? | Y |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 101482816 |
| Sequencing Scaffolds | 13 |
| Novel Protein Genes | 14 |
| Associated Families | 13 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| Not Available | 4 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales → Desulfobulbaceae → Candidatus Electrothrix → Candidatus Electrothrix marina | 1 |
| All Organisms → cellular organisms → Bacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → Anaerolineales → Anaerolineaceae → unclassified Anaerolineaceae → Anaerolineaceae bacterium | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium | 2 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → unclassified Proteobacteria → Proteobacteria bacterium | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Urban Prokaryotic And Eukaryotic Communities From The Subway And Surrounding Areas In New York, Usa |
| Type | Engineered |
| Taxonomy | Engineered → Built Environment → Canal → Unclassified → Unclassified → Aquatic Canal → Urban Prokaryotic And Eukaryotic Communities From The Subway And Surrounding Areas In New York, Usa |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | Unclassified |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Saline → Water (saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA:New York City | |||||||
| Coordinates | Lat. (o) | 40.67 | Long. (o) | -73.99 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F004928 | Metagenome / Metatranscriptome | 418 | Y |
| F005564 | Metagenome / Metatranscriptome | 396 | Y |
| F016971 | Metagenome / Metatranscriptome | 243 | Y |
| F021019 | Metagenome / Metatranscriptome | 221 | Y |
| F021441 | Metagenome / Metatranscriptome | 219 | Y |
| F036294 | Metagenome / Metatranscriptome | 170 | Y |
| F048675 | Metagenome | 148 | Y |
| F056961 | Metagenome | 137 | N |
| F072362 | Metagenome / Metatranscriptome | 121 | Y |
| F077315 | Metagenome / Metatranscriptome | 117 | Y |
| F080659 | Metagenome | 115 | Y |
| F088437 | Metagenome | 109 | Y |
| F094059 | Metagenome / Metatranscriptome | 106 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0120688_1000994 | Not Available | 1560 | Open in IMG/M |
| Ga0120688_1003018 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales | 1121 | Open in IMG/M |
| Ga0120688_1005982 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi | 888 | Open in IMG/M |
| Ga0120688_1007055 | Not Available | 838 | Open in IMG/M |
| Ga0120688_1008588 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales → Desulfobulbaceae → Candidatus Electrothrix → Candidatus Electrothrix marina | 783 | Open in IMG/M |
| Ga0120688_1011679 | All Organisms → cellular organisms → Bacteria | 705 | Open in IMG/M |
| Ga0120688_1012163 | Not Available | 695 | Open in IMG/M |
| Ga0120688_1016729 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → Anaerolineales → Anaerolineaceae → unclassified Anaerolineaceae → Anaerolineaceae bacterium | 624 | Open in IMG/M |
| Ga0120688_1022448 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium | 567 | Open in IMG/M |
| Ga0120688_1027140 | All Organisms → cellular organisms → Bacteria → Proteobacteria → unclassified Proteobacteria → Proteobacteria bacterium | 534 | Open in IMG/M |
| Ga0120688_1028642 | Not Available | 525 | Open in IMG/M |
| Ga0120688_1030664 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 514 | Open in IMG/M |
| Ga0120688_1032186 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium | 506 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0120688_1000994 | Ga0120688_10009942 | F077315 | MLICKASTLFLSRFACWKVWALLLLLVLLVPAQAEEASKPDVLPQAQATLERLEQQLANARTATAQELKKLKKEIDTVRSSAKDCVQQAEPKIEILDSELAIFQPEAPKDTQKKTVEETQPAAEPEAPFSPAIARQLQDLQSRKASLEGRMAICKLMLLRSNELESDVDDYLSSLQTRQLLARGPTLVGVVQAN |
| Ga0120688_1003018 | Ga0120688_10030182 | F056961 | MNVEHRTLNFEHRIMYSTIYNKDKAKRLPHSTFDVERSMFDVQIVASEITTKPSYHVKIMYTDQEF* |
| Ga0120688_1005982 | Ga0120688_10059821 | F088437 | FITEDSYRMAGVSQGLGIDTQQSGTWIALEQSQAIQPRALWALVQWFGQLSPAQLNLALGVEPPQYILEDEKHVKECGQRAYMPMVYTPKDALLWWVDYLHSASEDALTTSLARFKALSERLEHIVGATVDGWEGAQNALRTVFPGIAIEECHFHALLNLGKHLATYKRQRKRAGNPVLPTEEAAIRNAFLNVLKAKTQKAYQDALDELPTAFNHPTLASRKQSLTEKQALFQAWTTDAKLAVVSTPLDQCMKFLKRKFQNMQTFHGDKSGLATVNAWAITRNCWRFLKGAIRAG |
| Ga0120688_1006074 | Ga0120688_10060741 | F021019 | MTEHKYIVKIFTKYLQTSFEIESEKEINNMEELNKPIIDFLGKSDIKWEKNDLQYHGTGNDFYITYEEVNNGSAKDGVVREENTVRV* |
| Ga0120688_1007055 | Ga0120688_10070552 | F021441 | VGTKKIMKKIKHNDLVPWFIQDHGTLPASYLKSCRSFFDSIKLQASSIKPQASIQKNLHKPGTRVKNRFNRKI* |
| Ga0120688_1008588 | Ga0120688_10085881 | F036294 | MDRGSSDEAIVGVKPVADEDAVTYLRVKLSESGSDAGAEGRNM* |
| Ga0120688_1011679 | Ga0120688_10116791 | F056961 | CILPFYNKDEAKRLPHSTFDVERSMFDVQIVASEITTKPPYHIEIMYTGQEF* |
| Ga0120688_1012163 | Ga0120688_10121631 | F005564 | MKPSKQQVVTGTDMHKIIIGLLVIYFLGDPGMETIAWLGSMKNYVITAVIALATAPFIVS |
| Ga0120688_1016729 | Ga0120688_10167291 | F080659 | MTGIPILDTILDLIMQYRLGIAMMGLAVIAIGLLARPIAPEWSAHNRSAVAMMVIGGIILVLLPTLAAAIVGP* |
| Ga0120688_1022448 | Ga0120688_10224481 | F004928 | MAQALHSVETQPDHTVEIIVHVTENLGEKKRNELVAALEEDSGITTAEFCPLRYHLILVRYDRDTYSSQDVLNRVKAQNLDARLIGPV* |
| Ga0120688_1027140 | Ga0120688_10271401 | F094059 | FLDSFMYPQSFISLCGYFRDLPVGHYTARFLGVFIGAILGVLFVMFIGVLYKWVLKDWAYSKATRLGALAFILIALMTGFMSDMFNESIGMALHIEEGMPEDLVGKFLVEDYGSFEKGTKITVQNASILGAYGTVKGNNQAGARRWWISSLIFMYAWFGLYFLCAHRLFKRNKVPRKD |
| Ga0120688_1028642 | Ga0120688_10286421 | F072362 | MHKLFAIGVLIYLVGDPGLSMLASFSEARELIVASALALIAVPWVVSQIDN* |
| Ga0120688_1030664 | Ga0120688_10306641 | F048675 | MEEHMDSNEVLFCEHYAAAQSLKSFWTQLASTYGCMMIVLTPNMLIIKPHWFAKWLLSLLRLDLCHEIPITRIRGVIEKGKWSSYGKVELHFQTVEGKDQNILLYLKKYHDFVDKIKGSIHR* |
| Ga0120688_1032186 | Ga0120688_10321861 | F016971 | KFAHGDSTMDNAVQQSFIETKNAVEVVVYIKEDLGAEQRDLVVSALEKTDGIIGAEFCQLRNHLLLAKYDRDIFSSQDVLKSFNSLKLDARLIGPI* |
| ⦗Top⦘ |