


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300007217 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0114292 | Gp0119833 | Ga0079268 |
| Sample Name | Seawater microbial communities from Saanich Inlet, British Columbia, Canada - KN S10 NT17 metaT (Metagenome Metatranscriptome) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 100535059 |
| Sequencing Scaffolds | 10 |
| Novel Protein Genes | 11 |
| Associated Families | 9 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae | 1 |
| Not Available | 4 |
| All Organisms → cellular organisms → Eukaryota → Sar | 1 |
| All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Hexapoda → Insecta → Dicondylia → Pterygota → Neoptera → Endopterygota → Diptera → Nematocera → Culicomorpha → Culicoidea → Culicidae → Anophelinae → Anopheles → Cellia → Pyretophorus → gambiae species complex → Anopheles gambiae | 1 |
| All Organisms → cellular organisms → Eukaryota | 2 |
| All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Podoplea → Harpacticoida → Harpacticidae → Tigriopus → Tigriopus californicus | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Marine Microbial Communities From The Southern Atlantic Ocean Transect To Study Dissolved Organic Matter And Carbon Cycling |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Marine → Oceanic → Unclassified → Marine → Marine Microbial Communities From The Southern Atlantic Ocean Transect To Study Dissolved Organic Matter And Carbon Cycling |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | marine biome → marine water body → sea water |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Saline → Water (saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Southern Atlantic ocean | |||||||
| Coordinates | Lat. (o) | -28.2362 | Long. (o) | -38.4949 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000237 | Metagenome / Metatranscriptome | 1498 | Y |
| F000481 | Metagenome / Metatranscriptome | 1089 | Y |
| F001926 | Metagenome / Metatranscriptome | 616 | Y |
| F005202 | Metatranscriptome | 408 | N |
| F022892 | Metagenome / Metatranscriptome | 212 | Y |
| F041786 | Metagenome / Metatranscriptome | 159 | Y |
| F042355 | Metagenome / Metatranscriptome | 158 | Y |
| F090441 | Metatranscriptome | 108 | N |
| F098751 | Metatranscriptome | 103 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0079268_1004121 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae | 617 | Open in IMG/M |
| Ga0079268_1012796 | Not Available | 775 | Open in IMG/M |
| Ga0079268_1017679 | Not Available | 622 | Open in IMG/M |
| Ga0079268_1020784 | All Organisms → cellular organisms → Eukaryota → Sar | 636 | Open in IMG/M |
| Ga0079268_1276399 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Hexapoda → Insecta → Dicondylia → Pterygota → Neoptera → Endopterygota → Diptera → Nematocera → Culicomorpha → Culicoidea → Culicidae → Anophelinae → Anopheles → Cellia → Pyretophorus → gambiae species complex → Anopheles gambiae | 533 | Open in IMG/M |
| Ga0079268_1284610 | All Organisms → cellular organisms → Eukaryota | 510 | Open in IMG/M |
| Ga0079268_1301824 | Not Available | 528 | Open in IMG/M |
| Ga0079268_1310690 | All Organisms → cellular organisms → Eukaryota | 563 | Open in IMG/M |
| Ga0079268_1315763 | Not Available | 552 | Open in IMG/M |
| Ga0079268_1324325 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Podoplea → Harpacticoida → Harpacticidae → Tigriopus → Tigriopus californicus | 801 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0079268_1004121 | Ga0079268_10041211 | F001926 | VRHEDIPETNNPRIRFPVLEGDKLMKEVLNGGYNVHPVPPPNSEIKERSARRYERKRIKIERLLTLGYTTSGDP* |
| Ga0079268_1012796 | Ga0079268_10127961 | F090441 | WELATLSFPDSLALGALITRRVIVGCQFPDAASHRKNDSDTFRSRNGT* |
| Ga0079268_1017679 | Ga0079268_10176791 | F090441 | IESEDFIKGEVWELATLSFLDSLALGALITRRVIVGCQFPDAASYRKNDSDMFRSHSGT* |
| Ga0079268_1020784 | Ga0079268_10207842 | F001926 | VRHEDIPEINNPKIRFAVLEGDKLMKEVLNGGYNVHPVPPPKSEIKERITKRYERNRINIEKLLALGYTTSGDP* |
| Ga0079268_1276399 | Ga0079268_12763991 | F000481 | LIQKHEELIPTVQKTSLMVDLYWKCYAYGDELKPHIEFLDGIMLSSTRDIAPSCVENVDELIERQEKSLTQLDSKRGIVVDLIAKGKVILEHPDKPKFLEGNVKRIQDGWEDTKNKAQERLKLLNDTKEAWVGYANNNETIASEFEVAEEEIKKVKKRYALDDALSDLKKRQELYNK |
| Ga0079268_1284610 | Ga0079268_12846103 | F022892 | MNPIVNNIGVLKVILPPHKVAIQLKILIPVGTAIIIVAAVK* |
| Ga0079268_1292103 | Ga0079268_12921031 | F000237 | P*YYLYLVQLHVMFCHES*DTDSGESTIEDKSGTYVSWFYDAFLKEIQDA*F*SQFVFVYFAMHHFHASTVNYHFFER*NISELDEIRFYGVAPH*YFRPLMGLLTITPTHYEGLL*MGM*FGLLAALPLLNSFYNSGLRYVPVIPMQSSFVQTSAFILYMLSMYCAASMLPCGRYYYDPEGGYVGNP*VKFSYQYAYLYMA* |
| Ga0079268_1301824 | Ga0079268_13018241 | F098751 | LNLIDGFSHSQVEINLFHDTGIVLITGIFIFVAILGGSLVFNKLRDRKMLEDQALETI* |
| Ga0079268_1310690 | Ga0079268_13106901 | F005202 | QFKQMFNSYRSKRQASRRQGPRPAPRQTGDNLDLGDRLVEKLAEQKHHMEAKIGNMTCVMKELNVLDASNNIDVQAMKRDIQKYTMPSEWFKNKYEHLLDTCYEMATNLPADIAENSVVTGDSFGTVKLGEVKMFTKCCEKAKTKLCMHQDIKKKVESNFGPMEDILQETQLTEHEFFPLVMQLLHG |
| Ga0079268_1315763 | Ga0079268_13157632 | F042355 | MSSDLKKINKKNSGRKSNSQELKLVERLIKISRVSKVTKGG |
| Ga0079268_1324325 | Ga0079268_13243251 | F041786 | AEAFRHSLMTFGNKFSAGEVDDAFAEMKIDEGMIDAAHLKGLMVAK* |
| ⦗Top⦘ |