


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300018498 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0117946 | Gp0217020 | Ga0193224 |
| Sample Name | Metatranscriptome of marine eukaryotic protist communities collected during Tara Oceans survey from station TARA_047 - TARA_G000000124 (ERX1782113-ERR1711919) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Canada's Michael Smith Genome Sciences Centre |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 1764614 |
| Sequencing Scaffolds | 25 |
| Novel Protein Genes | 29 |
| Associated Families | 24 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Eukaryota | 10 |
| All Organisms → cellular organisms → Eukaryota → Opisthokonta | 2 |
| Not Available | 3 |
| All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Gymnoplea → Calanoida → Temoridae → Eurytemora → Eurytemora affinis | 3 |
| All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Podoplea → Siphonostomatoida → Caligidae → Lepeophtheirus → Lepeophtheirus salmonis | 2 |
| All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae | 2 |
| All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Podoplea → Harpacticoida → Harpacticidae → Tigriopus → Tigriopus californicus | 2 |
| All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Gymnoplea → Calanoida → Acartiidae → Acartia → Acartia pacifica | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Marine Viral And Eukaryotic Protist Communities Collected From Different Water Depths During Tara Oceans Survey |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Marine → Unclassified → Unclassified → Marine → Marine Viral And Eukaryotic Protist Communities Collected From Different Water Depths During Tara Oceans Survey |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | marine biome → deep chlorophyll maximum layer → sea water |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Saline → Water (saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Indian Ocean: TARA_047 | |||||||
| Coordinates | Lat. (o) | -2.0465 | Long. (o) | 72.1568 | Alt. (m) | N/A | Depth (m) | 55 | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000073 | Metagenome / Metatranscriptome | 2639 | Y |
| F000147 | Metagenome / Metatranscriptome | 1917 | Y |
| F000717 | Metatranscriptome | 923 | Y |
| F001293 | Metagenome / Metatranscriptome | 729 | Y |
| F001636 | Metatranscriptome | 659 | Y |
| F002147 | Metatranscriptome | 589 | N |
| F002148 | Metatranscriptome | 589 | Y |
| F003495 | Metagenome / Metatranscriptome | 483 | Y |
| F004094 | Metagenome / Metatranscriptome | 453 | Y |
| F005379 | Metatranscriptome | 402 | Y |
| F005532 | Metatranscriptome | 397 | N |
| F007564 | Metatranscriptome | 348 | N |
| F010030 | Metatranscriptome | 309 | Y |
| F022575 | Metatranscriptome | 213 | Y |
| F023281 | Metagenome / Metatranscriptome | 210 | Y |
| F027856 | Metagenome / Metatranscriptome | 193 | Y |
| F029011 | Metatranscriptome | 189 | Y |
| F029316 | Metatranscriptome | 188 | N |
| F029641 | Metatranscriptome | 187 | N |
| F036075 | Metatranscriptome | 170 | Y |
| F041786 | Metagenome / Metatranscriptome | 159 | Y |
| F088178 | Metatranscriptome | 109 | N |
| F088204 | Metatranscriptome | 109 | Y |
| F100298 | Metatranscriptome | 102 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0193224_10001 | All Organisms → cellular organisms → Eukaryota | 2481 | Open in IMG/M |
| Ga0193224_10006 | All Organisms → cellular organisms → Eukaryota → Opisthokonta | 1523 | Open in IMG/M |
| Ga0193224_10007 | All Organisms → cellular organisms → Eukaryota → Opisthokonta | 1522 | Open in IMG/M |
| Ga0193224_10012 | All Organisms → cellular organisms → Eukaryota | 1426 | Open in IMG/M |
| Ga0193224_10160 | Not Available | 889 | Open in IMG/M |
| Ga0193224_10165 | All Organisms → cellular organisms → Eukaryota | 878 | Open in IMG/M |
| Ga0193224_10209 | All Organisms → cellular organisms → Eukaryota | 816 | Open in IMG/M |
| Ga0193224_10265 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Gymnoplea → Calanoida → Temoridae → Eurytemora → Eurytemora affinis | 766 | Open in IMG/M |
| Ga0193224_10270 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Podoplea → Siphonostomatoida → Caligidae → Lepeophtheirus → Lepeophtheirus salmonis | 756 | Open in IMG/M |
| Ga0193224_10271 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Gymnoplea → Calanoida → Temoridae → Eurytemora → Eurytemora affinis | 756 | Open in IMG/M |
| Ga0193224_10284 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae | 750 | Open in IMG/M |
| Ga0193224_10332 | Not Available | 720 | Open in IMG/M |
| Ga0193224_10338 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae | 717 | Open in IMG/M |
| Ga0193224_10370 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Podoplea → Harpacticoida → Harpacticidae → Tigriopus → Tigriopus californicus | 702 | Open in IMG/M |
| Ga0193224_10491 | All Organisms → cellular organisms → Eukaryota | 636 | Open in IMG/M |
| Ga0193224_10547 | All Organisms → cellular organisms → Eukaryota | 609 | Open in IMG/M |
| Ga0193224_10654 | All Organisms → cellular organisms → Eukaryota | 569 | Open in IMG/M |
| Ga0193224_10700 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Gymnoplea → Calanoida → Temoridae → Eurytemora → Eurytemora affinis | 554 | Open in IMG/M |
| Ga0193224_10706 | All Organisms → cellular organisms → Eukaryota | 550 | Open in IMG/M |
| Ga0193224_10709 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Gymnoplea → Calanoida → Acartiidae → Acartia → Acartia pacifica | 549 | Open in IMG/M |
| Ga0193224_10791 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Podoplea → Harpacticoida → Harpacticidae → Tigriopus → Tigriopus californicus | 526 | Open in IMG/M |
| Ga0193224_10808 | All Organisms → cellular organisms → Eukaryota | 519 | Open in IMG/M |
| Ga0193224_10821 | Not Available | 517 | Open in IMG/M |
| Ga0193224_10861 | All Organisms → cellular organisms → Eukaryota | 507 | Open in IMG/M |
| Ga0193224_10882 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Podoplea → Siphonostomatoida → Caligidae → Lepeophtheirus → Lepeophtheirus salmonis | 502 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0193224_10001 | Ga0193224_100011 | F022575 | MQATATSNNNNKYNPIIRKIKFIDLNLGPXDPNKVNNKCPATILAARRIDKVIGRIIFLTVSIITIIGIRKAGVPVGTKCAKRLLYXKIIESNILPNHKGNAKVKVNDICLVLVKI |
| Ga0193224_10006 | Ga0193224_100061 | F010030 | VEYLHSVLLLQFKILLPEGVDTINHGLDELDLRVTKTMFVGNVIGVSSLTARFAAGTTGLQSKGLTPGLQAINTILGPAGQVNVDGGTHASAQVGGARVDVTELGGEQEVLARFSLDGVTDGLDAPGETLEDTLDVTSLLHGNDTELILLVDPDQESLVSVVEDTTALGPVTLHTGYLQVGVTRHEEEMVIDELLADLLIHTGQGVVGTGQVTFHFGECVLHQSLNIDTLLLGDTGGKTESLDAAANTDPAGVNRYISLNVASDLGGVHVGGVLEVGSKTMVLADEGVEDLSEVNIGVLITSIDTAMLVVELNCAGNSLGQGELRGLGNNFVELVPFVFSHVLGNQGVLGLDFGERSHGLCFNK |
| Ga0193224_10007 | Ga0193224_100072 | F010030 | VEYLHSVLLLQFKILLPEGVDTINHGLDELDLGVAKTVLVGNVIGVTSLTARLAAGTTGLEREGLAPLLQFVNGILGPAGKVNVDGGTHASAQVGGAGVDVAELGGEQEVLARLSLDGVTDGSNTTGKTLEDTLDVTTLLHGDDTELILLVDPDKEGLVGVVEDTAALGPVALHTSDLQVGVTGHEEEMVVDELLADLLIHTGQGVVGTSQVTGQFGEGVLHQSLNIDTLLLGDSGGKTETLDGATNADPAGVDGYIGLNVASDLGGVHVGGVLEVGSKTMVLADEGIEDISEVNIGVLITSVDAAVLVVEFNCASNGLGQGELRGLGDNLVELVPFLFSDVLGDQGVLGLDFGERSHGLSLINNLRKEDCTAQS |
| Ga0193224_10012 | Ga0193224_100121 | F001636 | HWRGEDGAHIWSSHHVTGAEGAWAGDVDVVAEYFQPEGPIMKGLICGQYPSHWIATKRSDKEVDVWGTVTLKVEGGGTYTAYVHEYMQFYEPCVTINRHYWKLEYLAAEYYPDHWYHKEAAVVDPNWTWGNVTDKTTFSWHLSGSLNNKAISADGYGYGHGYRQHHWGKGGKGFGEHGLPNLAWALAWEGHAGLHFFTRFPKGVVNPFIYSLPEGFTIERKWWGQDGDHWSTTHDVRYEGTNVNKRIQLVGSGFAPGSLMMWDGNKDKGAWIVKSYPQYALATPKGDRHIWYRGTFQFELNNGNFYSGYWEMDIHFRKPNPKMPNPYVVKFWPETWASKPHYWHFYEREEVEESTYDQIA |
| Ga0193224_10160 | Ga0193224_101601 | F029641 | LDSCISFFGDSFSDTLVSWERNQSLITFSQEEDVAGSGSENVSGSIFDVDNIERSWVSFSGLDGSHSTNVLTANDLAHVTGVEFDPVGNLVGGKVEFDGVADFAVWVWVSDGSAVVCDQEWNVVLFDEDFLDSAEFVRSFFGGDSVDDESAFGVIDESEVFVGLFDSDDIHVSSWVFHVGSHFSVNFDHSSLEDFLALVTGKGVVESVSDEDSQWHAFSKFVWTGVWSMSKDSSSFWEHPVVWSGQGFKMFLWSTSSHFR |
| Ga0193224_10165 | Ga0193224_101651 | F007564 | TWGVLVGQSDSYNMLVFLSLISAAMAAPQLYPYYGTYGMYGYPGYGYPMVHSPMVNYPASYYPAQRLIANYPTVGSPMAGTRNLIKFGNFLELNGLFEQVATATTTAPVTTVKGNFNIQQNGILDVFSGSEAKFSMYIMSSNDLTGKNIKVNLGTGASCLAASTGTLVELAMVNAPPSINGFYISGTTKGYNIDGMNSKTALMGSTNWLVLTESGTVIGCSKTALQ |
| Ga0193224_10196 | Ga0193224_101961 | F004094 | VGAQELGHAEGTDPVVAENLSHLLVGVEELLVLGVLEVVLLDVGPQLLDALGPGSLLLADDVSELGGELHGLGESGSLRHVESWLVFGAWSKIERKDEP |
| Ga0193224_10209 | Ga0193224_102091 | F029316 | VVTSVYSFAFGVDFNASLTVQFSQSVKVQFWLLDNLDFADVAVFDWVDWHSSFGDVRGDAVWEEFLDQLWNVTVGDLFGDDLGHLLSDLFDLLTLGVSGLFDLAVALFFGEGDDEDSEVVVVGGFAVNGALDHGLPFFDHTAHLVSCEGHAVEVQDAVFALDIFADEFEFSVSLAVIVKVGLVTVVNSTFESISGDLVTDGSCDQSVTDVSDFEDGWGFDGVPVLFGEWVDDLLLTS |
| Ga0193224_10265 | Ga0193224_102651 | F002147 | PDTYGAPAETYGAPDTSYSAPSYAADTGLDLTTIIIPLLALLGLFLLFPTYVTLTSVRRKREAIEDAGETNMVERMMDMYQAIIESEECLERVACDVGGLANDAGFDPTMAQLASVMVPNKYSKYMKQFASAKNCHKLKCGNLF |
| Ga0193224_10270 | Ga0193224_102701 | F004094 | VSAQELGHTESTNPVVAEDLGHLLVGVEELLVLRVLKVVLLDVSPQLLDALSPGGLLLADNVSELSGELHGLGESGSLRHVEFLRVVSVGFEKRLEG |
| Ga0193224_10271 | Ga0193224_102711 | F000717 | PQPHQQPELSFFARIVGDEGTARIVESISSWVNDKAKENPGCVENFVCEMYKTGETMSGLPYLLMSLTNAAVSFMVAEQFSDSIQMDAITRASKYGRTTGTCHLMECPLLNGKLREVTDWLAGFEELLGYVVNSVSTSLG |
| Ga0193224_10284 | Ga0193224_102841 | F003495 | MKEKYEKKVKNSKESSKLTKAPPKHTAALDKRSVLANVNSKGSGR |
| Ga0193224_10302 | Ga0193224_103021 | F004094 | VSADVLGHTESTDPVVAEDLGHLLVGVEELLVLGVLEVVLLDVGPQLLDALGPGSLLLADDVSELGGELHGLGESGSLRHVESWLVFGAWSKIERKDEP |
| Ga0193224_10332 | Ga0193224_103322 | F005379 | VPTVFLPGLLKGRLAQFPETPTGTYVETCNKVLMGYFGNLEVVSKTGMKRFLEQFEGYYANGGKCWRWDTPDCKKDWKYGPWGEDLFMQRTMDDAEVMKKSDFTLTDTGTCPGMRPKANKDDTEFVPSCVGADKFVAVHPFRNLTAWQTCYSTFSKKL |
| Ga0193224_10338 | Ga0193224_103381 | F001293 | MIXNSLICKLINVQACMKEKYEKKVKNSKESSKLTKAPPKHTAALDKRSVLANVNSKGSG |
| Ga0193224_10370 | Ga0193224_103701 | F041786 | AETFRAQLMTFGNKFSASEVDDAFAEFKIDDGQIDAAHLKGLMVSK |
| Ga0193224_10491 | Ga0193224_104911 | F088178 | INGCGVGGFSLGSGFGSFSLLVLVPFSSVGDSFGFQSGNNISVLPADFVAEFAQRSNFSAWKASDFFQSGGDLEFFLAIEWWWTTIEALESLVGGLASSGFVGEHTSNHSFDHHGWSSVVEWTVFWVGGGSMMHLFEHDQFVSVVVTGDEDTFASDDNDIVTSEEFLGDSGAKSTGEVVLRVNNDVLFKSFSHCRKNICKFDLRL |
| Ga0193224_10535 | Ga0193224_105351 | F000147 | KRKKAGTWINSNMFIQTWKMIKDEGAWSKWDWTVKVDCDAVFLPSRLRDYLGKVEVTDNGIYLENCKYVNYGFFGSLAVVSHDAAATYMANLDDCKASLNYMGSDKDTGNQPWGEDLFQQRCMDLHGVDKVAAYDINTDASCAAWRPEGQKKNGKWKPDCAVVQTPGIHHFKKPKDYFDCLKATQR |
| Ga0193224_10547 | Ga0193224_105471 | F088178 | MAELCDNLVTHFGQLGIDSSGVSFGNLGGVSLCLGLFLVVPFLSVGDSFSFKSSNDISVLPADFVAESSQRSDFSAWKRSDFFQGSGDLEFFLGVKWWWASIEALESLVGSLSSSGFVWKHTSDHSFDHHRWSSVVEWTVFWVGGGSVVHLLEHDELVSVVVTGDEDTFTSDNGDVVTSEKFFG |
| Ga0193224_10654 | Ga0193224_106542 | F036075 | VFDGFSGVSWSLEQQGVLSEWSLLSELVQRDDFSAGFQDSLSGGFSDSKGGEFDFWDLVASGVVGDGSDADDGLAFLGVGGKSGERQWWSVLSGHKKSFQHDLVEFAVGSSGQKSVQLDEHVQVEIIALGGFSD |
| Ga0193224_10700 | Ga0193224_107001 | F023281 | PDEFRELLDEYMDASERSLGQKQCEEFFQCPYSLKDAVKRNFSGNSL |
| Ga0193224_10706 | Ga0193224_107061 | F100298 | LSGLDDVTVTWIGDAQAADSKVFTASGTELDVVSRVVVDAGFAQHSVVLDFGSSESWGVRRQDDEFALGSSKLSESLSVSQAVFTGLHDQLKSRVDGLGGVSFFWHVERGFCSQL |
| Ga0193224_10709 | Ga0193224_107091 | F002148 | MGSQGATFAEAVRMVVKSGGNWKRPVNRMYTYNFQIGENYYLPMTSYLEGKNSPAVSADLPGPLCFSERISLHPLSGTSVEAYADQSVETIKKTRQVISSM |
| Ga0193224_10791 | Ga0193224_107911 | F005532 | DCPFGPGSCPVTLDNVVDVYFHDVDDIRSCQRECRQIEDCKFFTMKPESDDPHDHMKCFLFKTCDHLEACEQCITGPELPIIDDDHCNNDGNYTCETSLDNVVDVYYFDVEDTFSCKDQCNMLDDCNFWTFFDVPDAPQPHHKCFLFRECNIHEPCKTCETGPKGKYYPRR |
| Ga0193224_10808 | Ga0193224_108081 | F029011 | VSATGFGFLTFDWIAPPVSDTSVKSDFLHSFDILSKFVVQVVRGHLAVLTVFPVALPVQKPSWDFEVLWVGHDGLDLFHLGFGELTGSFGSVDFGFFQDQVGESSSHTSDSSEGVGDLD |
| Ga0193224_10821 | Ga0193224_108211 | F088204 | EVVSKNGMKRFLEQAETYYYNGGKCWRWDTDACKKQWKYGPWGEDLFMQKTMDDAEVGKKSDFTLTDTGTCPGMRPKADKKNTSYVPSCTDSGVSKFVAVHPMRSLDAWKSCYSAISR |
| Ga0193224_10830 | Ga0193224_108301 | F000073 | EGKGNLQWERDIPPAALANLENLAKTDFKANLGQIIHDIYQTAHQMFSDMPQGDHKKRAKYRFASKTLMRILPEANKKEVEGLIEQKAMTLDLYEILGTCTWVPPK |
| Ga0193224_10861 | Ga0193224_108611 | F029011 | VSTTGFGFLTFDWVAPPVSDTSMESDFLHPFNILSKFVVQVVTGHLAVLAVFPVSLPVQEPSWNLKVLWVGHNSLDLFHFGFSQLTGSFGSIDFGFFQDQVGESSAHTSDGGQSVGDFDVTVDVGVLHSQNMLEVFRVDE |
| Ga0193224_10882 | Ga0193224_108821 | F027856 | VNELNSWVAKDRGAETEQQFSLEKMESTLGELKNIFKEKERLVDNL |
| ⦗Top⦘ |