


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300018523 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0117946 | Gp0216950 | Ga0193088 |
| Sample Name | Metatranscriptome of marine eukaryotic protist communities collected during Tara Oceans survey from station TARA_083 - TARA_N000001377 (ERX1782163-ERR1712191) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Canada's Michael Smith Genome Sciences Centre |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 17985905 |
| Sequencing Scaffolds | 17 |
| Novel Protein Genes | 22 |
| Associated Families | 18 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| Not Available | 5 |
| All Organisms → cellular organisms → Eukaryota | 8 |
| All Organisms → cellular organisms → Eukaryota → Opisthokonta | 1 |
| All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Tunicata → Appendicularia → Copelata → Oikopleuridae → Oikopleura → Oikopleura dioica | 2 |
| All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Gymnoplea → Calanoida | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Marine Viral And Eukaryotic Protist Communities Collected From Different Water Depths During Tara Oceans Survey |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Marine → Unclassified → Unclassified → Marine → Marine Viral And Eukaryotic Protist Communities Collected From Different Water Depths During Tara Oceans Survey |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | marine biome → marine water body → sea water |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Saline → Water (saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | South Atlantic Ocean: TARA_083 | |||||||
| Coordinates | Lat. (o) | -54.3739 | Long. (o) | -65.1342 | Alt. (m) | N/A | Depth (m) | 5 | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000021 | Metagenome / Metatranscriptome | 6082 | Y |
| F000048 | Metagenome / Metatranscriptome | 3365 | Y |
| F000052 | Metagenome / Metatranscriptome | 3223 | Y |
| F001255 | Metatranscriptome | 736 | N |
| F004127 | Metatranscriptome | 451 | Y |
| F005202 | Metatranscriptome | 408 | N |
| F005379 | Metatranscriptome | 402 | Y |
| F015056 | Metagenome / Metatranscriptome | 257 | Y |
| F018668 | Metagenome / Metatranscriptome | 233 | Y |
| F019431 | Metatranscriptome | 229 | Y |
| F024947 | Metatranscriptome | 203 | N |
| F026782 | Metatranscriptome | 196 | Y |
| F030134 | Metagenome / Metatranscriptome | 186 | Y |
| F040998 | Metagenome / Metatranscriptome | 160 | Y |
| F050126 | Metagenome / Metatranscriptome | 145 | Y |
| F065809 | Metatranscriptome | 127 | N |
| F073014 | Metatranscriptome | 120 | N |
| F100298 | Metatranscriptome | 102 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0193088_102362 | Not Available | 1077 | Open in IMG/M |
| Ga0193088_102543 | All Organisms → cellular organisms → Eukaryota | 1050 | Open in IMG/M |
| Ga0193088_104644 | All Organisms → cellular organisms → Eukaryota | 853 | Open in IMG/M |
| Ga0193088_105265 | All Organisms → cellular organisms → Eukaryota | 807 | Open in IMG/M |
| Ga0193088_105780 | All Organisms → cellular organisms → Eukaryota | 774 | Open in IMG/M |
| Ga0193088_107157 | Not Available | 698 | Open in IMG/M |
| Ga0193088_107708 | Not Available | 673 | Open in IMG/M |
| Ga0193088_107734 | Not Available | 672 | Open in IMG/M |
| Ga0193088_109662 | All Organisms → cellular organisms → Eukaryota | 595 | Open in IMG/M |
| Ga0193088_110228 | All Organisms → cellular organisms → Eukaryota → Opisthokonta | 576 | Open in IMG/M |
| Ga0193088_110237 | All Organisms → cellular organisms → Eukaryota | 576 | Open in IMG/M |
| Ga0193088_111056 | Not Available | 548 | Open in IMG/M |
| Ga0193088_111158 | All Organisms → cellular organisms → Eukaryota | 545 | Open in IMG/M |
| Ga0193088_111748 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Tunicata → Appendicularia → Copelata → Oikopleuridae → Oikopleura → Oikopleura dioica | 527 | Open in IMG/M |
| Ga0193088_111844 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Tunicata → Appendicularia → Copelata → Oikopleuridae → Oikopleura → Oikopleura dioica | 524 | Open in IMG/M |
| Ga0193088_111926 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Gymnoplea → Calanoida | 521 | Open in IMG/M |
| Ga0193088_112252 | All Organisms → cellular organisms → Eukaryota | 511 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0193088_102362 | Ga0193088_1023621 | F030134 | LIIKSINSFRTKINKYKKDKIQPVLQRFKLRSCTFLIVEQTNYEYCYNTQEKIIQHRGSRHIYR |
| Ga0193088_102543 | Ga0193088_1025431 | F005202 | MINQGNNMDKLKKMMAAMSMAMNQDNSEFSYVQPYSTNSGKSDSFMDKMKMMMMKMKMRSNYQNDDEQLFNEFFKSMGSSDRMDNYRTGSYRKNDPMSQFKQMFNSYRSKRQATDSLDLGDRLVEKLAQQKHQMEAKIGNMTCVLKETNVLDASNNLDIQTMKSDMQQYTMPSPWFGQKYEQILDTCYEMATNLPAEIADNSVVSGSFGTVKLGEVKMFNKCCEKAETKLCMNQDIKTKVESNFGPMEELLQQTQLTEAEFFPLVMQLLHGQEMDYMFGSM |
| Ga0193088_103418 | Ga0193088_1034181 | F000021 | ELLKMDTYKMNPEKDPLFRGEFEVVKELVATLKNGEAAKNVCDKVIDKNGTPKTGGTGIKQLRENIAESKLSYEIMDDAAQAFLKSKIMDNIHKYYYLIVFGGYMLEAAELARNAVPDDIKAEVTLKGGKCAIPANQLKLVKSFVQFVEENASLRGLIDSGKGNLQWERDIPQAALANLEALAAKDFKGNLGTIIHDIYQTAHVMFGDMPQGDHKKRAKYRFASKTLMRLLPANLKSEIETLIETKKMTLDLYEILGHCTWTAPKPT |
| Ga0193088_104063 | Ga0193088_1040632 | F000021 | IKQLRENIAESKLSYEIMDDAAQAFLKTKIMDNIHKYFYLICFAAYLREAAILAKDQASEDDKKNFSLTGGKVSTPADQLKLAKTFEAYMTEHSKLRTIVDEGKGKLQWERDIPAEALANLESLAASDFKGNLGKIIHDIYQTAHGLFKDLPQGDHKKRAKYRFASKTLMRVLPAALKTEVEGLIASKTMTLDLYEILGQCTWGQAKA |
| Ga0193088_104644 | Ga0193088_1046441 | F040998 | VLDVGLAGSVQESFDDNGVVHLVSDSLAGDFGWENAVFQDSLVDNFDTSTDWSFLLVATGAGGFGFLGHDFPGADDQNVFAGEFLFEFSDDFGLDFLPSGNLWGWDHQDQSVSSADIDFFDGADVKLHELFLGGGIGTVLDFENGLGDVVLDVGGSLVVFLLEFGGDGSKNHVRRVFGVSVSST |
| Ga0193088_105265 | Ga0193088_1052652 | F040998 | VLDVGLAGSVQESFDDNGVVHLVSDSLAGDFGWENAVFQDSLVDNFDTSTDWSFLLVATGAGGFGFLGHDFPGADDQNVFAGEFLFEFSDDFGLDFLPSGNLWGWDHQDQSVSSADIDFFDGADVKLHELFLGGGIGTVLDFENGLGDVVLDVGGSLVVFLLEFGGDGSKNHVRRV |
| Ga0193088_105780 | Ga0193088_1057801 | F073014 | MATDGYQNKEFSKNEPWYRQSSVGFNSNYNAKTTGQIYADQYASQYERNGKHSSDYRGANHFDSDFSQRSFWGAKTMNTPEGAVFHPRMSNQGITDKTSSNMGCHIKHQKFLMCALNLNQRAAVAQCRTEFRDYRDCQSDFTTGGNKSSKTYYQDLLKSVSRAQGDYWLDLTSRDYAKFASYSFEHRPNVGVNGNFQPGFMNSEKMAFDKYLGK |
| Ga0193088_107157 | Ga0193088_1071572 | F005379 | VKADVPTVFLPGLLKDRLAQFPETPTGTYVETCNKVLMGYFGNLEVTSKTGMKRFLEQFEGYYANGGKCWRWDTADCKSAWKYGPWGEDLFMQRTMDDAEVMKKSDFTLTDTGTCPGMRPKADKESTDFVPSCDAAYKFVAVHPFRNLTAWQTCYDTFSKRQ |
| Ga0193088_107206 | Ga0193088_1072061 | F000052 | EENFRKNCFINYEKIAFNETVEVCRTPLVKDCDVQGPEICRTEYESECWTKQEVHDVEDDVVECETIQDEKCEDETSGYTTFTKCSKWPREVCNVQKKAVKKYTPITGCTKEPRELCAPAGCGFKEGAEECYEKTQTVVQDAPKEECSLEPQRTCAHVTKLVPKLEPTEECVDVPKEVCTRSKSNPRKVKKPVVKKWCYVPSEESGLA |
| Ga0193088_107656 | Ga0193088_1076561 | F065809 | VHGRTSWNLLGGYIEFDMDVCGAQVGVNNNFYTISPQGGPSSGYCDIQTNDSPICMELDIIENNGNCLGQTTWHVWGNKDGGCDQNGCYGQYHFDDGCQFHMRTEFGEDGSMTQYKNGQVIEVNGGPSEDEKNQIKQNMEDTGAAIASTQWTGWVPDDGSCGGSDSNGAQFAVTNVVVHAPQGIKFGPPPPTCAEPSPFSSAASV |
| Ga0193088_107708 | Ga0193088_1077081 | F015056 | VHTWLKLDLSDVQDLALAADEFDDSLVDSHLVSIPGFGTFTTWSFSGGDPHSPGWHWSWSFDLDKTVVASAVDGLCSGADFSARLVDGFWALGRDGDSDVGFFDWGGGDFVVFLEIGHGNSIKVICKFDLRL |
| Ga0193088_107734 | Ga0193088_1077341 | F015056 | VHTWFQLDLGDVQDLALAADEFDDSLVDSHLVSIPGFGTFTTWSFSGGDPHSPGWHWSWSFDLDKTVVASAVDGLCSGADFSARLVDGFWALGRDGDSDVGFFDWGGGDFVVFLEIGHGNSIKVICKFDLRL |
| Ga0193088_108488 | Ga0193088_1084881 | F000048 | ALNEKVAAFAKSCENYFKVLMSADAAAKKMTTHDEADKEVATLKGRFDKVKAVSMEWTAKVDTLLKEWQLLDNTVTELNSWVAKDKSAEGENQFSLEKMESTLGELKNIFKQKEKLVDGL |
| Ga0193088_109662 | Ga0193088_1096621 | F100298 | LGGLDDITVTSIGNAEAADSKVFTASGTEFDVVSGVVVDTGFAQHSVVLDFGSSESWGVRAQDDEFTFGSSELSEGLSVAEAVFTGLHDQLKSGVDGFGGVSFLRHIECR |
| Ga0193088_110228 | Ga0193088_1102281 | F018668 | HQTIKMAFFKSLLIASVAAVAFAAPQGEGKNTEVGVKSQSSAPKCGNGQKIACCNSGDDLIGLNCLSIPILAIPIQQACGSNVAACCQTGDAEGNLINLEANCIAIPL |
| Ga0193088_110237 | Ga0193088_1102371 | F019431 | PPPPSDKPKPLWFVPDGPATKENCNTEGEAVLAGSKPVFESPTSEIFVPEGKGTDICIFFAPPQPQEWQLGKGVSYNSKSCSEGCCFYTPASSKRVQEIEAEVEKNLPTWFRTPAGGCDAVPSNTITSRLILNEDSNDSGNEAALCVKAKDKDTFELLTGETTSCKGKCCVFNCKSCKAPES |
| Ga0193088_111056 | Ga0193088_1110561 | F026782 | MIFKFERKIKTKLPHKNNNKCLHEIFAMIRTERLIGRAILLTNSIKTKTGTKIGGDLGGVKFISINITLKLKVKIKIENQNLKELINIKIVPVIGTLKGNNPTKFKTTKINLVYCKPNLKVSFFVYKI |
| Ga0193088_111158 | Ga0193088_1111581 | F001255 | SLPYVTYHAPTCKTEFEQVKGQSCVTKPVTECNEVEVPSHKIITEDVCQNVTTAECTHSPVTTLPAPAGVYYGKREAEADPQYLVGAIPAVVPYHNCVNKVTQVCYPVQKVEPTTITEQSCLLKGEVECTEVVVATIPRVTCSHAEHTKPADEAVETE |
| Ga0193088_111748 | Ga0193088_1117481 | F050126 | MQLKVRKIELCECLAVGDSEEDVLHSLGSLGENVGEVLLGEGAVTAVLSDEGSEGGSDLVVELAEVLLVEDEGLIFFVDILGAWEGGVDLLNISEDLVNPLRGSGDVLVGGGGVVVGKIDEGIGVASDLLKSLDGVLDLSGGGNSDEGGGELHF |
| Ga0193088_111844 | Ga0193088_1118441 | F050126 | RIFLLFLSMQLKVRKIELCECLAVGDSEEDVLHSLGSLGENVGEVLLGEGAVTAVLSDEGSEGGSDLVVELAEVLLVEDEGLIFFVDILGAWEAGVDLLNISEDLVNPCRGSGDVLVGGGGVVVGKIKKGIGVASDLLKSLDGVLDLSGGGNNDEGGGELHFFIIFFKFQTSPC |
| Ga0193088_111926 | Ga0193088_1119261 | F004127 | VLFCEMRFLLAALVLVAGAWAQDTYNCPDGWMLEEDRSGCRCFMLSGGESVTRDDADILCAFHEAWVAELDHPGINYWLKSQLLAITEPGDWNQFWLGAKTTGRHDANHNNGEWLWQHRNATVDWFDWGNGQPNDWHNENCLVLREYHSFLFPNQRDYYWNDYFCDD |
| Ga0193088_112252 | Ga0193088_1122521 | F024947 | FFAGWKKPFPTQDVQSINITKIQRDNLDNPQGPFCDPFEGSIHKSDFVPTKGTGTKVDPFIIPARNEFRMMWENVDMYGDFRPFKCWWLYEEGDGFTENSMMRSPFSGRYFKLAYDPTDYYGVDQVDMHSHAASQY |
| ⦗Top⦘ |