Basic Information | |
---|---|
IMG/M Taxon OID | 3300018523 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0117946 | Gp0216950 | Ga0193088 |
Sample Name | Metatranscriptome of marine eukaryotic protist communities collected during Tara Oceans survey from station TARA_083 - TARA_N000001377 (ERX1782163-ERR1712191) |
Sequencing Status | Permanent Draft |
Sequencing Center | Canada's Michael Smith Genome Sciences Centre |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 17985905 |
Sequencing Scaffolds | 17 |
Novel Protein Genes | 22 |
Associated Families | 18 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
Not Available | 5 |
All Organisms → cellular organisms → Eukaryota | 8 |
All Organisms → cellular organisms → Eukaryota → Opisthokonta | 1 |
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Tunicata → Appendicularia → Copelata → Oikopleuridae → Oikopleura → Oikopleura dioica | 2 |
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Gymnoplea → Calanoida | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Marine Viral And Eukaryotic Protist Communities Collected From Different Water Depths During Tara Oceans Survey |
Type | Environmental |
Taxonomy | Environmental → Aquatic → Marine → Unclassified → Unclassified → Marine → Marine Viral And Eukaryotic Protist Communities Collected From Different Water Depths During Tara Oceans Survey |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | marine biome → marine water body → sea water |
Earth Microbiome Project Ontology (EMPO) | Free-living → Saline → Water (saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | South Atlantic Ocean: TARA_083 | |||||||
Coordinates | Lat. (o) | -54.3739 | Long. (o) | -65.1342 | Alt. (m) | N/A | Depth (m) | 5 | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F000021 | Metagenome / Metatranscriptome | 6082 | Y |
F000048 | Metagenome / Metatranscriptome | 3365 | Y |
F000052 | Metagenome / Metatranscriptome | 3223 | Y |
F001255 | Metatranscriptome | 736 | N |
F004127 | Metatranscriptome | 451 | Y |
F005202 | Metatranscriptome | 408 | N |
F005379 | Metatranscriptome | 402 | Y |
F015056 | Metagenome / Metatranscriptome | 257 | Y |
F018668 | Metagenome / Metatranscriptome | 233 | Y |
F019431 | Metatranscriptome | 229 | Y |
F024947 | Metatranscriptome | 203 | N |
F026782 | Metatranscriptome | 196 | Y |
F030134 | Metagenome / Metatranscriptome | 186 | Y |
F040998 | Metagenome / Metatranscriptome | 160 | Y |
F050126 | Metagenome / Metatranscriptome | 145 | Y |
F065809 | Metatranscriptome | 127 | N |
F073014 | Metatranscriptome | 120 | N |
F100298 | Metatranscriptome | 102 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0193088_102362 | Not Available | 1077 | Open in IMG/M |
Ga0193088_102543 | All Organisms → cellular organisms → Eukaryota | 1050 | Open in IMG/M |
Ga0193088_104644 | All Organisms → cellular organisms → Eukaryota | 853 | Open in IMG/M |
Ga0193088_105265 | All Organisms → cellular organisms → Eukaryota | 807 | Open in IMG/M |
Ga0193088_105780 | All Organisms → cellular organisms → Eukaryota | 774 | Open in IMG/M |
Ga0193088_107157 | Not Available | 698 | Open in IMG/M |
Ga0193088_107708 | Not Available | 673 | Open in IMG/M |
Ga0193088_107734 | Not Available | 672 | Open in IMG/M |
Ga0193088_109662 | All Organisms → cellular organisms → Eukaryota | 595 | Open in IMG/M |
Ga0193088_110228 | All Organisms → cellular organisms → Eukaryota → Opisthokonta | 576 | Open in IMG/M |
Ga0193088_110237 | All Organisms → cellular organisms → Eukaryota | 576 | Open in IMG/M |
Ga0193088_111056 | Not Available | 548 | Open in IMG/M |
Ga0193088_111158 | All Organisms → cellular organisms → Eukaryota | 545 | Open in IMG/M |
Ga0193088_111748 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Tunicata → Appendicularia → Copelata → Oikopleuridae → Oikopleura → Oikopleura dioica | 527 | Open in IMG/M |
Ga0193088_111844 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Tunicata → Appendicularia → Copelata → Oikopleuridae → Oikopleura → Oikopleura dioica | 524 | Open in IMG/M |
Ga0193088_111926 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Gymnoplea → Calanoida | 521 | Open in IMG/M |
Ga0193088_112252 | All Organisms → cellular organisms → Eukaryota | 511 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0193088_102362 | Ga0193088_1023621 | F030134 | LIIKSINSFRTKINKYKKDKIQPVLQRFKLRSCTFLIVEQTNYEYCYNTQEKIIQHRGSRHIYR |
Ga0193088_102543 | Ga0193088_1025431 | F005202 | MINQGNNMDKLKKMMAAMSMAMNQDNSEFSYVQPYSTNSGKSDSFMDKMKMMMMKMKMRSNYQNDDEQLFNEFFKSMGSSDRMDNYRTGSYRKNDPMSQFKQMFNSYRSKRQATDSLDLGDRLVEKLAQQKHQMEAKIGNMTCVLKETNVLDASNNLDIQTMKSDMQQYTMPSPWFGQKYEQILDTCYEMATNLPAEIADNSVVSGSFGTVKLGEVKMFNKCCEKAETKLCMNQDIKTKVESNFGPMEELLQQTQLTEAEFFPLVMQLLHGQEMDYMFGSM |
Ga0193088_103418 | Ga0193088_1034181 | F000021 | ELLKMDTYKMNPEKDPLFRGEFEVVKELVATLKNGEAAKNVCDKVIDKNGTPKTGGTGIKQLRENIAESKLSYEIMDDAAQAFLKSKIMDNIHKYYYLIVFGGYMLEAAELARNAVPDDIKAEVTLKGGKCAIPANQLKLVKSFVQFVEENASLRGLIDSGKGNLQWERDIPQAALANLEALAAKDFKGNLGTIIHDIYQTAHVMFGDMPQGDHKKRAKYRFASKTLMRLLPANLKSEIETLIETKKMTLDLYEILGHCTWTAPKPT |
Ga0193088_104063 | Ga0193088_1040632 | F000021 | IKQLRENIAESKLSYEIMDDAAQAFLKTKIMDNIHKYFYLICFAAYLREAAILAKDQASEDDKKNFSLTGGKVSTPADQLKLAKTFEAYMTEHSKLRTIVDEGKGKLQWERDIPAEALANLESLAASDFKGNLGKIIHDIYQTAHGLFKDLPQGDHKKRAKYRFASKTLMRVLPAALKTEVEGLIASKTMTLDLYEILGQCTWGQAKA |
Ga0193088_104644 | Ga0193088_1046441 | F040998 | VLDVGLAGSVQESFDDNGVVHLVSDSLAGDFGWENAVFQDSLVDNFDTSTDWSFLLVATGAGGFGFLGHDFPGADDQNVFAGEFLFEFSDDFGLDFLPSGNLWGWDHQDQSVSSADIDFFDGADVKLHELFLGGGIGTVLDFENGLGDVVLDVGGSLVVFLLEFGGDGSKNHVRRVFGVSVSST |
Ga0193088_105265 | Ga0193088_1052652 | F040998 | VLDVGLAGSVQESFDDNGVVHLVSDSLAGDFGWENAVFQDSLVDNFDTSTDWSFLLVATGAGGFGFLGHDFPGADDQNVFAGEFLFEFSDDFGLDFLPSGNLWGWDHQDQSVSSADIDFFDGADVKLHELFLGGGIGTVLDFENGLGDVVLDVGGSLVVFLLEFGGDGSKNHVRRV |
Ga0193088_105780 | Ga0193088_1057801 | F073014 | MATDGYQNKEFSKNEPWYRQSSVGFNSNYNAKTTGQIYADQYASQYERNGKHSSDYRGANHFDSDFSQRSFWGAKTMNTPEGAVFHPRMSNQGITDKTSSNMGCHIKHQKFLMCALNLNQRAAVAQCRTEFRDYRDCQSDFTTGGNKSSKTYYQDLLKSVSRAQGDYWLDLTSRDYAKFASYSFEHRPNVGVNGNFQPGFMNSEKMAFDKYLGK |
Ga0193088_107157 | Ga0193088_1071572 | F005379 | VKADVPTVFLPGLLKDRLAQFPETPTGTYVETCNKVLMGYFGNLEVTSKTGMKRFLEQFEGYYANGGKCWRWDTADCKSAWKYGPWGEDLFMQRTMDDAEVMKKSDFTLTDTGTCPGMRPKADKESTDFVPSCDAAYKFVAVHPFRNLTAWQTCYDTFSKRQ |
Ga0193088_107206 | Ga0193088_1072061 | F000052 | EENFRKNCFINYEKIAFNETVEVCRTPLVKDCDVQGPEICRTEYESECWTKQEVHDVEDDVVECETIQDEKCEDETSGYTTFTKCSKWPREVCNVQKKAVKKYTPITGCTKEPRELCAPAGCGFKEGAEECYEKTQTVVQDAPKEECSLEPQRTCAHVTKLVPKLEPTEECVDVPKEVCTRSKSNPRKVKKPVVKKWCYVPSEESGLA |
Ga0193088_107656 | Ga0193088_1076561 | F065809 | VHGRTSWNLLGGYIEFDMDVCGAQVGVNNNFYTISPQGGPSSGYCDIQTNDSPICMELDIIENNGNCLGQTTWHVWGNKDGGCDQNGCYGQYHFDDGCQFHMRTEFGEDGSMTQYKNGQVIEVNGGPSEDEKNQIKQNMEDTGAAIASTQWTGWVPDDGSCGGSDSNGAQFAVTNVVVHAPQGIKFGPPPPTCAEPSPFSSAASV |
Ga0193088_107708 | Ga0193088_1077081 | F015056 | VHTWLKLDLSDVQDLALAADEFDDSLVDSHLVSIPGFGTFTTWSFSGGDPHSPGWHWSWSFDLDKTVVASAVDGLCSGADFSARLVDGFWALGRDGDSDVGFFDWGGGDFVVFLEIGHGNSIKVICKFDLRL |
Ga0193088_107734 | Ga0193088_1077341 | F015056 | VHTWFQLDLGDVQDLALAADEFDDSLVDSHLVSIPGFGTFTTWSFSGGDPHSPGWHWSWSFDLDKTVVASAVDGLCSGADFSARLVDGFWALGRDGDSDVGFFDWGGGDFVVFLEIGHGNSIKVICKFDLRL |
Ga0193088_108488 | Ga0193088_1084881 | F000048 | ALNEKVAAFAKSCENYFKVLMSADAAAKKMTTHDEADKEVATLKGRFDKVKAVSMEWTAKVDTLLKEWQLLDNTVTELNSWVAKDKSAEGENQFSLEKMESTLGELKNIFKQKEKLVDGL |
Ga0193088_109662 | Ga0193088_1096621 | F100298 | LGGLDDITVTSIGNAEAADSKVFTASGTEFDVVSGVVVDTGFAQHSVVLDFGSSESWGVRAQDDEFTFGSSELSEGLSVAEAVFTGLHDQLKSGVDGFGGVSFLRHIECR |
Ga0193088_110228 | Ga0193088_1102281 | F018668 | HQTIKMAFFKSLLIASVAAVAFAAPQGEGKNTEVGVKSQSSAPKCGNGQKIACCNSGDDLIGLNCLSIPILAIPIQQACGSNVAACCQTGDAEGNLINLEANCIAIPL |
Ga0193088_110237 | Ga0193088_1102371 | F019431 | PPPPSDKPKPLWFVPDGPATKENCNTEGEAVLAGSKPVFESPTSEIFVPEGKGTDICIFFAPPQPQEWQLGKGVSYNSKSCSEGCCFYTPASSKRVQEIEAEVEKNLPTWFRTPAGGCDAVPSNTITSRLILNEDSNDSGNEAALCVKAKDKDTFELLTGETTSCKGKCCVFNCKSCKAPES |
Ga0193088_111056 | Ga0193088_1110561 | F026782 | MIFKFERKIKTKLPHKNNNKCLHEIFAMIRTERLIGRAILLTNSIKTKTGTKIGGDLGGVKFISINITLKLKVKIKIENQNLKELINIKIVPVIGTLKGNNPTKFKTTKINLVYCKPNLKVSFFVYKI |
Ga0193088_111158 | Ga0193088_1111581 | F001255 | SLPYVTYHAPTCKTEFEQVKGQSCVTKPVTECNEVEVPSHKIITEDVCQNVTTAECTHSPVTTLPAPAGVYYGKREAEADPQYLVGAIPAVVPYHNCVNKVTQVCYPVQKVEPTTITEQSCLLKGEVECTEVVVATIPRVTCSHAEHTKPADEAVETE |
Ga0193088_111748 | Ga0193088_1117481 | F050126 | MQLKVRKIELCECLAVGDSEEDVLHSLGSLGENVGEVLLGEGAVTAVLSDEGSEGGSDLVVELAEVLLVEDEGLIFFVDILGAWEGGVDLLNISEDLVNPLRGSGDVLVGGGGVVVGKIDEGIGVASDLLKSLDGVLDLSGGGNSDEGGGELHF |
Ga0193088_111844 | Ga0193088_1118441 | F050126 | RIFLLFLSMQLKVRKIELCECLAVGDSEEDVLHSLGSLGENVGEVLLGEGAVTAVLSDEGSEGGSDLVVELAEVLLVEDEGLIFFVDILGAWEAGVDLLNISEDLVNPCRGSGDVLVGGGGVVVGKIKKGIGVASDLLKSLDGVLDLSGGGNNDEGGGELHFFIIFFKFQTSPC |
Ga0193088_111926 | Ga0193088_1119261 | F004127 | VLFCEMRFLLAALVLVAGAWAQDTYNCPDGWMLEEDRSGCRCFMLSGGESVTRDDADILCAFHEAWVAELDHPGINYWLKSQLLAITEPGDWNQFWLGAKTTGRHDANHNNGEWLWQHRNATVDWFDWGNGQPNDWHNENCLVLREYHSFLFPNQRDYYWNDYFCDD |
Ga0193088_112252 | Ga0193088_1122521 | F024947 | FFAGWKKPFPTQDVQSINITKIQRDNLDNPQGPFCDPFEGSIHKSDFVPTKGTGTKVDPFIIPARNEFRMMWENVDMYGDFRPFKCWWLYEEGDGFTENSMMRSPFSGRYFKLAYDPTDYYGVDQVDMHSHAASQY |
⦗Top⦘ |