


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300025243 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0053074 | Gp0053865 | Ga0208335 |
| Sample Name | Marine microbial communities from the Deep Atlantic Ocean - MP0759 (SPAdes) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 121405408 |
| Sequencing Scaffolds | 35 |
| Novel Protein Genes | 38 |
| Associated Families | 34 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 1 |
| All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosopumilus | 2 |
| Not Available | 19 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 1 |
| All Organisms → cellular organisms → Bacteria | 2 |
| All Organisms → cellular organisms → Bacteria → FCB group → Candidatus Marinimicrobia → Candidatus Marinimicrobia bacterium | 3 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium | 1 |
| All Organisms → cellular organisms → Eukaryota | 3 |
| All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrososphaeria → Nitrososphaerales → unclassified Nitrososphaerales → Nitrososphaerales archaeon | 1 |
| All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Marine Group I → Marine Group I thaumarchaeote | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia → Acidimicrobiales → Acidimicrobiaceae → unclassified Acidimicrobiaceae → Acidimicrobiaceae bacterium | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Deep Ocean Microbial Communities From The Global Malaspina Expedition |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Marine → Oceanic → Unclassified → Deep Ocean → Deep Ocean Microbial Communities From The Global Malaspina Expedition |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | marine biome → marine water body → sea water |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Saline → Water (saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | West of Cape Town, South Africa, South Atlantic Ocean | |||||||
| Coordinates | Lat. (o) | -32.81 | Long. (o) | 12.77 | Alt. (m) | N/A | Depth (m) | 3901.64 | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F001048 | Metagenome / Metatranscriptome | 793 | Y |
| F001625 | Metagenome / Metatranscriptome | 661 | Y |
| F002030 | Metagenome | 601 | Y |
| F002753 | Metagenome / Metatranscriptome | 532 | Y |
| F003122 | Metagenome / Metatranscriptome | 506 | Y |
| F005608 | Metagenome / Metatranscriptome | 395 | Y |
| F006551 | Metagenome / Metatranscriptome | 370 | Y |
| F007773 | Metagenome | 345 | Y |
| F008648 | Metagenome | 330 | Y |
| F008912 | Metagenome / Metatranscriptome | 326 | Y |
| F016284 | Metagenome / Metatranscriptome | 248 | Y |
| F018739 | Metagenome / Metatranscriptome | 233 | Y |
| F019388 | Metagenome / Metatranscriptome | 230 | N |
| F023136 | Metagenome / Metatranscriptome | 211 | Y |
| F024888 | Metagenome | 204 | Y |
| F031665 | Metagenome / Metatranscriptome | 182 | Y |
| F034700 | Metagenome | 174 | Y |
| F035337 | Metagenome | 172 | N |
| F041248 | Metagenome / Metatranscriptome | 160 | Y |
| F042383 | Metagenome / Metatranscriptome | 158 | Y |
| F045695 | Metagenome / Metatranscriptome | 152 | Y |
| F047117 | Metagenome | 150 | Y |
| F050430 | Metagenome / Metatranscriptome | 145 | N |
| F055778 | Metagenome / Metatranscriptome | 138 | Y |
| F063768 | Metagenome / Metatranscriptome | 129 | N |
| F066848 | Metagenome / Metatranscriptome | 126 | N |
| F080654 | Metagenome / Metatranscriptome | 115 | N |
| F081452 | Metagenome / Metatranscriptome | 114 | N |
| F082562 | Metagenome | 113 | N |
| F085813 | Metagenome | 111 | N |
| F089050 | Metagenome / Metatranscriptome | 109 | Y |
| F089571 | Metagenome | 109 | N |
| F094378 | Metagenome / Metatranscriptome | 106 | Y |
| F099485 | Metagenome / Metatranscriptome | 103 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0208335_1000053 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 52122 | Open in IMG/M |
| Ga0208335_1000733 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosopumilus | 10803 | Open in IMG/M |
| Ga0208335_1000959 | Not Available | 8845 | Open in IMG/M |
| Ga0208335_1001831 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 5389 | Open in IMG/M |
| Ga0208335_1003867 | All Organisms → cellular organisms → Bacteria | 3211 | Open in IMG/M |
| Ga0208335_1006108 | Not Available | 2385 | Open in IMG/M |
| Ga0208335_1006133 | All Organisms → cellular organisms → Bacteria | 2378 | Open in IMG/M |
| Ga0208335_1006821 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosopumilus | 2223 | Open in IMG/M |
| Ga0208335_1009311 | All Organisms → cellular organisms → Bacteria → FCB group → Candidatus Marinimicrobia → Candidatus Marinimicrobia bacterium | 1804 | Open in IMG/M |
| Ga0208335_1010063 | Not Available | 1712 | Open in IMG/M |
| Ga0208335_1010803 | All Organisms → cellular organisms → Bacteria → FCB group → Candidatus Marinimicrobia → Candidatus Marinimicrobia bacterium | 1633 | Open in IMG/M |
| Ga0208335_1015357 | Not Available | 1287 | Open in IMG/M |
| Ga0208335_1016977 | Not Available | 1203 | Open in IMG/M |
| Ga0208335_1018587 | Not Available | 1132 | Open in IMG/M |
| Ga0208335_1021407 | Not Available | 1024 | Open in IMG/M |
| Ga0208335_1026283 | Not Available | 890 | Open in IMG/M |
| Ga0208335_1026347 | All Organisms → cellular organisms → Bacteria → FCB group → Candidatus Marinimicrobia → Candidatus Marinimicrobia bacterium | 888 | Open in IMG/M |
| Ga0208335_1028505 | Not Available | 840 | Open in IMG/M |
| Ga0208335_1028993 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium | 830 | Open in IMG/M |
| Ga0208335_1030315 | Not Available | 806 | Open in IMG/M |
| Ga0208335_1031811 | Not Available | 779 | Open in IMG/M |
| Ga0208335_1038350 | Not Available | 680 | Open in IMG/M |
| Ga0208335_1039163 | Not Available | 670 | Open in IMG/M |
| Ga0208335_1040164 | Not Available | 658 | Open in IMG/M |
| Ga0208335_1042757 | All Organisms → cellular organisms → Eukaryota | 629 | Open in IMG/M |
| Ga0208335_1044173 | Not Available | 614 | Open in IMG/M |
| Ga0208335_1045186 | Not Available | 604 | Open in IMG/M |
| Ga0208335_1046307 | Not Available | 594 | Open in IMG/M |
| Ga0208335_1046407 | All Organisms → cellular organisms → Eukaryota | 593 | Open in IMG/M |
| Ga0208335_1047202 | All Organisms → cellular organisms → Eukaryota | 586 | Open in IMG/M |
| Ga0208335_1051092 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrososphaeria → Nitrososphaerales → unclassified Nitrososphaerales → Nitrososphaerales archaeon | 551 | Open in IMG/M |
| Ga0208335_1052440 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Marine Group I → Marine Group I thaumarchaeote | 540 | Open in IMG/M |
| Ga0208335_1053528 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia → Acidimicrobiales → Acidimicrobiaceae → unclassified Acidimicrobiaceae → Acidimicrobiaceae bacterium | 532 | Open in IMG/M |
| Ga0208335_1056227 | Not Available | 512 | Open in IMG/M |
| Ga0208335_1056905 | Not Available | 508 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0208335_1000053 | Ga0208335_100005318 | F063768 | MDLHATPDPLTTAEGVAEFVVLAELSTALSERLGGWSTDGADPDSSATMASLAARLDEHSTWWTGRIPESVLLEGERTAASGSGQLAEVLGLLDVAASDRRAAVEPVLDRLVAYLVALAERLSPLGDAPAIRTIRLVLADLEDRPR |
| Ga0208335_1000733 | Ga0208335_100073314 | F099485 | MKQQIIHNWQHSPPKTQVARTPVKAVSKPLRQSQTITNEDLMGLSWIMFEESD |
| Ga0208335_1000959 | Ga0208335_10009594 | F085813 | MDQFEIIWILVTVIGALNYGFNAYQKGFRRLLKWDVISLIILDTLMFVPLSLIVGL |
| Ga0208335_1001831 | Ga0208335_10018315 | F066848 | MRFSTRSIHSLRHPQDDVYRAVSTEVGPGFAMGSSDSVLRTVLAGARQIAADQRLPVDPVAKGAGSEPELNLVA |
| Ga0208335_1003867 | Ga0208335_10038671 | F085813 | TLNYGLNAYQKGFRRLLKWDVIFLIVLNTLMFVPLSLIVGL |
| Ga0208335_1006108 | Ga0208335_10061081 | F042383 | MKGIKMETTYNNFSNPDDFYEGPSDEELKKIEEDLERYSD |
| Ga0208335_1006133 | Ga0208335_10061331 | F082562 | MKIIILGILISLSTNLFAQTNDSLRNYNLDIELSIIHTILGCFGGTIGIEKNHSGFGVMGFGTKLNHMTKHYLMENAEELAVYNWGIELYSDYYFKQNHRGLFIGTFVSLNGFRFTDIPNPQTILVLYAVPRIGYRKYLPKKKNAFYFQPSLTLHSKVWDNQEQFLYREIDTKRIFLLSQLTIGMKI |
| Ga0208335_1006821 | Ga0208335_10068212 | F055778 | MLSNVDLPEPELPTIRTISPRSIESDTLSRAFIFNSPSP |
| Ga0208335_1009311 | Ga0208335_10093112 | F089571 | MFIIYLDKENGRKMLKNIIVIVFTVLFLVVPASSQHAQQKYAITDKDKKVILNDDGTWEYIEEGKQNNNYKRVPPKTYDYEFKPQTWEERLEQAAHEYWRIRAHAELEQLGIPVTRYDPTTSDVPLYLKLEGGRIRTDFDSFTPLARGEVIDISGLWDPDKGDNPTLKIQGKLFLQITMKLDDQKKLFSQYNQRNYIPISEDFQRILKFRDSLDVISLGMHDYNRDNTHIWPGIPHSFLYAPPGSPYDPDNPPVIRGDGFPYIQRGHLTLARRE |
| Ga0208335_1010063 | Ga0208335_10100634 | F005608 | MKKATIEILEEGELIFGSPTAGKYFVRRYEDETEMGGGFFKTKKEAETHAREYKKSE |
| Ga0208335_1010803 | Ga0208335_10108031 | F082562 | MNKLILSILITLSTVLFAQASDSSNNYFFNIELSLIHPILGGFGGTVGIEKNHFSYGLNSFGTKLNHMTKHYLLENAEELAVYNWGVELYSDYYFKQNHTGLFLGLILSLNGFRFNDIPNPQTILVLYSAPRIGYRLCLPKKLKSFYFQYSLTTHFKVWDDKKKFLYREIDTKSIFLLSQLTLGVKI |
| Ga0208335_1015357 | Ga0208335_10153571 | F008912 | MKNLVVTLGITGCGKSRWLKDKSPVIETDDLRIELLDDISDSTTQEGLIFGTAAKKISKLFDTHDTVYFGATLVDSKYRIPFLQSIQDVCKHKLVIDVVIFPGIPELSKERIARDLKSGVQRADSIKFIDEQYEQYLHTMSIFNKEKNFYRSIKRSAE |
| Ga0208335_1016977 | Ga0208335_10169772 | F094378 | SNHLGNFQAKGPYIITLERRLSLTEGVNLSGNKTEGVY |
| Ga0208335_1018587 | Ga0208335_10185872 | F035337 | MEYNLPVEITWNEEDQATIDIICDACEKRYIILTNDISNLEQCSFCGHYLEVSTEDIHEFQEDSWD |
| Ga0208335_1018587 | Ga0208335_10185873 | F047117 | KDITIEEGQEFANVKQQQINRKQMRITASFHMKHGLYKKPEFKFFQSIGMFNFFQGFKSDDLDLGVYHLYWDRDAENTIEYYDPKTGERTNNIKGDWRERWYDPGEEIKPVEMDNPYRPDPAKMAKWWMETQGSIQAKIMMKQQEKAETEDEENRPWHHIPKPQWEN |
| Ga0208335_1021407 | Ga0208335_10214073 | F002030 | DMIGFLIFGLVVLSTVCLWLLIEERKSPKFLIWFIPVLLVLVTSTYVTYTSILGFPKVGIPERGMYLRHYIDEPNWIYLWVLSKKNVPMSYQLVYTREKHNALEGVRGKAEEGNFMILGKDEVLGYGNGGDDHRGSSPGSGYTIGGDISFYKWDFAENMPPKNTQE |
| Ga0208335_1026283 | Ga0208335_10262831 | F024888 | MNLDKIDLKNIHKIVGEMVGAIQKEEMKGKTPDEFAQEKGFSDWLEYAAWSRHTGGDYQMMEMMLKSKWKEQDPEEFARQKKIEGDQSTREHSYIGILDEEKIKKKNNGHKYYKPVHKSEFSAEELEVIYDERGVGGEPVEYRW |
| Ga0208335_1026283 | Ga0208335_10262832 | F001048 | MEYFSNWSIMKIKSTKQFWWRFNHLKRNGGEITVTDRTTEMDVKDFNRIESLVHKRVRWELGSEGYSIWSECGYKDIPTLAKAYRIS |
| Ga0208335_1026347 | Ga0208335_10263471 | F023136 | SDKNAIIKKDRNTIPKDLILDFKFKICFVETIKEANIQNCVKKTIGITNSGVTAKNLTKPGAWA |
| Ga0208335_1028505 | Ga0208335_10285052 | F008648 | MKSFKGYLIERGTSLSDLLFLPRIGYYDQLMIPISSSMFKRIWPDTLRATVFHTTDSDGVKSISKLEGKKGQISAFFSMFSRYMEVGVATQGGVHAVLEMDADVLLSASGDVMSHLDQSGRRWTSISDLKETSRWTKFSAVEKDLEKMFDPLVKKYLKKGEFQENATVWELWRMA |
| Ga0208335_1028993 | Ga0208335_10289931 | F041248 | MRSKQHGKRLRQEGAIERIETSILVHEEQLKSNKNDDEKKLLKKKIERAQATVENTKGNM |
| Ga0208335_1030315 | Ga0208335_10303152 | F081452 | TLADAHLLSRLFMIPVLSLAQVAREDGVISDASRIFKDPTNFNIIISDKVA |
| Ga0208335_1031811 | Ga0208335_10318112 | F018739 | MTTQNYEPTLDLVKERMAKIDGLIEKFRDLPDVQQKLQGAKDALQESEEEIMNYFDLTDIVKRDLN |
| Ga0208335_1038350 | Ga0208335_10383501 | F080654 | NSRLLKKKPGIQDSKKINAEILETVVADIISEETAESEPVGKTNLEERQDLWNKIQKFDPEASAMDYYDNGEWDLEKRRSDVKVLLEKKRFGRYI |
| Ga0208335_1039163 | Ga0208335_10391632 | F003122 | MIELLIFGLVLLSVICLWLLIEGRKSPKFLIWFIPLLLILVSSTYMTYTAIQGYPKFETPKRGLYLKHYIDEPNWIYIWVIDKEQIPISYKIVYSREVHNALEGVKGKSEEGKFMVLGGEETEQAGMQGEEDGEHAGGHTIGGDISFYEWDYESSMPQKNPKE |
| Ga0208335_1040164 | Ga0208335_10401642 | F050430 | MDNKLKENLMVGTKIVGMLFFLIVLITTLVWPGGVDTLANLFFTKE |
| Ga0208335_1042757 | Ga0208335_10427571 | F002753 | NTVSDRFTPAAFDIESVKLTVSDRFTPAVFEIESVKLTDSDRFTPAVFDIESVNNTVSDRFTPVDFATESVKLTDSDRFTPSVFETESVKLTDSDRFTPSVFETESVKLTVSDRFTPAAFEIESVKLTVSDRFTPAVFDIESVNNTVSDRFTPAVFEIESVNDTDSDRFTGAVFEIESVKLTVSDRFTPAVFEIESVNNTVSDRFTPAV |
| Ga0208335_1044173 | Ga0208335_10441731 | F089050 | MVELNHDVVSFNSDTLVKGGHGVLVSVYVTKVGSGSNKVEFKNGT |
| Ga0208335_1044173 | Ga0208335_10441732 | F001625 | MALTISTSDWTNANVRKTLSWQAALVSKLRVYAIKVTFGSGDNYATNGVSADLKEGRISTLVAVIPTYSNISREVVYDKANEKIKILEVGGSTGSAMTELANSSSATNSKIFEFLVIGY |
| Ga0208335_1045186 | Ga0208335_10451862 | F031665 | MEEQITQWVAPLVGLGITAIIVLWFKEAVGDVVASIRWKMKPGFEPGDVVYLDGEEATIVSIGFRETIFELNNGRGKVWRYIYNKRIPYHKLEKIISGMGDDK |
| Ga0208335_1046307 | Ga0208335_10463071 | F019388 | MQSPNEQNPYLVETKNGKILKFNKIDADNEAATKQLDGDDVEVFHDGKLQYRLHGIEQGKLF |
| Ga0208335_1046407 | Ga0208335_10464071 | F002753 | AALDIESVNDTDSDRFTPAALDIESVNDTDSDRFTPTVFEIESVNNTVSDRFTPAVFDVSSVNDIDSDRFTPAVFEIESVKLTDSERFTPAAFAIESVKLTDSERFTPAAFAIESVKLTVSVRFTPADFEIESANNTVSDRFTSAALDVSSVNNTVSDRFTPAALDVSSVNNTVSDRFTPADFEIESANNTVSDRFT |
| Ga0208335_1047202 | Ga0208335_10472021 | F002753 | DTDSVRFTPAAFDIESVKLTVSDRFTPAAFDIESVKLAVSDRFTPAAFAIESVKITVSDRFTPAAFEIESVKLTVSDRFTPAVFEIESVKLTVSDRFTPAVFEIESVNDTDSDRFTPAVFEIESVKLTVSDRFTPAVFEIESVKLTVSDRFTPADFAIESVKLTVSDRFTPADFAIESVKLTVSDRFTPAVFEIE |
| Ga0208335_1051092 | Ga0208335_10510921 | F016284 | MTSKTLKNMRKELLDKYSGKITGYEHLDDGTGDYARIVPKDFEKESEETSEKT |
| Ga0208335_1052440 | Ga0208335_10524402 | F007773 | VSKNNIINLSDYRKEKEKETPQNPPQYIQDFEVGGYYIYPELGVMLHCMLITDSSHTHKNELMYIMEDQFGNIFSVPIDDPDSMMGWRLLEKEV |
| Ga0208335_1053528 | Ga0208335_10535282 | F034700 | MAEEEPIVDFGFSAVTADEYERDNTDGENTGSGGSASPGALASMDAKIEQIMAALSSKSDAPPDDFGFSQEDK |
| Ga0208335_1056227 | Ga0208335_10562272 | F045695 | MNSKREAFLRIWNKEAPKIRDTIKQKLLVRGIEPTQLNCYEYWMGYIKPWKNTRGDISHLNPYRNEGFMRSIGRTKYGTPYNMARRNKNKKR |
| Ga0208335_1056905 | Ga0208335_10569051 | F006551 | FESLDRIVSTSAEEGTLGGSGSGHYDPWAANATIDRDSGTAFDCTVESASGTIGTNGVLTDDTLRAFLRKIRIAAGKDPNVFLGSHEVYSEIQGLYMPSVRIPNPYGEALVQIDVNGIQSFKGTGVGIHVDSIYGIPFIPSKDAPSSSSDSSEVGRLFAFDTSDAEGYG |
| ⦗Top⦘ |