


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300008550 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0117987 | Gp0126513 | Ga0103924 |
| Sample Name | Planktonic microbial communities from coastal waters of California, USA - Canon-21 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | University of Hawaii |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 4899064 |
| Sequencing Scaffolds | 20 |
| Novel Protein Genes | 24 |
| Associated Families | 21 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria → Proteobacteria | 2 |
| All Organisms → Viruses → Predicted Viral | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus → Streptococcus pneumoniae | 1 |
| Not Available | 12 |
| All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Dependentiae → unclassified Candidatus Dependentiae → Candidatus Dependentiae bacterium ADurb.Bin331 | 2 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 1 |
| All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Stichotrichia → Sporadotrichida → Oxytrichidae | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Planktonic Microbial Communities From Coastal Waters Of California, Usa |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Marine → Coastal → Unclassified → Coastal Water → Planktonic Microbial Communities From Coastal Waters Of California, Usa |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | marine biome → coastal water body → coastal sea water |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Saline → Water (saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Pacific Ocean | |||||||
| Coordinates | Lat. (o) | N/A | Long. (o) | N/A | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000055 | Metagenome / Metatranscriptome | 3096 | Y |
| F000075 | Metagenome / Metatranscriptome | 2622 | Y |
| F000237 | Metagenome / Metatranscriptome | 1498 | Y |
| F010914 | Metagenome / Metatranscriptome | 297 | Y |
| F019484 | Metagenome / Metatranscriptome | 229 | Y |
| F027881 | Metagenome / Metatranscriptome | 193 | Y |
| F042865 | Metagenome / Metatranscriptome | 157 | N |
| F043390 | Metagenome / Metatranscriptome | 156 | N |
| F045749 | Metagenome / Metatranscriptome | 152 | Y |
| F050360 | Metagenome / Metatranscriptome | 145 | Y |
| F051119 | Metagenome / Metatranscriptome | 144 | N |
| F054846 | Metagenome / Metatranscriptome | 139 | N |
| F058934 | Metagenome / Metatranscriptome | 134 | N |
| F058997 | Metagenome / Metatranscriptome | 134 | N |
| F063325 | Metatranscriptome | 129 | N |
| F074867 | Metagenome / Metatranscriptome | 119 | N |
| F077283 | Metagenome / Metatranscriptome | 117 | Y |
| F078696 | Metagenome / Metatranscriptome | 116 | N |
| F087089 | Metagenome / Metatranscriptome | 110 | N |
| F090440 | Metagenome / Metatranscriptome | 108 | N |
| F096902 | Metagenome / Metatranscriptome | 104 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0103924_10023 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 2967 | Open in IMG/M |
| Ga0103924_10173 | All Organisms → Viruses → Predicted Viral | 1553 | Open in IMG/M |
| Ga0103924_10236 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus → Streptococcus pneumoniae | 1396 | Open in IMG/M |
| Ga0103924_10239 | Not Available | 1389 | Open in IMG/M |
| Ga0103924_10282 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Dependentiae → unclassified Candidatus Dependentiae → Candidatus Dependentiae bacterium ADurb.Bin331 | 1312 | Open in IMG/M |
| Ga0103924_10365 | Not Available | 1201 | Open in IMG/M |
| Ga0103924_10496 | Not Available | 1093 | Open in IMG/M |
| Ga0103924_10564 | Not Available | 1052 | Open in IMG/M |
| Ga0103924_10765 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 953 | Open in IMG/M |
| Ga0103924_10935 | Not Available | 890 | Open in IMG/M |
| Ga0103924_11533 | Not Available | 753 | Open in IMG/M |
| Ga0103924_11828 | Not Available | 712 | Open in IMG/M |
| Ga0103924_12981 | Not Available | 599 | Open in IMG/M |
| Ga0103924_13196 | Not Available | 584 | Open in IMG/M |
| Ga0103924_13234 | Not Available | 581 | Open in IMG/M |
| Ga0103924_13477 | Not Available | 567 | Open in IMG/M |
| Ga0103924_13695 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 555 | Open in IMG/M |
| Ga0103924_14119 | Not Available | 533 | Open in IMG/M |
| Ga0103924_14443 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Dependentiae → unclassified Candidatus Dependentiae → Candidatus Dependentiae bacterium ADurb.Bin331 | 520 | Open in IMG/M |
| Ga0103924_14476 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Stichotrichia → Sporadotrichida → Oxytrichidae | 518 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0103924_10023 | Ga0103924_100235 | F074867 | MSSRGITDIILNGEAFELHPTFSNLDKLETVLNKGAIGFLRQDLSSGAFKTGDVVSIIQVCAVPANGRKFPNWWNRDGVGEAVISAGLVGITTSVTHFLAKAFTAGTETDIKTVGSESDEKK* |
| Ga0103924_10023 | Ga0103924_100236 | F078696 | MKLWSSAVTYLNVQPSEAWNLTPFEFWALWDTHLEKMEISTGKAYTRPMTVDEFNELSDFLDELHGDN* |
| Ga0103924_10173 | Ga0103924_101732 | F058997 | MIIYKGQQMTVREACQLMGIDCDDFMVWCKKFALQNYGYALNYYKRTLKFKK* |
| Ga0103924_10236 | Ga0103924_102361 | F050360 | MSNAEFGAYQDMLRQCEESMFGGDVQDDDEQWHDVKRLLPPTSHMIWAACPNVVLCAYQTFLLYLDSDGQWRDNTGCLFSRKVNFWQYADVPECNISC* |
| Ga0103924_10239 | Ga0103924_102391 | F027881 | LTVARSERVRGVLTHANCTKIKFNLSSVRVSNSLMTFIKFV* |
| Ga0103924_10282 | Ga0103924_102823 | F051119 | MTIKEAFEQFDALRVANALGLEYGVVCKWRDREQIPVFWRTRFVNLMNHHGVSISLHDLAGWIK* |
| Ga0103924_10365 | Ga0103924_103652 | F087089 | LIKGIFGDTVLKKFDRVLVDAKRDRLGSTLGGVGGSQTYGRGATDKIMSQSIWKLDGAGASLGALLGTMLGGVGTMGGALGGAATAHAVKVLNMKQNALTSKAMLDPKYAAELLKKQIAPTQYKRGALESLKKNIAPSTIPAQDKNQ* |
| Ga0103924_10496 | Ga0103924_104961 | F010914 | MIYNIMTEQDGKFVATGETVECEFEETQEVVDELQVEHGRCCALEAVSE* |
| Ga0103924_10564 | Ga0103924_105642 | F019484 | CSFFYFLTILGGSLKKIAKKITTNVPISKKVYTNASII* |
| Ga0103924_10765 | Ga0103924_107651 | F043390 | MMSYQVKTEDLTKVISLTLTAEQLETIAGALEMYCIGLAEHNDPHLKYAADAQEAIIEVLESNFSVEP* |
| Ga0103924_10935 | Ga0103924_109352 | F050360 | MTNSYDNWFQDMLRDCENGLFGGDVHDESKDDVSDWTDVKTYLPEFSRMVWAAVPNVIICAHQTFLLYLDSDGQWRDNTGCLFGRRVNFWQYADVPECDISC* |
| Ga0103924_10966 | Ga0103924_109661 | F000237 | ESDG*LLGGYAFFFFHYIIFLGISLSANHLSDLTLTIGANIF*SLFNNAYKTYYIIFTNKHLNTDQLTRFMIFHYFTPFYYIYLVKMHVLFCHES*DTDSGETTFEDKSY*TLYVFVYFFLHHFNGATVNYFFFER*NISELDEVRFYGVAPH*YFRPLMGILVISPTHYEGLM*MGLFLGLLAFLPLVYNLYNTFSKYVATIPMQNSILQTTTFIIFMLSLYCANSMLPCGRYYYEPEGGYVGNP*VKFSYQYMYLYLG*IVHHLDLIDHYIFKFTQVLLRKLQSNKLKTTQ |
| Ga0103924_11533 | Ga0103924_115331 | F051119 | MKISEAFEKLDALRVANALGLEYDTVCKWRDRESIPAYWRVKFVNLMNHHNVAISLHDLAGVIK* |
| Ga0103924_11828 | Ga0103924_118281 | F054846 | MSNELNLNDLGLGDNSQSAINEKMPRRGAEGRGQSRESRKSLSEHDTARKPERVPMYAQRTMIDTTLIPEGFHGHWVSNNPAGRIDMLLRAGYDFVTKDQNVYSSHVTENGVDSRVSKSGSDGVTLYLMIIPLELYEADQEAKAEKAKEQTATIFGKQRNDPDFFSRDENGRDTLASRG |
| Ga0103924_12635 | Ga0103924_126351 | F000075 | LAVVSANQLESMNEDDLLVSLESNLNSALSSEARGDADAAVAKTAAIKNIQKALTARILKRLDDGQPLVEVARKMKAIEGMQPQINDMERRLGIMQSVEPVLENAIKTLQKVVDVRGMGKK* |
| Ga0103924_12981 | Ga0103924_129812 | F058997 | MIIYKGREMTVREACALMGIDCDDFMAWCKKFALQNYSYAMNYYKRTLKFKKG* |
| Ga0103924_13196 | Ga0103924_131962 | F096902 | MSGGSLDYVCYKVGDVADTIDARAKTSLQKAFAAHLRDVSTALHDLEWVFSGDYGDGDEVAALRKVVNKEMELNAATEQANIALKELQSVLGISV* |
| Ga0103924_13234 | Ga0103924_132342 | F042865 | ADDFNKRIWWQFMQSVVSDLPLNGNVAPENTVRANKSCLYIETTGGTAVLWFNPNGNGSATGWIVK* |
| Ga0103924_13455 | Ga0103924_134551 | F000055 | LGAVSAVEVGKAGTSGDWFVDKTPAPTATGTASSGYFGADEDDVMNNIFNKDGCRKACAEILLVTKQVSEAKMEAYMAEFFPRTWAKFDLNNTGEIDITESHTFMRALLGRLNQFVLAPGSLTDIKV* |
| Ga0103924_13477 | Ga0103924_134771 | F058934 | RPVMRQFSYPNSQFTNILGGASIGWKLNVYNTGTTDFASIYSNLAQTTPTANPVIADADGFLASFYWTGTVDVVLTDENNNLIDSANGIQDLVSTINAVVVAGNISLPYGAASGSGDTITATLPITSDFSDGGMLIVRANAANTGAANTPNLQVNSYASRRIKKIGGVALIANDIVSGMNMILVYDLA |
| Ga0103924_13695 | Ga0103924_136951 | F045749 | LLLTNYKDALPITNDDRRFCVMYGRIQNESELFDYFGGRDEAGEYFENLFRESELHAGAIKTFILNHTISEDFKPSGRAPDTVSRRLMIQASVSPEQCTVEDLINKHECGVVNGRILDVTWFKSLCEGEGDVLPQSRTLAHILNDMGYQQITGRRIKIKKTRENHYIWFKAKKGVDEESVKNEV |
| Ga0103924_14119 | Ga0103924_141191 | F090440 | SSLGGNEISGLKLATDLRLKRVYHCRITDVGEDDDLRIIIRKSIGG* |
| Ga0103924_14443 | Ga0103924_144432 | F077283 | MNNAEQVAYQSMLRDCENSMFGGDVQDDVETPVHVWIDAKRESPIAAQMCWVAVPNVVTCRHQVILCYIDSDEQWRDAGGVLMFRRVSFWQYADVPECEVLC* |
| Ga0103924_14476 | Ga0103924_144762 | F063325 | MSELLANSMSSWTAGESSEWNNCSMSKDVIHVTDCSLQTQAFASASSLICRLEMSSQIVNSA |
| ⦗Top⦘ |