NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008550

3300008550: Planktonic microbial communities from coastal waters of California, USA - Canon-21



Overview

Basic Information
IMG/M Taxon OID3300008550 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0117987 | Gp0126513 | Ga0103924
Sample NamePlanktonic microbial communities from coastal waters of California, USA - Canon-21
Sequencing StatusPermanent Draft
Sequencing CenterUniversity of Hawaii
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size4899064
Sequencing Scaffolds20
Novel Protein Genes24
Associated Families21

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Proteobacteria2
All Organisms → Viruses → Predicted Viral1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus → Streptococcus pneumoniae1
Not Available12
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Dependentiae → unclassified Candidatus Dependentiae → Candidatus Dependentiae bacterium ADurb.Bin3312
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage1
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Stichotrichia → Sporadotrichida → Oxytrichidae1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NamePlanktonic Microbial Communities From Coastal Waters Of California, Usa
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Marine → Coastal → Unclassified → Coastal Water → Planktonic Microbial Communities From Coastal Waters Of California, Usa

Alternative Ecosystem Assignments
Environment Ontology (ENVO)marine biomecoastal water bodycoastal sea water
Earth Microbiome Project Ontology (EMPO)Free-living → Saline → Water (saline)

Location Information
LocationPacific Ocean
CoordinatesLat. (o)N/ALong. (o)N/AAlt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000055Metagenome / Metatranscriptome3096Y
F000075Metagenome / Metatranscriptome2622Y
F000237Metagenome / Metatranscriptome1498Y
F010914Metagenome / Metatranscriptome297Y
F019484Metagenome / Metatranscriptome229Y
F027881Metagenome / Metatranscriptome193Y
F042865Metagenome / Metatranscriptome157N
F043390Metagenome / Metatranscriptome156N
F045749Metagenome / Metatranscriptome152Y
F050360Metagenome / Metatranscriptome145Y
F051119Metagenome / Metatranscriptome144N
F054846Metagenome / Metatranscriptome139N
F058934Metagenome / Metatranscriptome134N
F058997Metagenome / Metatranscriptome134N
F063325Metatranscriptome129N
F074867Metagenome / Metatranscriptome119N
F077283Metagenome / Metatranscriptome117Y
F078696Metagenome / Metatranscriptome116N
F087089Metagenome / Metatranscriptome110N
F090440Metagenome / Metatranscriptome108N
F096902Metagenome / Metatranscriptome104Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0103924_10023All Organisms → cellular organisms → Bacteria → Proteobacteria2967Open in IMG/M
Ga0103924_10173All Organisms → Viruses → Predicted Viral1553Open in IMG/M
Ga0103924_10236All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus → Streptococcus pneumoniae1396Open in IMG/M
Ga0103924_10239Not Available1389Open in IMG/M
Ga0103924_10282All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Dependentiae → unclassified Candidatus Dependentiae → Candidatus Dependentiae bacterium ADurb.Bin3311312Open in IMG/M
Ga0103924_10365Not Available1201Open in IMG/M
Ga0103924_10496Not Available1093Open in IMG/M
Ga0103924_10564Not Available1052Open in IMG/M
Ga0103924_10765All Organisms → cellular organisms → Bacteria → Proteobacteria953Open in IMG/M
Ga0103924_10935Not Available890Open in IMG/M
Ga0103924_11533Not Available753Open in IMG/M
Ga0103924_11828Not Available712Open in IMG/M
Ga0103924_12981Not Available599Open in IMG/M
Ga0103924_13196Not Available584Open in IMG/M
Ga0103924_13234Not Available581Open in IMG/M
Ga0103924_13477Not Available567Open in IMG/M
Ga0103924_13695All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage555Open in IMG/M
Ga0103924_14119Not Available533Open in IMG/M
Ga0103924_14443All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Dependentiae → unclassified Candidatus Dependentiae → Candidatus Dependentiae bacterium ADurb.Bin331520Open in IMG/M
Ga0103924_14476All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Stichotrichia → Sporadotrichida → Oxytrichidae518Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0103924_10023Ga0103924_100235F074867MSSRGITDIILNGEAFELHPTFSNLDKLETVLNKGAIGFLRQDLSSGAFKTGDVVSIIQVCAVPANGRKFPNWWNRDGVGEAVISAGLVGITTSVTHFLAKAFTAGTETDIKTVGSESDEKK*
Ga0103924_10023Ga0103924_100236F078696MKLWSSAVTYLNVQPSEAWNLTPFEFWALWDTHLEKMEISTGKAYTRPMTVDEFNELSDFLDELHGDN*
Ga0103924_10173Ga0103924_101732F058997MIIYKGQQMTVREACQLMGIDCDDFMVWCKKFALQNYGYALNYYKRTLKFKK*
Ga0103924_10236Ga0103924_102361F050360MSNAEFGAYQDMLRQCEESMFGGDVQDDDEQWHDVKRLLPPTSHMIWAACPNVVLCAYQTFLLYLDSDGQWRDNTGCLFSRKVNFWQYADVPECNISC*
Ga0103924_10239Ga0103924_102391F027881LTVARSERVRGVLTHANCTKIKFNLSSVRVSNSLMTFIKFV*
Ga0103924_10282Ga0103924_102823F051119MTIKEAFEQFDALRVANALGLEYGVVCKWRDREQIPVFWRTRFVNLMNHHGVSISLHDLAGWIK*
Ga0103924_10365Ga0103924_103652F087089LIKGIFGDTVLKKFDRVLVDAKRDRLGSTLGGVGGSQTYGRGATDKIMSQSIWKLDGAGASLGALLGTMLGGVGTMGGALGGAATAHAVKVLNMKQNALTSKAMLDPKYAAELLKKQIAPTQYKRGALESLKKNIAPSTIPAQDKNQ*
Ga0103924_10496Ga0103924_104961F010914MIYNIMTEQDGKFVATGETVECEFEETQEVVDELQVEHGRCCALEAVSE*
Ga0103924_10564Ga0103924_105642F019484CSFFYFLTILGGSLKKIAKKITTNVPISKKVYTNASII*
Ga0103924_10765Ga0103924_107651F043390MMSYQVKTEDLTKVISLTLTAEQLETIAGALEMYCIGLAEHNDPHLKYAADAQEAIIEVLESNFSVEP*
Ga0103924_10935Ga0103924_109352F050360MTNSYDNWFQDMLRDCENGLFGGDVHDESKDDVSDWTDVKTYLPEFSRMVWAAVPNVIICAHQTFLLYLDSDGQWRDNTGCLFGRRVNFWQYADVPECDISC*
Ga0103924_10966Ga0103924_109661F000237ESDG*LLGGYAFFFFHYIIFLGISLSANHLSDLTLTIGANIF*SLFNNAYKTYYIIFTNKHLNTDQLTRFMIFHYFTPFYYIYLVKMHVLFCHES*DTDSGETTFEDKSY*TLYVFVYFFLHHFNGATVNYFFFER*NISELDEVRFYGVAPH*YFRPLMGILVISPTHYEGLM*MGLFLGLLAFLPLVYNLYNTFSKYVATIPMQNSILQTTTFIIFMLSLYCANSMLPCGRYYYEPEGGYVGNP*VKFSYQYMYLYLG*IVHHLDLIDHYIFKFTQVLLRKLQSNKLKTTQ
Ga0103924_11533Ga0103924_115331F051119MKISEAFEKLDALRVANALGLEYDTVCKWRDRESIPAYWRVKFVNLMNHHNVAISLHDLAGVIK*
Ga0103924_11828Ga0103924_118281F054846MSNELNLNDLGLGDNSQSAINEKMPRRGAEGRGQSRESRKSLSEHDTARKPERVPMYAQRTMIDTTLIPEGFHGHWVSNNPAGRIDMLLRAGYDFVTKDQNVYSSHVTENGVDSRVSKSGSDGVTLYLMIIPLELYEADQEAKAEKAKEQTATIFGKQRNDPDFFSRDENGRDTLASRG
Ga0103924_12635Ga0103924_126351F000075LAVVSANQLESMNEDDLLVSLESNLNSALSSEARGDADAAVAKTAAIKNIQKALTARILKRLDDGQPLVEVARKMKAIEGMQPQINDMERRLGIMQSVEPVLENAIKTLQKVVDVRGMGKK*
Ga0103924_12981Ga0103924_129812F058997MIIYKGREMTVREACALMGIDCDDFMAWCKKFALQNYSYAMNYYKRTLKFKKG*
Ga0103924_13196Ga0103924_131962F096902MSGGSLDYVCYKVGDVADTIDARAKTSLQKAFAAHLRDVSTALHDLEWVFSGDYGDGDEVAALRKVVNKEMELNAATEQANIALKELQSVLGISV*
Ga0103924_13234Ga0103924_132342F042865ADDFNKRIWWQFMQSVVSDLPLNGNVAPENTVRANKSCLYIETTGGTAVLWFNPNGNGSATGWIVK*
Ga0103924_13455Ga0103924_134551F000055LGAVSAVEVGKAGTSGDWFVDKTPAPTATGTASSGYFGADEDDVMNNIFNKDGCRKACAEILLVTKQVSEAKMEAYMAEFFPRTWAKFDLNNTGEIDITESHTFMRALLGRLNQFVLAPGSLTDIKV*
Ga0103924_13477Ga0103924_134771F058934RPVMRQFSYPNSQFTNILGGASIGWKLNVYNTGTTDFASIYSNLAQTTPTANPVIADADGFLASFYWTGTVDVVLTDENNNLIDSANGIQDLVSTINAVVVAGNISLPYGAASGSGDTITATLPITSDFSDGGMLIVRANAANTGAANTPNLQVNSYASRRIKKIGGVALIANDIVSGMNMILVYDLA
Ga0103924_13695Ga0103924_136951F045749LLLTNYKDALPITNDDRRFCVMYGRIQNESELFDYFGGRDEAGEYFENLFRESELHAGAIKTFILNHTISEDFKPSGRAPDTVSRRLMIQASVSPEQCTVEDLINKHECGVVNGRILDVTWFKSLCEGEGDVLPQSRTLAHILNDMGYQQITGRRIKIKKTRENHYIWFKAKKGVDEESVKNEV
Ga0103924_14119Ga0103924_141191F090440SSLGGNEISGLKLATDLRLKRVYHCRITDVGEDDDLRIIIRKSIGG*
Ga0103924_14443Ga0103924_144432F077283MNNAEQVAYQSMLRDCENSMFGGDVQDDVETPVHVWIDAKRESPIAAQMCWVAVPNVVTCRHQVILCYIDSDEQWRDAGGVLMFRRVSFWQYADVPECEVLC*
Ga0103924_14476Ga0103924_144762F063325MSELLANSMSSWTAGESSEWNNCSMSKDVIHVTDCSLQTQAFASASSLICRLEMSSQIVNSA

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.