NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Scaffold Ga0114340_1000202

Scaffold Ga0114340_1000202


Overview

Basic Information
Taxon OID3300008107 Open in IMG/M
Scaffold IDGa0114340_1000202 Open in IMG/M
Source Dataset NameFreshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE2, Sample E2014-0046-3-NA
Source Dataset CategoryMetagenome
Source Dataset Use PolicyOpen
Sequencing CenterUniversity of Michigan
Sequencing StatusPermanent Draft

Scaffold Components
Scaffold Length (bps)44042
Total Scaffold Genes59 (view)
Total Scaffold Genes with Ribosome Binding Sites (RBS)42 (71.19%)
Novel Protein Genes10 (view)
Novel Protein Genes with Ribosome Binding Sites (RBS)9 (90.00%)
Associated Families10

Taxonomy
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage(Source: UniRef50)

Ecosystem & Geography

Source Dataset Ecosystem
Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater, Plankton → Harmful Algal Blooms In Lake Erie

Source Dataset Sampling Location
Location NameLake Erie, USA
CoordinatesLat. (o)41.7635Long. (o)-83.3309Alt. (m)Depth (m)4.9
Location on Map
Zoom:    Powered by OpenStreetMap ©

Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000447Metagenome / Metatranscriptome1128Y
F001106Metagenome / Metatranscriptome776Y
F002489Metagenome / Metatranscriptome554Y
F003379Metagenome / Metatranscriptome490Y
F003643Metagenome / Metatranscriptome475N
F006841Metagenome / Metatranscriptome363Y
F009961Metagenome / Metatranscriptome310Y
F012456Metagenome / Metatranscriptome280Y
F051930Metagenome / Metatranscriptome143Y
F099281Metagenome / Metatranscriptome103N

Sequences

Protein IDFamilyRBSSequence
Ga0114340_100020213F003379AGGAMKKLSALVMLIATAILSGVALSKFLNWAGQIEIFDFDLDEDIDNEEF*
Ga0114340_10002022F000447GAGGMGYIEIFRMDQNGAGWVDLSEATPDEMFNIELGLLNEGALFTSPEAE*
Ga0114340_100020221F002489GAGGMQEDQNNNNDSTNLEDNLPMVSYIMLHRIYDLLTLIASKVAGSEDTSKMVEYHQAGYLLGPMPSYNPGEQNE*
Ga0114340_100020222F051930N/AMSKEGLVEKFLCPVDQSLLFSNQDIEENIFLYCLSCKYKKNIGFNTYKDIVKLVEGQLKNGRLSK*
Ga0114340_100020225F003643AGGAGGMKQDLNNDGKVTMQEKILAALASYGRHFLGAAIALYMTGNTSPRDLLMGGFAATAPVILKALNPNEKSFGFTTK*
Ga0114340_100020234F099281GGAMATVDKNFKVKNGLNVAGAATFDAAVNVDNLVLNSTPLAFDSSTGRLKLQIDGVWKEIALLQDAAEDLTAITFMDIGLAIDYNGQPVYTVFANGVNTTATKFADGGAYSTDTYSMTFDSGAIA*
Ga0114340_100020253F001106AGGAGMSFDTLKVSDLKVIATDFAVDTEGLKNKKDIIAALAEEGVTWSVYQSTVEAVEKDTEEIEILPKFDPKSQPENTILVRMTRDNMRYDIHGKTFTKNHPFVAMSEEDAQKIFDKEEGFRLATPKEVQDFYN*
Ga0114340_100020256F012456AGGMAGSVPGIIKDSTVAQISAFLYYEAAVMSKLTTHSAFKNLFKTTIFNQIEKDFGLYIDSQARVKPRSLHHVYEWNKAGSPTSRLFTLSRIDTEGLSFRINYDFKISKSSVPSKNKKQKKGYVFANKASVMESGMPVIIRPRSAERLVFEIDGATVFMPKGTSVVVKRPGGTQATNQFSLSYGRFFGGQLVNNSIKSSGLQQIFGSRIKKAMGVPMNIKKVQYSFSAGKIRTQADAALHASFGGSL*
Ga0114340_100020257F009961GGAGGMTVDYKIDAMFELRKFLWNELKEAKIFEASDYYSDSSDTEIIPIIPVQQSPEIDQFLNGKKHIVYDKIGMSFEDIWLIACEKVLFTIYSTDITEVYEIRNLIMDLFRRMDESARDANLSRSTGKIIFHSIHVVETSPIEPSMELQGYMSADIILEVKYSRTVGQDGRFD*
Ga0114340_100020259F006841AGGAGGMATTVYDVEEIQLQNGATVKLKPLTIKELRKFMKVIARTQEVTTEDETLSILIEACAVALEKQLPELVKDIDAFEDTLDVPTINRILEICGGIKMDDPNQLAAAVLAGQN*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.