NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300026883

3300026883: Groundwater microbial communities from the Columbia River, Washington, USA, for microbe roles in carbon and contaminant biogeochemistry - GW-RW metaG T4_10-June-14 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300026883 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0114663 | Gp0115670 | Ga0209895
Sample NameGroundwater microbial communities from the Columbia River, Washington, USA, for microbe roles in carbon and contaminant biogeochemistry - GW-RW metaG T4_10-June-14 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size61414965
Sequencing Scaffolds9
Novel Protein Genes11
Associated Families11

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiales genera incertae sedis → Methylibium1
All Organisms → Viruses → Predicted Viral1
All Organisms → cellular organisms → Bacteria → Proteobacteria1
All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira oceanica1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage1
Not Available3
All Organisms → cellular organisms → Eukaryota1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameGroundwater Microbial Communities From The Columbia River, Washington, Usa
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sand → Groundwater Microbial Communities From The Columbia River, Washington, Usa

Alternative Ecosystem Assignments
Environment Ontology (ENVO)freshwater river biomemicrocosmsand
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Subsurface (non-saline)

Location Information
LocationUSA: Columbia River, Washington
CoordinatesLat. (o)46.372Long. (o)-119.272Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F005780Metagenome / Metatranscriptome390Y
F024650Metagenome / Metatranscriptome205Y
F044595Metagenome154Y
F050934Metagenome144N
F052686Metagenome / Metatranscriptome142Y
F053809Metagenome / Metatranscriptome140N
F068469Metagenome / Metatranscriptome124Y
F076893Metagenome / Metatranscriptome117Y
F076944Metagenome / Metatranscriptome117Y
F092936Metagenome / Metatranscriptome107N
F104469Metagenome / Metatranscriptome100Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0209895_1000412All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiales genera incertae sedis → Methylibium4542Open in IMG/M
Ga0209895_1007010All Organisms → Viruses → Predicted Viral1043Open in IMG/M
Ga0209895_1007603All Organisms → cellular organisms → Bacteria → Proteobacteria988Open in IMG/M
Ga0209895_1008787All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira oceanica899Open in IMG/M
Ga0209895_1010293All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage813Open in IMG/M
Ga0209895_1011872Not Available741Open in IMG/M
Ga0209895_1011974Not Available736Open in IMG/M
Ga0209895_1014986All Organisms → cellular organisms → Eukaryota638Open in IMG/M
Ga0209895_1021091Not Available515Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0209895_1000412Ga0209895_10004121F044595VKSAHIVTANRPMNGLVAFVRAAAAMESFRGCASALHLTLPASFQLDRNEDRMHP
Ga0209895_1007010Ga0209895_10070101F092936LVIIAAIKIQSRIPLAPGAFFTVENPAIVQKWMANGTLPLIFTAQSIKSVTHDKFRLYEIPPVPQSKSQHGFILSGVDITLLNIQVTNINCGGAMCDGLNMYQNSVTADRCPCYSVLDREGKVCLVLSLKVSDPKNNLQFCVHNHTSKSLTQLFMKRIPKGAVAATITGNQKHMGNLSAKVMDMLALGNDYNGFIISGWIKRGTIADSGVVQPPTGSKWDKPQQVDSGGLTYHLTKIAYASPPSDRLLGEYQFNAGSLV
Ga0209895_1007603Ga0209895_10076031F052686MADKPGSKPTSSAPKPVAPKGSEAGTMGLDDRGNVTWEWKDQGDLLADDTLGAAERVRALVDPRLKVTDDDDPGNPIKSNPKGLKSGYNPYNSGALGKQSWKKKRNL
Ga0209895_1008787Ga0209895_10087871F050934IQAAIFQKHIQATHPNVTSNEMPPEHTLIIEGDITSSRSNTTRQRIDRHLRHRIITTCGDANVMMGSKHIDPALCIYIGAYLICIDNKHLTDKVPRGNGTLCRVLGMKLNENAQSYKCKNYYGKKVWTVNAADVEWVECEHVNKTSFLTQLESQIKELKCQLDLTPNDHKIERKKIKSKLDDLNNKLAKEMTGRKFKLEPEQFSPEVTVKHYQTSSKKVAFRCKMKQIPANSNDATTGHKLQGMSKDAIIVSSWPTGGLSAMFKNWEYVVLSRVRTLSGLYLVKPIDMDKSFQPSPQLA
Ga0209895_1010293Ga0209895_10102931F005780MAQETVSIAWCDNGMVDGKFMQGVTDVMLKSGINFTTTLRSQGNQIARQREKIIRYWYENNTSEWLLWVDSDVVISPEKFRLLWDNKDVKERPIVTGVYFTTDTPEEPLMIPMPTIFNFAEAQDGVVGIKRVHPMP
Ga0209895_1011872Ga0209895_10118721F053809KSHIRPYIRNTLLLINDQLPLLHLHVNYLSQLQLINRRRLHDFDVLSGKRHTRGAGAIRVAEIGAAYAEDRIRAHLGALIVDNNDLLAEWFRLKTYTDGHTPLICSNRPALQMNTMPFDTDGVCTIQGDGIPHPADMRSTIQIIERILLPHAQSAARCCTITDYRNHNAAYMCVKGFVRETKRSLDDVRGRLYYLQYDEIDLWRSVQNFEYNIEDCSTIIRELFANTQPTQPFPQVVTPTTNDVAH
Ga0209895_1011974Ga0209895_10119742F024650TFSLAQNNNNTYMAASWACRTLVSKFARMVTTQLDGALSADYSDLMMHYQQLADTLEYQGKTSGAALGVLAGGLTKSSVEAVRADTNRIEGSFRRDQFKNPPSYNTPEYE
Ga0209895_1013391Ga0209895_10133911F104469FVDTTKAPAGDSDYACFTTSQKCITSLMYLLDDMECPDYAFQSIMDWARNCFEAGFDFNPKSKTRLGNLKWMYDSLHNAKQMLPNVVSIQLPDPLPDTKSMDVICYDFVPQLLSILQNKEMMSANNLVLDPNNPLAMYKPHDSRLGEALSGSVCRDMYHRLVSNPSKQLLCPLICYTDGTQVDSLSRFSVEPFLFTPAVLSHAARCKADAWRPFGYVQHSKSNLRSD
Ga0209895_1014986Ga0209895_10149861F076893GYKYPVWYSKKAITNIICLKNLIKCYRVTYDSEVDTTFVVHCSASGLPDLLFEMHPCGLHVCYPKKMGQFGFVQTVQDNMKLFSKRQLAGAQRARELYERLLYPSTSDFRAIVCAGGVPGSDVTLDDVKAAEVIWGRSVLKMKGNMTRKNGKRMTQSIVKVPTELIKLHKNVELAIDCFFVNKHIFFTTISTKICFTTITHLTKRNKEDVWV
Ga0209895_1021091Ga0209895_10210911F068469VKGNDKPAILAALISSNDKVWGEFYMHPLKTNMRLATAAAAQARGGILSQEEQAQLQYADMLIDVSK
Ga0209895_1021091Ga0209895_10210912F076944VEDDNDVDDQDLATCDFKKERCDYLHEVSVIFWDEFISNDRILMEAVLEEFKTRWELPRYYIFVCAGDFAQVCI

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.