NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300026717

3300026717: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G09A2-12 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300026717 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0072085 | Ga0207526
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G09A2-12 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size17658403
Sequencing Scaffolds19
Novel Protein Genes20
Associated Families19

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → eudicotyledons → Gunneridae → Pentapetalae → Caryophyllales → Amaranthaceae → Bosea1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → unclassified Pseudolabrys → Pseudolabrys sp. Root14621
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae2
Not Available6
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Craniata → Vertebrata → Gnathostomata → Chondrichthyes → Elasmobranchii → Selachii → Galeomorphii → Galeoidea → Orectolobiformes → Hemiscylliidae → Chiloscyllium → Chiloscyllium punctatum1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales3
All Organisms → cellular organisms → Archaea1
All Organisms → cellular organisms → Bacteria → Proteobacteria1
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2
All Organisms → cellular organisms → Bacteria1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.3Long. (o)-89.38Alt. (m)N/ADepth (m)0
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000268Metagenome / Metatranscriptome1411Y
F016471Metagenome / Metatranscriptome247Y
F017759Metagenome239N
F019338Metagenome / Metatranscriptome230Y
F021340Metagenome219Y
F022740Metagenome / Metatranscriptome213Y
F028161Metagenome / Metatranscriptome192N
F028554Metagenome / Metatranscriptome191N
F029703Metagenome187N
F034172Metagenome / Metatranscriptome175Y
F040185Metagenome162N
F040398Metagenome / Metatranscriptome162Y
F041300Metagenome / Metatranscriptome160Y
F046248Metagenome151Y
F048216Metagenome / Metatranscriptome148Y
F050525Metagenome / Metatranscriptome145N
F054151Metagenome / Metatranscriptome140N
F071393Metagenome122N
F087939Metagenome / Metatranscriptome110Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207526_100016All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → eudicotyledons → Gunneridae → Pentapetalae → Caryophyllales → Amaranthaceae → Bosea2013Open in IMG/M
Ga0207526_100033All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → unclassified Pseudolabrys → Pseudolabrys sp. Root14621838Open in IMG/M
Ga0207526_100121All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1403Open in IMG/M
Ga0207526_100269Not Available1164Open in IMG/M
Ga0207526_100893All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Craniata → Vertebrata → Gnathostomata → Chondrichthyes → Elasmobranchii → Selachii → Galeomorphii → Galeoidea → Orectolobiformes → Hemiscylliidae → Chiloscyllium → Chiloscyllium punctatum835Open in IMG/M
Ga0207526_101263All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales746Open in IMG/M
Ga0207526_101370Not Available721Open in IMG/M
Ga0207526_101373All Organisms → cellular organisms → Archaea720Open in IMG/M
Ga0207526_101413Not Available714Open in IMG/M
Ga0207526_101513Not Available698Open in IMG/M
Ga0207526_101611All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales681Open in IMG/M
Ga0207526_101640All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales676Open in IMG/M
Ga0207526_101987All Organisms → cellular organisms → Bacteria → Proteobacteria633Open in IMG/M
Ga0207526_102332All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae599Open in IMG/M
Ga0207526_102769All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium564Open in IMG/M
Ga0207526_102990Not Available549Open in IMG/M
Ga0207526_103116All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium540Open in IMG/M
Ga0207526_103194All Organisms → cellular organisms → Bacteria536Open in IMG/M
Ga0207526_103208Not Available535Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207526_100016Ga0207526_1000162F034172IVLLGGASYAADLRLEHKRTHARGAGTFDQRMRVVEQKPYCGDCEAPIGRTHSANVARLRFINWPFWQERCAVGACGVYYPVMRSCAFWGLGCT
Ga0207526_100033Ga0207526_1000331F022740QILAARSNELGGLVSFISVGGQPVNTTRNGTIVAAFTFDDIVWTDIQQKTFAAATAQIRQIRPGSTPVLAATGTITPLADTEIKKLGWKIVQLKPER
Ga0207526_100121Ga0207526_1001211F028554VLQDQVNFLKGQMRKAKQVRNRALSAGEGRRAIIVTAALTGLFLLAAFVTAGSFLSTDPQAMSSVAKVTPLPRTEGGEAASRVASIVMETDKKGRCEERQFDNRTGKMVSANYVNCDARLEPERDSTPSENIN
Ga0207526_100179Ga0207526_1001792F017759LFGVVVGLLNSECPQQNRAYESKYGADSQHIELQGKVHGSASLVDARRLARNQRRPKVPAIGPVLFAGEIAYRICAMDNRLIVRLKKSEEPKILTVQNSRGTEFFMANLRQIPSILRAMIPPAWSPRNLAVNKTHDDFGCAPNMRDRKKSKNGG
Ga0207526_100269Ga0207526_1002692F040185RGVPTPVVVVTDTIGAADSALPLSMKVTSYVPDTNIVLKGLAVGTTLTSGASVGDREWRINIEDLQNAYVIPPQGFVGPMAFVAELRDIDGHPLLRAPGQFTWTAVDPSSATAGKEPAEEEPPVTAVASAADAGNQQLIGQFVGQKEEVVLPKPRPIKHASLGGKATKPKKQIARAHGYKERMPRRDLGADTRWASNELPPHSLFSEPDRRRERRAIVDGIFRSLFYDGDANECEPATLERGTQKKSGDDCQWRR
Ga0207526_100893Ga0207526_1008931F019338MRRTEAVLTAGAVACLAFVSIVYPVFAQDVDPRCKDIFDKVACTCAVRNGGHVIPPPVGVKREGLKLRPKEEAGGTQTLDGGRVAFPKYYRREGLKFHRSRALEGYLAC
Ga0207526_101263Ga0207526_1012631F040398GITDAAVVSNEFEAVMPSNIKVLAKGSSAVPNFLRLCVATSGKVLSERRDDLVKFVAAEMDAYKFALANRAETIKVSHEMTHAKPDDKRAEFITDEAIKNKQIDPVLSIPLDRLDWMQNLFVKAGVIKQTVPIESIVDKSVNADAAKIAGK
Ga0207526_101370Ga0207526_1013702F041300LPFFSAVDYRRYAAECVRLAQQVADPDDKVRLLDMAETFRELADKNDARHSG
Ga0207526_101373Ga0207526_1013731F016471MTEADKFGAVKDATVAIGLANKDTKDVISSFGTGFFIGGEYIVSSAHIFSQCIKYNAQNKDKNKGMEGIYSAFNITTNGNQLELNTYHIIKAIRLPPVKEVMGFTGSVDLDIGIGKLDRHSDNFLHIKEPTQLKLYDEIVICGYPSGRISLVLYRNKHEDGMGIHLRPIIQFGRI
Ga0207526_101413Ga0207526_1014131F087939GLAEDIAKSPLPTGKGFAESLDRPGLGIEVDEDRVSRHRVQIAARSVA
Ga0207526_101513Ga0207526_1015131F028554MRKAKQVRNRALSAGEGRRAIIVMAALTGLFFLAAFVTAGSLLSTDPRPMSSLSKVTQLPPTEGGEAASRVASIVVETDKKGRCEERRFDNRTGKMVSANYVNCDARLEPERDTTPSENINRERIRAILGA
Ga0207526_101611Ga0207526_1016112F046248MKQKYALRRALPILVAAVLLTAIFVDVVAAGVKAVSRRSRASYPASGGAITISVPNAMKPFPAELLPQ
Ga0207526_101640Ga0207526_1016402F021340VAIVTLFFVIVLGASLFLGTVMVIGTFHGVDPSNELTANGRAGRIARTLQDGTLCHYMIFDNKTAQTVEDRIGRCDENKPKPKQEKPATFSWGK
Ga0207526_101987Ga0207526_1019872F028161MKAVRVVTLSLAAALAVVAIQAPHAQDNKNVREDDYVRKVPLEDFKVPIVPIIPPGSSLDLRPGRTPDSADRVYNSTPFARDPTTPSIGLSIKSPFDDRK
Ga0207526_102332Ga0207526_1023321F050525FGIGIVGLMALHSPTGRTPSSPPVAAAEPLAKPAKRPLDDKKTAHRNQTHKKVHVTRKQHEAPSIDAGRNAYGYAEELRRIDPNRFLFFGR
Ga0207526_102769Ga0207526_1027691F048216IIEKVQQSGREFYAAVFYAFERRWLQERGGDYTAVRWRTINQVTPFGLIRLPVRVVRERGAQKGGYLSLSKALLKPKATRLLSPWVEKGVLEAATCSNYRPAAAELWRWVRVKVSAWLIWKCVQFHGARLCEQLERQWWPDRALPRKADVVVTEIDSTYLKAQRRGRAARGHPTAHFAIHLGLHYSG
Ga0207526_102990Ga0207526_1029901F071393EAVNQYKMGTAAAEDKIILLESEKAVLQAQLDVALEESKTLADRVHAAEAASDRREATVALSIKQIEFLNTELTAAAAERFRLVATMQGEQRRQRSVFSQQKSILEDKLQEKEALAATQGTKIKQLEGVRDELDKRVRVIEALLASEREVAERKTRRPTEILGAAG
Ga0207526_103116Ga0207526_1031161F029703MSTDEPFRTDYEFLKGVDYVFVSLDRNLSGEECHELAEKYFETHKGM
Ga0207526_103194Ga0207526_1031941F000268MLMRVVAVMLLLSAGIAAEAMSYSFVSKASGRLGGPIRFEFYHDSTTRPKTDIQSFTVSMRTADDRWKAMWSILSGRGLTQPIEYGVTPPGFTTMIQPQKLIPGRVYAGFATDGHGGTSGVTFGF
Ga0207526_103208Ga0207526_1032081F054151MRLAECKFPSGNSPGMRKADQYTLADHFRALADGLSVRAVTERAPAQRAELQRLAECYAELAKQQSPADHFARGVGPR

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.