NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300027390

3300027390: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A1-11 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300027390 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0055663 | Ga0207435
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A1-11 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size21829642
Sequencing Scaffolds22
Novel Protein Genes26
Associated Families26

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2
Not Available11
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → Pseudolabrys taiwanensis1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.2958Long. (o)-89.3799Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F003059Metagenome / Metatranscriptome510Y
F004397Metagenome / Metatranscriptome440Y
F005277Metagenome / Metatranscriptome406Y
F010961Metagenome / Metatranscriptome297Y
F020731Metagenome / Metatranscriptome222Y
F020927Metagenome / Metatranscriptome221N
F021340Metagenome219Y
F022446Metagenome / Metatranscriptome214Y
F024822Metagenome / Metatranscriptome204N
F029148Metagenome189Y
F030545Metagenome185Y
F032172Metagenome / Metatranscriptome180Y
F050525Metagenome / Metatranscriptome145N
F055629Metagenome / Metatranscriptome138Y
F057709Metagenome136Y
F063749Metagenome / Metatranscriptome129Y
F063910Metagenome / Metatranscriptome129N
F068052Metagenome125Y
F068533Metagenome124N
F078970Metagenome / Metatranscriptome116Y
F083399Metagenome113N
F084687Metagenome / Metatranscriptome112Y
F089374Metagenome / Metatranscriptome109Y
F090709Metagenome108Y
F097867Metagenome104Y
F098176Metagenome104N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207435_100249All Organisms → cellular organisms → Bacteria2532Open in IMG/M
Ga0207435_100386All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales2166Open in IMG/M
Ga0207435_101177All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1299Open in IMG/M
Ga0207435_101446Not Available1165Open in IMG/M
Ga0207435_101460All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1156Open in IMG/M
Ga0207435_101573All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → Pseudolabrys taiwanensis1110Open in IMG/M
Ga0207435_101698Not Available1051Open in IMG/M
Ga0207435_102080All Organisms → cellular organisms → Bacteria922Open in IMG/M
Ga0207435_102274All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium859Open in IMG/M
Ga0207435_102599Not Available781Open in IMG/M
Ga0207435_102785Not Available745Open in IMG/M
Ga0207435_103272Not Available665Open in IMG/M
Ga0207435_103622All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae624Open in IMG/M
Ga0207435_103707All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium617Open in IMG/M
Ga0207435_104194Not Available573Open in IMG/M
Ga0207435_104270Not Available568Open in IMG/M
Ga0207435_104466All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium552Open in IMG/M
Ga0207435_104683All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria537Open in IMG/M
Ga0207435_104693Not Available537Open in IMG/M
Ga0207435_105277Not Available507Open in IMG/M
Ga0207435_105356Not Available503Open in IMG/M
Ga0207435_105412Not Available500Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207435_100249Ga0207435_1002491F024822VREPFIRIAGAILCALALSGCVDSSGPLLSDAQPVLGEQLRLQFYSLRKGTADEPEQATYKWDRGAYQRTGGGMTDISSFSVHPLARDIFVVQSAAAKRPGMFEYAVARRLVDGVYQVIAIDEADAGRVTRARFCKRASDSSCRIQTRNQLYAFA
Ga0207435_100386Ga0207435_1003862F021340VAIVTLFFVIVLGASLFLGTVMVIGTFHGVDPSNELTANGRAGRIARTLQDGTLCHYMIFDNKTAQTVEDRIGRCDENKPKPKQEKPATFTWGK
Ga0207435_101141Ga0207435_1011411F063749HAAGQTLLLLERGDRPMKSLLVLVLALAATTAAAKPSPFEPARSANPIALARTSAEFNTGPAGATRERNYFRVPEADAHSGVFLCRFEPSMFAKVRLTQSCK
Ga0207435_101177Ga0207435_1011773F090709TINRHAIDPSVTLPIDKLNWMQNELVKAGNLKAPFDLTTMTAPDIRAEAAKRATK
Ga0207435_101446Ga0207435_1014463F029148ADVAMTDDPLDWKDIEIKYRGKPVKGRYNVSDNLVTVVAWSGTKTARLGVLPAERLAKMLLRELADDTEKL
Ga0207435_101460Ga0207435_1014602F063910MSERYDLELISRRRAFSFLGSAAVALSVAVPATLLIATDAEARVGNPGSAVSVAGANRRDRRQDRRYKKSPTTPTTTGQGEKK
Ga0207435_101573Ga0207435_1015733F010961MQRVTAMMAYVVSEGRLCALKPAEWSFLLVGVTLCGIATL
Ga0207435_101698Ga0207435_1016981F020927MWAAFVRQFAAYAITRRGKKLFALIGVLALCFGAALLIDMQFYVSASFAALLAGFAAVTYVVQHVKLKRAEHQRLLRKAEVARQRAIAAQARLERIDTAKSTLRGAATGARRLVTDNVSIVANEALLMANETADT
Ga0207435_102080Ga0207435_1020801F032172LEGEEPVQIIRGRFGPSGGIVPELDKDGQVVPTGHFNNRLGFHALMQAVGVDERVMTLIELFANPKLNAEITRQLAAGQRSVSISGEDAAKTGEF
Ga0207435_102236Ga0207435_1022361F005277DLHQTLQLTRMLAVPAAITLGAPVTMRFVILAAVMTLLAGFTQAELEKAKNSKEFFKDGYWKCLATEIVRVAPTNMPVQEFSVFVKRACSKERNDFFASLSNYVAMLHPDAARDTVISATNIAVLDAQKDAVTALVDLRSGKR
Ga0207435_102274Ga0207435_1022742F068052TPAAVTDLESILAPQGPHVVSVLLQDRDSVFLARDLVDAFKRIGWKAKRDTSVNDVPDGLTVWPEDDVARAICNALTMATGALVAVREDQHLKDQGTYAIGVGYKLI
Ga0207435_102599Ga0207435_1025991F004397RKTQSMSLSGGKAAMSEFDLKVALIIFVTKFIDPFAAVPALVAGYFCRTWWQVVIAAAAVGIFVEMILVLFEPTPGVHQGRLLMAVLAAGVWSNLAFAFKTWRAKRA
Ga0207435_102785Ga0207435_1027852F083399MVEESAVSGDARPWGFFATFVLGAIALLAGQLAGMAALVGWYDFDLRNVPVLSQHGGAIILFIFVSAPVQVAILALAAGYKGNIADYLGYKLPRRGEVVLCLAILAAMIAIGDAMSWLAGRSVVDRFQTDIYHAANSVGQLPLLLAA
Ga0207435_103207Ga0207435_1032071F084687MKLVRYRSASSEKPGLILDGEIFDLSGSFAALNPRAPTLDDIEAIAAVP
Ga0207435_103272Ga0207435_1032721F030545ILIAGLLLMAKDALGVRFWDLIPADIPILAGGIAIGLTGIGLVSLAAYGIVRAVGWAIDKSV
Ga0207435_103622Ga0207435_1036222F050525GVSAIFGIGIVGLMALHSPTGRTPSSPPVAAAEPLAKPAKRPLDDKKTAHRNQKHKKEHVTRKQPHEAPFTDAGRNAYGYAEEPRRIDPNRFLFFGR
Ga0207435_103707Ga0207435_1037072F097867LIPPGTSHRNVGDMATIRIILYTRKPVRLADEFIHRAKRAGQPIV
Ga0207435_104194Ga0207435_1041941F020731MPVLAFFAVAGLALIALLFVADAALEKDGSPVIVTSQRSGLPESSHRPDKIPVLTMAPAPDPDMTSKIVRDAQPKPVAQDPMKIHPAARAARAEAMPQTPSVTQPMN
Ga0207435_104270Ga0207435_1042701F089374MTNILDSARASDEGPRLIVRKASHAPIWSVWAVLEGTPSEEIFEGSSEEDASSWIN
Ga0207435_104466Ga0207435_1044661F003059GAVIVTAAVPAAAQVRDAVYRGTLVCDKLPFSAGKGREAIEVTIAGGTVRYSHVVRLRDAAEPVPEQGKGSLNGQDIELQGSWKSGNRQYEAKYSGAFVRRHADLKGTQTWTDGGKSFTRACTGTIKRPFRVFLPGEKK
Ga0207435_104683Ga0207435_1046832F078970SDLATKLSSTKSVPEALDAYTKCVSQRMQMAADDGRRLAEEAQQLTQKFAQSLGNGRPGMTT
Ga0207435_104693Ga0207435_1046932F057709MPEFDLKVALIIFVTKVIDPFAALPALVAGYFCRTWWQVVISAAVVGIFVEMVLVLFEPTPGIHQGRLLMAVLAA
Ga0207435_104841Ga0207435_1048411F055629WFFAAVAPNGGQEQREPCPVTGYACEGDLSYLCEEYGCARKGGLSPRSEENF
Ga0207435_105277Ga0207435_1052772F022446MSAALPSPPAPSPRGRNPLPVEALQNQLGGLLREREELHAAPSPFALERNRREIVRIQWELSYALIERYASV
Ga0207435_105356Ga0207435_1053561F098176ANGLSGDGIICGMSIVGTSEAYWLTGGGFTGWHQIHHEDENIALSTAAQLPPQCALVASYRNTSETLVPRGRKRNRMYLGLLRADLLENDGTIIAGDATAVRSAMNELNTALEGITDADPIFAPQGLAIVSPTAGECYEVNEVGCGEAVDTHRSRRQKEPENMIWQA
Ga0207435_105412Ga0207435_1054121F068533LPSAVLHCTHSRANEHLRRSNVRNIARWTAPAAIALFLASVQFARSDQGLTGDVRTTFIEAATRSCLKTQLDAPTNKDVPVSALYDYCKCNASGMADKTSNDEVKTLEATGSEEKYRTAMQTRMESSAKTCLDEIRKSLPK

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.