NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300026773

3300026773: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05A5-11 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300026773 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0072071 | Ga0207566
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05A5-11 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size23673299
Sequencing Scaffolds32
Novel Protein Genes35
Associated Families34

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Proteobacteria1
Not Available14
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1
All Organisms → cellular organisms → Bacteria3
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales2
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1
All Organisms → cellular organisms → Archaea1
All Organisms → cellular organisms → Bacteria → Acidobacteria1
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Chthoniobacter → Chthoniobacter flavus1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → Flavobacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → unclassified Proteobacteria → Proteobacteria bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Phyllobacteriaceae → Hoeflea → unclassified Hoeflea → Hoeflea sp. WL00581

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.3Long. (o)-89.38Alt. (m)N/ADepth (m)0
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000466Metagenome / Metatranscriptome1105Y
F001436Metagenome / Metatranscriptome695Y
F005549Metagenome / Metatranscriptome397Y
F007710Metagenome / Metatranscriptome346Y
F013626Metagenome / Metatranscriptome269Y
F015492Metagenome / Metatranscriptome254Y
F016471Metagenome / Metatranscriptome247Y
F018801Metagenome / Metatranscriptome233Y
F020078Metagenome / Metatranscriptome226Y
F020927Metagenome / Metatranscriptome221N
F021654Metagenome / Metatranscriptome218Y
F024412Metagenome / Metatranscriptome206Y
F025757Metagenome200N
F030581Metagenome / Metatranscriptome185N
F031960Metagenome / Metatranscriptome181Y
F034172Metagenome / Metatranscriptome175Y
F037212Metagenome / Metatranscriptome168Y
F038444Metagenome / Metatranscriptome166Y
F039759Metagenome163Y
F040358Metagenome / Metatranscriptome162N
F044132Metagenome / Metatranscriptome155Y
F045182Metagenome / Metatranscriptome153N
F051112Metagenome / Metatranscriptome144Y
F051990Metagenome / Metatranscriptome143Y
F053375Metagenome141N
F053943Metagenome140Y
F057488Metagenome136N
F084203Metagenome / Metatranscriptome112N
F084687Metagenome / Metatranscriptome112Y
F085355Metagenome / Metatranscriptome111Y
F090518Metagenome / Metatranscriptome108N
F094081Metagenome / Metatranscriptome106Y
F095440Metagenome / Metatranscriptome105Y
F103539Metagenome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207566_100049All Organisms → cellular organisms → Bacteria → Proteobacteria1833Open in IMG/M
Ga0207566_100198Not Available1373Open in IMG/M
Ga0207566_100220All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1332Open in IMG/M
Ga0207566_100340All Organisms → cellular organisms → Bacteria1192Open in IMG/M
Ga0207566_100423All Organisms → cellular organisms → Bacteria1128Open in IMG/M
Ga0207566_100585Not Available1018Open in IMG/M
Ga0207566_101040All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium856Open in IMG/M
Ga0207566_101124All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium833Open in IMG/M
Ga0207566_101260All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria803Open in IMG/M
Ga0207566_101578All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales743Open in IMG/M
Ga0207566_101615Not Available738Open in IMG/M
Ga0207566_101749Not Available717Open in IMG/M
Ga0207566_102112All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium670Open in IMG/M
Ga0207566_102662All Organisms → cellular organisms → Bacteria624Open in IMG/M
Ga0207566_102814Not Available613Open in IMG/M
Ga0207566_102907All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia608Open in IMG/M
Ga0207566_102940Not Available606Open in IMG/M
Ga0207566_103239All Organisms → cellular organisms → Archaea587Open in IMG/M
Ga0207566_103246All Organisms → cellular organisms → Bacteria → Acidobacteria586Open in IMG/M
Ga0207566_103323Not Available581Open in IMG/M
Ga0207566_103479All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Chthoniobacter → Chthoniobacter flavus573Open in IMG/M
Ga0207566_103775Not Available559Open in IMG/M
Ga0207566_103783All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → Flavobacterium558Open in IMG/M
Ga0207566_103933All Organisms → cellular organisms → Bacteria → Proteobacteria → unclassified Proteobacteria → Proteobacteria bacterium551Open in IMG/M
Ga0207566_104056All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales546Open in IMG/M
Ga0207566_104168Not Available541Open in IMG/M
Ga0207566_104192Not Available540Open in IMG/M
Ga0207566_104930All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Phyllobacteriaceae → Hoeflea → unclassified Hoeflea → Hoeflea sp. WL0058512Open in IMG/M
Ga0207566_104953Not Available512Open in IMG/M
Ga0207566_104960Not Available511Open in IMG/M
Ga0207566_105198Not Available504Open in IMG/M
Ga0207566_105232Not Available503Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207566_100049Ga0207566_1000491F025757MSYSTSISLFWLEMAVLIGCVVLSIRMRSRAAMWVALAIVAHCAVWLAIHDEEILIRLVASALVYLGLLKFSPNAARVWLCAGGALAGAFLLGTTALSLLMSFPGRWSGFGLSIGATLVFLASGLLIGFWVVHRWTEPAQRRESQPGQKA
Ga0207566_100198Ga0207566_1001982F034172MRTIFVFLVAGLIVLLGGASYAADVRLDHKRAHARGAGTFDQRLRVVEQVPYCGNCEAPFGRTHSANVVQLRFINWPFWQERCAVGACGVYYPVMRSCAFWGLGCT
Ga0207566_100220Ga0207566_1002201F084203MRARIRRALWMFGALAFVAMPASAQESTEVAPLTTEDSALLANALVFDPGALATAPKKPLRLPGYRNNEYDITRTQKVDGSTTVVVKQPLQTEWSNSVGADLAPSRPATYPLPLPTEHNNGLPAG
Ga0207566_100340Ga0207566_1003401F018801ARAGGNVTMGLTILSIPFIALGVFLKPYALENSRCVGFAGLGPYCFEQASSMPEVIKYGSVAVGFALLYAGRLQIKRQRDGK
Ga0207566_100423Ga0207566_1004232F001436MNHEQDVATLIELLKMAAERWPRSEAEQVCQSELFREDHSLLEMWPEACRRTGVGRREFPPGVIKLWKQRMGRAN
Ga0207566_100585Ga0207566_1005851F039759LLDTFLPGHRSQDVTLWEGAVFALSSLLFVAASMAPFYGIRRVQNAFQSTFATQVQQVCPAQPPEAMIACWAQFYPWSRVAIDLGAPIVIAVICLVLANRLRHFGRQHFINRLAELKVLPAASTLFLRAFRDDQVRIRRASRNLFSSVFDLGRVPATLDELMLERLDGRGDLIAIGNPQDRKGAARQSPWGAQRLYVDDAHWQETVTMLARDADRIVLCIDASDGVRWEIAHVLQSGHAGKTLFFLNPSTDVQTRKRQLQEDFGVSAADLASIDVDRVLALRTTSTDQLILMVCDKPERDAYLVAARLAFEGRAGNLTMSAKGR
Ga0207566_101040Ga0207566_1010402F051990MRLRIRTVLCGTAIFSAVSFFTITFAETTPFNKAQVADRIRKVENGVDDFEKYLTSRGENAKDQAGSAKSSGAAKRGQGANSANKDA
Ga0207566_101124Ga0207566_1011242F000466VLTYMENWKSWKTAPSQGVVKKDWVPTGRIDFATRLNGTLEESDQPSEFKLLVEERRIVESITGNENLEIQWRLATLNEAKAVVAQYHKYLAENALIRSVSDETVSLPPPKKVQKIQETTAA
Ga0207566_101260Ga0207566_1012601F090518MTKIELEQIDDSILNFNVSDEALESAGDNAVAANYT
Ga0207566_101578Ga0207566_1015782F037212MTNNEIGAADISEPADAIAWRHFWLANVVVESDGRPLVRATRPALQPRKDSLSRRLQRLRHMRAILD
Ga0207566_101615Ga0207566_1016151F057488ADALLVIRDEEGRGMRALLLGTLLAIGLIPGATAQLAVGPVPIMSSINGIPITVSVTSWITVNSVGDETTVDARIFADLIDLQKKFSDVVDSFKRSARNCNRSADGQNPVVSFKSGSLWPRNDQLIMFVRGDIDIWSCSVGPPQSAIRWEKTKVSFLTLKLPVRRTWRNVKRNMDGTQPFHGTLLVSLAEKDGANVALRNTEPNLRLDGEPTFATNANLSLAKTDMNEKVSKMLRSAIDLTKLKDV
Ga0207566_101749Ga0207566_1017491F001436MKRDKDIVALIELLKLAAEQWPHANCEISQTDLFHRNQSLLEMWPEACRRTGVGTREFPPGVIKLWKQSLGRSN
Ga0207566_101911Ga0207566_1019112F051112MPLCSHIVLFESGSTRDASRKIGDAMKNRTSSRAFTLAAIAFAAGFVASAAHAFTVHDSNGQVGGQGYLELDKPAAAPDRMAPVSRFGNENGQTTIKQGNGTFQFGGQRSFD
Ga0207566_102112Ga0207566_1021121F095440MRDAQYLRAQAELCLEVARQISDLKTAENLKAEAARYHAEAAAVEAAEQPGPDGGRPTA
Ga0207566_102662Ga0207566_1026621F013626LGNPAYRMVTVVHFARRHMAIGRRVATAFIRHRVREAAKRLQARYDAQKIARDAKSDIFVVTDFDGTIASQLGQSAGATAFCVFVFGQTGELLAQWHSVPSADELAAAVKK
Ga0207566_102814Ga0207566_1028142F040358MRIAIFALAILAGSGAAEARQVEVVSTSPRHIEIAAWCTAGSNCQQEASDVAQGYCHGPDYPRRALYVRSGLVERGFFSERVIFVYKCNRRSIFCE
Ga0207566_102907Ga0207566_1029072F024412EGRNFKLAFAPSDKPMMDILGTPANRKFVETLLHEISGKDWTLKLTVNEELASRHAAVTEHSPQDFKDEPLIQEAIELFNARVSQER
Ga0207566_102940Ga0207566_1029401F103539SIEGELLNLARLCHSQARLTQDRAVKQALRKLGDHYESEAKKLQGQLSAHLQQSD
Ga0207566_103239Ga0207566_1032391F016471KKSSKSCSKHKLWTIRLLNCITVCCLMTEADKFGAVKDATVAIGLANKDTKDVISSFGTGFFIGGEYIVSSAHIFSQCIKYNAQNKDMNNGMEGIYSAFNITTNGDQLELNTYHIIKAIRLPPVKEVTGFTGSVDLDIGIGKLDRHSDNFLHIKEPTQLKLYDEIVICGYPSGRISLVLYRNKHEDGMGIHLRPI
Ga0207566_103246Ga0207566_1032462F021654FPDSGISPWVSAGGGFGYFKENSTLEFGGTNPGDTGSATAIFQIGVGLDVKLFSRLSLRGEARDFYAGVPQLNVDIGKSRQHNIFVGAGVVYHF
Ga0207566_103323Ga0207566_1033232F030581MKTQRESRRRVSRFTKMVAKFLHIPDRAAVYVIWAAWIAVVIIIGIIVMLWR
Ga0207566_103479Ga0207566_1034793F007710MTTQRAAAVFDFANYESTTISKKALCQLQDREAQERGARDLQKPAP
Ga0207566_103616Ga0207566_1036162F094081MRDDQGKILKKEFVERFDKAKGRLQQSVPEIQHLLKTSNVVEAARKIIDPAQSIFKQFADDIQLKDLFAKAEALVANLTVAKSVSRDTPPTETSPSETPASKLPSKKASLKKAPSKGVRSRKPLKKTPL
Ga0207566_103775Ga0207566_1037751F020927MWAAFVRQFAAYAITRRGKKLFALIGVLALCFGAALLIDMQFYVSASFAALLAGFAAVTYVVQHVKLKRAEHQRLLRKAEAAHQRALAAQARLERIDTAKSALRGAVTGAGRLVTDNVSIVANEALLMANETA
Ga0207566_103783Ga0207566_1037832F053943VVAPLLHNKEPVKDAAVNVELPQLFTTVTTGADGIACGAATPLPEGLVQPFNVCVTEYAPAVVTLIDEVVAPLLHNNEPVKDVAVNIELPQLSTTDTVGADGIDFGAATPLPEGLVQPLTVCVTEYVPAVVT
Ga0207566_103933Ga0207566_1039331F015492GARAQVFDFGQIEEFESLGSGTQKGGSPPKTIIDDGARHTVLFTILESNTEAKIYWKSKDGSQTTIMRGQGLRAFQTVGEFRIEATGDDSRSFRYGYVLFRLKSEKSAQEDKI
Ga0207566_104056Ga0207566_1040562F085355MDSLKAALTSVKDLLDWLPDLVVALLILAIAVLFALALHRWARKLVRRAIAGRYPF
Ga0207566_104168Ga0207566_1041681F031960GSEPLQTLTADPDAPDADSFSNKDYEFYALKAFITLTRKSHGVVVKTYNACRAE
Ga0207566_104192Ga0207566_1041922F038444MTDLHVEVQGDYIIITLPGTKFMVTYYKAGDPPQLMAKSDWTDDADAPIALGAFRARAWMAASDKARQLGWIE
Ga0207566_104930Ga0207566_1049302F045182ISWYHAVPLHIIPASRDMLHSAKYQDNPVIQKRMDVLKFLDSVWTKGVPLYYWDGRELNPYIGLYHNENLAGWMLAMRNIKGMKSDQIVDEAAAQVRKKMKRVG
Ga0207566_104953Ga0207566_1049531F020078QVDVLFKEFGPTIATAMLQAEKREMRTAIIIRWQSGAIALTAIVLAFLVPGVLLCKLVPLVSIASAIAITTLIAGAGLYLAGRLIEKRTPRSERVDHYLQASIFVIAAGLLWLHVIFQTGAWQDRSIEPGTALAIATGCGIAGALLLIRRTRRLSESK
Ga0207566_104960Ga0207566_1049602F053375MKVTGVSAATILVLAIASPSHAQGIEVFGGYSANADYVQNRPAILILDQKVSPFFSHGSGPNGFEASFKHDVRNGLGIKID
Ga0207566_105198Ga0207566_1051981F005549VYCVAGVLLPRGRGAAEWSVMLKFLRKCTAIPAVKYSIIAITSFLWLVGFADQLPDVEQTVKYVGISLLMLAVAAMA
Ga0207566_105232Ga0207566_1052321F044132DRMAPVNRFGNENGQTTMKQGNSTLQFGGQQSFGQRYNTDNIFNPYARDGR
Ga0207566_105289Ga0207566_1052892F084687MKLVRYRSASSEKPGLILDGEIFDLSGSFAALNPRAPTLD

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.