


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300026740 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0072080 | Ga0207439 |
| Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05A1w-12 (SPAdes) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 20743468 |
| Sequencing Scaffolds | 26 |
| Novel Protein Genes | 28 |
| Associated Families | 27 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Archaea | 6 |
| Not Available | 12 |
| All Organisms → cellular organisms → Bacteria | 2 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Sphingomonas | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 2 |
| All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Wisconsin, United States | |||||||
| Coordinates | Lat. (o) | 43.3 | Long. (o) | -89.38 | Alt. (m) | N/A | Depth (m) | 0 | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000569 | Metagenome / Metatranscriptome | 1018 | Y |
| F001213 | Metagenome / Metatranscriptome | 746 | Y |
| F002616 | Metagenome / Metatranscriptome | 543 | Y |
| F004317 | Metagenome / Metatranscriptome | 443 | Y |
| F016622 | Metagenome / Metatranscriptome | 246 | Y |
| F017892 | Metagenome / Metatranscriptome | 238 | Y |
| F019548 | Metagenome / Metatranscriptome | 229 | N |
| F022731 | Metagenome / Metatranscriptome | 213 | Y |
| F023901 | Metagenome / Metatranscriptome | 208 | N |
| F026602 | Metagenome / Metatranscriptome | 197 | N |
| F026949 | Metagenome | 196 | Y |
| F038480 | Metagenome | 166 | Y |
| F040716 | Metagenome | 161 | Y |
| F043582 | Metagenome / Metatranscriptome | 156 | Y |
| F047460 | Metagenome / Metatranscriptome | 149 | Y |
| F057496 | Metagenome / Metatranscriptome | 136 | N |
| F057773 | Metagenome / Metatranscriptome | 136 | Y |
| F058528 | Metagenome / Metatranscriptome | 135 | N |
| F068880 | Metagenome / Metatranscriptome | 124 | Y |
| F070698 | Metagenome / Metatranscriptome | 123 | N |
| F071393 | Metagenome | 122 | N |
| F078970 | Metagenome / Metatranscriptome | 116 | Y |
| F083156 | Metagenome | 113 | Y |
| F088429 | Metagenome | 109 | N |
| F089166 | Metagenome / Metatranscriptome | 109 | Y |
| F089980 | Metagenome | 108 | N |
| F094081 | Metagenome / Metatranscriptome | 106 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0207439_100092 | All Organisms → cellular organisms → Archaea | 1653 | Open in IMG/M |
| Ga0207439_100213 | Not Available | 1353 | Open in IMG/M |
| Ga0207439_100333 | All Organisms → cellular organisms → Archaea | 1199 | Open in IMG/M |
| Ga0207439_100542 | All Organisms → cellular organisms → Bacteria | 1038 | Open in IMG/M |
| Ga0207439_101042 | Not Available | 838 | Open in IMG/M |
| Ga0207439_101341 | All Organisms → cellular organisms → Bacteria | 775 | Open in IMG/M |
| Ga0207439_101903 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Sphingomonas | 686 | Open in IMG/M |
| Ga0207439_101944 | Not Available | 681 | Open in IMG/M |
| Ga0207439_102038 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 667 | Open in IMG/M |
| Ga0207439_102099 | All Organisms → cellular organisms → Archaea | 659 | Open in IMG/M |
| Ga0207439_102254 | Not Available | 641 | Open in IMG/M |
| Ga0207439_102262 | Not Available | 640 | Open in IMG/M |
| Ga0207439_102576 | Not Available | 612 | Open in IMG/M |
| Ga0207439_102765 | All Organisms → cellular organisms → Archaea | 594 | Open in IMG/M |
| Ga0207439_102832 | Not Available | 589 | Open in IMG/M |
| Ga0207439_103188 | All Organisms → cellular organisms → Archaea | 564 | Open in IMG/M |
| Ga0207439_103296 | Not Available | 558 | Open in IMG/M |
| Ga0207439_103344 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes | 555 | Open in IMG/M |
| Ga0207439_103418 | Not Available | 551 | Open in IMG/M |
| Ga0207439_103590 | Not Available | 542 | Open in IMG/M |
| Ga0207439_103934 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 525 | Open in IMG/M |
| Ga0207439_104158 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia | 514 | Open in IMG/M |
| Ga0207439_104254 | Not Available | 510 | Open in IMG/M |
| Ga0207439_104316 | All Organisms → cellular organisms → Archaea | 508 | Open in IMG/M |
| Ga0207439_104362 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 506 | Open in IMG/M |
| Ga0207439_104445 | Not Available | 502 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0207439_100092 | Ga0207439_1000922 | F023901 | MKDNCSQNEIKTSEQSQSCEGGVPLDPKKFGGYEFAINTKGWDAKREPTHNCHDQHKSGLIPTNLENDKAVRLRQTVKDESGKVHQIGEIDYMDGNGFHKVMDIFDSSPNPWMVDRNLYETKSYFWIRNNGSGYITVRDVSLEILS |
| Ga0207439_100213 | Ga0207439_1002132 | F019548 | MNPSKKEIYKILERFTSQKGGILIILHNSFSSDSKPPQEQTSVVRDDERKSAIFEINFGTVAGLSMLCECEKALADQVMSLDFLDSGIEGDVIWFGGMLDKSGSEFIGSTYDDGLKSAPVEQSELVHRVNQAIDKCLEYMLNSVELDKKVYVDASDRMSGYVKLTRIGEHIREKHPDLFRDNTKSE |
| Ga0207439_100333 | Ga0207439_1003331 | F026602 | MISLTLASMHVFAGRIYAIEQPSKMEGINEEECNKIFKCKIISEDVLKYPDIVNPFAKNEEIAKTLNANDAQIMTEHTCQKLMDVDIVKKKDQKIGEQTPKYLVCLP |
| Ga0207439_100542 | Ga0207439_1005421 | F083156 | MSDGGRERASLGVEVWNSSQKWSVQRSAVRSIAWLDVL |
| Ga0207439_101042 | Ga0207439_1010421 | F001213 | QQEEVNVIVAGSGVYRIEGEDIPVSVGSFLRFDPGTTRQPIAGPEGMTMIGVGARRGSYEPRGPF |
| Ga0207439_101341 | Ga0207439_1013411 | F070698 | QTNIFDRRHHTLMVILNISEIWNNGHNTTFRDSHLEDDNSEILSNGNAPYVIKGNGDCMIIVHADALGASKNFGDGH |
| Ga0207439_101511 | Ga0207439_1015112 | F040716 | MTHVRIAANAILVFSFMAVTGFPALAAPAKCNAELRKCNSHCNLVYESGRANRTCRNRCKDNLYVCKARPS |
| Ga0207439_101903 | Ga0207439_1019032 | F043582 | PQREKKMPLLFYFPLIIWMGVLEAMQDEMRVAATAKARR |
| Ga0207439_101944 | Ga0207439_1019442 | F047460 | MFLLHFVMFGCIYDHFVTALNSAQMGQSGAINAKVRATKSRL |
| Ga0207439_102038 | Ga0207439_1020381 | F078970 | ASRVWIDRVQSEISLWSDLATKLSSTKSVPEALDAYTKCVSQRMQMAADDGRRLAEEAQQLTQKFAQSLGNGRPGMTT |
| Ga0207439_102099 | Ga0207439_1020991 | F068880 | KYKGVLESADRLTFGHSNYGYRYFLNLSSPSKLLDSSSGLIPKNEFLSKIPEGYDVPFKGMLILTQKHNELYAIIFLNPKEKFDSMLNQIQPTLDSIQLSG |
| Ga0207439_102254 | Ga0207439_1022541 | F038480 | NMEGACRVVADRGSSQMKLGRKSGKIVRRDTMKAILVSIVLYFAATAAHSMPISVLNANGLSATIPISDQCGDRCGSSRSYVKDRRSGVGGYSGGYVLVRDPLIQRRPFCPFGSYVACVVSGTYCVDLCH |
| Ga0207439_102262 | Ga0207439_1022621 | F057773 | LILEPAVASHETDDGLGKVDIEIVADDIPSDVGGGAAQQAAEKARKILFGPRIADHAFDLAGGNIEGRNQGLSAVAAVLELTPLDLARRHRQSRRDALERLDAGHLVDRDGAMGVIGGGRGFVDRADVCALGIEGGIGLRGQPVADAMRLEVSLFFKKRPTERCEMFGMSPRRMASSAISRWLQWLMGRSLSDGFSHVIATRAQICSGVNVA |
| Ga0207439_102576 | Ga0207439_1025762 | F058528 | NDIPDGLSVWPQDDVARAICNALMMATGALVTIREDQHLKDQGTYAIGIGHKLN |
| Ga0207439_102765 | Ga0207439_1027652 | F088429 | TPEPQPETNASLNTPEPQPETNASLNTPEPQPETNASEIVTPRSIDLNITVGKDPIARSENQMVTVVALDPTTGKVLDRVFIKLEIKDPVGILVKNYTGTVGNLTRTFKIGENAIGTFIISATASQAGVQSTKSLPFQVQ |
| Ga0207439_102832 | Ga0207439_1028321 | F002616 | MQDTPMPNSFPNWTSRIVIAGRIAEYRRPESRHESTMPGDYHLPFGRVEAQAKAARWLWQSYII |
| Ga0207439_103188 | Ga0207439_1031881 | F089980 | MDLESATELIVMNWLGIFSLAATIFIAIVTINYRNKQHQIKGLLDAFKILNTREHRTSRRKVYELYIEYEKNKDVGIFDNVPEVVDVRADFDVIGTLVKSRNIDEKLFLIEYGPLAYRCWKYLKNHIEAERKKRNFDPFMMNFESLAGKADNFWLKRGYDLSKTLLYQPEQ |
| Ga0207439_103296 | Ga0207439_1032961 | F089166 | ACELGYWAMVDALEAERNAAKKNNAGDHPAFSCEPEAEPSPAPRVQRTADEPAP |
| Ga0207439_103344 | Ga0207439_1033441 | F038480 | VLVSIGLCLAATAVHSMPLSLLNANVAQPVIAVSDQCGDRCGSSRSYVRDRRTVMAGYSGGYVLVRDPLIQRRPYCPFGSYVACIMSGTYCIDLCH |
| Ga0207439_103418 | Ga0207439_1034181 | F004317 | MGKVFRALRPCLSPYRDERQMNLSYMVLLDMWNMFEQKFLSFIGG |
| Ga0207439_103590 | Ga0207439_1035901 | F000569 | MDLHIKKVWLPGAASCLLFFGFYWVLIWLPFDKNRFQFLAIPYLVLPFAGALAAYSSRRMKGSVLERILSALFPAFAFVVLFAVRIVYGLFFEGQPYTLPHFLAGFSVTLVFIVVGGLLLVLGAWPFCRPHLREQ |
| Ga0207439_103909 | Ga0207439_1039091 | F094081 | MADDHRKTLKEEFTDRLEKAKGRLQQSFPEIQQSIKTSGAVEAARKIIDPAQSIFKQFADDIQLKDLIAKAEALVANANLTLTKAASKDAAPTEARPIGAPSNEPSAKKAGVKKTPSR |
| Ga0207439_103934 | Ga0207439_1039341 | F016622 | VLREVAEYPNCFGPLGPGAERIETDRYTLCLGPGSTWNTVQRQRFALEELDEVLEEVRAHLRDRNRTQTQWEVGSSAPAGLVDALLERGIGFDKDPYAVALVLTSEPPPPREGLVARQIETYAEYLDANAVQWEAFGTPPEQI |
| Ga0207439_104158 | Ga0207439_1041581 | F026949 | MKKLFALILSVAALAPFTAQSQRPPGSLAGFLSGQGLVGVKLERRYGNHLFVL |
| Ga0207439_104254 | Ga0207439_1042542 | F022731 | MGTEATIDRVKKELLRAFDNTRAELDRIEILAAGLAAFNAPIPGYEPMFRHLPQLNRNAHELAADEPRA |
| Ga0207439_104316 | Ga0207439_1043162 | F057496 | MAGVYRNHTKVAEDHSSAYCKDEYVFGGKGEPLRLSHLIAFNVIEDAEHISGILTVEFDYYSDDDSIKYRDMHYSNPRLIKHIEGNPTVMKNID |
| Ga0207439_104362 | Ga0207439_1043622 | F017892 | YTKQSTQGPTTVVEGFFEGSLEDAYDEYKKELEAAGFKILFDEIEEHDSEVSWEGEGRSGQVALREECGSDDKIYVHITNRPASE |
| Ga0207439_104445 | Ga0207439_1044451 | F071393 | QANELQDAHDEIDRLSQTVSALQEAVTQYQAGAAAAEDEIVLLESEKAALQAQLDGAFEESKTLADRVLAAEAAAKRREENIASSLKQIDFLNAELMAASSERFKVVAAMQGEQRRQRSAFNQQKSMLEIRLQEKEALAATQAATIKQLEGVRDELDKRFRVIEAL |
| ⦗Top⦘ |