Basic Information | |
---|---|
IMG/M Taxon OID | 3300026749 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0091547 | Ga0207452 |
Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G07K1-12 (SPAdes) |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 21600752 |
Sequencing Scaffolds | 8 |
Novel Protein Genes | 10 |
Associated Families | 10 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 1 |
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 2 |
All Organisms → cellular organisms → Archaea | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium | 1 |
Not Available | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Type | Environmental |
Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | Wisconsin, United States | |||||||
Coordinates | Lat. (o) | 43.3 | Long. (o) | -89.38 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F003059 | Metagenome / Metatranscriptome | 510 | Y |
F023865 | Metagenome / Metatranscriptome | 208 | Y |
F025522 | Metagenome / Metatranscriptome | 201 | Y |
F027009 | Metagenome / Metatranscriptome | 196 | Y |
F045778 | Metagenome / Metatranscriptome | 152 | Y |
F054450 | Metagenome / Metatranscriptome | 140 | Y |
F055942 | Metagenome / Metatranscriptome | 138 | Y |
F061030 | Metagenome / Metatranscriptome | 132 | Y |
F068970 | Metagenome | 124 | N |
F103537 | Metagenome / Metatranscriptome | 101 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0207452_100056 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota | 1274 | Open in IMG/M |
Ga0207452_101542 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 619 | Open in IMG/M |
Ga0207452_101578 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 615 | Open in IMG/M |
Ga0207452_101972 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 582 | Open in IMG/M |
Ga0207452_102170 | All Organisms → cellular organisms → Archaea | 569 | Open in IMG/M |
Ga0207452_103016 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium | 522 | Open in IMG/M |
Ga0207452_103282 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 511 | Open in IMG/M |
Ga0207452_103526 | Not Available | 500 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0207452_100056 | Ga0207452_1000562 | F027009 | MVASKMEADIIAEYFKKSKQNLQSEHDSMIQDLKQDISSYKKKALSDV |
Ga0207452_101542 | Ga0207452_1015421 | F061030 | IPPPVGVKREGLKLRPKEEAGGTQTLDGGRVAFPKYYRREGLKFHRSRALEGYLACMRAAGRK |
Ga0207452_101578 | Ga0207452_1015781 | F055942 | PVDARYLAGQGGVRLVIACNHKSAILLELLDHGIGDQDFDARHQRAILERGHRNAVNVPEIVRLYRPDVISKATGDREAKDCRK |
Ga0207452_101972 | Ga0207452_1019721 | F023865 | DVDPGATPAGLAQVRFLDQLYNGQIERAYASLHPAYQRVVPRSRFVECTRRGALGGLDSIEVLDVYDDPVQIPGGGKAGAKAVRVRLTSSDGQATTFVNHEVKVGPRWRWVLNDAALKAFQAGKCPST |
Ga0207452_101972 | Ga0207452_1019722 | F025522 | MSTAQTYEVKCPHCKKSFKAELLGGGERTGFKCPHCRLFVPYERAATEQAR |
Ga0207452_102039 | Ga0207452_1020392 | F103537 | VFAFAAIEYEFLLTVSGQIKHELLKMNEAKFIDIVQWRYTEVAQQAAKFVTLNKAFINCFKIIHKLSQQIFNIILATKFSR |
Ga0207452_102170 | Ga0207452_1021702 | F054450 | FFNGNNPHSNKHRPTPTLRPVNISKEQLRDFRKLLSDPRTSQKIKKRFMLEKASGYCSVW |
Ga0207452_103016 | Ga0207452_1030161 | F003059 | AIVMAAMPAAAQVRDAVYRGTLICDKLPFSAGKGREAIEVTIAGGTVRYSHVVRLRDAAEPVSEQGKGSLNGQDIELQGSWKAGNRQYEAKYSGAFVRRHADLKGTQTWTDGGKSFTRACTGTIKRPFRVFLPGEKK |
Ga0207452_103282 | Ga0207452_1032822 | F045778 | MPLARVVTFEGVDRARIDQLREQIETGEQPEGLNATEIIVLHDGGADKSLTVVFFDNEDDYVRGNEILDSMPRDDTPGERTSVSKYDVAIRMAD |
Ga0207452_103526 | Ga0207452_1035261 | F068970 | NIPAFGKYFEDIKVYSDVYTGGGNSSKPINQFDPVNKEIYVGQTIKWSNPTAGAPYPHIVVFVSNQSAELESKISNITKPLHSSNGQSVVGNLNKLLIEDNNHNNKSNQTFSARSIVLPSIINTSSLAVKYLNVNGSGIYSGAGYNFTGNERYVSSGLIWAGGVIP |
⦗Top⦘ |