Basic Information | |
---|---|
IMG/M Taxon OID | 3300027027 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0114663 | Gp0127646 | Ga0209844 |
Sample Name | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_0_10 (SPAdes) |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 83878151 |
Sequencing Scaffolds | 27 |
Novel Protein Genes | 31 |
Associated Families | 29 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Archaea | 5 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium erythrophlei | 1 |
All Organisms → cellular organisms → Bacteria | 3 |
All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium | 1 |
Not Available | 12 |
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → unclassified Nitrosopumilales → Nitrosopumilales archaeon | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Eisenbacteria → Candidatus Eisenbacteria bacterium | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → unclassified Cyanobacteria → Cyanobacteria bacterium 13_1_40CM_2_61_4 | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Groundwater Microbial Communities From The Columbia River, Washington, Usa |
Type | Environmental |
Taxonomy | Environmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand → Groundwater Microbial Communities From The Columbia River, Washington, Usa |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | freshwater river biome → river bed → sediment |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Subsurface (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Columbia River, Washington | |||||||
Coordinates | Lat. (o) | 46.372 | Long. (o) | -119.272 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F002385 | Metagenome / Metatranscriptome | 565 | Y |
F002599 | Metagenome / Metatranscriptome | 544 | Y |
F002896 | Metagenome / Metatranscriptome | 522 | N |
F006787 | Metagenome / Metatranscriptome | 364 | Y |
F006888 | Metagenome / Metatranscriptome | 362 | Y |
F007370 | Metagenome / Metatranscriptome | 352 | Y |
F007831 | Metagenome / Metatranscriptome | 344 | Y |
F008846 | Metagenome / Metatranscriptome | 327 | Y |
F012470 | Metagenome / Metatranscriptome | 280 | Y |
F018428 | Metagenome / Metatranscriptome | 235 | Y |
F019802 | Metagenome / Metatranscriptome | 227 | Y |
F021408 | Metagenome / Metatranscriptome | 219 | Y |
F021640 | Metagenome | 218 | Y |
F022264 | Metagenome / Metatranscriptome | 215 | Y |
F023298 | Metagenome / Metatranscriptome | 210 | Y |
F032183 | Metagenome / Metatranscriptome | 180 | Y |
F035426 | Metagenome / Metatranscriptome | 172 | N |
F036895 | Metagenome / Metatranscriptome | 169 | Y |
F040350 | Metagenome | 162 | Y |
F044994 | Metagenome / Metatranscriptome | 153 | N |
F046963 | Metagenome | 150 | N |
F053366 | Metagenome / Metatranscriptome | 141 | Y |
F055103 | Metagenome / Metatranscriptome | 139 | Y |
F070180 | Metagenome | 123 | N |
F079760 | Metagenome | 115 | N |
F081935 | Metagenome / Metatranscriptome | 114 | N |
F082936 | Metagenome | 113 | Y |
F087961 | Metagenome | 110 | Y |
F095107 | Metagenome | 105 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0209844_1000432 | All Organisms → cellular organisms → Archaea | 2378 | Open in IMG/M |
Ga0209844_1000737 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium erythrophlei | 1984 | Open in IMG/M |
Ga0209844_1000808 | All Organisms → cellular organisms → Bacteria | 1911 | Open in IMG/M |
Ga0209844_1001203 | All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium | 1667 | Open in IMG/M |
Ga0209844_1002688 | All Organisms → cellular organisms → Archaea | 1219 | Open in IMG/M |
Ga0209844_1004313 | Not Available | 1002 | Open in IMG/M |
Ga0209844_1004339 | All Organisms → cellular organisms → Archaea | 999 | Open in IMG/M |
Ga0209844_1006696 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → unclassified Nitrosopumilales → Nitrosopumilales archaeon | 832 | Open in IMG/M |
Ga0209844_1007986 | All Organisms → cellular organisms → Bacteria | 771 | Open in IMG/M |
Ga0209844_1008030 | Not Available | 770 | Open in IMG/M |
Ga0209844_1009818 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Eisenbacteria → Candidatus Eisenbacteria bacterium | 707 | Open in IMG/M |
Ga0209844_1010291 | All Organisms → cellular organisms → Archaea | 693 | Open in IMG/M |
Ga0209844_1010828 | All Organisms → cellular organisms → Archaea | 679 | Open in IMG/M |
Ga0209844_1011330 | Not Available | 666 | Open in IMG/M |
Ga0209844_1013394 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium | 622 | Open in IMG/M |
Ga0209844_1015217 | Not Available | 590 | Open in IMG/M |
Ga0209844_1015335 | Not Available | 588 | Open in IMG/M |
Ga0209844_1015386 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 588 | Open in IMG/M |
Ga0209844_1016255 | All Organisms → cellular organisms → Bacteria | 574 | Open in IMG/M |
Ga0209844_1017357 | Not Available | 559 | Open in IMG/M |
Ga0209844_1020085 | Not Available | 527 | Open in IMG/M |
Ga0209844_1020136 | Not Available | 527 | Open in IMG/M |
Ga0209844_1020360 | Not Available | 525 | Open in IMG/M |
Ga0209844_1020400 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → unclassified Cyanobacteria → Cyanobacteria bacterium 13_1_40CM_2_61_4 | 524 | Open in IMG/M |
Ga0209844_1021757 | Not Available | 511 | Open in IMG/M |
Ga0209844_1021857 | Not Available | 510 | Open in IMG/M |
Ga0209844_1022522 | Not Available | 504 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0209844_1000432 | Ga0209844_10004321 | F079760 | MLEKGNNQVEIMYLSTSSGKIRFRYGSREGNWYDFKGDARFKADYFLEKGWKRKKSQTIINE |
Ga0209844_1000432 | Ga0209844_10004323 | F087961 | LISNSHSLKLEETAKGVRISVHVYTNDKETAIHEAIATYLETKQKCEKEKIQIAPMEVISKYNEMY |
Ga0209844_1000737 | Ga0209844_10007371 | F036895 | MLTVRIQAIDGTGTCTLSDSQPCRLLTLLRGRYSASQLLRTHPPPSRRRSISRLSRLYDLPCSGDFAPVRGGLLQLLGMSLSPCCRFHPAEVQVPHRSDFGTPCSLRPSDAGSALGSIHYRGHIRVHCRYGPVTRDLPKGDLVDRLQDLGFPSPCYPSYGAPDSCPGRSTSC |
Ga0209844_1000808 | Ga0209844_10008081 | F007831 | MMLLVAAMFLLPPVLVTLYLYMSGVFTNRSSFQSAFYILLALFALTIGIGTHLSYQGYGLPAATPVTLSEVLPEP |
Ga0209844_1001203 | Ga0209844_10012031 | F002385 | LNQQAQELSQALQKIMRRFGRQCRGQGKVFVTLVRETERHLLALGTPIETWSQQARTCLHHDSGRSAAQRERLLRDLEATSAAHRHITTQSQRLTQGKKLAQCKIVNAYDPTIAPILKGKSNCPAQFGRKTGIVSEPASGFIFANRVPAGNPSDPSYVLPMLDKVQDAIDLVVSPKRLRVISLGGDLGINDAQLRQALHARGILTVGIPTSVEPINPTPSQQEVRDILNASGLNRIRTPHQVHLACASGYSRPVVEGHIATLMTRGADQVRYKGLEGAVIQMGMAVMAHNGAVLVRVGQQRLSKRGQKFRRLLG |
Ga0209844_1002688 | Ga0209844_10026882 | F081935 | LRFKRNYKYEIIKRKAIKIERLPAPIPEEQRQVKIMPYMKDTSTFWNIRKETYSERKRTCIICGQTATYTAYFQIEGAKLKEKYCSDCVEKWVYLDLGLLHGKGRLQDSYQAHIG |
Ga0209844_1003069 | Ga0209844_10030692 | F095107 | KMSTKYGEPFFHSINGVMLPSFRFSEPGRYTISVEIAAQLFIPIGPVFANFSAAVSPAADGNLEIKLST |
Ga0209844_1003723 | Ga0209844_10037231 | F035426 | MLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTAGHLLHRSHGQIRMTDLATQIHLSSSQLERQFKHYTAISPKAYARIVRFGSLQASLLVNPSI |
Ga0209844_1004313 | Ga0209844_10043132 | F007370 | YVVFAVYERATMASNTPDDSRVVSVRLPTPLLQRLDRLLDWHTTHRRRPTTRNAALRAALGDWLDQQEQLAGLLDSQVLRQQFHAAYTSLRPSATGAPISRLRRVLQWPRERFDTVLETLRAAQVIEVEARTEPVGNDPATHDSYQVHGQYYDRLRWRP |
Ga0209844_1004339 | Ga0209844_10043392 | F046963 | MTRTKIMVLTIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTAPELIQVRKILFERALEMKKAIAYVRAQESKS |
Ga0209844_1006696 | Ga0209844_10066961 | F070180 | HIFSLLNGLPGVIILSGSAIVVMKLLFYTKLERSEIHKFSSLLLGIFCFLIGETLYFYQQYFLQITIPYPSVAEVPYLLGSLFFSYFLFLCLFSLINRKGFNPLPIILGSSVSIFPIFLILSSAYDLEINKSTELEFIVNALYYTFDALMMVPALVILLNLKKNDPFIFHWISITVALILLVIGDVGYTYFSIISESLIEEFEWLWSIVYALGYLFLGIGIYWFDRIKNTLEDKKINIFLEKDEMDRLKNSSKNELIGDMGTEYSEHIIGYENFVDK |
Ga0209844_1007986 | Ga0209844_10079862 | F018428 | MRRFGRQCRGQGKVFVSLVRQTETQLLTTGAPVVALAQTARAQIQTATELTEDQRARWDTTLTLALVTHQQIATQSRRLTHGKPLTRCKIVKAYAPTIAPICKGKSNCPTQFGRKPGIIAEPATGFVFAAQLPVGNPTDVSYVGPLVDKVQTALTHVTTRPTPAIHS |
Ga0209844_1008030 | Ga0209844_10080302 | F021408 | MKKTSIAIATICLLWAAVATVPTTEAATTFAGAIVGIDQTQRTITYQTEDGRTWTLPVTDSNILKQEQIAKGDRVRVEVEVSDDLSQRITKITKIPDQPRTKPTQSLNDVRP |
Ga0209844_1009818 | Ga0209844_10098182 | F023298 | MMNLVRNFVHDENGEDLIEYGLLAAFVAAVALVTIIADPLGIKGSLVGAFNKAKAALDQS |
Ga0209844_1010291 | Ga0209844_10102911 | F021640 | HNLMSPKFGFFKKTSNYFQDKKNDSDSVQSVTDKELLEIIENEKKVFEKDLSSNLEPIRNSVLDCLDRLRKGADELEEQEIKVENPQFESLINTSKKILITSIKKESLIQSSEIKNYEDAVKFKNNLELLINRFGQVGDSHNRILNEFMRKQVNKFKNEFDNLSSLLKKVTKLLSTKENEINKCIACKEDLILFKEKLSERNGKQERLSELMKERQTIDKNIDMGNKEYE |
Ga0209844_1010828 | Ga0209844_10108281 | F002896 | DMSKMDQDTILHFYMILENSLLESDISKINEADIDAWSQSFKKVVRESKEKSGKGVFVPFLMWRLGEISPVEASKYLVNRKQDECRVSYDHNNVEYIIWVMALMFMSWSVTNLKRKMQNGHCQNIDHPHGDINPRLCQEGTKFHQELYNECVKTFKDLLIHSNSERDR |
Ga0209844_1011330 | Ga0209844_10113302 | F002599 | MTPTWAQRQEALRRDCIVSPDVFNPMVDRLRDFALPYQQALETEAQARRRDLQAARTPAD |
Ga0209844_1011428 | Ga0209844_10114281 | F040350 | MLNRWHRLSLQVKMTMIIVSIVGVSAVTTEWLEVRAIQHTVEDNVRDAALAVGRSVDQNVISLAQLSNREARTKELEKILANLPGLLNIVLYEFPVEPGGSPQPITSAGPTELLLLTHPRQEKARELIQRVREQRRPLIDYADRSNTHRV |
Ga0209844_1013394 | Ga0209844_10133941 | F018428 | MRRFGRQCRGQSQVFVSVVRQTETQLLTTGSPVGGLARAAQAQVQSAPQLTEEQRERLTTQLQVALAAHQQIVTQSRRLTNGKPLTQCKIVNAYDPTIAPICKGKSNGPTQFGRKPGIIGEPASGFIFAFALPVGNPADLSYVVPLVDKVQTAIAYVAGRPPLAIHSLAGDLALNDTKLRETLHGRGILTVGIPHTVAPLSPSPTPE |
Ga0209844_1015217 | Ga0209844_10152171 | F006888 | MEDTAVDAFLTSSFGTFVMTLWHGVLSAPSWHNFTYLTYGWALASGRQTITAYLWGSGAAQVKHFSRYYAFLGGALYHRRYQLWARVIRFGASLVPADAVIEVRLDDATMKKTGRHIQGAAHYRNGAGTARQEYRTLWGINLVWAIMRIPLQRWPGHHLSLPIGLELYLKEALANKLKVPYRSRS |
Ga0209844_1015335 | Ga0209844_10153352 | F019802 | MGIVDGGELRGSRQSALKLMVLNLKLVDSMFGSQSTVAPPLPAAVDFVASLPEAATVFATTNRPLFPRLPRRLDSTEILTFLSRHHPYPIDERKA |
Ga0209844_1015386 | Ga0209844_10153862 | F006787 | MIDRQFLGARALAETLHGMAPPVLARIRDTRSDERRPRLGLNHLRINDNHRSII |
Ga0209844_1016255 | Ga0209844_10162551 | F022264 | MADRGIEAVRAHLAKLPPSDSLTTAERRAQYERAEKAFPTPPDVKVERVSAPAIPAEWLRPPSAVP |
Ga0209844_1017357 | Ga0209844_10173571 | F082936 | QAYVHGFRLGEVPIHFKNRAREASKLSAEEIYTALVNFALLRFRYGLRPRRRPEPRA |
Ga0209844_1020085 | Ga0209844_10200851 | F012470 | MSKYQDLSKTFAESVQLQAKYVSECCEFSNFFFSQMAEYLEWPKDQMTFAPDESSQTEEPICSHHSVHLREDGNIHFFALFTIKRFDNIKHRYQLIFPFKVKKLEDSFLLTITGLVEEVLLSTEDTQEMKNVYENLFAAMKS |
Ga0209844_1020136 | Ga0209844_10201361 | F008846 | MTDQMIVRRLALDLRNLADKTLAVRGAVEDYRHDLVRTIDDDSCDDDELLALQRQVQELWGAMDRAEAKLRSGSRRMSPLLWLE |
Ga0209844_1020360 | Ga0209844_10203601 | F053366 | AGCSERKNRQTPDELRAEIAALEKELVELRPKLDGLILKDLRIQGMPKTPVRVGVPTALATLLIERVVSGFVDHVTLELKNLKVNKKGTVKKVVTLGQYELHVLIKRVSGRLKTGKPSVTFGGNKVALSMPVTVASGSGNANISFKWDGKGMSDAVCGDLAVNQDVTGGVKPASY |
Ga0209844_1020400 | Ga0209844_10204001 | F044994 | FATRHHRTPDEMLRLCLAVYEEALYQRAHAQMLADGILVELPAPPLLPGETEDDDDFEPVEIPGKPLSEMILEDRR |
Ga0209844_1021757 | Ga0209844_10217571 | F032183 | MLATILFAVMILLQTGALSLGLTVGRRFITAAAGSNGKYSRPNSVYHRERH |
Ga0209844_1021857 | Ga0209844_10218571 | F007370 | ADWLLAGKIPSWQRPFFALRFAHVLHYVVFVAYEGEPMPHPEPEDSRVVSVRLPTTLVQRLDRVLDWHMTHRRRPTTRNAALREALGDWLDQHEQLAGLLDPESLRQQFRAAYDSLRPSPDGVPIPRLRRLLPWPRERFNTVLEALRAAQAIDLEPLSAQVGDTQATHD |
Ga0209844_1022522 | Ga0209844_10225222 | F055103 | MFAEQTNLDSGLRGFWLVLDGHLGQARLANRMFYWLRRSSPRWLAQNDPEFRVLMKYPG |
⦗Top⦘ |