


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300004265 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0111384 | Gp0097054 | Ga0051981 |
| Sample Name | Groundwater microbial communities from aquifer in Utah, USA - Crystal Geyser 4/10/14 3 um filter |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | Y |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 813327234 |
| Sequencing Scaffolds | 31 |
| Novel Protein Genes | 35 |
| Associated Families | 30 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → Viruses → Predicted Viral | 1 |
| Not Available | 14 |
| All Organisms → cellular organisms → Bacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla | 6 |
| All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota | 1 |
| All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 2 |
| All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 2 |
| All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium RBG_13_41_22 | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Paenibacillaceae → Paenibacillus → Paenibacillus pasadenensis | 1 |
| All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrososphaeria → unclassified Nitrososphaeria → Nitrososphaeria archaeon | 1 |
| All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Leptospirillum → Leptospirillum sp. Group II → Leptospirillum sp. Group II 'C75' | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater → Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | freshwater biome → aquifer → groundwater |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Subsurface (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Utah | |||||||
| Coordinates | Lat. (o) | 38.9383 | Long. (o) | -110.1342 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F002184 | Metagenome | 585 | Y |
| F002652 | Metagenome | 539 | Y |
| F003801 | Metagenome / Metatranscriptome | 467 | Y |
| F004273 | Metagenome | 445 | Y |
| F004515 | Metagenome | 434 | Y |
| F010662 | Metagenome | 300 | Y |
| F011639 | Metagenome | 288 | Y |
| F012552 | Metagenome / Metatranscriptome | 279 | Y |
| F014811 | Metagenome | 260 | Y |
| F019952 | Metagenome / Metatranscriptome | 226 | Y |
| F030345 | Metagenome | 185 | Y |
| F043771 | Metagenome | 155 | Y |
| F053063 | Metagenome | 141 | Y |
| F058693 | Metagenome / Metatranscriptome | 134 | Y |
| F060597 | Metagenome / Metatranscriptome | 132 | Y |
| F060838 | Metagenome / Metatranscriptome | 132 | Y |
| F069687 | Metagenome | 123 | Y |
| F071959 | Metagenome / Metatranscriptome | 121 | Y |
| F073189 | Metagenome | 120 | Y |
| F078228 | Metagenome | 116 | Y |
| F079604 | Metagenome / Metatranscriptome | 115 | N |
| F080880 | Metagenome / Metatranscriptome | 114 | Y |
| F080881 | Metagenome | 114 | N |
| F086649 | Metagenome / Metatranscriptome | 110 | Y |
| F093503 | Metagenome | 106 | Y |
| F094905 | Metagenome | 105 | N |
| F094906 | Metagenome | 105 | N |
| F096626 | Metagenome / Metatranscriptome | 104 | N |
| F098657 | Metagenome | 103 | N |
| F102531 | Metagenome | 101 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0051981_10046109 | All Organisms → Viruses → Predicted Viral | 1874 | Open in IMG/M |
| Ga0051981_10069159 | Not Available | 1460 | Open in IMG/M |
| Ga0051981_10096640 | All Organisms → cellular organisms → Bacteria | 1185 | Open in IMG/M |
| Ga0051981_10131612 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla | 974 | Open in IMG/M |
| Ga0051981_10133469 | Not Available | 965 | Open in IMG/M |
| Ga0051981_10150744 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla | 892 | Open in IMG/M |
| Ga0051981_10176083 | Not Available | 805 | Open in IMG/M |
| Ga0051981_10180623 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota | 792 | Open in IMG/M |
| Ga0051981_10194724 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 755 | Open in IMG/M |
| Ga0051981_10196858 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 749 | Open in IMG/M |
| Ga0051981_10212307 | Not Available | 713 | Open in IMG/M |
| Ga0051981_10215628 | Not Available | 706 | Open in IMG/M |
| Ga0051981_10225835 | Not Available | 685 | Open in IMG/M |
| Ga0051981_10231292 | Not Available | 674 | Open in IMG/M |
| Ga0051981_10240652 | Not Available | 657 | Open in IMG/M |
| Ga0051981_10252527 | Not Available | 637 | Open in IMG/M |
| Ga0051981_10257283 | Not Available | 629 | Open in IMG/M |
| Ga0051981_10270247 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla | 609 | Open in IMG/M |
| Ga0051981_10275200 | All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium RBG_13_41_22 | 602 | Open in IMG/M |
| Ga0051981_10276593 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla | 600 | Open in IMG/M |
| Ga0051981_10280626 | Not Available | 594 | Open in IMG/M |
| Ga0051981_10299590 | Not Available | 568 | Open in IMG/M |
| Ga0051981_10301867 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Paenibacillaceae → Paenibacillus → Paenibacillus pasadenensis | 566 | Open in IMG/M |
| Ga0051981_10304309 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 563 | Open in IMG/M |
| Ga0051981_10306440 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 560 | Open in IMG/M |
| Ga0051981_10319644 | Not Available | 545 | Open in IMG/M |
| Ga0051981_10322438 | Not Available | 542 | Open in IMG/M |
| Ga0051981_10325695 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrososphaeria → unclassified Nitrososphaeria → Nitrososphaeria archaeon | 538 | Open in IMG/M |
| Ga0051981_10332400 | All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Leptospirillum → Leptospirillum sp. Group II → Leptospirillum sp. Group II 'C75' | 531 | Open in IMG/M |
| Ga0051981_10361389 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla | 502 | Open in IMG/M |
| Ga0051981_10361402 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla | 502 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0051981_10046109 | Ga0051981_100461094 | F093503 | KPMKKYKVKIKHTDKEEIIKADSELEARVKFCEQNNLNYRHLAGKLEITLNNKPLQNNL* |
| Ga0051981_10069159 | Ga0051981_100691593 | F079604 | MTKARRGITLIQNRKKFKLDTFFQDIHKSPVAFMEICHYQGKQVQKFKQTKRLLRLISECDKSIAVLPRGASKSFSFAIIALWYFYTHENFRVAIFSRSHRQSKAVLEICSDIIDSSPLLKTSRQSFQIDQKQRLKSHINSEIIAHPFDASTVLGEHPDIVLADECAFFGDDSFFRKVILPMQSGVRTIYKIPKISLVSTIDQDEGFFYDVWKNPEKYGYTKLKMTWQQCDGYTKEDMQKKKI |
| Ga0051981_10092047 | Ga0051981_100920471 | F094905 | MNALYGYIAVFVVIVFLGAGWAVEHDKRITYQAKVEQAGADALAQTEKINAKHREEMQNAEQNTIIATNSI |
| Ga0051981_10096640 | Ga0051981_100966402 | F078228 | MSLSEEMMKAIKEEKKKRKLGSIQETIRSILAEYFAKKS* |
| Ga0051981_10131612 | Ga0051981_101316122 | F102531 | LSFASSILNLPNADNPAGWTLGTYGGQTNVIQPQKTVAGINQEIDKLVANGVDKLEAIRQVGSVSVPDYAATPEQIAQLQLADKARTADEILQCPYTWCRHNSTVSDAILESRGYQPYRAAMMMQTTEGLSAGGHQVSAIIVNGEPVFIDLTNNLIIPGQQALEQILLNSGKQLTALEMIRLTTNNVWDVINLIPK* |
| Ga0051981_10133469 | Ga0051981_101334691 | F043771 | LRGWLWRGDSHLIAGSSPAVIAGIAGRKAVMDAVRLTKCECGCGCGEDATTSDGGVALCAACAEYAVDPESGEVVCSRDPRAEEITECCGAGGQTRSYWRIRPPVAPAVASDGEWACYWNTVGDGSRVVSRHATEAEAARAVVTRRQWVVAGGHPHYLYRYEVRQWDGEAWVAPYEAE* |
| Ga0051981_10150744 | Ga0051981_101507443 | F002184 | MKYIAKYSRVWDDDVKPIQKPLQPTLYTCEELCSILNKDNYS |
| Ga0051981_10176083 | Ga0051981_101760831 | F071959 | MASRNYLVMSNDMTLSDKKEYRLNALSAGLERCGLRGIGDIKADIPGLAGIPDANKVARVKLIHNYLITGQWPRSIDQRELTTGTDLVVAPAVDSLLTAPMAAVGNIVSCFQGVAAPQLVQGKLMVCYAVSVESSAVPMPVSRLIFRRGAAGNVQAQFDMEPMGIRWEVDAFFSEVVVIDPQDVFAIQVRCRNATAVPEIVHIHNFLFESAGLVVA* |
| Ga0051981_10180623 | Ga0051981_101806231 | F010662 | MNTHKLLMSIAEAFMIYGLIIVGYGILAIHVTKTWFGDWHVDRYLPWLTLDMFMMISFALSFFGFIVWRYLKYIEETK* |
| Ga0051981_10194724 | Ga0051981_101947241 | F019952 | MAQTVKSMRISSMGRVILLEKPRALRRIFYGIKVLTDLTMGHRSHISFDDPTFLSYYILDGQVQQLEAKGEGICQGDIWARNVSPAELIFDMTEILV* |
| Ga0051981_10196858 | Ga0051981_101968581 | F080880 | LRVEKVGSCEDCVIKAKMNEQTKLREALDLTSLGRKTLDGLLEVAVGDKTIQDFIKEQKAESKNIIEILRGTIEWKTPSNFKEFTIILEEIRNLTQMFDIASHSEIENTVILRPKAFKRLPEVVAFQTAVMLEGVGAPFEIRMMGEDIAVKMIRQEIYPLRKKEFGESLDQQIEKRLATSRPGLFKNSLMLVGPGFMNWAEKHLEEPVTDLGSIIEDVRIALGVDELPREPKEFVMGLLSACVKMNWFK |
| Ga0051981_10212307 | Ga0051981_102123072 | F060838 | MPTFQYVQPTDEQKQTMQTFRDKYESLAKEIGKLPTSRGLSLAFTKLEESSFWLNKAITQNS* |
| Ga0051981_10215628 | Ga0051981_102156281 | F069687 | MEELARKIEQILNQFAQEETKNRLSQFSMIALKEIILNEIKSYKPKIKE |
| Ga0051981_10220085 | Ga0051981_102200851 | F094906 | ENVQSAQYSVAVSNGDSKVYDVEMDIFGSLIRLHWKYFVASPPDNFSARYRINGGSWWSWSITRSGQQYHASDTGGWQTGDQLEINIYNPASGSETNIQDSILYGNDCF* |
| Ga0051981_10225835 | Ga0051981_102258351 | F080881 | MSKNSKSSVALSELAVLPQFVAPVVEIAPEVLSMIVMDIDPALIEAAEIKEARLDAVKAKKDDAKRLLAQAHEIAKTLNPLCDEQDKVKEECRTLKDVLKDALLATRVALANHPEVLENKAQIVKAAPNLMAEFNEKFNAAVIKQTQIAQADNIARMKALDASFAELDKKTAQLSKQYDEYKALSTRLFAEADQLRVEYGFA* |
| Ga0051981_10231292 | Ga0051981_102312921 | F086649 | NLTSESTDQQVQDAISAEIELCMNQPPPPGAENQQKYCAGKAYGMAREKTGKELNLGK* |
| Ga0051981_10240652 | Ga0051981_102406521 | F096626 | LASSATKRMTSAMLLGMWRNSSKYVMDVAPKLTTMTGSQLLRYKANIERHEKRLTALCSRNPELALSASDLDKIMINLADDDADTAFGAYLADRTEEVRSKLTEDSEAL* |
| Ga0051981_10252527 | Ga0051981_102525271 | F004273 | MKNDEKAVIAGAGVMLAYFLLKKKEKKVDIPNLSISPPVFYA* |
| Ga0051981_10257283 | Ga0051981_102572831 | F030345 | MELKNERILIQTIEKRDRENESPWYAITGNQGRRFSCFEEEVAKKLQINKVNLCKVRYFGKYANVMSVEGYEDKPEGADANSEIRKERNIESLRILKCVALKCAAQCFDGQSASAGEVIAKSNAFLKWLFSIEDKDAI* |
| Ga0051981_10270247 | Ga0051981_102702472 | F004515 | MTQTIRSVTVLPGQQRVLLEKPRVLYRIFFSIRALADQTVWYQSKISFDDPLFHSYYVLDGPGKYFEAYGEGIFQGDIWVRNASPVN |
| Ga0051981_10275200 | Ga0051981_102752001 | F053063 | MTKDEISARIELLKKENDEDNTKLINLNQIRESVIQGMLVRNGRILELEEILKN* |
| Ga0051981_10276593 | Ga0051981_102765932 | F004515 | MVQTVYSVTVFAGKKRIILEKPRVLRRVFFSIRALADQKVWLKSMLSFGDPLFRSFYVLNGPAKYFEAKGEGIFQGDVWIRNASDQ |
| Ga0051981_10280626 | Ga0051981_102806262 | F002652 | MKKAKASVGAIPTQVNENRNKTFEKFFEKMRRVKALEIDIQKRENDFFWEYIQKPISTQEERDDRDILKRAAHLGFAGEWRAVFQERNEILEISTDIRELAGAQGKKNDAEKFAKWKEILEKIEKTNKNTV* |
| Ga0051981_10299590 | Ga0051981_102995902 | F014811 | MWIYDYKTKKEYSTSSPRSKFYNYVIIEIEPIPGKPCIILVKKGMEEGRKPAQVLEQLDIFKNLSVPSAT* |
| Ga0051981_10301867 | Ga0051981_103018671 | F003801 | RWKEIDDGKEEVDWLMANLHASIEREGNQQLEKKKNTKESVYIFSVPIDEIKKHILNRSVVKIEQVITMTNKIREDIYEFDQQIGIASPETDVEQEKKTKKIDDFQEEIRKRWEDVWRIKEEIHTTTDIIYEISHIRRKYIGVAEEEQMSWADILQEIKHKMKNSFCF* |
| Ga0051981_10304309 | Ga0051981_103043092 | F058693 | LLNKPLVLNRIFFTVKVLTDPTIDYKLYISLGDPGFSSYFTLDGQVPYFEAKGEGIFQGDIWAQNVSAADLLVSMSEILV* |
| Ga0051981_10306440 | Ga0051981_103064401 | F073189 | MLSTIDILSIIEEQEIYDVKYLATKLQIPLEQLKEILTNMRKHTLIEYDPRTGKVTLPTWLINIEKKMEKIKPTTGEIILPKYQEIKIQDITIGNFTKNDLELKLRFKAKQKEIAI |
| Ga0051981_10319644 | Ga0051981_103196442 | F058693 | YSGWKVLLFEKPLVLKRIFYSVKVIVDPTTDYRSYISLGDPRFHSYYTLGGSVSFFEAKGEGIFQGDVWAQNVSSSELLFVVTEILV* |
| Ga0051981_10319767 | Ga0051981_103197671 | F098657 | TREVITLIARGRELKAGLGITEDAQMAQFHTTVLTLLHGSAKEAYQVAVSHTAETRRQTEAEEATVEFFADLDKPTALVRFNRKRMLYNETLEIINDDFETGLNAICDQLLPFQALIKVKRYLRRHCHKPANMTIRQYVSCIRKINEHELPLLVPMSPDNQLPFDEVKELLFYGIPNRYR |
| Ga0051981_10322438 | Ga0051981_103224381 | F012552 | MKKKGASVGAIPPQVEEIMKKTFEKLFQKIKYLKDMEVDVQKNENDFFWEYVQKPISTREERD |
| Ga0051981_10325695 | Ga0051981_103256951 | F011639 | VWIKKKYVRPRLSPDELWLIYRLIDNQYWLLRKHPHPFRSLEEVSRLRRKILRLVNRCRKPKHQISLRQRLY* |
| Ga0051981_10328505 | Ga0051981_103285051 | F060597 | MVQTVKSQTVYTGWKVILLEKPLASCRVFFSIKTLADPAAWRRSMISFDDPSFVSHYVFDGPLQQLEAKGEGIFQGTIWAWNVSPVDILFTVTEILL* |
| Ga0051981_10332400 | Ga0051981_103324001 | F080881 | EIKAKKDDAKRLIVLCHEIAKTLNPLCDEWDRVKEERALLKGVLKDALLASRIALAGHPKVRENKAQIIKAAPNLMAEFNKRFDSAVIKQTRIVQADNIARMKALDQSFAELDKKTAQLSKQYDDCKALSTRLFAEADQLKVEYGFA* |
| Ga0051981_10361389 | Ga0051981_103613891 | F004515 | MVQTLLSKTILPGQKLKLLEKPHVLNRVFFSIRALADDTVWYKSLVSFGDPLFHSFFVLNGPGKYFEARGEGIFQGDVWVRNLSNQNLEYTATEILI* |
| Ga0051981_10361402 | Ga0051981_103614021 | F004515 | MVQTVISKTILAGQKRIFLEKPIALKRVFFSITALADPGMWYKSEVSFDDPLFLSFFVLNGPARHFEARGDGIFQGNVWVSNLSDYDLEYTATEILV* |
| ⦗Top⦘ |