


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300013051 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0127392 | Gp0191755 | Ga0164274 |
| Sample Name | Enriched backyard soil microbial communities from Emeryville, California, USA - RNA 3rd pass 30_C BE-Lig BY (Metagenome Metatranscriptome) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 36112040 |
| Sequencing Scaffolds | 19 |
| Novel Protein Genes | 20 |
| Associated Families | 15 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Oligohymenophorea | 3 |
| Not Available | 9 |
| All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora | 2 |
| All Organisms → cellular organisms → Eukaryota → Amoebozoa → Evosea → Variosea → Cavosteliida → Cavosteliaceae → Planoprotostelium → Planoprotostelium fungivorum | 1 |
| All Organisms → cellular organisms → Eukaryota | 3 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rickettsiales → unclassified Rickettsiales → Rickettsiales bacterium | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Lignin-Adapted Enriched Soil Microbial Communities From Emeryville, California, Usa |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil → Lignin-Adapted Enriched Soil Microbial Communities From Emeryville, California, Usa |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | Unclassified |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Emeryville, California | |||||||
| Coordinates | Lat. (o) | 37.83 | Long. (o) | -122.29 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F003009 | Metagenome / Metatranscriptome | 513 | Y |
| F003289 | Metagenome / Metatranscriptome | 495 | Y |
| F004148 | Metagenome / Metatranscriptome | 450 | Y |
| F004248 | Metagenome / Metatranscriptome | 446 | Y |
| F005470 | Metagenome / Metatranscriptome | 399 | Y |
| F028375 | Metagenome / Metatranscriptome | 191 | Y |
| F044934 | Metagenome / Metatranscriptome | 153 | Y |
| F047414 | Metagenome / Metatranscriptome | 149 | Y |
| F049342 | Metagenome / Metatranscriptome | 146 | Y |
| F057919 | Metagenome / Metatranscriptome | 135 | Y |
| F065812 | Metagenome / Metatranscriptome | 127 | Y |
| F069541 | Metagenome / Metatranscriptome | 123 | Y |
| F071833 | Metagenome / Metatranscriptome | 121 | Y |
| F071840 | Metagenome / Metatranscriptome | 121 | Y |
| F104185 | Metagenome / Metatranscriptome | 100 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0164274_100588 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Oligohymenophorea | 4304 | Open in IMG/M |
| Ga0164274_100802 | Not Available | 872 | Open in IMG/M |
| Ga0164274_103623 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Oligohymenophorea | 1597 | Open in IMG/M |
| Ga0164274_104138 | Not Available | 807 | Open in IMG/M |
| Ga0164274_104953 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora | 2922 | Open in IMG/M |
| Ga0164274_112589 | Not Available | 723 | Open in IMG/M |
| Ga0164274_114818 | Not Available | 1195 | Open in IMG/M |
| Ga0164274_116379 | All Organisms → cellular organisms → Eukaryota → Amoebozoa → Evosea → Variosea → Cavosteliida → Cavosteliaceae → Planoprotostelium → Planoprotostelium fungivorum | 681 | Open in IMG/M |
| Ga0164274_117930 | All Organisms → cellular organisms → Eukaryota | 842 | Open in IMG/M |
| Ga0164274_128463 | All Organisms → cellular organisms → Eukaryota | 709 | Open in IMG/M |
| Ga0164274_129361 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Oligohymenophorea | 741 | Open in IMG/M |
| Ga0164274_140145 | Not Available | 646 | Open in IMG/M |
| Ga0164274_140795 | Not Available | 735 | Open in IMG/M |
| Ga0164274_145809 | All Organisms → cellular organisms → Eukaryota | 728 | Open in IMG/M |
| Ga0164274_146064 | Not Available | 651 | Open in IMG/M |
| Ga0164274_150605 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rickettsiales → unclassified Rickettsiales → Rickettsiales bacterium | 511 | Open in IMG/M |
| Ga0164274_150808 | Not Available | 701 | Open in IMG/M |
| Ga0164274_153770 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora | 1297 | Open in IMG/M |
| Ga0164274_156777 | Not Available | 532 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0164274_100588 | Ga0164274_1005881 | F003289 | M*GNIVTEVALQTNFGVGFNNMQSDVLIHLTQ*QY*W*F*FSFL*AFYYLVILRIIRFRTLKFRPRLATTYRPHGK*GDLIICLIPIS*CINIITNSSFILRMIE*QAETGLLTVRIRGKQWYWIYKFELKTFTDILTVPKNIGRNK*QISTPGDLQVADDYLHILQL |
| Ga0164274_100588 | Ga0164274_1005887 | F003009 | YQRTYFNVNIGNLVKYFSILTVAFHDVHSLFGFFILLVVFSQLISGTMLSFSLVPESMMVPLVRDEEDLEDLYTDDFF* |
| Ga0164274_100802 | Ga0164274_1008021 | F047414 | PFMKKESTAGLTLDTRMSPFFGKDLSPRMHRDSLMLSPLFSSNNQHGSFFSGFTPRYFSNVPHMQDNSKTFGPDDFLLRPSPTHMEIDVNARFEKAVENMKMELRNNPVQHLGDHGMDQQGLNLDIDLIDDSYLHAPLTKSPHLQFVGSQPTSKCSFKKFGEWTLSPNASFLPRKKF* |
| Ga0164274_103623 | Ga0164274_1036232 | F003009 | LVWFQRNYFNLSILNLVKYFSTLTVAFHDIHSLFGFFIILVVMSQLVSGTMLSFSLVPEAMMVPIVRDEEDIEDLYTDDFFLNTRARC* |
| Ga0164274_104138 | Ga0164274_1041381 | F071833 | QDPQQNINLVQILIKRKLIEPFEKLSRESVECYNLQKGTKEANGEYATRVNKCLDSWQKHFESVEDHTNKYLSNLRAKEALHFSKLFHCSNAINEKDIEVCRREENQRFANELKETFSQL |
| Ga0164274_104953 | Ga0164274_1049537 | F071840 | MNDVYGLYTSYYILNSFEFLMVGLLLLFASIVCVNLSKFNRNIKLNNYYELLTLYDFFNDFVNFLFMRKQNLNNQTIATVSTRIFKKKINK* |
| Ga0164274_112589 | Ga0164274_1125891 | F028375 | MNLAQECKVWLNTYPKFNWRHTFDFLLSNELTKAVEFSDYGVACFVKGLQAELVHKDYTEAFSWYENGAIQLDSLCLFRLHEIYIGDTNFKVEYNEEQAMIHLLYSALLSQFEVFDQKVSFWQKFDSFWKKEALKTTYLQKLILDPPAYYLVPTGPLFSKLFAFYNNKNSFLDVLPEIKELSIDTLKNKFFPIVNALFDFLAYTYNSGFSKLDLEKYVENILDMLTNDILFDNF |
| Ga0164274_114818 | Ga0164274_1148181 | F049342 | WAKLEACWKKDAAHKDYLMELLNNPPADYLQSTGPLFAKLFTFYNNKETFIDLLPELKALSIDVTKNKFFAILNAIFDFMAYTYNSGFSKTDLEKYVETILDMLTNDILFENFFQNYVAHLRIIRAKKKFAFLFQRRLETDCFWVWSFSFLASMKNHYLGLLLSFEETFLEGGLILKWKNTESWVNNFIAFCYEKGIGTRKNLAKAAQLYKKDIDQMPRVLYSRYRKVLVVKEKRAHGLQLSQEEENINVDEQAEDLKLKIEERLEDTTRMDCYLFYVYGKIYEKIDEDNDRAIEWYQKGVDVDTDSCLKNHLLCNEAWRLKCKKRLLKLQARKGLQVSIVNKNRED* |
| Ga0164274_116379 | Ga0164274_1163792 | F004148 | GGNNVKGLFWMETVFPSPDGRSIIPPEKLQKEYEKGTLRPPDPNAEKEVDGDMDEKVDKVIRDVFNNYDPKGTGQLPKKVMERFFKDSLDVYALRKGFKKGSEVLAPGIKMGQAMQQSLAKITANPQFCTFKEFEDFLNCYDLEEALGSFIGVQEIAIQDRVEFVDTSGLKADAAKPKAVVYRDYSALEN* |
| Ga0164274_117930 | Ga0164274_1179301 | F065812 | MGDKLLLKLNNEYEDKENKYCEGKVKRTLKRGLKDLKSLRRKQ* |
| Ga0164274_128463 | Ga0164274_1284631 | F005470 | VQMSSIKLRKEGQLITEFPPEMVARSRLLTKLVEEFNSTEVDLEPAPGKDFSPAIINKVKEFLEKFDKGLTKMPKKPLLIFVTYNDWLDNNFDEKLREWLEEFLKPKSFYDLVELFNAAFYLQIDDLREICAARIAHSIILERKAPEDFLRDFGIVTQYQDFFTPEEEAKFIEKEFINKNDFEGVAAEDEEELNKE* |
| Ga0164274_129361 | Ga0164274_1293611 | F003289 | M*GNIVTEVALQTNFGVGFNNTKSDVLIHLTQ*QY*W*F*F*FLFAFYYLIILRIVRFRTLKFRPRLATTFRPHGK*GDLIICLIPIS*CANIITNSSLILRMIEWQAETGLLTIRIRGKQWY*IYKFELKTFTDILTVPKNIGNNKWIVSTPGDLQVSDDYLHILQLRSQNK*VHDF*NDLIQKFSKKKDFNLISPQEQLKYDFYETFNKIFLYKMYRSSTLNLQNFNLAFD |
| Ga0164274_140145 | Ga0164274_1401451 | F069541 | NNKNPNFVVSQKAPDKIMSVSNEIKNWLETYPRLNWRLTFDFLLAPENSKAVEFSDYGVACFVKGFQKEFIDRDYNEALNWYETGAMQYDSLCLFKLHEIYIGDTHFKVPYNERQALCHLIYSALLSQFEIFDTKVSFWQKFEAFWKKESSKTAYIQELLISPSADYFVTTGGLFSKLFTFYATKNNFPDILPELQNLSIDILKTKFFSIINAMF |
| Ga0164274_140795 | Ga0164274_1407951 | F028375 | LKAIEFCDNGIACFVKGLQYEHDKDYHEALNCYENGAMQLDSLCLFRLHEIYSGDTNFGVEYNERQAMCHLAYSALLSQFETFDHKVTFWAKLEAFWKKDAAHKDYLMELLNNPPADYLQSTGPLFAKLFTFYNNKETFIDLLPELKALSIDVTKNKFFAILNAIFDFMAYTYNSGFSKTDLEKYVETILDMLTNDILFENFFQNYVAHLRIIRAKKKFAFLFQRRLETDCFWVWSFSFLASMKN |
| Ga0164274_145809 | Ga0164274_1458091 | F005470 | FNLLNTDMSVKVRKEGQVITEFPTDMVPRSRLLTKLVEEFNSTEVDLEPAPGKDFSPATINKVKEFLEKFEKGLHKMPKKPLLIFVTYNDWLDNNFDEKTREWLEEFLKPKSFYDLVELFNAAFYLQIDDLREICAARIAHSIILERKAPEDFLRDFGVVTQYQDFFTPEEELKFIEKEFINKNDYEGVAAEDDEELNKE* |
| Ga0164274_146064 | Ga0164274_1460641 | F071833 | MNLSADYTQDPQQNINLVQILIKRKLIEPFEKLSKENVECYNLERGSLEGKPQYFERVNKCLDSWQRHFERVENSTNQYLSKLREKEASHFSKLFHCSNAINDPEIQACRREENERFANELKETFSQL* |
| Ga0164274_150605 | Ga0164274_1506051 | F057919 | MFIHYLVFFILLVVFSQLISGTMLSFSLVPESMMVPLVRDEEDLE |
| Ga0164274_150808 | Ga0164274_1508081 | F004248 | MTNIVSYGQYLDKKQIYDIVPYIDIKPEDLGTSDTYHEDKILQKFMSYKEDDRILIYKAALQLSIVGYGNKNYGFVRINDKDIIMLEDIFKRYNIKYMEKINAKYNDDDLSVRRLLRLFRFQIQDFIRTHNRPSFLWLKYAEKINKDFMYICFPGGEHLIETKEEAEFFLNTYGNLDNIINSKFRQRLQRIFIARNILQ |
| Ga0164274_153770 | Ga0164274_1537702 | F044934 | MSTTISFTNLLITRRTLSMPGLRNRRVLLPFITISLFLTMRMLALVTPVLGAAMIMLLLDRH* |
| Ga0164274_156777 | Ga0164274_1567771 | F104185 | ENVIKVWKYDEEKKKVEQYKTIKAKGSYPDCIVSNEDESQLLFTSRDSFLESYDFATEKTTQISLNPHIKKTNALVFLENMGKVSVSDYTSGNICFLN* |
| ⦗Top⦘ |