


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300031484 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0132857 | Gp0330684 | Ga0314822 |
| Sample Name | Metatranscriptome of soil surface biofilm microbial communities from soil inoculated with nitrogen-fixing consortium DG1, State College, Pennsylvania, United States - MICR_R1 (Metagenome Metatranscriptome) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 23322151 |
| Sequencing Scaffolds | 18 |
| Novel Protein Genes | 21 |
| Associated Families | 19 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → Viruses → Riboviria → Orthornavirae → Lenarviricota → Amabiliviricetes → Wolframvirales → Narnaviridae → Narnavirus | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1 |
| All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium | 2 |
| Not Available | 9 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Enterobacter → Enterobacter cloacae complex → Enterobacter hormaechei | 2 |
| All Organisms → cellular organisms → Eukaryota → Haptista → Haptophyta → Prymnesiophyceae → Coccolithales → Coccolithaceae → Coccolithus → Coccolithus braarudii | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Escherichia | 1 |
| All Organisms → cellular organisms → Bacteria → Acidobacteria | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Soil Surface Biofilm Microbial Communities From Soil Inoculated With Nitrogen-fixing Consortium Dg1, State College, Pennsylvania, United States |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil → Soil Surface Biofilm Microbial Communities From Soil Inoculated With Nitrogen-fixing Consortium Dg1, State College, Pennsylvania, United States |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | terrestrial biome → land → biofilm material |
| Earth Microbiome Project Ontology (EMPO) | Unclassified |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Pennsylvania | |||||||
| Coordinates | Lat. (o) | 40.7997 | Long. (o) | -77.8629 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000203 | Metagenome / Metatranscriptome | 1619 | Y |
| F000344 | Metagenome / Metatranscriptome | 1257 | Y |
| F001633 | Metagenome / Metatranscriptome | 660 | Y |
| F001679 | Metagenome / Metatranscriptome | 653 | Y |
| F002242 | Metagenome / Metatranscriptome | 578 | Y |
| F005048 | Metagenome / Metatranscriptome | 413 | Y |
| F008500 | Metagenome / Metatranscriptome | 332 | Y |
| F011252 | Metagenome / Metatranscriptome | 293 | Y |
| F013934 | Metagenome / Metatranscriptome | 267 | Y |
| F015442 | Metagenome / Metatranscriptome | 254 | Y |
| F016001 | Metagenome / Metatranscriptome | 250 | Y |
| F026607 | Metagenome / Metatranscriptome | 197 | Y |
| F030619 | Metagenome / Metatranscriptome | 184 | Y |
| F030637 | Metagenome / Metatranscriptome | 184 | Y |
| F048822 | Metagenome / Metatranscriptome | 147 | Y |
| F053640 | Metagenome / Metatranscriptome | 141 | Y |
| F059087 | Metagenome / Metatranscriptome | 134 | Y |
| F099956 | Metagenome / Metatranscriptome | 103 | Y |
| F105419 | Metagenome / Metatranscriptome | 100 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0314822_100504 | All Organisms → Viruses → Riboviria → Orthornavirae → Lenarviricota → Amabiliviricetes → Wolframvirales → Narnaviridae → Narnavirus | 2151 | Open in IMG/M |
| Ga0314822_100692 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1873 | Open in IMG/M |
| Ga0314822_102643 | All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium | 1041 | Open in IMG/M |
| Ga0314822_103630 | Not Available | 899 | Open in IMG/M |
| Ga0314822_104510 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Enterobacter → Enterobacter cloacae complex → Enterobacter hormaechei | 816 | Open in IMG/M |
| Ga0314822_104619 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Enterobacter → Enterobacter cloacae complex → Enterobacter hormaechei | 807 | Open in IMG/M |
| Ga0314822_104799 | All Organisms → cellular organisms → Eukaryota → Haptista → Haptophyta → Prymnesiophyceae → Coccolithales → Coccolithaceae → Coccolithus → Coccolithus braarudii | 793 | Open in IMG/M |
| Ga0314822_105561 | Not Available | 746 | Open in IMG/M |
| Ga0314822_105591 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Escherichia | 745 | Open in IMG/M |
| Ga0314822_106752 | Not Available | 686 | Open in IMG/M |
| Ga0314822_106968 | Not Available | 677 | Open in IMG/M |
| Ga0314822_108621 | Not Available | 616 | Open in IMG/M |
| Ga0314822_110869 | All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium | 558 | Open in IMG/M |
| Ga0314822_110956 | Not Available | 556 | Open in IMG/M |
| Ga0314822_111150 | All Organisms → cellular organisms → Bacteria → Acidobacteria | 552 | Open in IMG/M |
| Ga0314822_111749 | Not Available | 540 | Open in IMG/M |
| Ga0314822_112401 | Not Available | 527 | Open in IMG/M |
| Ga0314822_113475 | Not Available | 509 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0314822_100504 | Ga0314822_1005041 | F030619 | NTYKRLTLLVKSLPTEQSNTPPRKGVRAHRHRGGDSSPLQQVWTACWSGFVLSGWDSLRVAWFLHSWVVSSVPARGAGWTIGELKKLCHNVRGHALHSKRFKNVPCSIRKDVVDTLCRLAVREPENGFAFSRISRALPEPPVREAVRHLQHAAEMASAAFPTSDAALESLRSFVGVSRFHEHDTKEPRAPRRLPSSSSSCFQWPATRGGIDGYLEHLGHEAEARGATQAEFHQFAGDSLGAFCLRKARVVLRPCQGVAEDLREAYRCAGVLVLREERKLFSMKAEALRAPGYKVRVVGVPDCLTYVEGSWVRESKRLLPYGHWRVDPESREVPGDLQHRHGATFRSLDLSKATDGLSHRAVEVVIEALASRGAIRQCDLPMAKRSLGLGGKATWSFPDPIGEVPFLRGSPMGTPLSFVVLSWVSCWAVARFERSLVHGDDAVGRHRIGSDALDVYADRVSSVGAQLNKGKTFRADHSWTACEVLALPRSWNQERMTLFVAPSIPPPMLKAPVEADPRLENLWLRRMERIMKSRFPWVKCDPRLHLPVEVGGLGYTGRGLAVGRALRSRLGALVSRGPDPVVGAALIGKKPFRETGLFPRPLVRIPRPAAHWRAVKNVDELLAGGEGEESVPLESFETFKCELVESEIRLIEGDKFKRKRVSGRPDRTKGQAVFRRLTVKPARPLSRFHGLASLKRWALSCRQVRVTVDQDVASEIR |
| Ga0314822_100692 | Ga0314822_1006922 | F059087 | MGVHISFTHPPSAIRSPMGAVAFAIGLKLLVPTPGKNGSWAVKDTIE |
| Ga0314822_102643 | Ga0314822_1026432 | F105419 | MRKFIALSALLVAAAASNGCISTTMYGCEITETLAPGASEGEVVMKHGAPDNIVYLGGQYFNPQTGERGEVDKYLYEYRIGGGTTLLGQVFASDEFHNIAYLIEGGRVMGGGYVGEGKGSIILGMGGILSTPLGVIDLRMGGFLHPKARAGYGGDGNP |
| Ga0314822_103630 | Ga0314822_1036301 | F030637 | VTSFPLRRFEDSMRCPVLCRQNAGTSHEVGCSPEFYERRPCGPIGLSAATSTLRFLARPGGSTSCVVPRSLAKTGSSSPELGLLFRVRTASNLPHARMRRAPSLGFRSQSRHQLRRSTCERGSQPRPMFRPRCFAHPRRFALSTALWACFIPLPRPGFTFQGFVPAAWPARLVDESFPHVVGRRHLPSSFLGGSSSGSLAFRALIRAAIRSNRRGV |
| Ga0314822_104510 | Ga0314822_1045101 | F001633 | WAFVTRSFPAAHAWTRSLFAGGVLPDATLRGMRMFRSHGGTVLTVAGRELSSEASAPGSDAPCRERRAGRGVDTPATFIFRVGTFYRGVGISLWLIVGPALRV |
| Ga0314822_104619 | Ga0314822_1046191 | F000344 | MRPKHLHAAESGVGKHIARESERVQACAAGKERVTNAH |
| Ga0314822_104799 | Ga0314822_1047991 | F002242 | ECSVRCPKDMCEKDSCPKCETVCAPAQCHTTCVAPKANCAPLCEETSCSWSCAKPTTCPRPKCELQCSKPACDVKDKQKCCKCGAKGAKRAIAAAPRFEEVHSDAEMMPSFMEVVASFKHAAENGVEECCPCKAK |
| Ga0314822_104905 | Ga0314822_1049051 | F000203 | VRHSLFPVPALGANLAAGFPTLFSTASGVSGLVAGPSSVLQRLDFE |
| Ga0314822_105561 | Ga0314822_1055611 | F016001 | VFSRRLDPIPSWACLLQVFALDAVGTPSRSLTLMILMATLSSHCRHRPSAFRHRAWLASLEAAYLLEVSHLPAMLSCPNISDEVRRSAST |
| Ga0314822_105591 | Ga0314822_1055911 | F053640 | EQSKRGGLKYSAESTTGIIPGDRGKAGGNWCRPPLMPAKAVMRHISPVPLAGVVSGQSTHELGTEPQAAIRNRVEWSQATQGVSTCASTQLPQRLRLLLRRPERHRVSRRDDPAKRPHSPHEWGAQGTYGGGERTDLGKVREPPHRGGVKHTSPSCKRQRSLRGKRSDP |
| Ga0314822_105591 | Ga0314822_1055912 | F001633 | VLPDATLRGMRMFRSHGGTVLTVAGRDLSSEASAPGSDAPCRERRAGRGADTPATFIFARRHPYHGDGTGLWPVVGPALRV |
| Ga0314822_106752 | Ga0314822_1067521 | F013934 | DKANIERIQLLLRRHEQVVEEKRTHPEEFLRQNEQFQEAFGRIARATIVPVLEEVKDILVGKVESASIFHRRTAGGLRVKLDRWEDFERSLLFFGDASTQSVRVTHEGVGFSLLSQKLSLSQVTAELVEEEAMKFLKRLFGQEQLRRPMHVPDPADRRRPMSSSPASPVRLDYELVRV |
| Ga0314822_106968 | Ga0314822_1069681 | F005048 | VGLKFEHTTPIAEDCLQDVVVETFARAKADDGLIALVVEPRHVERQALVRSLRELGRRAIGVATALDAVQLLGREGEHVDTVFIEAESSSLPSLELVEFLAHNHPHVRRVLVGEPSEIAASWVAQATGEVHALLETPCDQDAVHRVLHRLQFTPHDGALS |
| Ga0314822_108621 | Ga0314822_1086211 | F026607 | MMKRFVSTTVALALALSASVALSATYVTGPLPSEFGGGFIPPNPAILKNVQKANKEGAKLAASVEKCYSKGAANYSKGKATGVSTCLNDPAKGVLPKFNAKIAGIAAKAPGLPPCHDFASDGALIAGLVKGFNGLTYCQSPSGAFVDGTASF |
| Ga0314822_108673 | Ga0314822_1086732 | F099956 | MTDRERHPGPASTFHLASGVKLEPLGDGSAVLYSKELDQSLSLNHTAALLCSYA |
| Ga0314822_110869 | Ga0314822_1108692 | F001679 | MSWRQKAMKGVEDCEKLGGTVKQVMIPGFPNQRILNS |
| Ga0314822_110956 | Ga0314822_1109561 | F015442 | LRASERPLREIGSGQVVRAGGLGSVNPLSLKGGSGESTRNRSGNLFAKLGAGHAGGER |
| Ga0314822_111150 | Ga0314822_1111501 | F011252 | VSEYTFIEPNAAFPLPATFNFDEYIETERELWALFPETDGRRVNFVQIGLTAMIYAEVPDTGAELGFDETYILMPCEDFAMRPIRMRACRRLTLSEYRMGHLTAIKLVGKVNEAGEVMREIGCQILGDEVLSEAQAR |
| Ga0314822_111749 | Ga0314822_1117491 | F008500 | NNFRAVTSVSCSNCHAAGFIPVIDEVRDIALANAREIGLNRDEVEQLESIYVTPQEFARQVTDDSQNFYQRALQTASLPVQGGDPISATFLRFDQDVRIEDAAGDLGLTPDELADTLNLLNPVLSVLDRGVLDRDDFTDVYVDSLCILSTNLENQPDVAVCDAAAALLEQ |
| Ga0314822_112401 | Ga0314822_1124011 | F005048 | DFVVETFASAKADDGLIALVVEPRHVERQGLVRTLRELGRRAIGVATALDAVQLLGREGEHVDTVFIEAESSSLPSLELVEFLAHNHPHVRRVLVGEPSEIAASWVAQATGEVHALLETPCDQEAVHRVLHRLQFTPHDGALS |
| Ga0314822_113475 | Ga0314822_1134751 | F048822 | VLAAAVAGIVSGTAVAQGQWREHGYANQGFGIAWPAEPNIQEVEKFEAAPGKMVPATIYSLDYNKSLLKVTVIEGRDANLNEDAVIRHQVAKVVQGGRVTFDFPHRIYRIYGRQMSVARPNDSVTQAVFFFANERLYLVESTRMRGGEDIDLIKFQLSLTFDRNVRNRT |
| ⦗Top⦘ |