NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300029655

3300029655: Metatranscriptome of soil microbial communities from Anza Borrego desert, Southern California, United States - S1+v_5-13C (Metagenome Metatranscriptome)



Overview

Basic Information
IMG/M Taxon OID3300029655 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0128792 | Gp0224285 | Ga0206091
Sample NameMetatranscriptome of soil microbial communities from Anza Borrego desert, Southern California, United States - S1+v_5-13C (Metagenome Metatranscriptome)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size32681261
Sequencing Scaffolds16
Novel Protein Genes17
Associated Families15

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available10
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Opitutae → Opitutales → Opitutaceae → Lacunisphaera → Lacunisphaera limnophila1
All Organisms → cellular organisms → Eukaryota1
All Organisms → cellular organisms → Eukaryota → Amoebozoa → Discosea → Flabellinia → Vannellidae → Vannella → Vannella robusta1
All Organisms → cellular organisms → Bacteria → Acidobacteria1
All Organisms → cellular organisms → Eukaryota → Amoebozoa → Discosea1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSystems Level Insights Into Methane Cycling In Arid And Semi-Arid Ecosystems
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Desert → Soil → Systems Level Insights Into Methane Cycling In Arid And Semi-Arid Ecosystems

Alternative Ecosystem Assignments
Environment Ontology (ENVO)desert biomedesertsoil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationUSA: California
CoordinatesLat. (o)33.3049Long. (o)-116.2547Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000344Metagenome / Metatranscriptome1257Y
F000400Metagenome / Metatranscriptome1181Y
F004178Metagenome / Metatranscriptome449Y
F004947Metagenome / Metatranscriptome417Y
F011252Metagenome / Metatranscriptome293Y
F015326Metagenome / Metatranscriptome255Y
F018507Metagenome / Metatranscriptome234Y
F018997Metagenome / Metatranscriptome232Y
F021903Metagenome / Metatranscriptome216Y
F033285Metagenome / Metatranscriptome177Y
F034481Metagenome / Metatranscriptome174Y
F052341Metagenome / Metatranscriptome142Y
F063225Metagenome / Metatranscriptome129Y
F073141Metagenome / Metatranscriptome120Y
F102215Metagenome / Metatranscriptome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0206091_102736Not Available1376Open in IMG/M
Ga0206091_104053All Organisms → cellular organisms → Bacteria1126Open in IMG/M
Ga0206091_105915Not Available927Open in IMG/M
Ga0206091_106835Not Available861Open in IMG/M
Ga0206091_107615Not Available814Open in IMG/M
Ga0206091_107728All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Opitutae → Opitutales → Opitutaceae → Lacunisphaera → Lacunisphaera limnophila808Open in IMG/M
Ga0206091_107743Not Available807Open in IMG/M
Ga0206091_109330Not Available732Open in IMG/M
Ga0206091_109655Not Available720Open in IMG/M
Ga0206091_112590Not Available631Open in IMG/M
Ga0206091_113174Not Available616Open in IMG/M
Ga0206091_113207Not Available616Open in IMG/M
Ga0206091_113749All Organisms → cellular organisms → Eukaryota604Open in IMG/M
Ga0206091_117355All Organisms → cellular organisms → Eukaryota → Amoebozoa → Discosea → Flabellinia → Vannellidae → Vannella → Vannella robusta537Open in IMG/M
Ga0206091_118140All Organisms → cellular organisms → Bacteria → Acidobacteria525Open in IMG/M
Ga0206091_118766All Organisms → cellular organisms → Eukaryota → Amoebozoa → Discosea516Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0206091_102736Ga0206091_1027361F063225SSATTTITVCLSPAPFCGDGVCAGSETALPTAQARCFDCGRLRGVISLAVGQKDQLVNITVSVWADPVNPALYLRNGTAVPAPHFTTISDSKGFFSINNLSFDAGLSTRAAGQAKRRYYFSLSGSFTDRTTGTDLQTELLTLWWNYELTNTQWSGTGPLTNTNNSPSFYFYMTPPFVTADALIRIILSWGNLISNPANSLPDLDLVVAGPVDASSISTFGTGIVNFQNKDLHSTFKVLPYAKLVTDSAQGYGPEVMDFYGQPGALALGFSSSYAPGAAANAYEVWVDRPNSSPNQDSSFTFLFDTNSFIVVYQNDGTTGGNKQVLFDARTNVPYAYGFYNTDPWNKVPAEATLWHIIDLSQSAGGVVFNGFPGENSTDTTATPGAKYGYDGSFFETTKSIPCGHTASRGASPAYCPATLTYPQSKKK
Ga0206091_104053Ga0206091_1040532F000400MRDPEIIENELMEMSAILDDTIKFERIVAWCATHPDEVPFALHQLLHRRPKDQSQSPSA
Ga0206091_104684Ga0206091_1046842F018997MKNKKSKSLELTLQETKLMEQLRKQPELMERFQTILEISSSADGPIKTADEIEGLLIEEMRRLGNTTMGTWAAGAEERLVKQFKEEHPSARARKKKR
Ga0206091_105915Ga0206091_1059151F034481KLLSILSVVCFLVVLAAANHGENEAAQQVVNPADWHGTWTSLNRYGGQTYTCPVGNTLYGVYSNAGFFVGTITDRVVEGVWFEGGRGDRNYYQGSFRITISDDNQEFDGVFNRLTTGNEIRWHESRLGAPYPSNPTLEQCFAPDNTAENVLGSYYRSTETGALAGSTFICKDYWEQIYGSFHSPEGYLAGWSVDDATGFHGYRYDSTGESGAYILRALSRDRVKGFYWRGRLAIQNYPTSKYEAFDRSSFTATLNQCEQVGPGFVERLHGPDYVPYYLVNSSSIVSFSYLTFLIIALTAILF
Ga0206091_106835Ga0206091_1068351F073141MATISTNDQKEDNKSESVNISDYLSRLYDITQITDEELNTWNEAYSYKGFDRKKVIMELAKKVPDVKTSQQIILVCGLLGPQRASQVKLLNGRTIASYGIPASGQKGTNNLSCQRITAATADLCAYLLKRLNVPKRLNIACPAWLQFPSAGSITMPQDLRVMHIEFHRRFSTAISGKFNEQIYDQMTVNSYYDRRLNLFENLNPDIMAEAPSHLVPIPAPTFNPNRGDVGPTKTTDSQRFKKQ
Ga0206091_107615Ga0206091_1076151F000344MRPIHPHAAESGVGKHTARESESAQACAAGKERVANAHPHKL
Ga0206091_107728Ga0206091_1077282F004178MRPKHPHAAESGVGEHTARESEAPNVCRRERARGERP
Ga0206091_107743Ga0206091_1077431F004178MRPRHPHAADSGVGEHTARESEAPNVCRRERARGER
Ga0206091_109330Ga0206091_1093301F021903VVAGPVDQSSITTFGTGIVNFQNKDQHSSLRVLPYAKLVTDSAQGYGPEVMDFYGQPGQFALGFSSTYAPGAAANAYEIWIDRPNSSPNQDSSFTFLFDTNAFIVIYQNDNVANKQVLFDARTGVPYAYGFYNTDQWNKVPDEATLWHVIDLSQAAGGVLYNGFPGENSTDQTAVAGAKYGYDAKFFESTKSIPCGHTASRGASPAYCPATLTYPQSKKK
Ga0206091_109655Ga0206091_1096551F052341QLLRMQIMFGGKNARVMVVMMISILSFFLIQESAAIDCPKDNSQDRIYPGCNCFFEWNDANRVGNWTFEENWMQLNEPGWVAFVSIAGDNTIHLDEARRINEIYIGPNRWDTTRVVLDEDLTVEYDDVPVINSVRGYRQPTGQIRLVIQGKGFGFVSEDIVVTATQVIDNFPDNNVENQITQVYNCEYATLVYRDAQIECNIFTASLYPYSLSVTVAANGHLSEPALISTYLQ
Ga0206091_112590Ga0206091_1125901F021903VVAGPVDQSSITTFGTGIVNFQNKDQHSSLRVLPYAKLVTDSAQGYGPEVMDFYGQPGQLALGFSSTYAPGAAANAYEIWVDRPNSSPNQDSSFTFLFDTNAFIVIYQNDNVENKQVLFDARTGVPYAYGFYNTDQWNKVPDEATLWHIIDLSQAAGGVLYNGFPGENSTDQTAVAGAKFGYDAKFFESTKSIPCGHTASRGASPAYCPA
Ga0206091_113174Ga0206091_1131741F102215LIKRLTRPMRVAIVILFCAIVAAAFAGVTERDDHISANFWDKYQEPAAHQIKTIVRRSVDQLVNARQTAAVNFTTWCFEYQQYIDQFKWGLAATDPETNTEYFHGIVEDSSTDPAVVVGYIYGYSLDNQRLVQMAVDYNDNGYMRLFSFQFNPRNERYSDAFTVFSNADTTLTPQFLGSPEYTSLTKCA
Ga0206091_113207Ga0206091_1132071F004947KSIIVIIFAIAVIALAQPAPPKEPSDFSAYGFVIEWHNDFHRRFKGDIFEDFTNHRQRIDAERHASYGGVITFFRFFDKEREYEYFGQTDMCWQAHFNNTQHPVFEWLEHAHYAGNCHGRHSQGNVKGHLFREHGSESLRREICVADNSTYTPLWIEHHHGRSGRILEFLQFKSGAPDPAVFTLPGSCTKKQEEILA
Ga0206091_113749Ga0206091_1137491F033285TNKLSLFVLFAVFVCIATAELTRCPKIHASIENSKVSASWEGFVPFEVISFEWGVSSMKNGELEANICKESPFFHGKSDVLDWTYVRKSTQVQSQYLELKAGENYQVVLRITSRGGKQYWITSSPLSAPEKEPLEETKTVSRNERNGACPTCTQNQPCANCPIDVANRKRAYEETVAERLNKIYGPALFLKPADSVYTAV
Ga0206091_117355Ga0206091_1173551F018507TRMNFSAVLILFLGLVAFVYTACVSPPSDLQVCVLPSTARVPQEYANTNADRQLQSYFFFLQSIKAVPTDECSLAYIDFACSQAYPRCAGDAASGLGLPVNTCYFQCSNFVEKCRGQLIGVERPDCGTFSVSPDCTAVSVKLPQDTNGASVFSASLALFSVLALVLVL
Ga0206091_118140Ga0206091_1181401F011252HGGQVRSRSCRSGRNFLSQYSFIEPNAAFPLPPTFNFDEYIETEQELWALFPETDGRRVNFVQIGLTAMIYAEAPDRGAELGPDETYILMPCADYTMRPIRMRACRRLTLSEYRMGHLTAIKLVGKVNEAGDVMREIGCQILGEEVIQEAQARLADYGDDQ
Ga0206091_118766Ga0206091_1187661F015326NLQMEKCYCDWGEHGGDWCMCLGFFSGNMECCEPCCGKPCNTNDGVYCCLSWFPGCCFAGPKCLAASQNQECAFVNHGVPFLLLLIFIIPVIGILGYVVLWTIETAIRFNLRKQHGIGDTSKWDICDCCGIWFIVPGPCFACQEMRSVPKDYWDWYKAFNDKKFPAETQLE

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.