NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300026803

3300026803: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G09A1-12 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300026803 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0072076 | Ga0207549
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G09A1-12 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size28610106
Sequencing Scaffolds23
Novel Protein Genes26
Associated Families26

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1
All Organisms → cellular organisms → Archaea4
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. ARR651
Not Available7
All Organisms → cellular organisms → Bacteria3
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → unclassified Xanthomonadales → Xanthomonadales bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.3Long. (o)-89.38Alt. (m)N/ADepth (m)0
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F002896Metagenome / Metatranscriptome522N
F003605Metagenome / Metatranscriptome477Y
F005549Metagenome / Metatranscriptome397Y
F005950Metagenome / Metatranscriptome385Y
F006025Metagenome / Metatranscriptome383Y
F008973Metagenome325Y
F013143Metagenome / Metatranscriptome274Y
F013352Metagenome / Metatranscriptome272Y
F017530Metagenome / Metatranscriptome240Y
F020942Metagenome / Metatranscriptome221N
F021340Metagenome219Y
F025530Metagenome201Y
F025757Metagenome200N
F028610Metagenome / Metatranscriptome191Y
F030263Metagenome / Metatranscriptome186Y
F037212Metagenome / Metatranscriptome168Y
F040716Metagenome161Y
F049164Metagenome / Metatranscriptome147Y
F054151Metagenome / Metatranscriptome140N
F057709Metagenome136Y
F060880Metagenome / Metatranscriptome132N
F070698Metagenome / Metatranscriptome123N
F078898Metagenome116N
F089138Metagenome109N
F099660Metagenome / Metatranscriptome103N
F103511Metagenome101Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207549_100048All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes5133Open in IMG/M
Ga0207549_100220All Organisms → cellular organisms → Archaea2338Open in IMG/M
Ga0207549_100237All Organisms → cellular organisms → Archaea2264Open in IMG/M
Ga0207549_100244All Organisms → cellular organisms → Archaea2241Open in IMG/M
Ga0207549_100338All Organisms → cellular organisms → Archaea1855Open in IMG/M
Ga0207549_100640All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. ARR651383Open in IMG/M
Ga0207549_100670Not Available1350Open in IMG/M
Ga0207549_101179All Organisms → cellular organisms → Bacteria1070Open in IMG/M
Ga0207549_101247All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1050Open in IMG/M
Ga0207549_102096All Organisms → cellular organisms → Bacteria859Open in IMG/M
Ga0207549_102219All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria844Open in IMG/M
Ga0207549_102354Not Available827Open in IMG/M
Ga0207549_102873All Organisms → cellular organisms → Bacteria → Proteobacteria771Open in IMG/M
Ga0207549_103981Not Available683Open in IMG/M
Ga0207549_104099All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium675Open in IMG/M
Ga0207549_104382Not Available657Open in IMG/M
Ga0207549_105031All Organisms → cellular organisms → Bacteria → Proteobacteria624Open in IMG/M
Ga0207549_105142Not Available618Open in IMG/M
Ga0207549_105713All Organisms → cellular organisms → Bacteria594Open in IMG/M
Ga0207549_106624Not Available559Open in IMG/M
Ga0207549_106744All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium556Open in IMG/M
Ga0207549_107559Not Available533Open in IMG/M
Ga0207549_108350All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → unclassified Xanthomonadales → Xanthomonadales bacterium513Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207549_100048Ga0207549_1000481F003605IGDAAAAEGYPLVPDSGEEGKVKWGARELNRTRDFIANLKVLLPSSKAASRSAAGISSGTAEPTGGTDGDIYFKILP
Ga0207549_100220Ga0207549_1002202F002896MDKMNEDTILHFYRILENSLLESDISKINEEDIDAWSQSFKKVVRESREKSGKGVFVPFLMWKLGEISPVEARKYLVNRKQDECRVSYDHNNVEYILWVMALMFMSWSVTNLKRKTQNGHCQNIDHPHGDKNPRLCKEGTIFHQELYNECVKTFKDLLIHSDAQHH
Ga0207549_100237Ga0207549_1002374F070698MVILNISEIWNNGHNTTFRDSHLEDDNSEILSNGNTPYVIKGNGDCMIIVHADALGVSKNFGDGH
Ga0207549_100244Ga0207549_1002442F005950MKMTNGKNRIKVKIHRTSDYDDKYTGIRDFPDEKAMLQYGLSKVHKVIIRKYTEDDEFIARAQKTRGIKFNYEMELYD
Ga0207549_100338Ga0207549_1003382F013352MKYTLPGKLLTNSIDKYFSEYLAKNWHSASQFSRLPCLSIAIAGFLFSLMASSTNAQIEVTVDENVSTPISNSTLSEDAEPRPDILYSALNKDTIVGEVLNNFSYPIELVRITATVYDKNGIIVATGDKYVNDYLIKPGSRSGFDIFLDETLPSKSKYALTTSFEKSEDDKPEVLQLSVGKNSKSSNTFRVLGEVMNQGKNDANAVKVSAIFYDEKHKVMDTDYVFTNPDIISPNKKAPFEFSFYVDNPEKIKSMAFNVQSDEFSLITDNGQNNTISQQ
Ga0207549_100640Ga0207549_1006403F025530MSDMRISTSTTMQIGEAARDNIAAGIWFAVLAGSLFLYAQSILMTTGLMLELTAAYSTFVLCGKSARSPFVHAIPYAFALAGAVFLCLAPDFHNAIEASLVFLGVTALMHGSVVYSALKNPRETEDP
Ga0207549_100670Ga0207549_1006702F025757MSYSASLSLFWLEMAVLIGCVALSIGMRSRPAMWVALGIVAHCAMWLAMHDEEILIRLVASTLVYLGLLKFSPNAARVWLCAGGALAGAFLLGTMALSLLMSFPGRWSVFGLSMGATLLFLTSGLLIGFWVVYRWTDPTQLGDTQPRQEA
Ga0207549_101179Ga0207549_1011791F006025VELLLSTHDTGYERPDGPTIAKVLASLDGGRNVVATLGTSDSSYLQATGGVQTGFGLDLQEGSLERRFRTRDRALPLAWVTEVFHRYARGDLAWRDTVEWEQDRIMPARPSWTNSWAAYIVLLVVVAAILTHRRR
Ga0207549_101247Ga0207549_1012471F008973GLLLAASFASAAYGFGRMAAPERTDVISLAIFGAIGLLSQAGLLLAPWAVTRGTTPRTIAALLMGPSGVFLSIFAYEGFTRYAAGAPIWVVAWAAYVCGVFVYAAVYVALARGRLGRRPG
Ga0207549_102096Ga0207549_1020961F060880MHTSADFRKTQRRRAELRSQSEAAVATIWLVFYVLGIAVAVSSPIVSRALEFAAH
Ga0207549_102219Ga0207549_1022191F017530GYPPSEVAEQVVAGIREGRFYIVPAQPDVKGNIAIRAQDLLELRNPTLRRG
Ga0207549_102354Ga0207549_1023542F030263MRTSHHHSVSSAGRVSAWLVAALGLAAYVGLIYGLTVFPLQFELPVPQWATFAVPAVAYGVLVLLFVRRPTIIRWVVGTALLTGLHVALLMARGPLSVMLDPALAGRPLPWMLPPPLPELVGVFLLLVPLRDLLRARPRLA
Ga0207549_102873Ga0207549_1028732F057709MSEFDLKVALIIFVTKFIDPFAAVPALVAGYFCRTWWQVVIAAAAVGIFVEMILVLFE
Ga0207549_103981Ga0207549_1039811F021340MRHSKLQTFQAHRTIARANVARSQFWHVAIVTLFFVIVLGASLFLGTVMVIGTFHGVDPSNELTANGRAGRIARTLQDGTLCHYMIFDNKTAQTVEDRIGRCDENKPKPKQEKPATFSWG
Ga0207549_104066Ga0207549_1040661F103511MDLSFERRAWTLVYVCLGIVEGGTAAVMVRALFVGAAPGMLVDLVLAMVSAAPAWSNLASLAF
Ga0207549_104099Ga0207549_1040991F054151DHFRALAEGLSLRAVSERAPAQRAELQRLAECYAELAKQQSPADHFARGVGPR
Ga0207549_104315Ga0207549_1043151F099660MRLRIRTVVCGTAIFSAVSFFTITFAETTPLNKAQVGDRIRKVE
Ga0207549_104382Ga0207549_1043822F078898MDLSKRQLLQEALAKAEAHHAYIAVSTPSGSRILLHPDFTCYETYVEGTGPDGQLLTLTYEQIASVDVE
Ga0207549_105031Ga0207549_1050312F005549PRGRGAAEWSVMLKFLRKLTAIPAVKYSIIAVTSFLWLVGFADQLPDIEQAAKYVGISLLMLAVAAMA
Ga0207549_105142Ga0207549_1051421F037212ADWRRAMTNNEIGAADISEPADAIAWRHFWLANVVVESDGRPLVRATRPALQPRKDSLSRRLQRLRHMRAILDAPRH
Ga0207549_105713Ga0207549_1057131F028610QHHAVDPTLPLPAEKIQWIQEQMVKAGKLKAPLDLKVVTAPEYRERALKVLGH
Ga0207549_106624Ga0207549_1066241F013143MMSRFLLGAIVGGVAVYVWGDEIRRLVNTKGQTARLAAADTLQSVQSAAEQLLDSAKDHLTSTMEAGQ
Ga0207549_106744Ga0207549_1067441F020942LGWEPIARFDYLVLDLARFAPDPEVHIRKVDALGDRAHAAWRFGEVFLHNFVPRYLVSELFQPYPRGPYMGSLTAFGPGGNAWLSLWDDRARRGLDPGSVRAVKAYDVTLKGRGGFRAFSAIAATLHEAGLDHLLMPIPHDVNARALIDPFVSDLVEFNFCVKRLNGAAPVPSGPIYFDIRH
Ga0207549_107211Ga0207549_1072111F040716MRHFPMAAAAILALSVLAVTGHSGLAATPKCNADLRKCNSHCNLVYESGRANRTCRNRCKDNLYVCKARPS
Ga0207549_107559Ga0207549_1075593F049164KDVAITWATRNELLKLLQRAPGTLHVVLYFENVGATRPVDLDRDGKAHVFRALTHWQDHPAIGKPFPDDAQSLWTALADELEPA
Ga0207549_108350Ga0207549_1083501F089138MLKEGMAKAEIVTVLRRYAYDINRISALMPDGGTAKDAQSRLKQLKDAIHSDYKHRNAITRSAQLTPLEQSNLASAILDVFSALQSIGVNTTPGREWRNALFGADMYIQRYLNELR

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.