NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300026812

3300026812: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A3-10 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300026812 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0055861 | Ga0207518
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A3-10 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size29692333
Sequencing Scaffolds22
Novel Protein Genes25
Associated Families25

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria3
Not Available8
All Organisms → cellular organisms → Bacteria → Proteobacteria2
All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1
All Organisms → cellular organisms → Bacteria → Acidobacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → unclassified Bradyrhizobiaceae → Bradyrhizobiaceae bacterium1
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium3

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.2958Long. (o)-89.3799Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F001033Metagenome / Metatranscriptome799Y
F001436Metagenome / Metatranscriptome695Y
F003758Metagenome / Metatranscriptome470Y
F004886Metagenome / Metatranscriptome420Y
F014308Metagenome / Metatranscriptome264Y
F015418Metagenome / Metatranscriptome255Y
F015863Metagenome / Metatranscriptome251Y
F018801Metagenome / Metatranscriptome233Y
F020952Metagenome / Metatranscriptome221Y
F024822Metagenome / Metatranscriptome204N
F037759Metagenome / Metatranscriptome167N
F040358Metagenome / Metatranscriptome162N
F041232Metagenome160N
F050714Metagenome145N
F053375Metagenome141N
F053376Metagenome / Metatranscriptome141Y
F058266Metagenome135N
F060887Metagenome / Metatranscriptome132Y
F073679Metagenome / Metatranscriptome120Y
F082749Metagenome / Metatranscriptome113Y
F085279Metagenome / Metatranscriptome111Y
F087349Metagenome / Metatranscriptome110Y
F090709Metagenome108Y
F094081Metagenome / Metatranscriptome106Y
F103330Metagenome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207518_100041All Organisms → cellular organisms → Bacteria1729Open in IMG/M
Ga0207518_100246Not Available1184Open in IMG/M
Ga0207518_100724All Organisms → cellular organisms → Bacteria → Proteobacteria886Open in IMG/M
Ga0207518_100819All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes855Open in IMG/M
Ga0207518_101156All Organisms → cellular organisms → Bacteria → Acidobacteria776Open in IMG/M
Ga0207518_101207Not Available767Open in IMG/M
Ga0207518_101308All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria748Open in IMG/M
Ga0207518_101402All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → unclassified Bradyrhizobiaceae → Bradyrhizobiaceae bacterium733Open in IMG/M
Ga0207518_101406All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium732Open in IMG/M
Ga0207518_101603Not Available708Open in IMG/M
Ga0207518_101822All Organisms → cellular organisms → Bacteria683Open in IMG/M
Ga0207518_101850All Organisms → cellular organisms → Bacteria → Proteobacteria680Open in IMG/M
Ga0207518_102392All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria634Open in IMG/M
Ga0207518_102635All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium617Open in IMG/M
Ga0207518_103009Not Available597Open in IMG/M
Ga0207518_103288Not Available583Open in IMG/M
Ga0207518_103508All Organisms → cellular organisms → Bacteria574Open in IMG/M
Ga0207518_103951Not Available555Open in IMG/M
Ga0207518_104221Not Available545Open in IMG/M
Ga0207518_104469All Organisms → cellular organisms → Bacteria → Acidobacteria537Open in IMG/M
Ga0207518_104485All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium537Open in IMG/M
Ga0207518_104650Not Available532Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207518_100041Ga0207518_1000411F018801MGLTILSIPFIALGVFLKPYALENSRCIGFAGLGPYCFEQASSMPEVIKYGSVAVGFALL
Ga0207518_100246Ga0207518_1002462F090709NRHAIDPSVTLPIDKLNWMQNELVKAGNLKAPFDLTTMTAPDIRAEAAKRATK
Ga0207518_100724Ga0207518_1007242F040358MRIAIFALAILAGSGAAEARQVEVVSTSPRHIEIAAWCTAGSNCQQEASDVAQGYCHDPYPRRALYVRSGFVERGFFGGRVIFVYRCNRRSINCEAGSCN
Ga0207518_100819Ga0207518_1008191F085279AYGLGVGQVLDFLEVPAASVVERRELPGVDPRLVKAVGLESNRHFILLDVDAVFAPIIGS
Ga0207518_101156Ga0207518_1011561F053375MHVTPLSAATILAFASPLHAQGIEVFGGYSVNADYVQNRPAILVADQKVSPFFSHGSGPTGFEASFKHDVRNGLGIKVDVS
Ga0207518_101207Ga0207518_1012072F053376MLTLRTLLTSGGEPYQYEVIGLPNGSGAMLARLSGVWKVLKIERGKGHAWNGSFASAEDALGSLDGLVKGALLVPVRS
Ga0207518_101308Ga0207518_1013082F024822VRQLFIRVAGVILCALALSGCVDSAGPLLSEAQPVLGERLRLQFYSLSKGTADEPEQATYKWDRGTYQRTGGGMTDIGSFSVHPLARDIFVVQSASAKRPGMFEYAIARRLVDGVYQVIAVDEADAGQVTRARFCKRASDSSCRIEKRNQL
Ga0207518_101402Ga0207518_1014022F014308LIFPPELEVAFMSGSTIGIQLIMPKTGTSPKATPEQQASEREAATQPVVKAPPPPGMGKIVDKVA
Ga0207518_101406Ga0207518_1014062F020952MPIIEKTIRIAAIALAFLFIGVSLLGIFGAWFVDGKATDVALKGFGLI
Ga0207518_101603Ga0207518_1016031F015418MSIESRRTFTKGLLASALVPGTSAFGQPNDPASIAIIDTPQNAAKVAAKLAAQNVKVVVRFFAR
Ga0207518_101822Ga0207518_1018221F050714MMRVLKPLQDKATTGAGKRLVLPEPRRVRFLIRGEGSI
Ga0207518_101850Ga0207518_1018501F037759LLSGAARAQSPELGAPSIGILPPSDILASVSYLGLDPSGEPVRRGAYYMLHAFDRAGIELLVVVDAQFGDVLFMAPALNTSLTPPYTRAARIIQVAPESGDQQKR
Ga0207518_102346Ga0207518_1023462F087349LEYGMSWSETMKGNATVASLEHVANDESVFRPTQATSWDPFEVWLTRIKQPRDRAAKSALADTSSGKGRV
Ga0207518_102392Ga0207518_1023921F073679MASEGREYFLARAAEERDAATSAPNPSLARVHDDLAAMYDRMAREAE
Ga0207518_102635Ga0207518_1026352F103330MESDEVFHLTAKISSNSNVADKHPILVNFKSRAALLAAGARIDRANIDLQTVEGGI
Ga0207518_103009Ga0207518_1030092F058266MNNDRLLVVFVGLYLIVIVALLFKDAPRPVAPEVKTPVAKVETAPIAKEDKPDAARPQADDGGKPGATPNCEKELRRTADLLRFFANRIQGGEDAQSVV
Ga0207518_103288Ga0207518_1032881F015863MRRFIPLLILLGLVFGASYASALINRVMGPWSSTAIHQDGSLSHMQFGVDLPRPEWVPVYPGAWVVGGSKITSVEHPAGFHGLDLGTRASLDEVKRFYTEQLTAAGFEVSDLGLMGLNPMTAAYLGVDGMLSAKRHATDDAIDVQ
Ga0207518_103508Ga0207518_1035082F001436MNHEQDVATLIELLKMAAERWPRSEADQASQSELFHEDHSLLEMWPEACRRTGVGSREFPPGVIKLWK
Ga0207518_103951Ga0207518_1039512F003758VLRAFENAEDEILMNTLKGAPRLDVPSLFRPQFMAEVQKEHPEYFADLPPLK
Ga0207518_104221Ga0207518_1042211F060887MNAAAWICLALPLGATVAIALAGTLISRRLAGYLATASVL
Ga0207518_104469Ga0207518_1044691F004886RLEVLLEIEKGPRSRSWFAGKAPQDLERPAAIKRLLDTGLIEPAPAPQHYRVTADGWEFLQALRARVGGSSGLDWTRVDEIDFSKL
Ga0207518_104485Ga0207518_1044851F001033EKRELLMCGNSLIARLTIAVFAFQMLGVTSVVHAESPDSTAGTSTAGTRKLFINPSSTSVALGKASLIVSPLTHRDGNYVGNYQLKVRPYFFKSEKGSLLLTASDDAVRKLQAGTAINFTGKAVTHKDGRTHIVLGRATPSSGNRGGVTFSIITDDAKIVFNTSYHFGTQPGT
Ga0207518_104650Ga0207518_1046502F041232RLLLVAASLLLTYGIALVTFMVRWIRFWMRGEYFLPYLFAFVTTLALLLFWWTSRRPHAPLPILSTIVYSVVAGYVAGLIAMVLYPFFQSDGLQHMIEALRFPTIEAAIAFFWFPIRLLTWLFGGITGVTMLVLSRRWRRMTC
Ga0207518_105025Ga0207518_1050251F094081MADDHRKTLKEEFTDRLEKAKGRLQQSFPEIQQSIKTSGAVEAARKIIDPAQSIFKQFADDIQLKDLFAKAEALVANLTVAKSVSRD
Ga0207518_105215Ga0207518_1052151F082749MDGESVDAAGKLGRKRLINHAMTLDAGLSLKGVRHDIDPVVSLPARPVPGMALMLVRFINHFEVLRRESLGQLFCDEIGGSHIARLGERSLPVNGY

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.