NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F105807

Metagenome / Metatranscriptome Family F105807

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105807
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 47 residues
Representative Sequence ARAAGITQITALASSDNPAALALLRRTANVTDVRFEGPELSIRAAIA
Number of Associated Samples 93
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 1.00 %
% of genes from short scaffolds (< 2000 bps) 1.00 %
Associated GOLD sequencing projects 92
AlphaFold2 3D model prediction Yes
3D model pTM-score0.70

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(12.000 % of family members)
Environment Ontology (ENVO) Unclassified
(30.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(47.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 13.33%    β-sheet: 29.33%    Coil/Unstructured: 57.33%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.70
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF00202Aminotran_3 69.00
PF00027cNMP_binding 3.00
PF00682HMGL-like 2.00
PF05402PqqD 1.00
PF06224HTH_42 1.00
PF02417Chromate_transp 1.00
PF09836DUF2063 1.00
PF13378MR_MLE_C 1.00
PF00211Guanylate_cyc 1.00
PF02423OCD_Mu_crystall 1.00
PF00890FAD_binding_2 1.00
PF03992ABM 1.00
PF13439Glyco_transf_4 1.00
PF13411MerR_1 1.00
PF00528BPD_transp_1 1.00
PF00753Lactamase_B 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG2059Chromate transport protein ChrAInorganic ion transport and metabolism [P] 1.00
COG2114Adenylate cyclase, class 3Signal transduction mechanisms [T] 1.00
COG2423Ornithine cyclodeaminase/archaeal alanine dehydrogenase, mu-crystallin familyAmino acid transport and metabolism [E] 1.00
COG3214DNA glycosylase YcaQ, repair of DNA interstrand crosslinksReplication, recombination and repair [L] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.00 %
All OrganismsrootAll Organisms1.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300032828|Ga0335080_10249267All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1940Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil12.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil10.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil7.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere7.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil5.00%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost4.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.00%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere4.00%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil3.00%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland3.00%
RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere3.00%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.00%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere2.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil2.00%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.00%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere2.00%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere2.00%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment1.00%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands1.00%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil1.00%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil1.00%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.00%
PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Peatland1.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.00%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Clay → Unclassified → Soil1.00%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.00%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.00%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.00%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.00%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere1.00%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere1.00%
SoilEngineered → Lab Enrichment → Unclassified → Unclassified → Unclassified → Soil1.00%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil1.00%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2228664022Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300004153Grasslands soil microbial communities from Hopland, California, USA (version 2)EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005339Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaGHost-AssociatedOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005538Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1EnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006354Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010039Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot56EnvironmentalOpen in IMG/M
3300010229Soil microbial communities from Bangor area, North Wales, UK, treated with sorgoleone, replicate 1EngineeredOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010375Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-4 metaGHost-AssociatedOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010877Boreal forest soil eukaryotic communities from Alaska, USA - W3-2 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300012010Permafrost microbial communities from Nunavut, Canada - A7_35cm_12MEnvironmentalOpen in IMG/M
3300012011Permafrost microbial communities from Nunavut, Canada - A30_65cm_6MEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012951Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_226_MGEnvironmentalOpen in IMG/M
3300012955Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_216_MGEnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012984Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MGEnvironmentalOpen in IMG/M
3300013102Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C4-5 metaGHost-AssociatedOpen in IMG/M
3300013104Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C3-5 metaGHost-AssociatedOpen in IMG/M
3300014058Permafrost microbial communities from Nunavut, Canada - A3_65cm_0.25MEnvironmentalOpen in IMG/M
3300015197Arctic soil microbial communities from a glacier forefield, Russell Glacier, Kangerlussuaq, Greenland (Sample G6B, Proglacial plain, adjacent to northern proglacial tributary)EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300017944Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0815_BV2_10_20_MGEnvironmentalOpen in IMG/M
3300017966Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_20_MGEnvironmentalOpen in IMG/M
3300018089Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP05_20_MGEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300019888Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1c2EnvironmentalOpen in IMG/M
3300020078Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-5 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300020082Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-4 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024254Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK02EnvironmentalOpen in IMG/M
3300024286Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK28EnvironmentalOpen in IMG/M
3300025893Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025920Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025921Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025944Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028654Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-12-22 metaGHost-AssociatedOpen in IMG/M
3300028802Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 17_SEnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031238Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-19-26 metaGHost-AssociatedOpen in IMG/M
3300031672Soil microbial communities from Risofladan, Vaasa, Finland - OX-2EnvironmentalOpen in IMG/M
3300031711Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-1-26 metaGHost-AssociatedOpen in IMG/M
3300032008Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f18EnvironmentalOpen in IMG/M
3300032052Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f19EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300032828Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.4EnvironmentalOpen in IMG/M
3300032893Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.1EnvironmentalOpen in IMG/M
3300033158Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.1EnvironmentalOpen in IMG/M
3300033233Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_bottomEnvironmentalOpen in IMG/M
3300033805Tropical peat soil microbial communities from peatlands in Loreto, Peru - MAQ_50_10EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
INPgaii200_113677722228664022SoilGITEITALASSDNRAAVALIQRIASVRDVTLEGPELSIRAAIA
Ga0063455_10149329523300004153SoilDARLAGITEITALTSPDNPAAIALVRRIAKVVDVCFEGPELSIRAAIA*
Ga0062595_10105960823300004479SoilCDARAAGIAEITALASSDNPAALALLRRTAKVLDVSLEGPELSIRAAIA*
Ga0062591_10164247333300004643SoilGITEITALVSSDNPSAVAVLRRIAGALDVRFEGRELSIRAAIA*
Ga0066388_10788387723300005332Tropical Forest SoilITEITAQASGENRAALALIRRSARVLDIRLEGPEQTIRAAIA*
Ga0070660_10081842923300005339Corn RhizosphereAGITEITALAASDNRAAVALLRRISHVLDVRLEGPELSIRAAIA*
Ga0070709_1072053013300005434Corn, Switchgrass And Miscanthus RhizosphereALASELLVDARAAGITQITALASSDNPAALALLRRTTNVTDIRLEGPELSIRAAIA*
Ga0070681_1062495023300005458Corn RhizosphereRAAGITQVTALASSDNPAALALLRRTTNVTDIRLEGPELSIRAAIA*
Ga0070731_1079108323300005538Surface SoilGITEITAQAATDNRAALALIRRTACITDIRLDGPEQSIRAAIA*
Ga0070704_10075166523300005549Corn, Switchgrass And Miscanthus RhizosphereGELVAEARAEGVKEITALVSADNPAAVSLLRRIANVLSVRFEGSDLSIRAAIA*
Ga0066707_1090705623300005556SoilARAAGITQITALASSDNPAALALLRRTANVTDVRFEGPELSIRAAIA*
Ga0066670_1085397913300005560SoilMLLDDARAAGITEISALASGDNRAALALIRRSASVLEVRLEGPEQAIRAAIA*
Ga0066699_1103817923300005561SoilMLLDDARAAGITEISALASGDNRAALALIRRSACILEVRLEGPEQSIRAAIA*
Ga0066693_1010134113300005566SoilLAAELLADARAAGITQITALAASDNPAALALLRRTANVTDVRFEGPELSIRAAIA*
Ga0066703_1025496413300005568SoilTALAASDNPAALALLLRTANVTDVRFEGAELSIRAAIA*
Ga0066705_1097655513300005569SoilASGDNRAALALIRRSACILEVRLEGPEQAIRAAIA*
Ga0066708_1048852623300005576SoilMLLDDARAAGITEISALASGDNRAALALIRRSACILEVRLEGPEQAIRAAIA*
Ga0066903_10156282523300005764Tropical Forest SoilTALVSADNPVAVALLRRIASVLDIRHSGPELSIRAAIA*
Ga0070717_1073024623300006028Corn, Switchgrass And Miscanthus RhizosphereHELIADARAAGVTEITALVSGDNRAAVALLRRVAGALEIRFEGPALSIRARLPEAA*
Ga0075018_1044504523300006172WatershedsTEITALASSDNPATLALLRRTSDVVDVHFEGPELSIRAAIA*
Ga0075021_1023591213300006354WatershedsVDARAAGVTEITALVSSDNPAAVAVLRRIVNALAVSFEGPELWIRAAIA*
Ga0079222_1010072613300006755Agricultural SoilVTALASSDNPAALRLLRRTANVLDVHLEGPELSIRAAIA*
Ga0066659_1021787623300006797SoilALASSDNPAALALLRRTANVVDVRLEGPELSIRAAIA*
Ga0075436_10155564513300006914Populus RhizosphereITEISALASGDNRAALALIRRSACVLEVRLEGPEQAIRAAIA*
Ga0079219_1126809313300006954Agricultural SoilRMLLDDARAAGITEITAQASGENRAALALIRRSARVLEISLDGPEQSIRAAIA*
Ga0066710_10468748513300009012Grasslands SoilITALTGSDNRATLALLRGIARVLDVRLEGTETSIRAAIA
Ga0099829_1041977913300009038Vadose Zone SoilVASDNPAALALLRRIVNVFDISLDGPEVSIRAAIA*
Ga0099828_1107438613300009089Vadose Zone SoilELLADARAAGIAEITALAGSDNPAALALIRRTANVIDVHFEGPELSIRAAIA*
Ga0105242_1039504433300009176Miscanthus RhizosphereRAAGITEITALVASDNPAALALLRRILTVLDVRFDGPELWIRAALR*
Ga0126374_1170150413300009792Tropical Forest SoilITALVSADNPVAVALLRRIASVLDIRHSGPELSIRAAIA*
Ga0126309_1130573223300010039Serpentine SoilAGELVADARAGGITEITALVSSDNPPAVRVLRRIAGALDIRSEGPDLSIRVSLA*
Ga0136218_100721243300010229SoilXALVSSDNPAAVALLRRVLSRLDIRHEGPELTIRARAEPRF*
Ga0134070_1042524613300010301Grasslands SoilALASSDNPAALALLRRTAKVVDVRLEGPELSIRAAIA*
Ga0134067_1039477813300010321Grasslands SoilQHRGIGSALTGMLLDDARAAVITEISALASGDNRAALALIRRSAFILEVRLEGPEQAIRAAIA*
Ga0134067_1040496413300010321Grasslands SoilTALASSDNPAALALLRRTANVVDVRFEGPELSIRAAIA*
Ga0126376_1143603413300010359Tropical Forest SoilLADARAAGIRQVTALASSDNRAALALLRRTARVLDIRFEGAELSVRAAIA*
Ga0134125_1158290823300010371Terrestrial SoilLVAGDNSAAVTLLRRVARRLQITYEGTELSIRAAI*
Ga0105239_1347922013300010375Corn RhizosphereTALASSDNPAAVALLRRTAKVLDVRLEGPELSIRAAIA*
Ga0134126_1072624413300010396Terrestrial SoilTALASSDNPAALALLRRTTNVTDIRLEGPELSIRAAIA*
Ga0126356_1119028023300010877Boreal Forest SoilLASGDNPAALALIRRTACILEVRLEGPEQAIRAAIA*
Ga0137392_1158023423300011269Vadose Zone SoilASNDNPAVLALLRRSADVVDVRFEGPELSIRAAIA*
Ga0120118_111948013300012010PermafrostELIADARAAGITEITALVSSSNPAAVALLRRIANVLDIRLEGPELSIRAAIA*
Ga0120118_115854023300012010PermafrostRAAGITEITALASSDNPATLALLRRTTDVVDVHFEGPELSIRAAIA*
Ga0120152_107457313300012011PermafrostITEITALASSDNPAVLALLRRSADLLDVRFEGPELSIRAAIA*
Ga0137363_1107775923300012202Vadose Zone SoilAAGITQITALASSDNPAALALLRSTANVTDVRFEGPELSIRAAIA*
Ga0137380_1073003623300012206Vadose Zone SoilELVADARAAGIAEISALVRTDNPNALAVVKRIACVTDVHYEGPELSIRAAIA*
Ga0137376_1119837513300012208Vadose Zone SoilELLADARAAGITQITALASSDNRAALALLRRTANVTDVRFEGPELSIRAAIA*
Ga0137376_1139242223300012208Vadose Zone SoilADARAAGITQITALASSDNPAALALLRRTANVTDVRFEGPELSIRAAIA*
Ga0137376_1176805423300012208Vadose Zone SoilALARELLADARAAGISGITALTGSDNRATLALLRRIARVLDVRLEGTEIWIRAAIA*
Ga0137367_1109548213300012353Vadose Zone SoilKLIADARTAGITEITALVANDNPAALALLRRTLSALDIRLEGPELWIRAALA*
Ga0137384_1144562213300012357Vadose Zone SoilAELIADARAAGITEITALASSDNPAALALLRRTAHVLDVCFEGPQLSIRAAIA*
Ga0137407_1158768013300012930Vadose Zone SoilIREITALASSDNPAAVALLRRTSKVLDIRLEGPELSIRAAIA*
Ga0153915_1177275223300012931Freshwater WetlandsAGITEITALVSSDNHAALALVRRIANVLDIRLEGPELSIRAAIA*
Ga0164300_1082048623300012951SoilAGEFGTNARAPGITEITALATSDNRAAVALLRRIAQVRDITFEGPELSIRAAIA*
Ga0164298_1046031323300012955SoilSGIRVITALASSDNPAAVALLRRTAKVLDVRLEGPELSIRAAIA*
Ga0164298_1079047013300012955SoilDARAAGITEITALATSDNRAAVALLSRIAQVRDITFEGPELSIRAAIA*
Ga0134077_1043643913300012972Grasslands SoilAGIIEITALASSDNPAAVALLRRISNVLDVQLEGPELSIRAAIA*
Ga0164309_1022976613300012984SoilITEITALASSDNPAAVALIRRIARVRDITLEGPELSIRAAIA*
Ga0157371_1118313813300013102Corn RhizosphereITEITALTSSDTPAALALVRRIAKVVDVCFEGPELSIRAAIA*
Ga0157370_1077897723300013104Corn RhizosphereVDARAAGITQVTALASSDNPAALALLRRTTNVTDIRLEGPELSIRAAIA*
Ga0120149_106080313300014058PermafrostYEHRGIGSALAGELLADERAAGITQITALASSDNPAALALLRRTANVTDVRFEGPELSIRAAIA*
Ga0167638_103731113300015197Glacier Forefield SoilSDNPAAVALLRRIANALEIRLEGPELSIRAAIAS*
Ga0134073_1038446113300015356Grasslands SoilARAAGITEISALASGDNRAALALIRRSACILEVRLEGPEQAIRAAIA*
Ga0132256_10321554413300015372Arabidopsis RhizosphereELLVDARAAGITEITALAAGDNPAALRLLRRTSNVVDVHFEGPELSIRAAIA*
Ga0132257_10190223213300015373Arabidopsis RhizosphereDARAACITEITALAASDNRAAVALLRRISKVLDVRLEGPELSIRAAIA*
Ga0187786_1049049523300017944Tropical PeatlandLDDARAAGITEITALASGDNRAALALLRRVARILEVRLEGPEQSIRAAIA
Ga0187776_1078655913300017966Tropical PeatlandARELIADARAAGVTEISALTAADNQAALALIRRCAKVRRVAFEGPELSISAAIA
Ga0187774_1061001813300018089Tropical PeatlandADARAEGVSEITALVASDNRAAVAVLRRTARLLDVRFEGPELSIRAAIV
Ga0066667_1227316223300018433Grasslands SoilAAELLADARAAGITQITALASSDNPAALALLRRTANVVDVRFEGPELSIRAAIA
Ga0193751_115811413300019888SoilAAELLGDARAAGITQITALASSDNPAALALLRRTANVTDVRFEGPELSIRAAIA
Ga0193751_118975213300019888SoilLDDARAAGITEISALASGDNRAALALIRRTACILEVRLEGPEQAIRAAIA
Ga0206352_1078008823300020078Corn, Switchgrass And Miscanthus RhizosphereLVADARLAGITEITALTTPDNPAALALVKRIARVVDVCFEGPELSIRAAIA
Ga0206353_1066574213300020082Corn, Switchgrass And Miscanthus RhizosphereSGIGAALAAELVADARMAGITEITALTSSDNPAALALVRRIAKVVDVCFEGPELSIRAAI
Ga0247661_103278123300024254SoilRAAGVTEITALVSRDNPTALALLRRIANVRDIRFDGPDLSIRAALA
Ga0247687_107830913300024286SoilLAGELVAEARAEGVKEITALVSADNPAAVSLLRRIANVLSVRFEGSDLSIRAAIA
Ga0207682_1049253213300025893Miscanthus RhizosphereVTEITALVANDNPAALALLRRILDVLDIRFEGPELSVRAAL
Ga0207699_1038900623300025906Corn, Switchgrass And Miscanthus RhizosphereALASELLVDARAAGITQITALASSDNPAALALLRRTTNVTDIRLEGPELSIRAAIA
Ga0207693_1078571423300025915Corn, Switchgrass And Miscanthus RhizosphereMTHQAPSPTLASSDNPAALALLRRTANVTDVRFEGPELSIRAAIA
Ga0207663_1022189513300025916Corn, Switchgrass And Miscanthus RhizosphereLIADARAAGVTEITALVSGDNRAAVALLRRVAGALEIRFEGPALSIRARLPEAA
Ga0207649_1033460023300025920Corn RhizosphereIGSALASELLADARAAGITEITAVASSTNSAAVALLRRTANVLDVRLEGPELSIRAAIA
Ga0207652_1045616013300025921Corn RhizosphereALTTPDNPAALALVKRIARVVDVCFEGSELSIRAAIA
Ga0207652_1102477223300025921Corn RhizosphereAAGITQITALASSDNPAALALLRRTTNVTDIRLEGPELSIRAAIA
Ga0207665_1041956913300025939Corn, Switchgrass And Miscanthus RhizosphereRAAGITEITAQASGENRAALALIRRSARVLDIRLEGPEQTIRAAIA
Ga0207661_1148313423300025944Corn RhizosphereRMAGITEITALTSSDNPAALALVRRIAKVVDVCFEGPELSIRAAIA
Ga0209474_1046793213300026550SoilARAAGITEISALASGDNRAALALIRRSACILEVRLEGPEQAIRAAIA
Ga0209488_1028256413300027903Vadose Zone SoilTRAAGITEIVALASNDNPAVLALLRRSADVVDVRFEGPELSIRAAIA
Ga0265322_1003124923300028654RhizosphereALAGELLADARAAGITVVTALAGADNRAALALMRRCATVLDLRLEGPEIAIRAAIA
Ga0307503_1005847733300028802SoilEITALVASDNPAALALLRRILNVLDVRLEGPELWIRAALR
(restricted) Ga0255310_1007781423300031197Sandy SoilELIADARASGIREITALAASDNPAAVALLRRTAKVLDIRLEGPELSIRAAIS
Ga0265332_1026426113300031238RhizosphereITALATGDNRAALALIRRVASISEIRLEGPEQSIRAAIA
Ga0307373_1063415013300031672SoilDARAAGITEITALASSDNPAALALLRRTAHVHDVHFEGPELSIRACIA
Ga0265314_1021806113300031711RhizosphereTRLLLDDARASGITEITALATGDNRAALALIRRVASISEIRLEGPEQSIRAAIA
Ga0318562_1080473613300032008SoilSALTAELIADARASGVTEITALVSTDNPAALALVRRLLSALDIRIDGPELSIRAAIA
Ga0318506_1020637923300032052SoilTEITALVSSDNPAALAVIRPLLRAIDIRFEGPELSIRAAIA
Ga0306920_10153505413300032261SoilLIADARAAGITEITALASSGNSAAVKLLRRTANVLDVRFEGSDLSIRAAIA
Ga0335080_1024926733300032828SoilHRGIGSALTRILLDDARVAGIKEITALATGDNRAAVALIRRIADVLEIRLEGPEQSIRAAIA
Ga0335069_1000375013300032893SoilAAGITEITALVSSNNPAAVALIRRIAGVLAISFEGRELAIRAAIT
Ga0335077_1093155233300033158SoilTALVSSDNPAAVAVLRRIANALDISLEGPELWIRAAIA
Ga0334722_1038403413300033233SedimentGIGSTLAAELIADARASGIREITALASSDNPAAVALLRRTARVLDIRLEGPELSIRAAIA
Ga0314864_0154955_2_1603300033805PeatlandELLADARAAGITEITALVTSGNAAALALLRRVARVLEVTFEGPQLAVRAAIS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.