NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F086818

Metagenome / Metatranscriptome Family F086818

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F086818
Family Type Metagenome / Metatranscriptome
Number of Sequences 110
Average Sequence Length 44 residues
Representative Sequence DHWGAGQSFVILGAVLIVLCAPVTLITRRAARTATNQAKPTLSD
Number of Associated Samples 77
Number of Associated Scaffolds 110

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 95.45 %
% of genes from short scaffolds (< 2000 bps) 95.45 %
Associated GOLD sequencing projects 70
AlphaFold2 3D model prediction Yes
3D model pTM-score0.46

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (52.727 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(52.727 % of family members)
Environment Ontology (ENVO) Unclassified
(60.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(60.909 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: Yes Secondary Structure distribution: α-helix: 48.61%    β-sheet: 0.00%    Coil/Unstructured: 51.39%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.46
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 110 Family Scaffolds
PF07883Cupin_2 6.36
PF00512HisKA 4.55
PF07690MFS_1 3.64
PF00216Bac_DNA_binding 3.64
PF01925TauE 2.73
PF03796DnaB_C 2.73
PF02738MoCoBD_1 2.73
PF00072Response_reg 1.82
PF05443ROS_MUCR 1.82
PF11295DUF3096 1.82
PF04909Amidohydro_2 1.82
PF02133Transp_cyt_pur 1.82
PF13424TPR_12 0.91
PF12697Abhydrolase_6 0.91
PF13193AMP-binding_C 0.91
PF04185Phosphoesterase 0.91
PF12833HTH_18 0.91
PF00496SBP_bac_5 0.91
PF01724DUF29 0.91
PF00449Urease_alpha 0.91
PF01546Peptidase_M20 0.91
PF07681DoxX 0.91
PF01243Putative_PNPOx 0.91
PF00027cNMP_binding 0.91
PF13333rve_2 0.91
PF03237Terminase_6N 0.91
PF01315Ald_Xan_dh_C 0.91
PF00211Guanylate_cyc 0.91
PF00248Aldo_ket_red 0.91
PF13560HTH_31 0.91
PF04828GFA 0.91

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 110 Family Scaffolds
COG0776Bacterial nucleoid DNA-binding protein IHF-alphaReplication, recombination and repair [L] 3.64
COG0305Replicative DNA helicaseReplication, recombination and repair [L] 2.73
COG0730Sulfite exporter TauE/SafE/YfcA and related permeases, UPF0721 familyInorganic ion transport and metabolism [P] 2.73
COG1066DNA repair protein RadA/Sms, contains AAA+ ATPase domainReplication, recombination and repair [L] 2.73
COG4957Predicted transcriptional regulatorTranscription [K] 1.82
COG0804Urease alpha subunitAmino acid transport and metabolism [E] 0.91
COG2114Adenylate cyclase, class 3Signal transduction mechanisms [T] 0.91
COG2259Uncharacterized membrane protein YphA, DoxX/SURF4 familyFunction unknown [S] 0.91
COG3511Phospholipase CCell wall/membrane/envelope biogenesis [M] 0.91
COG3791Uncharacterized conserved proteinFunction unknown [S] 0.91
COG4270Uncharacterized membrane proteinFunction unknown [S] 0.91


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A52.73 %
All OrganismsrootAll Organisms47.27 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005332|Ga0066388_100356365All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2109Open in IMG/M
3300006804|Ga0079221_10371090All Organisms → cellular organisms → Bacteria873Open in IMG/M
3300010046|Ga0126384_10574323All Organisms → cellular organisms → Bacteria983Open in IMG/M
3300010046|Ga0126384_11268978Not Available682Open in IMG/M
3300010048|Ga0126373_10599179All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1154Open in IMG/M
3300010048|Ga0126373_11609897Not Available714Open in IMG/M
3300010048|Ga0126373_12241418All Organisms → cellular organisms → Bacteria607Open in IMG/M
3300010358|Ga0126370_10303133All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1268Open in IMG/M
3300010359|Ga0126376_12160407All Organisms → cellular organisms → Bacteria601Open in IMG/M
3300010361|Ga0126378_13446059Not Available502Open in IMG/M
3300010366|Ga0126379_10878024All Organisms → cellular organisms → Bacteria999Open in IMG/M
3300010376|Ga0126381_101063644All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1168Open in IMG/M
3300010376|Ga0126381_101347804Not Available1031Open in IMG/M
3300010376|Ga0126381_101436028Not Available997Open in IMG/M
3300010376|Ga0126381_101824123Not Available878Open in IMG/M
3300010376|Ga0126381_104198755All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria559Open in IMG/M
3300012363|Ga0137390_11799246Not Available544Open in IMG/M
3300012971|Ga0126369_10890681Not Available975Open in IMG/M
3300016270|Ga0182036_11562487Not Available555Open in IMG/M
3300016294|Ga0182041_10200217All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1589Open in IMG/M
3300016294|Ga0182041_11926387All Organisms → cellular organisms → Bacteria → Proteobacteria549Open in IMG/M
3300016319|Ga0182033_10215752All Organisms → cellular organisms → Bacteria → Proteobacteria1536Open in IMG/M
3300016319|Ga0182033_10507465Not Available1037Open in IMG/M
3300016319|Ga0182033_11074675All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium718Open in IMG/M
3300016319|Ga0182033_12175895Not Available506Open in IMG/M
3300016341|Ga0182035_10184005All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1638Open in IMG/M
3300016357|Ga0182032_10703926All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria848Open in IMG/M
3300016357|Ga0182032_11822227Not Available532Open in IMG/M
3300016404|Ga0182037_10288444All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1314Open in IMG/M
3300016422|Ga0182039_10590832All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Acetobacteraceae969Open in IMG/M
3300016422|Ga0182039_11507918Not Available612Open in IMG/M
3300016445|Ga0182038_11100859Not Available706Open in IMG/M
3300017822|Ga0187802_10019683All Organisms → cellular organisms → Bacteria → Proteobacteria2309Open in IMG/M
3300017924|Ga0187820_1244647Not Available574Open in IMG/M
3300020580|Ga0210403_10190939Not Available1680Open in IMG/M
3300020582|Ga0210395_10460277Not Available957Open in IMG/M
3300020583|Ga0210401_10267342All Organisms → cellular organisms → Bacteria → Proteobacteria1569Open in IMG/M
3300020583|Ga0210401_10684970Not Available886Open in IMG/M
3300021168|Ga0210406_10271793All Organisms → cellular organisms → Bacteria → Proteobacteria1385Open in IMG/M
3300021170|Ga0210400_10174434All Organisms → cellular organisms → Bacteria → Proteobacteria1740Open in IMG/M
3300021178|Ga0210408_11451553Not Available515Open in IMG/M
3300021180|Ga0210396_10269748All Organisms → cellular organisms → Bacteria → Proteobacteria1510Open in IMG/M
3300021403|Ga0210397_11180062Not Available595Open in IMG/M
3300021405|Ga0210387_11071015Not Available704Open in IMG/M
3300021405|Ga0210387_11563380Not Available562Open in IMG/M
3300021406|Ga0210386_11463271Not Available570Open in IMG/M
3300021432|Ga0210384_10868544Not Available801Open in IMG/M
3300021432|Ga0210384_11899200Not Available501Open in IMG/M
3300021478|Ga0210402_10993733Not Available766Open in IMG/M
3300021479|Ga0210410_10086533All Organisms → cellular organisms → Bacteria → Proteobacteria2758Open in IMG/M
3300021560|Ga0126371_12101711All Organisms → cellular organisms → Bacteria → Proteobacteria680Open in IMG/M
3300022718|Ga0242675_1051995Not Available689Open in IMG/M
3300025898|Ga0207692_10310505Not Available962Open in IMG/M
3300027173|Ga0208097_1041386Not Available535Open in IMG/M
3300028047|Ga0209526_10708391Not Available633Open in IMG/M
3300029636|Ga0222749_10519692Not Available646Open in IMG/M
3300029701|Ga0222748_1070119Not Available636Open in IMG/M
3300031545|Ga0318541_10230504All Organisms → cellular organisms → Bacteria → Proteobacteria1028Open in IMG/M
3300031545|Ga0318541_10763331Not Available540Open in IMG/M
3300031561|Ga0318528_10220508All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1016Open in IMG/M
3300031573|Ga0310915_10216388All Organisms → cellular organisms → Bacteria → Proteobacteria1342Open in IMG/M
3300031573|Ga0310915_11134777All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium542Open in IMG/M
3300031640|Ga0318555_10422447Not Available722Open in IMG/M
3300031640|Ga0318555_10488928All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria667Open in IMG/M
3300031640|Ga0318555_10558423Not Available620Open in IMG/M
3300031668|Ga0318542_10374041All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium734Open in IMG/M
3300031680|Ga0318574_10247001All Organisms → cellular organisms → Bacteria → Proteobacteria1032Open in IMG/M
3300031680|Ga0318574_10653777Not Available616Open in IMG/M
3300031681|Ga0318572_10069436All Organisms → cellular organisms → Bacteria → Proteobacteria1937Open in IMG/M
3300031681|Ga0318572_10613702Not Available648Open in IMG/M
3300031682|Ga0318560_10350603Not Available797Open in IMG/M
3300031719|Ga0306917_10368719Not Available1118Open in IMG/M
3300031736|Ga0318501_10046666All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2004Open in IMG/M
3300031736|Ga0318501_10412107All Organisms → cellular organisms → Bacteria731Open in IMG/M
3300031744|Ga0306918_10553288Not Available901Open in IMG/M
3300031765|Ga0318554_10610201Not Available614Open in IMG/M
3300031768|Ga0318509_10143476All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1314Open in IMG/M
3300031781|Ga0318547_10541016Not Available720Open in IMG/M
3300031792|Ga0318529_10246706All Organisms → cellular organisms → Bacteria831Open in IMG/M
3300031797|Ga0318550_10533441Not Available565Open in IMG/M
3300031819|Ga0318568_10937410Not Available535Open in IMG/M
3300031832|Ga0318499_10150072Not Available909Open in IMG/M
3300031833|Ga0310917_10346735All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1008Open in IMG/M
3300031833|Ga0310917_10406066All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria926Open in IMG/M
3300031833|Ga0310917_10949324Not Available578Open in IMG/M
3300031897|Ga0318520_10320214Not Available937Open in IMG/M
3300031910|Ga0306923_11218603Not Available803Open in IMG/M
3300031910|Ga0306923_12192294Not Available554Open in IMG/M
3300031910|Ga0306923_12262799Not Available543Open in IMG/M
3300031912|Ga0306921_11019741All Organisms → cellular organisms → Bacteria932Open in IMG/M
3300031941|Ga0310912_10353517Not Available1141Open in IMG/M
3300031942|Ga0310916_10321479All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Nematoda → Chromadorea → Rhabditida → Rhabditina → Rhabditomorpha → Rhabditoidea → Rhabditidae → Rhabditidae incertae sedis → Diploscapter → Diploscapter pachys1312Open in IMG/M
3300031942|Ga0310916_11102500Not Available659Open in IMG/M
3300031942|Ga0310916_11233071Not Available617Open in IMG/M
3300031954|Ga0306926_10509797All Organisms → cellular organisms → Bacteria1477Open in IMG/M
3300031981|Ga0318531_10012231All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3255Open in IMG/M
3300032001|Ga0306922_10481640All Organisms → cellular organisms → Bacteria1324Open in IMG/M
3300032001|Ga0306922_11401964All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria702Open in IMG/M
3300032001|Ga0306922_12023603Not Available560Open in IMG/M
3300032035|Ga0310911_10499541Not Available705Open in IMG/M
3300032052|Ga0318506_10389627Not Available618Open in IMG/M
3300032059|Ga0318533_10979173All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium620Open in IMG/M
3300032060|Ga0318505_10551847All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales542Open in IMG/M
3300032076|Ga0306924_10739904All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella → Candidatus Entotheonella gemina1101Open in IMG/M
3300032076|Ga0306924_11125060All Organisms → cellular organisms → Bacteria854Open in IMG/M
3300032076|Ga0306924_11417706Not Available740Open in IMG/M
3300032091|Ga0318577_10087275All Organisms → cellular organisms → Bacteria → Proteobacteria1449Open in IMG/M
3300032094|Ga0318540_10059155Not Available1738Open in IMG/M
3300032515|Ga0348332_11787880All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium688Open in IMG/M
3300033290|Ga0318519_10243634All Organisms → cellular organisms → Bacteria1038Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil52.73%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil24.55%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil14.55%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment1.82%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.82%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil0.91%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.91%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.91%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.91%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter0.91%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300016404Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300017822Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_2EnvironmentalOpen in IMG/M
3300017924Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_5EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300022718Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025898Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027173Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF036 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300029701Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-O (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031545Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f26EnvironmentalOpen in IMG/M
3300031561Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f26EnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031640Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f23EnvironmentalOpen in IMG/M
3300031668Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.168b4f23EnvironmentalOpen in IMG/M
3300031680Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f22EnvironmentalOpen in IMG/M
3300031681Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f20EnvironmentalOpen in IMG/M
3300031682Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f22EnvironmentalOpen in IMG/M
3300031719Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000 (v2)EnvironmentalOpen in IMG/M
3300031736Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f21EnvironmentalOpen in IMG/M
3300031744Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00H (v2)EnvironmentalOpen in IMG/M
3300031765Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f22EnvironmentalOpen in IMG/M
3300031768Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f22EnvironmentalOpen in IMG/M
3300031781Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f20EnvironmentalOpen in IMG/M
3300031792Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f23EnvironmentalOpen in IMG/M
3300031797Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f23EnvironmentalOpen in IMG/M
3300031819Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f21EnvironmentalOpen in IMG/M
3300031832Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.109b1f25EnvironmentalOpen in IMG/M
3300031833Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF178EnvironmentalOpen in IMG/M
3300031897Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f16EnvironmentalOpen in IMG/M
3300031910Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 (v2)EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031941Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080EnvironmentalOpen in IMG/M
3300031942Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF176EnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300031981Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f25EnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300032035Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF170EnvironmentalOpen in IMG/M
3300032052Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f19EnvironmentalOpen in IMG/M
3300032059Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f27EnvironmentalOpen in IMG/M
3300032060Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f18EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032091Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f25EnvironmentalOpen in IMG/M
3300032094Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f25EnvironmentalOpen in IMG/M
3300032515FICUS49499 Metatranscriptome Czech Republic combined assembly (additional data)EnvironmentalOpen in IMG/M
3300033290Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f15EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0066388_10035636513300005332Tropical Forest SoilIADHWGAGQSFVILGIVLIVLCAPVTWITWRAARATTNQTEPSLSD*
Ga0079221_1037109033300006804Agricultural SoilDHWGAGRSFVILGAVLIVLCAPVTLITRRAERTATSQAEPTLSD*
Ga0126384_1057432323300010046Tropical Forest SoilVGSIADQWGADRSFVILAAVLIVLCAPVILIARRAARTATSQAEPTLSD*
Ga0126384_1126897813300010046Tropical Forest SoilADHWGAGRSFVILGAVLIVLCAPVTLITRRAARTATSRAEPTLSD*
Ga0126373_1059917933300010048Tropical Forest SoilILGAVLIVLCAPVTLITRRAAGTATSQVEPTLSD*
Ga0126373_1160989733300010048Tropical Forest SoilIADHWGAGQSFVILGAVLIVLCAPVTLITRRAARTATNQA*
Ga0126373_1224141813300010048Tropical Forest SoilFVILGAVLIVLCAPVTLITRRAARMAAQPAEPTLSD*
Ga0126370_1030313313300010358Tropical Forest SoilMVGGIADHWGAGQSFVILGTVLIVLCAPVTLITRRAARTATNQAKPTLSD*
Ga0126376_1216040723300010359Tropical Forest SoilILGAVLIVLCAPVTLITRRAARTATNQAKPTLSD*
Ga0126378_1344605913300010361Tropical Forest SoilDHWGAGQSFVILGAVLIVLCAPVTLITRRAARTATNQAKPTLSD*
Ga0126379_1087802413300010366Tropical Forest SoilILGAVLIVLCAPVTLITRRAARTATSQAEPTLSD*
Ga0126381_10106364423300010376Tropical Forest SoilMVGGIADHWGAGQSFVILGTVLIVLCAPVTLITRRAARTTTNQAEPTLSD*
Ga0126381_10134780433300010376Tropical Forest SoilPPLVGSIADHWGAGRSFVILGAVLIVLCAPVTLITRRAARTATSRAEPTLSD*
Ga0126381_10143602833300010376Tropical Forest SoilADHWGAGQSFVILGTVLIVLCAPVSLITQRAARTATNQTEPTLGA*
Ga0126381_10182412323300010376Tropical Forest SoilGIADHWGAGQSFVILGAVLIVLCAPVILITRRAARMATNQAEPTLSD*
Ga0126381_10419875523300010376Tropical Forest SoilGAGRSFVILGAVLIVLCAPVTLITRRAARTATSRAEPTLSD*
Ga0137390_1179924613300012363Vadose Zone SoilGQSFVILGTVLIVLCVPVTLITRRAGRTATNQAEPTVGD*
Ga0126369_1089068123300012971Tropical Forest SoilGIADHWGAGQSFVILGIVLIVLCAPVTWITWRAARATTNQTEPSLSD*
Ga0182036_1156248713300016270SoilAIADHWGAGQSFVILGTVLIVLCAPVSLITRRAARTATNQTEPTLGA
Ga0182041_1020021713300016294SoilRWGAGPSFVILGTILIVLCAPVALITRRAVRTVANQPESTLSD
Ga0182041_1192638733300016294SoilVGGIADHWGAGRSFVILGAVLIVLCAPVTLITRRAARTAAQQAEPTLSD
Ga0182033_1021575213300016319SoilPPAVGAIADHWGAGPGFVILGTVLILLCAPVTLITRRAARTPANQAAPTLSD
Ga0182033_1050746533300016319SoilFVILGTVLIVLCAPVSLITRRAARTATNQTEPTLGA
Ga0182033_1107467513300016319SoilPAVGAIADHWGAGPGFVILGTVLILLCAPVTLITRRAARTAANQAAPTLSD
Ga0182033_1217589513300016319SoilSFVILGTVLIVLCAPVSLITRRAARTATNQTEPTLGA
Ga0182035_1018400543300016341SoilGAGPSFVILGGVLIVLCAPVILITRRAAQAATNQAEPTLSD
Ga0182032_1070392613300016357SoilDRWGADHGFVILGTVLIVLCAPVSLITRRAARTATNQTEPTLGA
Ga0182032_1182222713300016357SoilAIADRWGAGHSFVILGALLIVLCAPVTSITRRAARSPSGRNPDPTLAD
Ga0182037_1028844413300016404SoilDHWGAGPGFVILGTVLILLCAPVTLITRRAARTAANQAAPTLSD
Ga0182039_1059083213300016422SoilIADHWGAGPGFVILGTVLILLCAPVTLITRRAARTPANQAAPTLSD
Ga0182039_1150791813300016422SoilSFVILGAVLIVLCAPVTLITRRAARTATSQAEPTLSD
Ga0182038_1110085913300016445SoilGFVILGTVLILLCAPVTLITRRAARTAANQAASTLSD
Ga0187802_1001968343300017822Freshwater SedimentMGAIADHWGAGQSFVILGTILIVLCAPVTLITRRAARTPSNRAPTLSD
Ga0187820_124464723300017924Freshwater SedimentAGQSFVILGTILIVLCAPVTLITRRAARTPSNRAPTLSD
Ga0210403_1019093933300020580SoilAGRSFVILGAVLIALCAPVTLITRRAARTATNARWKIP
Ga0210395_1046027713300020582SoilMGVIADHWGARQSFVILGAILIVLCVPVTLIIRRAARTATQAEPALSD
Ga0210401_1026734233300020583SoilARQSFVILGAILIVLCVPVTLIIRRAARTATQAEPALSD
Ga0210401_1068497033300020583SoilIPPLVGSIADHWGAGRSFVILGAVLIVLCAPVTLITRRAARTATSQAEPTLSD
Ga0210406_1027179313300021168SoilVIADHWGAGQSFVILGAILIVLCVPVTLIIRRAARTATQSEPALSD
Ga0210400_1017443413300021170SoilAGQSFVILGAILIVLCVPVTLIIRRAARTATQSEPALSD
Ga0210408_1145155323300021178SoilDHWGARQSFVILGAILIVLCVPVTLIIRRAAWTATQAEPALSD
Ga0210396_1026974813300021180SoilHWGAGRSFVILGAVLIVLCAPVTLITRRAARTATSQTEPTLGD
Ga0210397_1118006223300021403SoilPPLVGSIADHWGAGRSFVILGAVLIVLCAPVTLITRRAARTATSQAEPTLSD
Ga0210387_1107101513300021405SoilPPLVGSIADHWGAGRSFVILGAVLIVLCAPVALITRRAARTATSQAEPTLSD
Ga0210387_1156338023300021405SoilGQSFVILGAILIVLCVPVTLIIRRAARTATQAEPALSD
Ga0210386_1146327113300021406SoilVIADHWGAGQSFVILGAILIVLCVPVTLIIRRAARTATQAEPALSD
Ga0210384_1086854413300021432SoilGVIADHWGARQSFVILGAILIVLCVPVTLIIRRAARTATQAEPALSD
Ga0210384_1189920023300021432SoilWGAAPSFVILGAGLIVLCAPVTLIIRRAARTATSQAEPTLSD
Ga0210402_1099373323300021478SoilIADHWGAGRSFVILGAVLIVLCAPVALITRRAARTATSQAEPTLSD
Ga0210410_1008653313300021479SoilLGAGQSFVILGAILIVLCVPVTLIIRRAARTATQSEPALSD
Ga0126371_1210171113300021560Tropical Forest SoilQSFVILGIVLIVLCVPVTSITRRAARTTTNQAEPTLSD
Ga0242675_105199513300022718SoilSFVILGAILIVLCVPVTLIIRRAARTATQAEPALSD
Ga0207692_1031050523300025898Corn, Switchgrass And Miscanthus RhizosphereAGQSFVILGTVLIVLCAPVILITRRAARTATNRAAPTLND
Ga0208097_104138613300027173Forest SoilHWGAGRSFVILGAVLIVLCAPVTLITRRAARTATSQAEQTLSN
Ga0209526_1070839113300028047Forest SoilHGGATPSFVIIGAGLIVLCAPVTLIMRRAARTATSQAEPTLSD
Ga0222749_1051969213300029636SoilSIADHWGAGRSFVILGAVLIVLCAPVALITRRVARTATSQAEPTLGD
Ga0222748_107011923300029701SoilDHWGAGQSFVILGAVLLVLCAPATLITRRAARTVTDQPEPTSSD
Ga0318541_1023050413300031545SoilHWGAGPGFVILGTVLILLCAPVTLITRRAARTAANQAAPTLSD
Ga0318541_1076333113300031545SoilIVGSIADHWGAGPSFVILGGVLIVLCAPVTLITRRAALTATNQAEPTLSD
Ga0318528_1022050813300031561SoilMGAIADHWGAGQSFVILGTVLIVLCAPVSLITRRAARTATNQTEPTLGA
Ga0310915_1021638813300031573SoilAVGAIADHWGAGPGFVILGTVLILLCAPVTLITRRAARTAANQAAPTLSD
Ga0310915_1113477723300031573SoilAGPGFVILGTVLILLCAPVTLITRRAARTAANQAAPTLSD
Ga0318555_1042244713300031640SoilGAGPGFVILGTVLILLCAPVTLITRRAARTPANQAAPTLSD
Ga0318555_1048892813300031640SoilVILGAVLIVLCAPVTLITRRAARTAAQQAEPTLSD
Ga0318555_1055842313300031640SoilHWGAGQSFVILGAVLIVLCAPVTLITRRAARTATNQAKPTLSD
Ga0318542_1037404123300031668SoilIADHWGAGPGFVILGTVLILLCAPVTLITRRAARTAANQAAPTLSD
Ga0318574_1024700113300031680SoilADHWGAGPGFVILGTVLILLCAPVTLITRRAARTAANQAAPTLSD
Ga0318574_1065377723300031680SoilDHWGAGPGFVILGTVLILLCAPVTLITRRAARTPANQAAPTLSD
Ga0318572_1006943613300031681SoilAIADHWGAGPGFAILGTVLILLCAPVTLITRRAARTPANQAAPTLSD
Ga0318572_1061370223300031681SoilFVILGGGLIVLCAPVTLIARRAALTATNQAEPTLSD
Ga0318560_1035060313300031682SoilIADHWGAGQSFVILGAILIVLCGPVTLITRRAARMATNQAEPTLSD
Ga0306917_1036871943300031719SoilSIADHWGAGPSFVILGGGLIVLCAPVTLIARRAALTATNQAEPTLSD
Ga0318501_1004666613300031736SoilVILGTVLIFLCAPVALITRRAARTAANQAAPTLSD
Ga0318501_1041210713300031736SoilHWGAGPGFVILGTVLILLCAQVTLITRRAARTAANQQAPTLSD
Ga0306918_1055328813300031744SoilIPPAVGAIADHWGAGPGFVILGTVLILLCAPVTLITRRAARTPANQAAPTLSD
Ga0318554_1061020123300031765SoilHWGAGPGFVILGTVLILLCAPVTLITRRAARTPANQAAPTLSD
Ga0318509_1014347613300031768SoilVGSIADYWGAGPSFVILGGALIALCAPVILIARRAALTATNQAEPTLSD
Ga0318547_1054101613300031781SoilADHWGAGPGFVILGTVLILLCAPVTLITRRAARTPANQAAPTLSD
Ga0318529_1024670613300031792SoilGQSFVILGTVLIVLCAPVSLITRRAARTATNQTEPTLGA
Ga0318550_1053344113300031797SoilGPSFVILGTILIVLCAPVALITRRAVRTVANQPESTLSD
Ga0318568_1093741013300031819SoilIVGSIADHWGAGPSFVILGGVLIVLCAPVTLIARRAALTATNQAEPTLSD
Ga0318499_1015007233300031832SoilVGAIADHWGAGPGFVILGTVLILLCAPVTLITRRAARTPANQAAPTLSD
Ga0310917_1034673533300031833SoilVVPPIVGDIADHYGAGPSFVILGGVLIVLCAPVILITRRAAQAATNQAEPTLSD
Ga0310917_1040606643300031833SoilGHSFVILGAVLIVLCAPVTLITRRAARTAAQQAEPTLSD
Ga0310917_1094932413300031833SoilPPAVGAIADHWGAGPGFVILGTVLILLCAPVALITRRAARTAANQAAPTLSD
Ga0318520_1032021413300031897SoilAIADHWGAGPGFLILGTVLILLCAPVTLITRRAARTAANQAASTLSD
Ga0306923_1121860323300031910SoilAGQSFVILGTVLIVLCAPVSLITRRAARTATNQTEPTLGA
Ga0306923_1219229413300031910SoilVGSIADHWGAGPSFVILGGVLIVLCAPVTLITRRAALTATNQAEPTLSD
Ga0306923_1226279923300031910SoilPPMVGGIADHWGAGQSFVILGAVLIVLCAPVTLMTRRAARTATNQAKPTLSD
Ga0306921_1101974113300031912SoilFVILGGVLIVLCAPVALITRRAAQTATNQAEPTLSD
Ga0310912_1035351713300031941SoilADRWGAGHSFVILGTVLIVLCAPVSLITRRAARTATNQTEPTLGA
Ga0310916_1032147923300031942SoilGGIADHWGAGRSFVILGAVLIVLCAPVILITRRAARTATTPTEPSSSD
Ga0310916_1110250013300031942SoilGAGQSFVILGAVLIVLCAPVTLITRRAARTATNQAKPTLSD
Ga0310916_1123307113300031942SoilPLVGGIADHWGAGQSFVILGAILIVLCGPVTLITRRAARMATNQAEPTLSD
Ga0306926_1050979733300031954SoilADHWGAGPSFVILGGGLIVLCAPVTLIARRAALTATNQAEPTLSD
Ga0318531_1001223113300031981SoilAIADYWGAGPGFVILGTVLIFLCAPVALITRRAARTAANQAAPTLSD
Ga0306922_1048164013300032001SoilIPPVVGAIADHWGAGPGFVILGTVLILLCAQVTLITRRAARTAANQQAPTLSD
Ga0306922_1140196433300032001SoilVILGTVLIVLCAPVSLITRRAARTATNQTEPTLGA
Ga0306922_1202360313300032001SoilWGAGPGFVILGTVLILLCAPVALITRRAARTAANQAAPTLSD
Ga0310911_1049954123300032035SoilIADHWGAGPSFVILGGVLIVLCAPVTLIARRAALTATNQAEPTLSD
Ga0318506_1038962723300032052SoilVGAIADHWGAGPGFVILGTVLILLCAPVALITRRAARTAANQAAPTLSD
Ga0318533_1097917323300032059SoilPAVGAIADHWGAGPGFLILGTVLILLCAPVTLITRRAARTAANQAAPTLSD
Ga0318505_1055184723300032060SoilWGAGHSFVILGTVLIVLCAPVSLITRRAARTATNQTEPTLGA
Ga0306924_1073990413300032076SoilQSFVILGAVLIVLCAPVTLMTRRAARTATNQAKPTLSD
Ga0306924_1112506013300032076SoilIADHWGAGPGFVILGTVLILLCAPVALITRRAARTAANQAAPTLSD
Ga0306924_1141770613300032076SoilAIADHWGAGPGFVILGTVLILLCAPVTLITRRAARTPANQAAPTLSD
Ga0318577_1008727553300032091SoilAIAIPPAVGAIADHWGAGPGFVILGTVLILLCAPVTLITRRAARTPANQAAPTLSD
Ga0318540_1005915553300032094SoilGAIADRWGADHGFVILGTVLIVLCAPVSLITRRAARTATNQTEPTLGA
Ga0348332_1178788023300032515Plant LitterGMIADRWGAGQSFVILGTLMVVLCAPVTLITRRAARTAAEQNVEPTSAD
Ga0318519_1024363413300033290SoilGPGFVILGTVLILLCAPVALITRRAARTAANQAAPTLSD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.