NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F104837

Metagenome / Metatranscriptome Family F104837

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104837
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 45 residues
Representative Sequence HLRDVAVAARPSLAGQIDDVFSLEAIVTELAPVVDETLAGIAGEG
Number of Associated Samples 89
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 86
AlphaFold2 3D model prediction Yes
3D model pTM-score0.55

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(16.000 % of family members)
Environment Ontology (ENVO) Unclassified
(28.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(36.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 53.42%    β-sheet: 0.00%    Coil/Unstructured: 46.58%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.55
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF01264Chorismate_synt 14.00
PF00535Glycos_transf_2 7.00
PF08448PAS_4 7.00
PF05685Uma2 5.00
PF00296Bac_luciferase 5.00
PF01613Flavin_Reduct 5.00
PF00496SBP_bac_5 3.00
PF00254FKBP_C 3.00
PF132794HBT_2 2.00
PF07355GRDB 2.00
PF00206Lyase_1 1.00
PF01055Glyco_hydro_31 1.00
PF04075F420H2_quin_red 1.00
PF00903Glyoxalase 1.00
PF09084NMT1 1.00
PF01850PIN 1.00
PF00067p450 1.00
PF09720Unstab_antitox 1.00
PF00079Serpin 1.00
PF13231PMT_2 1.00
PF01425Amidase 1.00
PF00126HTH_1 1.00
PF13185GAF_2 1.00
PF02515CoA_transf_3 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG0082Chorismate synthaseAmino acid transport and metabolism [E] 14.00
COG1853FMN reductase RutF, DIM6/NTAB familyEnergy production and conversion [C] 5.00
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 5.00
COG4636Endonuclease, Uma2 family (restriction endonuclease fold)General function prediction only [R] 5.00
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 1.00
COG0715ABC-type nitrate/sulfonate/bicarbonate transport system, periplasmic componentInorganic ion transport and metabolism [P] 1.00
COG1501Alpha-glucosidase/xylosidase, GH31 familyCarbohydrate transport and metabolism [G] 1.00
COG1804Crotonobetainyl-CoA:carnitine CoA-transferase CaiB and related acyl-CoA transferasesLipid transport and metabolism [I] 1.00
COG2124Cytochrome P450Defense mechanisms [V] 1.00
COG4521ABC-type taurine transport system, periplasmic componentInorganic ion transport and metabolism [P] 1.00
COG4826Serine protease inhibitorPosttranslational modification, protein turnover, chaperones [O] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil16.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil11.00%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil8.00%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere8.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil6.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil5.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil4.00%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand4.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil3.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.00%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere3.00%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere3.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.00%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil2.00%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere2.00%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil1.00%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.00%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.00%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.00%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil1.00%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil1.00%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks1.00%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.00%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere1.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.00%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.00%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.00%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.00%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005330Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaGEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005457Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaGHost-AssociatedOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005615Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-3 metaGEnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005841Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2Host-AssociatedOpen in IMG/M
3300006058Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1Host-AssociatedOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300006881Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2Host-AssociatedOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009802Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_50_60EnvironmentalOpen in IMG/M
3300009816Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012514Unplanted soil (control) microbial communities from North Carolina - M.Soil.1.old.130510EnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013100Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C6-5 metaGHost-AssociatedOpen in IMG/M
3300013308Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaGHost-AssociatedOpen in IMG/M
3300014883Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT760_16_10DEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300025899Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025918Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025923Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025938Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026116Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300027490Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300027654Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027954Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300028809Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_PalmiticAcid_Day48EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031680Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f22EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031768Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f22EnvironmentalOpen in IMG/M
3300031770Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f17EnvironmentalOpen in IMG/M
3300031771Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f19EnvironmentalOpen in IMG/M
3300031792Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f23EnvironmentalOpen in IMG/M
3300031797Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f23EnvironmentalOpen in IMG/M
3300031799Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f21EnvironmentalOpen in IMG/M
3300031831Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f20EnvironmentalOpen in IMG/M
3300031859Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f25EnvironmentalOpen in IMG/M
3300032000Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D3EnvironmentalOpen in IMG/M
3300032052Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f19EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033814Sediment microbial communities from East River floodplain, Colorado, United States - 55_j17EnvironmentalOpen in IMG/M
3300034090Peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF00NEnvironmentalOpen in IMG/M
3300034666Metatranscriptome of lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8R2 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI1027J12803_10094865013300000955SoilAVKERRHLKDVAVAARPSLAGRIDDVFSFEAVVTELAPVLDETLAGIPADP*
JGI1027J12803_10727476433300000955SoilARPSLAGEIDTVFSLETVVAELAPVLEETLSGVGGENP*
Ga0062595_10174651823300004479SoilVRDRRNLKEVARAARPSLAGEIDTVFSLETVVAELAPVLEETLSGVGGENP*
Ga0066680_1047002813300005174SoilVKERRHLKDVAIAARPSLAGQIDDVFSLDAIVGELAPVLEETLAGIAEG*
Ga0066685_1074846323300005180SoilERRHLKDVALAARPSLAGQIDAVFSLEAIVAELAPVLEETLEGVAG*
Ga0066678_1052441513300005181SoilKDVALAARPSLAGQIDAVFSLEAIVAELAPVLEETLEGVAG*
Ga0066676_1070960323300005186SoilAAVRERRHLKDVALAARPALAGQIDDVFSLEAIVTELAPVLDETLAGIPSAG*
Ga0070690_10028121923300005330Switchgrass RhizosphereVKERRPLRDVALAARPELAGAIHGVFSLDAVVAELAPVLDEVLRDVDA*
Ga0066388_10778017613300005332Tropical Forest SoilRQLRDVALAARPALAGAIDGVFSLEAVVRELAPVLDEVLAEIDTDR*
Ga0070680_10055425823300005336Corn RhizosphereLKEVALAARPSLAAHIEEVFSLDALVKELAPVLDETLTGVPGDR*
Ga0070708_10017968233300005445Corn, Switchgrass And Miscanthus RhizosphereHLRDVALAARPELAGQLDEVFSLEAVVRELSPVLDETLAGIPGDG*
Ga0070662_10006833813300005457Corn RhizosphereQLRDVALAAKPELAGAIDNVFSLEAVVTELAPVLSEVLAEVND*
Ga0070662_10131965523300005457Corn RhizosphereVTVKEGRHLKDVAVAARPSLAGRIDGVFSFEAVVAELVPVLDETLAGLPADA*
Ga0070707_10188607313300005468Corn, Switchgrass And Miscanthus RhizosphereRHLRDVALAARPELAGQLDEVFSLEAIVRDLTPVLDETLAGIPSEG*
Ga0066698_1073368513300005558SoilAARPSLAGQIDDVFSLDAIVGELAPVLEETLAGIAEG*
Ga0066708_1014533123300005576SoilPALAGQIDDVFSLEAIVIALAPVLDETLAGIPSDG*
Ga0070702_10087199813300005615Corn, Switchgrass And Miscanthus RhizosphereEHRHLRDIAGAARPALARQLDDVFSFEVVVAELAPVLDEVLAPVAGDAAEPP*
Ga0066905_10031952823300005713Tropical Forest SoilLAVKERRQLKDVALAAKPALAGRIDDVFSLDALVAELAPVLEETLAGIPTE*
Ga0066905_10128630123300005713Tropical Forest SoilRNLKEVARAARPSLAGEIDTVFSLEAIVAELAPVLEETLSGVGGENP*
Ga0068863_10140506413300005841Switchgrass RhizosphereLAARPGLAGRLDDVFSLEAIVRELTPVLDETLAGITADG*
Ga0075432_1003558123300006058Populus RhizosphereSSLAVKDHRHLKGVALAARPSLAAHIEDVFSMDALVKELAPVLDETLAGVPTEA*
Ga0066665_1090197813300006796SoilVKERRHLKDVAIAARPSLAGQIDDVFSLDAIVGELAPVLEEALAGIADG*
Ga0066665_1094984113300006796SoilDVAIAARPSLAGQIDDVFSLDAIVGELAPVLEETLAGIAEG*
Ga0066665_1107544813300006796SoilIKERRHLRDVALAARPELAGQLDQVFALDAIVVEAAPVLEETLAGIAREG*
Ga0066660_1032198113300006800SoilVAVNERRHLRDVALAARPSLAARIEEVFSLEAIVTDLASVLDETLQSCA*
Ga0075421_10132471923300006845Populus RhizosphereRHLRAVALAERPSLGAEIDSVFSLEAVVAELAPVLDETLSGVGGENP*
Ga0075421_10159898223300006845Populus RhizosphereDVALAARPDLAGQLADVFSLEAIVAELAPVLDETLAGIP*
Ga0075420_10133448613300006853Populus RhizosphereRDVALAAKPELAGAIDDVFSLEAVVTELAPVLSEVLAEVND*
Ga0075434_10229442413300006871Populus RhizosphereHLRDVALAARPELAGQLDQVFAPDAIVVEAAPVLEETLAGIAGEG*
Ga0075429_10153983423300006880Populus RhizosphereLAARPALAAQIADVFSLEAIVADLAPVLDETLAEVPAER*
Ga0068865_10093469123300006881Miscanthus RhizosphereVKERRHLKDVAMAERPSLAGRIDGVFSFEAVVAELVPVLDETLAGLPADA*
Ga0099827_1070745413300009090Vadose Zone SoilRPELADRLDEVFSLEAIVRELTPVLDETLAAIPADG*
Ga0114129_1142089323300009147Populus RhizosphereLRDVALAAKPELAGAIDDVFSLEAVVTELAPVLSEVLAEVND*
Ga0114129_1209627223300009147Populus RhizosphereARPALAAQIADVFSLEAIVADLAPVLDETLAEVPAER*
Ga0105241_1250072423300009174Corn RhizosphereEGRHLKDVAVAARPSLAGRIDGVFSFEAVVAELVPVLDETLAGLPADA*
Ga0105073_104675513300009802Groundwater SandHLRDVAVAARPSLAGQIDDVFSLEAIVTELAPVVDETLAGIAGEG*
Ga0105076_106347213300009816Groundwater SandAVREHRHLRDVAVAARPSLAGQIDDVFSLEAIVTELAPVVDETLAGIAGEG*
Ga0126382_1060467413300010047Tropical Forest SoilDWSAAAVKERRHLRDIAVAARPALAGEIDDVFSLEAIVTELAAVLEETLTGIAEA*
Ga0126372_1040810613300010360Tropical Forest SoilTEAIKERRHLKDVAVAARPALAGQIDDVFSFDAIVAELAPVFEEALAGVPTE*
Ga0126377_1012757353300010362Tropical Forest SoilVKELRHLRDVAVAAQPALAGRIDGVFSLEAVVTELAPVLDETLTEVPLSGDQP*
Ga0126379_1011248213300010366Tropical Forest SoilARPSLGAEIDSVFSLEAVVAELAPVLEETLSGVGGENP*
Ga0134121_1119754033300010401Terrestrial SoilAARPSLAGRIDDVFSFDAVVTELAPVLDETLAGIPADP*
Ga0137389_1044060913300012096Vadose Zone SoilKDVAIAARPSLAGQIDDVFSLEAIVTELAPVLDETLAAIPGA*
Ga0137383_1032231333300012199Vadose Zone SoilAARPSLAGQIDDVFSLDAVVAELVPVLDETLEGVPG*
Ga0137399_1002048163300012203Vadose Zone SoilLAGGIDTVFSLEAIVAELASLLDETLSGVGGENP*
Ga0157330_107636023300012514SoilLKEVARAARPALAGEIDTVFSLETVVAELAPVLEETLSGVGGENP*
Ga0137358_1030483913300012582Vadose Zone SoilAQPSLAGHIDDVFSLDAIATELAPILGETLAGIPSDASRL*
Ga0137394_1033239823300012922Vadose Zone SoilVSLAARPELAGQLDQVFAPDAIVVEAAPVLEETLAGIAGEG*
Ga0137419_1043723013300012925Vadose Zone SoilVKERRQLRDVALAARPSLAGQIDDVFSLDAVVAELVPVLDETLEGVPG*
Ga0137416_1204132823300012927Vadose Zone SoilVAVREHRHLRDIAVAARPSLAGEIDDVFSLDAIVTELAPIVDETLTGIAGEG*
Ga0157373_1111993013300013100Corn RhizosphereRDVALAARPALTGAIDGVFSLESVVTELAPVLDEVLAEIA*
Ga0157375_1142018513300013308Miscanthus RhizosphereVAVAARPSLAGRIDDVFSFEAVVTELAPVLDETLAGIPADP*
Ga0180086_117462513300014883SoilRRNLRDVALAARPELASQLDEVFSLEAIVRELAPVLEETLARISGDG*
Ga0134089_1037653823300015358Grasslands SoilSTVAVREHRHLRDVAIAAQPSLAGQVDVVFSLEAIVTDLAPVLDETLVGVAGEG*
Ga0182038_1101403113300016445SoilVAARPALAGQIDDVFSFDAIVAELAPVFEEALAGVPTE
Ga0184640_1023851613300018074Groundwater SedimentRHLRDVAIAARTSLAGQIDDVFSLEAVVTELAPVLDEVLAEVAGDA
Ga0190265_1267703313300018422SoilRDVALAARPSLAGQIHDVFSLEAIVGELAPVLDEVLAEVV
Ga0066667_1049842513300018433Grasslands SoilRHLRDVALAARPSLAARIDDVFSLEAIVTDLASVLDETLQSCA
Ga0187894_1039306413300019360Microbial Mat On RocksVAVAERRHLREVALAARPELAGLLDEVFSLEAIVGELAPVLDETLAGLAGDA
Ga0193755_113983723300020004SoilVGVADTAGARPSLARRIDHVFSLEAIVTELAPVLDEVLAEVAG
Ga0126371_1046922413300021560Tropical Forest SoilAARPSLASEIAGVFSLETIVTELAPVLEETLAGIPAG
Ga0207642_1058733913300025899Miscanthus RhizosphereVALAARPELAGAIHGVFSLDAVVAELAPVLDEVLRDVDA
Ga0207662_1138072213300025918Switchgrass RhizosphereKERRPLRDVALAARPELAGAIHGVFSLDAVVAELAPVLDEVLRDVDA
Ga0207681_1001565453300025923Switchgrass RhizosphereVDDWSSVAVKERRPLRDVALAARPELAGAIHGVFSLDAVVAELAPVLDEVLRDVDA
Ga0207706_1161705823300025933Corn RhizosphereSSVAVKERRPLRDVALAARPELAGAIHGVFSLDAVVAELAPVLDEVLRDVDA
Ga0207704_1087080313300025938Miscanthus RhizosphereVKERRHLKDVAMAERPSLAGRIDGVFSFEAVVAELVPVLDETLAGLPADA
Ga0207674_1121276733300026116Corn RhizosphereRDIARAARPALARQLDDVFSFEVVVAELAPVLDEVLAPVEGDTAEPP
Ga0209803_133620513300026332SoilAARPSLAGQIDAVFSLEAIVAELAPVLEETLEGVAG
Ga0209160_114328413300026532SoilVNERRHLRDVALAARPSLASRIDDVFSLEAIVTDLASVLDETLQSCA
Ga0209376_119763813300026540SoilERRHLKDVAIAARPSLAGQIDDVFSLDGIVGELAPVLEETLAGIAEG
Ga0209805_112598113300026542SoilDDWSAVAVNERRHLRDVALAARPSLAARIDDVFSLEAIVTDLASVLDETLQSCA
Ga0209899_104390613300027490Groundwater SandPSLAGQIDDVFSLEAVVAELASVLDETLAGIAGEG
Ga0209799_108920013300027654Tropical Forest SoilLKDVALAAKPALAGRIDDVFSLDALVAELAPVLEETLAGIPTE
Ga0209859_103241013300027954Groundwater SandVVKERRHLKEVALAARPTLAGRIDDVFSLESVVVELAPVLDETLAGVAG
Ga0247824_1022922113300028809SoilAVREGRQLRDVALAAKPELAGAIDDVFSLEAVVTELAPVLSEVLAEVND
(restricted) Ga0255311_108184113300031150Sandy SoilVATTARPSLAGQIDDVFSLEAIVTELAPVLDETLAAIPGD
Ga0310915_1079333623300031573SoilERLHLKDVATAARPSLASEIGGVFSLETIVTELGPVLEETLAGIPASA
Ga0318574_1014070013300031680SoilARVVKERLHLKDVATAARPSLASEIGGVFSLETIVTELGPVLEETLAGIPAGA
Ga0307469_1098505913300031720Hardwood Forest SoilLRDVALAARPALAGTIHDVFSLDAVVAEMAPVLDEVLRDVHA
Ga0307469_1123163023300031720Hardwood Forest SoilRRHLKDVAMAARPSRAGRIDGVFSFEAVVAELVPVLDETLAGLPADA
Ga0307469_1128046113300031720Hardwood Forest SoilHLKDVAVAARPSLAGHIDDVFSFEAVIAELAPVLDETLEGIPAGA
Ga0318509_1064280013300031768SoilTAARPSLASEIAGVFSLETIVTELAPVLEETLAGIPAGA
Ga0318521_1027029313300031770SoilVKERLHLKDVATAARPSQASEIAGVFSLETIVTELGPVLEETLAGIPAGA
Ga0318546_1049351313300031771SoilVVKERLHLKDVATAARPSQASEIAGVFSLETIVTELGPVLEETLAGIPAGA
Ga0318529_1029832513300031792SoilDWSTEAIKERRHLKDVAVAARPALAGQIDDVFSFDAIVAELAPVFEEALAGVPTE
Ga0318550_1006336713300031797SoilWSTEAIKERRHLKDVAVAARPALAGQIDDVFSFDAIVAELAPVFEEALAGVPTE
Ga0318565_1016962923300031799SoilLLDDAGPSLASEIAGVFSLETIVTELAPVLEETLAGIPAGA
Ga0318564_1016972923300031831SoilHLKDVATAARPSLASEIAGVFSLETIVTELAPVLEETLAGIPAGA
Ga0318527_1031168513300031859SoilAIKERRHLKDVAVAARPALAGQIDDVFSFDAIVAELAPVFEEALAGVPTE
Ga0310903_1005105813300032000SoilDRRHLKDVALAARPSLAAHIEDVFSLDALVKELAPVLDETLAGVPTEA
Ga0318506_1027802913300032052SoilVDDWSTEAIKERRHLKDVAVAARPALAGQIDDVFSFDAIVAELAPVFEEALAGVPTE
Ga0306924_1211546023300032076SoilVAVAARPALAGQIDDVFSFDAIVAELAPVFEEALAGVPTE
Ga0307471_10057978813300032180Hardwood Forest SoilDWSGVAVRDRRHLKLVALAARPALAGEIDAVFSLEAIVAELAPLLDETLSGVGGENP
Ga0307471_10063351113300032180Hardwood Forest SoilCRHLRAVALAERPSLGAEIDSVFSLEAVVAELAPVLDETLSGVGGENP
Ga0307472_10021747713300032205Hardwood Forest SoilRSARPSLAGEIDTVFSLETVVAELAPVLEETLSGVGGESP
Ga0335084_1132548223300033004SoilRQLRDVALAARPELAGTIDDVFSLAAIVTELAPVLDEVLAEIDA
Ga0335084_1156124213300033004SoilGARPALAGQIDAVFSPETMVTELAPVLDETLAVLSEGA
Ga0364930_0146106_3_1433300033814SedimentRHLRDVVIAARPALAGQIDDVFSLEAVVTELAPVLDEVLAEVAADG
Ga0326723_0332406_1_1443300034090Peat SoilRRHLKDVAVAARPSLTGQIDEVFSLEAIVAELAPVLDETLAGIPADA
Ga0314788_044523_705_8183300034666SoilLAAKPELAGAIDDVFSLEAVVTELAPVLSEVLAEVND


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.