NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F091812

Metagenome Family F091812

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F091812
Family Type Metagenome
Number of Sequences 107
Average Sequence Length 43 residues
Representative Sequence APAPETQKEPGLGPRPLSPEDPIFHRALDLLKTPAKKAA
Number of Associated Samples 82
Number of Associated Scaffolds 107

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 16.82 %
% of genes from short scaffolds (< 2000 bps) 15.89 %
Associated GOLD sequencing projects 74
AlphaFold2 3D model prediction Yes
3D model pTM-score0.37

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (83.178 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(41.121 % of family members)
Environment Ontology (ENVO) Unclassified
(40.187 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(44.860 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Fibrous Signal Peptide: No Secondary Structure distribution: α-helix: 16.42%    β-sheet: 0.00%    Coil/Unstructured: 83.58%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.37
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 107 Family Scaffolds
PF04748Polysacc_deac_2 52.34
PF02517Rce1-like 4.67
PF00701DHDPS 0.93
PF16491Peptidase_M48_N 0.93

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 107 Family Scaffolds
COG2861Uncharacterized conserved protein YibQ, putative polysaccharide deacetylase 2 familyCarbohydrate transport and metabolism [G] 52.34
COG1266Membrane protease YdiL, CAAX protease familyPosttranslational modification, protein turnover, chaperones [O] 4.67
COG4449Predicted protease, Abi (CAAX) familyGeneral function prediction only [R] 4.67
COG03294-hydroxy-tetrahydrodipicolinate synthase/N-acetylneuraminate lyaseCell wall/membrane/envelope biogenesis [M] 1.87


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A83.18 %
All OrganismsrootAll Organisms16.82 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300009038|Ga0099829_10151039All Organisms → cellular organisms → Bacteria → Acidobacteria1854Open in IMG/M
3300009143|Ga0099792_10590102All Organisms → cellular organisms → Bacteria → Acidobacteria707Open in IMG/M
3300010320|Ga0134109_10354756All Organisms → cellular organisms → Bacteria → Acidobacteria577Open in IMG/M
3300010322|Ga0134084_10252977All Organisms → cellular organisms → Bacteria → Acidobacteria637Open in IMG/M
3300012096|Ga0137389_10904453All Organisms → cellular organisms → Bacteria → Acidobacteria757Open in IMG/M
3300012096|Ga0137389_11360545All Organisms → cellular organisms → Bacteria → Acidobacteria605Open in IMG/M
3300012200|Ga0137382_10730050All Organisms → cellular organisms → Bacteria → Acidobacteria711Open in IMG/M
3300012203|Ga0137399_10578194All Organisms → cellular organisms → Bacteria → Acidobacteria944Open in IMG/M
3300012206|Ga0137380_11590217All Organisms → cellular organisms → Bacteria → Acidobacteria538Open in IMG/M
3300012206|Ga0137380_11607826All Organisms → cellular organisms → Bacteria → Acidobacteria534Open in IMG/M
3300012362|Ga0137361_11126057All Organisms → cellular organisms → Bacteria → Acidobacteria706Open in IMG/M
3300012363|Ga0137390_11843307All Organisms → cellular organisms → Bacteria → Acidobacteria535Open in IMG/M
3300012582|Ga0137358_10092501All Organisms → cellular organisms → Bacteria2045Open in IMG/M
3300018433|Ga0066667_10970944All Organisms → cellular organisms → Bacteria → Acidobacteria732Open in IMG/M
3300020580|Ga0210403_10353204All Organisms → cellular organisms → Bacteria → Acidobacteria1203Open in IMG/M
3300027562|Ga0209735_1100598All Organisms → cellular organisms → Bacteria → Acidobacteria630Open in IMG/M
3300027875|Ga0209283_10909616All Organisms → cellular organisms → Bacteria → Acidobacteria531Open in IMG/M
3300031720|Ga0307469_11834678All Organisms → cellular organisms → Bacteria → Acidobacteria586Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil41.12%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil9.35%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil7.48%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil7.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil6.54%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.61%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil3.74%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment2.80%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.80%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.87%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.87%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.87%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.87%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.93%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.93%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.93%
Permafrost SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost Soil0.93%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa0.93%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.93%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005538Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005952Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 1 DNA2013-045EnvironmentalOpen in IMG/M
3300006052Freshwater sediment microbial communities from North America - Little Laurel Run_MetaG_LLR_2013EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017822Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_2EnvironmentalOpen in IMG/M
3300017823Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_3EnvironmentalOpen in IMG/M
3300017973Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_20_MGEnvironmentalOpen in IMG/M
3300018006Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_4EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026316Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027535Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027562Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027587Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027669Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028773Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - II_Palsa_N3_2EnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12635J15846_1006499243300001593Forest SoilSSTAKEAPAGPRPLSPEDPIFHKALDLLKTPAKKAA*
JGI12635J15846_1046422623300001593Forest SoilAAAAEEVDTGDEDVAAPETQKEPLAGPRPLSPEDPVYHKALELLKSPAKKAA*
JGIcombinedJ26739_10070235823300002245Forest SoilATNTAKEAPAGPRPLSPEDPIFRKALELLKTPAKKAA*
JGI25616J43925_1003510813300002917Grasslands SoilSDTPDEEAAPAPQTQKEPGLGPRPLSPEDPIFHRALDLLKATVKKAA*
Ga0066673_1006924033300005175SoilEDTAPASVPEKGAAPAPQTQKEPGLGPRPLSPEDPIFHRALDLLKEPVKKAA*
Ga0070694_10145830813300005444Corn, Switchgrass And Miscanthus RhizosphereDVDDDGGKNTPAKVTPSGPRPLSPEDPIFRKALELLKNSAKKAA*
Ga0070731_1050802923300005538Surface SoilPTDNSKEPGLGPRPLSPEDPIYHKALYMLKSPAKKAA*
Ga0070733_1001786563300005541Surface SoilTKKEPLAGPRPLSPEDPVYRKALELLKTPAKKAA*
Ga0066700_1080510113300005559SoilDSDTTEEEAAPPPETQKEPGLGPRPLSPEDPIFHRALDLLKAPVKKAA*
Ga0066699_1039927623300005561SoilEAAPPPETQKEPGLGPRPLSPEDPIFHRALDLLKAPVKKAA*
Ga0066705_1033374913300005569SoilKGAAPAPQTQKEPGLGPRPLSPEDPIFHRALDLLKAPVKKAA*
Ga0066706_1021309013300005598SoilAPAPQTQKEPGLGPRPLSPEDPIFHRALDLLKAAVKKAA*
Ga0080026_1028791113300005952Permafrost SoilANSTNKSKVAPTGPRPLSPEDPIFRKALELLKAPPAKKAA*
Ga0075029_10106762723300006052WatershedsAQKEPQAGPRPLSPEDPVYRKALELLKTPAKKAA*
Ga0079222_1119741613300006755Agricultural SoilPPETQKEPGLGPRPLSPEDPIFHRALDLLKAPVKKAA*
Ga0066659_1053570323300006797SoilEEEAAPAPQTQKEPGLGPRPLSPEDPIFHRALDLLKALAKKAA*
Ga0079219_1236075213300006954Agricultural SoilTPEAQKEPGLGPRPLSPEDAIFHRALDLLKTPAKKAA*
Ga0099791_1011950923300007255Vadose Zone SoilAPAPETQKEPGLGPRPLSPEDPIFHRALDLLKTPAKKAA*
Ga0099791_1058383523300007255Vadose Zone SoilSDSTDEEAAPAPETRKEPGLGPRPLSPEDPIFHRALDLLKTPAKKAA*
Ga0099791_1066442513300007255Vadose Zone SoilPQKEPGLGPRPLSPEDPIFHRALDLLKTPAKKAA*
Ga0066710_10266443413300009012Grasslands SoilASVPEKGAAPAPQTQKEPGLGPRPLSPEDPIFHRALDLLKEPVKKAA
Ga0099829_1015103933300009038Vadose Zone SoilDDNNSGGSKETLSGPRPLSPEDPVFHKAVDLLKAPAKKAA*
Ga0099830_1013886013300009088Vadose Zone SoilAPETQKEPGLGPRPLSPEDPIFHRALDLLRAPAKKAA*
Ga0099830_1026518413300009088Vadose Zone SoilEDDSDTPDEEAAPPPQTQKEPGLGPRPLSPEDPIFHRALDLLKAPAKKAA*
Ga0099830_1054935723300009088Vadose Zone SoilGDEDTAPAPETQKEPGLGPRPLSPEDPIFHRALDLLRAPAKKAA*
Ga0066709_10062518313300009137Grasslands SoilEEDTAPASVPEKGAAPAPQTQKEPGLGPRPLSPEDPIFHRALDLLKAPVKKAA*
Ga0066709_10111791223300009137Grasslands SoilEEDTAPASVPEKGAAPAPQTQKEPGLGPRPLSPEDPIFHRALDLLKEPVKKAA*
Ga0066709_10182524723300009137Grasslands SoilTSGTPKESPSGPRPLSPEDPIFRKALDLLKTPAKKAA*
Ga0099792_1059010213300009143Vadose Zone SoilETTDEEAAPPAPETQKEPELGPRPLSPEDPIFHRALDLLRATAKKAA*
Ga0134082_1042924223300010303Grasslands SoilPAPEARKEPGQGPRPLSPEDPIFHRALDLLKTPAKKAA*
Ga0134109_1035475613300010320Grasslands SoilEESDAADEEAAPTPQTQKEPGLGPRPLSPEDSIFHRALDLLKVPAKKAA*
Ga0134084_1025297713300010322Grasslands SoilEAAPTPQTQKEPGLGPRPLSPEDSIFHRALDLLKVPAKKAA*
Ga0137392_1078028113300011269Vadose Zone SoilDSSASAKEAPSGPRPLSPEDPVFHKALDLLKAPAKKAA*
Ga0137389_1035150213300012096Vadose Zone SoilDSGDDDAAPAPEPQKEPGLGPRPLSPEDPIFHRALDLLKTPAKKAA*
Ga0137389_1042589413300012096Vadose Zone SoilPQAQKEPGLGPRPLSPEDPIFHRALDLLKTPAKKAA*
Ga0137389_1061766623300012096Vadose Zone SoilDDSETTDEEAAPPAPETQKEPGLGPRPLSPEDPIFHRALDLLRATAKKAA*
Ga0137389_1090445313300012096Vadose Zone SoilDDSDTPDEEAAPPPQTQKEPGLGPRPLSPEDPIFHRALDLLKAPAKKAA*
Ga0137389_1136054513300012096Vadose Zone SoilQTQKEPGLGPRPLSPEDPIFHRALDLLKAPAPAKKAA*
Ga0137388_1131056123300012189Vadose Zone SoilDDDAAPAPEPQKEPGLGPRPLSPEDPIFHRALDLLKTPAKKAA*
Ga0137383_1056812713300012199Vadose Zone SoilETQKGPGLGPRPLSPEDPIFHRALDLLKAPMKKAA*
Ga0137382_1073005013300012200Vadose Zone SoilTQKEPGLGPRPLSPEDAIFHRALDLLKVPAKKAA*
Ga0137399_1006116843300012203Vadose Zone SoilPDDSDSGDDDAAPVPETQKEPGLGPRPLSPEDPIFHRALDRLRTPAKKAA*
Ga0137399_1057819413300012203Vadose Zone SoilPEDDSDSTDEDAAPAPETQKEPGLGPRPLSPEDPIFHRALDLLKAPVKKAA*
Ga0137362_1033544613300012205Vadose Zone SoilTDEEAAPPAPETQKEPGLGPRPLSPEDPIFHRALDLLRATAKKAA*
Ga0137380_1159021713300012206Vadose Zone SoilEEVAPAPETQKGPGLGPRPLSPEDPIFHRALDLLKAPMKKAA*
Ga0137380_1160782613300012206Vadose Zone SoilTQKGPGLGPRPLSPEDPIFHRALDLLKAPMKKAA*
Ga0137370_1021332713300012285Vadose Zone SoilEAAPTPQTQKEPGLGPRPLSPEDAIFHRALDLLKVPAKKAA*
Ga0137361_1112605713300012362Vadose Zone SoilAPAPQTQKEPGLGPRPLSPEDPIFHRALDLLKSPVKKAA*
Ga0137390_1013208513300012363Vadose Zone SoilQAPESQKEPGLGPRPLSPEDPIFHRALDLLKAPAKKAA*
Ga0137390_1076341813300012363Vadose Zone SoilETQKEPGLGPRPLSPEDPIFHRALDLLKTPAKKAA*
Ga0137390_1108768323300012363Vadose Zone SoilTDEEAAPAPETRKEPGLGPRPLSPEDPIFHRALDLLRTPAKKAA*
Ga0137390_1184330713300012363Vadose Zone SoilPDEEAAPPPQTQKEPGLGPRPLSPEDPIFHRALDLLKAPAKKAA*
Ga0137358_1009250133300012582Vadose Zone SoilAVPEDSDSGDDDAAPAPEPQKEPGLGPRPLSPEDPIFHRALDLLKTPAKKAA*
Ga0137358_1109521513300012582Vadose Zone SoilSDSGDDDAAPPPETQKEPGLGPRPLSPEDPIFHRALDLMRTPAKKAA*
Ga0137394_1116556513300012922Vadose Zone SoilLPEDSDSGDDDAAPVPEAQKEPGLGPRPLSPEDPIFHRALDLLKTPVKKAA*
Ga0137419_1000652013300012925Vadose Zone SoilTQKEPGLGPRPLSPEDPIFHRALDLLKAPVKKAA*
Ga0137419_1126626613300012925Vadose Zone SoilPDTADEAVAPTPEKQKEPGLGPRPLSPEDPIFHRALDLLKLPAKKAA*
Ga0137407_1121384213300012930Vadose Zone SoilPETQKEPGLGPRPLSPEDPIFHRALDLLKTPVKKAA*
Ga0137407_1196661023300012930Vadose Zone SoilDDAAPAPEPQKEPGLGPRPLSPEDPIFRRALDLLKTPAKKAA*
Ga0157379_1194484913300014968Switchgrass RhizosphereKNTPAKVTPSGPRPLSPEDPIFRKALELLKNSAKKAA*
Ga0137418_1045350423300015241Vadose Zone SoilTVTQEPSKDPGLGPRPLSPEDPIYHKAIELLKTPAKKAA*
Ga0137418_1089252713300015241Vadose Zone SoilKEPGLGPRPLSPEDPIFHRALDLLKAPAAAKKAA*
Ga0187802_1008266813300017822Freshwater SedimentTQQHEPGLGPRPLSPEDPIYRRALELLKTPAKKAA
Ga0187818_1002404643300017823Freshwater SedimentAPPDTETQQHEPGLGPRPLSPEDPIYRRALELLKTPAKKAA
Ga0187780_1088163623300017973Tropical PeatlandEDVAPTPESQKEPGLGPRPLSPEDPIFHRALDLLKSPAKKAA
Ga0187804_1001099243300018006Freshwater SedimentGDEDAAPPDTETQQHEPGLGPRPLSPEDPIYRRALELLKTPAKKAA
Ga0066667_1017728913300018433Grasslands SoilEEDTAPASVPEKGAAPAPQTQKEPGLGPRPLSPEDPIFHRALDLLKAPVKKAA
Ga0066667_1097094413300018433Grasslands SoilEEDTAPASVPEKGAAPAPQTQKEPGLGPRPLSPEDPIFHRALDLLKEPVKKAA
Ga0066662_1003819943300018468Grasslands SoilSDTTEEDAAPPPEVQKEPGLGPRPLSPEDPIFHRALDLLKAPVKKAA
Ga0193755_110777423300020004SoilDSGDDDGAPAPEPQKEPGLGPRPLSPEDPIFHRALDLLKTPAKKAA
Ga0179594_1010251423300020170Vadose Zone SoilSDSGDDDAAPAPEPQKEPGLGPRPLSPEDPIFRRALDLLKTPAKKAA
Ga0179594_1014624723300020170Vadose Zone SoilPEDSDSGDDDAAPAPETQKEPGLGPRPLSPEDPIFHRALDLLKTPAKKAA
Ga0179592_1046695813300020199Vadose Zone SoilDEEAAPAPETQKEPGLGPRPLSPEDPIFHRALDLLKAPVKKAA
Ga0210407_1066814813300020579SoilATAPATKKEPLAGPRPLSPEDPVYRKALELLKTPAKKAA
Ga0210403_1035320423300020580SoilRAVAQDESDTGDDESAAASNTEREPSLGPRPLSPEDPIYHKALELLKTPAKKAA
Ga0210399_1034943523300020581SoilDDSSGASKETVSGPRPLSPEDPVFHKAVDLLKAPAKKAA
Ga0210399_1118722123300020581SoilGDDESAAASNTEREPSLGPRPLSPEDPIYHKALELLKTPAKKAA
Ga0210405_1117075123300021171SoilESDTTEEEAAPAPETQKEPGLGPRPLSPEDPIFHRALDLLKAPPVKKAA
Ga0210408_1135667813300021178SoilEDVAAPDTQKEPLAGPRPLSPEDPVYRKALELLKTPAKKAA
Ga0210386_1015556233300021406SoilATDSSKAEPSGPRPLSPEDPIYRKALELLKAPAKKAA
Ga0207700_1146486523300025928Corn, Switchgrass And Miscanthus RhizosphereATNKETPSGPRPLSPEDPVFHKALDLLRTPAKKAA
Ga0209155_115930713300026316SoilETQKEPGLGPRPLSPEDPIFHRALDLLKAPVKKAA
Ga0209473_114111833300026330SoilTQREPGLGPRPLSPEDPIYHRALDLLKAAAPAKKAA
Ga0209807_113532223300026530SoilKGAAPAPQTQKEPGLGPRPLSPEDPIFHRALDLLKAPVKKAA
Ga0209805_116854013300026542SoilEAAPPPETQKEPGLGPRPLSPEDPIFHRALDLLKAPVKKAA
Ga0209161_1008746313300026548SoilVPEKGAAPAPQTQKEPGLGPRPLSPEDPIFHRALDLLKEPVKKAA
Ga0209577_1061414413300026552SoilPASVPEKGAAPAPQTQKEPGLGPRPLSPEDPIFHRALDLLKAPVKKAA
Ga0209734_111889113300027535Forest SoilNTPSKVEPSGPRPLSPEDPIYRKALELLKNPTKKAA
Ga0209735_110059813300027562Forest SoilSDTSDEEAAPAPQTQKEPGLGPRPLSPEDPIFHRALDLLKTPAKKAA
Ga0209220_102555233300027587Forest SoilPPAAPQTQKEPGLGPRPLSPEDPIFHRALDLLKAPAKKAA
Ga0209217_119753013300027651Forest SoilGSSAGANKETPSGPRPLSPEDPVYHKALDLLKAPAKKAA
Ga0208981_108870123300027669Forest SoilAPVPETQKEPGLGPRPLSPEDPIFHRALDLLRTPAKKAA
Ga0209073_1050925713300027765Agricultural SoilDDDAAPPPETQKEPGLGPRPLSPEDPIFHRALDLLRTPAKKAA
Ga0209074_1001993313300027787Agricultural SoilRAAPTEPGLGPRPLSPEDPIFHRALDLLKAPAPAKKAA
Ga0209701_1020330823300027862Vadose Zone SoilAPAPQTQKEPGLGPRPLSPEDPIFHRALDLLKAPAKKAA
Ga0209283_1090961623300027875Vadose Zone SoilVPEDDSDSTDEDAAPAPETQKEPGLGPRPLSPEDPIFHRALDLLKAPVKKAA
Ga0209488_1054907413300027903Vadose Zone SoilETTDEEAAPPAPETQKEPELGPRPLSPEDPIFHRALDLLRATAKKAA
Ga0137415_1021842733300028536Vadose Zone SoilADEGDTGDEYATAPETQKEPSAGPRPLSPEDPIFRKALELLKTPAKKAA
Ga0302234_1023360913300028773PalsaSAGAKETPGPRPLSPDDAVYRKALELLKTPAKKAA
Ga0170820_1515698413300031446Forest SoilDGGDDGTVTQEPSKDPGLGPRPLSPEDPIYHKAIELLKTPAKKAA
Ga0310686_11028094743300031708SoilSADAQKEPSLPRPLSPEDPIYRKALELLKAPQKKAA
Ga0307469_1077653513300031720Hardwood Forest SoilPAPEPQKEPGLGPRPLSPEDPIFHRALDLLKAPVKKAA
Ga0307469_1183467823300031720Hardwood Forest SoilRAVPDDADSGEDDAAPAPEPQKEPGLGPRPLSPEDPIFHRALDLLKTPAKKAA
Ga0307477_1008621533300031753Hardwood Forest SoilPEDDSDVADDDAAPAPQTQKEPGLGPRPLSPEDPIFHRALDLLKTPVKKAA
Ga0307471_10102071113300032180Hardwood Forest SoilDDSDATDEEAAPAPETRKEPGLGPRPLSPEDPIFHRALDLLKTPAKKAA
Ga0307471_10267609223300032180Hardwood Forest SoilPEPQKEPGLGPRPLSPEDPIFHRALDLLKTPAKKAA
Ga0307472_10081164213300032205Hardwood Forest SoilPGGSKETPSGPRPLSPEDPVFHKAVDLLKAPAKKAA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.