NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F069943

Metagenome / Metatranscriptome Family F069943

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F069943
Family Type Metagenome / Metatranscriptome
Number of Sequences 123
Average Sequence Length 58 residues
Representative Sequence MSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAE
Number of Associated Samples 103
Number of Associated Scaffolds 123

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 0.81 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 92
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.187 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(32.520 % of family members)
Environment Ontology (ENVO) Unclassified
(52.033 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(65.041 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 17.24%    β-sheet: 20.69%    Coil/Unstructured: 62.07%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 123 Family Scaffolds
PF02698DUF218 77.24
PF00496SBP_bac_5 11.38
PF08352oligo_HPY 4.07
PF01047MarR 0.81
PF12811BaxI_1 0.81
PF04264YceI 0.81
PF00528BPD_transp_1 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 123 Family Scaffolds
COG1434Lipid carrier protein ElyC involved in cell wall biogenesis, DUF218 familyCell wall/membrane/envelope biogenesis [M] 77.24
COG2949Uncharacterized periplasmic protein SanA, affects membrane permeability for vancomycinCell wall/membrane/envelope biogenesis [M] 77.24
COG2353Polyisoprenoid-binding periplasmic protein YceIGeneral function prediction only [R] 0.81


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.19 %
All OrganismsrootAll Organisms0.81 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300026537|Ga0209157_1006055All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi8739Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil32.52%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil16.26%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil11.38%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil8.94%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil8.13%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.88%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere4.06%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil2.44%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost2.44%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.63%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.63%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.63%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.81%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.81%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.81%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001538Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A10-PF 4A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300011987Permafrost microbial communities from Nunavut, Canada - A20_80cm_0MEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300014829Permafrost microbial communities from Nunavut, Canada - A10_35cm_6MEnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019279Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019885Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m2EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025918Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026312Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028711Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_150EnvironmentalOpen in IMG/M
3300028718Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_194EnvironmentalOpen in IMG/M
3300028791Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_144EnvironmentalOpen in IMG/M
3300028799Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_123EnvironmentalOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033814Sediment microbial communities from East River floodplain, Colorado, United States - 55_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
A10PFW1_1013225413300001538PermafrostMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGEC
JGI25385J37094_1022066413300002558Grasslands SoilVPSALVLAVTAENAESLLSGERDRDHRRIPPRKLPARAYLAVVGTGSVVGECRL
JGI25382J37095_1024894913300002562Grasslands SoilVPSALVLAVTAENAESLLSGERDRDHRRIPPRKLPARAYLAV
JGI25382J43887_1051755713300002908Grasslands SoilVPSALVLAVTAENAESLLSGERDRDHRRIPPRKLPARAYLAVVGTGSVVGECRLGAP
Ga0062589_10289215923300004156SoilMSSAIVLATTAENAEALLSGERDRDHRRVPPKKLPARAYLAVVGTGSIVGECELGAAERKTAKGWILPVSKPRRYRKPR
Ga0062592_10032564113300004480SoilVDAIVLAVSQENADALLDGKRTADHRALPPTRLPARAYLAVVGTGTVVGECVLG
Ga0062591_10132689613300004643SoilVDAIVLAVSQENADALLDGKRTADHRALPPTRLPARAYLAVVGTGTVVGEC
Ga0066674_1008001433300005166SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAERHTSK
Ga0066677_1021244713300005171SoilVATALVLAISAEHADAILAGDRDTDHRRFPPKKLPARAYLAVVGTGSVVGECQLGEAERNTAK
Ga0066683_1075871423300005172SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVG
Ga0066680_1004394133300005174SoilVASALVLAVTAENAESLLSGERDRDHRRMPPKKLPARAYLAV
Ga0066680_1009267933300005174SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKRLPARAYLAVVGTGSVVG
Ga0066679_1001400913300005176SoilMSSAIVLATTAENAEALLSGARDRDHRRFPPKKLPARAYLAVVG
Ga0066690_1085578923300005177SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECEL
Ga0070694_10057291223300005444Corn, Switchgrass And Miscanthus RhizosphereMPSALVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSIVGECELGTAERHTSKGWALPVTKPRRYRK
Ga0066686_1019308913300005446SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTG
Ga0066682_1080430913300005450SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGT
Ga0066687_1042443013300005454SoilMSSALVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTAS
Ga0066687_1076710623300005454SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVDGECELGAAERRTAKGWALPVSKPRR
Ga0070706_10079076723300005467Corn, Switchgrass And Miscanthus RhizosphereMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAERHTAKGWALPVSKPRR
Ga0070706_10153239223300005467Corn, Switchgrass And Miscanthus RhizosphereVVNQTRAIVLAITAENAEALLSGARDRDHRRSPPKELPARGYLAVVGTGSVVGECRLGVP
Ga0070707_10079246513300005468Corn, Switchgrass And Miscanthus RhizosphereMSSAIVLATTAEHAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGEC
Ga0070697_10065734033300005536Corn, Switchgrass And Miscanthus RhizosphereVPSALVLAVTAENAESLLSGERDRDHRRIPPKKLPARAYLAVVGTASVVGE
Ga0066697_1007577533300005540SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAE
Ga0066697_1030918513300005540SoilMSSAIVLATTAENADALLSGERDKDHRRFPPKKLPARAYLAVVGTGSVVGECELGA
Ga0066701_1091469023300005552SoilLSEAIVLAISAENAEAILSGERDRDHRRFPPKKLP
Ga0066695_1024824013300005553SoilMSSAIVLATTAENADALLSGERDKDHRRFPPKKLPARAY
Ga0066692_1026290213300005555SoilLSEAIVLAISAENAEAILSGERDRDHRRFPPKKLPARAYLAVVGTGSVIGECHLGEPERN
Ga0066692_1088665913300005555SoilVNPTRAIVLAITAENAEALLSGARDRDHRKSPPKELPARGYLAVVGTGSVVGECRLGVP
Ga0066704_1058037423300005557SoilMSSAIVLATTAENAEAILSGERERDHRRFPPKKLPARAYLAVVGTGSVIGECELGAAERHTAKGWALPVSKPRRYR
Ga0066704_1096354523300005557SoilVVNPTRAIVLAITAENAEALLSGARDRDHRRSPPKELPARGYLAVVGTGSVVG
Ga0066698_1071528713300005558SoilMSSAIVLATTAENAEALLSGDRDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGVAERHTAKGWALPVSKPRRYRK
Ga0066700_1065100723300005559SoilMASALVLALTAANADSILSGERENDYRRFPPKKLPARAYLAVVGTASVVGECELGVAERHTAKGWALPVSRP
Ga0066703_1078039023300005568SoilMSSAIVLATTAENAEAILSGERERDHRRFPPKKLPARAYLAVVGTGSVIGECELGAAERHTAKGWALPVSK
Ga0066708_1008226813300005576SoilMASALVLALTAANADSILSGERENDYRRFPPKKLPARAYLAVVGTASVVGECELGVAERHTA
Ga0066652_10117342223300006046SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSIIGECTLGIAERNTAKGWALPVSKPRR
Ga0066653_1061094123300006791SoilMSSALVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTASIVGECQLGPAERHSSKGWALPVSKPRRY
Ga0066659_1179393323300006797SoilVASALVLAVTAENAESLLSGERDRDHRRMPPKKLPARAYLAVVGTGSVVGECLLGAA
Ga0075425_10100666133300006854Populus RhizosphereMSSAIVLAITAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSV
Ga0075436_10015619833300006914Populus RhizosphereMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAERHTAKGWALPVSKPRRYR
Ga0075435_10088420523300007076Populus RhizosphereMSSAIVLATTAENAEALLSGERDRDHRRFPPKNLPARAYLAVVGTGSVVGECELGAAER
Ga0099791_1068730323300007255Vadose Zone SoilMSSAIVLATTAENAEAILSGERDRDHRRFPPKKLPARAYL
Ga0099793_1002052013300007258Vadose Zone SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSIVGE
Ga0099793_1029309113300007258Vadose Zone SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAERHTAKGWALPVSKPRRYRKP
Ga0099794_1042129513300007265Vadose Zone SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSV
Ga0066710_10114086133300009012Grasslands SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPAR
Ga0066710_10121390033300009012Grasslands SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVG
Ga0099829_1100867413300009038Vadose Zone SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPTRAHLAVVGTGSVVGECELGAAERHTAKG
Ga0099829_1128708423300009038Vadose Zone SoilMSSAIVLATTAENAEALLSGKRDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGPAERHTAKGWALPVSKPRRYRKP
Ga0099830_1018167033300009088Vadose Zone SoilMSSAIVLATTAENAEALLSGVRDRDHRRFPPKKLPARAYL
Ga0099827_1061130813300009090Vadose Zone SoilVPSALVLAVTAENAESLLSGERDRDHRRMPPKKLPAR
Ga0099827_1102805623300009090Vadose Zone SoilLLPAIVLAITAENAEALLSGARDREHRHKPPKRLPARAYLAVVGTGQVVGECRLGPPERKTA
Ga0066709_10052493033300009137Grasslands SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGVAERHTAKGWALPVSKPRRYR
Ga0066709_10080211533300009137Grasslands SoilMSSAIVLATTAENAEALLSGDRDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGVAERHTAKGWALPVSKPRRYR
Ga0114129_1316579023300009147Populus RhizosphereMSSAVVLTTTAENAEALLSGERDTDHRRFPPKKLPSRAYLAVVGTGSVVGECELGAAERR
Ga0075423_1023080933300009162Populus RhizosphereMSSAVVLATTAENAEALLSGERDTDHRRFPPKKLPSRAYLAVVGTGSVFGECELGAAERK
Ga0134088_1048299923300010304Grasslands SoilVASALVLAVTAENAESLLSGERDRDHRRMPPKKLPARAYLAVVGTGSVVGECLLGAAERHTAKGWA
Ga0134086_1037080413300010323Grasslands SoilVASALVLAISAENAEALLSGARDRDHRTSPPRALPARAYLAVVGTGAVVG
Ga0134080_1002495533300010333Grasslands SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAERHTAKGWALPVSK
Ga0134071_1016943613300010336Grasslands SoilMSSAIVLATTAENAEALLSGARDRDHRRFPPKKLPAR
Ga0134071_1027773023300010336Grasslands SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECTLGIAERNTAKGWALPV
Ga0134126_1093616333300010396Terrestrial SoilMPSALVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAERNTAKGWVLPVSKPRRYRKPR
Ga0120164_101832213300011987PermafrostMPSALVLATTAENAEALLSGERDRDHRRVPPKKLPARAYLAVVGTGSIVGECELGAAERHTSK
Ga0137383_1094371523300012199Vadose Zone SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAERHTSKGWALPVTKT
Ga0137382_1135282713300012200Vadose Zone SoilVPSALVLAVTAENADALLSGERDRDHRRFPPKKLPARAYL
Ga0137399_1034659513300012203Vadose Zone SoilMSSAIVLATTAENAEAILSGERERDHRRFPPKKLPARAYLAVVGTGTVIGEC
Ga0137376_1148521413300012208Vadose Zone SoilVPSALVLAVTAENAESLLSGERDRDHRRIPPKKLPARAYLAVVGTGSVVGECRLGA
Ga0137378_1173338423300012210Vadose Zone SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPRKLPARAYLAVVGTGSVVGECTLGIAERKTAK
Ga0137370_1066580823300012285Vadose Zone SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSIIGECTLGIAERKTAKG
Ga0137416_1095261323300012927Vadose Zone SoilMSSAIVLATTAENAEAILSGERERDHRRFPPKKLPARAYL
Ga0137410_1026939413300012944Vadose Zone SoilMSSAIVLATTAENAEAILSGERDRDHRRFPPKKLPARAYLAVVGTGSIVGECELGAAERNTARG
Ga0134110_1023676113300012975Grasslands SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGLVVGE
Ga0134075_1005806613300014154Grasslands SoilMSSAIVLATTAENAEALLSGARDRDHRRFPPKKLPARAYLAVVGTGSVVG
Ga0134079_1034023923300014166Grasslands SoilMASALVLALTAANADSILSGERENDYRRFPPKKLPARAYLAVVGTASVVGEC
Ga0120104_102993633300014829PermafrostMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAERKTAKGWV
Ga0137418_10012497103300015241Vadose Zone SoilMSSAIVLATTAENAEAILSGERERDHRRFPPKKLPARA
Ga0134085_1003550633300015359Grasslands SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGLVVGECELGAAERHTAKGWALPVSKP
Ga0134069_109742633300017654Grasslands SoilVASALVLAVTAENAESLLSGERDRDHRRIPPRKLPARAYLAVAGTGSVVGECRLGAPLRHSAKGW
Ga0184604_1008764423300018000Groundwater SedimentMSSAIVLATTAENAEALLAGERDRDQRRFPPKKLPARAYLAVVGTGSVVGECELGPAERNTAKGWVLPVS
Ga0184618_1039132813300018071Groundwater SedimentVPSALVLAVTAETAESLLSGERDRDHRRIPPKKLPARAYLAVVGTGSVVGECRLGAAEQHSAKGWALPVSQPR
Ga0066667_1040942213300018433Grasslands SoilMSSALVLATTAENAEALLSGARDRDHRRFPPKKLPARAYLA
Ga0066667_1058504533300018433Grasslands SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAERHTAKGW
Ga0066669_1159548923300018482Grasslands SoilLSEAIVLAISAENAEAILSGERDRDHRRFPPKKLPARAYLAVVGTGSVIGECHLGEPERNTAK
Ga0184642_152799213300019279Groundwater SedimentVPSALVLAVTAENAESLLSGERDRDHRRIPPKKLPARAYLAVVGTGSVVGECRLGAAEQHSAKGWALPVLQPRR
Ga0193747_102959213300019885SoilMSSAIVLATTAENAEALLSGARDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAA
Ga0193747_104051633300019885SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGPVVGECELGAAERNTAKGWAS
Ga0193735_117977323300020006SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSIVGECELGIAERKTAR
Ga0210382_1013787033300021080Groundwater SedimentVPSALVLAVTAENAESLLSGERDRDHRRIPPKKLPARAYLAVVGTG
Ga0222622_1010387913300022756Groundwater SedimentMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLP
Ga0207662_1068926523300025918Switchgrass RhizosphereMPSALVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSIVGECELGAAERHTSKGWALPVTKPRRYRK
Ga0207646_1093901723300025922Corn, Switchgrass And Miscanthus RhizosphereMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVIGECELGPAERHTSKGWALP
Ga0209237_103069613300026297Grasslands SoilVPSALVLAVTAENAESLLSGERDRDHRRIPPRKLPARAYLAVVGTGSVVGE
Ga0209237_106039233300026297Grasslands SoilVPAALVLAVTAENAESLLSGERDRDHRRIPPKKLPARAYLAVVGTASVV
Ga0209239_102816743300026310Grasslands SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGTIIGECTLGIAERNTAKGWALPI
Ga0209153_108141733300026312SoilMSSALVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVV
Ga0209153_125672923300026312SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAER
Ga0209761_115260133300026313Grasslands SoilMSSAIVLATTAENAAALLSGERDRDHRRFPPKRLPARAYLAVVGTGRMA
Ga0209471_100962873300026318SoilMSSAIVLATTAENAEALLSGKRDRDHRRIPPKRLPARAYLAVVGTGSV
Ga0209470_105915433300026324SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAV
Ga0209266_104184633300026327SoilVPSALVLAVTAENAESLLSGERDRDHRRIPPKKLPARAYLAVVGTGSVVGECRL
Ga0209802_105589233300026328SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKRLPARAYLAVVGTGSVVGECELGTAERRSAKGWALPVSKPRRYRKPR
Ga0209057_123363323300026342SoilVASALVLAVTAENAESLLSGERDRDHRRMPPKKLPARAYLAVVGTG
Ga0209806_101041473300026529SoilMSSALVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTASIVGECQLGAAERHTS
Ga0209058_101845263300026536SoilMSSAIVLATTAENADALLSGERDKDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAERH
Ga0209157_100605513300026537SoilMSSAIVLATTAENAEALLSGARDRDHRRFPPKKLPARAYLAVVGTG
Ga0209157_111589913300026537SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGLVVGECELG
Ga0209056_1001309613300026538SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTRSVVGECELGAAERHTAKGWALPVSKPRR
Ga0209376_102768553300026540SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAERHTAKGWALPVIKP
Ga0209376_109327613300026540SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAERHTSKGWALPVTKTRRYR
Ga0209474_10009772113300026550SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAERHTAKGWALPVIKPR
Ga0209180_1026570413300027846Vadose Zone SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAERHTA
Ga0209590_1056321723300027882Vadose Zone SoilLLPAIVLAITAENAEALLSGARDREHRHKPPKRLPARAYLAVVGTGQVVGECRLGPPERKTAKG
Ga0307293_1000803843300028711SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAEKHTAKGWALPVTKP
Ga0307293_1008350633300028711SoilMPSAIVLATTAENAEAILSGERDRDHRKFPPKKLPARAYLAVVGTGSVVGECTIGVAERNTARGWALPVSKPRRYRKP
Ga0307307_1002110413300028718SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAEKHTAKGWALPVTKPRRYRK
Ga0307290_1021722823300028791SoilMSSAIVLATTAENAEALLSRERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGAAEKHTAKGWAL
Ga0307284_1005643813300028799SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLA
Ga0307296_1035552913300028819SoilMSSAAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYL
Ga0307312_1023520413300028828SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVVGECELGIAERN
Ga0307308_1044300613300028884SoilMSSAIVLATTAENAEALLSGERDRDHRRFPPKKLPARAYLAVVGTGSVIGE
Ga0307471_10220619623300032180Hardwood Forest SoilLATALVLAISAENADAILAGERDTDHRRFPPKRLPARAYLAVV
Ga0307471_10281513723300032180Hardwood Forest SoilMSSAIVLATTAENAEALLSGARDRDHRRFPPKKLPTRAY
Ga0364930_0312957_2_2293300033814SedimentMDALVLAVTAENADAILDGERRFDHRRIPPKRLPARAYLAVSGEGVVGECELGAAERETADGWALPVSKPRRYRSA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.