NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F087470

Metagenome Family F087470

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F087470
Family Type Metagenome
Number of Sequences 110
Average Sequence Length 45 residues
Representative Sequence VQVPFRAYYALNVLRLDRSIRLSDKSIFFARNRFSAFGWGLGSLF
Number of Associated Samples 92
Number of Associated Scaffolds 110

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.91 %
% of genes from short scaffolds (< 2000 bps) 0.91 %
Associated GOLD sequencing projects 85
AlphaFold2 3D model prediction Yes
3D model pTM-score0.30

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(35.455 % of family members)
Environment Ontology (ENVO) Unclassified
(31.818 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(51.818 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 54.79%    β-sheet: 0.00%    Coil/Unstructured: 45.21%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.30
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 110 Family Scaffolds
PF03976PPK2 90.00
PF14310Fn3-like 2.73
PF01833TIG 0.91
PF07876Dabb 0.91
PF08818DUF1801 0.91
PF01425Amidase 0.91
PF01915Glyco_hydro_3_C 0.91
PF064393keto-disac_hyd 0.91

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 110 Family Scaffolds
COG2326Polyphosphate kinase 2, PPK2 familyEnergy production and conversion [C] 90.00
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 0.91
COG1472Periplasmic beta-glucosidase and related glycosidasesCarbohydrate transport and metabolism [G] 0.91
COG4430Uncharacterized conserved protein YdeI, YjbR/CyaY-like superfamily, DUF1801 familyFunction unknown [S] 0.91
COG5646Iron-binding protein Fra/YdhG, frataxin family (Fe-S cluster biosynthesis)Posttranslational modification, protein turnover, chaperones [O] 0.91
COG5649Uncharacterized conserved protein, DUF1801 domainFunction unknown [S] 0.91


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300012582|Ga0137358_10150078Not Available1587Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil35.45%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil18.18%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil7.27%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil6.36%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.45%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil5.45%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.55%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.64%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil2.73%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment1.82%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.82%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil0.91%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.91%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.91%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.91%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.91%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil0.91%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.91%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.91%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001089Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3EnvironmentalOpen in IMG/M
3300001867Texas A ecozone_OM1H0_M2 (Combined assembly for Texas A ecozone Site metagenome samples, ASSEMBLY_DATE=20130705)EnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005538Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009520Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_1_NS metaGEnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017926Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_2EnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300018064Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP05_10_MGEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026316Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300027034Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_RefH0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027061Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM1H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027669Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031122Oak Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031679Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f23EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032039Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f21EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12683J13190_102536813300001089Forest SoilNILRLNRFLSLSDKSLFHANNRFSAFGWGLGSLF*
JGI12627J18819_1032311813300001867Forest SoilNVLRMNRFISLSQNSVFHARNRFSAFGWGLGTLF*
JGI25617J43924_1019586423300002914Grasslands SoilVPVRAYYAVNVLRMNRFFSLSDNSLFHAHNRFSVFGWGLGALF*
JGI25617J43924_1033376923300002914Grasslands SoilTIPGVQVPVRVYYAVNVLRMNRFFSLSNNSLFHARNRFSAFGWGLGSLF*
JGI25616J43925_1012931623300002917Grasslands SoilGIELQWTVPAIQVPVRGYYAVNVLRLNRFLALPGGSVFHAHNRFSAFGWALGTLF*
Ga0066688_1081835623300005178SoilRWTVPGVQVPVRAYYAANVLGMNRFPPLSDISLFHARNRFSAFGWGLSSLF*
Ga0066675_1011999613300005187SoilQLQWTIPGVQVPFRSYYALNVLRLDRVIPLSSRSLLHPHNRLGAFGWGLGSLF*
Ga0066675_1100988923300005187SoilFRSYYALNVLRLDRLIPLPDKSLLHARNRFGAFGWGLGWLF*
Ga0066687_1091180223300005454SoilWTLPGIQVPVRAYYALNVLRLNRVIPLSSTSSFVAHNRFSAFGWGLGSLF*
Ga0070706_10184499713300005467Corn, Switchgrass And Miscanthus RhizosphereGVNVPFRAYYARNLLRLDRSIRLSDKSIFFARNRFSAFGWGLGSLF*
Ga0070731_1108455513300005538Surface SoilVRAYYALNVLRLNRWVPMPDGSLFHARDRFSAIGWGLGSLF*
Ga0070732_1064059013300005542Surface SoilGVQVPFRSYYAVNVLRLDRWIPLSEKSLLHPHNRLGAFGWGLGSFF*
Ga0066701_1064215523300005552SoilTGVELRWTVPEVQVPVRAYFAFNVLRLDHPIALSDKSRFLARNRFSAFGWGLGALF*
Ga0066692_1038738923300005555SoilRAYYALNLLRLDRSIRLSDKSIVFARNRFSAFGWGLGSLF*
Ga0066703_1063574913300005568SoilGIELRWTVPGVQVPVRAYYALNVLRMNRFIYLTENSVFHARNRFSAFGWGLGTLF*
Ga0066705_1073004123300005569SoilYYALNVLRLNRIIPLSSTSSFAAHNRFSAFGWGLGSLF*
Ga0066708_1020031713300005576SoilALNVLRLNRVIPLSSTSSFAAHNRFSAFGWGLGSLF*
Ga0066691_1012007013300005586SoilRWTVPSVQVPIRAYYSLNVLRLNRFFSLSEDSLFHAHNRFSAFGWGLGALF*
Ga0070765_10190090623300006176SoilNVLRMNRFFSLSDNSPFHAHNRFSAFGWGLGSLF*
Ga0066658_1002310143300006794SoilVQVPVRAYYAVNVLRMNRFISLSENSLFHVHNRFSAFGWGLGTLF*
Ga0066659_1139681923300006797SoilYYSLNVLRLNRYIVLSDKPRLLAHNRFSAFGWGLGTLF*
Ga0075436_10027157423300006914Populus RhizosphereVQVPFRSYYAVNVLRLDRWIPLSEKSLLHAHNRFGAFGWGLGSFF*
Ga0079219_1118933113300006954Agricultural SoilTAPGVHLPVRAYYALNVVRLNRFSELPNGAIFHARNRLSAFGWALGSLF*
Ga0099793_1023611023300007258Vadose Zone SoilWTVPGVNVPFRAYYALNLLRLDRSIRLSDKSIVFARNRFSAFGWGLGSLF*
Ga0099793_1036841213300007258Vadose Zone SoilALNVLRLDRSIRLSDKSIFFARNRFSAFGWGLGSLF*
Ga0066710_10013245613300009012Grasslands SoilRWTVPSVQVPIRAYYSLNVLRLNRFFSLSEDSLFHAHNRFSAFGWGLGALF
Ga0099829_1085209613300009038Vadose Zone SoilSLNVLRLNRFVSLSDKSRLLAHNRFSAFGWGLGTLF*
Ga0099830_1019252313300009088Vadose Zone SoilIELRWTAPGVQVPVRAYYSLNVLRLDRFVSLSDKSRLLAHNRFSAFGWGLGTLF*
Ga0099830_1147597823300009088Vadose Zone SoilRWTVPGVQVPVRVYYAVNVLRMNRFFSLSNNSLFHAHNRFSAFGWGLGSLF*
Ga0099827_1029317313300009090Vadose Zone SoilALNVLRLNRYIALSDSAVFHAHNRFSAFGWGLGTLF*
Ga0099792_1075860523300009143Vadose Zone SoilWTVPGVQVPVRAYYSLNVLRLNRFVVLSDKSRLLAHNRFSAFGWGLGTLF*
Ga0116214_140988113300009520Peatlands SoilEVQVPVRAYYAFNVLRLDRYIFLSGKSSFFAHDRLSAFGWGLGSLF*
Ga0126373_1271412623300010048Tropical Forest SoilYAINVLRLNRAIPLSDKSILHAHNRFAAFGWGLGSLF*
Ga0134111_1024397113300010329Grasslands SoilFQWTIPGVQVPFRSYYALNVLRLDRLIPLPDKSLLHARNRFGAFGWGLGWLF*
Ga0134062_1024839013300010337Grasslands SoilIQLQWTIPGIQVPFRSYYALNVLRLDHPVPLSEKSPLHAHNRFGAFGWGLGSLF*
Ga0126370_1049320023300010358Tropical Forest SoilVPFRSYYAVNVLRLDRWIPLSEKSFLHVHNRFGAFRWGLGSLF*
Ga0126372_1062039223300010360Tropical Forest SoilWTVPGVQVPLRTYYAINVLRLDRVIPLSDKSLLHAHNRFAVFGWGLGSLF*
Ga0126378_1304542613300010361Tropical Forest SoilTGIQLQWTIPGVQVSFRSYYAVNVLRLDRWIPLSGKSLLRAHNRFGAFGWGLGSLF*
Ga0126381_10271901813300010376Tropical Forest SoilVQVPFRSYYAVNVLRLDRLIPLSEKSLLHAHNRFGAFGWGLGSLF*
Ga0126383_1229325723300010398Tropical Forest SoilPFRSYYAVNVWRFDRWIPFSEKSLLHLHNRLGAFGWGLGSLF*
Ga0137393_1088039713300011271Vadose Zone SoilTVPGVQVPVRAYYAVNVLRLNRFFSLSENSIFHAHNRFSAFGWGLGTLF*
Ga0137388_1070981723300012189Vadose Zone SoilVPEVQVPVRAYYAFNVLRLDHPIALSDKSRFLARNRFSAFGWGLGALF*
Ga0137363_1100656923300012202Vadose Zone SoilVQVPVRAYYAVNVLRMNRIIPLSENSLFHARNRFSAFGWGLGTLF*
Ga0137399_1088927423300012203Vadose Zone SoilALNILRLNRFLSLSDKSLFHANNRFSAFGWGLGSLF*
Ga0137362_1011554613300012205Vadose Zone SoilVPVRAYYAVNVLRLDRFITLSDKSRFFAHNPFSAFGWGLGSLF*
Ga0137377_1083063123300012211Vadose Zone SoilYAFNVLRLDRSLLLPDNSLFRAHNRFAAFGWALGSLF*
Ga0137377_1169551213300012211Vadose Zone SoilVPIRAHYAFNVLRLDRSLLLPDNSLFRAHNRFSAFGWALGSLF*
Ga0137387_1023774913300012349Vadose Zone SoilRAYYALNVLRLNRVMPLSSTSSFLAHNRFSAFGWGLGSLF*
Ga0137387_1103424223300012349Vadose Zone SoilVQVPIRAHYAFNVLRLDRSLLLPDNSLFRAHNRFSAFGWALGSLF*
Ga0137361_1043855223300012362Vadose Zone SoilVQVPVRAYYSLNVLRLNRYIVLSDKTRLLAHNRFSAFGWGLGTLF*
Ga0137358_1015007843300012582Vadose Zone SoilHGSAGIELRWTVPGVQVPVRAYYALNVLRLNRVIPLSSTSSFLAHNRFSAFGWGLGSLF*
Ga0137396_1019976233300012918Vadose Zone SoilWTIPGVQVPFRAHYALNVLRLDRSIRLSDKSIFFPRNRFSAFGWGLGSLF*
Ga0137394_1149676623300012922Vadose Zone SoilYYALNLLRLDRSIRLSDKSIFFARNRFAAFGWGLGSLF*
Ga0137419_1069316813300012925Vadose Zone SoilVPFRAYYALNLLRLDRSIRLSDKPIFFARNRFSAFGWGLGSLF*
Ga0137416_1002733353300012927Vadose Zone SoilWTVPGIQVPVRGYYALNVLRLNRFLPLPNGSVFHAHNRFSAFGWALGALF*
Ga0137416_1089475213300012927Vadose Zone SoilGFELRWTVPGVRVPVRAYYAANVLHMNRFISLSENSVFHARNRFSAFGWGLGPLF*
Ga0137416_1166741113300012927Vadose Zone SoilALNVLRLSRVIPLSSTSSFLAHNRFSAFGRGLGSLF*
Ga0137416_1169147613300012927Vadose Zone SoilYALNLLRLDRSIRLSDKSIVFARNRFSAFGWGLGSLF*
Ga0134076_1050707413300012976Grasslands SoilLNVLRLNRFFSLSEDSLFHAHNRFSAFGWGLGALF*
Ga0137405_128968213300015053Vadose Zone SoilSAGIELRWTIPGVQVPFRAYYALNVLRLDRSIRLSDKSIFFARNRFSAFGWGLGSLF*
Ga0137405_134663813300015053Vadose Zone SoilVQVPFRAYYALNVLRLDRSIRLSDKSIFFARNRFSAFGWGLGSLF*
Ga0137420_120482823300015054Vadose Zone SoilSLNVLRLNRYIALSDKSRLLAHNRFSAFGWGLGTLF*
Ga0182032_1170512523300016357SoilILHGSTGVQLQWTIPGVQVPFRSYYALNVLRLDRLIPLSHKSFLHPHNRFGAFGWGLGSL
Ga0134083_1020801413300017659Grasslands SoilLRWTVPEVQVPVRAYFAFNVLRLDHPIALSDKSRFLARNRFSAFGWGLGALF
Ga0187807_134464623300017926Freshwater SedimentYAINVLRLDRTIHLSDKSIFFARNPFSTFGWGLGSLF
Ga0187817_1048606913300017955Freshwater SedimentVPGIQVPFRVYYAINVLRLDRTIHLSDKSIFFARNPFSAFGWGLGSLF
Ga0187773_1017388423300018064Tropical PeatlandPGIEVPLRSYYALNVLRLNRRISLSDKSLFLARNRFSAFGWGLGSLF
Ga0066662_1103438613300018468Grasslands SoilRWTVPGVQVPVRAYYAANVLGMNRFPPLSDISLFHARNRFSAFGWGLSSLF
Ga0066669_1054274713300018482Grasslands SoilVQVPVRAYYAVNILRMNRFFFLSDNSLFHARNRFSAFGWGLGTLF
Ga0179594_1009276713300020170Vadose Zone SoilELRWTVPGVNVPFRAYYALNLLRLDRSIRLSDKSIFFARNRFSAFGWGLGSLF
Ga0179592_1020569713300020199Vadose Zone SoilAGIELRWTVPGVQVPVRAYYAVNVLRLDRFISLSDKSRFFAHNRFSAFGWGLGSLF
Ga0179592_1031264723300020199Vadose Zone SoilGIELRWTVPGVQVPVRAYYAVNVLRMNRFISLSENSVFHARNRFSAFGWGLGTLF
Ga0179592_1040019223300020199Vadose Zone SoilHWTAPRVQVPVRAYYAVNILRMNRFFFLSDNSLFHARNRFSAFGWGLGALF
Ga0210399_1034874723300020581SoilVPIRAYYAINVLRLNRSIRLSDKSVLFARNRFSAFGWGLGSLF
Ga0179596_1047685013300021086Vadose Zone SoilPGVQVPVRAYYAVNVLRLDRFITLSDKSRFFAHNRFSAFGWGLGSLF
Ga0210406_1015425213300021168SoilVPVRAYYAVNVLRMNRFISLSENSVFHARNRFSAFGWGLGTLF
Ga0210400_1049564623300021170SoilYYALNILRLNRFLSLSDKSLFHASNRFSAFGWGLGSLF
Ga0210400_1116316313300021170SoilHVPVRAYYALNVLRLNRFLSLSDKSLFHANNRFSAFGWGLGSLF
Ga0210405_1049137413300021171SoilVPGVNVPIRAYYAVNVLRLDRSIRLSNKSVFFARNRISAFGWGLGSLF
Ga0210397_1004462113300021403SoilYAVNVLRLDRSIRLSDKSVFFARNRISAFGWGLGSLF
Ga0126371_1290554323300021560Tropical Forest SoilFRSYYSLNVLRLNRVIPLSEKSFLHAHNRFGAFGWGLGSLF
Ga0209055_126406713300026309SoilTIPGVQVPFRSYYAVNELRLDRWIPLSEKSLLHAHNRLGAFGWGLGSLF
Ga0209155_112019123300026316SoilIELRCTVPGVQVPVRAYYALSVLRLNRVIPLSNTSSFVAHNRFSAFGWGLGSLF
Ga0209472_107023413300026323SoilPFRSYYAVNVLPLDRWIPLSEKSLLHAHNRLGAFGWGLGSLF
Ga0209808_115984113300026523SoilHGSTGIQLQWTIPGIQVPFRSFYSLNVLRLDRLVPLSEKSLLHAHNRFGAFGWGLGSLF
Ga0209808_123208823300026523SoilTGIEFRWTVPGVQVPVRTYYALNVLRLNRVIPLSSTSSFAAHNRFSAFGWGLGSLF
Ga0209157_106085013300026537SoilYYSLNVLRLDRLVPLSEKSLLHAHNRFGAFGWGLGSLF
Ga0209157_136257423300026537SoilQVPIRAHYAFNVLRLDRSLLLPDNSLFRAHNRFAAFGWALGSLF
Ga0209056_1019066013300026538SoilQFQWTIPGIQVPFRSYYALNVLRLDRVIPLSGRSLLHPHNRLGAFGWGLGSLF
Ga0209156_1006287013300026547SoilQLQWTIPGVQVPFRSYYALNVLRLDRVIPLSSRSLLHPHNRLGAFGWGLGSLF
Ga0209730_104104413300027034Forest SoilGVQVPLRSYYAFNVLRLDRAIPLSGKSFLRAGNRFAAFGWGLGPLF
Ga0209729_104866513300027061Forest SoilPVRAYYALNVLRLDRYLSLSDKSRLFAHGRFATFGWGLGSLF
Ga0209076_109911413300027643Vadose Zone SoilYYAFNVLRLDHPIALSDKSRFLARNRFSAFGWGLGALF
Ga0209388_116980913300027655Vadose Zone SoilPFRAYYALNLLRLDRSIRLSDKSTFFARNRFSAFGWGLGSLF
Ga0208981_115467513300027669Forest SoilVRAYYAVNVLRLDRAIRLSDKSVFFAHNRFSAFGWGLGTLF
Ga0209701_1058856723300027862Vadose Zone SoilPGVQVPVRAYYAVNVLRMNRFFSLSDNSLFHAHNRFSVFGWGLGALF
Ga0209283_1090475813300027875Vadose Zone SoilYYAVNVLRMNRFISLSDNSVFHAHNRLSAFGWGLGALF
Ga0209590_1087688413300027882Vadose Zone SoilPVRAYYAVNVLRLNRFFSLSDNSIFHAHNRFSAFGWGLGTLF
Ga0209526_1048341123300028047Forest SoilVNVPFRAYYALNLLRLDRSIRLSDKSIFFARNRFSAFGWGLGSLF
Ga0137415_1005768813300028536Vadose Zone SoilWTVPGIQVPVRGYYALNVLRLNRFLPLPNGSVFHAHNRFSAFGWALGALF
Ga0170822_1071213523300031122Forest SoilVNVLRLDRFISLSGKSRFLAHDRFSAFGWGLGCLF
Ga0170822_1146830713300031122Forest SoilSLNVLRLNRFVSLSDKSRLLARNRFSVFGWGLGSLF
Ga0170824_11522928913300031231Forest SoilIRFSLNVLRLNRFVSLSDKSRLLARNRFSVFGWGLGSLF
Ga0318561_1021459213300031679SoilSYYARNVLRLDRVIPLSDRSLLRLHNRLGAFGWGLGSLF
Ga0307475_1006858343300031754Hardwood Forest SoilAYYAVNVLRMNRFISLSENSVFHAHNRFFAFGWGLGSLF
Ga0307475_1072586823300031754Hardwood Forest SoilPVRAYYAINVPRLDRFISLSGKSRFFAHDRFSAFGWGLGSLF
Ga0307475_1134731113300031754Hardwood Forest SoilPVRAYYALNVLRLNRFFSLSENSLFHARNRFSSFGWGLGSLF
Ga0307478_1156642413300031823Hardwood Forest SoilPLRSYYAFNVLRLDRAIPLSGKSFLRAGNRFAAFGWGLGPLF
Ga0307479_1074475623300031962Hardwood Forest SoilRAYYSLNVLRLNRYIVLSDKSRLLAHNRFSAFGWGLGMLF
Ga0318559_1027476113300032039SoilRSYYARNVLRLDRVIPLSDRSLLRLHNRLGAFGWGLGSLF


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.