NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F095795

Metagenome / Metatranscriptome Family F095795

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F095795
Family Type Metagenome / Metatranscriptome
Number of Sequences 105
Average Sequence Length 75 residues
Representative Sequence MRKQKDFGLLILRGAGLLLALTFGVQKIGWYWSALHAGKSFSSIGLAPLIAKMGFPIPVALALWITFNESIGAFLIG
Number of Associated Samples 87
Number of Associated Scaffolds 105

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 87
AlphaFold2 3D model prediction Yes
3D model pTM-score0.39

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(13.333 % of family members)
Environment Ontology (ENVO) Unclassified
(25.714 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(55.238 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 60.95%    β-sheet: 0.00%    Coil/Unstructured: 39.05%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.39
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 105 Family Scaffolds
PF04173DoxD 20.95
PF07681DoxX 8.57
PF12833HTH_18 2.86
PF12706Lactamase_B_2 1.90
PF07676PD40 1.90
PF00210Ferritin 1.90
PF08240ADH_N 1.90
PF00571CBS 0.95
PF13387DUF4105 0.95
PF03544TonB_C 0.95
PF04191PEMT 0.95
PF04982HPP 0.95
PF02661Fic 0.95
PF00005ABC_tran 0.95
PF00128Alpha-amylase 0.95
PF04140ICMT 0.95
PF00248Aldo_ket_red 0.95
PF14357DUF4404 0.95
PF00165HTH_AraC 0.95
PF06250YhcG_C 0.95
PF13193AMP-binding_C 0.95

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 105 Family Scaffolds
COG2259Uncharacterized membrane protein YphA, DoxX/SURF4 familyFunction unknown [S] 29.52
COG4270Uncharacterized membrane proteinFunction unknown [S] 8.57
COG02961,4-alpha-glucan branching enzymeCarbohydrate transport and metabolism [G] 0.95
COG0366Glycosidase/amylase (phosphorylase)Carbohydrate transport and metabolism [G] 0.95
COG0810Periplasmic protein TonB, links inner and outer membranesCell wall/membrane/envelope biogenesis [M] 0.95
COG1523Pullulanase/glycogen debranching enzymeCarbohydrate transport and metabolism [G] 0.95
COG3280Maltooligosyltrehalose synthaseCarbohydrate transport and metabolism [G] 0.95
COG3448CBS-domain-containing membrane proteinSignal transduction mechanisms [T] 0.95
COG4804Predicted nuclease of restriction endonuclease-like (RecB) superfamily, DUF1016 familyGeneral function prediction only [R] 0.95


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil13.33%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil12.38%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil9.52%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil7.62%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil5.71%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil5.71%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil5.71%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil4.76%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil4.76%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.76%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.86%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere2.86%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere1.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.90%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.90%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.90%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.90%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.95%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.95%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.95%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grass Soil0.95%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.95%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.95%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.95%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Switchgrass Rhizosphere0.95%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.95%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.95%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.95%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2162886007Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2Host-AssociatedOpen in IMG/M
2170459002Grass soil microbial communities from Rothamsted Park, UK - March 2009 direct MP BIO 1O1 lysis 0-21 cmEnvironmentalOpen in IMG/M
2170459004Grass soil microbial communities from Rothamsted Park, UK - March 2009 indirect MP BIO 1O1 lysis 0-21cm (2)EnvironmentalOpen in IMG/M
2170459006Grass soil microbial communities from Rothamsted Park, UK - March 2009 indirect in plug lysis (for fosmid construction) 0-10cmEnvironmentalOpen in IMG/M
2170459011Grass soil microbial communities from Rothamsted Park, UK - July 2009 indirect Gram positive lysis 0-10cmEnvironmentalOpen in IMG/M
2189573004Grass soil microbial communities from Rothamsted Park, UK - FG2 (Nitrogen)EnvironmentalOpen in IMG/M
2228664022Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005564Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaGHost-AssociatedOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012951Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_226_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012988Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_242_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300014969Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4-5 metaGHost-AssociatedOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300019868Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s1EnvironmentalOpen in IMG/M
3300019870Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m1EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300026469Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-BEnvironmentalOpen in IMG/M
3300026498Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-49-AEnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026791Grasslands soil microbial communities from Kansas, USA that are Nitrogen fertilized - NN591 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300030916Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA12 EcM (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030945Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA10 EcM (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031128Oak Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031469Fir Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031538Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D1EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031941Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080EnvironmentalOpen in IMG/M
3300031942Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF176EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
SwRhRL2b_0276.000057402162886007Switchgrass RhizosphereMREQKDLALLIFRGAGLLLVATFGVQKIGWYWSALLAGKSLTSSGLAQLIAKMGFPIPVALALWITFNESIGAFLVAC
E1_071451502170459002Grass SoilMSAFHRFPSRDLGLLILRGAGFLLAATFGLQKFSWYWKAFHAGKSLSAIGLAPLIARMGFPIPVVLALWITFNESIGALLIGCGFLTR
E4B_039871202170459004Grass SoilMSAFHRFPSRDLGLLILRGAGFLLAATFGLQKFSWYWTAFHAGKSLSAIGLAPLIARMGFPIPVVLALWITFNESIGALLIGCGF
E4B_037135602170459004Grass SoilMSAFHRFPSRDLGLLILRGAGFLLAATFGLQKFSWYWTAFHAGKSLSAIGLAPLIARMGFPIPVVLALWITFNESIGALLIGXGF
L01_022807302170459006Grass SoilPSRDLGLLILRGAGFLLAATFGLQKFSWYWTAFHAGKSLSAIGLAPLIARMGFPIPVVLALWITFNESIGALLIGCGFLT
F64_013086202170459011Grass SoilMSAFHRFPSRDLGLLILRGAGFLLAATFGLQKFSWYWTAFHAGKSLSAIGLAPLIARMGFPIPVVLALWITFNESIGALL
FG2_087916702189573004Grass SoilMREQKDFGLLILRGAGFLLAGTFGIQKIGWYWSLFHASKSLSSAGLAPLIARMGFPIPFALALWITFNESIAVVSVGCGF
INPgaii200_105041912228664022SoilVNGRRIKMSGFQRFPDRDLGLLILRGAGFLLAATFGLQKIGWYWTAFHTGKALSAIGLAPLIATMGFPIPVVLAFWITFNESIGALLIGCGFLTRILAAS
INPgaii200_116085432228664022SoilMLDLGLLALRSAGFLLALTFGFQKIGWYISAFHSDKAFSSVGLAPLIAHVGFPAPDPRCVDYVQ
JGI25389J43894_102327023300002916Grasslands SoilMLDIGLLLFRAAGFLLAFTFGVQKIGWYVTAFHAGKPLLSSIGLTPLIAHMGLPLPVILAFWITLNESIGA
Ga0062593_10313518423300004114SoilMIDPGLLLLRAAGFLLAFTFGIQKIGWYVTAFHAGKPLSSIGLTPLIAHVGFPLPMILAL
Ga0062590_10119858513300004157SoilMRKQTDLGLLILRGAGLLLALTFGVQKIGWYWSALHAGKSFSSIGLAPLIAKMGFPIPVALAIWITFNESIGA
Ga0066683_1018535533300005172SoilMRKHKDLGLLLLRGSGLLLALTFGVQKIGWYCSALHVGKSFLSIGLAPLIAKMGFPIPVVLAVWITFNESIGAFL
Ga0066673_1044721323300005175SoilMSAFQRFPSRDLGFLILRGAGFLLAATFGVQKIGWYWTAFHAGKSLSAIGLAPLIARMGFPIPVVLAL*
Ga0066685_1002012113300005180SoilMLDIGLLLFRAAGFLLAFTFGVQKIGWYVTAFHAGKPLLSSIGLTPLIAHMGLPLPVILALWITLNESIG
Ga0066676_1101996813300005186SoilMIYSGLLLLRAAGFLLAFTFGIQKIGWYVTGLHAGKPFSSIGLTPLIAHVGFPLPVILALWITLNES
Ga0066675_1097475013300005187SoilMREQKDLGLLILRGAGPLLALTFGVQKIGWYWSALHAGKPFSSIGLAPLIAKMGFPIPVALAIWITFNESIGAFLIGCGFL
Ga0065705_1004574213300005294Switchgrass RhizosphereMREQKDFGLLILRGAGLLLAGTFGVQKIGWYWSAFHAGKSLSSAGLAPLIARMGFPIPFALALWITFNESIGAFLIGCGVLTRLLAASAALGMA
Ga0065705_1044793413300005294Switchgrass RhizosphereMREQKDFGLLVLRGAGLLLAVTFGLQKIGWYWSVFHAGKSLSSAGLTPLIARMGFPIPFALALWITFNESIGAFL
Ga0066388_10094672233300005332Tropical Forest SoilMRKQNDLGLLVLREAGLLLALTFGLQKIGWYWSALHARKSFSSIGLAPLIAKMGFPIPPALALWITFNESIGAF*
Ga0066388_10852762623300005332Tropical Forest SoilMLDLGLLALRSAGFLLAFTFGIQKIGWYVMALHANKPFSSIGLAPLIAKFGFPIPVILA
Ga0070709_1136874613300005434Corn, Switchgrass And Miscanthus RhizosphereMINIGLLLSRAAGFLLAFTFGIQKIGWYVAAFHAGKPLASVGLAPLIAKVGFPFPIILALWI
Ga0070714_10099819723300005435Agricultural SoilMSAFQRFPSRDLGLLILRGAGFLLAATFGLQKIGWYWTAFHAGKSLSAIGLAPLIARMGFPIPVVLALWITFNESIGALLIGCGFLTRI
Ga0070711_10147830123300005439Corn, Switchgrass And Miscanthus RhizosphereMLDLGLLALRSAGFLLALTFGFQKIGWYLSAFHSDKAFSSVGLAPLIAHVGFPAPAILAVWITFNE
Ga0070708_10089029713300005445Corn, Switchgrass And Miscanthus RhizosphereLKASPPFGYGSVSLHMRKQTDLGLLLLRGSGFLLALTFGAQKIGWYWSGLHAGKSFSSIGLAPLIAKIGFPIPVALAIWTTFNESIGAF
Ga0070697_10094213423300005536Corn, Switchgrass And Miscanthus RhizosphereMKEQKDLGLLILRGAGLLLAVTFGVQKLGWYWAAFHAGKSLFHAGLAPLIARMGFPIPIVLALWITFN
Ga0070664_10214694113300005564Corn RhizosphereMLDLGLLVLRTAGFFLAFTFGIQKIGWYIAGLHSDKAFSSTGLAPLIAKMGFPAAVILALWVTFNESVGAFLI
Ga0066693_1011796913300005566SoilMIDIGRLLFRAAGLLLALTFGVQKIGWYVTAFHAGKPLLSSIGLTPLIAHMGLPLPVILALWITLNESIGAFF
Ga0066703_1079951823300005568SoilMRKQADLGLLLLRGSGFLLALTFGVQKIGWYWSSLHAGKSFSSIGLAPLIAKIGFPIPVALAIWITFNESI
Ga0066708_1017119113300005576SoilMKEQRDLGLLILRGWYWTALHAGKSLSHAGLAPLIARMGFPIPVVLALWVTFNESIGAFLIGCGFLTRVM
Ga0066905_10014676053300005713Tropical Forest SoilMIDLGLLLLRAAGFLLAFTFGIQKIGWYVTAFHSGKPLSSIGLAPLIGHVGFPLPVILA
Ga0066903_10019967643300005764Tropical Forest SoilMPSLINIGLLLSRAAGFLLAFTFGIQKIGWYVTAFHAGKPVASIGLAPLIAKVGFPFPIILALWITLNESIGAFFVGIGL
Ga0066903_10834059913300005764Tropical Forest SoilMIDRGLLTLRSAGFLLAFTFGIQKIGWYISAFHSEKPFASIGLTPLIAHMGFPVPVVLAL
Ga0070717_1003906763300006028Corn, Switchgrass And Miscanthus RhizosphereMREQKDFGLLILRGAGLLLAGTFGVQKIGWYWSALHAGKSLSSAGLAPLIAKMGFPIPVALAVWITFNESIGAFLIGGGFLTRLLAASAALGMAGA
Ga0066696_1022892513300006032SoilMREQKDFGLLILRGAGLLLAGTFGVQKIGWYWSAFHAGKSLSSAGLAPLIAKMGLPIPFALALWITFNEIGVFLVGCGFLTRLLAASAALGMAGAL
Ga0066652_10055572833300006046SoilMRKHKDLGLLLLRGSGLLLALTFGVQKIGWYCSALHVGKSFLSIGLAPLIAKMGFPIPVVLAVWITFNESIG
Ga0066652_10108973113300006046SoilMREQKDFGLLILRGVGLLLAGTFGVQKIGWYWSAFHAGKSLSSAGLAPLIARMGFPIPVALALWITFNESIGAFLIGCGFLTRLLA
Ga0070765_10067289433300006176SoilMLDLGLLVLRTAGFLLAFTFGIQKIGWYMAAFHSDKAFSSIGLVPLIAKMGFPAAVILA
Ga0070765_10140069713300006176SoilMSAFQRFPSRDLGFLILRGAGFLLAATFGVQKIGWYWTAFHAGKSLSAIGLASLIARMGFPIPVVLALWITFNESIGAFLIGCGFLTRILAASAALGMAG
Ga0066653_1040110213300006791SoilMRKQKDFGLLILRGAGLLLALTFGVQKIGWYWSALHAGKSFSSIGLAPLIAKMGFPIPVALALWITFNESIGAFLIG
Ga0075433_1192320923300006852Populus RhizosphereMAYSLVSLPMIKQKDLGLLVLRGAGFLLALTFGVQKIGWYWSALHAGKSFQSIGLAPLIAKMGFPIPVVLSIWIMFNESIGAFFVGC
Ga0099791_1006920343300007255Vadose Zone SoilMLDLGLLALRSAGILLAFTFGIQKIGWYIAAFHSDKPLSSVGLAPLIAQIGFPAPAILALWVTFNESIGALLI*
Ga0066710_10051824433300009012Grasslands SoilMRKQKDFGLLILRGAGLLLALTFGVQKIGWYWSAVHAGKSFSSIGLAPLIAKMSFPIPVALAIWITFDESIGAFLIGRGFLARL
Ga0126374_1167597913300009792Tropical Forest SoilMLNVGLLVLRATGFLLAFTFGIQKIGWYIAALHSDKAFSSIGLAPLISKMGFPASVILALWI
Ga0126373_1233736323300010048Tropical Forest SoilMKITERLMNRDLGLLILRCAGLLLAATFGVQKIGWYWTGLHAGKDLSHIGLATLIARMGFPVPVLLALSVTFNESIGAFL
Ga0134088_1040605913300010304Grasslands SoilMLDIGLLLFRAAGFLLAFTFGVQKIGWYVRALHAGKPWSSIGLAPLIAHVGFPLPVVLALWITLNESIGA
Ga0134088_1069934213300010304Grasslands SoilMKEQRDLGLLILRGAGLLLAVTFGVQKVGWYWTALHAGKSLSHAGLAPLIARMGFPIPVVLALWVTFNESIGAFLIGCGF
Ga0134065_1047551223300010326Grasslands SoilMREQKDFGLLILRGAGLLLAGTFGVQKIGWYWSAIHAGKSLSSAGLAPLIAKMGLPIPFALALWITFNESIGAFLVGCGFLTRLLAASAALGM
Ga0134063_1011020733300010335Grasslands SoilMLDIGLLLFRAAGFLLAFTFGVQKIGWYVTAFHAGKPLLSSIGLTPLIAHMGLPLPVILAL*
Ga0126372_1174904223300010360Tropical Forest SoilMIDPGLLVLRAAGFLLAFTFGIQKIGWYVTAFHAGKPLSSIGLAPLIADVGFPIPVILALWITLNESIGAFLV
Ga0126379_1019926153300010366Tropical Forest SoilMFDLGLLALRGAGFLLAFTFGIQKIGWYVMAFHSNKPFSSIGLAPLIAKVGFPMAVILALWIT
Ga0126379_1107313023300010366Tropical Forest SoilMRKQKDLGLLLLRGAGLLLALSFGVQKIGWYWSALHAEKPFSSIGLAPLIARMGFPIPVALAIWITFNESIGAFLIGCGFLTRSLSASLALG
Ga0134125_1127650013300010371Terrestrial SoilMLDLGLLVLRAAGFLLAFTFGIQKIGWYIAGLHSDKAFSSIGLAPLIAKMGFPAAILLAL
Ga0137391_1015187053300011270Vadose Zone SoilMLDLGLLALRSAGILLAFTFGIQKIGWYIAAFHSDKPLSSVGLAPLIAQIGFPAPAILALWVTFNESIGAL*
Ga0137383_1083892813300012199Vadose Zone SoilMIDTGLLLLRAAGFLLAFTFGVQKIGWYVTAFHAGKPWSSIGLAPLIAHVGFPLPVILALWITLNESIGAFLIG
Ga0137363_1062668813300012202Vadose Zone SoilMREQKDLGLLILRGAGFLLAGTFGVQKIGWYWSAFHAGKSLSTVGLAPLIARMGFPIPVALALWITFNESIGAFLIGCGF
Ga0137386_1025574313300012351Vadose Zone SoilMRKQKDFGLLILRGAGLLLALTFGVQKIGWYWSAVHAGKSFLSIGLAPLIAKMGFPIPVVLAVWITFNESIGAFLIGCGVLTR
Ga0137360_1100945013300012361Vadose Zone SoilMREQKDFGLLILRGVGLLLAGTFGVQKIGWYWSAFHAGKSLSSAGLAPLIARMGFPIPVALALWITFNESIGAFLIGCGFLTRLLAA
Ga0137360_1102219413300012361Vadose Zone SoilMIDLGLLFLRAAGFLLAFTFGIQKIGWYVTAFHTGKPLSSIGLAPLIAHVGFPLPFILALWITLNESIGG
Ga0137416_1004966433300012927Vadose Zone SoilMREQKDFGLLILRGAGLLLAGTFGVQKIGWYWSAFHAGKSLSSAGLAPLIAKMGFPIPVALAIWITFNELIGAFLIGCGFRHDY*
Ga0137407_1000399413300012930Vadose Zone SoilMIYSGLLLLRAAGFLLAFTFGIQKIGWYVTGLHAGKPFSSIGLTPLIAHVGFPLPVIL
Ga0137407_1019053033300012930Vadose Zone SoilMRKQKDFGLLILRGAGLLLALTFGVQKIGWYWSAVHAGKSFSSIGLAPLIAKMSFPIPVALAIWITFNESIGAFL
Ga0164300_1087420823300012951SoilMREQKDFGLLILRGAGLLLAGTFGVQKIGWYWSAFHAGKSLPSDGLATLIAKIGFPILVALAIWITLDNSIGA
Ga0164302_1159711323300012961SoilMREQKDSGLLILRGAGLLLALTFGVQKIGWYWSALHAGKSFSSIGLAPLIAKMGFPIPVALALWITFNESIGAFLIGCGRLTRY*
Ga0126369_1051808543300012971Tropical Forest SoilVYDLGLLAVRSAGFLLAFTFGIQKIGWYISAFHSKKPFASIALTPLITHMGFPVPVILPLWVTFDTSVGAFLIGCGVFTRVF
Ga0126369_1310621813300012971Tropical Forest SoilMRKQKDLGLLLLRGAGLLLALTFGVQKIGWYWSALHAGKSFSSIGLAPLIAKMGFPIPGVLAIWITFNESIGAFL
Ga0164306_1092117013300012988SoilMREQKDFGLLILRGAGLLLAGTFGVQKIGWYWSAFHAGKSLSSAGLTPLIAKMGFPIPFALALWITFNESIGAFLIGCG
Ga0164305_1155941313300012989SoilMREQKDSGLLILRGAGLLLALTFGVQKIGWYWSALHAGKSFSSIGLAPLIAKMGFPIPVALALWITFNESIGAFLIGCGLLT
Ga0157378_1181484813300013297Miscanthus RhizosphereMTKQKDLGLLVLRGAGFLLALTFGVQKIGWYWSALHARRPFSSIGLAPLIAKMGFPIPVALAIWVTFNESI
Ga0134079_1030724123300014166Grasslands SoilMREQKDFGLLILRGAGLLLAGTFGIQKIGWYWSAFHAGKSLSTAGLAPLIARMGFPIPFALALWITFNESIGAFLIGCGFLT
Ga0157376_1041976313300014969Miscanthus RhizosphereMREQKDFGLLILRGAGFLLAGTFGVQKIGWYWSAFHAGKSLSSAGLAPLIAKMGFPIPFALALWITFNE
Ga0157376_1284831013300014969Miscanthus RhizosphereMINIGLLLSRAAGFLLAFTFGIQKIGWYVAAFHAGKPFASVGLAPLIAKVGFPFPIF
Ga0137403_1112483823300015264Vadose Zone SoilMRKQKDFGLLILRGAGLLLALTFGVQKIGWYWSAVHAGKSFSSIGLAPLIAKMSFPIPVALAIWITF
Ga0132256_10174947123300015372Arabidopsis RhizosphereMKEQKDFGLLILRGAGLLLAGTFGIQKIGWYWSAFHAGKSLSSAGLAPLIARMGFPIPFALAFWITFNESIGAFL
Ga0132255_10465814623300015374Arabidopsis RhizosphereMREQKDFGLLILRGAGFLLAGTFGVQKIGWYWSAFHAGKSLSSGGLAPLIARMGFPIPFALALWITF
Ga0134083_1002753643300017659Grasslands SoilMFDWGLLALRSAGFLLAFTFGLQKIGWYISAFQSDKSFSSIGLAPLIAQVGFPAPVILA
Ga0184604_1014195023300018000Groundwater SedimentMREQKDFGLLVLRGAGLLLAGTFGVQKIGWYWSAIHAGKSLSSAGLAPLIARMGFPIPFALALWITFNESIGAFLIGCGVLTRRWLRGASNEMN
Ga0066655_1084808913300018431Grasslands SoilMREQKDFGLLILRGAGLLLAGTFGVQKIGWYWSAVHAGKSLSSAGLAPLIARMGFPIPFALALWITFNESIGAILIGCGFLTRLLA
Ga0193720_104335523300019868SoilMRAGFLLAFTFGLQKIGWYISAFQSDKPFSSIGLAPLIAQVGFPAPVILALWITFNESIGALFLGCGLF
Ga0193746_101144113300019870SoilMREQKDLGLLILRGAGLLLTLTFGVQKIGWYWPALHAGKSFSSIGLAPLIAKMGLPIPVALALWITFNESVGAFLIG
Ga0193707_118752913300019881SoilMREQKDFGLLVLRGAGLLLAGTFGVQKIGWYWSAIHAGKSLSSAGLAPLIARMGFPIPFALALWITFNESIGAFLIGCGVLTRRWLHGASNEMN
Ga0179596_1006014613300021086Vadose Zone SoilMLDLGLLALRSAGILLAFTFGIQKIGWYIAAFHSDKPLSSVGLAPLIAQIGFPAPAILALWV
Ga0126371_1082741423300021560Tropical Forest SoilVFDFSLLVLRSAGFLLAFTFGVQKIGWYLIAFHSNKPFSSIGLAPLIAKMGFPAAGYSCALDNIQ
Ga0126371_1119367723300021560Tropical Forest SoilMRQQKDLGLLVLRTAGFLLTFTFGIQKIGWYVTALRSGKHLSFIGLAPLISQIGFPVSVSVVLAIWVTFNESIGAFLIGCGLF
Ga0126371_1353659523300021560Tropical Forest SoilVFDIGLLALRSAGFLLAFTFGIQKIGWYIAAFHSDKPLSSIGLAPLIAHVGFPVPVILALWITFNESIAS
Ga0222622_1148100613300022756Groundwater SedimentMIDSGLLLLRAAGFLLAFTFGVQKIGWYVTAFHAGKPWSSIGLAPLIAHVGFPLPVILALWITLNESIGAFLIGI
Ga0257169_108479113300026469SoilMSAFQRFPSRDLGLLILRGAGFLLAATFGLQKIGWYWTAFHPGKSLSAIGLAPLIGRMGFPIPVVLALWITFNESTGALLIACGFLTRILAA
Ga0257156_100078313300026498SoilMREQRDFGLLILRIAGFLLVFTFGIQKIGWYWSALHAGKSFSSIGLAPLIAKMGFPIPVALAMWITFNESIGAFL
Ga0209157_124935133300026537SoilMFDWGLLALRSAGFLLAFTFGLQKIGWYISTFQSDKSFSSIGLAPLIAQVGFPAPVILALWITFNESIGALF
Ga0208072_10815513300026791SoilMRKQRDLGLLILRGAGLLLALTFGVQKIGWYWSGLHAGKSFSSIGLAPLIAKMGFPIPVALAIWITFNESIGAFLIGCGFLTRSL
Ga0209488_1082785823300027903Vadose Zone SoilMRGQKDFGLLILRGAGLLLAGTFGVQKIGWYWSAFHAGKSLSSAGLAPLIAKMGFPIPVALAIWITFNESIGAFLIACGLTRLLAASL
Ga0307312_1042534113300028828SoilMFDWGLLALRSAGFLLAFTFGLQKIGWYVSAFQSDKPFSSIGLAPLIAQVGFPAAVILALWITFNESIG
Ga0075386_1220610213300030916SoilMSAFQRFPTRDLGFLILRGAGFLLAVTFGVQKIGWYWTAFHAGKSFSAVGLAPLIARMGFPIPVVLALW
Ga0075373_1134006423300030945SoilMSAFHRFPSRDLGLLILRGAGFLLAATFGLQKFSWYWTAFHAGKSLSAIGLAPLIARMGFPIPVVLALWITFNESIG
Ga0170823_1368787123300031128Forest SoilMRKQKDFGLLILRGAGLLLALTFGVQKIGWYWSALHAGKSFSSIGLAPLIAKMGFPIPVALAIWITFNESIGAFLIGCGFLT
Ga0170824_10205815853300031231Forest SoilLLILRGAGLLLALTFGVQKIGWYWSALHAGKSFSSIGLAPLIAKIGFPIPVALAIWITFNESIGAFLIGCGFL
Ga0170824_10452801113300031231Forest SoilMSALQRFLSRDLGLLILRGAGFLLAVTFGVQKIGWYWTAFHAGKSFSAVALAPLIARMGFPIPVVLALWITFNESIGALL
Ga0170824_11811602033300031231Forest SoilMKEQKDFGLLILRGTGLLLAGTFGIQKIGWYWSAVHAGKSLSSAGLAPLIAKMGFPIPFALALWITFNESIGAFLVGCGFLTR
Ga0170824_12325798813300031231Forest SoilAGFLLALTFGFQKIGWYLSAFHSDKAFASVGLAPLIAHLGFPAPAILAVWITFNE
Ga0170819_1444096513300031469Forest SoilMSAFQRFPSRDLGFLILRGAGFSLAATFGLQKIGWYWTAFHTGKSLSAIGLAPLIARMGFPIPVVLALWITFNESIGAFLIGCGFLTRILAA
Ga0310888_1046302813300031538SoilMKEQKDSGLLILRGAGLLLALTFGVQKIGWYWSALHAGKPFSSIGLAPLIAKMGFPISVALALWITFN
Ga0307475_1155635423300031754Hardwood Forest SoilMKEQKDLGLLILRGAGLLLAVTFGVQKIGWYWTAFHAGKPLSHAGLAPLIARMGFPIPFILAWWVTFNESIGAFLIGCGFLTRTLAAS
Ga0310912_1027104123300031941SoilMIDFGLLLLRAARFVLAFTFGIQKMGWYVTAFHAGKPLRSVGLAPLIAHVGFPLPIILALWITL
Ga0310916_1060514613300031942SoilMIDFGLLLLRAARFVLAFTFGIQKMGWYVTAFHAGKPLRSVGLAPLIAHVGFPLPIILALWI
Ga0307479_1147453913300031962Hardwood Forest SoilLLAVTFGVQKLGWYWAAFHAGKSLLHAGLAPLIARMGFPISAALALWITFNESIGAFLIGCGFLTRIMAGSAALGMAGALYT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.