NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F073411

Metagenome Family F073411

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F073411
Family Type Metagenome
Number of Sequences 120
Average Sequence Length 165 residues
Representative Sequence AFESIQGWSDPHTLLTVTANQADLAWIAEDGKPTQALPIRDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFAHPPAGIPTVENGQAGGLFLYEFRSKRRTLIPLPGLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGLGLQRYIAGTSLVVGL
Number of Associated Samples 94
Number of Associated Scaffolds 120

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 89
AlphaFold2 3D model prediction Yes
3D model pTM-score0.66

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(18.333 % of family members)
Environment Ontology (ENVO) Unclassified
(34.167 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(74.167 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 2.70%    β-sheet: 35.68%    Coil/Unstructured: 61.62%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.66
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 120 Family Scaffolds
PF07687M20_dimer 3.33
PF03928HbpS-like 0.83
PF07676PD40 0.83
PF01288HPPK 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 120 Family Scaffolds
COG08017,8-dihydro-6-hydroxymethylpterin pyrophosphokinase (folate biosynthesis)Coenzyme transport and metabolism [H] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil18.33%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil18.33%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil15.83%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil10.00%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil7.50%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil6.67%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil5.83%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.17%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.50%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.67%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.67%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.83%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000793Forest soil microbial communities from Amazon forest - 2010 replicate II A001EnvironmentalOpen in IMG/M
3300002910Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cmEnvironmentalOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300027061Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM1H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027576Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM3H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027725Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027874Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300031564Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f21EnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031680Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f22EnvironmentalOpen in IMG/M
3300031744Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00H (v2)EnvironmentalOpen in IMG/M
3300031764Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.117b4f27EnvironmentalOpen in IMG/M
3300031765Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f22EnvironmentalOpen in IMG/M
3300031780Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f21EnvironmentalOpen in IMG/M
3300031793Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f21EnvironmentalOpen in IMG/M
3300031795Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f19EnvironmentalOpen in IMG/M
3300031797Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f23EnvironmentalOpen in IMG/M
3300031799Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f21EnvironmentalOpen in IMG/M
3300031819Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f21EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031821Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f20EnvironmentalOpen in IMG/M
3300031845Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f18EnvironmentalOpen in IMG/M
3300031860Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.108b1f25EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300031880Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.168b4f25EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032008Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f18EnvironmentalOpen in IMG/M
3300032025Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f20EnvironmentalOpen in IMG/M
3300032041Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f22EnvironmentalOpen in IMG/M
3300032054Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f23EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
AF_2010_repII_A001DRAFT_1013798313300000793Forest SoilKKLLFTREAGGVRTINEYIYDLNSVKELLRGEVRNPLWSPDDSRVAYLHHQGGKWQLWTFPSNDPAKAAVLSADAFESVQGWSDPHTLLTVMANQADLAWIGEDGKVAQTLPIRDLCGPDFVPGGKLAVRLHPTNPDLVLVSALFVHPPTGIPAVENGQAGALFLYEF
JGI25615J43890_105397613300002910Grasslands SoilESIQGWADPHTVLLTTLNQTDLAWIGEDGKPTQTLPIKDLCGPDFAPGANLTVRLHPTNPDLALVSATFVHPPTGVPTAEREGQAGALFFYEFRSKRRALLPIASLSASDAEWSRDGFQILFTGTDSAKRRTTYRIFWDGLGLQKYLSATSLVVGL*
Ga0066395_1045402913300004633Tropical Forest SoilTNPDLVLVSALLVRATAGVPAVENGLAGGLFFYEFRSRRRTLLNIPNLSATDGEWSRDGFQVLFTGTDSSRRKSTYRIFWDAIGLQKYAAGTSLVVGL*
Ga0066674_1006925513300005166SoilDPHTVLVTTLNQADLAWIGEDGKPTQTLPIKDLCGPDFVPGTKLTVRLHPTNPDLALVSALFVHPPAGVPTAEKEGQAGGLFFYEFRSKRRTLLPIANLSASEAEWSRDGFQILFTGTDSSKRKATYRVFWDGIGLQKYLSATSLVVGL*
Ga0066672_1007537413300005167SoilWSPDDSRIAYLNQQGGKWQIWAFPSSDSTKAAVLSSDSFESIQDWADPHTLLGTTPNQVALAWIAEDGRATQTLPIKDLCEPDFACGGMLTARLHPTNPDLVLVSAAFLRPPTGTPAVENGQAGALFFYEFHSKRRSVLTLPNLSATDGEWSRDGFQILFTGTDSAKRRTTYRVFWDGIGLQKYAPGTSLVVGL*
Ga0066672_1062368623300005167SoilQGGKWQLWAFPANDPAKAAVVSADSFENIQGWSDPHTFLALTASPAALAWIGEDGRATQTLSISDLCGTDFVAGNKLVVRLHPTNPDLVLVSASFARPPTGVPAVENGQAGGLFFYEFRARRRAVLTIPNLCAADGEWSRDGFEVFFTGTDSSRRKASYRVFWDAIGLQKYVAGTSLVVGL*
Ga0066677_1001957343300005171SoilSISDLCGTDFVAGNKLVVRLHPTNPDLVLVSASFARPQAGVPAVENGQAGGLFFYEFRSRRRAVLTIPNLSATDGEWSRDGFEVFFTGTDSSRRKASYRVFWDAIGLQKYVAGTSLVVGL
Ga0066683_1018088423300005172SoilDVRSPVWSPDDSRVAYLNHQDGKWQLWTFPSNDPTKAVVLSPDAFERTQGWVDPHTVLVTTLNQADLAWIGEDGKPTQTLPIKDLCGPDFVPGTKLTARLHPTNPDLALVSALFVHPPAGVPTAEKEGQAGGLFFYEFRSKRRTLLPIANLSASEAEWSRDGFQILFTGTDSSKRKATYRVFWDGIGLQKYLSATSLVVGL*
Ga0066690_1053307223300005177SoilTLLAVTVSQTDLASIGEDGKVTQALPIKDLCGPDFLPGAKLTIRVHPTNPDLVLVSALFVRAPAGVPTVESGQAGGVFFYEFRSKRRAPLLIANLSATDAEWSRDGFQILFTGTDSSKRRGTYRVFWDGIGLQKYAAGTSHVVGL*
Ga0066684_1027044523300005179SoilCGPDFVPGAKLAVRLHPSNPDLVLVSALFAHPPAGIPTVENGQAGGLFLYEFRSKRRTLIPLPGLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGLGLQRYIAGTSLVVGL*
Ga0066684_1075723813300005179SoilTLPIKDLCEPDFACGGMLTARLHPTNPDLVLVSAAFLRPPAGTPAVENGQAGALFFYEFHSKRRSVLTIPNLSATDGEWSRDGFQILFTGTDSAKRRTTYRVFWDGIGLQKYAPGTSLVVGL*
Ga0066685_1001639413300005180SoilRTINEYNYDSNSVKELLRGEVRNPVWSPEDSRAAYLNHQGGKWQLWTFPSNDPTKAAALSADAFESIQGWSDPHTLLTVTANQADLAWIAEDGKPTQALPIRDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFAHPPAGIPTVENGQAGALFLYEFRSKRRTLIPLPGLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGLGLQRYIAGTSLVVGL*
Ga0066671_1056047923300005184SoilWQLWAFPANDPAKAAVVSADSFENIQGWSDPHTLLALTASPAALAWIGEDGRATQTLSISDLCGTDFVAGNKLVVRLHPTNPDLVLVSASFARPQAGVPAVENGQAGGLFFYEFRSRRRAVLTIPNLSATDGEWSRDGFEVFFTGTDSSRRKASYRVFWDAIGLQKYVAGTSLVVGL*
Ga0066388_10332934113300005332Tropical Forest SoilEGPVRTISEYDCDSNSVKELVRGDVRNPVWSPDDFGVAYLNHQGGKWQLWAFPANDPAKAAVASPDDFESIQGWSDPHTLLALKANPAALAWIGQDGKTTQTLSISDLCGADFVPGTRLTVRLHPTNPDLVLVSAWFAHPSTGVPAVENGQAGALFLYEFRSRRRSVLAIPNLSATDGEWSRDGFQIFFTGADSSKRKATYKVFWDGIGLQKYVAGTSLVVGL*
Ga0066388_10379009613300005332Tropical Forest SoilSRIAYLNHQNGQWQLWTFPPNDATKASVLSADAYEAIQGWADAHAVLVTTLNQANLVWIGEDGKATQTVSVKDLCGPDFVCSARLTVRPHPTNRDLVLVSALFAHAPAGLPPAERDGQTGGLFLYEFRARRRAVLSFPNLSASDGEWSRDGFQILFTGTDSAKRRTTYRVFWDGIGLQKYVSGTSLVAGL*
Ga0066388_10542993223300005332Tropical Forest SoilFAAGGRLTVRLHPTNPDLVLVSALFVRPAPGVPAVENGLAGGLFFYEFRSRRRAVLAIPNLSATDGEWSRDGLQVLFTGTDSSRRKSTYRIFWDAIGLQKYAPGTSLVVGL*
Ga0066388_10681739013300005332Tropical Forest SoilNVKELLRGDIRKPVWSPDDFGIAYLNHQGGKWQLWAFPANDPAKAAVASPEAFESIQGWIDSHTLLALTANPPALAWIGQDGKVTQTLPISDLCGTDFAGGAKLTVRLHPTNPDLVLVSALFVRPAAGVPAMENGLAGGLFFYEFRSRRRTVLSIPNLSATDGEWSRDGFQIFFTGTDSARRKSTYRIFWD
Ga0066697_1024085823300005540SoilAVVSADSFENIQGWSDPHTLLALTASPAALAWIGEDGRATQTLSISDLCGTDFVAGTKLVVRLHPTNPDLVLVSASFARPQAGVPAVENGQAGGLFFYEFRSRRRAVLTIPNLSATDGEWSRDGFEVFFTGTDSSRRKASYRVFWDAIGLQKYVAGTSLVVGL*
Ga0066701_1059767013300005552SoilWSPEDSRAAYLNHQGGKWQLWTFPSNDPTKAAALSADAFESIQGWSDPHTLLTVTANQADLAWIAEDGKPTQALPIRDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFAHPPAGIPTVENGQAGALFLYEFRSKRRTLIPLPGLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGLGLQRYIAGTSLVVGL*
Ga0066693_1045350313300005566SoilANQADLVWISEDGKVTQTLPVRDLCGPDFICGPSFTLRLHPSNLDLVLVSAAFAHPPAGAPAVENGQAGALFLYEFHSKRRSPLLIPNLSASDAEWSRDGFQILFTGTDPARHKATYRVFWDGLGLQKYVTGTSLVVGL*
Ga0066903_10104648013300005764Tropical Forest SoilSTDDSRVAYLHHQSGKWQLWAFPANDPAKAAVVSADAFESIQGWSDAHTVLALMASPAALAWIGEDGRPTQTLAISDLGGADFVPGTRITVRLHPTNPDLVLVSAMFAHPPAGVPAVENGQAGALFFYEFRSKRRSVLNIPNLSATDGEWSRDGFQVFFTGTDSSRRKTTYKVFWDAIGLQKYVAGTSLVVGL*
Ga0066651_1055313113300006031SoilVDPHTVLVTTLNQADLAWIGEDGKPTQTLPIKDLCGPDFVPATKLTARQHPTNPDLALVSALFVHPPAGVPTAEKEGQAGGLFFYEFRSKRRTLLPIANLSASEAEWSRDGFQILFTGTDSSKRKATYRVFWDGIGLQKYLSATSLVVGL*
Ga0066656_1046161713300006034SoilAVVLSPDAFERIQGWVDPHTVLVTTLNQADLAWIGEDGKPTQTLPIKDLCGPDFVPGTKLTVRLHPTNPDLALVSALFVHPPAGVPTAEKEGQAGGLFFYEFRSKRRTLLPIANLSASEAEWSRDGFQILFTGTDASKRKATYRVFWDGIGLQKYLSATSLVVGL*
Ga0066652_10015754533300006046SoilVLSPDAFERIQGWVDPHTVLVTTLNQADLAWIGEDGKPTQTLPIKDLCGPDFVPGTKLTARLHPTNPDLALVSALFVHPPAGVPTAEKEGQAGGLFFYEFRSKRRTLLPIANLSASEAEWSRDGFQILFTGTDSSKRKATYRVFWDGIGLQKYLSATSLVVGL*
Ga0079222_1013641513300006755Agricultural SoilQLWAFPANDPTKAALVNSDSFESIQGWSDPRTLLVLTASPSALEWIGEDGRATQTLPVSDLCGTDCVLGTKLAVRPHPTNPDLMLVSAIFAHPPAGVPAVENGQTGGLFFYEFRSRRRAVLAIPNLSATEGEWSRDGFQILFTGTDSSRRKATYRVFWDAIGLQKYAAGTSLVVGL*
Ga0079222_1127803513300006755Agricultural SoilAALVNADSFESIEGWSDPRTLLVFTANPAALAWIGEDGKAMQTLPVSDLCGTDFVPGAKLVVRPHPTNPDLVLVSTLFVHPPAGVPAVENGQAGALFFYEFRSRRRTVLSVPNLSATDGEWSRDGFQVLFTGTDSSRRKTTYKVFWDAIGLQKYVAGTSLAVGL*
Ga0066658_1016268433300006794SoilFLPGAKLTIRVHPTNPDLVLVSALFVRAPAGVPTVESGQAGGVFFYEFRSKRRAPLLIANLSATDAEWSRDGFQILFTGTDSSKRRGTYRVFWDGIGLQKYAAGTSLVVGL*
Ga0066660_1017737533300006800SoilAVRLHPSNPDLVLVSALFAHPPAGIPTVENGQAGALFLYEFRSKRRTLIPLPGLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGLGLQRYIAGTSLVVGL*
Ga0079221_1047714613300006804Agricultural SoilKAAVISTDGFESIQGWSDPHTVLALTSNPAALAWIGEDGKPTQTLPLSDLCGTDFVPGPRIAVRLHPTNPDLVLVSASFARPPAGIPPVENGQAGGLFFYEFRSRRRAFLAIPNLSATEGEWSRDGFQILFTGTDSSRRKATYRVFWDAIGLQKYVAGTSLVVGL*
Ga0079221_1106457813300006804Agricultural SoilPSNDSTKAAILSADAFEAIQGWADAHTVLVTTLNQANLAWIGEDGKLTQSVPVKDLCEPDFVCGARLVIRLHPTNRDLVLVSALFVRAPTGVPAAEKEGQAGGLFYYEFRAKRRAVLPISNLSASDGEWSRDGFQILFTGTDSAKRRTTYRVFWDGIGLQKYITGTSLVVGL*
Ga0075426_1021979633300006903Populus RhizosphereSNPAALAWIGEDGKPTQTLPLSDLCGTDFVPGPRITVRLHPTNPDLVLVSASFARPPAGVPPVENGQAGGLFFFEFRSRRRAVLTIPNLSATEGEWSRDGFQILFTGTDSSRRKATYRVFWDAIGLQKYVAGTSLVVGL*
Ga0079219_1032473823300006954Agricultural SoilRVEGAVRAINEYDYDSNSVKELLRGDVRNPVWFPDDSRIVYLNHQGVKWQLWAFPANDPSRAALVNADSFESIEGWSDPRTLLVFTANPAALAWIGEDGKAMQTLPVSDLCGTDFVPGAKLVVRPHPTNPDLVLVSTLFVHPPAGVPAVENGQAGALFFYEFRSRRRTVLSVPNLSATDGEWSRDGFQVLFTGTDSSRRKTTYKVFWDAIGLQKYVAGTSLAVGL*
Ga0079219_1116328623300006954Agricultural SoilKAALVNSDSFESIQGWSDPRTLLVLTASPSALEWIGEDGRATQTLPVSDLCGTDCVPGTKLAVRPHPTNPDLVLVSAIFAHPPAAVPAVENGQTGGLFFYEFRSRRRAVLAIPNLSATEGEWSRDGFQILFTGTDSSRRKATYRVFWDAIGLQKYAAGTSLVVGL*
Ga0075435_10038139713300007076Populus RhizosphereLHPTNPDLVLVSASFARPPAGVPPVENGQAGGLFFFEFRSRRRAVLTIPNLSATEGEWSRDGFQILFTGTDSSRRKATYRVFWDAIGLQKYVAGTSLVVGL*
Ga0075435_10159882013300007076Populus RhizosphereSNSVKELLRGDVRNPVWSPDDSRVAYLNHQGGKWQLWAFPTNDPAKAAVVSADSFESIQGWSDPHTLLVLTASPAALAWIGEDGRATQTLSISDLCGTDFVAGTKLMVRLHPTNPDLVLVSASFARPPAGVPAVENGQAGGLLFYEFRSRRRAVLTIPNLSATDGEWSRDGFQVFFTGTDSSRRKATYRV
Ga0066710_10094572923300009012Grasslands SoilLNNLKELLRGDVQSPVWSPDDSRVAYLNHQDGKWQLWTFPSNDPTKAVVLSPDAFERIQGWVDPHTVLVTTLNQADLAWIGEDGKPTQTLAIKDLCGPDFVPGTKLTARLHPTNPDLALVSALFVHPPAGVPTAEKEGQAGGLFFYEFRSKRRTLLPIANLSASEAEWSRDGFQILFTGTDASKRKATYRVFWDGIGLQKYLSATSLVVGL
Ga0066709_10045363133300009137Grasslands SoilDSFENIQGWSDPHTLLALTASPAALAWIGEDGRATQTLSISDLCGTDFVAGNKLVVRLPPTNPDLVLVSASFARPQAGVPAVENGQAGGLFFYEFRSRRRAVLTIPNLSATDGEWSRDGFEVFFTGTDSSRRKASYRVFWDAIGLQKYVAGTSLVVGL*
Ga0066709_10123774123300009137Grasslands SoilDLHTLLAVTDSQTDLASIGEDGKVTQALPLKDLCGPDFLPGAKLTIRVHPTNPDLVLVSALFVRAPAGVPTVESGQAGGVFFYEFRSKRRAPLLIANLSATDAEWSRDGFQILFTGTDSSKRRGTYRVFWDGIGLQKYAAGTSLVVGL*
Ga0126374_1060746023300009792Tropical Forest SoilFESIQGWSDPHTLVTVTANQGDLAWIGEDGKLAQALSIRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFAHSASGIPAVESGQAGALFLYEFRSKRRTLVPAPNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYIAGTSLVVGL*
Ga0126373_1074605713300010048Tropical Forest SoilLSADAFESIQGWSDPHTVLAVTAGQADLAWIAEDGKAAQTVPVRELCGPDFVPGGKLAVRQHPRNPDLVLVSVLFVHPPAGIPPVENGQAGGLFLYELRSKRRTLIPAPNLSASEAEWSRDGLQIFFCGTDSARRRATYRIFWDGIGLQRYVVGTSLVVGL*
Ga0126373_1328116713300010048Tropical Forest SoilGAKLTARLHPTNPDLVLVSALFVRPAAGVPAVENGLAGGLLFYEFRSRRRAVLNIPNLSATDGEWSRDGFQIFFTGTDSSRRKSTYRVFWDAIGLQKYAAGTSLVVGL*
Ga0134088_1057557913300010304Grasslands SoilGKPTQALPIRDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFAHPPAGIPTVENGQAGGLFLYEFRSKRRTLIPLPGLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGLGLQRYIAGTSLVVGL*
Ga0134109_1036141023300010320Grasslands SoilISDLCGTDFVAGTKLVVRLHPTNPDLVLVSASFARPPAGVPAVENGQAGGLFFYEFRSRRRAVLTIPNLSATDGEWSRDGFEVFFTGTDSSRRKASYRVFWDAIGLQKYVAGTSLVVGL*
Ga0134067_1002197913300010321Grasslands SoilHDGKKLLFTREAGAVRTINEYNYDSNSVKELLRGEVRNPVWSPEDSRAAYLNHQGGKWQLWTFPSNDPTKAAALSADAFESIQGWSDPHTLLTVTANQADLAWIAEDGKPTQALPIRDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFAHPPAGIPTVENGQAGALFLYEFRSKRRTLIPLPGLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGLGLQRYIAGTSLVVGL*
Ga0134067_1049531913300010321Grasslands SoilQDWADPHTLLGTTPNQVALAWIGEDGRATQTLPIKDLCEPDFACGGMLTARLHPTNPDLVLVSAAFLRPPAGTPAVENGQAGALFFYEFHSKRRSVLTLPNLSATDGEWSRDGFQILFTGTDSAKRRTTYRVFWDGIGLQKYAAGSSLVVGL*
Ga0134065_1024866013300010326Grasslands SoilEERAVRIIDEYDYDLNNLKELLRGDVRSPVWSPDDSRVAYLNHQDGKWQLWTFPSNDPTKAVVLSPDAFERTQGWVDPHTVLVTTLNQADLAWIGEDGKPTQTLPIKDLCGPDFVPGTKLTARLHPTNPDLALVSALFVHPPAGVPTAEKEGQAGGLFFYEFRSKRRTLLPIANLSASEAEWSRDGFQILFTGTDSSKRKATYRVFWDGIGLQKYLSATS
Ga0134111_1035418613300010329Grasslands SoilEGAVRIIDEYDYDLNNLKELLRDDVRSPVWSPDDSRVAYLNHQDGKWQLWTFPSNDPTKAVVLSPDAFERTQGWVDPHTVLVTTLNQAYLAWIGEDGKPTQTLPIKDLCGPDFVPGTKLTVRLHPTNPDLALVSALFVHPPAGVPTAEKEGQAGGLFFYEFRSKRRTLLPIANLSAGEAEWSRDGFQILFTGTDSSKRKATYRVFWD
Ga0134063_1022696123300010335Grasslands SoilSISDLCGTDFVAGNKLVVRLHPTNPDLVLVSASFARPQAGVPAVENGQAGGLFFYEFRSRRRAVLTIPNLSAADGEWSRDGFEVFFTGTDSSRRKASYRVFWDAIGLQKYVAGTSLVVGL
Ga0134062_1034821323300010337Grasslands SoilPVWSPEDSRAAYLNHQGGKWQLWTFPSNDPTKAAALSADAFESIQGWSDPHTLLTVTANQADLAWIAEDGKPTQALPIRDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFAHPPAGIPTVENGQAGALFLYEFRSKRRTLIPLPGLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGLGLQRYIAGTSLVVGL*
Ga0126370_1024728813300010358Tropical Forest SoilVWSPDDSQIAYLNHQHGQWQLWILPSNDPTKAAVLSADAFEAIQDWADAHTVLVTTLNQGNLAWIGEDGKTSQNVPVKDLCGPDFACGARLTVQLHPTNRDLLLVSALFARVPAGLPAAEREGQAGGLFLYEFRARRRAVLPIPNLSASDGEWSRDGFQVLFTGTDSAKRRSTYRVFWDGIGLQKYVAGTSLVVGM*
Ga0126376_1111680613300010359Tropical Forest SoilNSVKELLHGEVRNPLWSPDDSRVAYLNHQGGKWQLWTFPSNDPAKAAVLSADAFESIQGWSDPHTLLTVMANHADLSWIGEDGKPAQALSIRDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFAHPPTGIPTVENGQAGALFLYEFRSKRRTLIPVPNLSASEAEWSRDGFQIFFTGTDSARRKATYRIFWDGIGLQRYIVGTSLLVGL*
Ga0126372_1056150313300010360Tropical Forest SoilAAALAWIGEDGKTTQTLLISDLCGTDFVRGARFAVRLHPTNPDLVLVSATFARPPAGVPTVENGQAGALFFYEFRSRRRSVLTIPNLSGTDGEWSRDGFQVFFTGTDSLRRKTTYKVFWDAIGLQKYAVGTSLVVGL*
Ga0126372_1094749213300010360Tropical Forest SoilSNSVKELVRGDVRNPVWSPDDFGVAYLNHQGGKWQLWAFPANDPTKAAVAGPDAFEGIQGWSDPHTLLALKANPAALAWIGQDGKTTQTLSISDLCGTDFVPGTRLTVRLHPTNPDLALVSAWFARPPAGVPAVENGQAGALFLYEFRSRRRSVLAVPNLSATDAEWSRDGFQILFTGTDSSRRKATYKVFWDAIGLQKYVAGTSLVVGL*
Ga0126372_1215947323300010360Tropical Forest SoilDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFARPPTGIPAVENGQAGALFLYEFRSKRRTLIPVPNLSASEAEWSRDGFQIFFTGTDSARRKATYRIFWDGIGLQRYIAGTSLLVGL*
Ga0126372_1325506113300010360Tropical Forest SoilQLHPTNRDLLLVSALFARVPAGVPAAEREGQAGGLFLYEFRARRRAVLPIPNLSASDGEWSRDGFQVLFTGTDSAKRRSTYRVFWDGIGLQKYVAGTSLVVGM*
Ga0126378_1028507533300010361Tropical Forest SoilGDVRNPVWSPDDFGVAYLNHQGGKWQLWAFPANDPAKAAVASPEAFESIQGWIDSHTLLALTANPPALAWIGQDGKVTQTLPISDLCGTDFAGGAKLTVRLHPTNPDLVLVSALFVRPAAGVPAMENGLAGGLFFYEFRSRRRTVLSIPNLSATDGEWSRDGFQIFFTGTDSARRKSTYRIFWDAIGLQKYAAGTSLVVGL*
Ga0126378_1082007213300010361Tropical Forest SoilLVLVSASFVHPPSGTPTGVNSGQSGGLFLYESRAKRRVPVPLPNLSAGEAEWSRDGLQIFFTASDSAHRNTTYRIFWDGIGLQKYVVGVGLVVGL*
Ga0126378_1109190413300010361Tropical Forest SoilHQGGKWQLWAFPANDPAKAAVVNADAFENIQGWSDSHTILALTSNPAALAWIGEDGKTTQTLLISDLCGTDFVRGARFAVRLHPTNPDLVLVSATFARPPAGVPTVENGQAGALFFYEFRSRRRSVLTIPNLSGTDGEWSRDGFQVFFTGTDSLRRKTTYKVFWDAIGLQKYAVGTSLVVGL*
Ga0126378_1219053323300010361Tropical Forest SoilDFAGGAKLTVRLHPTNPDLVLVSALFVRPAAGVPAVENGLAGGLFFYEFRSKRRAALGIPNLSATEGEWSRDGFQIFFTGTDSARRKATYRVFWDAVGLQKYVPGTSLVVGL*
Ga0126378_1324111713300010361Tropical Forest SoilPDDSRIAYLNHQSGKWQLWAFPINDPAKAAVISTDSFQGIQSWSDPHTLLVLTASPTSLAWVSEDGKAMQTLLISDLCGTDFAGGAKLMVRLHPTNPDLVLVSALSVRPAAGVPAVENGLAGGLFFYEFRSRRRAVLNVPNLSATDGEWSRDGFQIFFTGTDSARRKSTYRI
Ga0126377_1299126713300010362Tropical Forest SoilFPLNDPTKAAVISADSFESIQGWSDPRTLLVLTGNTLAWIGEDGRTTQTLPISDLCGTDFAAGAKLTVRVHPTNPDLVLVSALFVRPAAAVPAVENGLAGGLFFYEFRAKRRAVLAIPNLSAMDAEWFRDGFQIFFIGTDFSKRKSTYRVFWDAIGLQKYAAGTSLVVGL*
Ga0134066_1015186513300010364Grasslands SoilKVGPDFVPGTKLTARLHPTNPDLALVSALFVHPPAGVPTAEKEGQAGGLFFYEFRSKRRTLLPIANLSASEAEWSRDGFQILFTGTDSSKRKATYRVFWDGIGLQKYLSATSLVVGL*
Ga0126381_10223861113300010376Tropical Forest SoilISDLCGADFAAGGRLTVRLHPTNPDLVLVSALFVRPAPGVPAVENGLAGGLFFYEFRSRRRAVLAIPNLSATDGEWSRDGLQVLFTGTDSSRRKSTYRIFWDAIGLQKYAPGTSLVVGL*
Ga0126383_1003086313300010398Tropical Forest SoilLNSVKELLRGEVRNPLWSPDDSRVAYLNHQGSKWQLWTFPSNDPTKAAVLSADAFESIQGWSDPRTLLTVMANQAELAWIGEDGKVAQILPIRDLCGPDFVPGAKLVVRLHPTNPDLVLVSALFVHPPTGIPAVESGQAGALFLYEFRSKRRTLIPVPNLSASEAEWSRDGFQIFFTGTDSARRRATYRIFWDGIGLQRYIVGTSLFVGL*
Ga0126383_1195718013300010398Tropical Forest SoilAKAAVLSADAFESIQGWSDPHTLVAVTANQADLAWIGEDGKPAQALPVRDLCGPDFVPGVKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQAGALFLYEFRSKRRTLVPAPNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYIAGTSLVVGL*
Ga0137383_1132403113300012199Vadose Zone SoilQGGKWQLWAFPTNDPAKAAVVSADSFESIQGWSDPHTLLVLTASPAALAWIGEDGRATQTLSISDLCGTDFVAGTKLMVRLHPTNPDLVLVSASFARPPAGVPAVENGQAGGLLFYEFRSRRRAVLTIPNLSATDGEWSRDGFQVFFTGTDSSRRKATYRVFWDAIGLQK
Ga0137381_1012131233300012207Vadose Zone SoilDSRVAYLNHQDGKWQLWSFPANDPTKAAALSKDAFESIQGWVDPHTMLLTTLSKTDLAWIGEDGKPTQTLPIKDLCGPDFAPGAKLTVRLHPTNPDLALVSATFVHPPTGVPTAEREGQAGALLFYEFRSKRRALLPIPGLSASDAEWSRDGFQILFTGTDSAKRRATYRIFWDGIGLQKYLSATSLVVGL*
Ga0137379_1064742123300012209Vadose Zone SoilVWSPDDSRVAYLNHQDAKWQLWTFPSNDPTKTAFLSIDTFESIQGWADPHTVLVTTLNQADLAWIGEDGKSTQTLPVKDLCGPDFVPGAKLTVRLHPSNPDLALVSALFVHPPAGLPTVEREGQAGGLFFYEFRSKRRAPLPIPNLSASDAAWSRDGFQILFTGTDSSKRKATYRVFWDDIGLQKYLSATSLVVGL*
Ga0137379_1177862423300012209Vadose Zone SoilAEDGKPTQALPIRDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFAHPPAGIPTVENGQAGALFLYEFRSKRRTLIPLPGLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGLGLQRYIAGTSLVVGL*
Ga0137377_1032701333300012211Vadose Zone SoilGKPTQALPIRDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFTHPPAGIPTVENGQAGALFLYEFRSKRRTLIPLPGLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGLGLQRYIAGTSLVVGL*
Ga0137387_1011344133300012349Vadose Zone SoilTQGWSDPHTLLTVTANQADLAWIAEDGKPTQALPIRDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFAHPPAGIPTVENGQAGALFLYELRSKRRTLIPLPGLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGLGLQRYIAGTSLVVGL*
Ga0137385_1154660213300012359Vadose Zone SoilPTQTLPIKDLCGPDFAPGAKLTVRLHPTNPDLALVSATFIHPPTGVPTAERESQAGALLFYEFRSKRRALLPIPGLSASAAEWSRDGFQILFTGTDSAKRRATYRIFWDGIGLQKYLSATSLVVGL*
Ga0126369_1078492723300012971Tropical Forest SoilLRNPVWSPDDSRVAYLNHQGGKWQLWAFPANDPAKAAVVNADAFENIQGWSDSHTILALTSNPAALAWIGEDGKTTQTLLISDLCGTDFVRGARFAVRLHPTNPDLVLVSATFARPPAGVPTVENGQAGALFFYEFRSRRRSVLTIPNLSGTDGEWSRDGFQVFFTGTDSLRRKTTYKVFWDAIGLQKYAVGTSLVVGL*
Ga0126369_1085418923300012971Tropical Forest SoilLNSVKELLHGEVRNPLWSPDDSRVAYLNHQGGKWQLWTFVSNDPAKAAVLSADAFESIQGWSDPHTLLTVMANHADLSWIGEDGKPAQALPIRDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFARPPTGIPAVENGQAGALFLYEFRSKRRTLIPVPNLSASEAEWSRDGFQIFFTGTDSARRKATYRIFWDGIGLQRYIVGTSLLVGL*
Ga0126369_1098145113300012971Tropical Forest SoilTQTVSVKDLCGPDFVCGARLTVRPHPTNRDLVLVSALFAHAPAGLPPAERDGQTGGLFLYEFRARRRAVLSFPNLSASDGEWSRDGFQILFTGTDSAKRRTTYRVFWDGIGLQKYVSGTSLVAGL*
Ga0134078_1061825313300014157Grasslands SoilAFESIQGWSDPHTLLTVTANQADLAWIAEDGKPTQALPIRDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFAHPPAGIPTVENGQAGGLFLYEFRSKRRTLIPLPGLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGLGLQRYIAGTSLVVGL*
Ga0134079_1007210913300014166Grasslands SoilLLRGEVRNPVWSPEDSRAAYLNHQGGKWQLWTFPSNDPTKAAALSADAFESIQGWSDPHTLLTVTANQADLAWIAEDGKPTQALPIRDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFAHPPAGIPTVENGQAGGLFLYEFRSKRRTLIPLPGLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGLGLQRYIAGTSLVVGL*
Ga0134079_1051098613300014166Grasslands SoilHTLLALTASPAALAWIGEDGRATQTLSISDLCGTDFVAGTKLVVRLHPTNPDLVLVSASFARPPAGVPAVENGQAGGLFFYEFRSRRRAVLTIPNLSATDGEWSRDGFEVFFTGTDSSRRKASYRVFWDAIGLQKYVAGTSLVVGL*
Ga0182032_1041210113300016357SoilNPVWSPDDSRVAYLHHQGGKWQLWAFPANDPAKAAVVNADAFESIQGWSDSHTILAITGDPAALAWVGEDGRATQTLPISNLCGTDFVPGAIFAVRLHPTNPDLVLVSAMFARPPTGVPAVENGQAGALFFYEFRPRRRSVLNIPNLSATDGEWSRDGFQVFFTGTDSSRRKATYKVFWDAIGLQKYVAGTSLVVGL
Ga0182032_1050989423300016357SoilAYLNHQGGKWQLWTFPSNDPAKAAVLSADAFESIQGWSDPHTLLTVMANHADLSWIGEDGKPAQALPIRDLCGPDFVPGGKLAVRLHPSNPDLVLVSALFARPPTGIPTVENGQAGALFLYEFRSKRRTLIPVPNLSASEAEWSRDGFQIFFTGTDSARRKATYRIFWDGIGLQRYIVGTSLLVGL
Ga0182039_1051272213300016422SoilLWTFPSNDPAKAAVLSADAFESIQGWSDPHTLLTVMANHADLSWIGEDGKPAQALPIRDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFARPPTGIPTVENGQAGALFLYEFRSKRRTLIPVPNLSASEAEWSRDGFQIFFTGTDSARRKATYRIFWDGIGLQRYIVGTSLLVGL
Ga0066669_1093642013300018482Grasslands SoilHDGKKLLFTREAGAVRTINEYNYDSNSVKELLRGEVRNPVWSPEDSRAAYLNHQGGKWQLWTFPSNDPTKAAALSADAFESIQGWSDPHTLLTVTANQADLAWIAEDGKPTQALPIRDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFAHPPAGIPTVENGQAGALFLYEFRSKRRTLIPLPGLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGLGLQRYIAGTSLVVGL
Ga0126371_1033335623300021560Tropical Forest SoilYEYNYDSNSVKELLRGDVRNPVWSPDDFGVAYLNHQGGKWQLWAFPANDPTKAAVASPEAFESIQGWNDSHTLLAVTANPPALAWIGQDGKVTQTLPISDLCGTDFAGGAKLTVRLHPTNPDLVLVSALLVRATAGVPAVENGLAGGLFFYEFRSRRRTLLNIPNLSATDGEWSRDGFQVLFTGTDSSRRKSTYRIFWDAIGLQKYAAGTSLVVGL
Ga0209240_107560613300026304Grasslands SoilESIQGWADPHTVLLTTLNQTDLAWIGEDGKPTQTLPIKDLCGPDFAPGANLTVRLHPTNPDLALVSATFVHPPTGVPTAEREGQAGALFFYEFRSKRRALLPIASLSASDAEWSRDGFQILFTGTDSAKRRTTYRIFWDGLGLQKYLSATSLVVGL
Ga0209239_134559313300026310Grasslands SoilQGWADPHTLLVTTPNQVALAWIGEDGRATQTLPIKDLCEPDFACGGMLTARLHPTNPDLVLVSAAFLRPPAGTPAVENGQAGALFFYEFHSKRRSVLTIPNLSATDGEWSRDGFQILFTGTDSAKRRTTYRVFWDGIGLQKYAAGTSLAVGL
Ga0209470_127505713300026324SoilANQADLAWIAEDGKPTQALPIRDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFAHPPAGIPTVENGQAGGLFLYEFRSKRRTLIPLPGLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGLGLQRYIAGTSLVVGL
Ga0209377_100572983300026334SoilCGPDFVPGAKLAVRLHPSNPDLVLVSALFAHSPAGIPTVENGQAGALFLYEFRSKRRTLIPLPGLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGLGLQRYIAGTSLVVGL
Ga0209057_106768213300026342SoilTQTLSISDLCGTDFVAGTKLVVRLHPTNPDLVLVSASFARPQAGVPAVENGQAGGLFFYEFRSRRRAVLTIPNLSATDGEWSRDGFEVFFTGTDSSRRKASYRVFWDAIGLQKYVAGTSLVVGL
Ga0209808_109891323300026523SoilNPDLVLVSASFARPQAGVPAVENGQAGGLFFYEFRSRRRAVLTIPNLSATDGEWSRDGFEVFFTGTDSSRRKASYRVFWDAIGLQKYVAGTSLVVGL
Ga0209160_107936413300026532SoilREAGAVRTINEYNYDSNSVKELLRGEVRNPVWSPEDSRAAYLNHQGGKWQLWTFPSNDPTKAAALSADAFESIQGWSDPHTLLTVTANQADLAWIAEDGKPTQALPIRDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFAHPPAGIPTVENGQAGALFLYEFRSKRRTLIPLPGLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGLGLQRYIAGTSLVVGL
Ga0209729_103418213300027061Forest SoilNDPAKAAVISTDGFESIQGWSDPHTVLALTSNPAALAWLGEDGKPTQTLPLSDLCGTDFVPGPRIAVRLHPTNPDLVLVSASFARPPAGILPVENGQAGGLFFYEFRSRRRAVLAIPNLSATEGEWSRDGFQILFTGTDSSRRKATYRVFWDAIGLQKYVAGTSLVVGL
Ga0209003_106244513300027576Forest SoilQVYSGGEVELWYTPFEQGVSTRGEKLFTREEGAVRVINEYDYDSNSVKELLRGDIRNPVWSLDDSRVAYLNHQGGKWQLWAFPANDPARAAVISADGFESIQSWSDPHTVLALTSNPAALAWIGEDGRPTQTLPLSELCGTDFVPGAKLVVRLHPTNPDLVLVSASFARPPSGVPPVENGQAGGLFFYEFRSRRRAALAIPGLSATEGEWSRDGFQVLFTGTDSS
Ga0209178_126984613300027725Agricultural SoilALTSNPAALAWIGEDGKPTQTLPLSDLCGTDFVPGPRIAVRLHPTNPDLVLVSASFARPPAGIPPVENGQAGGLFFYEFRSRRRAFLAIPNLSATEGEWSRDGFQILFTGTDSSRRKATYRVFWDAIGLQKYVAGTSLVVGL
Ga0209073_1037247723300027765Agricultural SoilSDLCGTDFVPGPRITVRLHPTNPDLVLVSASFARPPAGVPPVENGQAGGLFFFEFRSRRRAVLTIPNLSATEGEWSRDGFQILFTGTDSSRRKATYRVFWDAIGLQKYVAGTSLVVGL
Ga0209074_1030981613300027787Agricultural SoilIRNPVWFPDDSRVVYLNHQDGKWQLWAFPANDPTKAALVNSDSFESIQGWSDPRTLLVLTASPSALEWIGEDGRATQTLPVSDLCGTDCVLGTKLAVRPHPTNPDLMLVSAIFAHPPAGVPAVENGQTGGLFFYEFRSRRRAVLAIPNLSATEGEWSRDGFQILFTGTDSSRRKATYRVFWDAIGLQKYAAGTSLVVGL
Ga0209465_1038554923300027874Tropical Forest SoilGGAKLTVRLHPTNPDLVLVSALLVRATAGVPAVENGLAGGLFFYEFRSRRRTLLNIPNLSATDGEWSRDGFQVLFTGTDSSRRKSTYRIFWDAIGLQKYAAGTSLVVGL
Ga0209465_1056657013300027874Tropical Forest SoilGWSDPRTLLVLTGNTLAWIGEDGRTTQTLPISDLCGTDFAAGAKLTVRVHPTNPDLVLVSALFVRPAAAVPAVENGLAGGLFFYEFRAKRRAVLAIPNLSAMDGEWSRDGFQIFFTGTDPSRRKSTYRVFWDAIGLQKYAAGTSLVVGL
Ga0318573_1004354633300031564SoilRNPVWSPEDSRVAYLNHQGGKWQLWTFPSSDPAKAAVLSADAFESIQGWSDPHTLVTVTANQADLAWIGEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYLAGTSLVVGL
Ga0310915_1114373613300031573SoilNHADLSWIGEDGKPAQALPIRDLCGPDFVPGAKLAVRLHPSNPDLVLVSALFARPPTGIPTVENGQASALFLYEFRSKRRTLIPVPNLSASEAEWSRDGFQIFFTGTDSARRKATYRIFWDGIGLQRYIVGTSLLVGL
Ga0318574_1051432913300031680SoilTYDSNSVKELLRGEVRNPVWSPEDSRVAYLNHQGGKWQLWTFPSSDPAKAAVLSADAFESIQGWSDPHTLVTVTANQADLAWIGEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYLAGTSLVVGL
Ga0306918_1069419713300031744SoilEVRNPVWSPEDSRVAYLNHQGGKWQLWTFPSSDPAKAAVLSADAFESIQGWSDPHTLVTVTANQADLAWIGEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYLAGTSLVVGL
Ga0318535_1052104013300031764SoilGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYLAGTSLVVGL
Ga0318554_1009124413300031765SoilDLRLSHDGKKLLFTREAGAVRTINEYTYDSNNVKELLRGEVRNPVWSPEDSRVAYLNHQGGKWQLWTFPSNDPAKAAVLSADAFESIQGWSDPHTLVTVTANQADLAWIGEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYLAGTSLVVGL
Ga0318508_108926913300031780SoilKWQLWTFPSSDPAKAAVLSADAFESIQGWSDPHTLVTVTANQADLAWIGEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYLAGTSLVVGL
Ga0318548_1010879813300031793SoilRNPVWSPEDSRVAYLNHQGGKWQLWTFPSSDPAKAAVLSADAFESIQGWSDPHTLVTVTANQADLAWIGEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYIAGTSLVVGL
Ga0318557_1007944913300031795SoilLLRGEVRNPVWSPEDSRVAYLNHQGGKWQLWTFPSSDPAKAAVLSADAFESIQGWSDPHTLVTVTANQADLAWIGEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYLAGTSLVVGL
Ga0318550_1043480113300031797SoilPSSDPAKAAVLSADAFESIQGWSDPHTLVTVTANQADLAWIGEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYLAGTSLVVGL
Ga0318565_1063860413300031799SoilSWSDPHTLVTVTANQADLAWIGEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYLAGTSLVVGL
Ga0318568_1086670013300031819SoilEYTYDSNSVKELLRGEVRNPVWSPEDSRVAYLNHQGGKWQLWTFPSSDPAKAAVLSADAFESIQGWSDPHTLVTVTANQADLAWIGEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTD
Ga0307473_1093921613300031820Hardwood Forest SoilVAYLNHQGGKWQLWAFPTNDPAKAAVVSADSFESIQGWSDPHTLLVLTASPAALGWIGEDGRATQTLSISDLCGTDFVAGTKLMVRLHPTNPDLVLVSASFARQAVGVPAVENGQAGGLFSYEFRSRRRAVLTIPNLSATDGEWSRDGFQVFFTGTDSSRRKATYRVFWDAIGLQKYVAGTSLVVGL
Ga0318567_1024722423300031821SoilLWTFPSSDPAKAAVLSADAFESIQGWSDPHTLVTVTANQADLAWIGEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYLAGTSLVVGL
Ga0318511_1010275423300031845SoilSIQGWSDPHTLVTVTANQADLAWIGEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYLAGTSLVVGL
Ga0318495_1009106223300031860SoilRDIPGGTDSDLRLSHDGKKLLFTREAGAVRTINEYTYDSNSVKELLRGEVRNPVWSPEDSRVAYLNHQGGKWQLWTFPSSDPAKAAVLSADAFESIQGWSDPHTLVTVTANQADLAWIGEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYLAGTSLVVGL
Ga0306919_1001322813300031879SoilSNSVKELLRGEVRNPVWSPEDSRVAYLNHQGGKWQLWTFPSNDPAKAAVLSADAFESIQGWSDPHTLVTVTANQADLAWIGEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYLAGTSLVVGL
Ga0318544_1029381613300031880SoilTANQADLAWIGEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYLAGTSLVVGL
Ga0307479_1077107613300031962Hardwood Forest SoilVAYLNHQGGKWQLWTFPASDPTKAAVFSADAFESIQGWSDPHSFLTVTANQADLAWIGEDGKPAQTLPIRDLCGPDFVPGTKLAVRLHPSNPDLVLVSALFVRPPTGIPTVENGQAGALFLYEFRSRRRTLIPVPNLSASEAEWSRDGFQILFTGTDSAKRRATYRIFWDGIGLQKYIAGTSLVVGL
Ga0318562_1067791913300032008SoilIQGWSDPHTLVTVTANQADLAWIGEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYLAGTSLVVGL
Ga0318507_1000204313300032025SoilLAWIGEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYLAGTSLVVGL
Ga0318549_1028133313300032041SoilAYLNHQGGKWQLWTFPSSDPAKAAVLSADAFESIQGWSDPHTLVTVTANQADLAWIGEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYLAGTSLVVGL
Ga0318570_1013605613300032054SoilEDGKPAQALPVRDLCGPDFVPGAKLTVRLHPSNPDLVLVSALFARPAAGIPAVESGQPGALFLYEFRAKRRTLVPVSNLSASEAEWSRDGFQIFFTGTDSAKRRATYRIFWDGIGLQRYLAGTSLVVGL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.