NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F101635

Metagenome Family F101635

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101635
Family Type Metagenome
Number of Sequences 102
Average Sequence Length 242 residues
Representative Sequence RLTRPLSPSLLWIAFGFLMGAGGAAIAELAAKHGSEWFWVHEMGRDLVMQGLFTGLAVAAGRVMRSGDRTSPALHVVAGAVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPQWELGPRNLRRSFAHLALWMLAAGNAWIAVAPHVRRAGLHVIFLGCFTALLLAALFPRPGEAPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQSV
Number of Associated Samples 94
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 76
AlphaFold2 3D model prediction Yes
3D model pTM-score0.72

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(32.353 % of family members)
Environment Ontology (ENVO) Unclassified
(44.118 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(56.863 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: Yes Secondary Structure distribution: α-helix: 71.70%    β-sheet: 0.00%    Coil/Unstructured: 28.30%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.72
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
f.24.1.1: Cytochrome c oxidase subunit I-liked3s8ga_3s8g0.6839
f.24.1.1: Cytochrome c oxidase subunit I-liked7coha_7coh0.64275
f.72.1.1: Double antiporter-like subunits from respiratory complex Id3rkoc_3rko0.59871
f.72.1.1: Double antiporter-like subunits from respiratory complex Id3rkod_3rko0.59229
f.72.1.0: automated matchesd6khid_6khi0.58313


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF06798PrkA 27.45
PF08298AAA_PrkA 15.69
PF00196GerE 4.90
PF00383dCMP_cyt_deam_1 1.96
PF00012HSP70 0.98
PF03706LPG_synthase_TM 0.98
PF13365Trypsin_2 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG2766Predicted Ser/Thr protein kinaseSignal transduction mechanisms [T] 43.14
COG0392Predicted membrane flippase AglD2/YbhN, UPF0104 familyCell wall/membrane/envelope biogenesis [M] 0.98
COG0443Molecular chaperone DnaK (HSP70)Posttranslational modification, protein turnover, chaperones [O] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil32.35%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil25.49%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.88%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil4.90%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.94%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.94%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.94%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.94%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere2.94%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.96%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.96%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.96%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.96%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.98%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.98%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.98%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.98%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300001686Grasslands soil microbial communities from Hopland, California, USAEnvironmentalOpen in IMG/M
3300002568Grasslands soil microbial communities from Hopland, California, USA - 2EnvironmentalOpen in IMG/M
3300003321Sugarcane bulk soil Sample H1EnvironmentalOpen in IMG/M
3300004081Grasslands soil microbial communities from Hopland, California, USA - 2 (version 2)EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020018Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s2EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025918Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026023Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027678Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031939Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.P.R2EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032954Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.2EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12053J15887_1044168513300001661Forest SoilVAFGFLMGAGGAAVAELAVGKGDAGFWAHEMGRDLVIQGLFTGVAVAAARVMRGDDRSHPAVHVGAGLIFMASFWVGRRFGVHLGFALRAAVTVWLALPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASF
C688J18823_1082699013300001686SoilSLIWIAAGFVMGAGGAVVAELAAAHGETWFWVHEMGRDLVIQGLFTGIAVGAGRVMRGDERSHPALHLAAAAIFVGSFWIGRRYGAHLGFALRALVTLWLAVPLRPAWEFGPHNLRRSFSHLALWMLAIGNAWVAFAPQIRRAGLHVIFLGCFTALLLAALFPRPGEQAAFPLRKLAWAGGLVALSLVGRVMVEL
C688J35102_12039659013300002568SoilCAAVGHWELGQAAWLLLLAVCVEFTWRRLQRPLSNALLWIPLAFAMGAGGAILAEVAAQLGDQWFWVHETGRDLVMQGLFTGLAVGAARALRGDDRVHPALHLVAGALFVASFWVGRRFGQHLGFALRAAITIWLAQPLRPRWEFGPRNRRRSFAHVALWMLAAGNAWVAVAPQIRRAGLHVIFLGCFTALLLATLFPPAGEKAAFPLRKLAWAGGLVALAMVGRVMVELDPTWFHLWMGVSAAAFLAATVACVRVPVTLSREGSRDFVSS*
soilH1_1027741723300003321Sugarcane Root And Bulk SoilGAALAGLAAPDRFWMHEMGRDLVMQGLFSGLAIAAARAMRGSDRTSLLLHLAAGIIFIASFWVGRRYGMHLGFALRAAVTVWLALPLRPEWEFGPRNLRRSFAHLALWFLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRRGERQAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGLSAASFLAATIACLRVPVRAPQSV*
Ga0063454_10123927113300004081SoilGETWFWVHEMGRDLVIQGLFTGLAVAAARVMRGNDRRHPALHLAAGAVFIASFFIGRRYGAHLGFAVRAAVTVWLALPLRPEWELGPRSLRRSFAHLAIWMLAIGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRPGEQPAFPLRKLAWAGGLIALSLIGRVMVELDPTSFHLWMGVSAASFLAATVACVRVPVGTRTQSV*
Ga0066672_1060071913300005167SoilALAPLISATCAAIGQWQLGQVASLSLLVVMLQFTLRRLSRPLSPSLLWIAFGFLMGAGGAAVAEIAARHGGSWFWMQEMGRDLVIQGLFTGLAVAAGGVMRTGDRTHPALHVAAGAVFIASFWVGPRFGMHLGFAIRAAVTIWLALPLRPEWEFGPRNLRRSFAHLALWMLAVGNAWIAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGEAPAFPLRKLAWAGSLVALSMIGRVMV
Ga0066677_1003777813300005171SoilLISATCAALGHWQLGQMASLALLAVMLEFALRRLTRPLSPSLLWIAFGFLMGAGGAAIAELAAKHGSEWFWVHEMGRDLVMQGLFTGLAVAAGRVMRRGDRTSPVLHVIAGAVFVASFWVGRRFGAHLGFAIRAAVTVWLALPLRPQWELGPRNLRRSFAHLALWMLAAGNAWIAVAPHVRRAGLHVIFLGCFTALLLAALFPRPGEAPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAVQSV*
Ga0066680_1040618723300005174SoilWVHEMGRDLVIQGLFTGLAVGAGRVMRTGDRAHPALHLVAGAVFIASFWVGPRFGTHLGFAIRAAVTIWLALPLRPEWEFGPRNLRRSFAHLALWMLAVGNAWIAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGEAPAFPLRKLAWAGSLVALSMIGRVMVELDPTWFHLWMGISAASFLAATIACSRVPVRTPQSV*
Ga0066688_1035793813300005178SoilHGGSWFWMQEMGRDLVIQGLFTGLAVGAGRVMRTGDRTHPALHLVAGAVFIASFWVGPRFGTHLGFAIRAAVTIWLALPLRPEWEFGPRNLRRRFAHLALWMLAVGNAWIAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGEAPAFPLRKLAWAGSLVALSMIGRVMVELDPTWFHLWMGISAASFLAATIACLRVPVRTPQSV*
Ga0066388_10344354313300005332Tropical Forest SoilFTLRRLQRPFSSSLIWIALGFVMGAGGAAVAALASIRGEGWFWVHEMGRDLVIQGLFTGLAVAAARAMRGDDHSRPGLHLVAGAIFIASFWIGRRYGIHLGYAIRAAVTVWLALPLRPAWEFGPRNLRRSFAHLALWMLAVGNAWVAIAPQIRRAGLHVIFLGCFTALMLGALFPHDGERAAFSLRKLAWAGGLIALSLVGRVMVELDPTSFHLWMGLSAASFLAGTLTCVRVPVKVAQSV*
Ga0070680_10110686813300005336Corn RhizosphereTLRRLSRPWPPSLIWIAFGFLMGAGGAALAEIAAAHGRDWFWLHEMGRDLVIQGSFTGVAVAAARVMRRERKSHLALHVAAGAIFIASFWVGHRYGQHLGFAIRAAVTVWLALPFRPDWEFGPRNLRRSFAHLALWMLAMGNAWVAIAPQIRRAGLHVIFLGCFTALMIGALFPRAGEQPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATL
Ga0068868_10058732323300005338Miscanthus RhizospherePSLVWIGFGFLMGAGGAAIAELAATRGGGWFWAHEMGRDLVIQGLFTGLAIAAARVMRGGDRTNLALHLVAGAIFIASFWVDRRYGKMHLGFAVRAAVTVWLALPLRPAWEFGPRNLRRSFAHLALWFLAIGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRPGERPAFPLRKLAWAGGLVALSMVGRVMVELDPLSFHLWMGLSAASFLAATIACLRVPVRTAQSV*
Ga0066689_1015583713300005447SoilLISATCAALGHWQLGQMASLALLAVMLEFTLRRLTRPLSPSLLWIAFGFLMGAGGAAIAELAAKHGSEWFWVHEMGRDLVMQGLFTGLAVAAGRVMRRGDRTSPVLHVIAGAVFVASFWVGRRFGAHLGFAIRAAVTVWLALPLRPQWELGPRNLRRSFAHLALWMLAAGNAWIAVAPHVRRAGLHVIFLGCFTALLLAALFPRPGEAPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSATSFLAATLACLRVPVRAVQSV*
Ga0066681_1009023323300005451SoilGDAWFWVHEMGRDLVIQGLFTGVAVAAGRVMRGDDRSHPAVHLAAGLIFVASFWVGRRFGVHLGFALRAAVTIWLALPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATIACVRVPVRTPQSV*
Ga0066687_1098679613300005454SoilHEIGRNLVIQGSFTGLAVAAARTMRRDDRAPSALHLFAGAIFIASFWIPKTHLGFAVRAAVTVWLALPLRPTWEFGPSNLRRSFAHLALWMLAIGNAWVAVAPQIRRAGLHVIFLGCFTALLLGALFPRTGEKPAFPLRKLAWAGGLVALSLVGRVMVELDPESFH
Ga0070681_1012333223300005458Corn RhizosphereVSATCAALGHWQVGQLASLLLLVVLLEFTLRRLSGPWPAGLIWIAFGFLMGAGGAALAEIAAAHGRDWFRLHEMGRDLVIQGSFTGVAVAAARVMRRERTSHLALHVAAGAVFIASFWVGHRYGQHLGFAIRAAVTVWLALPLRPDWEFGPRNLRRSFAHLALWMLAMGNAWVAIAPQIRRAGLHVIFLGCFTALMIGALFPRPGEQPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATLACVRIPVGGRAQSV*
Ga0070697_10016322423300005536Corn, Switchgrass And Miscanthus RhizosphereGYWQLGQAASLLLLAVLLQFTLPRLQRPLSPSLIWVALGFLMGAGGAVIAELAVTRGESWFWVHEMGRDLVIQGLFTGLAVAAGRVMRGDDRTHPAIHLVAGILFIASFWIGRRYGTHLGFALRAAVTVWLALPLRPTRWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLGALFPRAGERPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGLSAASFLAATIACLRVPVRAAQSV*
Ga0070697_10057201713300005536Corn, Switchgrass And Miscanthus RhizosphereLRHVYEPVNGLLAYRSFLHPLAELDGFLGCFAAGVVLTALRPPPALWQILVAAIAPLISATCAALGHWQLGQMASLALLAVMLEFTLRRLRRPLSPSLLWIAFGFVMGAGGAAIAELAATHGSEWFWVHEMGRDLVMQGLFTGLAVAAGRVMRSGDRTSPALHVVAGAVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAIAPHVRRAGLHVIFLGCFTALLLGALFPRPGEEPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQSV*
Ga0066701_1000623523300005552SoilVLPWLFFAFRLRHVYEPVNGLLAYRSFLHPLAELDGFLGCFAAGVVLTALRPPPALWQILVAAIAPLISATCAALGHWQLGQVASLALLAVMLEFTLRRLTRPLSPSLLWIAFGFVMGAGGAAIAELAATHGSDWFWVHEMGRDLVMQGLFTGLAVAAGRVMRSGDRSSPALHVVSGAVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAIAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQSV*
Ga0066692_1007558223300005555SoilLISATCAALGHWQLGQMASLALLAVMLDFTLRRLMRPLSPSLLWIAFGFVMGAGGAAIAELASTHGNEWFWVHEMGRDLVMQGLFTGLAVAAARVMRSGDRSSPALHIAAGVVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAIAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQSV*
Ga0066704_1004373023300005557SoilNEWFWVHEMGRDLVMQGLFTGLAVAAARVMRSGDRSSPALHIAAGVVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAIAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQSV*
Ga0066700_1083548113300005559SoilPSLIWIALGFVMGAGGAAFAELAVTRGDSWFWVHEMGRDLVIQGLFTGLAAAAGRVMRGDDRTHPALHVAAGLVFMASFWVGRRYGMHLGFALRAAVTVWLALPLRPTQWDFGPRNLRRSFAHLALWMLAIGNAWVAVAPQIRRAGLHVIFLGCFTALLLGALFPRAGEQPAFPLRKLAWAGGFVALSLVGRVMVELDPTSFHLW
Ga0066670_1022738023300005560SoilLLAVMLEFTLRRLTRPLSPSLLWIAFGFVMGAGGAAIAELAATHGSEWFWLHEMGRDLVMQGLFTGLAVAAGRVMRSGERRSPAFHIVAGAVFIASFWVGRRFGAHLGFAIRAALTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAVAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRTIQSV*
Ga0066670_1038285723300005560SoilRGDAWFWVHEMGRDLVIQGLFTGVAVAAGRVMRGDDRSHPAVHLAAGLIFVASFWVGRRFGVHLGFALRAAVTIWLALPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATIACVRVPVRTPQSV*
Ga0066699_1014919523300005561SoilPWLFFAFRLRHVYEPVNGLLAYRSFLHPLAELDGFLGCFAAGVVLTALRPPPALWQILVAAIAPLISATCAALGHWQLGQMASLALLAVMLDFTLRRLMRPLSPSLLWIAFGFVMGAGGAAIAELASTHGNEWFWVHEMGRDLVMQGLFTGLAVAAARVMRSGDRSSPALHIAAGVVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAIAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQSV*
Ga0066699_1038045023300005561SoilCAALGQWQLGQVASLALLAVMLQFTLRRLQRPLSPSLLWVAFGFLMGAGGAAVAEVAATRGPSWFWVHEMGRDLVIQGLFTGLALAAGRVMRSGDRAHPAMHVVAGAVFMASFWVGRQFGTHLGFAIRAAVTIWLALPLRPQWEFGPRNLRRSFAHLALWMLAVGNAWIAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGETPAFPLRKLAWAGSLVALSMIGRVMVELDPTWFHLWMGISAASFLAATVACLRVPVRTRQSV*
Ga0066705_1000618523300005569SoilVLPWLFFAFRLRHVYEPVSGLLAYRSFLHPLAELDGFLGCFAAGVVLTALRPPPALWQILVAAIAPLISATCAALGHWQLGQVASLTLLAVMLEFTLRRLTRPLSPSLLWIAFGFVMGAGGAAIAELAATHGSEWFWLHEMGRDLVMQGLFTGLAVAAGRVMRSGDRSSPALHIVAGAVFIATFWVGRRFGAHLGFAIRAALTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAVAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQSV*
Ga0066654_1025468913300005587SoilAFGFVMGAGGAAIAELAATHGSEWFWLHEMGRDLVMQGLFTGLAVAAGRVMRSGDRSSPALHIVAGAVFIATFWVGRRFGAHLGFAIRAAVTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAIAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQSV*
Ga0066706_1007409813300005598SoilRGSSWFWVHEMGRDLVIQGLFTGLAVGAGRVMRTGDRTHPALHLVAGAVFIASFWVGPRFGTHLGFAIRAAVTIWLALPLRPEWEFGPRNLRRSFAHLALWMLAVGNAWIAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGEAPAFPLRKLAWAGSLVALSMIGRVMVELDPTWFHLWMGISAASFLAATIACLRVPVRTPQSV*
Ga0066651_1024228713300006031SoilRDLVIQGLFTGVAVAAGRVMRGDDRSHPAVHLAAGLIFVASFWVGRRFGVHLGFALRAAVTIWLALPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATIACVRVPVRTPQSV*
Ga0066696_1004291623300006032SoilCGPRRRSGRSWWPPSPLISATCAALGHWQLGQVASLTLLAVMLEFTLRRLTRPLSPSLLWIAFGFVMGAGGAAIAELAATHGSEWFWLHEMGRDLVMQGLFTGLAVAAGRVMRSGDRSSPALHIVAGAVFIATFWVGRRFGAHLGFAIRAALTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAIAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQSV*
Ga0070712_10073548223300006175Corn, Switchgrass And Miscanthus RhizosphereSLALLLVLLEFTLRRLQRPLTSSLLWIALGFVMGAGGAGLAAFASARGEGWFWLHEMGRDLVIQGLFTGLAVAAARAMRGDDGRHAALHLAAGAIFIASFFVGRPWMHLGFAIRAAVTIWLALPLRPDWEFGPRNLRRSFSHLALWMLAVGNAWVAIAPQIRRAGLHVIFLGCFTALLLAALFPRSGERPAFPLRKLAWAGGLIALSLVGRVMVELDPTSFHLWMGLSAASFLAATVTCVRVPVAARQSV*
Ga0079222_1043002323300006755Agricultural SoilTARPLSPSLIWVPLAFAMGAGGAALAEIATAHGERWFWLHEMGRDLVMQGLFTGLAVAAARTLRGDDRTSPALHLAAGALFIASFYVGSRFGAHLGFALRAAVTVWLAQPLRPPWEFGPRNRRRAFAHVALWMLAAGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGEQPAFPLKKLALAGGLVALALVGRVMVELDPTWFHLWMGLSAAAFLAATAACVRVPVLGARDAAR*
Ga0066659_1053798213300006797SoilLGQVASLALLVVVLQFTLRRLSRPLSLSLLWIAFGFLMGAGGAAVAEVAATRGGSWFWMQEMGRDLVIQGLFTGLAVAAGRVMRTGDRTHPALHVAAGAVFIASFWVGPRFGMHLGFAIRAAVTIWLALPLRPEWEFGPRNLRRSFAHLALWMLAVGNAWIAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGEAPAFPLRKLAWAGGLVALSMIGRVMVELDPTWFHLWMGISAASFLAATIACLRVPVRTAQSV*
Ga0066660_1025930313300006800SoilEWFWVHEMGRDLVMQGLFTGLAVAAGRVMRRGDRTSPVLHVIAGAVFVASFWVGRRFGAHLGFAIRAAVTVWLALPLRPQWELGPRNLRRSFAHLALWMLAAGNAWIAVAPHVRRAGLHVIFLGCFTALLLAALFPRPGEAPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAVQSV*
Ga0075425_10016662613300006854Populus RhizosphereFVMGAGGAAVAEMAVSRGESWFWVHEMGRDLVIQGLFTGLAVAAARVMRGDDRTHPALHLAAGVLFVASFWIGRRYGAHLGFALRAAVTIWLALPLRPQWELGPRNLRRSFAHLALWMLAIGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRPGERPAFPLRKLAWAGGLVAASLVGRVMVELDPTSFHLWMGLSAASFLAATVACVRVPVRAPQSV*
Ga0075426_1061739113300006903Populus RhizosphereAELAVTRGESWFWVHEMGRDLVIQGLFTGLAVAAGRVMRGDDRTHPGIHLVAGILFIASFWIGRRYGTHLGFALRAAVTVWLALPLRPTRWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLGALFPRAGERAAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGLSAASFLAATIACLRVPVRAAQSV*
Ga0099791_1030954423300007255Vadose Zone SoilVIQGLFTGLAVAAARAMRGDDGRHPLLHVAAGAIFIASFFVGRRYGMHLGFAIRAAVTIGLALPLRPHWEFGPRNLRRSFSHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGERPAFPLRKLAWAGGLIALSLVCRVMVELDPTSFHLWMGLSAASFLAATVTCVRVPVAARQSV*
Ga0099794_1018404023300007265Vadose Zone SoilWAHEMGRDLVIQGLFTGLAVAAAGVMRGDDRTHPALHVAAGLVFMASFWVGRRYGMHLGFALRAAVTVWLALPLRPTQWEFGPRNLRRSFAHLALWMLAIGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGEQPAFPLRKLAWAGGLVALSLIGRVMVELDPTSFHLWMGLSAASFLAATIACLRVPVRAAQSV*
Ga0099794_1025022323300007265Vadose Zone SoilQVAVAAVAPVFSAVCAAVGHWQLGQAASLALLAVMLEFTLRRMRRPVSPSLTWVALGFLMGAGGAAVAELAAGRGDAWFWVHEMGRDLVIQGLFTGVAVAAARVMRGDDRSHPALHLAAGLIFVASFWVGRRFGAHLGFALRAVVTVWLALPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATIACVRVPVRTPQSV*
Ga0066710_10074132523300009012Grasslands SoilCAALGHWQLGQMASLALLAVMLDFTLRRLMRPLSPSLLWIAFGFVMGAGGAAIAELASTHGNEWFWVHEMGRDLVMQGLFTGLAVAAARVMRSGDRSSPALHIAAGVVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAIAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQSV
Ga0075423_1135295213300009162Populus RhizosphereEMAVSRGESWFWVHEMGRDLVIQGLFTGLAVAAARVMRGDDRTHPALHLAAGVLFVASFWIGRRYGAHLGFALRAAVTIWLALPLRPQWELGPRNLRRSFAHLALWMLAIGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRPGERPAFPLRKLAWAGGLVAASLVGRVMVELDPTSFHLWMGLSAASFLAATVACVRVPVRAPQSV*
Ga0134082_1010562923300010303Grasslands SoilILVAAIAPLISATCAALGHWQLGQVASLTLLAVMLEFTLRRLTRPLSPSLLWIAFGFVMGAGGAAIAELAATHGSEWFWLHEMGRDLVMQGLFTGLAVAAGGVMRSGDRSSPALHIVAGAVFIATFWVGRRFGAHLGFAIRAALTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAVAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGISAASFLAATLACLRVPVRAIQSV*
Ga0134128_1084652223300010373Terrestrial SoilGFLGCFAAGVVFTTLRPPPAAWQVAIAAIAPLISATCAALGHWQAGQLASLILLAVVLEFTLRRLSRPLPPSLVWIGFGFLMGAGGAAIAELAATRGGGWFWAHEMGRDLVIQGLFTGLAIAAARVMRGGDRTNLALHLVAGAIFIASFWVDRRYGKMHLGFAVRAAVTVWLALPLRPAWEFGPRNLRRSFAHLALWFLAIGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRPGERPAFPLRKLAWAGGLVALSMVGRVMVELDPLSFHLWMGLSAASFLAATIACLRVPVRTAQSV*
Ga0134124_1027524613300010397Terrestrial SoilSPSLVWIGFGFLMGAGGAAIAELAATRGGGWFWAHEMGRDLVIQGLFTGLAIAAARVMRGGDRTNLALHLVAGAIFIASFWVDRRYGKMHLGFAVRAAVTVWLALPLRPAWEFGPRNLRRSFAHLALWFLAIGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRPGERPAFPLRKLAWAGGLVALSMVGRVMVELDPLSFHLWMGLSAASFLAATIACLRVPVRTAQSV*
Ga0134127_1170174813300010399Terrestrial SoilHWQAGQLASLILLAVVLEFTLRRLSRPLPPSLVWIGFGFLMGAGGAAIAELAATQGGAWFWAHEMGRDLVIQGLFTGLAIAAARVMRGGDRTRLALHLVAGAIFIASFWVDRRYGKMHLGFAVRAAVTVWLALPLRPAWEFGPRNLRRSFAHLALWFLAIGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRPGERPAFPLRKLAWAGGLVALSMVGRVMVELDPLSFHLWM
Ga0137399_1046184023300012203Vadose Zone SoilFTLGRMRRPLSPSLIWVALGFLMGAGGAAVAELAVGRGDAGFWAHEMGRDLVIQGLFTGVAVAAGRVMRADDRSHPTLHVAAGMIFMASFWVGRRFGVHLGFALRAAVTVWLALPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATLACVRVPVRTPQSV*
Ga0137362_1078001713300012205Vadose Zone SoilHEMGRDLVIQGLFTGLAVAAAGVMRGDDRTHPALHVAAGLVFMASFWVGRRYGMHLGFALRAAVTVWLALPLRPTQWEFGPRNLRRSFAHLALWMLAIGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGEQPAFPLRKLAWAGGLVALSLIGRVMVELDPTSFHLWMGLSAASFLAATIACLRVPVRAAQSV*
Ga0137376_1012892413300012208Vadose Zone SoilAAIGQWQLGQVASLALLVVMLQFTLRRLSRPLSPSLLWIAFGFLMGAGGAAVAEVAATRGGSWFWMQEMGRDLVIQGLFTGLAVAAGGVMRTGDRTHPALHVAAGAVFIASFWVGPRFGMHLGFAIRAAVTIWLALPLRPEWEFGPRNLRRSFAHLALWMLAVGNAWIAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGEAPAFPLRKLAWAGSLVALSMIGRVMVELDPTWFHLWMGISAASFLAATIACLRVPVRTAQSV*
Ga0137386_1104258113300012351Vadose Zone SoilMGAGGAAIAELAAKHGSEWFWVHEMGRDLVMQGLFTGLAVAAGRVMRSGDRTSPALHVVAGAVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPQWELGPRNLRRSFAHLALWMLAAGNAWIAVAPHVRRAGLHVIFLGCFTALLLAALFPRPGEAPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHL
Ga0137397_1065849923300012685Vadose Zone SoilLTWVALGFLMGAGGAAVAELAAGRGDAWFWVHEMGRDLVIQGLFTGVAVAAARVMRGDDRSHPALHLAAGLIFVASFWVGRRFGAHLGFALRAVVTVWLALPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATIACVRVPVRTPQSV*
Ga0137395_1024188223300012917Vadose Zone SoilMRRPLSPSLIWVAFGFLMGAGGAAVAELAVGRGDGWFWAHEMGRDLVIQGLFTGVAVAAGRVMRGDDRSRPALHVAAGLIFMASFWVGRRFGVHLGFALRAAVTVWLAFPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATLACVRVPVRAPQSV*
Ga0137396_1045067813300012918Vadose Zone SoilRMRRPLSPSLIWVALGFLMGAGGAAVAELAVGRGDAGFWAHEMGRDLVIQGLFTGVAVAAGRVMRADDRSHPALHVAAGMIFMASFWVGRRFGVHLGFALRAAVTVWLALPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATLACVRVPVRTPQSV*
Ga0137419_1049964923300012925Vadose Zone SoilMRRPLSPSLIWVALGFLMGAGGAAVAELAVGRGDAGFWAHEMGRDLVIQGLFTGVAVAAGRVMRADDRSHPALHVAAGMIFMASFWVGRRFGVHLGFALRAAVTVWLALPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATLACVRVPVRTPQSV*
Ga0137407_1119126623300012930Vadose Zone SoilGWFWAHEMGRDLVIQGLFTGVAVAAGRVMRGDDRSHPVLHVVAGLVFMASFWVGRRFGVHLGFALRAAVTVWLALPLRPHWEFGPRHLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATVACLRVPVRTAQSV*
Ga0137410_1015549413300012944Vadose Zone SoilAPVFSAACAALGHWQLGQVASLALLAVMLEFTLRRMRRPLSPSLIWVALGFLMGAGGAAVAELAVGRGDAGFWAHEMGRDLVIQGLFTGVAVAAGRVMRADDRSHPALHVAAGMIFMASFWVGRRFGVHLGFALRAAVTVWLALPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATLACVRVPVRAPQSV*
Ga0137410_1080682813300012944Vadose Zone SoilLLLVLLEFTLRRLQRPFSSSLIWIALGFVMGAGGAAMAALASIRGEGWFWAHEMGRDLVIQGFFTGLAVAAARAMRGNDHSRPALHLFAGAIFTASFWIGRRYGIHLGYAVRAAVTVWLAMPLRPGWEFGPLNLRRSFAHLALWMLAVGNAWVAIAPQIRRAGLHVIFLGCFTALLLAALFPRAGERAAFSLRKLAWAGGLIALSLVGRVMVELDPTSFHLWMGLSAASFLAATLTCVRVPVKAAQSV*
Ga0134087_1013442823300012977Grasslands SoilWVHEMGRDLVIQGLFTGLAVGAGRVMRTGDRTHPALHLVAGAVFIASFWVGPRFGTHLGFAIRAAVTIWLALPLRPEWEFGPRNLRRSFAHLALWMLAVGNAWIAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGEAPAFPLRKLAWAGSLVALSMIGRVMVELDPTWFHLWMGISAASFLAATIACLRVPVRTPQSV*
Ga0134075_1027608813300014154Grasslands SoilRLTRPLSPSLLWIAFGFLMGAGGAAIAELAAKHGSEWFWVHEMGRDLVMQGLFTGLAVAAGRVMRSGDRTSPALHVVAGAVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPQWELGPRNLRRSFAHLALWMLAAGNAWIAVAPHVRRAGLHVIFLGCFTALLLAALFPRPGEAPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQSV*
Ga0137405_117743323300015053Vadose Zone SoilVMLEFTLRRMRRPLSPSLIWVALGFLMGAGGAAVAELAVGRGDAGFWAHEMGRDLVIQGLFTGVAVAAGRVMRADDRSHPALHVAAGMIFMASFWVGRRFGVHLGFALRAAVTVWLTLPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATLACVRVPVRTPQSV*
Ga0137420_101129213300015054Vadose Zone SoilMRRPLSPSLTWVAFGFLMGAGGAAVAELAVGKGDAGFWAHEMGRDLVIQGLFTGVAVAAARVMRGDDRSRPALHVAAGLIFMASFWVGRRFGVHLGFALRAAVTVWLAFPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATLACVRVPVRTPQSV*
Ga0137420_112200723300015054Vadose Zone SoilFWAHEMGRDLVIQGLFTGVAVAAARVMRGDDRSRPALHVAAGLIFMASFWVGRRFGVHLGFALRAAVTVWLAFPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATLACVRVPVRTPQSV*
Ga0137420_122187623300015054Vadose Zone SoilPAAWQVAVAAVAPVFSAACAALGQWQLGQVASLALLAVMLEFTLRRMRRPLSPSLTWVAFGFLMGAGGAAVAELAVGKGDAGFWAHEMGRDLVIQGLFTGVAVAAARVMRGDDRSRPALHVAAGLIFMASFWVGRRFGVHLGFALRAAVTVWLAFPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATLACVRVPVRAPQSV*
Ga0137409_1078067713300015245Vadose Zone SoilLLLVLLEFTLRRLQRPFSSSLIWIALGFVMGAGGAAMAALASIRGEGWFWAHEMGRDLVIQGFFTGLAVAAARAMRGDDHSRPALHLFAGAIFTASFWIGRRYGIHLGYAVRAAVTVWLAMPLRPGWEFGPLNLRRSFAHLALWMLAVGNAWVAIAPQIRRAGLHVIFLGCFTALLLAALFPRAGERAAFSLRKLAWAGGLIALSLVGRVMVELDPTSFHLWMGLSAASFLAATLTCVRVPVKAAQSV*
Ga0184619_1002789723300018061Groundwater SedimentAATRGSSWFWLHEMGRDLVIQGLFTGIAVGAGRMMRTGDRTHPALHLVAGAVFIASFWVGPRFGTHLGFAIRAAVTIWLALPLRPEWEFGPRNLRRSFAHLALWMLAVGNAWIAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGEAPAFPLRKLAWAGSLVALSMIGRVMVELDPTWFHLWMGISAASFLAATIACLRVPVRTPQSV
Ga0066667_1016509813300018433Grasslands SoilAFRLRPVYESVNGMLAYRSFLHPLAELDGFLGCFAAGVVLTALRPPPASWQILVAAIAPLISATCAALGHWQLGQVASLTLLAVMLEFTLRRLTRPLSPSLLWIAFGFVMGAGGAAIAELAATHGSEWFWLHEMGRDLVMQGLFTGLAVAAGRVMRSGDRSSPALHIVAGAVFIATFWVGRRFGAHLGFAIRAAVTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAVAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQSV
Ga0066662_1222071913300018468Grasslands SoilAAVAELAATRGSSWFWVHEMGRDLVIQGLFTGLAVGAGRVMRTGDRTHPALHLVAGAVFIASFWVGPRFGTHLGFAIRAAVTIWLALPLRPEWEFGPRNLRRSFAHLALWMLAVGNAWIAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGEAPAFPLRKLAWAGSLVALSMIGRVMVELDPTWFHLWMGI
Ga0137408_148285713300019789Vadose Zone SoilMVLGARDGRDLVIQGLFTGVAVAAARVMRGDDRSHPALHLAAGLIFVASFWVGRRFGAHLGFALRAVVTVWLALPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATIACVRVPVRTPQSV
Ga0193721_102885913300020018SoilAAVAEIAATRGSSWFWLHEMGRDLVIQGLFTGLAVGAGRMMRTGDRTHPALHLVAGAVFIASFWVGPRFGTHLGFAIRAAVTIWLALPLRPEWEFGPRNLRRSFAHLALWMLAIGNAWIAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGEAPAFPLRKLAWAGSLVALSMIGRVMVELDPTWFHLWMGISAASCLAATIACLRVPVRTPQSV
Ga0179594_1007303013300020170Vadose Zone SoilAPVFSAACAALGHWQLGQVASLALLAVMLEFTLRRMRRPLSPSLIWVALGFLMGAGGAAVAELAVGRGDGWFWAHEMGRDLVIQGLFTGVAVAAGRVMRGDDRSHPVLHVVAGLVFMASFWVGRRFGVHLGFALRAAVTVWLALPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATLACVRVPVRTPQSV
Ga0137417_111257913300024330Vadose Zone SoilVLPWLFFAFRLRHVYEPVNGLLAYRSFLHPLAELDGFLGCFAAGVVLTALRPPPARWQILVAAIAPLISATCAALGHWQLGQVASLALLAVMLEFTLRRLTRPLSPSLLWIAFGFVMGASGAAIAELAATHGSDWFWVHEMGRDLVMQGLFTGLAVAAGRVMRSGDRSSPALHVVSGAVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAIAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPCVSPAQARL
Ga0207660_1103242913300025917Corn RhizosphereLEFTLRRLSRPWPPSLIWIAFGFLMGAGGAALAEIAAAHGRDWFWLHEMGRDLVIQGSFTGVAVAAARVMRRERKSHLALHVAAGAIFIASFWVGHRYGQHLGFAIRAAVTVWLALPFRPDWEFGPRNLRRSFAHLALWMLAMGNAWVAIAPQIRRAGLHVIFLGCFTALMIGALFPRAGEQPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAA
Ga0207662_1087084113300025918Switchgrass RhizosphereGFLMGAGGAAIAELAATRGGGWFWAHEMGRDLVIQGLFTGLAIAAARVMRGGDRTNLALHLVAGAIFIASFWVDRRYGKMHLGFAVRAAVTVWLALPLRPAWEFGPRNLRRSFAHLALWFLAIGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRPGERPAFPLRKLAWAGGLVALSMVGRVMVELDPLSFHLWMGLSAASFLAATIAC
Ga0207677_1067869623300026023Miscanthus RhizosphereHEMGRDLVIQGLFTGLAIAAARVMRGGDRTNLALHLVAGAIFIASFWVDRRYGKMHLGFAVRAAVTVWLALPLRPAWEFGPRNLRRSFAHLALWFLAIGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRPGERPAFPLRKLAWAGGLVALSMVGRVMVELDPLSFHLWMGLSAASFLAATIACLRVPVRTAQSV
Ga0209234_109313913300026295Grasslands SoilEMGRDLVIQGLFTGLAVGAGRVMRTGDRTHPALHLVAGAVFIASFWVGPRFGTHLGFAIRAAVTIWLALPLRPEWEFGPRNLRRSFAHLALWMLAVGNAWIAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGEAPAFPLRKLAWAGSLVALSMIGRVMVELDPTWFHLWMGISAASFLAATIACLRVDNRCTTQSE
Ga0209238_115332513300026301Grasslands SoilWQLGQMASLALLAVMLEFTLRRLTRPLSPSLLWIAFGFLMGAGGAAIAELAAKHGSEWFWVHEMGRDLVMQGLFTGLAVAAGRVMRSGDRTSPALHVVAGAVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPQWELGPRNLRRSFAHLALWMLAAGNAWIAVAPHVRRAGLHVIFLGCFTALLLAALFPRPGEAPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWM
Ga0209761_128513213300026313Grasslands SoilAIAELAATHGSDWFWVHEMGRDLVMQGLFTGLAVAAGRVMRSGDRSSPALHVVSGAVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAIAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASF
Ga0209686_108872323300026315SoilPWLFFAFRLRPVYESVNGMLAYRSFLHPLAELDGFLGCFAAGVVLTALRPPPAGWQILVAAIAPLISATCAALGHWQLGQMASLALLAVMLEFALRRLTRPLSPSLLWIAFGFLMGAGGAAIAELAAKHGSEWFWVHEMGRDLVMQGLFTGLAVAAGRVMRRGDRTSPVLHVIAGAVFVASFWVGRRFGAHLGFAIRAAVTVWLALPLRPQWELGPRNLRRSFAHLALWMLAAGNAWIAVAPHVRRAGLHVIFLGCFTALLLAALFPRPGEAPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAVQSV
Ga0209472_103893723300026323SoilWLFFALRMRQVYEPVNGLLAYRSFLHPLAELDGFLGCFAAGVVLTALRPAPASWQVGVAAVAPVFSAACAALGHWQLGQVASLALLAVMVEFTLRRVWRPLSPSLTWIALGFLMGAGGAAVAELAVPRGDAWFWVHEMGRDLVIQGLFTGVAVAAGRVMRGDDRSHPAVHLAAGLIFVASFWVGRRFGVHLGFALRAAVTIWLALPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATIACVRVPVRTPQSV
Ga0209152_1001588613300026325SoilGEWFWVHEMGRDLVMQGLFTGLAVAAGRVMRRGDRTNPVLHVIAGAVFVASFWVGRRFGAHLGFAIRAAVTVWLALPLRPQWELGPRNLRRSFAHLALWMLAAGNAWIAVAPHVRRAGLHVIFLGCFTALLLAALFPRPGEAPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAVQSV
Ga0209803_102133433300026332SoilAAIAELAAKHGSEWFWVHEMGRDLVMQGLFTGLAVAAGRVMRSGDRTSPALHVVAGAVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPQWELGPRNLRRSFAHLALWMLAAGNAWIAVAPHVRRAGLHVIFLGCFTALLLAALFPRPDEAPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSATSFLAATLACLRVPVRAVQSV
Ga0209808_100586813300026523SoilQGLFTGLAVGAGRVMRTGDRTHPALHLVAGAVFIASFWVGPRFGTHLGFAIRAAVTIWLALPLRPEWEFGPRNLRRSFAHLALWMLAVGNAWIAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGEAPAFPLRKLAWAGSLVALSMIGRVMVELDPTWFHLWMGISAASFLAATIACLRVPVRTPQSV
Ga0209808_101381413300026523SoilPPPALWQILVAAIAPLISATCAALGHWQLGQVASLTLLAVMLEFTLRRLTRPLSPSLLWIAFGFVMGAGGAAIAELAATHGSEWFWLHEMGRDLVMQGLFTGLAVAAGRVMRSGDRSSPALHIVAGAVFIATFWVGRRFGAHLGFAIRAALTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAVAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQSV
Ga0209690_102676633300026524SoilVLPWLFFAFRLRHVYEPVNGLLAYRSFLHPLAELDGFLGCFAAGVVLTALRPPPARWQILVAAIAPLISATCAALGHWQLGQVASLALLAVMLEFTLRRLTRPLSPSLLWIAFGFVMGAGGAAIAELAATHGSDWFWVHEMGRDLVMQGLFTGLAVAAGRVMRSGDRSSPALHVVSGAVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAIAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQSV
Ga0209059_102887913300026527SoilMLDFTLRRLMRPLSPSLLWIAFGFVMGAGGAAIAELASTHGNEWFWVHEMGRDLVMQGLFTGLAVAAARVMRSGDRSSPALHIAAGVVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAIAPHVRRAGLHVIFLGCFTALLLAALFPRPGEAPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQSV
Ga0209807_101678933300026530SoilVLPWLFFAFRLRHVYEPVSGLLAYRSFLHPLAELDGFLGCFAAGVVLTALRPPPALWQILVAAIAPLISATCAALGHWQLGQVASLTLLAVMLEFTLRRLTRPLSPSLLWIAFGFVMGAGGAAIAELAATHGSEWFWLHEMGRDLVMQGLFTGLAVAAGRVMRSGDRSSPALHIVAGAVFIATFWVGRRFGAHLGFAIRAALTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAVAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQSV
Ga0209160_106642233300026532SoilVLPWLFFAFRLRHVYEPVNGLLAYRSFLHPLAELDGFLGCFAAGVVLTALRPPPARWQILVAAIAPLISATCAALGHWQLGQVASLALLAVMLEFTLRRLTRPLSPSLLWIAFGFVMGTSGAAIAELAATHGSDWFWVHEMGRDLVMQGLFTGLAVAAGRVMRSGDRSSPALHVVSGAVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAIAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGV
Ga0209805_108570323300026542SoilDGFLGCFAAGVVLTALRPPPALWQILVAAIAPLISATCAALGHWQLGQMASLALLAVMLDFTLRRLMRPLSPSLLWIAFGFVMGAGGAAIAELASTHGNEWFWVHEMGRDLVMQGLFTGLAVAAARVMRSGDRSSPALHIAAGVVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAIAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQS
Ga0209474_1072758013300026550SoilWVHEMGRDLVIQGLFTGLALAAGRVMRSGDRAHPAMHVVAGAVFMASFWVGRQFGTHLGFAIRAAVTIWLALPLRPQWEFGPRNLRRSFAHLALWMLAVGNAWIAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGETPAFPLRKLAWAGSLVALSMIGRVMVELDPTWFH
Ga0209076_110411313300027643Vadose Zone SoilRPRPVGWQILVAAIAPLISATCAALGHWQLGQMASLALLAVMLEFALRRLTRPLSPSLLWIAFGFLMGAGGAAIAELAAKHGSEWFWVHEMGRDLVMQGLFTGLAVAAGRVMRSGDRTSPALHVIAGAVFVASFWVGRRFGAHLGFAIRAAVTVWLALPLRPQWELGPRNLRRSFAHLALWMLAAGNAWIAVAPHVRRAGLHVIFLGCFTALLLAALFPRPGEAPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFL
Ga0209588_111430313300027671Vadose Zone SoilFSAVCAALGHWQLGQAASLALLAVMLEFTLRRMRRPVSPSLTWVALGFLMGAGGAAVAELAAGRGDAWFWVHEMGRDLVIQGLFTGVAVAAARVMRGDDRSHPALHLAAGLIFVASFWVGRRFGAHLGFALRAVVTVWLALPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGEQPAFPLRKLAWAGGLVALSLIGRVMVELDPTSFHLWMGLSAASFLAATIACLRVPVRAAQSV
Ga0209118_109002013300027674Forest SoilGSSWFWVHEMGRDLVIQGLFTGLAVAAGRVMRGDDRRPPALHFVAGLVFIGSFWIGRQFGTHLGFALRAAVTVWLALPVRPAWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRPGELPAFPLRKLAWAGGLVALSLVGRVMVELDPASFHLWMGVSSASFLAATVVCLRVPVKTAQSV
Ga0209011_115754013300027678Forest SoilFAMGAGGAALAELAVARGSSWFWVHEMGRDLVIQGLFTGLALAAGRVMRGDDRRPPALHVVAGLVFIDSFWIGRQFGTHLGFALRAAVTVWLALPVRPAWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGELPAFPLRKLAWAGGLVALSLIGRVMVELDPASFHLWMGVSAASFLAATVACV
Ga0209689_108083213300027748SoilVLPWLFFAFRLRHVYEPVNGLLAYRSFLHPLAELDGFLGCFAAGVVLTALRPPPARWQILVAAIAPLISATCAALGHWQLGQVASLALLAVMLEFTLRRLTRPLSPSLLWIAFGFVMGTSGAAIAELAATHGSDWFWVHEMGRDLVMQGLFTGLAVAAGRVMRSGDRSSPALHVVSGAVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAIAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQSV
Ga0209074_1036547813300027787Agricultural SoilRPLSPSLIWVPLAFAMGAGGAALAEIATAHGERWFWLHEMGRDLVMQGLFTGLAVAAARTLRGDDRTSPALHLAAGALFIASFYVGSRFGAHLGFALRAAVTVWLAQPLRPPWEFGPRNRRRAFAHVALWMLAAGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRAGEQPAFPLKKLALAGGLVALALVGRVM
Ga0209488_1047542123300027903Vadose Zone SoilVIQGLFTGVAVAAARVMRGDDRSRPALHVAAGLIFMASFWVGRRFGVHLGFALRAAVTVWLALPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGELPAFPLRKLAWAGGLVALSLVGRVMVELDPTSFHLWMGVSAASFLAATLACVRVPVRTPQSV
Ga0137415_1004089723300028536Vadose Zone SoilVLPWLFFAFRLRHVYEPVNGLLAYRSFLHPLAELDGFLGCFAAGVVLTALRPPPARWQILVAAIAPLISATCAALGHWQLGQVASLALLAVMLEFTLRRLTRPLSPSLLWIAFGFVMGASGAAIAELAATHGSDWFWVHEMGRDLVMQGLFTGLAVAAGRVMRSGDRSSPALHVVSGAVFIASFWVGRRFGAHLGFAIRAAVTVWLALPLRPEWELGPRNLRRSFAHLALWMLAAGNAWIAIAPHVRRAGLHVIFLGCFTALLLAALFPRPGEEPAFPLRKLAWAGGLVALSMVGRVMVELDPTSFHLWMGVSAASFLAATLACLRVPVRAIQSV
Ga0307282_1027475413300028784SoilPLAELDGFLGCFAAGVVLTSLRPPPAAWQVVVAAIAPLISATCAALGHWQVGQLASLLLLAVVLEFTLQRLSRPWSASLLWIVLGFLMGAGGAALAELAAAKGPEWFGAHEMGRDLVIQGLFTGLAIAAGRVMRGGDRTHPALHLAAGAVFITSFWIGRRFGMHLGFAIRAAVTVWLALPLRPDWEFGPRNLRRSFAHLALWMLAVGNAWVAVAPTIRRAGLHVIFLGCFTALLVGALFPRHGERPAFPLRKLAWAGGLVALSMVGRVM
Ga0307473_1034958923300031820Hardwood Forest SoilVLLEFTLRRLQRPFSSSLIWISLGFVMGAGGAAVAALASIRGEGWYWVHEMGRDLVIQGLFTGLAIAAARAMRGDDHSRPALHLFAGAIFIASFWIGRRYGIHLGYAIRAAVTVWLALPLRPGWEFGPLNLRRSFAHLALWMLAVGNAWVAIAPQIRRAGLHVIFLGCFTALLLAALFPRAGEHAGFSLRKLAWAGGLIALSLVGRVMVELDPTSFHLWMGLSAASFLAATLTCVRVPVKAAQSV
Ga0308174_1062709413300031939SoilAAATGHWQLGQVAWLALLLVSVEFAWRRTARPLAPSLVWVALAFAMGAGGAILAEVAAARGASWFWVHEMGRDLVMQGLFTGLAVAAARTLRGDDKTRPALHLAAGMLFAASFYVGRRFGAHLGFALRAAITVWLAQPLRPDWEFGPRNRRRAFAHVALWMLAAGNAWVAVAPQIRRAGLHVIFLGCFTALLLAALFPRSGEQPAFPLRKLAWASGLVALSMVGRVMVELDPTWFHLWMGLSAASFLCATATCVRMPVRAIQSV
Ga0307471_10096562323300032180Hardwood Forest SoilALGYWQLGQAASLALLLVLLEFTLRRLQRPFSSSLIWIALGFLMGAGGAVIAALASIRGEGWFWAQEMGRDLVIQGLFTGLAVAAARAMRGDDHSRPALHLFAGAIFTASFWIGRRYGIHLGYAIRAAVTVWLALPLRPGWEFGPLNLRRSFAHLALWMLAVGNAWVAIAPQIRRAGLHVIFLGCFTALLLAALFPRAGERAAFSLRKLAWAGGLIALSLVGRVMVELDPTSFHLWMGLSAASFLAATLTCVRVPVKAAQSV
Ga0335083_1092411213300032954SoilAARGNDWFRMHEIGRNLVIQGSFTGLAVAAARVMRRDEHSPAALHLIAGVVFIASFFLARTHIGFAIRAAVTVWLALPLRPEWELGPRNLRRSFAHLALWMLAVGNAWVALAPQIRRAGLHVIFLGCFTALLLGALFPRSGEKPSFPLRKLAWAGGLVALSMVGRVMVELDPSQFHLWMGVSSASFLAATLACVRVPVGAPQSV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.