NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F068619

Metagenome / Metatranscriptome Family F068619

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F068619
Family Type Metagenome / Metatranscriptome
Number of Sequences 124
Average Sequence Length 85 residues
Representative Sequence MKRHLLILAATLSLAGCAGGYTSAEFVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYRVQRDGPNRIIWARRGDDDVVRIFATPRGDR
Number of Associated Samples 100
Number of Associated Scaffolds 124

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 1.61 %
% of genes from short scaffolds (< 2000 bps) 0.81 %
Associated GOLD sequencing projects 93
AlphaFold2 3D model prediction Yes
3D model pTM-score0.34

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (97.581 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(33.871 % of family members)
Environment Ontology (ENVO) Unclassified
(40.323 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(45.161 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 5.08%    β-sheet: 19.49%    Coil/Unstructured: 75.42%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.34
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 124 Family Scaffolds
PF01120Alpha_L_fucos 11.29
PF06496DUF1097 8.87
PF14326DUF4384 5.65
PF11138DUF2911 3.23
PF13424TPR_12 2.42
PF03807F420_oxidored 2.42
PF16757Fucosidase_C 1.61
PF01966HD 1.61
PF09900DUF2127 1.61
PF11716MDMPI_N 1.61
PF00459Inositol_P 1.61
PF04307YdjM 0.81
PF00892EamA 0.81
PF02652Lactate_perm 0.81
PF14255Cys_rich_CPXG 0.81
PF12770CHAT 0.81
PF02368Big_2 0.81
PF06271RDD 0.81
PF08139LPAM_1 0.81
PF13649Methyltransf_25 0.81
PF12697Abhydrolase_6 0.81
PF13440Polysacc_synt_3 0.81
PF12867DinB_2 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 124 Family Scaffolds
COG3669Alpha-L-fucosidaseCarbohydrate transport and metabolism [G] 11.29
COG1620L-lactate permeaseEnergy production and conversion [C] 0.81
COG1714Uncharacterized membrane protein YckC, RDD familyFunction unknown [S] 0.81
COG1988Membrane-bound metal-dependent hydrolase YbcI, DUF457 familyGeneral function prediction only [R] 0.81


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A97.58 %
All OrganismsrootAll Organisms2.42 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300011270|Ga0137391_10128699All Organisms → cellular organisms → Bacteria2201Open in IMG/M
3300012685|Ga0137397_10009031All Organisms → cellular organisms → Bacteria6939Open in IMG/M
3300026324|Ga0209470_1060841All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1786Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil33.87%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil16.94%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil12.10%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil11.29%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere8.87%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment5.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.03%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.42%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.42%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.61%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018072Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b2EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300019869Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3m2EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019882Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300021951Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300031226Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 10_SEnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1002186143300002558Grasslands SoilMKRYLLTLAGTLSLAGCAGGYTRADFVYAEPVEYEYVVPVDRVVVVTRDVLVARGWAVYRVQRSGPNRIIWARRGDDEVVRIFATPRGERGERVAVRGLREERERGDHGK
JGI25383J37093_1020456213300002560Grasslands SoilMKRELIMLAATLSLAGCAPGYTSAEFVYAEPVEYEYVVPVDRVVVVTRDVLITRGWTVYRVQRSGPNRIIWARRGDDEIVRIFATPNGQRVAVRGLWEQRDRRKHGDKDDQGEDEH
JGI25389J43894_105579823300002916Grasslands SoilMKRDLIMLAATLSLAGCAPGYTNAEFVYAEPVEYEYVVPVDRVVVVTRDVLITRGWTVYRVQRSGPNRIIWARRGDDEIVRIFATP
Ga0066674_1030079013300005166SoilMKRHLLILAATLSLAGCAGGYTSAQFVYAEPVEYEYAVPVDHVVVVTREVLVTRGWTVYRVQRSGPNRIIWARRGDDRIVRIFATPRGERVAVRGLQEERE
Ga0066683_1078052913300005172SoilMKRHLLILAATLSLAGCAMGGGYASAEYVYAEPVEYEYVVPVDRVVVVTRDVL
Ga0066688_1031439923300005178SoilMKRDLIMLAATLSLAGCAPGYTNAEFVYAEPVEYEYVVPVDRVVVVTRDVLITRGWTVYRVQRSGPNRIIWARRGDDEIVRIFATPRGERGERV
Ga0066685_1035839613300005180SoilMKRTPHYLLVAGALSLAGCAPGYTSASVVYAEPAEYVYVVPVDRVVVVTREVLVNRGWVVYREERSGPNRIIWAR
Ga0066685_1113704713300005180SoilMKRHLLILAATLSLAGCAGGYTSAEFVYAEPVEYEYAVPVDHVVVVTREVLVTRGWTVYRVQRS
Ga0066676_1003366743300005186SoilMQRSLIFLGAALGVAGCAPGYTSASFVYAEPARYVYVVPVDRVVVVTREVLVNRGWTVYRVERDGPNRIIWARQGDDHIVRIFANPDGERVVVRGLA
Ga0070705_10032786223300005440Corn, Switchgrass And Miscanthus RhizosphereMKRLLLVLAATLSLAGCAMGGGGGGYASAEYVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYRVERSGPNRIIWARRGDND
Ga0066686_1014268033300005446SoilMKRTRHYLLVAGALSLAGCAAGYTSASVVYTEPAEYVYVVPVDRVVVVTREVLVNRGWVVYRVQRSGPNRIIWARRGDDEVVRIFATPQGQ
Ga0066689_1099460013300005447SoilMKRTPHYLLVAGALSLAGCAPGYTSASVVYAEPAEYVYVVPVDRVVVVTREVLVNRGWVVYREERSGPNRIIWARRGDDEVVRIFATPQGQQVAVR
Ga0066681_1005091413300005451SoilMKNTLPLLLVAGALSLAGCAPGYTSASVVYAEPAEYVYVVPVDRVVVVTREVLVNRGWVVYRVQRSGPNRIIWARRGDDEV
Ga0070696_10059876623300005546Corn, Switchgrass And Miscanthus RhizosphereMKRTQQCLLVAGALSLAGCAGGYTSAAVVYSEPAEYVYVVPVDRVVVVTREVLVNRGWAVVRVERSGPNRIIWAR
Ga0070704_10000245713300005549Corn, Switchgrass And Miscanthus RhizosphereMKRYLLMPAAALSLAGCAPGYTSAEFVYAEPVRYEYVVPVDRVVVVSQDVLVTRGWTVYRVQRSGPNRIIWAQRGDDDVVRIFA
Ga0066695_1016587613300005553SoilMKRTPHYLLVAGALSLAGCVPGYTSASVVYAEPAEYVYVVPVDRVVVVTREVLVNRGWVVYREERSGPNRIIWAR
Ga0066707_1000966863300005556SoilMKRQLLVLAATLSLAGCAAGGGYASAEYVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYRVQRSGPNRIIWARRGDDQVVRIFATP
Ga0066698_1031241713300005558SoilMKRHLLILAATLSLAGCAGGYTSAEFVYAEPVEYEYAVPVDHVVVVTREVLVTRGWTVYRVQRSGPNRIIWARRGDDRIVRIFATPRGERV
Ga0066698_1083531323300005558SoilMKRYLLMPAAALSLAGCAGYTSAEFVYAEPVQHEYVVPVDRVVIVTRDVLITRGWTVYRVQRSGPNRIIWARRGDDEIVRIFATPRGERGERVAVRGLWEQRDRRK
Ga0066694_1011593413300005574SoilMKTIFRFLLGAGAVSFTACAPGYTSASFVYAEPAEYVYEVPVDRVVVVTRDVLVDRGWTVYRVERSGPNRIIWARRGDDDVVRIFASPTGQRVAVRGLWEARERKARREHG
Ga0066708_1106620813300005576SoilMKNTLPLLLVAGALSLAGCAPGYTSASVVYAEPAEYVYVVPVDRVVVVTREVLVNRGWVVYRVQRSGPNRIIWARRGDDEVVR
Ga0066656_1055311113300006034SoilMKRYLLTLAGTLSLAGCAGGYTRADFVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYRVQRSGPNRIIWARRGDD
Ga0075417_1070919913300006049Populus RhizosphereMKRYLLMPAAALSLAGCAPGYTSAEFVYAEPVRYEYVVPVDRVVVVSQDVLVARGWTVYRVQHSGPNRIIWARRGDDDVVRIFATPRGGRVGVRGLW
Ga0066659_1191272723300006797SoilMKRYLLTLAGTLSLAGCAGGYTRADFVYAEPVEYEYVVPVDRVVVVTRDVLVARGWAVYRVQRSGPNRIIWARRGDDQIVRIFATPRG
Ga0075425_10156899513300006854Populus RhizosphereMKRHLLILAAPLSLAGCLAGGGGYASAEYVYAEPAQYEYVAPVDHVVVVTRDVL
Ga0075434_10125026413300006871Populus RhizosphereMKRQLLGLAATLSLAGCAMGGGGYASAEYVYAEPAEYEYVVPVDRVVVVTRQVLVTRGWTVYRVQRSGPNRIIWARRGDNDVVRIFATPRGERVT
Ga0075424_10073190233300006904Populus RhizosphereMKQYLMLAATLGLAGCAGGYTSATVMYGEPAEYEYVVPVDRVVVVSREVLVERGWRVYRVERSGPNRILWARRGD
Ga0075436_10004488633300006914Populus RhizosphereMKRDLILLAATLSLAGCAPGYTSAEFVYAEPVEYEYVVPVDRVVVVTQDVLVTRGWAVYRVQRSGPNRIIWARRGDDEIVRIFATPNGQRVAVR
Ga0075436_10074485423300006914Populus RhizosphereMKRHLLILAAPLSLAGCLAGGGGYASAEYVYAEPAQYEYVAPVDHVVVVTRDVLVTRGWTVYRVQRDGPNRIIWARRGD
Ga0075435_10069514413300007076Populus RhizosphereMKRYLLMPAAALSLAGCAPGYTSAEFVYAEPVRYEYVVPVDRVVVVSQDVLVARGWTVYRVQRSGPNRIIWARRGDDDVVRIFATPRGGRVGVRGLWE
Ga0099791_1045002513300007255Vadose Zone SoilMKRYLMLAAALGLAGCAGGFTSATVVYGEPAEYVYLVPVDRVVVVTREVLVNRGWVVYRVERSGPNRIIWARRGDDEVVRIFATPQGDRVAVRGLWEGREQEERG
Ga0099793_1018008733300007258Vadose Zone SoilMKTICQFLLGAGAVSLTACAPGYTSASVVYAEPAEYVYEVPVDRVVVVTREVLVDRGWTVYRVERSGPNRIIWARRGDDDVV
Ga0099793_1071207813300007258Vadose Zone SoilMKRDLIMLAATLSLAGCAPGYTSAEFVYAEPVEYEYVVPVDRVVVVTREVLVTRGWTVYRVQ
Ga0099794_1022141323300007265Vadose Zone SoilMKTICQFLLGAGAVSLTACAPGYTSASVVYAEPAEYVYEVPVDRVVVVTREVLVDRGWTVYRVERSGPNRIIWARRGDDDVVRIFASPDGRRV
Ga0099794_1063435013300007265Vadose Zone SoilMKQHLLILAATLSLAGCAGGYYTNADFVYAEPVEYEYVVPVDRVVVVTQEVLVTRGWTVYRVQRSGPNRIIWARRGEDEIVRIFATPRGDRVA
Ga0066710_10077157313300009012Grasslands SoilMKRYLPSLVATLSLAGCAGGYARAEYVYAEPAEYVYVVPVERVVVVTRDVLVARGWTVYRVE
Ga0066710_10243480813300009012Grasslands SoilMKRYLPSLAATLSLVGCAGGYARAEYVYAEPAEYVYVVPVERVVVVTRDVLVARGWTVYRVERSGPNRIIWARRGDDQVVRIFATPQGER
Ga0066710_10396967013300009012Grasslands SoilMKRHLIILAATLSLAGCAGGYTSAEFVYAEPVEYEYAVPVDHVVVVTREVLVTRGWTVYRVQ
Ga0075418_1007252313300009100Populus RhizosphereMKRYLLILAAPAALSVASCAPGYTRAEYVYTEPAEHVYVVPVDRVVVVSRDVLVTHGWTVYRVERSGPNRVIWARRGDDHVVRIFASPHGE
Ga0066709_10209930213300009137Grasslands SoilMKRQLLILAATLSLAGCAGYGYARAEYVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYRVQRDGPNRIIWARRGDDDVVRIFATPR
Ga0099792_1042544723300009143Vadose Zone SoilMERQLLVLAATLSLAGCAAGGGYASAEYVYAEPVEYEYVVPVDRVVVVTREVLVTRDVLVARGWAVYRVQRSGPNRIIWARRGDDEVVRIFATPRGERGER
Ga0114129_1053284833300009147Populus RhizosphereMKRYLMLAATLSLAGCAGGFTSATVVYGEPVEYEYVVPVDRVVVVTREVMVNRGWVVYRVERAGPNRIIWGRRGDG
Ga0114129_1222884023300009147Populus RhizosphereMKRYLLILAAPAALSVASCAPGYTRAEYVYTEPAEHVYVVPVDRVVVVSRDVLVTHGWTVYRVERSGPNRIIWARRGDDHVVRIF
Ga0134070_1001065743300010301Grasslands SoilMKRHLLILAATLSLAGCAMGGGYASAEYVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYRVQRSGPNRIIWARRGDNDVVRIFATPRGQRVAVRGLWEERDR
Ga0134088_1027389113300010304Grasslands SoilMKRYLLTLAGTLSLAGCAGGYTRADFVYAEPVEYEYVVPVDRVVVVTRDVLVARGWAVYRVQRSGPDRIIWA
Ga0134088_1042533423300010304Grasslands SoilMKRTRHYLLVAGALSLAGCAAGYTSASVVYTEPAEYVYVVPVDRVVVVTREVLVNRGWVVYRV
Ga0134086_1014608923300010323Grasslands SoilMKNTLPLLLVAGALSLAGCAPGYTSASVVYAEPAEYVYVVPVDRVVVVTREALVNRGWVVYRVQRLGPNRIIWARRG
Ga0134086_1032033923300010323Grasslands SoilMKRTRHYLLVAGALSLAGCAAGYTSASVVYTEPAEYVYVVPVDRVVVVTREVLVNRGWVVYRVQRSGPNRIIWARRGDDEVVRIFATPQGQQVAVRGLWEVRE
Ga0134111_1050756723300010329Grasslands SoilMKRQLLILAATLSLAGCAGYGYARAEYVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYRVQRDGPNRIIWARRGDDDVVRIFA
Ga0134062_1004261233300010337Grasslands SoilMKPHLLSLAATLSLAGCAGGYTSAEFVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWRVYRVQRSGPNRIIWARRGDDQLVRIFATPRG
Ga0134126_1308177613300010396Terrestrial SoilMLTFMVLALPIVSGVSCAPGYVYTDADVVYADPPARVYVVPVDRVVVVTRDVLVRRGWSVWRVERRGPDRIVWARRGDDVVRIFATPQGDRVAVRGIVEGRDRDDG
Ga0134122_1331729613300010400Terrestrial SoilMTLIKEVRSMKRLLLVLAATLSLAGCAMGGGGGGYASAEYVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYRVERSGPNRIIWARRGDNDVVRIFATPHGQRVAVRGLWE
Ga0134123_1344630613300010403Terrestrial SoilMTLIKEVRSMKRLLLVLAATLSLAGCAMGGGGGGYASAEYVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYRVERSGPNRIIWARRGDHNVVRIFATPHGQRVAVRGLWEARDHDEDRGGP
Ga0137391_1012869933300011270Vadose Zone SoilMQRYLPVMVAALGVAGCVPGYTRASVVYAEPAEYVYVVPMDRVVVVTQEVLVNRGWTVYRVERAGPNRIIWARRGDDEVVRIFANPDRERVVVRGLWEARGRRKHGDKDER
Ga0137364_1064889123300012198Vadose Zone SoilMMKPIPRYLLVASSLGLAGCAGGFTSATVVYGEPAEYAYVVPVDRVVVVTRDVLVTRGWVVYRVQRSGP
Ga0137364_1125233923300012198Vadose Zone SoilMKTTLPVLLVAGALSLAGCAPGYTSASVVYAEPAEYVYVVPVDRVVVVTREVLVNRGWVVYRVQRSGPNRIIWARRGDDEVVRIFATPQGQQVGVRG
Ga0137364_1129482113300012198Vadose Zone SoilMKTTLPLLLVAGALSLAGCAPGYTSASVVYAEPAEYVYVVPVDRVVVVTREVLVNRGWVVYRVQRSGPNRIIWARRGDDEVVRIFATPQGQQV
Ga0137382_1107885123300012200Vadose Zone SoilMKRHLLILAATLSLAGCAMGGGYASAEYVYAEPVEYEYVVPVDRVVVVTRDVLVTRGW
Ga0137365_1119104613300012201Vadose Zone SoilMKRHLLILAATLSLAGCAMGGGYASAEYVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYRVQRSGPNRIIWARRGDNDVVRIFATPRG
Ga0137374_1028688313300012204Vadose Zone SoilMKRQLLTLAATLSLAGCAMGGGYASAEYVYAEPVEYEYVVPVNRVVVVTREVLVTRG
Ga0137374_1035943533300012204Vadose Zone SoilMMKPIPRYLLVASSLGLAGCAGGFTSATVVYGEPAEYAYVVPVDRVVVVTRDVLVTRGWVVYRVQRSGPN
Ga0137374_1067706123300012204Vadose Zone SoilMKRYLMLAATLGLAGCAGGFTSATVVYGEPAEYEYVVPVDRVVVVTREVLVTRGWVVYRVERSGPNRIIWGRRGDGEIV
Ga0137380_1120929113300012206Vadose Zone SoilMKTIFQFLLGAGAVSFTACAPGYTSASFVYAEPAQYEYEVPVGRVVVVTREVLVDRGWTVYRVERSGPNRIIWARRGDDDVVRIFASPTGQR
Ga0137376_1006065213300012208Vadose Zone SoilMKRQLLVLAATLSLAGCAAGGGYASAEYVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYRV
Ga0137376_1014299213300012208Vadose Zone SoilMKRHLLILAATLSLAGCAMGGGYASAEYVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTV
Ga0137379_1100207613300012209Vadose Zone SoilMKRHLLILAATLSLAGCAGGYTSAEFVYAEPVEYEYAVPVDHVVVVTREVLVTRGWTVYRVQRSGPNRIIWARRGDDRIVRIFATPRGERVAVRGLQ
Ga0137377_1025174823300012211Vadose Zone SoilMKRDLIMLAATLSLAGCAPGYTRADFVYAEPVEYEYVVPVDRVVVVTRDVLVARGWAVYRVQRSGPNRIIWARRGDD
Ga0137386_1086346523300012351Vadose Zone SoilMKRYLLSLAALPSLAGCAGGYASAEYVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYRVQRSGPDRIIWARRGDDQIVRIFATPRGQRV
Ga0137366_1064519213300012354Vadose Zone SoilMKNTLPLLLVAGALSLAGCAPGYTSAGVVYAEPAEYVYVVPVDRVVVVTREVLVNRGWVAYRVQRSGPNRIIWARRGDDE
Ga0137369_1024997833300012355Vadose Zone SoilMKRYLMLAATLGLAGCAGGFTSATVVYGEPAEYEYVVPVDRVVVVTREVLVNRGWVVYRVERSGPNRIIWGRRGDGEMVRIFA
Ga0137384_1062507813300012357Vadose Zone SoilMKRHLLILAATLSLAGCAGGYTSAEFVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYRVQRDGPNRIIWARRGDDDVVRIFATPRGDR
Ga0137390_1062210013300012363Vadose Zone SoilMKRDLIMLAATLSLAGCAPGYTSADFVYAEPVEYEYVVPVDRVVVVTRDVLVARGWAVYRVQRSGPNRII
Ga0137373_1026958713300012532Vadose Zone SoilMKRYLLSLAALPSLVGCAGGYASAEYVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYRVQRSGPNRII
Ga0137373_1114169523300012532Vadose Zone SoilMKTIFQFLLGAGAVSFTACAPGYTSASFVYAEPAQYEYEVPVDRVVVVTREVLVDRGWTVYRVERSGPNRIIWARRGDDDVVRIFASPTGQRVVVRGLWEARERKA
Ga0137398_1001254113300012683Vadose Zone SoilMQFITKEATMKTTLQFLLGAGALSLAACAPGYTSASVVYGEPAQYVYEVPVDRVVVVTREVLIDRGWTVYRVERSGRNRVIWARRGDDDVVRIFASPNGQRVAVRGLWEARERR
Ga0137397_1000903143300012685Vadose Zone SoilMTLIKEVRSMKRQLLVLAGTLSLAGCAAGGGYASAEYVYAEPVEYEYVVPVDRVVVVTREVLVTRGWTVYRVER*
Ga0137394_1036442313300012922Vadose Zone SoilMKRYAMLAATLGLAGCAGGFTSATVVYGEPAEYEYVVPVDRVVVVTREVLVNRGWVVYRVERSGPNRIIWGRRGDGEMV
Ga0137419_1147638213300012925Vadose Zone SoilMKRQLLVLAGTLSLAGCAAGGGYASAEYVYAEPVEYEYVVPVDRVVVVTREVLVTRGWTVYRVERSGPNRIIWARRGDDQVVRIFA
Ga0137416_1134637313300012927Vadose Zone SoilMKRDLIMLAATLSLAGCAPGYTSAEFVYAEPVEYEYVVPVDRVVVVTREVLVTRGWTVYRVQRSGPNRVIWARRGDDEIVRIFATPRGDRVAVRGLWEAREHDDHGE
Ga0137416_1186259423300012927Vadose Zone SoilMKRYLLTLAGTLSFAGCAGGYTRADFVYAEPVEYEYVVPVDRVVVVTRDVLVERGWTVYRVQRSGPNRIIWARRG
Ga0137407_1033405013300012930Vadose Zone SoilMTLIKEVRSMERQLLVLAATLSLAGCAAGGGYASAEYVYAEPVEYEYVVPVDRVVVVTREVLVTRGWTVYRVERSGPNRIIWARRGDDQVVRIFATPRGQRVAV
Ga0137407_1168609413300012930Vadose Zone SoilMKRYLMLAATLSPVGCAGGFTSATVVYGEPAEYEYVVPVDRVVVVTREVLVNRGWVVYRVERS
Ga0137410_1026232623300012944Vadose Zone SoilMKRYLLTLAGTLSLAGCAGGYTRADFVYAEPVEYEYVVPVDRVVVVTREVLVTRGWTVYRVQRSGPNRIIWARRGDDEI
Ga0137410_1105016913300012944Vadose Zone SoilMKQYLMLAATLSLAGCAGGFTSATVVYGEPAEYEYVVPVDRVVVVTREVLVNRGWVVYRVERSGPNRIIWGRRGDGEIVRIFANPNRDRVAVRGLWE
Ga0137410_1179141713300012944Vadose Zone SoilMKRYAMLAATLGLAGCAGGFTSATVVYGEPAEYEYVVPVDRVVVVTREVLVNRGW
Ga0134077_1009876433300012972Grasslands SoilMKRTRHYLLVAGALSLAGCAAGYTSASVVYTEPAEYVYVVPVDRVVVVTREVLVNRGWVVYRVQRSGPN
Ga0134077_1033076713300012972Grasslands SoilMKRQLLVLAATLSLAGCAMGGGYASAEYVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYRVQRDGPNRIIW
Ga0134076_1031582213300012976Grasslands SoilMKRYLLTLAGTLSLAGCAGAYTSADFVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYR
Ga0137418_1007160713300015241Vadose Zone SoilMKRQLLVLAGTLSLAGCAAGGGYASAEYVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYRVQRS
Ga0137409_1156637513300015245Vadose Zone SoilMQRYLMLAATLGLAGCAGGFTSATVVYGEPAEYVYVVPVDRVVVVTREVLVNRGWVVYRVERSGPNRIIWARRGDDEVVRIFATPQGERVVLRGLWE
Ga0137403_1024617633300015264Vadose Zone SoilMTLIKEVRSMERQLLVLAATLSLAGCAAGGGYASAEYVYAEPVEYEYVVPVDRVVVVTRE
Ga0134089_1041103123300015358Grasslands SoilMKRYLLTLAGTLSLAGCAGGYTRADFVYAEPVEYEYVVPVDRVVVVTRDVLVARGWTVYRVQRSGPNRIIWARRGDDEIVRIFATPRGDRVA
Ga0134085_1027345713300015359Grasslands SoilMKRTRHYLLVAGALSLAGCAAGYTSASVVYTEPAEYVYVVPVDRVVVVTREVLVNRGWVVYRVQRSGPNRIIWARRGD
Ga0134069_135659313300017654Grasslands SoilMQRYLPIMFVALGVAGCAPGYMRASVVYAEPAEYVYVVPMDRVVVVTQEVLVNRGWTVYRVERAGPNRIIWARRGDDDVVRIFANPDGDRVVVRGLWEARGRRKHG
Ga0134074_124443513300017657Grasslands SoilMKPHLLSLAATLSLAGCAGGYTSAEFVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWRVYRVQRSGPNRIIWARRGDDQLVRIFATPRGERVAVRGL
Ga0134083_1029402323300017659Grasslands SoilMKRTPQCLLVAGALSLAGCAGSYTSAAVVYGEPAEYVYVVPVDRVVVVTREVLVDRGWVVYRVERSGPNRIIWARR
Ga0184610_125629213300017997Groundwater SedimentMQRHFMLAATLGLAGCAGGYTSATLVYGEPAEYEYVVPVDRVVVVTREVLVTRG
Ga0184638_124004913300018052Groundwater SedimentMKRYLMIFAATLSLAGCAMGGGYASAEYVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYRVQRSGPNRIIWARRGDNDVVRIFATPRGQRVAVRGLWEERE
Ga0184623_1007597223300018056Groundwater SedimentMRNLWLLALAVTLPACAGGYFTRAEFVYGEPAPYAYVTEVDRVVVVTRDVLVAHGYVVWRVERSGPRRIVWARRGDNEVVRIFVTPEGRRVYLR
Ga0184618_1027894523300018071Groundwater SedimentMKRYAMLAATLCLAGCAGGYTSATLVYGEPAEYEYVVPVDRVVIVTREVLVNRGWVVYRVERSGPNRIIWGRRGDGEIV
Ga0184635_1008521913300018072Groundwater SedimentMKRYAMLAATLSLAGCAGGFTSATVAYGEPAEYAYVVPVDRVVVVSREVLVDRGWRVYRVERSGPN
Ga0184632_1048958023300018075Groundwater SedimentMKRQLLTLAATLSLAGCAMGGGYASAEYVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTVYRVQR
Ga0184612_1056123513300018078Groundwater SedimentMKRYLMIFAATLSLAGCAGGYGRAEYVYAEPAEHVYVVPVERVVVVTRDVLVTRGWTVYRVERSGRNRIIWARRGDDQIVRIFANPRGERVAV
Ga0066655_1038808533300018431Grasslands SoilMKRYLLTLAGTLSLAGCAGGYTRADFVYAEPVEYEYVVPVDRVVVVTRDVLVTRGWTV
Ga0066662_1292071823300018468Grasslands SoilMKRELIMLAATLSLAGCAPGYTSAEFVYAEPVEYEYVVPVDRVVVVTRDVLITRGWTVYRVQRSGP
Ga0066669_1085729323300018482Grasslands SoilMKTIFRFLLGAGAVSFTACAPGYTSASFVYAEPAEYVYEVPVDRVVVVTRDVLVDRGWTVYRVERSGPNRIIWARRGDDDVVRIFASPTGQRVAVRGLWEARER
Ga0137408_112715333300019789Vadose Zone SoilMKRQLLTLAATLSLAGCAMGGGGGYASAEYVYAEPVEYEYVVPVDRVVVVTREVLVTRGWTVYRVQRSGPNRIIWARRGDDQVVRIFATPRGQRVGVRGLWE
Ga0193705_108337313300019869SoilMKRYAMLAATLSLAGCAGGFTSATVAYGEPAEYAYVVPVDRVVVVSRDVLVDRGWRVYRVERSGPNRIVWARRGDDDVIRIFATPQGERVVVRGLREVREERHDRGR
Ga0193723_100251873300019879SoilMKRQLLVLAATLSLAGCAMGGGGGGYAGAEYVYAEPAQYEYVVPVDRVVVVTRDVLVTRGWTVYRVQRSGPNRIIWARRGDDQVVRIFATPRGERVAVR
Ga0193713_120179413300019882SoilMKTILQFLLVLGAVSCTACAPGYTSAGVVYAEPAEYVYEVPVDRVVVVTREVLVDRGWTVYRVERSGPNRIIWARRGDDDVVRIFATPTGQRVAVRGLWEARERKAQRE
Ga0193755_122899823300020004SoilMKTILQFLLVLGAVSCTACAPGYTSAGVVYAEPAEYVYEVPVDRVVVVTREVLVDRGWTVYRVERSGPNRIIWARRGDGDVVRIFAT
Ga0222624_139467233300021951Groundwater SedimentMKRYAMLAATLCLAGCAGGYTSATLVYGEPAEYEYVVPVDRVVIVTREVLVNRGWVVYRVERSGPNRIIWGRRGDGEIVRIFATPNRDRVAVRGLWEVR
Ga0209234_127248913300026295Grasslands SoilMQRYLPIMLAALGVAGCAPGYMRASVVYAEPAEYVYVVPVDRVVVVTQEVLVNRGWTVYRVERAGPNRIIWARRGDDDVVRIFANPDGERVVVRGLWE
Ga0209236_115627623300026298Grasslands SoilMKRELIMLAATLSLAGCAPGYTSAEFVYAEPVEYEYVVPVDRVVVVTRDVLITRGWTVYRVQRSGPNRIIWARRGDDEIVRIFATPSGQRVAVRGLW
Ga0209236_121450423300026298Grasslands SoilMKRYLLSFAATLSLAGCAGGYARAEYVYAEPAEYVYVVPVERVVVVTRDVLVSRGWTVYRVEG
Ga0209761_101211863300026313Grasslands SoilMKRYLLTLAGTLSLAGCAGGYTRADFVYAEPVEYEYVVPVDRVVVVTREVLVTRGWTVYRVQRSGPNRIIWARRGDDE
Ga0209470_106084133300026324SoilMKRTRHYLLVAGALSLAGCAAGYTSASVVYTEPAEYVYVVPVDRVVVVTREVLVNRGWVVYREERSGPNRIIW
Ga0209470_106192733300026324SoilMQRSLIFLGAALGVAGCAPGYTSASFVYAEPARYVYVVPVDRVVVVTREVLVNRGWTVYRVERDGPNRIIWARQGDDHIVRIFANPDGERVVVRGL
Ga0209803_104510413300026332SoilMKRDLIMLAATLSLAGCAPGYTNAEFVYAEPVEYEYVVPVDRVVVVTRDVLITRGWTVYRVQRSGPNRIIWARRGDDEIVRIFATPRGERGERVAVRGL
Ga0209378_101139213300026528SoilMKRYLPSLAATLSLVGCAGGYARAEYVYAEPAEYVYVVPVERVVVVTRDVLVARGWTVYRVERSG
Ga0209382_1088400413300027909Populus RhizosphereMKRYLMLAATLSLAGCAGGFTSATVVYGEPVEYEYVVPVDRVVVVTREVMVNRGWVVYRVERSGPNRIIWGRRGDGEIVRIFATPNRD
Ga0307497_1005573713300031226SoilMNRNMLFLAATLGVAGCAPGYARAEYVYAEPAEYEYVVPVDRVVVVTQDVLVRRGWTVYRVQRSGEGRIIWARRGDDHVVRIFASPHGERVAVRGLAEEREHSDHG
Ga0307473_1026673723300031820Hardwood Forest SoilMKRTQQCLLVAGALNLAGCAGGYTSAAVVYSEPAEYVYVVPVDRVVVVTREVLVNRGWAVVRVERSGPNRIIWARRNDNDDEIVRIFATPQGQQVAVRGL
Ga0307473_1039580623300031820Hardwood Forest SoilMKRDLILLAATLSLAGCAPGYTSAEFVYAQPVEYEYVVPVDRVVVVTQDVLVTRGWTVYRVQRD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.