NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F078541

Metagenome Family F078541

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F078541
Family Type Metagenome
Number of Sequences 116
Average Sequence Length 101 residues
Representative Sequence VTSVTGPSTFEVSTAGGTASVGSASADWFIKAQVDSAQVGVLAGTVDLTSIVTGQSVSIPAHWGTRLEAGLDPVLPRLWAQAEFDAVIHLTGA
Number of Associated Samples 84
Number of Associated Scaffolds 116

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.86 %
% of genes from short scaffolds (< 2000 bps) 0.86 %
Associated GOLD sequencing projects 76
AlphaFold2 3D model prediction Yes
3D model pTM-score0.66

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.138 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(40.517 % of family members)
Environment Ontology (ENVO) Unclassified
(52.586 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(50.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 9.09%    β-sheet: 44.63%    Coil/Unstructured: 46.28%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.66
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 116 Family Scaffolds
PF04773FecR 5.17
PF00072Response_reg 3.45
PF13473Cupredoxin_1 1.72
PF06041DUF924 1.72
PF05193Peptidase_M16_C 0.86
PF00392GntR 0.86
PF02796HTH_7 0.86

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 116 Family Scaffolds
COG3803Uncharacterized conserved protein, DUF924 familyFunction unknown [S] 1.72


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.14 %
All OrganismsrootAll Organisms0.86 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005167|Ga0066672_10114039All Organisms → cellular organisms → Bacteria → Proteobacteria1664Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil40.52%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil21.55%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil8.62%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil5.17%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil4.31%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil3.45%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil1.72%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.72%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.72%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.72%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.72%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil1.72%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.86%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.86%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.86%
Exposed RockEnvironmental → Terrestrial → Rock-Dwelling (Subaerial Biofilms) → Unclassified → Unclassified → Exposed Rock0.86%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.86%
Plant RootsHost-Associated → Plants → Roots → Unclassified → Unclassified → Plant Roots0.86%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.86%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001867Texas A ecozone_OM1H0_M2 (Combined assembly for Texas A ecozone Site metagenome samples, ASSEMBLY_DATE=20130705)EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300003219Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM3EnvironmentalOpen in IMG/M
3300003505Forest soil microbial communities from Harvard Forest LTER, USA - Combined assembly of forest soil metaG samples (ASSEMBLY_DATE=20140924)EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300006102Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2013EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016371Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172EnvironmentalOpen in IMG/M
3300016387Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176EnvironmentalOpen in IMG/M
3300016404Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021362Barbacenia macrantha exposed rock microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - ER_R09EnvironmentalOpen in IMG/M
3300021384Root-associated microbial communities from Barbacenia macrantha in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R9Host-AssociatedOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300025898Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026356Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-AEnvironmentalOpen in IMG/M
3300026489Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-11-AEnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027173Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF036 (SPAdes)EnvironmentalOpen in IMG/M
3300027583Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027874Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031469Fir Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031561Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f26EnvironmentalOpen in IMG/M
3300031564Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f21EnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031640Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f23EnvironmentalOpen in IMG/M
3300031679Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f23EnvironmentalOpen in IMG/M
3300031736Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f21EnvironmentalOpen in IMG/M
3300031764Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.117b4f27EnvironmentalOpen in IMG/M
3300031765Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f22EnvironmentalOpen in IMG/M
3300031805Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.109b1f23EnvironmentalOpen in IMG/M
3300031819Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f21EnvironmentalOpen in IMG/M
3300031821Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f20EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031846Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f19EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300031890Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176 (v2)EnvironmentalOpen in IMG/M
3300031910Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 (v2)EnvironmentalOpen in IMG/M
3300031941Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080EnvironmentalOpen in IMG/M
3300031942Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF176EnvironmentalOpen in IMG/M
3300031945Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX082EnvironmentalOpen in IMG/M
3300031946Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF172EnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300031959Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f24EnvironmentalOpen in IMG/M
3300031981Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f25EnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300032010Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f22EnvironmentalOpen in IMG/M
3300032041Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f22EnvironmentalOpen in IMG/M
3300032044Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f20EnvironmentalOpen in IMG/M
3300032059Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f27EnvironmentalOpen in IMG/M
3300032060Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f18EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300033289Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN108EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12627J18819_1004696513300001867Forest SoilFIRTQTGLTQAGVLAGKIDLTSTATGESVSIPAHWGTRLEAGRSPVPPRNWTQAEFEEVTHSTQ*
JGIcombinedJ26739_10043354633300002245Forest SoilSDSADWFIKAQAGSAQVGVLAGSVDFTSTVTGQSVSIPAHWGTSLETGRAPVLPRVWAQIEFDSVIRLTECCQSVQPKAPSPTR*
JGI26341J46601_1017923923300003219Bog Forest SoilGSADWFIVAEAGSAQVGVLAGTVDLTSAATRGSVAIPAHWGTRLESGLAPMLPRVWAQMEFNAVIRLTECCQSAQPKLEMAPVR*
JGI26341J46601_1018200223300003219Bog Forest SoilSTFEVSTAVGTASFRSADCYIDAEAGSAQVGVLAGIVDLTSAATGQSVAIPAHWGTRLEAGRDPVPPRVWAQVEFDAFSRRTE*
JGIcombinedJ51221_1018716723300003505Forest SoilGRDVKLSLTQGLLRAQVSSVTGPSTFDVSTAAGTASVGSASADWFIRTQTGLAQAGVLAGKIDLTSTTTGESVSIPAHWGTRLEAGRSPVPPRNWTQAEFEEVTHSTQ*
Ga0066672_1011403913300005167SoilSTAVGTAAVGSDSAEWFIKAEAGSAQVAVLAGTVDLTSRVTGGSVSIPAHWGTRLEAGRTPVPPRVWTQVEFNAFIRVTQ*
Ga0066672_1054373513300005167SoilSLTRGALRAQVTSATGPSTFEVSTAVGTASVDSASADWFIKAQSGSGQVGVLAGTIDLRSAVTGESVSIPAHWGTRLEKGLNPVLPRVWLQREFNAVIRLTRA*
Ga0070714_10051316313300005435Agricultural SoilEVSTAVGAAAVRSGSADWFINAQPGSAQVGVLDGDVDLTSAATGRLVSIVSHWGTRLEAGRDPVPPRNWTEAEFDAVTDSTK*
Ga0070711_10072526023300005439Corn, Switchgrass And Miscanthus RhizosphereSSGRDVKLSLTQGLLRAQVSSVTGPSTFDVSTAAGTASVGSASADWFIRTQTGLAQAGVLAGKIDLTSTATGESVSIPAHWGTRLEAGRSPVPPRNWTQAEFEEVTHSTQ*
Ga0066681_1032838613300005451SoilPFEVSTAVGTAAVGSDSAEWFIKAEAGSAQVAVLAGTVDLTSRVTGESVSIPAHWGTRLEAGRTPVPPRVWTQVEFNAFIRVTQ*
Ga0066687_1033180213300005454SoilSATRPFEVSTAVGTAAVGSDSAEWFIKAEAGSAQVAVLAGTVDLTSRVTGESVSIPAHWGTRLEAGRTPVPPRVWTQVEFNAFIRVTQ*
Ga0066702_1078856623300005575SoilAVGTAAVGSDSAEWFIKAEAGSAQVAVLAGTVDLTSRVTGESVSIPAHWGTRLEAGRTPVPPRVWTQVEFNAFIRVTQ*
Ga0075015_10072267013300006102WatershedsQGLLRAHITPIGGPSTFDVSTAVGTASVRSGFADWFIKAQAGSAQVGVLDGTVDLTSAATGQSVSIPSHWGTRLEAGLDVMPPRRWGKTDFDPVIGLTECCQSAQPKVEPSTGAETR*
Ga0079222_1246985513300006755Agricultural SoilTAVGIASVDSASADWFIKRQADAAQAGVFAGKIDLTSIATGQSVSIPAHWGTRLEPGRDPLPPRNWTQAEFEEIVHSTQ*
Ga0079219_1075936623300006954Agricultural SoilDAPGSSVTVVSYNIGGSGRYVRLSLTQGLLRAYVTSVTGPSMFEVSTAVGTTSVGSDSADWFIKAQAGLAQVGVLAGTIDLTNVVTGQSVSIPARWGTRLETGRVPVVPRVWTQMEFNAVIRVTE*
Ga0099792_1021302413300009143Vadose Zone SoilEVSTAVGTASVGSDSADWFIKAQAGSAQVGVLAGTVDLTSTVTGQSVSIPAHWGTSLETGRAPVLPRVWAQREFDAVIRLTECCQPKVETAPSPIR*
Ga0099792_1108703013300009143Vadose Zone SoilVGTAQVGSDSADWFIKVDAASAQVGVLAGIVALSSPTGTSVSIPAHWGTRLEAGRAPLPPRVWNQMEFNAVMRVTE*
Ga0126374_1163971313300009792Tropical Forest SoilLRITSATRPFEVSSAVGTAAASSASADWFVESKAGSARVAVLAGIVDLTSNFIGQSVSIPAHWGARLEAGRAPVPPRVWNQMEFNAFIRITQ*
Ga0126378_1171436413300010361Tropical Forest SoilRLSLTQGLLRAHVTSAIGPSTFEVSTAGGTASVASDSADWFIKAQIGSAQVGVLTGTVDLTSTETKQAVSIPTRWGTRLEGGLDPVLPRVWTQREFTGVIRLTDVQSGGQGPSPSNTDPINLSPR*
Ga0150985_10015709923300012212Avena Fatua RhizosphereSTAVGTASVDSASADWFIEAQPGSGQVGVLAGTIDLRSAVTGESVSIPTHWGTRLERGLNPVLPRVWLQREFNAVIRLTGA*
Ga0150984_10590386923300012469Avena Fatua RhizosphereLLRARVTPVRGPSTFEVSTAVGTASVRSSSADWFVKAQADAAQVGVLEGIVDLRSAATGRSVSIPSHWGTRIQAGLDSMLPRRWGESDFNPLIALVPFAK*
Ga0137396_1086136113300012918Vadose Zone SoilRHVRLSLTQGLLRAHVTPVTGPSTFEVSTAVGTASVGSDSADWFIKAQAGSAQVGALAGTVDLTSTVTGQSVSIPAHWGTRLESGLDPVLPRVWTQREFSAVIRLTEVQSGGNGSARLNINAVNVSRR*
Ga0137419_1084632613300012925Vadose Zone SoilLTQGLLRAHVTSVTGPSMFEVSTAVGTASVGSDSADWFIKAQAGSAQVGALAGTVDLTSTVTGQSVSIPAHWGTRLESGLDPVLPRVWTQREFSAVIRLTEVQSGGNGSASLNINAVNVSRR*
Ga0137418_1063579513300015241Vadose Zone SoilGGSGRHVRLSLTQGLLRAHVTSVTGPSMFEVSTAVGTASVGSDSADWFIKAQAGSAQVGALAGTVDLTSTVTGQSVSIPAHWGTRLESGLDPVLPRVWTQREFSAVIRLTEVQSGGNGSASLNINAVNVSRR*
Ga0137412_1118112913300015242Vadose Zone SoilGTASVDSASADWFIKAQPDSGQVGVLAGTIDLRSAVTGESVSIPAHWGTRLEKGLNPMLPRVWLQREFNAVIRLTGA*
Ga0182036_1006584213300016270SoilMGGALRAQVTSVTGPSTFEVSTAGGTASLGSAPADWFIKAQVDSAQVGVLAGTVDLTSIVTRQSVSIPAHWGTRLEAGRAPMLPRLWPQVDFNPLSRLTECCQPVQPQTEPRSSPSR
Ga0182036_1152852523300016270SoilLRAQVTSVTGPSTFEVSTASGTASVGSASADWFIKAQVDSAQVAVLAGTVDLTSIVTGQSVSIPAHWGTRLEAGLDPVLPRLWAQAEFDVVIQLTGT
Ga0182033_1039396423300016319SoilSGRSVSLRLAGGALRAQVTSVTGPSTFEVSTAGGTASVGSASADWFIKAQVDSAQVGVLVGTVDLTSIVTGQSVSIPAHWGTRLEAGRAPMLPRVWPQVDFNALSRLTECCQSAQPQTEP
Ga0182035_1117157213300016341SoilEVSTAGGTASVGSASADWFIKAQVDSAQVAVLAGTVDLTSIVTGQSVSIPAHWGTRLEAGRAPMLPRVWPQVDFNALSRLTECCQSVQPQTEPKSSPNR
Ga0182035_1164131713300016341SoilGTASVGSASADWFIRTQTGLAQAGVLAGKIDLTSTATGESVSIPAHWGTRLEAGRSPVPPRNWTQAEFEEVTHSTQ
Ga0182034_1108412313300016371SoilTASLGSAPADWFIKAQVDSAQVGVLAGTVDLTSIVTGQSVSIPAHWGTRLEAGRAPMLPRVWPQVDFNALSRLTECCQSAQPQTEPR
Ga0182040_1096814913300016387SoilADWFIKAQVDSAQVGVLAGTVDLTSIVTRQSVSIPAHWGTRLEAGRAPMLPRLWPQVDFNPLSRLTECCQPVQPQTEPRSSPSR
Ga0182040_1101414713300016387SoilMTVASYNIGASGRHVRLSLTQGLLRAQVTSAIGPSTFEVSTAGGTASVASGSADWFIKAQTGSAQVGVLTGTVDLTSTVTEEAVSIPARWGTDLESGRAPVLPRRWAQNEFDAVIRLTECCQSVLPKLGPAPVPIR
Ga0182037_1011587833300016404SoilDSSISVTSYSIDGSGRNVRLRLMGGALRAQVTSVTGPSTFEVSTAGGTASLGSAPADWFIKAQVDSAQVGVLAGTVDLTSIVTRQSVSIPAHWGTRLEAGRAPMLPRLWPQVDFNPLSRLTECCQPVQPQTEPRSSPSR
Ga0182039_1037175513300016422SoilSYSIDGSGRNVRLRLMGGALRAQVTSVTGPSTFEVSTAGGTASVGSASADWFIKAQVDSAQVGVLVGTVDLTSIVTGQSVSIPAHWGTRLEAGRAPMLPRVWPQVDFNALSRLTECCQSAQPQTEPR
Ga0182039_1042460713300016422SoilSYSIDGSGRNVRLRLMGGALRAQVTSVTGPSTFEVSTAGGTASLGSAPADWFIKAQVDSAQVGVLAGTVDLTSIVTRQSVSIPAHWGTRLEAGRAPMLPRLWPQVDFNPLSRLTECCQPVQPQTEPRSSPSR
Ga0066662_1012936613300018468Grasslands SoilTSATRPFDVSTAVGTGAVGSDSAEWFIKAEAGSAQVAVLAGTVDLTSRVTGESVSIPAHWGTRLEAGRTPVPPRVWTQAEFNAFIRVTQ
Ga0066662_1099314213300018468Grasslands SoilRAQVTSATGPSTFEVSTAVGTASVDSASADWFIKAQSGSGQVGVLAGTIDLRSAVTGESVSLPAHWGTRLEKGLNPVLPRVWLQREFNAVIRLTRA
Ga0179592_1007661213300020199Vadose Zone SoilTVVSYNIGGSGRHVRLSLTQGLLRAHVTSVTGPSMFEVSTAVGTASVGSDSADWFIKAQAGSAQVGALAGTVDLTSTVTGQSVSIPAHWGTRLESGLDPVLPRVWTQREFSAVIRLTEVQSGGNGSASLNINAVNVSRR
Ga0210406_1036975313300021168SoilGHRRGSASADWFIRTQTGLAQAGVLAGKIDLTSTTTGESVSIPAHWGTRLEAGRSPVPPRNWTQAEFEEVTHSTQ
Ga0210405_1007699143300021171SoilSLTQGLLRAQVSSVTGPSTFDVSTAAGTASVGSASADWFIRMQTGLAQAGVLAGKIDLTSTTTGESVSIPAHWGTRLEAGRSPVPPRNWTQAEFEEVTHSTQ
Ga0210408_1060840013300021178SoilAVGTASVASPSADWFIKAQPDSAQVGVLAGTIDLTSTVTRESVSIPAHWGTSLEAGFDPVLPRVWAQREFNAIIRLTRA
Ga0213882_1053769913300021362Exposed RockGLLRVRVARVAGPSTFVVSTAAGTASVSSTSADWFIKAESDSAQVGVLAGTVDLTSAARRQSVSIPGHWGTRLETGRAPVLPRVWSQVEFSAVIRLTECCQSAQPKIEMPVR
Ga0213876_1023648523300021384Plant RootsSSTSADWFIKAESDSAQVGVLAGTVDLTSAARRQSVSIPGHWGTRLETGRAPVLPRVWSQVEFSAVIRLTECCQSAQPKIETPVR
Ga0210397_1102526613300021403SoilGSVISVAPGSSITVERYNIASSGRDVKLSLTQGLLRAQVSSVTGPSTFDVSTAAGTAAVGSASADWFIRTQTGLAQAGVLAGKIDLTSTTTGESVSIPAHWGTRLEAGRSPVPPRNWTQAEFEEVTHSTQ
Ga0210387_1110699013300021405SoilSTFEVSTAVGTGSVGSDSADWFIKAQAGSAQVGVLAGSVDFTSTVTGQSVSIPAHWGTSLETGRAPVLPRVWAQIEFDAVIRLTECCQSVQPKAPSPTR
Ga0210387_1125343423300021405SoilSLTQGLLRAHVTSVTGPSTFEVSTAVGTGSVGSDSADWFIKAQAGSAQVGALAGTVDLTSTVTGQSVSIPAHWGTRLESGLDPVLPRVWTQREFSAVIRLTEVQSGGNGSASLNINAVNVSRR
Ga0210387_1154971213300021405SoilVERYNIAGSGRDVKLSLTQGLLRAQVSSVTGPSTFDVSTAAGTASVGSASADWFIRTQTGLAQAGVLAGKIDLTSTTTGESVSIPAHWGTRLEAGRSPVPPRNWTQAEFEEVTHSTQ
Ga0210386_1015298213300021406SoilERYNIASSGRDVKLSLTQGLLRAQVSSVTGPSTFDVSTAAGTASVGSASADWFIRTQTGLAQAGVLAGKIDLTSTATGESVSIPAHWGTRLEAGRSPVPPRNWTQAEFEEVTHSTQ
Ga0210390_1057710913300021474SoilPSTFDVSTAAGTASVGSASADWFIRTQTGLAQAGVLAGKIDLTSTTTGESVSIPAHWGTRLEAGRSPVPPRNWTQAEFEEVTHSTQ
Ga0126371_1138514613300021560Tropical Forest SoilIMTVTSYGVGGSGRDVRLSLTQGVLRANVTSVGGPSTFEVLTAAGTASLASESADWFVKAQPVAVQVGVLAGSIDLTSAATGQSVSIPAHWGTRLETGLDPVLPRIWAQKEFNAVTNLTGVEH
Ga0126371_1296479713300021560Tropical Forest SoilTASVASESADLFAEALSDSAQVGVLSGTVDLTSTATGQSVSIPAHWGTRLENGLDPVLPRVWAQREFNAVTRLTGA
Ga0207692_1020586713300025898Corn, Switchgrass And Miscanthus RhizosphereFGYRSLHLDVSTAAGTASVGSASADWFIRTQTGLAQAGVLAGKIDLTSTATGESVSIPARWGTRLEAGRSPVPPRNWTQAEFEEVTHSTQ
Ga0207664_1012636633300025929Agricultural SoilLLRAQVSSVTGPSTFDVSTAAGTASVGSASADWFIRTQTGLAQAGVLAGKIDLTSTATGESVSIPARWGTRLEAGRSPVPPRNWTQAEFEEVTHSTQ
Ga0257150_105403813300026356SoilSYNYGGSGRHVRLSLTQGLLRAHVTPVTGPSTFEVSTAVGTASVGSDSADWFIKAQAGSAQVGALAGTVDLTSTVTGQSVSIPAHWGTRLESGLDPVLPRVWTQREFSAVIRLTEVQSGGNGSASLNINAVNVSRR
Ga0257160_109720813300026489SoilIGGSGRHVRLSLTQGLLRAHVTSVTGPSMFEVSTAVGTASVGSDSADWFIKAQAGSAQVGVLAGSVDFTSTVTGQSVSIPAHWGTSLETGRAPVLPRVWAQIEFDSVIRLTECCQSVQPKAPSPTR
Ga0179587_1095520013300026557Vadose Zone SoilVTSVTGPSTFEVSTAVGTASVGSDSADWFIKAQAGSAQVGVLAGTVDLTSTVTGQSVSIPAHWGTSLETGRAPVLPRVWTQMEFDAVIRLTECCQPKVETAPSPIR
Ga0208097_104086713300027173Forest SoilSSGRDVKLSLTQGLLRAQVSSVTGPSTFDVSTAAGTASVGSASADWFIRTQTGLAQAGVLAGKIDLTSTTTGESVSIPAHWGTRLEAGRSPVPPRNWTQAEFEEVTHSTQ
Ga0209527_113470113300027583Forest SoilSLTQGLLRAHVTPVTGPSTFEVSTAVGTGSVGSDSADWFIKAQAGSAQVGALAGTVDLTSTVTGQSVSIPAHWGTRLESGLDPVLPRVWTQREFSAVIRLTEVQSGGNGSASLNINAVNVSRR
Ga0209465_1055877623300027874Tropical Forest SoilGPGRDVKLLLTQGVLRAEVTSVRGASKFEVSIPGGTASVTSESADLFIEALPDSAQVGVLAGSVDLTSTATGQSVSIPAHWGTHLETGLDPVLPRVWAQREFNAVTHLTGT
Ga0209488_1050521613300027903Vadose Zone SoilSGRHVRLSLAQGLLRAHVTSVTGPSMFEVSTAVGTASVGSDSADWFIKAQAGSAQVGALAGTVDLTSTVTGQSVSIPAHWGTRLESGLDPVLPRVWTQREFSAVIRLTEVQSGGNGSASLNINAVNVSRR
Ga0209488_1093044413300027903Vadose Zone SoilSIASSGRDVKLSLTQGLLRAQVSSVTGPSTFDVSTAAGTASVGSASADWFIRTQTGLAQAGVLAGKIDLTSTATGESVSIPAHWGTRLEAGRSPVPPRNWTQAEFEEVTHSTQ
Ga0209526_1020390723300028047Forest SoilTQGLLRAHVTPVTGPSTFEVSTAVGTGSVGSDSADWFMKAQAGSAQVGVLAGSVDFTSTVTGQSVSIPAHWGTSLETGRAPVLPRVWAQIEFDSVIRLTECCQSVQPKAPSPTR
Ga0170834_10270904613300031057Forest SoilSVTGPSMFEVSTAVGTGSVGSDSADWFIKAQAGSAQVGVLAGSVDFTSTVTGQSVSIPAHWGTSLETGRAPVLPRVWAQIEFDAVIRLTECCQSVQPKAPSPTR
Ga0170819_1803792123300031469Forest SoilPGSSMTVVSYNIGGSGRHVRLSLTQGLLRAHVTSVTGPSMFEVSTAVGTGSVGSDSADWFIKAQAGSAQVGALAGTVDLTSTVTGQSVSIPAHWGTRLESGLDPVLPRVWTQREFSAVIRLTEVQSGGNGSASLNINAVNVSRR
Ga0318528_1026531823300031561SoilGGTASVGSASADWFIKAQVDSAQVGVLVGTVDLTSIVTGQSVSIPAHWGTRLEAGRAPMLPRVWPQVDFNALSRLTECCQSAQPQTEPR
Ga0318528_1029397713300031561SoilVNFIRDPSTFEVATASGTASVASASADWFIKAQPDAAQVAVLVGTIDLRSAVTGDSVSIPARWGTRLETGLDPVLPRIWAQREFNAVIRLTGTNDPAAL
Ga0318528_1046308713300031561SoilVTSVTGPSTFEVSTAGGTASVGSASADWFIKAQVDSAQVAVLAGTVDLTSIVTGQSVSIPAHWGTRLEAGRAPMLPRVWPQVDFNALSRLTECCQSVQPQTEPKSSPNR
Ga0318528_1072686423300031561SoilVGTASVRSGSADWFVVAEAGSAQVGVLAGTVDLTGAATRGSVSIPAHWGTRLESGLAPMMPRVWAQMEFNAVIRLTECCQSAQPKLETPPAR
Ga0318573_1024098213300031564SoilVMLSLTQGVLRVRVTSVTVPSTFEVSTAVGTASVASESADWFIRAQAGSAQVGVLTGIVDLTSTTTRQSVSIPAHWGTRLETGLDPVLPRIWAHTEFDAVIRLTGG
Ga0310915_1036031313300031573SoilWFIKAQVDSAQVGVLVGTVDLTSIVTGQSVSIPAHWGTRLEAGRAPMLPRVWPQVDFNALSRLTECCQSAQPQTEPR
Ga0318555_1068277323300031640SoilSSAVGTAAASSASADWFVESKAGSARVAVLAGIVDLTSNLIGQSVSIPAHWGARLEAGRAPVPPRVWNQMEFNAFIRITQ
Ga0318561_1022417323300031679SoilAQVDSAQVGVLVGTVDLTSIVTGQSVSIPAHWGTRLEAGRAPMLPRVWPQVDFNALSRLTECCQSAQPQTEPR
Ga0318501_1033291113300031736SoilVTSVTVPSTFEVSTAVGTASVASESADWFIRAQAGSAQVGVLTGIVDLTSTTTRQSVSIPAHWGTRLETGLDPVLPRIWAHTEFDAVIRLTGG
Ga0318535_1052199813300031764SoilSISVTSYSIDGSGRNVRLRLMGGALRAQVTSVTGPSTFEVSTAGGTASLGSAPADWFIKAQVDSAQVGVLAGTVDLTSIVTRQSVSIPAHWGTRLEAGRAPMLPRLWPQVDFNPLSRLTECCQPVQPQTEPRSSPSR
Ga0318535_1056575313300031764SoilGRNVRLRLMGGALRAQVTSVTGPSTFEVSTAGGTASVGSASADWFIKAQVDSAQVGVLAGTVDLTSIVTGQSVSIPAHWGTRLEAGLDPVLPRLWAQAEFDAVIHLTGA
Ga0318554_1054646413300031765SoilVRLRLMGGALRAQVTSVTGPSTFEVSTAGGTASLGSAPADWFIKAQVDSAQVGVLAGTVDLTSIVTRQSVSIPAHWGTRLEAGRAPMLPRLWPQVDFNPLSRLTECCQPVQPQTEPRSSPSR
Ga0318497_1038911923300031805SoilRVTSVTVPSTFEVSTAVGTASVASESADWFIRAQAGSAQVGVLTGIVDLTSTTTRQSVSIPAHWGTRLETGLDPVLPRIWAHTEFDAVIRLTGG
Ga0318568_1044900513300031819SoilTAAAASDSADWFVKAEARSAQVGVLAGTVNLTSSPTGESVSIPTHWGTRLEAGRAPVPPRVWNQMEFNAVIRVTE
Ga0318567_1039558023300031821SoilVTGPSTFEVSTAGGTASVGSASADWFIKAQVDSAQVGVLVGTVDLTSIVTGQSVSIPAHWGTRLEAGRAPMLPRVWPQVDFNALSRLTECCQSAQPQTEPR
Ga0307478_1167804213300031823Hardwood Forest SoilRDVRLSLTQGLLRAQVSSVTGPSTFDVSTAAGTASVGSASADWFIRTQTGLAQAGVLAGKIDLTSTTTGESVSIPAHWGTRLEAGRSPVPPRNWTQAEFEEVTHSTQ
Ga0318512_1034172513300031846SoilAQTGSAQVGVLTGTVDLTSTVTEEAVSIPARWGTDLESGRAPVLPRRWAQNEFDAVIRLTECCQSVLPKLGPAPVPLR
Ga0318512_1044036013300031846SoilISVTSYSIDGSGRNVRLRLMGGALRAQVTSVTGPSTFEVSTAGGTASLGSAPADWFIKAQVDSAQVGVLAGTVDLTSIVTRQSVSIPAHWGTRLEAGRAPMLPRLWPQVDFNPLSRLTECCQPVQPQTEPRSSPSR
Ga0306919_1013950333300031879SoilLSLTHGVLRAQVTPVKGLSKFEVSIPVGTASVGSDSADWFIKAEAGSAQVGVLAGTVDLTSIPTGQSVSIPARWGTHLETGLDPVLPRVWTQREFSAVTRLTGA
Ga0306919_1032715813300031879SoilTGPSTFEVSTAGGTASVGSASADWFIKAQVDSAQVGVLAGTVDLTSIVTGQSVSIPAHWGTRLEAGLDPVLPRLWAQAEFDAVIHLTGA
Ga0306925_1064912323300031890SoilSTFEVSTAGGTASVGSASADWFIKAQVDSAQVGVLVGTVDLTSIVTGQSVSIPAHWGTRLEAGRAPMLPRVWPQVDFNALSRLTECCQSAQPQTEPR
Ga0306923_1022263233300031910SoilLTRGALRAQVNFIRDPSTFEVATASGTASVASASADWFIKAQPDAAQVAVLVGTIDLRSAVTGDSVSIPARWGTRLETGLDPVLPRIWAQREFNAVIRLTGTNDPAAL
Ga0306923_1040120533300031910SoilEVSTAGGTASVGSASADWFIKAQVDSAQVGVLAGTVDLTSIVTGQSVSIPAHWGTRLEAGLDPVLPRLWAQAEFDAVIHLTGA
Ga0310912_1008181553300031941SoilLTDTSTFEIATTAGTASVASASADWFVEAQRDAAQVGVLAGTIDLRSALTGDSVSIPARWGTLLETGLDPVLPRVWAQREFNAVIRLTGT
Ga0310916_1021331313300031942SoilRYKITSSGRDVKLSLTQGLLRAQVSSVSSPSTFDVSTAAGTASVGSASADWFIRTQTGLAQAGVLAGKIDLTSTATGESVSIPAHWGTRLEAGRSPVPPRNWTQAEFEEVTHSTQ
Ga0310916_1056831723300031942SoilDSAQVGVLVGTVDLTSIVTGQSVSIPAHWGTRLEAGRAPMLPRVWPQVDFNALSRLTECCQSAQPQTEPR
Ga0310916_1060505813300031942SoilGVLRVRVTSVTVPSTFEVSTAVGTASVASESADWFIRAQAGSAQVGVLTGIVDLTSTTTRQSVSIPAHWGTRLETGLDPVLPRIWAHTEFDAVIRLTGG
Ga0310913_1038646323300031945SoilEVSTAGGTASVGSASADWFIKAQVDSAQVGVLVGTVDLTSIVTGQSVSIPAHWGTRLEAGRAPMLPRVWPQVDFNALSRLTECCQSAQPQTEPR
Ga0310910_1014259343300031946SoilGLLRVASVTRPFEVSTAVGTAAAASDSADWFVKAEARSAQVGVLAGTVNLTSSPTGESVSIPAHWGTRLEAGPPVPPRVWNQMEFNAVIRVTE
Ga0310910_1052149713300031946SoilSVTGPSTFEVSTAGGTASVGSASADWFIKAQVDSAQVGVLVGTVDLTSIVTGQSVSIPAHWGTRLEAGRAPMLPRVWPQVDFNALSRLTECCQSAQPQTEPR
Ga0310910_1054625913300031946SoilTSVTVPSTFEVSTAVGTASVASESADWFIRAQAGSAQVGVLTGIVDLRSTTTRQSVSIPAHWGTRLETGLDPVLPRIWAHTEFDAVIRLTGG
Ga0306926_1096533713300031954SoilVTPVKGLSQFEVSIPVGTASVGSDSADWFIKAEAGSAQVGVLAGTVDLTSIPTGQSVSIPARWGTHLETGLDPVLPRVWTQREFSAVTRLTGA
Ga0306926_1169927423300031954SoilKAQVDSAQVAVLAGTVDLTSIVTGQSVSIPAHWGTRLEAGRAPMLPRVWPQVDFNALSRLTECCQSVQPQTEPKSSPNR
Ga0318530_1048657413300031959SoilIRDPSTFEVATASGTASVASASADWFIKAQPDAAQVAVLVGTIDLRSAVTGDSVSIPARWGTRLETGLDPVLPRIWAQREFNAVIRLTGTNDPAAL
Ga0318531_1040114813300031981SoilVSTAGGTASVGSASADWFIKAQVDSAQVGVLVGTVDLTSIVTGQSVSIPAHWGTRLEAGRAPMLPRVWPQVDFNALSRLTECCQSAQPQTEPR
Ga0318531_1058573813300031981SoilVRVTSLAVPSTFEVSTAVGTASVASESADWFIRAQAGSAQVGVLTGIVDLRSTTTRQSVSIPAHWGTRLETGLDPVLPRIWAHTEFDAVIRLTGG
Ga0306922_1026213613300032001SoilSVTGPSTFEVSTAGGTASVGSASADWFIKAQVDSAQVGVLAGTVDLTSIVTGQSVSIPAHWGTRLEAGLDPVLPRLWAQAEFDAVIHLTGA
Ga0306922_1144402623300032001SoilDVSTAAGTASVGSASADWFIRTQTGLAQAGVLAGKIDLTSTATGESVSIPAHWGTRLEAGRSPVPPRNWTQAEFEEVTHSTQ
Ga0306922_1159512013300032001SoilALRAQVTSVTGPSTFEVSTAGGTASVGSASADWFIKAQVDSAQVAVLAGTVDLTSIVTGQSVSIPAHWGTRLEAGLDPVLPRLWAQAEFDVVIQLTGT
Ga0306922_1236640423300032001SoilAVVAPVGGPSTFEVSTAVGTASVRSGSADWFVVAEAGSAQVGVLAGTVDLTSAATRGSVSIPAHWGTRLESGLAPMMPRVWAQMEFNAVIRLTECCQSAQPKLETPPAR
Ga0318569_1060740213300032010SoilSSAVGTATASSASADWFVESKAGSARVAVLAGIVDLTSNLIGQSASIPAHWGARLEAGRAPVLPRVWSQMEFNAFIRITQ
Ga0318549_1039551013300032041SoilTSVTGPSTFEVSTAGGTASVGSASADWFIKAQVDSAQVGVLAGTVDLTSIVTGQSVSIPAHWGTRLEAGRAPMLPRVWPQVDFNALSRLTECCQSAQPQTEPR
Ga0318558_1015853213300032044SoilVSTAGGTASLGSAPADWFIKAQVDSAQVGVLAGTVDLTSIVTRQSVSIPAHWGTRLEAGRAPMLPRLWPQVDFNPLSRLTECCQPVQPQTEPRSSPSR
Ga0318533_1070019213300032059SoilTVERYKITSSGRDVKLSLTQGLLRAQVSSVSSPSTFDVSTAAGTASVGSASADWFIRTQTGLAQAGVLAGKIDLTSTATGESVSIPAHWGTRLEAGRSPVPPRNWTQAEFEEVTHSTQ
Ga0318505_1012972823300032060SoilINGSGRDVTLSLTQGVLRVRVTSLAVPSTFEVSTAVGTASVASESADWFIRAQAGSAQVGVLTGIVDLTSTTTRQSVSIPAHWGTRLETGLDPVLPRIWAHTEFDAVIRLTGG
Ga0318505_1037304413300032060SoilRNVRLSLTRGALRAQVNFIRDPSTFEVATASGTASVASASADWFIKAQPDAAQVAVLVGTIDLRSAVTGDSVSIPARWGTRLETGLDPVLPRIWAQREFNAVIRLTGTNDPAAL
Ga0306920_10050951743300032261SoilVTSVTGPSTFEVSTAGGTASVGSASADWFIKAQVDSAQVGVLAGTVDLTSIVTGQSVSIPAHWGTRLEAGLDPVLPRLWAQAEFDAVIHLTGA
Ga0306920_10078477733300032261SoilRLSLTRGALRAQISSIRDPSTFEVATASGTASVASASADWFIKAQPDAAQVAVLVGTIDLRSAVTGDSVSIPARWGTRLETGLDPVLPRIWAQREFNAVIRLTGTNDPAAL
Ga0306920_10113787313300032261SoilVTSVTGPSTFEVSTAGGTASVGSASADWFIKAQVDSAQVGVLVGTVDLTSIVTGQSVSIPAHWGTRLEAGRAPMLPRVWPQVDFNALSRLTECCQSAQPQTEPR
Ga0310914_1050763823300033289SoilTASVGSASADWFIKAQVDSAQVGVLVGTVDLTSIVTGQSVSIPAHWGTRLEAGRAPMLPRVWPQVDFNALSRLTECCQSAQPQTEPR
Ga0310914_1102584213300033289SoilSSITVERYKITSSGRDVKLSLTQGLLRAQVSSVSSPSTFDVSTAAGTASVGSASADWFIRTQTGLAQAGVLAGKIDLTSTATGESVSIPAHWGTRLEAGRSPVPPRNWTQAEFEEVTHST


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.