NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F094174

Metagenome / Metatranscriptome Family F094174

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F094174
Family Type Metagenome / Metatranscriptome
Number of Sequences 106
Average Sequence Length 181 residues
Representative Sequence SRGLYWRGFALWRRAINGFNESPSPQDLEEDLNAAIADFNDSLAQDHAFVESKIAEASCFGYLAYLSMKDPARMQDMIQHSSPLLKDAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYNRGLEAVRSRPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQNDAAAALILVPYWHYVRDILLPQIQDAQAKAR
Number of Associated Samples 92
Number of Associated Scaffolds 106

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 83
AlphaFold2 3D model prediction Yes
3D model pTM-score0.83

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(21.698 % of family members)
Environment Ontology (ENVO) Unclassified
(35.849 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(66.981 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 67.39%    β-sheet: 0.00%    Coil/Unstructured: 32.61%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.83
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
a.118.8.0: automated matchesd3fdha_3fdh0.67383
a.118.8.1: Tetratricopeptide repeat (TPR)d1w3ba_1w3b0.66922
a.118.8.2: Transcription factor MalT domain IIId1hz4a_1hz40.65839
a.102.1.2: Cellulases catalytic domaind1ks8a11ks80.65467
a.102.1.2: Cellulases catalytic domaind1clca11clc0.64053


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 106 Family Scaffolds
PF01680SOR_SNZ 11.32
PF030614HBT 1.89
PF13545HTH_Crp_2 0.94
PF01882DUF58 0.94
PF07883Cupin_2 0.94
PF02075RuvC 0.94
PF14534DUF4440 0.94
PF15902Sortilin-Vps10 0.94
PF13435Cytochrome_C554 0.94

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 106 Family Scaffolds
COG0214Pyridoxal 5'-phosphate synthase subunit PdxSCoenzyme transport and metabolism [H] 11.32
COG0817Holliday junction resolvasome RuvABC endonuclease subunit RuvCReplication, recombination and repair [L] 0.94
COG1721Uncharacterized conserved protein, DUF58 family, contains vWF domainFunction unknown [S] 0.94


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil21.70%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil16.04%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil10.38%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil9.43%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil5.66%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil4.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.77%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil3.77%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.83%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.83%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.89%
Permafrost SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost Soil1.89%
FenEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Fen1.89%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa1.89%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.94%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.94%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.94%
PalsaEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Palsa0.94%
SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Soil0.94%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.94%
RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere0.94%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300004082Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3EnvironmentalOpen in IMG/M
3300004091Coassembly of ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004635Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005533Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1EnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005591Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005712Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300005952Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 1 DNA2013-045EnvironmentalOpen in IMG/M
3300006047Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006086Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2013EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300014495Permafrost microbial communities from Stordalen Mire, Sweden - 712P3M metaGEnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300018007Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_5EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300022533Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-7-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300023226Peat soil microbial communities from Stordalen Mire, Sweden - 717 E2 1-5EnvironmentalOpen in IMG/M
3300024225Spruce rhizosphere microbial communities from Bohemian Forest, Czech Republic ? CZU5Host-AssociatedOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026217Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 1 DNA2013-045 (SPAdes)EnvironmentalOpen in IMG/M
3300026984Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF048 (SPAdes)EnvironmentalOpen in IMG/M
3300027064Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF003 (SPAdes)EnvironmentalOpen in IMG/M
3300027109Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF008 (SPAdes)EnvironmentalOpen in IMG/M
3300027505Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_O3 (SPAdes)EnvironmentalOpen in IMG/M
3300027535Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027562Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027565Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027727Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027853Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1 (SPAdes)EnvironmentalOpen in IMG/M
3300027855Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027867Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027879Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4 (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300027895Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300028739Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - II_Fen_E1_3EnvironmentalOpen in IMG/M
3300028795Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - II_Palsa_N1_1EnvironmentalOpen in IMG/M
3300028800Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-21-26 metaGHost-AssociatedOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030491Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - II_Fen_E3_3EnvironmentalOpen in IMG/M
3300030580II_Palsa_N1 coassemblyEnvironmentalOpen in IMG/M
3300030741Forest Soil Metatranscriptomes Boreal Montmorency Forest, Quebec, Canada ANR Co-assemblyEnvironmentalOpen in IMG/M
3300030743Forest Soil Metatranscriptomes Boreal Montmorency Forest, Quebec, Canada VCO Co-assemblyEnvironmentalOpen in IMG/M
3300030862Metatranscriptome of soil microbial communities from Maridalen valley, Oslo, Norway - NSE5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030940Metatranscriptome of soil microbial communities from Maridalen valley, Oslo, Norway - NSI1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300032783Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.3EnvironmentalOpen in IMG/M
3300032805Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033158Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.1EnvironmentalOpen in IMG/M
3300033289Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN108EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12635J15846_1086928913300001593Forest SoilEDLKGAIADFNDSLAHDPAFVESKIGAASCFGYLAYMSMKDPARMEDMIQHSSSFLKDAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYQRGLEAVRSRPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQNDAAAALILVPYWHYVRDILLPQIQDAQAK
Ga0062384_10042898413300004082Bog Forest SoilDELAPIPEDNQLASRVLYWRGFALWRSAINGFNESPTPTDLEADLKQAVTDFNDAIVRAPGWVEPKIGAGSALGYLMYLNKKDPTRVQELLQQSSSILKEAMETAPDNPRLLWVLGPIRWSSPSERGGGQDKAIEGYNKGLEVIRKQKRNVVYPLEPSWGEPELLMSLAWSNLNRTTPDLNAAAQDAEAALKLVPYWHYVRDILMPQIRAAQVKALHSGRIVGNSEVMA*
Ga0062384_10149018513300004082Bog Forest SoilVLYWRGFALWRRAINGFNESPTPEDLEEDLTLAVTDFKDAIARDPAFVEPKIGAGSSLGYLMYLHKKDPTRVEELLQQSSPLLKDAMTTAPDNPRLLWVLGPIRWLSPPERGGGQEKAIETYNKGLEAVHKQMRAASDPMEPSWGEPELLMSLAWSNLNRTTPDLN
Ga0062387_10054795013300004091Bog Forest SoilRVLYWRGFALWRSAINGFNESPTPTDLEADLKQAVTDFNDAIVRAPGWVEPKIGAGSALGYLMYLNKKDPTRVQELLQQSSSILKEAMETAPDNPRLLWVLGPIRWSSPSERGGGQDKAIEGYNKGLEVIRKQKRNVVYPLEPSWGEPELLMSLAWSNLNRTTPDLNAAAQDAEAALKLVPYWHYVRDILMPQIRAAQVKALHSGRIVGNSEVMA*
Ga0062389_10356109813300004092Bog Forest SoilDYEGDRPDLKRLHDELTPIPEDNKLASRVLYWRGFALWRRAINGFNESPTPTDLEEDLKQAVTDFKDAIARDPAFVEPKIGAASSLGYLMFLHRKDPTLMQELLQQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAFEIYNKGLEAVRNQKRDVVDPLEPSWGEPELLMSLAWSNLNRTTPDLK
Ga0062388_10159903313300004635Bog Forest SoilDYEGDRPDLKRLHDELTPIPEDNKLASRVLYWRGFALWRRAINGFNESPTPTDLEEDLKQAVTDFKDAIARDPAFVEPKIGAASSLGYLMFLHRKDPTLMQELLQQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAFEIYNKGLEAVRNQKRDVVDPLEPSWGEPELLMSLAWSNLNRTTPDLKAAEQYAQSALKLVPYWHYVRDILMSQ
Ga0070703_1041090113300005406Corn, Switchgrass And Miscanthus RhizosphereAFVESKIGAASCFGYLAYLSMKDPARMQDMIQNSSSFLKDAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYQRGLEAVRTRPAVSDPLDPSWGEPELLMNRAWSNLHRTTPDLKAAHNDAAAALILVPYWHYVRDILLPQIQDAQAKAR*
Ga0070734_1032119113300005533Surface SoilVLYWRGFAMWRRAINGFNETPTPTDLEDDLNQAIADFRDSLAQDPKYVESKIAEASCLGYVMYLHRNDQSRIQELIQQSSPLLKDAMATDPDNPRLLWVLGPIRWSSPSERGGGQDNAIELYNRGLDIVRRQNPATDPLEPSWGEPELLMNRAWSNLNRTTPELTAAQQDADAALKSVPYWHYVRDILVPQIQQAKAKAKTP*
Ga0070735_1070031113300005534Surface SoilRAINAFNETPTPTDIPDDLNAAIADFKDSLARDSTFVESKIAEASCLGYLMYLNMKDQSRVQELIQQSSPLLKDAMATDPDNPRLLWVLGPIRWSTPLERGGGQEKAFDLYDRGLAAARKRLASTDPLDPSWGEPELLMNRSWSHLHEKTPDLKAAQKDAEAALAIVPYWHYVKDILLPQIQEAQKKASNLSG*
Ga0070697_10001571453300005536Corn, Switchgrass And Miscanthus RhizosphereMLASRGLYWRGFALWRRAINGFNESPSPQDLEEDLKGAIADFNNSLAHDPAFVESKIGAASCFGYLAYLSMKDPARMQDMIQNSSSFLKDAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYQRGLEAVRTRPAVSDPLDPSWGEPELLMNRAWSNLHRTTPDLKAAQHDAAAALILVPYWHNVRDILLPQIQDAQAKAR*
Ga0070733_1053938713300005541Surface SoilEEDLNGAVTDFNDSLAQDATFVESKIAEGSCFGYLAYLNMKNPARAQEQIQHSSPLLKEAMAAAPENPRLLWVLGPIRWSSPPERGGGQDKAFDLYNRGLEAIRKTAAVTDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQTDAAAALALVPYWHYVRDILIPQIQDAQSQSH*
Ga0066707_1042937013300005556SoilAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYQRGLEAVRTRPAGSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQNDAAAALILVPYWHYVRDILLPQIQDAQAKAR*
Ga0070761_1036098323300005591SoilFVESKIAEGSCFGYLAYLNMKDPARAQEQIQHSSPLLKEAIAAEPENPRLLWVLGPIRWSSPPERGGGQDKAFELYDRGLAAIRKSPAVSDPVQPSWGEPELLMSRAWSNLHRTAPDLKAAQNDAEAALVLVPYWHYVRDILMPQIKDAQAKAR*
Ga0070761_1072402613300005591SoilHDELTPIPEDNKLASRVLYWRGFALWRRAINGFNESPTPTDLEGDLTQAVTDFKDAIARDPAFVEPKIGAGSSLGYLMYLHRKDPTVVQELLEQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAFEIYNKGLEAVRNQKRGVVDPLEPSWGEPELLMNLAWSNLNRTTPDLKAADQYAQAALKLVPYWHYVRDILMPQ
Ga0070762_1051798113300005602SoilAQDATFVESKIAEGSCFGYLAYLNMKDPARAQEQIQHSSPLLKEAIAAEPDNPRLLWVLGPIRWSSPPERGGGQDKAFELYDRGLAAIRKSPAVSDPVQPSWGEPELLMSRAWSNLHRTAPDLKAAQNDAEAALVLVPYWHYVRDILMPQIKDAQAKVR*
Ga0070764_1103836013300005712SoilPIPEDNKLASQVLYWRGFALWRRAINGFNESRAPTDLDEDLTQAVTDFKDAIARNPAFVEPKIGAVSSLGYLMYLNKKDPTRVQELLQQLSPLLKEATAAAPNNPRLLWVLGPIRWSSPPDRGGGQDKAFESYNRGLEAVRNQKRDTSDPLEPAWGEPELLMSLAWSNLNRT
Ga0070766_1066551613300005921SoilPTPTDLETDLTQAIADFKDSIVRDPGFVEPKIGAGSSIGYLMYLHRKDQARVQELFQQSSPLLKEALATDPDNPRLLWVLGPIRWSSPPDRGGGQDKAFELYNRGLQEIRKQKPSSDPLEPSWGEPELLMSLAWSYLNRATPDLKAAQEDAQAALELVPYWHYVRDILMPQIQAAQLKAR
Ga0080026_1019347013300005952Permafrost SoilTPKDLEEDLNGAITDFNESLTQDPAFVESKIAEGSCYGYLMYLNMKDPARMQEMIQHSSPLLKDAMAAAPDNPRLLWVLGPIRWSSPPERGGGQDKAFELYDRGLAAIRKSPSATDPLEPAWGEPELLMNRAWSHLHSTSPDLKAAQSDAAAALAQVPYWHYVRDILMPQIQEAQAKAH*
Ga0075024_10059144213300006047WatershedsEGDRATLKRLHDELTAIPKDNRLASRVLYWRGFALWRRAINGFNESPTPTDLEEDLTQAVTDFKDGIARDPAFVEPKIGAGSCLGYLMYLSKKDPKRVQELLQQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAIEIYNKGLEAVRNHKRDVVDPLEPSWGEPELLMNLAWSNLNRTAPDLKAAEQYA
Ga0075028_10050919313300006050WatershedsDLTQAVTDFKGGIARDPAFVEPKIGAGSCLGYLMYLSKKDPKRVQELLQQSSPLLKEAMAAAPDNPRLLWVLGPIRWSSPPERGEGQDKAFEIYNKGLEVVRNQKRGVVDPREPEWGEPELLMSLAWSNLNRTTPDLKAAEQDAQAALKIVPYWHCVRDILMPQVRAAQAKAILRQKSE*
Ga0075019_1113448813300006086WatershedsLWRRGINGFNDNVSPQELEDDFKAAIADFNASLAQDPSFVESKIAEGSCYSNLVYLYRSDPARMQEMIQHSSPLLKDAMAADPDNPRLAWVLGPIRWNQPPERGGGQDKAFDLYDRALDAIHKKPSSADPLEPSSWGEPELLMSRSWSNLNKIKPDPKAAQSDAEAALQ
Ga0070765_10023045033300006176SoilDATFVESKIAEGSCFGYLAYLNMKDPARMQEQIQHSSPLLKEAIASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYNRGLDIVRKQAAVSDPLEPSWGEPELLMSRAWSNLHRTAPDLKAAQNDAEAALVLVPYWHYVRDILMPQIKDAQAKVR*
Ga0070765_10156288013300006176SoilPIPEDNKLASRVLYWRGFALWRRAINGFNESPTPTDLEEDLTQAVTDFKGAIAREPAFVESKIGAGSSLGYLMYLNRKDATRVQKLFQQSSPLLKEAMAATPDNPRLLWVLGPIRWSSPPERGGGQDKAVELYNKGLEAVRNQKRDVVDPLEPSWGEPELLMSLAWSNLNRTAPDLNAAEQGAQAALKMVPYWHYVRDILMPQIRA
Ga0070765_10206582913300006176SoilPEDNKLASRVLYWRGFALWRRAINGFNESPIPTDLEQDLMQAVSDFNDATARDLAFVEAKIGASSSLGYLMYLNKKDPTRVQELLQRLSPLLKEAMATAPDNPRLLWVLGPIRWASPPERGGGQDKAIEIYNKGLEAVRDQKRDVVDPLEPSWGEPELLMSLAWSNLNRTAPDLNA
Ga0070765_10209492213300006176SoilSQVLYWRGFALWRRAINGFNESPTPTDLEEDLTQAVTDFKDAIARNPAFVEPKIGGVSSLGYLMYLNKKDPTHVQELLQQLSPLLKEATAAAPNNPRLLWVLGPIRWSSPPDRGGGQDKAFENYNRGLEAIRIQKRDASDPLEPSWGEPELLMSLAWSNLNRTTPDLNAAEQDAQ
Ga0066659_1069380123300006797SoilYLSMKDPARVQDMIQHSSSFLKDAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYQRGLEAVRTRPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQNDAAAALILVPYWHYVRDILLPQIQDAQAKAR*
Ga0099829_1036140413300009038Vadose Zone SoilLWRRAINGFNESPTPKDLEEDLNSAISDFNNSLAQDPAFVDSKIAEGSCYGYLMYLNMKDPARMQELMQHSSPLLKEAMAAAPDNPRLLWVLGPIRWSSPPERGGGQDKAFVLYDRGLAAIRNSPAVSDPLEPSWGEPELFMNRAWSNLHRTTPDLKAAQTDAAAALTLVPYWHYVRDILIPQIQDAQSKSH*
Ga0099792_1075679113300009143Vadose Zone SoilMLASRGLYWRGFALWRRAINGFNESPSPKDLEEDLNGAIADFNDSLAQDPAFVESKIGEASCFSYLAYLRMKDPARMQDMIQHSSSFLKDAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDVYRRGLEAVRTKPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAHTDAAAALLLVPYWHYVRDVLLPQIQEAQSKSR*
Ga0126381_10385488413300010376Tropical Forest SoilSLWRRAINGFNDSVPPKELEDDLNEAISDFNDSLAQDPGFVESKIGAASCYSNLVYLNRKNPTHVQELIQHSSPLLKEVMAADPDNPRLLWVLGPIRWSAPPERGGGQDKAFELYNRGLEAIRKRPAVSDPLEPSWGEPELLMARAWSNLNKTTPDPKAAQKDAEAALQLVPYWHYVRDILMRQILDAQGKAK*
Ga0137393_1131630913300011271Vadose Zone SoilELYWRGFALWRRAINGFNESPTPQDLEEDLKGAIADFNNSLAHDPAFVESKIGAASCFGYLAYLSMKDPARMHDMIQNSSSFLKDAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYQRGLEAVRTRPAVSDPLDPSWGEPELLMNRAWSNLHRTTPDLKAAQHDAAAALILVPYWHYVRDVLLPQIQEAQSKSR*
Ga0137381_1130089113300012207Vadose Zone SoilTQIQRADYEGDRPALRRLHDELTPISGDNKMAARVLYWRGFALWRRAINGFNDSPTPTDLEADLTQAVADFKDSIARDPTFVEPKIGAGSSLGYLMFLHRKDPTLMQELLEQSSPLLKEAMATAPDNRQLLWVLGPIRWSSPPERGGGQDRAFEIYNKGLEALRNQKRGVVDPLEPSWGEPELLMSLAWSNLNRTTPDLKAADQYA
Ga0137376_1139880313300012208Vadose Zone SoilLYWRGFALWRRAINGFNESPSPKDLEEDLNGAIADFNDSLTDDPAFVESKIGEASCFGYLAYLSMKDPARVQDMIQHSSSFLKDAMASDPDNPRLLWVLGPIRSSSPPERGGGQDKAFDLYQRGLEAVRTRPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQNDAAAALILVPYWHYVRDILLPQIQDAQA
Ga0137384_1055257513300012357Vadose Zone SoilTPVPDSKTLASREFYWRGFALWRRAINGFNESPTPKDLEEDLNSAISDFNNSLAQDPAFVDSKIAEGSCYGYLMYLNMKDPARMQELMQHSSPLLKEAMAAAPDNPRLLWVLGPIRWSSPPERGGGQDKAFVLYDRGLAAIRNSPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQTDAAAALTLVPYWHYVRDILIPQIQDAQSKSH*
Ga0137384_1067845113300012357Vadose Zone SoilGFALWRRAINGFNESPSPKDLEEDLNGAIADFNDSLTDDPAFVESKIGEASCFGYLAYLSMKDPARMQDMIQHSSSFLKDAMASDPDNPRLLWVLGPIRSSSPPERGGGQDKAFDLYQRGLEAVRTRPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQNDAAAALILVPYWHYVRDILLPQIQDAQAKAR*
Ga0137385_1072225323300012359Vadose Zone SoilDSKIAEGSCYGYLMYLNMKDPARMQELMQHSSPLLKEAMAAAPDNPRLLWVLVPIRWSSPPERGGGQDKAFVLYDRGLAAIRNSPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQTDAAAALTLVPYWHYVRDILIPQIQDAQSKSH*
Ga0137360_1056821913300012361Vadose Zone SoilMLASLGLYWRGFALWRRAINGFNESPSPQDLEEDLKGAIADFNNSLAHDPAFVESKIGAASCFGYLAYLSMKDPARMHDLIQNSSSFLKDAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYQRGLEAVRTRPAVSDPLDPSWGEPELLMNRAWSNLHRTTPDLKAAQHDAAAALILVPYWHYVRDILLPQIQDAQAKAR*
Ga0137396_1052589113300012918Vadose Zone SoilSRGLYWRGFALWRRAINGFNESPSPQDLEEDLNAAIADFNDSLAQDHAFVESKIAEASCFGYLAYLSMKDPARMQDMIQHSSPLLKDAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYNRGLEAVRSRPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQNDAAAALILVPYWHYVRDILLPQIQDAQAKAR*
Ga0182015_1006815523300014495PalsaVLYWRGFTLWRRAINGFNEFPTPTDLEADLTEAVTDFKDAITRDPAFVEPKIGAGSSLGYLMFLNKKNPSRVQELLQQSSLLLKGAMATAPDNPRLLWVLGPIRWSSPPESGGGQDKAFEIYNKGLETVRNHERDVLDPLEPSWGEPELLMSLAWSNLNRTVPDLNAAEKYAEAALKLIPYWHYVRDILIAQIRAAQAKAH*
Ga0182041_1140669513300016294SoilARVEYWRGFALWRRAINAFNETPTPADIPDDLNGAIADFNDSLASNPSFIESKIAAASCYGYLVFINRKEPERMQEFIQHSSLLLKEAMAADPDNPRLKWVLGPIRWSSPPERGGGQDKAIELYTQGLETIRKRPKPTDPLEPAWGEPELLMNRAWSNLNRTAPELKSAQVDAEAALALVPYWHYVKDILIPQIQTAQTKASQSK
Ga0187805_1036078523300018007Freshwater SedimentDAMASDPDNPRLLWVLGPIRWSSPPGRGGGQDKAFELYDRGLEAIRKKPAASDPLEPTWGEPELLMNRSWSYLHRTTPDLKAARKDAEAALAIVPDWHYVRDILLPQIRDAQSKAR
Ga0066667_1191336913300018433Grasslands SoilASCFGYLAYLSMKDPARVQDMIQHSSSFLKDAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYQRGLEAVRTRPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQNDAAAALILVPYWHYVRDILLPQIQDAQAKAR
Ga0193735_111566213300020006SoilVGYLAYLSMKDPARMQDMIQHSSSFLKDAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYQRGLEAVRTRPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQTDAGAALTLVPYWHYVRDILLPQIQDAQAKAR
Ga0210407_1024244523300020579SoilFALWRRAINGFNQSPSPKDLEEDLNGAIADFSDSLTHDPAFVESKIGEASCFGYLAYLSIKDPARMQDMIQHSSSFLKDAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYQRGLEAVRTRPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQNDAGAALTLVPYWHYVRDILLPQIQDAQAKAR
Ga0210407_1112517113300020579SoilIQRADYEGDCPTLRRLHDELTPIPEDNKLASRVLYWRGFALWRRAINGFNESPTPTDLEGDLTQAVTDFKDAIARDPAFVEPKIGAGSGLGYLMYLHRKDTTVVQELLAQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAFEIYNKGLEAVRHQKPGVVDPLEPSWGEPELLMNLAWSNLNRTTPDL
Ga0210403_1129334013300020580SoilDELTPIPEDDKLASRVLYWRGFALWRRAINGFNESPTPTDLEGDLTQAVTDFKDAIARDPAFVEPKIGAGSSLGYLMYLHRKDTTVVQELLEQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAFEIYNKGLEAVRHQKPGVADPLEPSWGEPELLMNLAWSNLNRTTPDLKAADG
Ga0210399_1124686013300020581SoilLKRLHDELTPIPEDNRLASRVLYWRGFALWRRAINGFNESPTPTDLEEDLKQAVTDFKGAIARDPAFVESKIGAGSSLGYLMYLHRKDPTVVQELLEQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAFELYNKGLEAVRNQKRDVVDPLEPSWGEPELLMSLAWSNLNRTTPDLNAAEQDGQAA
Ga0210395_1094319213300020582SoilAIRMVEQIKRADYEGDRSALKRLHDDFTAAPGDKTLASRELYWRGFALWRRAISGFNESPTPKDLEEDLNGAISDFNNSLARDPGFVESKIAEGSCYGYLMYLNMKDPTRMQEMMQHSSPLLKEATAAAPDNPRLLWVLGPIRWSSPPERGGGHDKAFELYNRGLEIIRKSPAATDPLEPSWGEPELLMNRAWSYLHRTAPDLKAAQADATAA
Ga0210401_1115766013300020583SoilRVALKRLRGDLTPLPDDKVLASRELYWRGFALWRRAINGFNESPSPKDLEEDLNGAIADFNDSLIHDPAFVESKIGEASCFGYLAYLSMKDPARMQDMIQHSSSFLKDAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDVYQRGLEAVRTRPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAHTDAAAALLLVPYWHYVRDI
Ga0210401_1146813613300020583SoilLNSVHAELAPIPEDNKLASRVLYWRGFALWRRAINGFNESPTPTDLEEDLTQAVTDFNDAIARDLAFVEPKIGAGSSLGYLMYLNKKDPTRVQELLQKSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAIEIYNKGLEVVRDHKRDVVDPLEPSWGEPELLMSLAWSNLN
Ga0215015_1081255833300021046SoilMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFRTGPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQNDAAAALILVPYWH
Ga0179596_1035649613300021086Vadose Zone SoilALWRRAINGFNESPTPKDLEEDLNSAISDFNNSLAQDPAFVDSKIAEGSCYGYLMYLNMKDPARMQELMQHSSPLLKEAMAAAPDNPRLLWVLGPIRWSSPPERGGGQDKAFVLYDRGLAAIRNSPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQTDAAAALTLVPYWHYVRDILIPQIQDAQSKSH
Ga0210405_1098744913300021171SoilEGDRPALKRLHDGLSPIPEDNKLASQVLYWRGFALWRRAINGFNETPTPTDLEADLTQAVTDFKGAIARDPASVEPKIGAGSSLGYLMYLNRKDPTRVQELFQQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAFAGYNRGLEAIRNQKRDASDPLDPSWGEPELLMSLAWSNLNRTTPDLNAAEQNAQAALKIVPHWHYV
Ga0210396_1156370413300021180SoilLHDELTPIPEDNKLAARVLYWRGFALWRRAINGFNESPTPTDLEGDLTQAVTDFKDAMGRDPAFVEPKIGAGSSLGYLMFLHRKDPSLMQELLQQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAFEIYNKGLEALRNQKRGVVDPLEPSWGEPELLMNLAWSNLNRTTP
Ga0210397_1029878123300021403SoilGYLMYLNMKDQSRVQELIQQSSPLLKDAMATDPDNPRLLWVLGPIRWSTPPERGGGQDKAFDLYNRGIESIRKRPAATDPLEPSWGEPELLMNRAWSYLHAKTPDPPAAQKDAEAALAIVPYWHYVKDILLPQIREAQTKSK
Ga0210397_1121871113300021403SoilIIASSILYRRGFALWRSAINGFNESPTPKDLEEDLNGAITDFNDSLVQSPAFVESKIAEGSCYGYLAYLNLKDPARMQEMIQHSSPLLKEAMASAPDNPRLLWVLGPIRWSSPPERGGGQDKAFALYDRGLEIIRKSPAINDPLEPSWGEPELLMNRAWSHLHSTTPDLKAAQADASAALALVPYWHYVRDILI
Ga0210386_1000462013300021406SoilLKRLHDELAPIPEDSKLASRVLYWRGFALWRRAINGFNETPTPIDLEADLTQAVTDFNAAIARDPTWVEPKMGAGSSLGYLMYLNKKDPTRVQELLQQSSALLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAIEGYNKGLEVMRDHKRDVADPLEPSWGEPELLMSLAWSNLNRTAPDLKAAEQYAEAALKLVPYWHYVRDILMPQIRAAQAKALHSGRIVGNSEVMA
Ga0210383_1124620513300021407SoilLHDELAPIPEDNKLASRVLYWRGFALWRRAINGFNETPTPTDLEADLKQAVTDFNDSIARDPAFVESKIGAGSSLGYLMYLNMKNPTRVQELLQQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAIEIYNKGLEVARDHKRDVVDPLEPSWGEPELLMSLAWSNLNRTTPDLQAAEQYAEAALKLVPYWHYVRDILM
Ga0210394_1167252713300021420SoilGDKTLASRVSYWRGFALWRRAINGFNETPTPSDLADDFKAAIVDFDNSLAQDPTFVDSKIAAASCYGYLLYLNMKDQARMQEYIEKGRPYMQDAMTAAPNNPRLLWVLGPYRWTAPPEHGGGQDKAFAVYNHGLDIIRKTPAVTDPLEPSWGEPELLMNRSWSYLHQTTPDLKAAQQ
Ga0210391_1078511013300021433SoilLTPIPEDNKLASRVLYWRGFASWRRAINGFNETPTPKDLEEDLTQAVTDFKDSIARDSAFVESKIGASSSLGYLMYLHKNDPTRVQELLQQSSPLLKEAIVTAPDNPRLLWVLGPIRWSSPPELGGGQDKAFEIYDKGLKAIRNQRRDGSDPLEPSWGEPELLMSLAWSNLNRTTPDLNAAEQDAQGALKIVPYWHYVRDILMAQIRAAQAKAILRRQSG
Ga0210391_1106363513300021433SoilADYEGDRATLKRLHDELTPIAEDNKLASRVLYWRGFALWRRAINGFNDSPTPTDLEADLTQAVADFKDSIARDPAFVEPKIGAGSSLGYLMFLHRKDPTLMQELLQQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAFEIYNKGLEAVRNQKRGVVDPLEPSWGEPELLMNLAWSNLNRTTPDLKAADGYAQAALKLVP
Ga0210392_1111233713300021475SoilVAPAVAVAARAEPTVHEQPARAVPENNKLASRVLYWRGFALWRRAINGFNDSPTPTDMEADLTQAVTDFKDAIARDPAFVEPKIGAGSSLGYLMFLHRKDPTIMQELLQQSSPHLKEALATAPDNPRLLWVLGPIRWSSPPERGEGQDKAFEIYNKGLEVVRNQKRGVVDPLEPEWGEPELLMSLAWSNLNRTTPDL
Ga0210402_1190390213300021478SoilWRGFALWRSAINGFNETPTPKDLEEDLNGAIADFNDSLTQDPTFVESKIAEGSCYGYLMYLNMKDPARMQEMIQHSSPLLKEAMASAPDNPRLLWVLGPIRWSSPPERGGGQDKAFALYDRGLDAIRKSPAATDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQADASAALA
Ga0242662_1029584213300022533SoilLASRVLYWRGFALWRRAINGFNESPTPTDLEEDLKQAVTDFKGAIARDPAFVESKIGAGSSLGYLMYLHRKDPTVVQELLEQSSPLLKEAMATAPDNPRLLWVRGPIRWSSPPERGGGQDKAFELYNKGLEAVRNQKRDVVDPLEPSWGEPELLMSLAWSNLNRTTPDLNAAEQDGQAA
Ga0224536_103094013300023226SoilGFNESPTPTDLEEDLTQAVTDFKDGIARDPAFVEPKIGAGSCLGYLMYLSKKDPKRVQELLQQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAIEIYNKGLEAVRNDKRDVVDPLEPSWGEPELLMNLAWSNLNRTAPDLKAAEQYAAAALNLVPY
Ga0224572_110382013300024225RhizosphereEDLNGAIADFNDSLAQDATFVESKIAEGSCFGYLAYLNMKDPARAQEQIQHSSPLLKEAIAAEPENPRLLWVLGPIRWSSPPERGGGQDKAFELYDRGLAAIRKSPAVSDPLQPSWGEPELLMSRAWSNLHRTAPDLKAAQNDAEAALALVPYWHYVRDILMPQIKDAQAKVR
Ga0207684_1001114133300025910Corn, Switchgrass And Miscanthus RhizosphereMLASRGLYWRGFALWRRAINGFNESPSPQDLEEDLKGAIADFNNSLAHDPAFVESKIGAASCFGYLAYLSMKDPARMQDMIQNSSSFLKDAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYQRGLEAVRTRPAVSDPLDPSWGEPELLMNRAWSNLHRTTPDLKAAQQDAAAALILVPYWHYVRDILLPQIQDAQAKAR
Ga0209871_110835313300026217Permafrost SoilFNETPTPKDLEEDLNGAITDFNESLTQDPAFVESKIAEGSCYGYLMYLNMKDPARMQEMIQHSSPLLKDAMAAAPDNPRLLWVLGPIRWSSPPERGGGQDKAFELYDRGLAAIRKSPSATDPLEPAWGEPELLMNRAWSHLHSTSPDLKAAQSDAAAALAQVPYWHYVRDILMPQIQEAQ
Ga0208732_103057513300026984Forest SoilPIPEDNKLASRVFYWRGFALWRRAINGFNESPTPTDLEADLTQAVADFKDSIARDPAFVEPKIGAGSSLGYLMFLHRKDPPLMQELLQQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAFEIYNKGLEALRNQKRGVVDPLEPSWGEPELLMNLAWSN
Ga0208724_103332013300027064Forest SoilPLLKEAIAAEPDNPRLLWVLGPIRWSSPPERGGGQDKAFELYDRGLAAIRKSPAVSDPLQPSWGEPELLMSRSWSNLHRTAPDLKAAQNDAEAALALVPYWHYVRDILMPQIKDAQAKVR
Ga0208603_105826813300027109Forest SoilLYWRGFALWRRAINGFNESPTPTDLEEDLTQAVTDFKDAIARSPAFVEPKIGAVSSLGYLMYLNKKDPTRVQELLQQLSPLLKEATGAAPNNPRLLWVLGPIRWSSPPERGGGQDKAFESYNRGLEAVRNQTRDASDPLEPSWGEPELLMSLAWSNLNRTTPDLKAAEQDAQAALKIVPYWHYVRDILMPQIQA
Ga0209218_106111913300027505Forest SoilKLASRVLYWRGFASWRRAINGFNETPTPKDLEEDLTQAVTDFKDSIARDSAFVESKIGASSSLGYLMYLHKNDPTRVQELLQQSSPLLKEAIVTAPDNPRLLWVLGPIRWSSPPERGGGQDKAFEIYDKGLKAVRNQKGDGSGPLEPSWGEPELLMSLAWSNLNRTTPDLNAAEQDAQAALKIVPYWHYVRDILMPQIRAAQAKVILPRQSG
Ga0209734_108619113300027535Forest SoilKRADFEGDRVALKRLHGELTPLRDNKMLASRGFYWRGFALWRRAINGFNESPSPQDLEEDLNGAIADFNDSLTHDPAFVESKIGEASCFGYLAYLSMKDPARMQDMIQHSSSFLKDATASDSDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYQRGLEAVRTRPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQN
Ga0209735_111421413300027562Forest SoilIPEDNTLAARVLYWRGFSLWRRAINGFNESPTPTDLEEDLKQAVTDFKDSIARDPAFVEPKIGAGSSLGYLMYLHRKEPSVVQELLEQSSPLLKQAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAFEIYNNGLEAVRNHKRDVVDPLEPSWGEPELLMSLAWSNLNRTTPDLNAAEQDAQGGLSTTTTTTEAL
Ga0209219_100330783300027565Forest SoilLATSLGRNLKGNKPPEFRERLHHELTPIPEDNKLAARVLYWRGFALWRRAINGCNESPTPTDLEVDLTQAVTDFKMAIARDPASVEPKIGAGSSLGYLMYLYRKDPSVVPELLEQSSPLLREAIATAPDNPRLLWVLGPIRLSSPPERGGGQDKAFEIYNKGLEALRNQKRGVVDPLDPSWGETELLMNLAEPRRI
Ga0209328_1010949013300027727Forest SoilVLYWRGFALWRRAINGFNESPTPTDLEADLTQAVADFKDSIVRDPAFAEPKIGAGSSLGYLMFLHRKDPTLMKELLEQSSPLLKEAMAIAPDNPRLLWVLGPIRWSSSPERGGGQDKAFEIYNKGLEALRNQKRGVVDPLEPSWGEPELLMNLAWSNLNRTAPDLNAAEQYAEAAVKLVPDWHYVRDILMVQIRDAKAKQNPTG
Ga0209274_1053370313300027853SoilIVTQIQRADYEGDRPTLRRLHDELTPIPEDNKLASRVLYWRGFALWRRAINGFNESPTPTDLEGDLTQAVTDFKDAIARDPAFVEPKIGAGSSLGYLMYLHRKDPTVVQELLEQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAFEIYNKGLEAVRNQKRGVVDPLEPSWGEPELLMNLAWSNLNRTTPDLK
Ga0209693_1048741813300027855SoilLWRRAINGFNESPTPTDLEGDLTQAVTDFKDAIARDPAFVEPKIGAGSSLGYLMFLNKKNPSRVQELLQQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAFEIYNKGLEAVRNHKRDAVDALEPSWGEPELLMSLAWSNLNRTVPDLNTAEQYAEAALKLIPYWHYVRDILMAQIRAAQAKAH
Ga0209167_1034294023300027867Surface SoilPTPKDLEEDLNGAVTDFNDSLAQDATFVESKIAEGSCFGYLAYLNMKNPARAQEQIQHSSPLLKEAMAAAPENPRLLWVLGPIRWSSPPERGGGQDKAFDLYNRGLEAIRKTAAVTDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQTDAAAALALVPYWHYVRDILIPQIQDAQSQSH
Ga0209169_1065563113300027879SoilPALKRLNDELTPIPEDNKLASQVLYWRGFALWRRAINGFNESRAPTDLDEDLTQAVTDFKDAIARNPAFVEPKIGAVSSLGYLMYLNKKDPTRVQELLQQLSPLLKEATAAAPNNPRLLWVLGPIRWSSPPERGGGQDKALEIYNKGLEAIRNQKRDTSDPLHPSWGEPELLMSLAWSNL
Ga0209275_1033175023300027884SoilLWRRAINGFNESPTPKDLQEDLNGAIADFNDSLAQDATFVESKIAEGSCFGYLAYLNMKDPARAQEQIQHSSPLLKEAIAAEPDNPRLLWVLGPIRWSSPPERGGGQDKAFELYDRGLAAIRKSPAVSDPVQPSWGEPELLMSRAWSNLHRTAPDLKAAQNDAEAALVLVPYWHYVRDILMPQIKDAQAKVR
Ga0209275_1067269813300027884SoilPSARSVPTSNSANVVAPAVAVAARAEPTVHEQPARAVPENNKLASRVLYWRGFALWRRAINGFNDSPTPTDMEADLTQAVTDFKDAIARDPAFVEPKIGAGSSLGYLMFLHRKDPTIMQELLQQSSPHLKEALATAPDNPRLLWVLGPIRWSSPPERGEGQDKAFEIYNKGLEVVRNQKRGVVDPREPEWGEPELLMS
Ga0209275_1076487813300027884SoilKLASRVLYWRGFAMWRRAINGFNETPTPKDLEEDLMQAVTDFKDAITRDPAFVESKIGAGSSLGYLTYLHKNDPTRVQELLQQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAFEIYDKGLKAIRNQKRDGSDPLEPSWGEPELLMSLAWSNLNRTLPDLNAAEQDAQAALKIVP
Ga0209624_1110689113300027895Forest SoilNETPTPKDLEEDLTQAVTDFKDAIARDPAFVEPKIGAGSSLGYLMYLHRKDPSVVQELFEQSSPLLKEAMAMAPDNPRLLWVLGPIRWSSPPERGGGQDKAFEIYNKGLGLEAVRNHKRDVVDPLEPSWGEPELLMSLAWSNLNRTTPDLKAADQYAQSALKLVPYWHYV
Ga0302205_1009547113300028739FenPTDLEEDLTQAVTDFKDGIARDPAFVEPKIGAGSCLGYLMYLSKKDPKRVQELLQQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAIEIYNKGLEAVRNDKRDVVDPLEPSWGEPELLMNLAWSRLI
Ga0302227_1041889313300028795PalsaRVLYWRGFALWRRAINGFNESPTPTDLEEDLKQAVTDFKDAIAQDPAFVEPKIGAASSLGYLMFLHRKDPTLMQGLLQQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAFEIYNKGLEAVRNQKRDVVDPLEPSWGEPELLMSLAWSNLNRTTPDLKAA
Ga0265338_1026041313300028800RhizosphereHSSPLLKEAIAAEPDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYNRGVDIIRKSPPAADPLEPSWGEPELLMNRAWSHLHSTTPDLKAAQADATAALALVPYWHYVRDILIPQIQEAQTKSH
Ga0308309_1127134913300028906SoilRASAIRVVEQIKRADYEGDGAALKRLHDELNPPHDDKMIASRVFYWRGFALWRRAINGFNESPTPKDLEEDLNGAVADFNDSLAYDATFVESKIAEGSCFGYLAYLNMKDPARMQEQIQHSSPLLKEAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYNRGLDIVRKQAAVSDPLEPSWGEPELLMSRAWSNLHRTAPDLKAAQN
Ga0308309_1173271713300028906SoilVLYWRGFALWRRAINGFNESPIPTDLEQDLMQAVSDFNDATARDLAFVEAKIGASSSLGYLMYLNKKDPTRVQELLQRLSPLLKEAMATAPDNPRLLWVLGPIRWASPPERGGGQDKAIEIYNKGLEAVRDQKRDVVDPLEPSWGEPELLMSLAWSNLNRTAPDLNAAEQYAGAAL
Ga0222749_1035430213300029636SoilPDDKVLASRELYWRGFALWRRAINGFNESPAPKDLEEDLNGAIADFNDSLIHDPAFVESKIGEASCFGYLAYLSMKDPARMQDMIQHSSSFLKDAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDVYKRGLEAVRTRPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAHTDAAAALLLVPYWHYVRDILLPQIQEAQSKSR
Ga0302211_1016101613300030491FenDRATLKRLHDELTAIPEDNRLASRVLYWRGFALWRRAINGFNESPTPTDLEEDLTQAVTDFKDGIARDPAFVEPKIGAGSCLGYLMYLSKKDPKRVQELLQQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAIEIYNKGLEAVRNDKRDVVDPLEPSWGEPELLMNLAWSNLNRTAPDLKAAEQYAAAALNLVPYWHYVRDILMLQIRT
Ga0311355_1182629513300030580PalsaRVLYWRGFALWRRAINGFNESPTPTDLEEDLKQAVTDFKDAIAQDPAFVEPKIGAASSLGYLMFLHRKDPTLMQGLLQQSSPLLKEAMATAPDNPRLLWVLGPIRWSSPPERGGGQDKAFEIYNKGLEAVRNQKRDVVDPLEPSWGEPELLMSLAWSNLNRTTPDLKAAEQ
Ga0265459_1142731313300030741SoilQEQIQHSSPLLKEAIAAEPENPRLLWVLGPIRWSSPPERGGGQDKAFELYDRGLAAIRKSPAVSDPLQPSWGEPELQMSRAWSNLHRTAPDLKAAQNDAEAALVLVPYWHYVRDILMPQIKDAQAKAR
Ga0265461_1322012723300030743SoilAAEPDNPRLLWVLGPIRWSSPPERGGGQDKAFELYDRGLAAIRKSPAVSDPLQPSWGEPELLMSRAWSNLHRTAPDLKAAQNDADAALVLVPYWHYVRDLLMPQIKDAQAKVR
Ga0265753_110528513300030862SoilNGAIADFSDSLAQDPAFVESKIAEGSCYGYLMYLNMKDPARMQEMIQHSSPLLKEAMAAEPDNPRLLWVLGPIRWSSPPERGGGQDKAFELYDRGLAAIRKSPAVSDPLQPSWGEPELLMSRAWSNLHRTAPDLKAAQNDAEAALVLVPYWHYVRDILMPQIKDAQAKVR
Ga0265740_100087313300030940SoilQLHGELAPAPDNKTLASREFYWRGFALWRRAINGFNESPTPKDLQEDLNGAIADFNDSLAQDATFVESKIAEGSCFGYLAYLNMKDPARAQEQIQHSSPLLKEAIAAEPENPRLLWVLGPIRWSSPPERGGGQDKAFELYDRGLAAIRKSPAVSDPLQPSWGEPELLMSRAWSNLHRTAPDLKAAQNDAEAALALVPYWHYVRDILMPQIKDAQAKVR
Ga0307477_1015540313300031753Hardwood Forest SoilASRQLYWRGFALWRRAINGFNESPSPQDLEEDLNGAIADFNDSLTDDPAFVESKIGEASCFGYLAYLSMKDPARMQDMIQHSSSFLKDAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYQRGLEAVRTRPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQNDAGAALTLVPYWHYVRDILLPQIQDARAKVR
Ga0307478_1026881413300031823Hardwood Forest SoilCFGYLAYLSMKDPARMQDMIQHSSSFLKDAMASDPDNPRLLWVLGPIRWSSPPERGGGQDKAFDLYQRGLEAVRTRPAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQNDAGAALTLVPYWHYVRDILLPQIQDAQAKVR
Ga0307471_10417365113300032180Hardwood Forest SoilDLEEDLNSAISDFNNSLARDPGFVESKIAEGSCYGYLMYLNMKDPARMQEMIQHSSPLLKEAMAAAPDNPRLLWVLGPIRWSSPPERGGGQDKAFELYDRGLAAIRKSSAVSDPLEPSWGEPELLMNRAWSNLHRTTPDLKAAQTDAAAALTLVPYWHYVRDILIPQIQD
Ga0306920_10428692813300032261SoilDDLNGAISDFNDSIASDPSFIESKIAAASCYGYLVFINRKEPERMQEFIQHSSLLLKEAMAADPDNPRLKWVLGPIRWSSPPERGGGQDKAIELYTQGLETIRKRPKPTDPLEPAWGEPELLMNRAWSNLNRTAPELKSAQVDAEAALALVPYWHYVKDILIPQIQSAQT
Ga0335085_1035019013300032770SoilDLNGAIADFNDSIASDPSFIESKIAAGSCYGYLIYVHRKEPERMQEFMQHSSPLLKDAMAADPENPRLKWVLGPIRWSSPPERGGGQDKAIELYTQGLETIRKRPKPTDPLEPAWGEPELLMNRAWSNLNRTTPELKSAQVDAEAALALVPYWHYVRDILIPQIQAAQTKASQPK
Ga0335079_1024101713300032783SoilLGYLFYLNMKDPARSGEFIKQSSPLLKDAMAADPDNPRLLWVLGPIRWSSPPERGGGQDKAFDVYNRGLESIRKGPASTDPLDPNWGEPELLMNRAWSNLHRTEPDLKAAQKDAEAALTLVPYWHYVRDILLPQIREAQAKTDAKAGAH
Ga0335078_10001215103300032805SoilMKDPARSGEFIKQSSPLLKDAMAADPDNPCLLWVLGPIRWSSPPERGGGQDKAFDVYNRGLESIRKGPASTDPLDPNWGEPELLMNRAWSNLHRTEPDLKAAQKDAEAALTLVPYWHYVRDILLPQIREAQAKTDAKAGAH
Ga0335084_1022557723300033004SoilALKRLHDELKVPADNKQLASRLEYWRGFALWRRAINAFNETPTPTDIPDDLNGAIADFNDSIASDPSFIESKIAAGSCYGYLIYVHRKEPERMQEFMQHSSPLLKDAMAADPENPRLKWVLGPIRWSSPPERGGGQDKAIELYTQGLETIRKRPKPTDPLEPAWGEPELLMNRAWSNLNRTTPELKSAQVDAEAALALVPYWHYVRDILIPQIQAAQTKASQPK
Ga0335077_1015923043300033158SoilQARAQELLQRVIPLTKELQAEAADNPRFLWIMGPVRWSTPPERGGGQDKAFELYNRGLEIVRKPKPQSDPLEPSWGEPELLMNRAWSNLNRTTPDLKAAAEDAQAALQLVPYWHYVRDILIPQIKTAQAKAH
Ga0335077_1039292623300033158SoilMAADPDNPCLLWVLGPIRWSSPPERGGGQDKAFDVYNRGLESIRKGPASTDPLDPNWGEPELLMNRAWSNLHRTEPDLKAAQKDAEAALTLVPYWHYVRDILLPQIREAQAKTDAKAGAH
Ga0310914_1155786113300033289SoilYGYLVFINRKEPERMQEFIQHSSLLLKEAMAADPDNPRLKWVLGPIRWSSPPERGGGQDKAIELYTQGLETIRKRPKPTDPLEPAWGEPELLMNRAWSNLNRTAPELKSAQVDAEAALALVPYWHYVKDILIPQIQTAQTKASQSK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.