NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F014754

Metagenome / Metatranscriptome Family F014754

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F014754
Family Type Metagenome / Metatranscriptome
Number of Sequences 260
Average Sequence Length 97 residues
Representative Sequence KGLALSSLGQVNIEKKDNAQAVENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKEAFTQAASVNSPYKALAQEKLKASAGAAPARRKTP
Number of Associated Samples 173
Number of Associated Scaffolds 260

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.77 %
% of genes from short scaffolds (< 2000 bps) 0.38 %
Associated GOLD sequencing projects 158
AlphaFold2 3D model prediction Yes
3D model pTM-score0.61

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.231 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(36.154 % of family members)
Environment Ontology (ENVO) Unclassified
(35.769 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(48.077 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 48.78%    β-sheet: 0.00%    Coil/Unstructured: 51.22%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.61
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 260 Family Scaffolds
PF01625PMSR 65.77
PF04255DUF433 3.08
PF07719TPR_2 2.69
PF00006ATP-synt_ab 2.69
PF03712Cu2_monoox_C 2.31
PF02397Bac_transf 1.15
PF14384BrnA_antitoxin 0.77
PF00873ACR_tran 0.38
PF01734Patatin 0.38
PF07498Rho_N 0.38
PF01636APH 0.38
PF07992Pyr_redox_2 0.38
PF02698DUF218 0.38
PF07238PilZ 0.38
PF14559TPR_19 0.38
PF01565FAD_binding_4 0.38
PF00069Pkinase 0.38
PF08241Methyltransf_11 0.38
PF02719Polysacc_synt_2 0.38
PF13557Phenol_MetA_deg 0.38
PF13185GAF_2 0.38

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 260 Family Scaffolds
COG0225Peptide methionine sulfoxide reductase MsrAPosttranslational modification, protein turnover, chaperones [O] 65.77
COG2442Predicted antitoxin component of a toxin-antitoxin system, DUF433 familyDefense mechanisms [V] 3.08
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 1.54
COG2148Sugar transferase involved in LPS biosynthesis (colanic, teichoic acid)Cell wall/membrane/envelope biogenesis [M] 1.15
COG0451Nucleoside-diphosphate-sugar epimeraseCell wall/membrane/envelope biogenesis [M] 0.77
COG0702Uncharacterized conserved protein YbjT, contains NAD(P)-binding and DUF2867 domainsGeneral function prediction only [R] 0.77
COG1086NDP-sugar epimerase, includes UDP-GlcNAc-inverting 4,6-dehydratase FlaA1 and capsular polysaccharide biosynthesis protein EpsCCell wall/membrane/envelope biogenesis [M] 0.77
COG1087UDP-glucose 4-epimeraseCell wall/membrane/envelope biogenesis [M] 0.38
COG1088dTDP-D-glucose 4,6-dehydrataseCell wall/membrane/envelope biogenesis [M] 0.38
COG1089GDP-D-mannose dehydrataseCell wall/membrane/envelope biogenesis [M] 0.38
COG1091dTDP-4-dehydrorhamnose reductaseCell wall/membrane/envelope biogenesis [M] 0.38
COG1434Lipid carrier protein ElyC involved in cell wall biogenesis, DUF218 familyCell wall/membrane/envelope biogenesis [M] 0.38
COG1752Predicted acylesterase/phospholipase RssA, containd patatin domainGeneral function prediction only [R] 0.38
COG2949Uncharacterized periplasmic protein SanA, affects membrane permeability for vancomycinCell wall/membrane/envelope biogenesis [M] 0.38
COG3621Patatin-like phospholipase/acyl hydrolase, includes sporulation protein CotRGeneral function prediction only [R] 0.38
COG4667Predicted phospholipase, patatin/cPLA2 familyLipid transport and metabolism [I] 0.38


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.23 %
All OrganismsrootAll Organisms0.77 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005542|Ga0070732_10001983All Organisms → cellular organisms → Bacteria11171Open in IMG/M
3300015053|Ga0137405_1264348All Organisms → cellular organisms → Bacteria → Acidobacteria552Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil36.15%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil13.46%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil6.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil6.54%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.62%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.62%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.85%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil3.08%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment2.69%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.31%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.92%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.54%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil1.15%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.77%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil0.77%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.77%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil0.77%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.77%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.77%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.38%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring0.38%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.38%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil0.38%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.38%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.38%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.38%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.38%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.38%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.38%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.38%
Switchgrass, Maize And Mischanthus LitterEngineered → Solid Waste → Grass → Composting → Unclassified → Switchgrass, Maize And Mischanthus Litter0.38%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2065487018Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
2170459010Grass soil microbial communities from Rothamsted Park, UK - December 2009 direct MP BIO1O1 lysis 0-9cm (no DNA from 10 to 21cm!!!)EnvironmentalOpen in IMG/M
2170459013Grass soil microbial communities from Rothamsted Park, UK - July 2010 direct MP BIO 1O1 lysis soil at the rocks surface 0-21cmEnvironmentalOpen in IMG/M
2170459020Litter degradation NP2EngineeredOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300002910Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300004080Coassembly of ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300004152Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3EnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010341Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM2EnvironmentalOpen in IMG/M
3300010343Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM1EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300017823Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_3EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017943Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_4EnvironmentalOpen in IMG/M
3300017972Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0715_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300018006Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_4EnvironmentalOpen in IMG/M
3300018007Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_5EnvironmentalOpen in IMG/M
3300018032Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_BV01_MP10_20_MGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021476Biofilm microbial communities from the roof of an iron ore cave, State of Minas Gerais, Brazil - TC_06 Biofilm (v2)EnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022531Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-28-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022722Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-12-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300024331Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK09EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026376Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-BEnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027070Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF004 (SPAdes)EnvironmentalOpen in IMG/M
3300027109Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF008 (SPAdes)EnvironmentalOpen in IMG/M
3300027562Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027575Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300027590Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300027610Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027663Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027768Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP03_OM1 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300027911Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028673Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-BEnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030862Metatranscriptome of soil microbial communities from Maridalen valley, Oslo, Norway - NSE5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031018Metatranscriptome of rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZE5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031945Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX082EnvironmentalOpen in IMG/M
3300031959Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f24EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032035Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF170EnvironmentalOpen in IMG/M
3300032059Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f27EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033290Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f15EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPINP_041220902065487018SoilESYAQKAVALADSAPRPANLTDEQWSQQTALQRGLALSALGQVNLQKKNNLQAVQNFQAAARLLKSNDASYARNAYRMGFALINLKKIPEARAAFTEAASVNSPYKGPAQDKLKTLPARAAAPRKPS
F62_085482702170459010Grass SoilNAQAVENLKAAAPLVKVDDASYGRNQYRLGFALANLKRTAEAKEAFTQAASVNGPYKALAQDKLRAAAGPATARKKTP
N57_033411902170459013Grass SoilLALSALGQVNLQKKDNATAAQNLQTAAPLLKANDVSYARNQYRLGFAYLNLKRLPEAKQALTEAASVNSPYKQLALEKLKTLPPKAPVTRKKAA
2NP_003404002170459020Switchgrass, Maize And Mischanthus LitterQQVSLQKGLALSSLGQVNLQKKDNAQAAENFKSAAPLLKADEGSFGRNQYRLGFALLNLKKNAEAREAFTQAASVNSAYKGLAQAKLKSFEAAAKQKP
JGI1027J12803_10693343523300000955SoilVGVLETTKKPNEMTDDQWKQQSQLQKGLALSSLGQINIQKKDNAQAVTNLRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAMEAFTQAASVNSPYKGPAQEKLKAMAAPVRKKAS*
JGI12635J15846_1061537123300001593Forest SoilVTDEQWTQQKGLQKGLALSSLGQVNIQKKDNAQAAENLRAAAPLLKSDDGSYGRNQYRLGFALLNLKKNPEAKEAFTQAASVNSPYRALAQEKLRGLSAATPAHRKKS*
JGI25615J43890_107916423300002910Grasslands SoilYAKKSVSLLETAKKPDGVTDEQWKQQSGLQKGLALSALGQVNIQKKNNAQAVDNLRSAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPGTAANKS*
JGI25617J43924_1011426323300002914Grasslands SoilQAVENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKALAQEKLRASAGPASSRKKAP*
JGI25617J43924_1015297713300002914Grasslands SoilNIEKKNNAQAVDNFKAAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPGTAANKS*
Ga0062385_1040361913300004080Bog Forest SoilLSCLGQVNIQKKDNEQAVQNLKAAAPLLKPDEGSYGRNQYRLGFALLNLKKNAEARDAFVQSASVNSAYKALAQAKLKTFDAAAKKKT*
Ga0062386_10102213023300004152Bog Forest SoilLGQVNIQKKDNATAADNFKAAAPLLKADENSYGKNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQAKLKTFDTAAAAKKKS*
Ga0062591_10061355223300004643SoilQKKDNAQAAENFKSAAPLLKADEGSFGRNQYRLGFALLNLKKNAEAREAFTQAASVNSAYKGLAQAKLKSFEAAAKQKP*
Ga0066680_1017269613300005174SoilGQVNIQKKDNAQAAENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKALAQEKLRASAGAAPRKTP*
Ga0066680_1047893423300005174SoilSSLGQVNIEKKDNAQASENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKDAFAQAASVNSPYKVLAQEKLKGLAGAAPARRKPS*
Ga0066690_1073964423300005177SoilEQWAQQKSLQKGLALSSLGQVNIEKKDNAQAVENLKAAAPLLKPDDGSYARNQYRLGFALLNMKKNAEAKEAFAQAASVNSPYKALAQEKLKARAGAAPARRKTP*
Ga0066685_1016753113300005180SoilKDNAQAVENLKAAAPLVRPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKGLAQEKLRGLAGPAAAKRKPY*
Ga0066685_1087646213300005180SoilSALGQINIQKKDNVQAVQNLRAAAPLLKPDEGSYARNQYRLGFALLNLKRLPEAKVAFTEAASVNSPYKALAQEKLRSAPATTAGRN*
Ga0066675_1043440113300005187SoilAVENLRTAAPLVKADDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKGLAQDKLKGLAAPARRKAS*
Ga0066675_1139438713300005187SoilADYYSEKGEQLDKAEAEAKKALSVLDSATKPEGVSDEQWQAQNALQKGLALSALGQINIQKKDNATAAQNFKAAAPLLKSDAGSYARNQYRLGFALLNLKKMPEAKAALTEAASLNTPYKALAQDKLKGLPATTAGKN*
Ga0066388_10698341713300005332Tropical Forest SoilQWQQQTMLQKGLALSDLGQINIQKKDNAQAVENLKAAAPLLKSDANGYARNQYRLGFALLNLKKLPEARAAFTDAASVNSPYKGPAQEKLKALGTAHTATARAKS*
Ga0066388_10850370123300005332Tropical Forest SoilDGVTDEQWTQQKALQKGLALSSLGQVDIEKKDNAQAVENLKAAAPLVKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAGAGPARKKAA*
Ga0070730_1088921023300005537Surface SoilLALSSLGQVNLQKKDNAQAVESFKSAAPLLKADEGSFGRNEYWLGFALLNMKKNAEAKEAFTQSASVNSAYKGLAQAKLKSFEASSRKRP*
Ga0070732_10001983183300005542Surface SoilEGVTDEQWKQQSALQKGLALSALGQVNIQKKNNADAVDNLKSAAPLLKPDEVSYARNQYRLGFAFLNLKRIPEAKAAFTDAASVNSPYKALAQDKLKTLPGAAAAKKP*
Ga0066707_1043230723300005556SoilQKKDNAQAVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRNLEAKDAFTQAASVNSPYRGPAQEKLKAMAAPPRRKAS*
Ga0066699_1127680223300005561SoilKKENAQAVDNLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAAPARRKAS*
Ga0066654_1071381023300005587SoilKDNAQAVENLKAAAPLVRPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKGLAQEKLKGLAGPAAAKRKPY*
Ga0070763_1074723113300005610SoilQKGLALSSLGEVDIQKKDNATAVVNFRAAAPLLKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKGPAQEKLKGLAGASAAPARKKAS*
Ga0070766_1003999713300005921SoilLSALGQVNIQKKNNAQAVDNLKAAAPLLKPNANSYARNQYRLGFALLNLKRVPEAKAALTDAASVNSPYKSLAQDKLKALSGATPAHAKS*
Ga0066656_1063958523300006034SoilMTDDQWAQQKGLQKGLALSSLGQVNIEKKDNAQAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKDAFAQAASVNSPYKVLAQEKLKGLAGAAPARRKPS*
Ga0075023_10007502113300006041WatershedsAQAVENLKAAEPLLKPDDGSYARNQYRLGFAFLNLKKNAEAKEALTQAASVNSPYKQVALDKLKSLSAPPKRKAR*
Ga0070765_10066234323300006176SoilKAASPLLKADDGSYGRNQYRLGFALLNLKRNAEAKDAFAQSASVNSAYRALAQAKLKTFDAAAKRKS*
Ga0070765_10084211113300006176SoilNIQKKNNAQAVDNLKAAAPLLKPNANSYARNQYRLGFALLNLKRVPEAKAALTDAASVNSPYKSLAQDKLKALSGATAAHAKS*
Ga0070765_10222917423300006176SoilALGQVNIQKKDNAQAAENFKAAAPLLKADENSYGRNQYRLGFALLNLKRNAEAKDAFTQSASVNSAYKTLAQAKLKTFDAAAAKKKS*
Ga0079222_10000194113300006755Agricultural SoilALSALGQINLQKKDNAQAVQNFVAAAPLLKGNEASYGRNQYRLGFAYLNLKKTAEARAAFSEAAAAKGPYQTLAQEKLKSLPAAPSRRRRAS*
Ga0079222_1026536713300006755Agricultural SoilGQVNIEKKDNAQAVENLKAAAPLVKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKGLAQEKLKGLTGAVTAKRKAS*
Ga0079221_1003486533300006804Agricultural SoilLDKAESSAKKAITLLGTAQKPEGVTDEQWTQQVSLQKGLALSSLGQVNIQKKDNAGAVDNFKSAAPLLKADENSYGRNQYRLGFALLNLKKNAEAKQAFTEAAAVNSPYRPLAQAKLKTFVGAAKQ*
Ga0079221_1049499913300006804Agricultural SoilQQSALQKGLALSALGQVNIQKKNNAQAVENFRSAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFIDAASVNSPYKALAQEKLKALPGTSAAK*
Ga0073928_1091884223300006893Iron-Sulfur Acid SpringQQQSTLQKGLALSALGQINIQKKDNAQAAEHLEAAAPMLKSDATSYARNEYRLGFALLNLKKVPEAKAAFTEAASVNSPYKAMALDKLKSLPGGGATTTHKKS*
Ga0075436_10140325423300006914Populus RhizosphereQKGLALSTLGQINLQKKNNTQAVQNFQAAAPLLKSDDTSYARNQYRLGFAYINLKNAPGARAAFTECASVSSPYKQYALEKLKGIPATATAKRTR*
Ga0099793_1047938023300007258Vadose Zone SoilNAQAVTSFRAAAPLLKPDDGSYARNQYRLGFALLNLKRSPEAKDAFTQAASVNSPYKGPAQEKLKAMAAPPRRKAS*
Ga0066710_10208605713300009012Grasslands SoilQWKQQTALQKGLALSTLGQVNLQKKNNVGAVQNFQAAAPLLKANDTSYARNQYRLGFALLNLKKIPEAKAAFTEAASVNSPYKSYAQDKLKALPATAATRKKTP
Ga0099829_1148592513300009038Vadose Zone SoilAEAYARKSVALLETAKKPDGVTDEQWKQQSALQKGLALSALGQVNIEKKNNAQAVDNLKSAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPATAANKS*
Ga0099830_1002182513300009088Vadose Zone SoilIQKKDNAQAVENLKTAAPLLKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKAGAGAAQARRKSP*
Ga0099830_1031234513300009088Vadose Zone SoilQAVDNLKSAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFSDAASVNSPYKALAQEKLKALPGTAANKS*
Ga0099830_1076867023300009088Vadose Zone SoilNNAQAVDNFKSAAPLLKSDDGGYARNQYRLGFALLNLKRVPEAKAAFTEAASVNSPYKALAQEKLKALPATAANKS*
Ga0099828_1124307213300009089Vadose Zone SoilQAVENLKAAAPLLKPDDGSYARNQYRFGFALLNLKRNAEAKEAFTQAASVNSPYKALAQEKLRASAGPATARKKAP*
Ga0099828_1165791313300009089Vadose Zone SoilSLLETAQKPEGMTDEQWAKQKGLQKGLALSALGQVNIEKKENAQAVENLKVAAPLVKADDVSYAHNQYRLGFALLNLKKNAEAKEAFARAASVNSPYKTLAMEKLKGLATPAKRKPS*
Ga0099828_1179268713300009089Vadose Zone SoilKAISLLDAAPKPEGVTDEQRAQQKSLQKGLALRSLGQVKIEKKDNAQAVGNLKAAAPLVKPDDVSYAKNQYRLGFALLNLKKNAEAKEAFTQEASVNSPYKALAQDKLKAAAGAAPARRKKP*
Ga0099827_1050220623300009090Vadose Zone SoilALSSLGQVNIEKKDNAQAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKDAFAQAASVNSPYKVLAQEKLKGLAGAAPARRKPS*
Ga0099827_1063060423300009090Vadose Zone SoilGLALSSLGQVNIEKKDNAQAVTNLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNAEAKDAFTQAASVNSPYKALAQDKLKTFVGAGTAQRKKP*
Ga0066709_10279257413300009137Grasslands SoilAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKEAFAQAASVNSPYKALAQEKLKGLAGAAPARRKPS*
Ga0099792_1010704953300009143Vadose Zone SoilKSLQKGLALSSLGQVNIEKKDNAQAVENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKEAFTQAASVNSPYKALAQEKLKASAGAAPVRRKTP*
Ga0126382_1158411913300010047Tropical Forest SoilNIEKKDNAQAVDNLKAAAPLVKADNTSYAHNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKDRALEKLKGLAAPARRKPS*
Ga0126373_1118062613300010048Tropical Forest SoilDNAQAADNLRAAAPLVKSDNMSYAHNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKDRALEKLKGLAAPARRKPS*
Ga0099796_1027695823300010159Vadose Zone SoilYAQKAATLADSAPRPENVTEDQWKQQTALQKGLALSTLGQVNMQKKNNAQAIQNFQAAAPLLKSSDVGYARNEYRLGFAFINLRKIPEAKTAFTQAASVNSPYKQPALDKIKALPASPPTRRKAS*
Ga0134082_1020954613300010303Grasslands SoilSYAKKAIATLETAAKPEGVTDEQWTQQKALQKGLALSSLGQVNIAKKDNAQAVENLRTAAPLVKADDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKGLAQDKLKGLAAPARRKAS*
Ga0134088_1003785813300010304Grasslands SoilMLLLLSDYYGEKGEQLDKAEAYAKKAISVLEKAQKPEGMTDDQWAQQKGLQKGLALSSLGQVNIEKKDNAQAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKDAFAQAASVNSPYKVLAQEKLKGLAGAAPARRKPS*
Ga0134067_1006001213300010321Grasslands SoilLALSSLGQVNIGKKDNAQAVENLKAAAPLLKLDDGSYARNQYRLGFALLNMKKNAEAKEAFAQAASVNSPYKALAQEKLKASAGAAPARRKTP*
Ga0134065_1017876013300010326Grasslands SoilENLKAAAPLVRPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKGLAQEKLRGLAGPAAAKRKPS*
Ga0074045_1055738123300010341Bog Forest SoilADYYSEKGTQLDKAEEYAKKVSAICDAAKKPDNVSDDDWKKQNGLKKGLALSALGQVNLQKKDNLTAVKNLSSAAPLVKSDPVSFARNQYRLGFAYLNLKRNAEAKQAFTDAASVNTPYKGPAEDKLKELGATRAGAAKKPA*
Ga0074044_1024090723300010343Bog Forest SoilAYATKAAALADAAPKPEGMDDAAWAQQKALQKGLALSSLGQVNIEKKDNASAAENFKAAAPLLKADENSYGKNQYRLGFALLNLKKNAEAKDAFTQAASVNSAYKALAQAKLKTFDTAAKKKS*
Ga0126370_1036726233300010358Tropical Forest SoilENLKEAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAGPATAKRKPA*
Ga0126370_1205483713300010358Tropical Forest SoilMLLLLVGYSTEKGGQLDRAEAYAKKSMDLPTAAKKPEGVTDEQWAAQAALQKGLALSALGQVNIQRKDNAQAVENLKSAALLLKSDATSYARNQYRLGFALLNLKRIPEAKAALTEAASVNSPYKSLAQDKLRSLPATTATTKKP*
Ga0126376_1203188713300010359Tropical Forest SoilQMTLQKGIALSALGRVGVQKKDYSTAVQNFLAAGPMLKSNDASYAGNQFWLGYAYLSLKKLPEAKQALTDAASVNSPYKQPALDKLKTLPAKAPVTRKRAA*
Ga0126376_1238241723300010359Tropical Forest SoilEKKDNAQAVENLKAAAPLVKPDDGSYARNQYRLGFALLNLKRNEEAREALTQAASVNSPYKALAQDKLKGLAASARRKPS*
Ga0126372_1175693713300010360Tropical Forest SoilKAEAYAKKAISALETAPKPEGVTDEQWTQQKSLQKGLALSSLGQVNIVKKDNTQAVENLKAAAPLVKADNTSYAHNQYRLGFALLNLKKNAEAKEAFTQAASVSSPYKDRALEKLKGLAAPARRKAS*
Ga0126378_1005212913300010361Tropical Forest SoilQVNIEKKDNAQAVENLKAAAPLLRADDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKALAQEKLKGLAGAGAPRRKPA*
Ga0126378_1033990913300010361Tropical Forest SoilENLKAATPLLKPDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKSLAQEKLRGLAGVRPARRKPA*
Ga0126378_1037842033300010361Tropical Forest SoilGQVNIEKKANAQAVENLKAAAALVKPDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKPLAQDKLKGLAEPPRRKAS*
Ga0126378_1039700813300010361Tropical Forest SoilKKAISTLETAAKPEGVTNEQWTQQKALQKGLALSSLGQIDIEKKENAQAVESLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAAPARRKAS*
Ga0126378_1046718113300010361Tropical Forest SoilEKKENAQAVDNLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAAPARRKAS*
Ga0126378_1051999413300010361Tropical Forest SoilENLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAPPARRKAS*
Ga0126379_1284888113300010366Tropical Forest SoilTNEQWTQQKALQKGLALSSLGQIDIEKKENAQAVENLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAAPARRKAS*
Ga0126381_10102595413300010376Tropical Forest SoilTAAKPEGLTDEQWTQQKALQQGLALRSLGQIEIEKKENAQAVENLRAAAPLVKTDDGSYARNQYRSGFELLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAAPARRKAS*
Ga0126381_10135305623300010376Tropical Forest SoilDEQWTQQKALQKGLALSSLGQVNIEKKANAQAVENLKAAAALVKPDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKPLAQDKLKGLAEPPRRKAS*
Ga0126381_10310946813300010376Tropical Forest SoilYGEKGEQLDKAEGYAKKAIAVLGTAQKPEGVADDQWKRQTSLQKGLALSALGQVNMEKKDNASAVQNLRAAAPLVQADAVSYARNQYRLGFALINLKRMPEAKEAFTQAASVNSPYKSLAQDKLKSITTAAKKGSS*
Ga0136449_10086371333300010379Peatlands SoilQKKDNAKAVENLKAAAPLLKSDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKGPAQDKLKAMATPAKHKAS*
Ga0134126_1080567633300010396Terrestrial SoilVESFKSAAPLLKPDEGSYGRNQYWLGFALLNLKKNAEAKEAFTEAASVNSAYKGLAQAKLKTFEAASRKKS*
Ga0126383_1100412713300010398Tropical Forest SoilKDNAQAVENLKAAAPLVKADDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKPLAQDKLKGLAGTPRRKAS*
Ga0137392_1009875813300011269Vadose Zone SoilKTAAPLLKPDDGSFARNQYRLGFALLNLKRTAEAREAFTQAASVNSPYKAVAQDKLKAMAAPARRKPS*
Ga0137392_1045215523300011269Vadose Zone SoilVENLRAAAPLLKPDDGSYARNQYRLGFALLNLKKTAEAKEAFTQAASVNSPYKALAQEKLKAGAGAAQARRKTP*
Ga0137392_1096645713300011269Vadose Zone SoilLALSSLGQVNIEKKDNAQAVENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKEAFTQAASVNSPYKALAQEKLKASAGAAQARRKTP*
Ga0137392_1133454013300011269Vadose Zone SoilAQAVENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRTAEAKEAFTQAASVNSPYKALAQEKLGATAGPAGARKKAP*
Ga0137392_1147433323300011269Vadose Zone SoilLSALGQVNIEKKNNAQAVDNFKAAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPATAAAKKS*
Ga0137391_1049008623300011270Vadose Zone SoilGVTDEQWKQQSALQKGLALSALGQVNIEKKNNAQAVDNLKSAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFSDAASVNSPYKALAQEKLKALPGTAANKS*
Ga0137391_1049510733300011270Vadose Zone SoilWAQQKGLQKGLALSSLGQVNIEKKANAQAVENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRTAEAKEAFTQAASVNSPYKALAQEKLGATAGPAGARKKAP*
Ga0137391_1116904113300011270Vadose Zone SoilLKAAAPLVKPDDVSYAKNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKALAQDKLKAAAGAAPARRKKP*
Ga0137391_1128757913300011270Vadose Zone SoilYGEKGEQLDKAEAYAKKAISVLEKAQKPEGMTDDQWAQQKGLQKGLALSSLGQVNIEKKDNAQAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKDAFAQAASVNSPYKALAQEKLKGLAGAAPARRKPS*
Ga0137393_1065710523300011271Vadose Zone SoilNDAQAVDNFKTAAPLLKSDEVSYARNQYRLGFALLNLKRVPEAKAAFTEAASVNSPYKTPAQEKLKALPATGAKAAAKKS*
Ga0137393_1087246513300011271Vadose Zone SoilLSDYYGEKGEQLDKAEAYAKKAISVLEKAQKPEGMTDDQWAQQKGLQKGLALSSLGQVNIEKKDNAQAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKDAFAQAASVNSPYKVLAQEKLKGLAGAAPARRKPS*
Ga0137393_1093058223300011271Vadose Zone SoilGQINIQKKDNAQAVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAFTQAASVNSPYRGPAQEKLKAMAAPPRRKAS*
Ga0137393_1102521513300011271Vadose Zone SoilEQWKQQSALQKGLALSALGQINIQKKNNAQAVDNFKSAAPLLKSDDGGYARNQYRLGFALLNLKRVPEAKAAFTEAASVNSPYKALAQEKLKALPATAANKS*
Ga0137389_1011160313300012096Vadose Zone SoilKNDAQAVDNFKTAAPLLKSDEVSYARNQYRLGFALLNLKRVPEAKAAFTEAASVNSPYKTPAQEKLKALPATGAKAAAKKS*
Ga0137389_1053826833300012096Vadose Zone SoilQSALQKGLALSALGQVNIEKKNNAQAVDNLKSAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPATAANKS*
Ga0137389_1099484723300012096Vadose Zone SoilYGEKGEQLDKAEAYAKKAISVLEKAQKPEGMTDDQWAQQKGLQKGLALSSLGQVNIEKKDNAQAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKEAFAQAASVNSPYKALAQEKLKGLAGAAPARRKPS*
Ga0137388_1057258633300012189Vadose Zone SoilVTDEQWAQQKSLQKGLALSSLGQVNIEKKDNAQAVENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKEAFTQAASVNSPYKALAQEKLKASAGAAQARRKTP*
Ga0137388_1132210923300012189Vadose Zone SoilSLGQVNIEKKDNAQAVENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRTVEAKEAFTQAASVNSPYKGLAQEKLKASAGAAPARRKAP*
Ga0137388_1135807023300012189Vadose Zone SoilLALSSLGQINIRKKDDAQAVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAFTQAASVNSPYKGPAQEKLKAMAAPPRRKAS*
Ga0137388_1146633213300012189Vadose Zone SoilKKDNGQAVENLKAAAPLLKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKASAGAAPTRRKEP*
Ga0137388_1203450913300012189Vadose Zone SoilPDGATDEQWKQQSALQKGLALSALGQVNIQKKNNAQAVDNFKAAAPLLKSDEGSYARNQYRLGFALLNLKRLPEAKAAFTDAASVNSPYKSLALEKLKALPGTAANKS*
Ga0137364_1081778023300012198Vadose Zone SoilGQVNIQKKDNAQAAENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYRALAQEKLRASGAAPARRKTP*
Ga0137365_1011609113300012201Vadose Zone SoilAQAVTNLKAAGPLLKPDDGSYARNQYRLGFALLNLKRNAEAKDAFMQAASVNSPYKALAQDKLKTFAGAGTAQRKKP*
Ga0137365_1132605723300012201Vadose Zone SoilAQAVTNLKAAGPLLKPDDGSYARNQYRLGFALLNLKRNAEAKDAFTQAASVNSPYKALAQDKLKTFVGAGTAQRKKP*
Ga0137363_1009152933300012202Vadose Zone SoilVTDEQWARQKGLQKGLALSSLGQVNIEKKDNAQAVENLKAAAPLVKVDDASYGRNQYRLGFALANLKRTADAKEAFTQAASVNGPYKALAQDKLKAAAGPATARKKTP*
Ga0137363_1051881213300012202Vadose Zone SoilMTDEQWKTQSGLQKGLALSSLGQINIRKKDDAQAVTNLRAAAPLLKPDDGSYARNQYRLGFALLNLKRSPEAKEAFTQAASVNSPYKGPAQEKLKAMAAPVRKKAS*
Ga0137363_1059509813300012202Vadose Zone SoilLADYYSEKGEQLDKAESYAKKSVSLLETAKKPEGVTDEQWKQQSGLQKGLALSALGQVNIQKKNNAQAVDNLRSAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPGTAANKS*
Ga0137363_1109565823300012202Vadose Zone SoilEQLDKAESYAKKSVSLLETAKKPDGVTDEQWKQQNGLQKGLALSALGQVNIQKKNNAQAVDNFRSAAPLLKADDGGYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPGTAANKS*
Ga0137363_1179632913300012202Vadose Zone SoilALSVLETAKKPDEMTDEQGKTQSGLQKGLAVCSLGQINIQKKDNAQAVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAFTQAASVNSPYKGPAQEKLKAMAAPPRRKAS*
Ga0137399_1030594513300012203Vadose Zone SoilKENAQAVENLRAAAPLLKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASLNSPYKALAQEKLKAGAGAAQARRKTP*
Ga0137399_1080054413300012203Vadose Zone SoilTNLRAAAPLLKPDDGSYARNQYRLGFALLNLKRSPEAKEAFTQAASVNSPYKGPAQEKLKAMAAPPRRKAS*
Ga0137399_1172361113300012203Vadose Zone SoilALSALGQVNIQKKNNADAVETFKAGAPLLKSDEESYERNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKGLPATAAKKS*
Ga0137362_1015059613300012205Vadose Zone SoilLQKGLALSALGQVNIQKKNNAQAVDNFKSAAPLLKSDDGGYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKGLPTATAAKKS*
Ga0137362_1046491823300012205Vadose Zone SoilLLETAKKPDGVTDEQWKQQSALQKGLALSALGQVNIEKKNNAQAVDNFKAAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPATAAAKKS*
Ga0137380_1059559913300012206Vadose Zone SoilEKGEQLDKAEAYAKKAISVLEKAQKPEGMTDDQWAQQKGLQKGLALGSLGQVNIEKKDNAQAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKEAFAQAASVNSPYKALAQEKLKGLTGATPARRKPS*
Ga0137380_1066193313300012206Vadose Zone SoilMTDDQWAQQKGLQKGLALSSLGQVNIEKKDNAQAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKEAFAQAASVNSPYKVLAQEKLKGLTGAAPARRKPS*
Ga0137381_1044212823300012207Vadose Zone SoilQQKALQKGLALSSLGQVNIQKKDNAQAAENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNVEAKEAFTQAASVNSPYKALAQEKLRASAGAAPARRKTP*
Ga0137379_1170915623300012209Vadose Zone SoilSTLETVAKPESVTDEQWTLQKALQKGLALSSLGQVNIEKKENAQAVENLKAAAALVKPDDVSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKGLAQDKLKGLAAPTRRKTS*
Ga0137378_1052088613300012210Vadose Zone SoilQAAENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYRALAQEKLRASGAAPARRKTP*
Ga0137377_1035302523300012211Vadose Zone SoilDNLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLRGLAAPARRKAS*
Ga0150985_12155662313300012212Avena Fatua RhizosphereVLDASKKPEGVSEEQWQAQNAIQKGLALSALGQINIQKKDNVRAVQNLRTAAPLLKSDEGSYARNQYRLGFALLNLKKTAEAKVAFTEAASVNSPYKVLAQEKLKSAPATTAGRN*
Ga0137370_1028074423300012285Vadose Zone SoilTDEQWAQQKSLQKGLALSSLGQVNIEKKDNAQAVENLKAAAPLLKPDDGSYARNQYRLGFALLNMKKNAEAKEAFAQAASVNSPYKALAQEKLKASAGAAPARRKTP*
Ga0137386_1084428423300012351Vadose Zone SoilDKAETYAKKAISALETSAKPEGVTDEQWKQQKALQKGLALSSLGQINIEKKENAQAVDNLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLRGLAAPARRKAS*
Ga0137366_1105855413300012354Vadose Zone SoilEAYAKKAISVLEKAQKPEGMTDDQWAQQKGLQKGLALSSLGQVNIEKKDNAQAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKEAFAQAASVNSPYKALAQEKLKGLTGATPARRKPS*
Ga0137369_1041200023300012355Vadose Zone SoilQINIEKKENAQAVDNLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQETLKGLAAPARRKAS*
Ga0137385_1034343323300012359Vadose Zone SoilSDYYGEKGEQLDKAEAYAKKAISVLEKAQKPEGMTDDQWAQQKGLQKGLALGSLGQVNIEKKDNAQAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKEAFAQAASVNSPYKALAQEKLKGLTGATPARRKPS*
Ga0137385_1084494113300012359Vadose Zone SoilVENLKAAAALVKPDDVSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKGLAQDKLKGLAAPTRRKTS*
Ga0137360_1015047943300012361Vadose Zone SoilLADYYSEKGEQLDKAESYAKKSVSLLETAKKPDGVTDEQWKQQNGLQKGLALSALGQVNIQKKNNAQAVDNFRSAAPLLKADDGGYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPGTAANKS*
Ga0137360_1040505013300012361Vadose Zone SoilVNIEKKDNAQAVENFRAAAALLNSDAGSYGRNQYRLGFALLNLKRTAEAREAFTQAASVNGPYKTLAQEKLKAISGGPAARRKTS*
Ga0137360_1071639223300012361Vadose Zone SoilGQVNMQKKNNAQAIQNFQAAAPLLKSSDVGYARNEYRLGFAFINLRKIPEAKAAFTQAASVNSPYKQPALDKIKALPASPPTRRKTS*
Ga0137360_1115543113300012361Vadose Zone SoilSSLGQVNIEKKDNAQAVTNLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNAEAKDAFAQAASVNSPYKALAQDKLKTFVGAATAQRKKP*
Ga0137361_1043527013300012362Vadose Zone SoilKKPDEMTDEQWKTQSGLQKGLALSSLGQINIRKKDDAQAVTNLRAAAPLLKPDDGSYARNQYRLGFALLNLKRSPEAKEAFTQAASVNSPYKGPAQEKLKAMAAPVRKKAS*
Ga0137361_1053304833300012362Vadose Zone SoilQKGLALSALGQVNIQKKNNAQAVDNFRSAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPGTAANKS*
Ga0137361_1100239013300012362Vadose Zone SoilKKPDEMTDEQWKTQSGLQKGLALSSLGQINIRKKDDAQAVMNLRAAAPLLKPDDGSYARNQYRLGFALLNLKRSPEAKEAFTQAASVNSPYKGPAQEKLKAMAAPPRRKAS*
Ga0137361_1161218213300012362Vadose Zone SoilDKTDQLVKAETYAKRAIAALDTAAKPEGVTDEQWTQQKGLQKGLALSSLGQVNIEKKANAQAVENLKAAAPLLRPDDGSYARNQYRLGFALLNLKRTAEAKEAFTQAASVNSPYKALAQEKLGASARSTTARKKAP*
Ga0137358_1027856613300012582Vadose Zone SoilAQDPDNLGMLLLLVDYYSEKGEQLDKAESYAKKSVSLLDAAKKPDGVTDEQWKQQSALQKGLALSALGQVNIQKKNNAQAVENFRSAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKGLPAATAANKP*
Ga0137358_1056968323300012582Vadose Zone SoilVVEYLKAAAPLLKPDDGSYARNQYRLGFALLNMKKNPEAKEAFTQAASVNSPYKALAQEKLKASAGTAPARRKTP*
Ga0137396_1033614623300012918Vadose Zone SoilESAKKPGEMTDDQWKQQSGLQKGLALSSLGQINIQKKDNAQAVTNLRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAFTQAASVNSPYKGPAQEKLKAMAVPPRRKAS*
Ga0137396_1068741023300012918Vadose Zone SoilYAKKAISLLDSAAKPEGVTDEQWARQKSLQKGLALSSLGQVNIGKKDNAQAVENLKTAAPLLKPDDGSFARNQYRLGFALLNLKRTAEAREAFTQAASVNSPYKAVAQDKLKAMAAPARRKPS*
Ga0137396_1072491223300012918Vadose Zone SoilVQQKSLQKGLALSSVGQVNIERKDNAQAVENLKAAAPLLKPDDGSYARNQYRLGFALLNLKKTAEAKEAFTQAASVNSPYKTLAQEKLKASVAPARRKAP*
Ga0137396_1124195113300012918Vadose Zone SoilKSVALLETAKKPDGVTDEQWKQQSALQKGLALSALGQVNIEKKNNAQAVDNFKAAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPATAANKS*
Ga0137394_1094695413300012922Vadose Zone SoilKKDNAQAVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKEAFTQAASVNSPYKGPAQEKLKAMAVPPRRKAS*
Ga0137394_1099836713300012922Vadose Zone SoilQSGLQKGLALSSLGQINIRKKDDAQAVTNLRAAAPLLKPDDGSYARNQYRLGFALLNLKRSPEAKEAFTQAASVNSPYKGPAQEKLKAMAAPPRRKAS*
Ga0137359_1099254123300012923Vadose Zone SoilADYYSEKGEQLDKAESYAKKSVSLLETAKKPDGVTDEQWKQQNGLQKGLALSALGQVNIQKKNNAQAVDNFRSAAPLLKADDGGYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPGTAANKS*
Ga0137359_1119392713300012923Vadose Zone SoilKGLALSSLGQVNIEKKDNAQAVENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKEAFTQAASVNSPYKALAQEKLKASAGAAPARRKTP*
Ga0137419_1050644413300012925Vadose Zone SoilKGEQFDKAESYAKKSVSLLEVAKKPDAVTDEQWKQQSALQKGLALSALGQVNIQKKNNAQAVENFRSAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPATAASKS*
Ga0137419_1094047113300012925Vadose Zone SoilGLALSSLGQINIQKKDNAQAVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAFTQAASVNSPYKGPAQEKLKAMAAPPRRKAS*
Ga0137416_1039405223300012927Vadose Zone SoilVTDEQWAQQKSLQKGLALSSLGQVNIEKKDNAQAVENLKAAAPLLKPDDGSYARNQYRLGFALLNMKKNAEAKEAFAQAASVNSPYKALAQEKLKASAGAAPARRKTP*
Ga0137416_1106460913300012927Vadose Zone SoilYYSEKGEQLDKAEGYAKKAVAVLQTAGRPESVTDEQWVQQKALQKGLALSSLGQVNIQKKDNAQAAENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKALAQEKLKASAGAAPARRKTP*
Ga0137416_1128645223300012927Vadose Zone SoilSSLGQINIQKKDNAQAVTNLRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKEAFTQAASVNSPYKGPAQEKLKAMAAPPRRKAS*
Ga0137416_1205777113300012927Vadose Zone SoilKPDGVTDEQWKQQSALQKGLALSALGQVNIEKKNNAQAVDNFKAAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPATAANKS*
Ga0153915_1114489123300012931Freshwater WetlandsVNIVNKKNAQAVENLKAAAPLLKPNPATYGRNQYRLGFALLNLQRIPEAKAALTEAASMPNPYQALAQEKLKSVAAAKPVRKKH*
Ga0134110_1061379913300012975Grasslands SoilELWVQQKALQKGLALRSLGQVNIQKKDNADAAENLKAAAPWLKPDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYRALAQEKLKASAGGAAPARRKTP*
Ga0157378_1064233613300013297Miscanthus RhizosphereALSTLGQINLQKKNNSGAMQNFQAAAPLLKTNDTSYARNQYRLGFALLNLKKIPEAKAAFTEAASVNSPYKSYAQDKLKALPATTAARKKPS*
Ga0137411_131673253300015052Vadose Zone SoilQYSEKRQRASSDKFRTAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKEAFTQAASVNSPYKGPAQEKLKAIAAPPRRKAS*
Ga0137405_126434813300015053Vadose Zone SoilPNNLGMLLLLLSDYYGEKGEQLDKAEAYAKKAMSVLETAKKPDEMTDDLWKQQSGLQKGLALSSLGQVSIHKKDNAQAVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKEAFTQAASVNSPYKGPAQEKLKAMATPVRRKAS*
Ga0137420_115460423300015054Vadose Zone SoilGEQLDKAEAYAKKAASVLETAKKPDEMTDEQWKTQSGLQKGLALSSLGQINIRKKDDAQAVTNLRAAAPLLKPDDGSYARNQYRLGFALLNLKRSPEAKEAFTQAASVNSPYKGPAQEKLKAMAAPPRRKAS*
Ga0137420_142168323300015054Vadose Zone SoilVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKEAFTQAASINSPYKGPAQEKLKAMAAPPRRKAS*
Ga0137418_1037785213300015241Vadose Zone SoilMQKKNNAQAIQNFQAAAPLLKSSDVGYARNEYRLGFAFINLRKIPEAKTAFTQAASVNSPYKQPALDKIKALPASPPTRRKAS*
Ga0134073_1016394013300015356Grasslands SoilVTDQQWTQQKGLQKGLALSSLGQINIEKKENAQAVDNLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAAPARRKAS*
Ga0132257_10401997713300015373Arabidopsis RhizosphereDKAEGYAQKAVTLAGSAQKPGDVPEDQWKQQTALQKGLALSTLGQINLQKKNNSGAMQNFQAAAPLLKTNDTSYARNQYRLGFALLNLKKIPEAKAAFTEAASVNSPYKSYAQDKLKALPATTAARKKPS*
Ga0182032_1053052623300016357SoilYGEKGEQLDKAETYAKKAISTLERAAKPEGVSNEQWTQQKALQKGLALSSLGQIDIEKKENAQAVENLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAAPARRKAS
Ga0187818_1000393413300017823Freshwater SedimentMTDEQWAQQKALQKGLALSALGQVNIEKKDNAQAAENLKAAAPLLKSDDVSYARNQYRLGFALLNLKKNPEAKEAFTQAASVNSPYKALAQDKLKDMAKPVHHKPSS
Ga0187821_1010687013300017936Freshwater SedimentAVENLKAAGPLLKLDDASYARNQYRLGFALLNLKRNEEAKVAFTQAASVNSPYKGLATEKLKALAAPVKRKPS
Ga0187821_1014923323300017936Freshwater SedimentVNIQKKDNASAAENFKAAGPLLKADENSYGKNQYRLGFALLNLKKNAEAKEAFAQSASVNSPYRALAQAKLKTFDTAAAKKKS
Ga0187821_1015438113300017936Freshwater SedimentHAVENFEAAAPLLKPDEGSYGRNQYWLGFALLNLKRNAEAKEAFTQSASVNSAYKSLAQAKLKTFAGASRH
Ga0187819_1052934213300017943Freshwater SedimentQQKALQKGLALSALGQVNIEKKDNAQAAENLKAAAPLLKSDDVSYARNQYRLGFALLNLKKNPEAKEAFTQAASVSSPYKSLALDKLKDMAKPVHHKPSS
Ga0187781_1053589413300017972Tropical PeatlandKGEQLDKAEAYAKKVGPLLDTVQKPEGVSDDQWTKQKALQKGLALSALGQVNIQKKDNAQATENLKAAAPLLKSDDGSFARNEYRLGFAYLNLKKNAEAKDAFTQAASVNSPYKQLALDKLKAMATPAHHKAS
Ga0187804_1016556513300018006Freshwater SedimentEKKDNAQAAENFKAAAPLLKSDDGSYARNQYRLGFALLNLKKNPEAKEAFTQAASVNSPYKQLALDKLKDMAKPVHHKAS
Ga0187805_1027688913300018007Freshwater SedimentDQWAKQKALQKGLALSALGQVNIQKKDNAQAAENFKAAAPLLKSDDGSYARNQYRLGFALLNLKKNPEAKEAFTQAASVNSPYKQLALDKLKDMAKPVHHKAS
Ga0187788_1033342213300018032Tropical PeatlandQKGLALSALGQVNMEKKDNASAVQNLRAAAPLVQADAVSYARNQYRLGFALVNLKRMPEAKDAFTQAASVNSPYKALAQDKLKSIATAASKKGPS
Ga0066655_1028374223300018431Grasslands SoilVENLRTAAPLVKADDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNTPYKGLAQDKLEGLAAPARRKAS
Ga0210407_1084637013300020579SoilLALSALGQVNIEKKNNAQAVDNLKSAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPATAANKS
Ga0210403_1052523313300020580SoilAVQNLKAASQLLKPDEGSYGRNQYRLGFALLNLKRNAEAKDAFTQSASVNSPYKALAQAKLKTFAAAPKQKS
Ga0210399_1076419913300020581SoilLAKAETYAKKAISLLDTATKPEGATDEQLAQQKNLQKGLALSSLGQVNIEKKANTQAVDNLKAAAPLLKQDDASYGRNQYRLGFALLNLKRTAEAKEAFTQAASVNGPYKSLAQDKLKTASAPARH
Ga0210401_1052794233300020583SoilALSALGQVNIQKKNNAQAVDNLKAAAPLLKPNANSYARNQYRLGFALLNLKRVPEAKAALTDAASVNSPYKSLAQDKLKALSGATPAHAKS
Ga0210401_1095004823300020583SoilQQKNLQKGLALSSLGQVNIEKKANAQAVDNLKAAAPLLKQDDASYGRNQYRLGFALLNLKRTAEAKEAFTQAASVNGPYKSLAQDKLKTASAPARH
Ga0210406_1075346323300021168SoilDNFKAAGPLLKADENSYGKNQYRLGFALLNLKKNAEAKEAFTQSASVNSAYKALAQAKLKSFDTAAAAKKKS
Ga0210400_1037643113300021170SoilQVNIQKKNNAQAVENLKAASSLLKPDANGYARNQYRLGFALLNLKRVPEAKAALTDAASVNSPYKTLAQDKLKAMSGTTTARAKP
Ga0210400_1054273223300021170SoilALSSLGQVNIQKKDNAQAVENFRAAAPLLKSDDGAYGRNQYRLGFALVNLKKNAEAKDAFTQSASVNSAYKALAQAKLKSFETAGKTAAK
Ga0210400_1077144313300021170SoilLALSSLGQVNIERKDNALAAENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRTAEAKEAFTQAASVNSPYKALAQEKLRASAGAAPARRKAP
Ga0210400_1140635423300021170SoilDEQWKQQSGLQKGLALSSLGEVDIQKKDNVTAVVNFRAAAPLLKPDDGSYARNQYRLGFALLNLKKNPEAKEAFTKAASVNSPYKGPAQEKLKGLEGASAAPARKKAS
Ga0210405_1036948813300021171SoilGLALSSLGQVNIEKKANAQAVDNLKAAAPLLKQDDASYGRNQYRLGFALLNLKRTAEAKEAFTQAASVNGPYKSLAQDKLKTASAPARH
Ga0210405_1072405613300021171SoilEKKDNEQAVQNLKAASQLLKPDEGSYGRNQYRLGFALLNLKRNAEAKDAFTQSASVNSPYKALAQAKLKTFAAAPKQKS
Ga0210405_1101306213300021171SoilWQWQVSLEKGLALSSLGQVSIQKKDNAQAAENFKAAAPLLKTDEGSYGRNQYRLGFALLNLKRNAEAKDAFAQSASVNSAYKTLAQAKLKTFDAAATKKKS
Ga0210405_1141939523300021171SoilKKPDGVTDEQWKQQSALQKGLALSALGQVNIEKKNNAQAVDNLKSAAPLLKSDDGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPATAANKS
Ga0210408_1080639013300021178SoilQVNIQKKDNTKAVENLKAAAPMLKSDVNSYARNQYRLGFALLNLKRMPEAKAAFTDAASVNSPYKGPAQEKLKSLTHATTARAKS
Ga0210388_1116789023300021181SoilVGMLLLVSDYYGEKGEQLDKAEAYAKKAIAVLDGAKKPDEMTDDQWKQQSGLQKGLALSSLGEVDIQKKDNATAVVNFRAAAPLLKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKGPAQEKLKGLAGASAAPARKKAS
Ga0210389_1056820323300021404SoilKKNNAQAVDNLKAAAPLLKPDANSYARNQYRLGFALLNLKRVPEAKAALIDAASVNSPYKSLAQDKLKALSGATAAHAKS
Ga0210392_1038554013300021475SoilALSSLGQVNIQKKDNASAADNFKAAGPLLKADENSYGKNQYRLGFALLNLKKNAEAKEAFTQSASVNSAYKALAQAKLKSFDTAAAAKKKS
Ga0187846_1045376623300021476BiofilmKKAAALLDAAKKPENLTDDQWKQQTSIQKGLALSALGQINIQRKQNALAVDSLTKAGPLLKANAFIYARNQYRLGFAYLNLKKNPEAKQAFTDAASLDTPFKGPAQQKLAELSAAKTAPKKAS
Ga0210402_1128136913300021478SoilSGLQKGLALSSLGEVNIQKKDNATAVVNFRAAAPLLKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKGPAQEKLKGLAGASAAPARKKAS
Ga0210402_1145088923300021478SoilLQKGLALSALGQVNIQKKDNAQAVQNFQAAAPLLKSDDGSYGRNQYRLGFALVNLKRDPEAKEAFTQAASVNSPYKALAQAKLKTFDAAKKKS
Ga0210409_1101263523300021559SoilNMEKKDNAQAIVNLRAAAPLLKSDAGSYARNQYRLGFALLNLKRIPEAKEALTQAASVNSPYKALAEEKLKAISGPAGARRKAS
Ga0242660_104015833300022531SoilNIEKKDNAQAVENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKALALEKLKASSGAALARRKTP
Ga0242657_106300523300022722SoilLGQVNIQKKDNAKAVDNLKAAAPLLKSDDGSYARNQYRLGFALLNLKRNAEAKDAFTQAASVNSPYKGPAQDKLKAMATPAKRKAS
Ga0242665_1007896923300022724SoilVNIQKKDNAKAVENLKAAAPLLKSDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKGPAQDKLKAMATPAKRKAS
Ga0137417_138700923300024330Vadose Zone SoilMSSLGQVNIEKKENSQAVENLRAAAPLLKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKASAGAAQARRKTP
Ga0247668_103006813300024331SoilFKSAAPLLKADEGSFGRNQYRLGFALLNLKKNAEAREAFTQAASVNSAYKGLAQAKLKSFEAAAKQKP
Ga0207699_1013929243300025906Corn, Switchgrass And Miscanthus RhizosphereDNPQAAENFKSAAPLLKADEGSFGRNQYRLGFALLNLKKNAEAREAFTQAASVNSAYKGLAQAKLKSFEAAAKQKP
Ga0207684_1131467913300025910Corn, Switchgrass And Miscanthus RhizosphereDNAQAVQNLKTAAPMLKSDANSFARNQYRLGFALLNLKRLPEAKAAFTDAASVNSPYKALAQDKLKSMAHATTTRAKS
Ga0209438_120367013300026285Grasslands SoilNIEKKENSQAVENLRAAAPLLKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKASAGAAQARRKTP
Ga0209240_129726413300026304Grasslands SoilAMSSLGQVNIEKKENSQAVENLRAAAPLLKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKASAGAAQARRKTP
Ga0209239_130394713300026310Grasslands SoilISTVESSVKPEGVTDQQWTQQKGLQKGLALSSLGQINIEKKENAQAVDNLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAAPARRKAS
Ga0209154_118244713300026317SoilEKKDNAQAVENLKAAAPLVKSDDVSYARNQYRLGFALLNLKRNPEAREAFTQAASVNSPYRALAQDKLKALAAPARRKAS
Ga0257167_106494313300026376SoilIEKKDNAQAVENLKAAAPLLKPDDGSYARNQYRLGFAFLNMKRNAEAKEAFTQAASVNSPFKALAQEKLKASAGAAPARRKTP
Ga0257158_105977113300026515SoilLKAAAPLLKLDDGSYGRNQYRLGFALLNLKKTAEAKEAFTQAASVNGPYKALAQDKLKAAAAPARHKTP
Ga0209806_125689323300026529SoilVENLKAAAPLVKSDDVSYARNQYRLGFALLNLKRNPEAREAFTQAASVNSPYRALAQDKLKGLAAPARRKAS
Ga0209376_106800513300026540SoilKDNAQAVENLKAAAPLLKLDDGSYARNQYRLGFALLNLKRNAEAKEALTQAASVNSPYKGLAQEKLKASSGPAAARKKAP
Ga0209648_1034917613300026551Grasslands SoilEKKENSQAVENLRAAAPLLKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKASAGAAQARRKTP
Ga0179587_1004826313300026557Vadose Zone SoilIQKKDNASAADNFKAAGPLLKADENSYGKNQYRLGFALLNLKKNAEAKEAFTQSASVNSAYKALAQAKLKSFDTAAAAKKKS
Ga0208365_106297923300027070Forest SoilEQWAQQKNLQKGLALSSLGQVNIEKKANAQAVDNLKAAAPLLKQEDASYGRNQYRLGFALLNLKRTAEAKEAFTQAASVNGPYKSLAQDKLKTATAPARH
Ga0208603_106017413300027109Forest SoilVQNFKAAGPLLKSDEGSYGRNQYRLGFALLNLKKNAEAKDAFTQSASVNSAYKALAQAKLKTFDAAAKKKT
Ga0209735_104808913300027562Forest SoilLSALGQVNIQKKNNADAVDNLKSAAPLLKPDEVSYARNQYRLGFAFLNLKRIPEAKAAFTDAASVNSPYKALAQEKLKTLPGTAAAKKS
Ga0209525_110550613300027575Forest SoilGQVNIQKKDNATAADNFKAAGPLLKTDENSYGKNQYRLGFALLNLKKNAEAKEAFTQSASVNSAYKALAQAKLKSFDTAAAKKKS
Ga0209116_108723323300027590Forest SoilNFKAATPLLKPDANSYARNQYRLGFALLNLKKVPEAKAALTDAASVDSAYKALAQEKLKAISGATAARAKS
Ga0209528_109224723300027610Forest SoilNIQKKNDAGALDNLKSAAPLLKPDEVSYARNQYRLGFAFLNLKRIPEAKAALTDAASVNSPYRALAQEKLKTLPGTTAAKKS
Ga0209076_122539113300027643Vadose Zone SoilSVLDTSAKPEGVTDEQWTQQKSLQKGLALSSMGQVNIGKKDNAQAVENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRAAEAKEAFTQAASVNSPYKALAQEKLKASAGATPARRKAP
Ga0209117_106205833300027645Forest SoilGLALSSLGQVNIQKKDNAQAVGNFRAAAPLLKADDGSYARNQYRLGFALLNLKRNAEAKEAFSQAASVNSPYKAPAQEKLKGLAGPGAAPAHRKPS
Ga0209388_104291843300027655Vadose Zone SoilKGLALSALGQVNIQKKNNAQAVDNFKAAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPGTAANKS
Ga0208990_118162623300027663Forest SoilKQQSGLQKGLALSSLGQINIQKKDNAQAVTNLRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAFTQAASVNSPYKGPAQEKLKAMAAPPRRKAS
Ga0209588_125304923300027671Vadose Zone SoilKGLALSALGQVNIEKKNNAQAVDNFKAAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPGTAANKS
Ga0209689_100873013300027748SoilQAVDNLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAAPARRKAS
Ga0209772_1014286323300027768Bog Forest SoilQQISLQKGLALSSLGQVNIQKKDNAQAVENFRAAAPLLKSDDGAYGRNQYRLGFALVNLKKNAEAKDAFTQSASVNSAYKALAQAKLKTFETAGKTAAK
Ga0209074_1035034823300027787Agricultural SoilKDNTQAAESFKSAAPLLKPDEGSYGRNQYWLGFALLNLKKNAEAKEAFTEAASVNSAYKGLAQAKLKTFEAASRKKS
Ga0209283_1065908113300027875Vadose Zone SoilGQINIQKKNNAQAVDNFKSAAPLLKSDDGGYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPATAATKKS
Ga0209275_1032642323300027884SoilNAQAVDNFKAAAPLLKPDANSYARNQYRLGFALLNLKRVPEAKAALTDAASVNSPYKSLAQDKLKALSGATPAHAKS
Ga0209380_1014428233300027889SoilGLALSALGQVNIQKKNNAQAVDNLKAAAPLLKPNANSYARNQYRLGFALLNLKRVPEAKAALTDAASVNSPYKSLAQDKLKALSGATPAHAKS
Ga0209488_1054047513300027903Vadose Zone SoilLGQINIQKKDNAQAVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAFTQAASVNSPYRGPAQEKLKAMAAPPRRKAS
Ga0209583_1008912943300027910WatershedsGLALSSLGQVGIERKDNAQAVENLKAAEPLLKPDDGSYARNQYRLGFAFLNLKKNAEAKEALTQAASVNSPYKQVALDKLKSLSAPPKRKAR
Ga0209698_1008963413300027911WatershedsPAVDNLTKAAPLLKPNNLAYARNQYRLGFAYANLKKSAEARQAFTDAASVESPYKGPAQEKLKALANAKPAGKSAAKKPQ
Ga0209698_1017192013300027911WatershedsSSLGQVEIEKKDNAKAVENLKIAGPLLKPDDGSYARNQYRLGFALLNLKKNAEAKAAFTQAASVNSPYKQVALDKLKSISAPSKRKAP
Ga0209698_1093314813300027911WatershedsSSLGQVEIEKKDNAKAVENLKIAGPLLKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKQVALDKLKSMSAPSKRKAP
Ga0209526_1004930063300028047Forest SoilEMTDDQWKQQSQLQKGLALSSLGQIDIQKKDNAQAVTNLRAAAPLLKPDDGSYARNQYRLGFALLNLKRSPEAKEAFTQAASVNSPYKGPAQEKLKAMAAPVRKKAS
Ga0209526_1075758413300028047Forest SoilQLDKAESYAKKSISLLDAAKKPEGVTDEQWKQQTSLQKGLALSALGQVNIQKKNDAGALDNLKSAAPLLKPDEVSYARNQYRLGFAFLNLKRIPEAKAALTDAASVNSPYRALAQEKLKTLPGTTAAKKS
Ga0209526_1092685613300028047Forest SoilDKAESYAKKSVSLLETAKKPDGVTDEQWKQQTGLQKGLALSALGQVNIQKKNNAQAVDNFRSAAPLLKADDGGYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPGTAANKS
Ga0257175_112889123300028673SoilDQWKQQSALQKGLALSSLGQVNIQKKDNAQAVTNLRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKEAFTQAASVNSPYKGPAQEKLKAMAAPPRRKAS
Ga0308309_1075140113300028906SoilIQKKDNASAADNFKAAGPLLKADENSYGKNQYRLGFALLNLKKNAEAKEAFTQSASVNSPYKALAQAKLKSFDTAAAAKKKS
Ga0222749_1010636713300029636SoilKQQSGLQKGLALSALGQIDIQKKNNAQAVDEFKSAGPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKGLPGTPAAKKS
Ga0265753_101082413300030862SoilNAQAVENFRAAAPLLKSDDGAYGRNQYRLGFALVNLKKNAEAKDAFTQSASVNSAYKTLAQAKLKTFETAGKTAAK
Ga0265773_103557523300031018SoilGQVNIQKKDNAQAVENFRAAAPMLKSDDGAYGRNQYRLGFALLNLKKNAEAKDAFTQSASVNSAYKALAQAKLKTFETAGKTAAK
Ga0310686_10419807713300031708SoilFRAAAPLLKSDDGAYGRNQYRLGFALVNLKKNAEAKDAFTQSASVNSAYKALAQAKLKTFETAGKTAAK
Ga0310686_11318571913300031708SoilFRAAAPLLKSDDGAYGRNQYRLGFALVNLKKNAEAKDAFTQSASVNSAYKTLAQAKLKTFETAGKTAAK
Ga0307469_1013021113300031720Hardwood Forest SoilESYAKKSISLLDAGKKPEGVTDEQWKQQSALQKGLALSALGQVNIQKKNNADAVDNLKSAAPLLKPDEVSYARNQYRLGFAFLNLKRIPEAKAAFTDAASVNSPYKAPAQDKLKTLPGTAAAKKS
Ga0307469_1183208823300031720Hardwood Forest SoilANAQAVDNLKAAAPLLKQDDASYGRNQYRLGFALLNLKRTAEAKEAFTQAASVNGPYKSLAQDKLKTAAAPARH
Ga0307477_1048602113300031753Hardwood Forest SoilLLETAKKPDGVTDEQWKQQSALQKGLALSALGQVNIEKKNNAQAVDNFKAAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPATAANKS
Ga0307475_1057213813300031754Hardwood Forest SoilDKAEAYAKKSVALLETAKKPDGVTDEQWKQQSALQKGLALSALGQVNIEKKNNAQAVDNFKSAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPATATNKS
Ga0307475_1116938413300031754Hardwood Forest SoilKHDNVQAVDNLKTAAPLLKSDAGAYARNQYRLGFALLNLKKVQDAKAALTEAASVNSPYKALAQDKLKSLPTVSAGRAKP
Ga0307473_1096671713300031820Hardwood Forest SoilESSVKPEGVTDQQWTQQKGLQKGLALSSLGQINIEKKENAQAVDNLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAAPARRKAS
Ga0307478_1150456913300031823Hardwood Forest SoilQLDKAEAYAKKSAALLETAKKPDGVSDELWKQQSALQKGLALSALGQVNIEKKNNAQAVDNLKSAAPLLKADDGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPATAANKS
Ga0310913_1058307623300031945SoilLERAAKPEGVSNEQWTQQKALQKGLALSSLGQIDIEKKENAQAVENLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAAPARRKAS
Ga0318530_1021298813300031959SoilQQKGLQKGLALSSLGQINIEKKENAQAVDNLRAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAPPARRKAS
Ga0307479_1013275313300031962Hardwood Forest SoilPDGVTDEQWKQQSALQKGLALSALGQVNIEKKNNAQAVDNFKAAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKAPAQEKLKALPATAANKS
Ga0307479_1080447823300031962Hardwood Forest SoilNNAQAVDNFKAAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPATAANKS
Ga0307479_1105255623300031962Hardwood Forest SoilWKQQSALQKGLALSALGQVNIEKKNNAQAVDNFKAAAPLLKSDDGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPATAANKS
Ga0307479_1177973413300031962Hardwood Forest SoilSLLDTAKKPEGVTDEQWKQQSGLQKGLALSALGQVNIQKKNNAQAVDNLRSAAPLLKADDGGYARNQYRLGFALLNLKRLPEAKAAFTDAASVNSPYKALAQEKLKGLPATAANKS
Ga0310911_1072054513300032035SoilEEQWTQQKGLQKGLALSSLGQINIEKKENAQAVDNLRAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAPPARRKAS
Ga0318533_1071292323300032059SoilEEQWTQQKGLQKGLALSSLGQINIEKKENAQAVDNLRAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAPPARRKVS
Ga0307472_10130114323300032205Hardwood Forest SoilKKPEGVADEQWKHQSSLQKGLALSALGQVNIQKKNNAEAVDNFKSAAPLLRPDEVSYARNQYRLGFALLNLKRVPEAKAALTDAASVNSPYKALAQEKLKSLPGTTASKKS
Ga0318519_1075622813300033290SoilAAKPEGVSNEQWTQQKALQKGLALSSLGQIDIEKKENAQAVENLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAAPARRKAS
Ga0318519_1078873523300033290SoilQINIEKKENAQAVDNLRAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAPPARRKAS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.