NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F060804

Metagenome / Metatranscriptome Family F060804

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F060804
Family Type Metagenome / Metatranscriptome
Number of Sequences 132
Average Sequence Length 174 residues
Representative Sequence MERMKLRNVAAGVAVLAVALGFASAGKAQEQTQELSRETTTPAPEQLRQKADGERHGLPIETYAVAPGTKFLVRLEDELGTKGTQENAKFKVKTLEPLEAGSGIYLPSGAEIHGHISHVEPAGVAGRAKLWLTFDEIHTRFGDLPIVAEVVSVPGDHSVKSAPRKEGLIEGRSSTQ
Number of Associated Samples 99
Number of Associated Scaffolds 132

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 69.44 %
% of genes near scaffold ends (potentially truncated) 52.27 %
% of genes from short scaffolds (< 2000 bps) 45.45 %
Associated GOLD sequencing projects 88
AlphaFold2 3D model prediction Yes
3D model pTM-score0.46

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (54.545 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(38.636 % of family members)
Environment Ontology (ENVO) Unclassified
(37.879 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(56.818 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 1.96%    β-sheet: 26.47%    Coil/Unstructured: 71.57%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.46
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 132 Family Scaffolds
PF03167UDG 37.12
PF13714PEP_mutase 5.30
PF00072Response_reg 0.76
PF00218IGPS 0.76
PF04343DUF488 0.76
PF00459Inositol_P 0.76

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 132 Family Scaffolds
COG0692Uracil-DNA glycosylaseReplication, recombination and repair [L] 37.12
COG1573Uracil-DNA glycosylaseReplication, recombination and repair [L] 37.12
COG3663G:T/U-mismatch repair DNA glycosylaseReplication, recombination and repair [L] 37.12
COG0134Indole-3-glycerol phosphate synthaseAmino acid transport and metabolism [E] 0.76
COG3189Uncharacterized conserved protein YeaO, DUF488 familyFunction unknown [S] 0.76


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms54.55 %
UnclassifiedrootN/A45.45 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005179|Ga0066684_10832600All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → unclassified Acidobacteriaceae → Acidobacteriaceae bacterium URHE0068607Open in IMG/M
3300005181|Ga0066678_10010657All Organisms → cellular organisms → Bacteria4469Open in IMG/M
3300005554|Ga0066661_10836244All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium538Open in IMG/M
3300005555|Ga0066692_10261453All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1094Open in IMG/M
3300005561|Ga0066699_10082055All Organisms → cellular organisms → Bacteria2082Open in IMG/M
3300005568|Ga0066703_10078496All Organisms → cellular organisms → Bacteria1905Open in IMG/M
3300005586|Ga0066691_10190258All Organisms → cellular organisms → Bacteria1192Open in IMG/M
3300006796|Ga0066665_10442521All Organisms → cellular organisms → Bacteria → Acidobacteria1073Open in IMG/M
3300009012|Ga0066710_101731979All Organisms → cellular organisms → Bacteria → Acidobacteria949Open in IMG/M
3300009089|Ga0099828_10030320All Organisms → cellular organisms → Bacteria4348Open in IMG/M
3300009089|Ga0099828_11493576All Organisms → cellular organisms → Bacteria → Acidobacteria596Open in IMG/M
3300009090|Ga0099827_10343601All Organisms → cellular organisms → Bacteria → Acidobacteria1269Open in IMG/M
3300010326|Ga0134065_10188286All Organisms → cellular organisms → Bacteria → Acidobacteria741Open in IMG/M
3300011269|Ga0137392_10071073All Organisms → cellular organisms → Bacteria → Acidobacteria2678Open in IMG/M
3300011269|Ga0137392_10758540All Organisms → cellular organisms → Bacteria → Acidobacteria802Open in IMG/M
3300011269|Ga0137392_11273176All Organisms → cellular organisms → Bacteria → Acidobacteria594Open in IMG/M
3300011269|Ga0137392_11492797All Organisms → cellular organisms → Bacteria → Acidobacteria535Open in IMG/M
3300011270|Ga0137391_10217491All Organisms → cellular organisms → Bacteria1660Open in IMG/M
3300011271|Ga0137393_10353539All Organisms → cellular organisms → Bacteria → Acidobacteria1255Open in IMG/M
3300011271|Ga0137393_10451129All Organisms → cellular organisms → Bacteria → Acidobacteria1102Open in IMG/M
3300012096|Ga0137389_10549495All Organisms → cellular organisms → Bacteria → Acidobacteria992Open in IMG/M
3300012096|Ga0137389_10845899All Organisms → cellular organisms → Bacteria → Acidobacteria785Open in IMG/M
3300012096|Ga0137389_11034063All Organisms → cellular organisms → Bacteria → Acidobacteria704Open in IMG/M
3300012096|Ga0137389_11200972All Organisms → cellular organisms → Bacteria → Acidobacteria650Open in IMG/M
3300012096|Ga0137389_11540508All Organisms → cellular organisms → Bacteria → Acidobacteria561Open in IMG/M
3300012189|Ga0137388_10208881All Organisms → cellular organisms → Bacteria → Proteobacteria1762Open in IMG/M
3300012198|Ga0137364_10776568All Organisms → cellular organisms → Bacteria → Acidobacteria724Open in IMG/M
3300012199|Ga0137383_10109110All Organisms → cellular organisms → Bacteria2013Open in IMG/M
3300012202|Ga0137363_10913048All Organisms → cellular organisms → Bacteria → Acidobacteria745Open in IMG/M
3300012203|Ga0137399_10321590All Organisms → cellular organisms → Bacteria → Acidobacteria1281Open in IMG/M
3300012205|Ga0137362_10921520All Organisms → cellular organisms → Bacteria → Acidobacteria746Open in IMG/M
3300012211|Ga0137377_10629662All Organisms → cellular organisms → Bacteria → Acidobacteria1009Open in IMG/M
3300012224|Ga0134028_1303158All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium529Open in IMG/M
3300012349|Ga0137387_10544648All Organisms → cellular organisms → Bacteria → Acidobacteria842Open in IMG/M
3300012351|Ga0137386_10376029All Organisms → cellular organisms → Bacteria → Acidobacteria1023Open in IMG/M
3300012362|Ga0137361_11752973All Organisms → cellular organisms → Bacteria → Acidobacteria540Open in IMG/M
3300012582|Ga0137358_10033442All Organisms → cellular organisms → Bacteria3383Open in IMG/M
3300012917|Ga0137395_10079967All Organisms → cellular organisms → Bacteria → Proteobacteria2130Open in IMG/M
3300012918|Ga0137396_10140448All Organisms → cellular organisms → Bacteria → Acidobacteria1753Open in IMG/M
3300012925|Ga0137419_10114528All Organisms → cellular organisms → Bacteria1892Open in IMG/M
3300012925|Ga0137419_10491283All Organisms → cellular organisms → Bacteria → Acidobacteria973Open in IMG/M
3300012925|Ga0137419_11003180All Organisms → cellular organisms → Bacteria → Acidobacteria692Open in IMG/M
3300012927|Ga0137416_10640947All Organisms → cellular organisms → Bacteria → Acidobacteria929Open in IMG/M
3300012944|Ga0137410_11887394All Organisms → cellular organisms → Bacteria → Acidobacteria529Open in IMG/M
3300012957|Ga0164303_10387371All Organisms → cellular organisms → Bacteria → Acidobacteria858Open in IMG/M
3300014150|Ga0134081_10276874All Organisms → cellular organisms → Bacteria → Acidobacteria595Open in IMG/M
3300014154|Ga0134075_10374868All Organisms → cellular organisms → Bacteria → Acidobacteria626Open in IMG/M
3300015051|Ga0137414_1061577All Organisms → cellular organisms → Bacteria → Acidobacteria1517Open in IMG/M
3300015242|Ga0137412_10933087All Organisms → cellular organisms → Bacteria → Acidobacteria626Open in IMG/M
3300015245|Ga0137409_10745210All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → unclassified Acidobacteriaceae → Acidobacteriaceae bacterium URHE0068814Open in IMG/M
3300018468|Ga0066662_10165849All Organisms → cellular organisms → Bacteria → Proteobacteria1695Open in IMG/M
3300018468|Ga0066662_10466592All Organisms → cellular organisms → Bacteria → Acidobacteria1137Open in IMG/M
3300020583|Ga0210401_10321920All Organisms → cellular organisms → Bacteria → Acidobacteria1407Open in IMG/M
3300021170|Ga0210400_10525054All Organisms → cellular organisms → Bacteria → Acidobacteria977Open in IMG/M
3300021479|Ga0210410_10602865All Organisms → cellular organisms → Bacteria → Acidobacteria975Open in IMG/M
3300024330|Ga0137417_1398780All Organisms → cellular organisms → Bacteria → Acidobacteria2183Open in IMG/M
3300026304|Ga0209240_1216104All Organisms → cellular organisms → Bacteria → Acidobacteria581Open in IMG/M
3300026309|Ga0209055_1039791All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2086Open in IMG/M
3300026333|Ga0209158_1007189All Organisms → cellular organisms → Bacteria → Proteobacteria5770Open in IMG/M
3300026538|Ga0209056_10576250All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium569Open in IMG/M
3300026551|Ga0209648_10697593All Organisms → cellular organisms → Bacteria → Acidobacteria554Open in IMG/M
3300026557|Ga0179587_10933877All Organisms → cellular organisms → Bacteria → Acidobacteria572Open in IMG/M
3300027645|Ga0209117_1051015All Organisms → cellular organisms → Bacteria → Acidobacteria1225Open in IMG/M
3300027671|Ga0209588_1042667All Organisms → cellular organisms → Bacteria → Acidobacteria1466Open in IMG/M
3300027674|Ga0209118_1005162All Organisms → cellular organisms → Bacteria → Proteobacteria4990Open in IMG/M
3300027674|Ga0209118_1144223All Organisms → cellular organisms → Bacteria → Acidobacteria658Open in IMG/M
3300027765|Ga0209073_10423667All Organisms → cellular organisms → Bacteria → Acidobacteria549Open in IMG/M
3300027846|Ga0209180_10095180All Organisms → cellular organisms → Bacteria → Proteobacteria1693Open in IMG/M
3300027862|Ga0209701_10063018All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2359Open in IMG/M
3300027862|Ga0209701_10647772All Organisms → cellular organisms → Bacteria → Acidobacteria553Open in IMG/M
3300031754|Ga0307475_10405780All Organisms → cellular organisms → Bacteria → Acidobacteria1095Open in IMG/M
3300031823|Ga0307478_11811815All Organisms → cellular organisms → Bacteria → Acidobacteria502Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil38.64%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil21.21%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil9.85%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil5.30%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil4.55%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.79%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.03%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.03%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.27%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.27%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.52%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.52%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.76%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.76%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.76%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.76%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012224Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_2_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015051Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300016404Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082EnvironmentalOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026314Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143 (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027071Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM1H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0066672_1014671323300005167SoilMNYLRVRAFFVGTSILAVVLTGASALCAQEPAQESSKQTAAQPPEPPRHRVDGERHGLPPESFAVVPGTKFLVSLQDELGTKGRQENSTFKVKTLEPLEAGSGFYLPSGAEIEGHVSRVEPAGVAGRAKIWLTFDEMHTNFGNLPIVAEVVSVPGDHSVKPVPNQE
Ga0066672_1072454413300005167SoilMNYLRVRAFFVGASILAVVLTGASAWCAQETAQESSKQTAEPPRHRVDGERHGLPLESFAVVPGTKFLVSLQDELGTKGTQENSTFKVKTLEPLEAGSGFYLPSGAEIEGHVSRVEPAGVAGRAKIWLTFDEMHTNFGNLPIVAEVVSVPGDHSVKPVPNQE
Ga0066673_1010296313300005175SoilMAGFADRPSDNPLYLPCCTSKCMFFNPLRCLEAASNGSPSALPSLLSISREETMERKTLRSVFVGFAILATALVLAGTCRAQDQSSETPREATAQAPEQPRRKADGERHGLPVETYAVAPGTKFLVRLEEELGTKGTQENAKFKVKTLEPLEAGSRIYLPPGAEILGHVSHVEPAGVAGRAKIWLTFDEIRTKFGSLPIVAEVVSVPGDHSVKSANTKEGLIEGRSSNQHDAAEAAAAG
Ga0066679_1049201813300005176SoilMNYLRVRAFFVGASILAVVLTGASAWCAQETAQESSKQTAEPPRHRVDGERHGLPLESFAVVPGTKFLVSLQDELGTKGTQENSTFKVKTLEPLEAGSGFYLPSGAEIEGHVSRVEPAGVAGRAKIWLTFDEMHTNFGNLPIVAEVVSV
Ga0066684_1083260013300005179SoilMEPRKLNSVSAGIAILTVTLLPGVVGVAQEPVDGPARPAPMQAADQLRQKADGERHGLPIETFAVAPGTKFLVRLEDELGTKGPKENEKFKVKTLEPLEAGSGIYLPSGAEICGHISHVEPAGVAGRAKLWLTFDEIRTTFGALPIVAEVVSVPGDHSVKPGPRQEGLIEGRSSNQHDAAEA
Ga0066678_1001065713300005181SoilMEPRKLNSVSAGIAILTVTLLPGVVGGAQEPVDGPARPAPMQAADQLRQKADGERHGLPIETFAVAPGTKFLVRLEDELGTKGPKENEKFKVKTLEPLEAGSGIYLPSGAEICGHISHVEPAGVAGRAKLWLTFDEIRTTFGALPIVAEVVSVPGDHSVKPGPRQEG
Ga0066675_1140140713300005187SoilTARALFASTTILAIVLSEAIALRAQEPSQEPNKETIPQASEPARYKVDGERHGLPPETFAVAPGTKFLVGLEDGLNTKGTQENSPFKVKTLEPLEAGSGFYLPPGAEIVGHVSRVEPAGVAGRAKLWLTFDEMHTKFGALPIVAEVVSVPGDHSVKTVPHQEGLIAGRTG
Ga0066388_10470287813300005332Tropical Forest SoilMRHFGSKAFLFGMAAFVSSLGAGAALRTQEPSQEPSREAAQQAQSQELRHKADGERHGLPVETFAVTPGTKFLVKLEEELGTKGTQENSKFKVRTLEPLEAGSGIYLPPGAEIEGHISRVEPAGVAGRAKLWLTFDEIHTRFGDLPIVAEVVSVPGDHSVKPVPKQEGLIAGRSSTQQDAAAAAGAGAAIGAVKGVKDKD
Ga0066687_1008404313300005454SoilMNYLRVRAFFVGTSILAVVLTGASALCAQEPAQESSKQTAAQPPEPPRHRVDGERHGLPPESFAVVPGTKFLVSLQDELGTKGRQENSTFKVKTLEPLEAGSGFYLPSGAEIEGHVSRVEPAGVAGRAKIWLTFDEMHTNFGNLPIVAEVVSV
Ga0070732_1065143713300005542Surface SoilMNRPVVRSFFTIAPILAVALGGGIALRAQEPPQEPTKEVNAQAPDLRHRADGERHGLPLETFAVTPGTKFLVSLQEDLSTKGTQENSGFKVKTLEPLEAGSGFYLPSGAEIVGHISRVESAGVAGRAKLWLTFDEIHTKFGNLPIVAEVAGVPGDHSVKPVPEKEGLIQG
Ga0066661_1083624413300005554SoilMEPRKLNSVSAGIAILTVTLLPGVVGGAQEPVDGPARPAPMQAADQLRQKADGERHGLPIETFAVAPGTKFLVRLEDELGTKGPKENEKFKVKTLEPLEAGSGIYLPSGAEICGHISHVEPAGVAGRAKLWLTFDEIRTTFGALPIVAEVVSVPGDHSVKPGPR
Ga0066692_1026145313300005555SoilMEPRKLNSVSAGIAILTVTLLPGVVGGAQEPVDGPARPAPMQAADQLRQKADGERHGLPIETFAVAPGTKFLVRLEDELGTKGRKENEKFKVKTLEPLEAGSGIYLPSGAEICGHISHVEPAGVAGRAKLWLTFDEIRTTFGALPIVAEVVSVPGDHSVKPGPRQEGLIEGRSSNQHDAA
Ga0066699_1008205533300005561SoilMEPRKLNSVSAGIAILTVTLLPGVVGGAQEPVDGPARPAPMQAADQLRQKADGERHGLPIETFAVAPGTKFLVRLEDELGTKGPKENEKFKVKTLEPLEAGSGIYLPSGAEICGHISHVEPAGVAGRAKLWLTFDEIRTTFGALPIVAEVVSVPG
Ga0066699_1017472113300005561SoilMNYLRVRAFFVGASILAVVLTGASAWCAQETAQESSKQTAEPPRHRVDGERHGLPLESFAVVPGTKFLVSLQDELGTKGTQENSTFKVKTLEPLEAGSGFYLPSGAEIEGHVSRVEPAGVAGRAKIWLTFDEMHTNFGNLPIVAEVVSVPGDHSVKPVPNQEGLIQGR
Ga0066693_1002670433300005566SoilMNHPRTRTLFAATPILTFTLSAAITMLAQEPPQEPGKEPTAQAPEQPRHRADGERHGLPLEIFAVTPGTKFLVSLQDELSTKGTQENAVFKVKTLEPLEAGSGFYLPAGAEIVGHISRVEPARVAGHAKLWLTFDEIHTTFGDL
Ga0066703_1007849633300005568SoilMERRKLSGVFVRVTILVTALVLAGTCRAQDQSSETPREATTQAPDQQRQKADGERHGLPIETYAVAPGTKFLVRLEDELGTKGTQENAKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKIWLTFDEIHTKFGSLPIVAEVVSVPGDHSVKSANTKEGLIEGRSSNQHDAAEAAAAGAAIGA
Ga0066691_1019025823300005586SoilMERRKLSGVFVRVTILVTALVLAGTCRAQDQSSETPREATTQAPDQQRQKADGERHGLPIETYAVAPGTKFLVRLEDELGTKETQENAKFKVKTLEPLEAGSGVYLPPGAEIQGHISHVEPAGVAGRAKLWLTFDEIRTKFGSLPIVAEVVSVPGD
Ga0066691_1066560613300005586SoilMNCLRVRPLVVRTSILALVLRGAIALRAQEPAQESTKETTSQAPGQPRHRADGERHGLPLETFAVTPGTKFLVRLEDELGTKGTQENSKFKVKTLEPLEAGSGTYLPSGAEIEGHISRVEPAGVAGRAKLWLTFDEIHTRF
Ga0066903_10188518413300005764Tropical Forest SoilMNLTVRSFVITQILAVVLGSAIVLRGQEPPPQEPAKETASQAPEQPRHRADGDRHGLPLETFAVTPGTKFLVSLEEDLSTKGTQENAVFKVKTLEPLEAGSGFYLPSGAEIVGHISRVEPAGVAGRAKLWLTFDEIRTKFGD
Ga0066651_1075749913300006031SoilMNHPRSRALFVATPVLTITLSAAIALLAQEPPQEPGKEPTAQAPEQPRHRADGERHGLPLEIFAVTPGTKFLVSLQDELSTKGTQENAVFKVKTLEPLEAGSGFYLPAGAEIVGHISRVEPARVAGHAKLWLTFDEIHTTFGDLPI
Ga0079222_1016039313300006755Agricultural SoilMNRLKFLSRFTIAPILVVALGGAIPLHAQEPPQEPSKEATVPAPDLRHRADGERHGLPLETFAVTPGTKFLVSLREDLSTKGTQENAVFKVKTLEPLEAGSGFYLPAGAEIVGHISRVESAGVAGRAKLWLTFDEIHTKFGDLPIVAEVSDVPGDHSVKPVP
Ga0066665_1044252123300006796SoilMERRKLSDITVRVTILATALSLAGTCGAQDQSSETPREAAAQEPEQPRQKADGERHGLPIETYAVAPGTKFLVRLEDELGTKGTQENAKFKVKTLEPLEAGSGIYLPPGAEIQGHISHVEPAGVAGRAKLWLTFDEIRTKFGSLPIVAEVVSVPGDHSVKSANTKEGLIEGRSSNQHDAAEAAAAGAAIGAVKGVKDKN
Ga0066660_1009076613300006800SoilMNYLRVRAFFVGASILAVVLTGASAWCAQETAQESSKQTAEPPRHRVDGERHGLPLESFAVVPGTKFLVSLQDELGTKGTQENSTFKVKTLEPLEAGSGFYLPSGAEIEGHVSRVEPAGVAGRAKIWLTFDEMHTNFGNLPIVAEVVSVPGDHSVKPVPNQEG
Ga0066660_1010315913300006800SoilMNYLRVRAFFVGTSILAVVLTGASALCAQEPAQESSKQTAAQPPEPPRHRVDGERHGLPPESFAVVPGTKFLVSLQDELGTKGRQENSTFKVKTLEPLEAGSGFYLPSGAEIEGHVSRVEPAGVAGRAKIWLTFDEMHTNFGNLPIVAEVVSVPGDHSVKPVPNQEG
Ga0066660_1065336213300006800SoilMNHPRTRTLFAATPVLTFTLSAAITLLAQEPPQEPGKEPTAQAPEQPRHRADGERHGLPLEIFAVTPRTKFLVSLQDELSTKGTQENAVFKVKTLEPLEAGSGFYLPAGAEIVGHISRVEPAGVAGRAKLWLTFDEIHTKFGDLPIVAEVAGVPGDHSVKPVPNQEGVIAGRSSTQQDAAAAAAAG
Ga0079220_1006127513300006806Agricultural SoilMNRPVVRSFFTITPILAVALGGGIASRAQEPPQEPTKEVAAQGPDLRHRADGERHGLPLETFAVTPGTKFLVSLQEDLTSKGTQENATFKVKTLEPLEAGSGFYLPAGAEIVGHISRVESAGVAGRAKLWLTFDEIHTKFGNLPIVAEVASVPGDHSVKPVPKKEGLIQGRSSTQQD
Ga0079220_1055401713300006806Agricultural SoilMNCLRIRPFVISMSILAVGLSGAIALRAQEPTQQPNRETAQAPDLRHRADGERHGLPLETFAVTPGTKFLVSLQDELGTKGTQENSRFKVKTLEPLEAGSGFYLPSGAEIAGHISRVEPAGVAGRAKIWLTFDEIHTRFGDLPIVAEVVSVPGDHSV
Ga0075426_1040300823300006903Populus RhizosphereMNHLTIRAFFVVPILAVVVGTAIASHAQEPPQEPSKETAAQSPEPPRHRADGELHGLPLETFAVTPGTKFLVSLQYDLSTKGTQENAVFKVKTLEPLEAGSGFYLPSGAEIIGHISRVEPAGVAGRAKLWLTFDEIHTKFGDLPIVAEVAGVPGDHSVKPVPNQEGLIAG
Ga0075436_10008599713300006914Populus RhizosphereMNYLKIRAFFVAVPLLALVLSSAMNIRAQEPPQEPSKEPTAQAPEPSRHRADGDRHGLPLETFAVTPGTKFLVRLQEELSSKGTQENAKFLVKTLEPLEAGSGFYLPSGAEIVGHISRVEPAGVAGRAKLWLTFDEIRTKFGDLPIVAEVASVPGDHSVRPVPNQEGVIAGRSS
Ga0079219_1175251013300006954Agricultural SoilMNCLRIRPFVISMSILAVGLSGAIALRAQEPAQQPSRETAQAPDLRHRADGERHGLPLETFAVTPGTKFLVSLQDELGTKGAQENSRFKVKTLEPLEAGSGFYLPSGAEIEGHISRVEPAGVAGRAKIWLTFDEIHTKFGDLPIVAEVVSVPGDH
Ga0066710_10173197923300009012Grasslands SoilMERRKLSGVFVRVTILATALGLAGTCRAQDQSSERPREATTQAPEQPRQKADGERHGLPIETYAVAPGTKFLVRLEDELGTKGTQENAKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKLWLTFDEIRTKFGSLPIVAEVVSVPG
Ga0099830_1049382523300009088Vadose Zone SoilMSYLRVRTFFITMPILAVVLSGAIALRAQEPAQESTKETTSQAPEQPRHRADGERHGLPLETFAVTPGTKFLVRLEEELGTKGTQENSTFKVKTLEPLEAGSGIYLPSGAEIEGHVSRVGPAGVAGRAKLWLTFDEIHTKFGDLPIVAEVVSVPGDHSVKPVPNQEGLIAGRSSTQQDAAAAAAAG
Ga0099828_1003032063300009089Vadose Zone SoilMERRKLSSVFASIAILAMMCASGVVCLAQETTQEPARETTRQAPEQLRQKADGERHGLPIETYAVAPGTKFLVRLEDELGTRGTKENEKFKVKTLEPLEAGNGIYLPAGAEVRGHISHVESAGVAGRAKLWLTFDEIHTKFGSPPIVAEVVSVPGDHSVKPGGTQEGLIEGRSSNQHDATEAAAAGAAIGAV
Ga0099828_1149357613300009089Vadose Zone SoilYSREETMERRKLSSVFASIAILAMMCASGVVCLAQETTQEPARETTRQAPEQLRQKADGERHGLPVETYAVAPGTKFLVRLEDDLGTKATKENDRFKVRTLEPLEAGNGIYLPPGAEVRGHISHVEPAGVAGRAKIWLTFDEIHTKFGTLPITAEVVSVPGDHSVKPGGTQEGLIEGRSSNQHDATEAAAAGAAIGAV
Ga0099827_1034360113300009090Vadose Zone SoilVAAEVQSGLGQFRAEETMERPKIGTVTASVAILAVTLASGIVAAGQEAPQPPTRDAASPVPEQPRQKADGERHGLPLETYAVVPGTKFLVRLEDELSTRNNEENRKFKVKTLEPLEAGSGIYLPPGAEIHGHISRVEPAGVAGRAKLWLTFDEIRTKFGKLPIVAEVVSVPGDHSVKSGQAQEGL
Ga0126374_1173550413300009792Tropical Forest SoilSISSQLQPPLQVLFWRKGCIGHPDGQISHWRENMSYPRVRAFFVVTPILAIVLGGTCTLRAQEPPQEPTREPATQAPDQRHRADGERHGLPLETFAVTPGTKFLVRLEDELNSKGTQENAVFKVKTLEPLEAGSGIYLPSGAEIVGHISRVESAGVAGRAKLWLTFDEIHTK
Ga0126382_1218004813300010047Tropical Forest SoilAFLFGMAAFLSSLGAGTALRAQEPSQEPPREAAQQAQSQDLRHKADGDRHGLPIETFAVTPGTKFLVKLEEELGTKGTQENAKFKVRTLEPLEAGSGIYLPPGAEIQGHVSRVEPAGVAGRAKLWLTFDEIHTRFGDLPIVAEVVGVPGDHSVKPVPNQEGLIAGRSSTQQDAAAA
Ga0126373_1034612123300010048Tropical Forest SoilMNRLKVRSLLVITQILAVVLGSAIASRAQEPPQEPIRETTSQSPEPPRHRADGERHGLPLETFAVTPGTKFLVSLQDELGTKGTQENSRFKVKTLEPLEAGSGFYLPSGAEIEGHISRVEPAGVAGRAKIWLTFDEIHTRFGDLPIVAEVVSVPGDHS
Ga0134065_1012658113300010326Grasslands SoilMNHPRTRTLFAATPILTFTLSAAITMLAQEPPQEPGKEPTAQAPEQPRHRADGERHGMPLEIFAVTPGTKFLVSLQDELSTKGTQENAVFKVKTLEPLEAGSGFYLPAGAEIVGHISRVEPAGVAGRAKLWLTFDEIHTKFGDLPI
Ga0134065_1018828613300010326Grasslands SoilMERRKLSGVFVRVTILATALSLAGTCRAQDQSSETPREATTQAPDQQRQKADGERHGLPIETYAVAPGTKFLVRLEDELGTKGTQENAKFKVKTLEPLEAGSGIYLPSGAEIHGHISHVEPAGVAGRAKLWLTFDEIRTKFGSLPIVAEVVSVPGDHSVKSANTKEGLIEGRSSNQHDAAEAAAAGAAIGAVKGVKDKNKKEAAEGAAF
Ga0134062_1014519723300010337Grasslands SoilMNGLKLRAIFIGAPMLVVALSGAISLSAEEPVEGPSNGRAAQGDDPSRHPADGERHGLPPETFAVTPGTKFLVRLEDELSTKGTQENSTFKVKTLEPLEAGSGFYLPSGAEIAGHISRVEAAGVAGRAKIWLTFDELHTKFGDL
Ga0126376_1081022813300010359Tropical Forest SoilMKNLGSKAAFIGLAMLLGSLSGGTASRAQGAAQEPTREAPTQSQEPRHKADGERHGLPVETFAVTPGTKFLVKLEDELSTKGTQENAKFKVKTVEPLEAGSGIYLPSGAEIEGHVSHIEPAGVAGRAKLWLTFDEIHTRFGDLPIVAEVIGVPGDHSVKPVPNQEGLIAGRSSTQQDAAAAAGAGAAI
Ga0126376_1127254313300010359Tropical Forest SoilMAAFVSLLGAVAALRAQEPPQEPSREPQAPEQQLRHKADGDRHGLPIETFAVTPGTKFLVKLEEELGTKGTQENTKFKVRTLEPLEAGSGIYLPPGAEIEGHVSRVEPAGVAGRAKLWLTFDEIHTRFGDLPIVAEVVGVPGDHSVKPVPNQEGLIAGRSSTQQGAAAAAGAGAAF
Ga0126376_1302213213300010359Tropical Forest SoilFVVTTIFAVVIGGAVTSRAQEPPQEPTREPAAQAPDQQRHRADGERHGLPLETFAVTPGTKFLVSLQDELNSKGTQENAVFKVKTLEPLEAGSGIYLPSGAEIVGHISRVESAGVAGRAKLWLTFDEIHTRFGDLPIVAEVAGIPGDHSVKPVPDKEGLIQGRSSTQQDAAAA
Ga0126372_1090306623300010360Tropical Forest SoilMQVAAQQMLKKKPLSIPWEENMKQRKLKATLVSTSAFLAVLGAGIASSAQETAPPTTREAAAQAPEPPRQKADGDRHGLPLETYAVVPGTKFLVRLEDELGTKGMQENAKFRVRTLEPLEAGSGIYLPSGAEIHGHISRVEPAGVAGRAKLWLTFDEIRTSFGVLPIVAEVVSVPGDHSVKPVPNQEGLIAGRSSTQ
Ga0126372_1108473513300010360Tropical Forest SoilMAAVVCSLCAVTALRAQEPSQEPSREAAQQAQSQELRHKADGERHGLPVETFAVTPGTKFLVKLEEELGTKGTQENSKFKVRTLEPLEAGSGIYLPPGAEIEGHISRVEPAGVAGRAKLWLTFDEIHTRFGDLPIVAEVVSVPGDHSVKPVPNQEGLIAGRSSTQQDAAAAAGAGAAIGAVKGVKDKDK
Ga0126377_1229182913300010362Tropical Forest SoilMNCLKLRAIFASAPVLVVALSGAISLRAQATGDRPSNEPAAQGTDQSRHRADGERHGLPLETFAVTPGTKFLVRLEDELSTKGTPENSTFRVKTLEPLEAGSGFYLPSGAEIEGHVSRVEPAQVAGRAKLWLTFDEIHTKFGVLPIVAEVASVPGDHSVKPS
Ga0126379_1069894323300010366Tropical Forest SoilMNHPKVRAFFVVTTIFAVVIGGVVTSRAQEPPQEPTREPAAQAPDQQRHRADGERHGLPLETFAVTPGTKFLVRLQDELNSKGTQENAVFKVKTLEPLEAGSGIYLPSGAEIVGHISRVESAGVAGRAKLWLTFDEIHTRFGDLPIVAEVAGIPGDHSVKPVPDKEGLIQGRSSTQQDAAAAAAAGAALGAVKGVKDK
Ga0126379_1344401313300010366Tropical Forest SoilAIVLGGTCTLRAQEPPQEPTREPATQAPDQRHRADGERHGLPLETFAVTPGTKFLVRLQDELNSKGTQENAVFKVKTLEPLEAGSGIYLPSGAEIVGHISRVESAGIAGRAKLWLTFDEIHTRFGDLPIVAEVAGIPGDHSVKPVPEKEGLIQGRSSTQQDAAAAAAAGAALGAVK
Ga0137392_1007107343300011269Vadose Zone SoilMERMKLSNVPAKIAILTMTLALGAACKAQEQSQEPSREATTQASEQPRQKADGGRHGLPIETYAVAPGTKFLVRLEEELGTRGTKENEKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKMWLTFDEIHTKFGSLPIVAEIVSVPGDHSVKPGGTQEGLIEGRSSNQHDAAEAAAAGAAIGAVKGVKD
Ga0137392_1075854013300011269Vadose Zone SoilMERMKLRSVAAGVAILAGTFGLGATCKAQGQSPEPPREATIQAPEQQRQKADGERHGLPLETYAVAPGTKFLVRLEDELGTKDIKENEKFKVRTLEPLEAGSGIYLPAGAEVRGHISLVEPAGVAGRAKLWLTFDEIRTKFGSLPIVAEVVSVPGDHSVKSASTKEGLIE
Ga0137392_1127317613300011269Vadose Zone SoilMALGFGSVGKAQDQTQEPSRETTTPAPEQLRQKADGERHGLPIETYAVAPGTKFLVRLEDELGTKGTQENTKFKVKTLEPLEAGSGIYLPSGAEIHGHISHVEPAGVAGRAKLWLTFDEIHTRFGDLPIVAEVVSVPGDHSVKSAPRKE
Ga0137392_1149279713300011269Vadose Zone SoilSREETMERRKLRSVFASIAILAMMCASGVVCLAQETTQEPARETTRQASEQLRQKADGERHGLPIETYAVAPGTKFLVRLEDELGTRGTKENEKFKVKTLEPLEAGNGIYLPAGAEVRGHISHVESAGVAGRAKLWLTFDEIRTKFGTLPIVAEVVSVPGDHSVKSGTTQEGLIAGRS
Ga0137391_1021749113300011270Vadose Zone SoilMERMKFRSVFASVAILTMALALGVACKAQEQSQEPSREATTPVPDQQRQKADGVRHGLPIETYAVAPGTKFLVRLEDELGTKGTKENEKFKVKTLEPLEAGSGIYLPSGAEIHGHISHVEPAGVAGRAKLWLTFDEIRTKFGTLPIVAEVVSVPGDHSVKPGTAREGLIEGRSSNQHDAAEAAAA
Ga0137393_1035353923300011271Vadose Zone SoilMERMKLSNVPAKIAILTMTLALGAACKAQEQSQEPSREATTQASEQPRQKADGGRHGLPIETYAVAPGTKFLVRLEEELGTRGTKENEKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKMWLTFDEIHTKFGSLPIVAEIVSVPGDHSVKPGGTQEGLIEGRSSNQHDAAEAAAAGAAIGAV
Ga0137393_1045112913300011271Vadose Zone SoilMVILAVILASGIVAAAQEAPQEPTRDAASPVPEQPRQKADGERHGLPLETYAVVPGTKFLVRLEDELSTRNNEENRKFKVKTLEPLEAGSGIYLPPGAEIHGHISRVEPAGVAGRAKLWLTFDEIRTKFGKLPIVAEVVSVPGDHSVKSSQTQEGLIEGRSST*
Ga0137389_1054949523300012096Vadose Zone SoilMERKKLRSVFASIAVLTMTLAMGVACQAQEQTQEPIREATTPAPDQQRQKADGERHGLPLETYAVVPGTKFLVRLEDELGTRGTKENERFKVKTLEPLEAGSEIYLPPGAEIHGHISHVEPAGVAGRAKLWLTFDEIHTKFGTLPIVAEVVSVPGDHSVKPGTAKE
Ga0137389_1084589913300012096Vadose Zone SoilMERKKLSSVSASIAILTMTLALGVVGRAQDQTQERPREATTQAPEQQHQKADGERHGLPVETYAVAPGTKFLVRLEDELGTKGTRENEKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKIWLTYDEIHTKFGTLPITAEVVSVPGDHSVKSGATQEGLIEG
Ga0137389_1103406313300012096Vadose Zone SoilMCIRYLLSNSREETMNRRKLSSVFGSVAILTLSMALGDASKAQDQTQEQVREATSQSPEQQRLKADGERHGLPVETFAVAPGTKFLVRLEEELGTRGTKENEKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKIWLTFDEIRTKFGTLPIVAEVVSVPGDHSVKPGGAQEGLIEGR
Ga0137389_1120097213300012096Vadose Zone SoilMFFNPLRQPQTARNGFLCAFPCFLSILEETMERKKLSIASASVAAFMMTLALGVVGRAQDQPQEPSREPATQTPEQRQKADGELHGLPIETYAVAPGTKLLVRLEDELGTKGTQENAKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKLWLTFDEIHTKF
Ga0137389_1154050813300012096Vadose Zone SoilEETMERPKIGTVTASVAILAVTLASGIVAAAQEAPQEPTRDTASPVPEQPRQKADGERHGLPLETYAVVPGTKFLVRLEDELSTRNNEENRKFKVKTLEPLEAGSGIYLPPGAEIHGHISRVEPAGVAGRAKLWLTFDELRTKFGKLPIVAEVVSVPGDHSVKSSQTQEGLIEGRSSPQHDAAQAA
Ga0137388_1020888133300012189Vadose Zone SoilMERKKLRSVFASIAVLTMTLAMGVACQAQEQTQEPIREATTPAPDQQRQKADGERHGLPLETYAVVPGTKFLVRLEDELGTRGTKENERFKVKTLEPLEAGSGIYLPSGAEIHGHISHVEPAGVAGRAKIWLTFDEIHTKFGSLPIVAEVVSVPGDHSVKSGTTREGLIEGRSSKQHDAA
Ga0137364_1077656813300012198Vadose Zone SoilMFFNPLRCLEAASNGSPSALPSLLSISREETMERKTLRSVFVGFAILATALVLAGTCRAQDQSSETPREATAQAPEQPRRKADGERHGLPVETYAVAPGTKFLVRLEEELGTKGTQENAKFKVKTLEPLEAGSGIYLPPGAEILGHVSHVEPAGVAGRAKIWLTFDEIRTKFGSLPIV
Ga0137383_1010911033300012199Vadose Zone SoilMAVLAMILISGIACAAQEPTQAPMQQPTAPMPEPLRQKADGERHGLPLETYAVAPGTKFLVRLEDELNTKGTQENAKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKLWLTFDEIRTKFGVLPIVAEVVSVPGDHSVKPGPT
Ga0137383_1016263833300012199Vadose Zone SoilMNRLRVRVFFISTPILAVTLSAAIALRAQDPPQEASKETIAQAPETPRHRVDGERHGLPPETFAVAPGTKFLVKLEDDLSTKGTQENSTFRVKTLEPLEAGSGFYLPSGAEIEGHVSRVEPAAVAGRAKLWLTFDEIHTRFGDLP
Ga0137363_1091304813300012202Vadose Zone SoilMILSSGFPGFAQETAQEPERQAPVRAPDSPRQKADGERRGLPLETYAVAPGTKFLVRLEDELGTKGTQENAKFKVRTLEPLEAGSGIYLPSGAEIQGHISHVEPAGVAGRAKLWLTFDEIHTKFGRLPIVAEVVSVPGDHSVKSGTS
Ga0137399_1032159023300012203Vadose Zone SoilMERMKLRNVTAGVTVLALTLVWGSVSNAQEQTQEQSRETTPPAPEQLRQKADGERHGLPIETYAVAPGTKFLVRLEDELGTKGTKENERFKVKTLEPLEAGSGIYLPSGAEIHGHISHVEPAGVAGRAKLWLTFDEIHTRFGDLPIVAEVVSVPGDHSVKPLPQKEGLI
Ga0137399_1048782413300012203Vadose Zone SoilMAGFPDSTWGNSGCLSRGTSKCMFFNPLRQPQTARNGFLCAFPCFLSILEETMERKKLSIASASVAALMMTLALGVVGRAQDQTQEPVREATSQSPEQQRLKADGERHGLPVETFAVAPGTKFLVRLEDELGTKGTQENARFKVKTLEPLEAGSGIYLPPGAEIRGHISHLEPAGVAGRAKLWLTFDEIHTKFGSLPIVAEV
Ga0137362_1092152013300012205Vadose Zone SoilMFFNPLQRLRRPPDWPCACLPELLSISLEETMKRRKIGSVSGGIAILTTALALGIASKAQEQSQETSREASTQTSEQPRQKADGERHGLPIETYAVVPGTKFLVRLEDELGTKGTKENERFKVKTLEPLEAGNGIYLPAGAEVRGHISHIEPAGVAGRAKLWLTFDEIHTKFGDLPIVAEVVSVPGDHSVKTGPQKE
Ga0137380_1051855813300012206Vadose Zone SoilMAVLAMILTSGIACAAQEPTQAPMQQPTAPMPEPLRQKADGERHGLPLETYAVAPGTKFLVRLEDELNTKGTQENAKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKLWLTFDDIRTKFGALPIVAEVVSVPGDHSVKPGPTQEGLIAGRSSTQHDAAEAA
Ga0137377_1062966223300012211Vadose Zone SoilMERMKLRNVAAGVAVLAVALGFASAGKAQEQTQELSRETTTPAPEQLRQKADGERHGLPIETYAVAPGTKFLVRLEDELGTKGTQENAKFKVKTLEPLEAGSGIYLPSGAEIHGHISHVEPAGVAGRAKLWLTFDEIHTRFGDLPIVAEVVSVPGDHSVKSAPRKEGLIEGRSSTQ
Ga0137377_1083703013300012211Vadose Zone SoilMNHLRIRAFFVVPILAVVVGTAIASHAQEPPQEPSKETTAQTPEPPRHRADGELHGLPLETFAVTPGTKFLVSLQDDLSTKGTQENAPFKVKTLEPLEAGSGFYLPSGAEIIGHISRVEPAGVAGRAKLWLTFDEIRTKFGDLPIVAEVANVPGDHSVKPVPNQEGVIAGR
Ga0134028_130315813300012224Grasslands SoilNMEPRKLKSVSAGIAILTITLLPGVVGVAQEPVDGPERQAPMQPAADQRRQKADGERHGLPIETFAVAPGTKFLVRLEDELGTKGTQENAKFKVKTLEPLEAGSGIYLPSGAEIHGHISHVEPAGVAGRAKLWLTFDEIHTKFGDLPIVAEVVSVPGDHSVKSGPQKEGLIEGRSS
Ga0137387_1054464823300012349Vadose Zone SoilVLLSFSQEETMKRAQIGTVSAGMAVLAMILTAGIACAAQEPTQAPTQEPAAPMSEPPRQKADGERHGLPLETYAVAPGTKFLVRLEDELNTKGTQENAKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKLWLTFDEIRTKFGVLPIVAEVVSVPGDH
Ga0137386_1037602913300012351Vadose Zone SoilMERKKVSIVSAGVAILTMTLGLGATGKAQDQSQEPPREATTQAPDQQRQKADGERHGLPMETYAVAPGTKFLVRLEEELGTRGTQENAKFKVKTLEPLEAGSGIYLPSGAEIHGHISHVEPAGVAGRAKMWLTFDEIRTKFGSLPIVAEVVSVPGDHSVKPAPRQEGLIEGR
Ga0137386_1105375413300012351Vadose Zone SoilSLSQEETMKRPQIGTVSAGMAVLAMILTAGIACTAQEPTQAPTQEPTAPMSEPPRQKADGERHGLPLETYAVAPGTKFLVRLEDELSTKGTQENAKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKLWLTFDDIRTKFGVLPIVAEVVSVPGDHSVKPGPTQEGLIAGRSSTQHDAAEAAA
Ga0137361_1175297313300012362Vadose Zone SoilRKLSSLSASVPILTMTLALGVACRAQEQTQEPPRETTAPVPEQQRQKADGERHGLPIETYAVAPGTKFLVRLEEELGTRGTKENEKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKIWLTFDEIRTKFGTLPIVAEVVSVPGDHSVKPGGAQEGLIEGRSSNQHDAAEAA
Ga0137358_1003344253300012582Vadose Zone SoilMQLSKFGNAFAVAASLGMTVLSGIPSFAQETAQEQERQATVRAPDSPRQKADGERRGLPLETYAVAPGTKFFVRLEDELGTKGTQENAKFKVRTLEPLEAGSGIYLPSGAEIQGHISHVEPAGVAGRAKLWLTFDEIHTKFGRLPIVAEVVSVPGDHSVKSGTSQEGLIEGRSSNQHDASQA
Ga0137395_1007996713300012917Vadose Zone SoilMEQRKLNSVSAGIAILTITLLPGVVGVAQEPVDGPARQAPMQTADQQRQKADGERHGLPIETFAVAPGTKFLVRLEDELGTRGPKENEKFKVKTLEPLEAGSGIYLPSGAEICGHISHVEPAGVAGRAKLWLTFDEIRTTFGALPIVAEVVGVPGDHSVKPGPRQEGLIEGRSSNQHD
Ga0137396_1014044813300012918Vadose Zone SoilMCIPSLLSNSREETMERRRFSSVFGSVAILTMTLALGVACGAQEQTPEASREATTQAPEQQRQKADGERHGLPIETYAVAPGTKFLVRLEEELGTKGTKENEKFKVKTLEPLEAGSGIYLPPGAEIHGHVSHVEPAGVAGRAKMWLTFDEIHTKFGTLPIVAEVASVPGDHSVKPGGTQEGLIEGRSSNQHDAAEAAAAGAAIGAVKGVKDKNKKEA
Ga0137359_1030096023300012923Vadose Zone SoilVLLGFSQEETMKRPQIGTVSAGMAVLAMILTSGIACVAQEPTQTATQEPTAPMSEPPRQKADGERHGLPLETYAVAPGTKFLVRLEDELNTKGTQENAKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKLWLTFDEIRTKFGVLPIVAEVVSVPGDHSVKPGPTPEGLIAGRSSTHHDAAEAAAAG
Ga0137419_1011452843300012925Vadose Zone SoilMTVLSGIPSFAQETAQEQERQAPVRAPDSPRQKADGERRGLPLETYAVAPGTKFLVRLEDELGTKGTQENAKFKVRTLEPLEAGSGIYLPSGAEIQGHISHVEPAGVAGRAKLWLTFDEIYTKFGRLPIVAEVVSVPGDHSVKSGTSQEGLIEGR
Ga0137419_1049128323300012925Vadose Zone SoilMYFFQYFAAPLACRSWLPKCIHHLRSISREEIVERKKLSSVAAGITILAMTLAPGFLCLAQETAQEPARETTGQAPELQRQKADGERRGLPLETYAVAPGTKFLVRLEDELGTRGAKENEKFKVKTLEPLEAGSGIYLPPGAEIRGHVSHVEPAGVAGRAKLWLTFDEIRTKFGSLPIVAEVVSVPGDHSVKSGTPQE
Ga0137419_1100318013300012925Vadose Zone SoilMCIPSLLSNSREETMERRRFSSVFGSVAILTMTLALGVACGAQEQTPEASREATTQAPEQQRQKADGERHGLPIETYAVAPGTKFLVRLEEELGTKGTKENEKFKVKTLEPLEAGSGIYLPPGAEIHGHVSHVEPAGVAGRAKMWLTFDEIHTKFGTLPIVAEVASVPGDHSVKPGGTQEGLIEGRSSNQHDAAEAAAAGAAIGAVKGVKD
Ga0137416_1064094713300012927Vadose Zone SoilMFFNPLRQPQTARNGFLCAFPCFLSILEETMERKKLSIASASVAALMMTLALGVVGRAQDQTQEPVREATSQSPEQQRLKADGERHGLPVETFAVAPGTKFLVRLEDELGTKGTQENAKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKIWLTFDEIHTKFGDLPIVAEV
Ga0137410_1188739413300012944Vadose Zone SoilAASIAILTMTLAPGFLCLAQETAQEPARETTGQAPEQQRQKADGERRGLPLETYAVAPGTKFLVRLEDELGTKGAKENEKFKVKTLEPLEAGSGIYLPPGAEIHGHVSHVEPAGVAGRAKLWLTFDEIRTKFGRLPIVAEVVSVPGDHSVKSGTAQEGLIEGRSSNQHDAAEAAAA
Ga0164303_1038737123300012957SoilMKLSVCRNATVMAASLTMILLGGIPSTAKESAQEQERPAPAQAPEPTRQRADGERRGLPLEAYAVVPGTKFLVRLEDDLGTKGTQENAKFKVRTLEPLEAGSGIYLPAGAEIHGHISHVEPAGVAGRAKLWLTFDEIRTKFGNLPIVAEVVSVPGDHSVRPGPTREGLIEGRSSTQQDAAESAAAGA
Ga0134081_1027687413300014150Grasslands SoilSVSAGIAILLLPGVLGVAQEPVDGPARQAPMPAADQQRQKADGERHGLPMETFAVVPGTKFLVRLENELGTMGPKQNEKFKVKTLEPLEAGSGIYLPSGAEICGHISHVEPAGVAGRAKLWLTFDEIRTTFGALPIVAEVVSVPGDHSVKPGPRQEGLIEGRSSNQHDAAEAAAAGAAVGAVKGVKDKDKDKKEAAEG
Ga0134075_1037486813300014154Grasslands SoilMAVLAMILTAGIACTAQEPTQAPTQEPTAPMSEPPRQKADGERHGLPLETYAVAPGTKFLVRLEDELNTKGTQENAKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKLWLTFDEIRTKFGVLPIVAEVVSVPGDHSVKPGPTQE
Ga0134078_1033576813300014157Grasslands SoilMNGLKLRAIFIGAPMLVVALSGAISLSAEEPVEGPSNGRAAQGDDPSRHPADGERHGLPPETFAVTPGTKFLVRLEDELSTKGTQENAVFKVKTLEPLEAGSGFYLPAGAEIVGHISRVEPARVAGHAKLWLTFDEVHTTFGDLPI
Ga0137414_106157723300015051Vadose Zone SoilMEQKRINSMSGGIAFLTMTLALGIACKAQERSQESSREAATQASEQPRQKADGERHGLPVETYAVVPGTKFLVRLEDELGTKGTKENERFKVKTLEPLEAGNGIYLPAGAEVRGHISHVEPAGVAGRAKLWLTFDEIHTRFGDLPIVAEVVDVPGD
Ga0137412_1093308713300015242Vadose Zone SoilMERMKLRNATAGVTVLALTLVWGSVSNAQEQTQEQSRETTTPAPEQLRQKADGERHGLPIETYAVAPGTKFLVRLEDELGTKGTKENERFKVKTLEPLEAGSGIYLPSGAEIHGHISHVEPAGVAGRAKLWLTFDEIHTRFGDLPIVAEVVSVPGDHSVKSLPQKEGLIE
Ga0137409_1074521013300015245Vadose Zone SoilMTVLSGIPSFAQETAQEQERQAPVRAPDSPRQKADGERRGLPLETYAVAPGTKFLVRLEDELGTKGTQENAKFKVRTLEPLEAGSGIYLPSGAEIQGHISHVEPAGVAGRAKLWLTFDEIYTKFGRLPIVAEVVSVPGDHSVKSGTSQEGLIEGRSSNQHDA
Ga0137403_1009751243300015264Vadose Zone SoilMKLLKFRDALAAAAILAMTPLSGISSFARGTAQEPERQAPVQTPEPQRQKADGERRGLPIEAYAVTPGTKFLVKLEDDLSTKATRENTRFKVKTLEPLEAGSGIYLPSGAEIHGHVSHVEPAGVAGRAKLWLTFDEIRTKFGKLPIVAEVVSVPGDHSVKSATT
Ga0182037_1004358513300016404SoilMNGMKLRTIFASAPMLFVALSAATSLRAQEPVERPSNEPATQGNDQPRHRADGERHGLPLETFAVTPGTKFLVRLEGELSTKGTPENSTFRVKTLEPLEAGSGFYLPSGAEIEGHVSRVEPAQVAGRAKLWLTFDEIHTRFGNLPI
Ga0182038_1064838733300016445SoilMNSMKLRTIFASAPMLFVALSAATSLRAQEPVERPSNEPAAQGNDQPRHRADGERHGLPLETFAVTPGTKFLVRLEGELSTKGTPENSTFRVKTLEPLEAGSGFYLPSGAEIEGHVSRVEPAQVAGRAKLWLTFDEIHTRFGNLPIVAEVASVPGDHSVKPSPNQE
Ga0187825_1034479313300017930Freshwater SedimentMHRRNVAVRLLASATFVALLAAGTALRAQDQSQEPSKDTPPQPSDQPRHRADGGRHGLPPETFAVTPGTKFLVRLEDDLGTKGAENARFKVRTLEPLEAGSGIYLPSGAEIVGHVSHVESAGMAGRAKLWLTFDEIRTSFGTLPIVAEVVSVPGDHSVKPVANQEGLIEGRSSTQQDAAQAAA
Ga0066662_1016584933300018468Grasslands SoilMERKNVSSVTMGVAILAATLSLGATCKAPEQPQETPREATAQAPDQPRQKADGEQHGLPIETYAVAPGTKFLVRLEDELGTKGTQENAKFRVKTLEPLEAGSGIYLPAGAEIHGHISHVEPAGVAGRAKLWLTFDEIHTKFGDLPIVAEVVSVPGDHSVKSG
Ga0066662_1046659213300018468Grasslands SoilMERRKLSGVFVRVTILVTALVLAGTCRAQDQSSETPREATTQAPDQQRQKADGERHGLPIETYAVAPGTKFLVRLEDELGTKGTQENAKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKIWLTFDEIHTKFGSLPIVAEVVSVPGDHS
Ga0210401_1032192023300020583SoilMKRLEIGFASSRIVVMAMVLAIGIVAKAQELPAPQPEQRQQVADGKRRGLPIETYAVAPGTKFLVKLEDELGTRGTQENAKFKVKTLEPLEAGSGIYLPVGAEIHGHISHVEPAGVAGRAKLWLTFDEIHTKFGVLPIVAEVVSVPGDHSVRSGPTKEGLIEGRSSTQQDAAESAAAG
Ga0210401_1140659813300020583SoilSVELLVMVAANIVIALSPVAAQEPPQENPAPAVQEQPPQRADGRRGDLPLEAYAVAPGTKFLVRLEDELDTKETRENRRFKVRTLEPLEAGSGIYLPTGAEIQGHVSRVESAGIAGRARLWLTFDDIHTKFGKLPIVAEVVSVPGDHSVKTGGPQREGLIEGRTSTQQSAAEAAAAGAAIGAVKG
Ga0210400_1052505423300021170SoilMKRLEIGFASSGMAIMAMVLAIGIVAKAQEPPAPQSEQRQQVADGKRRGLPIETYAVAPGTKFLVKLEDELGTRGTQENAKFKVKTLEPLEAGSGIYLPVGAEIHGHISHVEPAGVAGRAKLWLTFDEIHTKFGVLPIVAEVVSVPGDHSVRSGPTKEGLIEGRSSTQQDAAESAAAGAAIGAVKGVKNKDKKEAAEG
Ga0210410_1060286513300021479SoilMKRTKLGGAFGCVVALTLMLALRITSSAQEQNQERSRESTPGAVDQQRQKADGERHGLPIETYAVAPGTKFLVRLEEELGTRATKENEKFKVKTLEPLEAGSGIYLPSGAEIQGHISHVEPAGVAGRAKLWLTFDEIHTKFGTLPIVAEVVSVPGDHSVKPGGTQEGLIEGRSSNQHDAAEAAAAGAAI
Ga0126371_1026070813300021560Tropical Forest SoilMNQLRVRAFFVIASILTATLSGAIRLRAQEPPQEPGKDTAAQTFEQPRHRADGERHGLPLETFAVTPGTKFLVSLQEELSSKGTQENAVFKVKTVEPLEAGSGFYLPSGAEIVGHISRVEPAGVAGRAKIWLTFDEIRTKFGDLPIVAEVASVPGDHSVKPVPEKEGLI
Ga0126371_1102507823300021560Tropical Forest SoilMNHSKVRAFFVVTTIFGVVFGGAVTSRAQELPPEPIRDPARQAPEQQRHRADGERHGLPLETFAVTPGTKFLVRLQDELNSKGTQENAVFKVKTLEPLEAGSGIYLPSGAEIVGHISRVESAGVAGRAKLWLTFDEIHTKFGDLPIVAEVAGVPGDHSVKPVPDQEGLIQGRSSTQQDAAAA
Ga0137417_139878043300024330Vadose Zone SoilMERRRFSSVFGSVAILTMTLALGVACGAQEQTPEASREATTQAPEQQRQKADGERHGLPIETYAVAPGTKFLVRLEEELGTKGTKENEKFKVKTLEPLEAGSGIYLPPGAEIHGHVSHVEPAGVAGRAKMWLTFDEIHTKFGTLPIVAEVASVPGTTA
Ga0209240_121610413300026304Grasslands SoilYQLSNSREETMNRRKLSSVFGSVAILTLSMALGDTSKAQDQTQEPVREATSQSPEQQRLKADGERHGLPIETYAVAPGTKFLVRLEEELGTRGTKENEKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKIWLTFDEIHTKFGTLPITAEVVSVPGDHSVKSGATQEGLIEGRSSNQHDAAEAA
Ga0209055_103979133300026309SoilMEPRKLNSVSAGIAILTVTLLPGVVGGAQEPVDGPARPAPMQAADQLRQKADGERHGLPIETFAVAPGTKFLVRLEDELGTKGPKENEKFKVKTLEPLEAGSGIYLPSGAEICGHISHVEPAGVAGRAKLWLTFDEIRTTFGALPIVAEVVSVPGDHSVKPGPRRKV
Ga0209268_1003394103300026314SoilMNCLGVRPFVVSTSILAIVLSGAIALRAQEPPKETTSQAPEQPRHRADGERHGLPLETFAVTPGTKFLVRLEAELGTKSTQENSKFRVKTLEPLEAGSGIYLPSGAEIVGHVSRVEPAGVAGRAKLWLTFDEIHTQFGDLPIVAEVSSVPGDHSVKPVPNQEGLIAGRSSTQQDAAA
Ga0209154_104067233300026317SoilMNYLRVRAFFVGTSILAVVLTGASALCAQEPAQESSKQTAAQPPEPPRHRVDGERHGLPPESFAVVPGTKFLVSLQDELGTKGRQENSTFKVKTLEPLEAGSGFYLPSGAEIEGHVSRVEPAGVAGRAKIWLTFDEMHTNFGNLPIVAEVVSVPGDHSVKPVPNQEGL
Ga0209158_100718983300026333SoilMERRKLSGVFVRVTILVTALVLAGTCRAQDQSSETPREATTQAPDQQRQKADGERHGLPIETYAVAPGTKFLVRLEDELGTKETQENAKFKVKTLEPLEAGSGVYLPPGAEIQGHISHVEPAGVAGRAKLWLTFDEIRTKFGSLPIVAEVVSVPGDHSVKPGNAKEGLIEGRSSNQHDAAEAAAAGAAIGAVKGVKDKNKKEAAEG
Ga0209059_110408313300026527SoilMNYLRVRAFFVGTSILAVVLTGASALCAQEPAQESSKQTAAQPPEPPRHRVDGERHGLPPESFAVVPGTKFLVSLQDELGTKGRQENSTFKVKTLEPLEAGSGFYLPSGAEIEGHVSRVEPAGVAGRAKIWLTFDEMHTNFGNLPIVAEVVSVPGDHSV
Ga0209059_110533313300026527SoilMNYLRVRAFFVGASILAVVLTGASAWCAQETAQESSKQTAEPPRHRVDGERHGLPLESFAVVPGTKFLVSLQDELGTKGTQENSTFKVKTLEPLEAGSGFYLPSGAEIEGHVSRVEPAGVAGRAKIWLTFDEMHTNFGNLPIVAEVVSVPGDHSV
Ga0209056_1057625013300026538SoilMEPRKLKSVSAGIAILTITLLPGVVGVSQEPVDGPARQAPTQAADQQRQKADGERHGLPIETFAVAPGTKFLVRLENELGTRDPKENEKFKVKTLEPLEAGSGIYLPSGADICGHISHVEPAGVAGRAKLWLTFDEIRTTFGALPIVAEVVSVPGDHSVKPGPRQEGLIEGRSS
Ga0209156_1049946613300026547SoilVLSEAIALRAQEPSQEPNKETIPQASEPARYKVDGERHGLPPETFAVAPGTKFLVGLEDGLNTKGTQENSPFKVKTLEPLEAGSGFYLPPGAEIVGHVSRVEPAGVAGRAKLWLTFDEMHTKFGALPIVAEVVSVPGDHSVKTVPHQEGLIAGRTGTQQDAAAAAAA
Ga0209648_1069759313300026551Grasslands SoilPRKLGSVSASIAILTMTLVPGFVCLAQETAQEPARETTAQAPEQQRQKADGERRGLPIETYAVTPGTKFLVKLEDELSTKGTPENGKFKVKTLEPLEAGSGIYLPAGAEIGGHVSRVEPAGVAGRAKLWLTFDEIRTKFGKLPIVAEVVSVPGDHSVKSGQEQAGLIEGRSSTQHDAAEAAAAG
Ga0179587_1093387713300026557Vadose Zone SoilKLSNVPAKIAILTMTLALGAACQAQEQSQEPSREATTQASEQPRQKADGERHGLPIETYAVAPGTKFLVRLEEELGTRGTKENEKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKMWLTFDEIHTKFGSLPIVAEIVSVPGDHSVKPGGTQEGLIEGRSSNQHDAAEAAAAGAAIGAVKG
Ga0209214_101655313300027071Forest SoilMNGLKLRAIFIGAPMLVVALSGAISLSAEEPVEGPSNGRAAQGDDPSRHPADGERHGLPLETFAVTPGTKFLVRLEDELSTKGTQENSTFKVKTLEPLEAGSGFYLPSGAEIAGHISRVEAAGVVGRAKIWLTFDELHTKFGDLP
Ga0209117_105101513300027645Forest SoilMERKKLSNVCASIAVLAMTLAPGFVCLAQEAAQEPARETTGQAPEQLRQKADGERRGLPLETYVVAPGTKFLVRLEDELGTKGARENEKFKVKTLEPLEAGSGIYLPPGAEIHGHVSHVEPAGVAGRAKLWLTFDEIRTKFGRLPIVAEVVSVPGDHSVKSGTAQEGLIEGRSSNQHDSAEAAAAGAAIGAVKGVKHKDKKE
Ga0209588_104266713300027671Vadose Zone SoilMKLSKSRDLLKAAATLAMILSSGFPGFAQETAQEPERQAPVRAPDSPRQKADGERRGLPLETYAVAPGTKFLVRLEDELGTKGTQENAKFKVRTLEPLEAGSGIYLPSGAEIQGHISHVEPAGVAGRAKLWLTFDEIHTKFGRLPIVAEVVSVPGDHSVKSGTSQEGLIEGRSSNQH
Ga0209118_100516213300027674Forest SoilMKRLKMGSTSAGTAILTMILASVVVCVAQEPAPETTQDSSAPKPQKADGERRGLPLETYAVAPGTKFLVRLEDVLNTKGTQENAKFKVKTLEPLEAGSGIYLPPGAEIHGHISRVEPAGVAGRAKLWLTFDEIHTKFGKLPIVAEVVSVPG
Ga0209118_114422313300027674Forest SoilMMLSRVRNALAATAILALIVPFGIPCFAQETVQEPERQASVQAPEPQRQKADGERRGLSIETYAVAPGTKFLVKLEDGLSTKGTQENAKFKVRTLEPLEAGSGIYLPPGAEIQGHISHVEPAGVAGRARLWLTFDEIHTKFGRLPIVAEVVSVPGDHSVKSGAKQEGLIEGRSSNQHDAAQA
Ga0209073_1022408513300027765Agricultural SoilMNRPVVRSFFTITPILAVALGGGIASRAQEPPQEPTKEVAAQGPDLRHRADGERHGLPLETFAVTPGTKFLVSLQEDLTSKGTQENATFKVKTLEPLEAGSGFYLPAGAEIVGHISRVESAGVAGRAKLWLTFDEIHTKFGDLPIVAEVSDVPGDHSVKPVPEKEGLIQGR
Ga0209073_1042366713300027765Agricultural SoilMERRKLSGVTVRVTLLATALGLAGTCRAQDQSSETPREATAQAPEQPRQKADGERHGLPIETYAVAPGTKFLVRLEDELGTKGTQENAKFKVKTLEPLEAGSGIYLPPGAEIQGHISHVEPAGVAGRAKLWLTFDEIRTKFGSLPIVAEVVSVPGDH
Ga0209180_1009518033300027846Vadose Zone SoilMERMKLRNVTAAVTVLALTLVWGSVGNAQEQTQEQSRETPTPAPEQLRQKADGERHGLPIETYAVAPGTKFLVRLEDELGTKGTKENERFKVKTLEPLEAGSGIYLPSGAEIHGHISHVEPAGVAGRAKLWLTFDEIHTRFGDLPIVAEVVSVPGDHSVKSLPQKEGLIEGRSS
Ga0209701_1006301843300027862Vadose Zone SoilMERRKLSSVFASIAILAMMCASGVVCLAQETTQEPARETTRQAPEQLRQKADGERHGLPIETYAVAPGTKFLVRLEDELGTRGTKENEKFKVKTLEPLEAGNGIYLPAGAEVRGHISHVEPAGVAGRAKLWLTFDEIHTKFGSLPIVAEVVSVPGDHSVKPGGTQEGLIEGRS
Ga0209701_1064777213300027862Vadose Zone SoilGVVCLAQETTQEPARETTRQAPEQLRQKADGERHGLPVETYAVAPGTKFLVRLEDDLGTKATKENDRFKVRTLEPLEAGNGIYLPPGAEVRGHISHVEPAGVAGRAKIWLTFDEIHTKFGTLPITAEVVSVPGDHSVKPGGTQEGLIEGRSSNQHDATEAAAAGAAIGAVKGVKDKNKKEAAEG
Ga0209590_1023491613300027882Vadose Zone SoilMMETADSKRSHVVQVSDETSKSLYFQSLVKGIARQSWLQKCSPVLLGFSQEETMKRPHIGTVSAGMAVLAMILTSGIAPAAQEPTQGATQESTGPMSEPPRQKADGERHGLPLETYAVAPGTKFLVRLEDELNTKGTQENAKFKVKTLEPLEAGSGIYLPPGAEIHGHISHVEPAGVAGRAKLWLTFDEIRTKFGVLPIVAEVVSVPGDHSVKPGPTQEGLIAGRSSTQHDAAEAAAAGAAIG
Ga0307475_1040578013300031754Hardwood Forest SoilMERKKLSNAPAGVAILTMTLALGVASRAQEQPQEPSRETTMQPSDQPRQKADGERHGLPVETYAVAPGTKFLVRLEEELGTRGTKENEKFKVKTLEPLDAGSGIYLPPGAEIHGHISHVEPAGVAGRAKMWLTFDEIHTKFGTLPIVAEVVSVPGDHSVKPAGTQEGLIEGR
Ga0307478_1181181513300031823Hardwood Forest SoilMEGKKLGSAFASVAILAGTLGLGAMCKAQDQSQEPPREATGQAPDQQRQKVDGERHGARLEEFAVVPGTKFLVRLEDELSSKETHENQRFKVRTLEPLEAGSGNYLPPGAEISGHVSRVEPAGIVGMAKIWLTFDEIHTKFGKLPIVAEV
Ga0307471_10220915313300032180Hardwood Forest SoilMNNLSVRAFFVRTSIFAFALGTAIASRAQEPPQEPSKETAAQAPEQPHHRADGERHGLPLETFAVTPGTKFLVSLQDDLSTKGAQENAVFKVKTLEPLEAGSGFYLPSGAEIVGHISRVEPAGVAGRAKLWLTFDEIRTKFGDLPIVAEVASVPGDHSVKPVPNQEGLIAGRSSTQQDAAAAAAAGAAI
Ga0306920_10305547413300032261SoilMNRPRVRASIVTSLLAVSLTGAIALRAQEPTQDPTKEASVQVPEPPRHRADGERHGLPLETFAVTPGTKFLVSLQDELGTKGTQENSRFKVKTLEPLEAGSGFYLPSGAEIEGHISRVEPAGVAGRAKIWLTFDEIHTRFGSLPIVAEVVSVPGDHSVKPVPNQEGLIQGHSSTQQD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.