NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F104540

Metagenome / Metatranscriptome Family F104540

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104540
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 271 residues
Representative Sequence VKSIRGSSAVLFAILIIVGIIWFPLCVHAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLARWDKNRFAEAVVELVLHDMISPEDRLRLTKLAVAQAGATVSLCELKNCTN
Number of Associated Samples 69
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 45.00 %
% of genes near scaffold ends (potentially truncated) 45.00 %
% of genes from short scaffolds (< 2000 bps) 60.00 %
Associated GOLD sequencing projects 55
AlphaFold2 3D model prediction Yes
3D model pTM-score0.38

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (73.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(65.000 % of family members)
Environment Ontology (ENVO) Unclassified
(59.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(65.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 15.41%    β-sheet: 24.66%    Coil/Unstructured: 59.93%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.38
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF03486HI0933_like 25.00
PF11897DUF3417 17.00
PF01590GAF 9.00
PF13185GAF_2 4.00
PF13620CarboxypepD_reg 2.00
PF00072Response_reg 1.00
PF08238Sel1 1.00
PF13302Acetyltransf_3 1.00
PF01292Ni_hydr_CYTB 1.00
PF08309LVIVD 1.00
PF01042Ribonuc_L-PSP 1.00
PF12848ABC_tran_Xtn 1.00
PF03480DctP 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG0493NADPH-dependent glutamate synthase beta chain or related oxidoreductaseAmino acid transport and metabolism [E] 50.00
COG06542-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductasesEnergy production and conversion [C] 50.00
COG0029Aspartate oxidaseCoenzyme transport and metabolism [H] 25.00
COG0446NADPH-dependent 2,4-dienoyl-CoA reductase, sulfur reductase, or a related oxidoreductaseLipid transport and metabolism [I] 25.00
COG0492Thioredoxin reductasePosttranslational modification, protein turnover, chaperones [O] 25.00
COG0644Dehydrogenase (flavoprotein)Energy production and conversion [C] 25.00
COG1053Succinate dehydrogenase/fumarate reductase, flavoprotein subunitEnergy production and conversion [C] 25.00
COG1249Dihydrolipoamide dehydrogenase (E3) component of pyruvate/2-oxoglutarate dehydrogenase complex or glutathione oxidoreductaseEnergy production and conversion [C] 25.00
COG2072Predicted flavoprotein CzcO associated with the cation diffusion facilitator CzcDInorganic ion transport and metabolism [P] 25.00
COG2081Predicted flavoprotein YhiNGeneral function prediction only [R] 25.00
COG2509FAD-dependent dehydrogenaseGeneral function prediction only [R] 25.00
COG3634Alkyl hydroperoxide reductase subunit AhpFDefense mechanisms [V] 25.00
COG0251Enamine deaminase RidA/Endoribonuclease Rid7C, YjgF/YER057c/UK114 familyDefense mechanisms [V] 1.00
COG1969Ni,Fe-hydrogenase I cytochrome b subunitEnergy production and conversion [C] 1.00
COG2864Cytochrome b subunit of formate dehydrogenaseEnergy production and conversion [C] 1.00
COG3038Cytochrome b561Energy production and conversion [C] 1.00
COG3658Cytochrome b subunit of Ni2+-dependent hydrogenaseEnergy production and conversion [C] 1.00
COG4117Thiosulfate reductase cytochrome b subunitInorganic ion transport and metabolism [P] 1.00
COG5276Uncharacterized secreted protein, contains LVIVD repeats, choice-of-anchor domainFunction unknown [S] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms73.00 %
UnclassifiedrootN/A27.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005534|Ga0070735_10000730All Organisms → cellular organisms → Bacteria33968Open in IMG/M
3300007255|Ga0099791_10026857Not Available2504Open in IMG/M
3300007258|Ga0099793_10096740All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1362Open in IMG/M
3300007265|Ga0099794_10069247All Organisms → cellular organisms → Bacteria → Acidobacteria1726Open in IMG/M
3300007788|Ga0099795_10150704Not Available952Open in IMG/M
3300009143|Ga0099792_10029811All Organisms → cellular organisms → Bacteria2518Open in IMG/M
3300009143|Ga0099792_10220073Not Available1090Open in IMG/M
3300009143|Ga0099792_10292745Not Available963Open in IMG/M
3300010159|Ga0099796_10021775All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1979Open in IMG/M
3300011271|Ga0137393_10238555All Organisms → cellular organisms → Bacteria1541Open in IMG/M
3300012202|Ga0137363_10153094All Organisms → cellular organisms → Bacteria → Acidobacteria1813Open in IMG/M
3300012202|Ga0137363_10209425All Organisms → cellular organisms → Bacteria1568Open in IMG/M
3300012203|Ga0137399_10315816All Organisms → cellular organisms → Bacteria1293Open in IMG/M
3300012203|Ga0137399_10338447All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1248Open in IMG/M
3300012205|Ga0137362_10331921Not Available1317Open in IMG/M
3300012208|Ga0137376_10477333All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1082Open in IMG/M
3300012211|Ga0137377_10097073All Organisms → cellular organisms → Bacteria2787Open in IMG/M
3300012362|Ga0137361_10290859All Organisms → cellular organisms → Bacteria → Acidobacteria1494Open in IMG/M
3300012362|Ga0137361_10406102All Organisms → cellular organisms → Bacteria1251Open in IMG/M
3300012363|Ga0137390_10283275All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1641Open in IMG/M
3300012582|Ga0137358_10125239All Organisms → cellular organisms → Bacteria1747Open in IMG/M
3300012582|Ga0137358_10564625Not Available764Open in IMG/M
3300012683|Ga0137398_10026343Not Available3233Open in IMG/M
3300012685|Ga0137397_10279838Not Available1243Open in IMG/M
3300012917|Ga0137395_10318158Not Available1105Open in IMG/M
3300012917|Ga0137395_10499673Not Available876Open in IMG/M
3300012917|Ga0137395_10760733Not Available701Open in IMG/M
3300012918|Ga0137396_10394461All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1025Open in IMG/M
3300012922|Ga0137394_10176236All Organisms → cellular organisms → Bacteria1823Open in IMG/M
3300012924|Ga0137413_10059980All Organisms → cellular organisms → Bacteria2240Open in IMG/M
3300012924|Ga0137413_10303884Not Available1116Open in IMG/M
3300012925|Ga0137419_10092772All Organisms → cellular organisms → Bacteria2071Open in IMG/M
3300012925|Ga0137419_10110529All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1921Open in IMG/M
3300012927|Ga0137416_10752179Not Available859Open in IMG/M
3300012929|Ga0137404_10062515Not Available2893Open in IMG/M
3300012944|Ga0137410_10131950All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1886Open in IMG/M
3300012944|Ga0137410_10193973All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1570Open in IMG/M
3300015051|Ga0137414_1041821All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1767Open in IMG/M
3300015051|Ga0137414_1049733Not Available936Open in IMG/M
3300015052|Ga0137411_1008364All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2112Open in IMG/M
3300015054|Ga0137420_1211205All Organisms → cellular organisms → Bacteria4688Open in IMG/M
3300015054|Ga0137420_1397610All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1581Open in IMG/M
3300015241|Ga0137418_10014000All Organisms → cellular organisms → Bacteria7406Open in IMG/M
3300015241|Ga0137418_10209693All Organisms → cellular organisms → Bacteria1678Open in IMG/M
3300015241|Ga0137418_10299718All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1345Open in IMG/M
3300015242|Ga0137412_10012179All Organisms → cellular organisms → Bacteria6948Open in IMG/M
3300015242|Ga0137412_10117730All Organisms → cellular organisms → Bacteria2154Open in IMG/M
3300015245|Ga0137409_10030471All Organisms → cellular organisms → Bacteria5234Open in IMG/M
3300015245|Ga0137409_10055608All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3742Open in IMG/M
3300015245|Ga0137409_10301590All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1408Open in IMG/M
3300015264|Ga0137403_10064874All Organisms → cellular organisms → Bacteria3723Open in IMG/M
3300020170|Ga0179594_10143120Not Available881Open in IMG/M
3300020199|Ga0179592_10026019All Organisms → cellular organisms → Bacteria2625Open in IMG/M
3300020580|Ga0210403_10016986All Organisms → cellular organisms → Bacteria5820Open in IMG/M
3300021086|Ga0179596_10294152Not Available809Open in IMG/M
3300021088|Ga0210404_10162179Not Available1179Open in IMG/M
3300021168|Ga0210406_10215394All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1588Open in IMG/M
3300021170|Ga0210400_10002629All Organisms → cellular organisms → Bacteria16448Open in IMG/M
3300021170|Ga0210400_10026009All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4542Open in IMG/M
3300021170|Ga0210400_10027523All Organisms → cellular organisms → Bacteria4411Open in IMG/M
3300021170|Ga0210400_10039778All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3649Open in IMG/M
3300021170|Ga0210400_10123954All Organisms → cellular organisms → Bacteria2064Open in IMG/M
3300021363|Ga0193699_10057772All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1512Open in IMG/M
3300021479|Ga0210410_10000020All Organisms → cellular organisms → Bacteria295936Open in IMG/M
3300021479|Ga0210410_10110047Not Available2437Open in IMG/M
3300024330|Ga0137417_1137891All Organisms → cellular organisms → Bacteria5684Open in IMG/M
3300024330|Ga0137417_1173404All Organisms → cellular organisms → Bacteria1274Open in IMG/M
3300024330|Ga0137417_1247164All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2967Open in IMG/M
3300026341|Ga0257151_1014534Not Available823Open in IMG/M
3300026361|Ga0257176_1014409Not Available1083Open in IMG/M
3300026557|Ga0179587_10443696Not Available848Open in IMG/M
3300026557|Ga0179587_10605602Not Available721Open in IMG/M
3300027181|Ga0208997_1025773All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium844Open in IMG/M
3300027480|Ga0208993_1032492Not Available929Open in IMG/M
3300027512|Ga0209179_1007612All Organisms → cellular organisms → Bacteria1794Open in IMG/M
3300027587|Ga0209220_1028317All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1502Open in IMG/M
3300027616|Ga0209106_1007217All Organisms → cellular organisms → Bacteria2276Open in IMG/M
3300027633|Ga0208988_1002395All Organisms → cellular organisms → Bacteria4231Open in IMG/M
3300027643|Ga0209076_1010061All Organisms → cellular organisms → Bacteria2406Open in IMG/M
3300027645|Ga0209117_1051537All Organisms → cellular organisms → Bacteria → Acidobacteria1217Open in IMG/M
3300027645|Ga0209117_1056539All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1147Open in IMG/M
3300027660|Ga0209736_1003942All Organisms → cellular organisms → Bacteria4925Open in IMG/M
3300027660|Ga0209736_1036344All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1445Open in IMG/M
3300027663|Ga0208990_1003682Not Available4926Open in IMG/M
3300027669|Ga0208981_1002311All Organisms → cellular organisms → Bacteria4486Open in IMG/M
3300027678|Ga0209011_1000796All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae11510Open in IMG/M
3300027738|Ga0208989_10053748All Organisms → cellular organisms → Bacteria → Acidobacteria1390Open in IMG/M
3300027875|Ga0209283_10211054All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1290Open in IMG/M
3300027882|Ga0209590_10077001All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1936Open in IMG/M
3300027903|Ga0209488_10079782All Organisms → cellular organisms → Bacteria2434Open in IMG/M
3300027986|Ga0209168_10008277All Organisms → cellular organisms → Bacteria6392Open in IMG/M
3300028047|Ga0209526_10016797All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium5076Open in IMG/M
3300028047|Ga0209526_10063961All Organisms → cellular organisms → Bacteria2590Open in IMG/M
3300028047|Ga0209526_10227606All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1283Open in IMG/M
3300028536|Ga0137415_10071787All Organisms → cellular organisms → Bacteria3323Open in IMG/M
3300028536|Ga0137415_10521677Not Available996Open in IMG/M
3300030991|Ga0073994_10026138All Organisms → cellular organisms → Bacteria1530Open in IMG/M
3300031231|Ga0170824_115635948Not Available932Open in IMG/M
3300031962|Ga0307479_10005248All Organisms → cellular organisms → Bacteria11864Open in IMG/M
3300032180|Ga0307471_100221989All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1915Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil65.00%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil16.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil13.00%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.00%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015051Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021363Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3c2EnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026341Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-AEnvironmentalOpen in IMG/M
3300026361Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-BEnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027181Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM2_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027480Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM2_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027512Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027587Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027616Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027633Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027660Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027663Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027669Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027678Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027738Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027986Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300030991Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-1A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0070735_1000073083300005534Surface SoilVKSIRTSLGQFAIFALAVIIGFSSSAHAQVATGGQFKLSQSVHWGSSILPTGTFIYSIDNAAGATVVRVRQIGGNFTGLFMPQTETEGSDFDSRGIVIATVGEDKFVSSLRTEGRGPVFNFSLPNAETEVEHPGETVTRYLSISKDPALGYFTIFNPANEKISYTEAEHVYLAACETIEREFNRSSPIRPHFTVHLHSDENNLHYPDRDLRLSRWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKNCTN*
Ga0099791_1002685713300007255Vadose Zone SoilMSILLLTESCQATRTSKFDLRAPIAPIECVRYQTAPSPAKIYQRRLGVKSIRGISAVLFAILIIVGIIWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0099793_1009674023300007258Vadose Zone SoilMSILLLTESRQAIRTSKFDLRAPIAPIECVRYQTAPSPAKIYQRRLGVKSNRGISAALFAILGIVGINWFPLCVHAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0099794_1006924723300007265Vadose Zone SoilVKSIHGISGLFAILALAGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSAAGATVVRVRQLGGNFSGLFMPQTESEGSSSDSRGIVLSTVGEDTFVTALRTEGRGPVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSSPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKNCTN*
Ga0099795_1015070423300007788Vadose Zone SoilSGLFAILALARIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSGAGATFVRVRQIGGNFTGLFMPQTESEGSSSDSRGIVLSTVGEDTFVTALRTEGRGPVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSFPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDIVSPQDRSRLTKLAVSRAGATVSLCELKNCTNQN*
Ga0099792_1002981113300009143Vadose Zone SoilKLSQSVRWGNSVLPTGTFIYSIDSAAGATVVRVRQLGGNFSGLFMPQTESEGSSSDSRGIVLSTVGENTFVTALRTADRGTVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSSPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKSCTN*
Ga0099792_1022007313300009143Vadose Zone SoilSRQATRTSKFDLRAPIAPIECVRFQTAPSPAKFYHRRLGVKSIRGSSAVLFAILIIVGIIWFPICVHAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLRYPDRDLRLARWDKNRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKTCTN*
Ga0099792_1029274513300009143Vadose Zone SoilVKSIHGISGLFAILALAGIIGFSSSAHAQAAASGQFKLSQSVRWGNSVLPTGTFIYSIDSGAGATFVRVRQIGGNFTGLFMPQTESEGSSSDSRGIVLSTVGEDTFVTALRTEGQGPVLNFSLPNAETEGAHPGATDTRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSFPIRPHLTVHLHSEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKL
Ga0099796_1002177513300010159Vadose Zone SoilRRLQQRFIQRRDDGVKSIHGISGLFAILALAGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSGAGATFVRVRQIGGNFTGLFMPQTESEGSSSDSRGIVLSTVGEDTFVTALRTEGRGPVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSSPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKNCTNQN*
Ga0137393_1023855523300011271Vadose Zone SoilVKSIHGISRLFAILALAGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSAAGATVVRVRQLGGNFSGLFMPQTESEGSSSDSRGIVLSTVGEDTFVTALRTEGRGPVLNFSLPNAEAEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSSPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKNCM
Ga0137363_1015309423300012202Vadose Zone SoilVRYQTAPSPAKIYQRRLGVKSNRGISAALFAILGIVGINWFPIFVHAQASAGGEFRLSQSVHWGRAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLRYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137363_1020942523300012202Vadose Zone SoilVKSIHGISGLFAILALAGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSAAGATVVRVRQLGGNFSGLFMPQTESEGSSSESRGIVLSTVGENTFVTALRTADRGTVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSSPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKSCTN*
Ga0137399_1031581623300012203Vadose Zone SoilVKSIHGISGLFAILALAGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSAAGATVVRVRQLGGNFSGLFMPQTESEGSSSDSRGIVLSTVGENTFVTALRTADRGTVLNFSLPNAEAEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEGEFNRSSPIRPHLTVHLHAEENNLHYPDRDLR
Ga0137399_1033844713300012203Vadose Zone SoilAPSPAKIYQRRLGVKSNRGISAALFAILGIVGINSFPIFVHAQASAGGEFRLPQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISPEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137362_1033192123300012205Vadose Zone SoilMSILLLTESCQATRTSKFDLRAPIAPIECVRFQTAPSPAKFYHRRLGVKSIRGSSAVLFAILIIVGIIWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGDYSYSVESRSGMTVVQVHQMGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137376_1047733313300012208Vadose Zone SoilMSILLLTKSRQAIRTSKFDLRAPIAPIECVRYQTAPSPAKIYQRRLGVKSNRGISAALFAILGIVGINWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGDYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVASLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGGEKISYGEAEKVYLAACETIEREFHQSAPIRPRLTVHLQANENNLHYPDRDLRLARWDKNRFAEAVVELVLHDMISREDRLRLAKLAVAQAGATVSLCELKNCTN*
Ga0137377_1009707323300012211Vadose Zone SoilMSILLLTKSRQAIRTSKFDLRAPIAPIECVRYQTAPSPAKIYQRRLGVKSNRGISAALFAILGIVGINWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVATLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGGEKISYGEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLARWDKNRFAEAVVELVLHDMISPEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137361_1029085913300012362Vadose Zone SoilMSILLLTKSRQAIRTAKFDLRAPIAPIECVRYQTAPSPAKIYQRRLGVKSIRGISTALFAILVIVGINWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQMGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137361_1040610213300012362Vadose Zone SoilVKSIHGISGLFAILALAGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSAAGATVVRVRQLGGNFSGLFMPQTESEGSSSDSRGIVLSTVGEDTFVTALRTEGRGPVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYSTIFNPANEKISYTEAEKVYLAACETIEREFNRSSPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKSCTN*
Ga0137390_1028327533300012363Vadose Zone SoilMVFAILAMAGITWLPCSVHAQAPAGGVFQLSQSVRWGNAVLPTGKFIYSVESGAGPIVVRVRQIGGSFTGSFLPQTIAEGSDSNSRGIVLERIGEDMIVTSLRTEGRGLMLNFSPPNAETEGPRPDATQTRYISISRDPALGYFTIYNPGGEKISYSEAEKVYLAACEAIEREFNRSTPIRPRLTVHLHSNENNLHYPDRDLRLSRWDKNRFAEAVVELVLHDMISPEDRLRLTKLAVAQAGASVSLCELKDCTN*
Ga0137358_1012523923300012582Vadose Zone SoilVKSIHGISRLFAILALAGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSAAGATVVRVRQLGGNFSGLFMPQTESEGSSSDSRGIVLSTVGEDTFVTALRTEGRGPVVNFSLPNAEAEGAHPGATETRYLSVSKDPTLGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSSPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKSCTN*
Ga0137358_1056462513300012582Vadose Zone SoilSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137398_1002634313300012683Vadose Zone SoilVKSIHGISGLFAILALAGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSAAGATVVRVRQLGGNFSGLFMPQTESEGSSSDSRGIVLSTVGENTFVTALRTADRGTVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSFPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKSCTN*
Ga0137397_1027983823300012685Vadose Zone SoilMTRIDFVDWTFQQAPATVRMSILLLTESRQAIRTSKFDLRAPIAPIECVRYQTAPSPAKIYQRRLGVKSIRGISTALFAILVIVGIIWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNGEMGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137395_1031815813300012917Vadose Zone SoilVKSIHGISGLFAILALAGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSAAGATVVRVRQLGGNFSGLFMPQTESEGSSSDSRGIVLSTVGENTFVTALRTADRGTVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSSPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKNCTN*
Ga0137395_1049967313300012917Vadose Zone SoilHGVKSNRGISGVVFAILAVTGWGGFPSSLHAQASAGGTFKLSQSVHWGSSVLPTGEYTYSVESAGWPNIVRVSQVGGSFTGVFLPRTTSQYGDSGSKGIVLARLGEEMIVSSFRVEERGLVLNFSPPNPDTEVVRPDATRTQYISITKDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFNRTTPIRPRLTVHLHSTENNLHYPDRDLRLARWDKSMFAEAVVELALHDMISPEDRKRLTKMAVAQAGATVSLCELKNCTN*
Ga0137395_1076073313300012917Vadose Zone SoilIVGIIWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHHPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTQLAVAQ
Ga0137396_1039446123300012918Vadose Zone SoilAKIYQRRLGVKSIRGSSAVLFAILIIVGIIWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137394_1017623623300012922Vadose Zone SoilMSILLLTESCQATRTSKFDLRAPIAPIECVRYQTAPSPAKIYQRRLGVKSIRGISTALFAILVIVGIIWFPICVHAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137413_1005998023300012924Vadose Zone SoilVKSIHGISGLFAILALAGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSAAGATVVRVRQLGGNFSGLFMPQTESEGSSSDSRGIVLSTVGENTFVTALRTADRGTVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSFPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELK
Ga0137413_1030388413300012924Vadose Zone SoilGKGASRSSPVIITSLTGPKPAHIVIKSSPRFPECAAERSNSTKFAMTRIDFVDWTFQQAPATVRMSILLLTESCQAIRTSKFDLRAPIAPIECVRFQTAPSPAKFYHRRLGVKSIRGSSAVLFAILIIVGIIWFPICVYAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELK
Ga0137419_1009277223300012925Vadose Zone SoilVKSIHGISGLFAILALAGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSAAGATVVRVRQLGGNFSGLFMPQTESEGSSSDSRGIVLSTVGENTFVTALRTADRGTVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSSPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHD
Ga0137419_1011052913300012925Vadose Zone SoilLSTGEYTYSVESGSGTTVVRVQQKGGTFTGLFLPKAFSEGGDSGSRGIAVEQIGEEMFVTSLRVEERGLVLNFSPPHADTEVARPDATRTRYISVTRDPALGYFTIFNPGGEKISYSEAEKVYLAACETIEREFNRSTPIRPRLTVHLHSNENSLHYPDRDLRLARWDKNRFAEAVVELVLHDMISPGDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137416_1075217923300012927Vadose Zone SoilWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIQREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKNRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137404_1006251523300012929Vadose Zone SoilPAKIYQRRLGVKSNRGISAALFAILGIVGINWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRMTKLAVAQAGATVSLCELKNCTN*
Ga0137410_1013195013300012944Vadose Zone SoilMSILLLTKSRQAIRTSKFDLRAPIAPIECVRYQTAPSPAKIYQRRLGVKSIRGISTALFAILGIVGINWFPICVHAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGIARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLR
Ga0137410_1019397323300012944Vadose Zone SoilLSTGEYTYSVESGSGTTVVRVQQKGGTFTGLFLPKAFSEGGDSGSRGIAVEQIGEEMFVTSLRVEERGLVLNFSPPHADTEVARPDATRTRYISVTKDPALGYFTIFNPGGEKISYSEAEKVYLAACETIEREFNRSTPIRPRLTVHLHSNENSLHYPDRDLRLARWDKNRFAEAVVELVLHDMISPGDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137414_104182123300015051Vadose Zone SoilMSILLLTESCQAIRTSKFDLRAPIAPIECVRFQTAPSPAKFYHRRLGVKSIRGSSAVLFAILIIVGIIWFPICVYAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137414_104973313300015051Vadose Zone SoilGNGASRSSPVIITSLTGPKPAHIAIKSSPRFPACAAAGRSNSTKFAMTRIDFVDWTFQQAPATVRMSILLLTESCQAIRTSKFDLRAPIAPIECVRFQTAPSPAKFYHRRLGVKSIRGSSAVLFAILIIVGIIWFPICVYAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQS
Ga0137411_100836423300015052Vadose Zone SoilYQRRLGVKSIRGISAVLFAILIIVGIIWFPLCVHAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137420_121120533300015054Vadose Zone SoilMSILLLTKSRQAIRTSKFDLRAPIAPIECVRYQTAPSPAKIYQRRLGVKSIRGISTALFAILVIVGIIWFPICVHAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISITRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIQREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKNRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137420_139761023300015054Vadose Zone SoilQAIRTSKFDLRAPIAPIECVRYQTAPSPAKIYQRRLGVKSIRGISTALFAILVIVGIIWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKNRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137418_1001400043300015241Vadose Zone SoilMSILLLTKSRQAIRTSKFDLRAPIAPIECVRYQTAPSPAKIYQRRLGVKSIRGISTALFAILVIVGIIWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKNRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137418_1020969313300015241Vadose Zone SoilVKSIHGISGLFAILALAGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSAAGATVVRVRQLGGNFSGLFMPQTESEGSSSDSRGIVLSTVGENTFVTALRTADRGTVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSSPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSL
Ga0137418_1029971823300015241Vadose Zone SoilMAFTILVIAGVIWFPRSVRGQSLAGGEFRLSQSVHWGSSALSTGEYTYSVESGSGTTVVRVQQKGGTFTGLFLPKAFSEGGDSGSRGIAVEQIGEEMFVTSLRVEERGLVLNFSPPGAGTEIVRPDATRTTYISITKDPALGYFTIFNPGGEKISYSEAEKVYLAACETIEREFNRSTPIRPRLTVHLHSNENSLHYPDRDLRLARWDKNRFAEAVVELVLHDMISPGDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137412_1001217923300015242Vadose Zone SoilMSILLLTESCQAIRTSKFDLRAPIAPIECVRFQTAPSPAKFYHRRLGVKSIRGSSAVLFAILIIVGIIWFPICVHAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137412_1011773023300015242Vadose Zone SoilVKSIHGISGLFAILALAGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSAAGATVVRVRQLGGNFSGLFMPQTESEGSSSDSRGIVLSTVGENTFVTALRTADRGTVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSFPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKNC
Ga0137409_1003047133300015245Vadose Zone SoilMSILLLTESCQATRTSKFDLRAPIAPIECVRYQTAPSPAKIYQRRLGVKSNRGISAALFAILGIVGINWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0137409_1005560813300015245Vadose Zone SoilVKSIHGISGLFAILALAGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSAAGATVVRVRQLGGNFSGLFMPQTESEGSSSDSRGIVLSTVGENTFVTALRTADRGTVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSSPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKSCTN*
Ga0137409_1030159023300015245Vadose Zone SoilLSTGEYTYSVESGSGTTVVRVQQKGGTFTGLFLPKAFSEGGDSGSRGIAVEQIGEEMFVTSLRVEERGLVLNFSPPHADTEVARPDATRTRYISVTRDPALGYFTIFNPGGEKISYSEAEKVYLAACETIEREFNRSTPIRPRLTVHLHSNENSLHYPDRDLRLARWDKNRFAEAVVELVLHDMISPGDRLRLTKLAVAQAGATVLVGRPQKRHRASTPPQRRLRSTSRQR*
Ga0137403_1006487423300015264Vadose Zone SoilMSILLLTESRQAIRTSKFDLRAPIAPIECVRFQTAPSPAKFYHRRLGVKSIRGSSAVLFAILIIVGIIWFPICVHAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN*
Ga0179594_1014312013300020170Vadose Zone SoilCVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVRQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN
Ga0179592_1002601923300020199Vadose Zone SoilMTRIDFVDWTFQQAPATVRMSILLLRKSRQAIRTSKFDLRAPIAPIECVRNQTAPSPAKIYQRRLGVKSNRGISAALFAILGIVGINWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKNRFAEAVVELVLHDMISSEDRLRLTKLAVA
Ga0210403_1001698623300020580SoilVKLIRSTLGVFAILALAGITWTPCSLHAQSFAGGEFSLSQSVRWGNAVLPTGKFIYSVENSTGPTVVRVHQIGGDFSGLFLPQTKSEDSDSNPRGIVLSRLGEEMFVTSLRVEERGLVLNFSPPSAETDVPHPGATQTRYVSVTKDPALGYFTIFNPRNEKISYSEAEKVYLAACQTIEREFNRPTPIRPRLTVLLGSVENNLHYPNRELRLARWDKNRFAEAVVELVLHDMISAEDRMRLIKLAVAQAGASVSLCELKDCAN
Ga0179596_1029415213300021086Vadose Zone SoilAILVIVGIIWFPICVHAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN
Ga0210404_1016217923300021088SoilVNSIRGISGTLAILALAGIIGFSSSAHAQVSAGGQFKLSQRVRWGNSILPTGTFIYSIESAAGATLVRVRQIGGSFTGLFMPQTESDASSSDSRGIVLATVGEDTFVTALRTEGRGPVLNFSPPNAETEGMHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACQTIEREFNRSAPIRPHLTVHLHSEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQERLRLTKLAVSQAGATVSLCELKNCTN
Ga0210406_1021539413300021168SoilISAVVIAILVVVEMIGFPFSVHAQASAGGQFKLSQSVRWGSAVLPTGEYTYSVESAGWPNIVRVSQVGGNFTGIFLPRTSSLDGDPGSKGIVLARIGEQMFVSSLRVEERGLVLNFLPPNADTEVARPDATRTRYISVTKDPALGNFTIFNPGGEKISYSEAEKVYLAACETIEREFNRSTPIRPRLTVHLHSSENNLHYPDRDLRLARWNKSMFAEAVVELALHDMISAEDRQRLTKLAVAQAGATVSLCELKDCAN
Ga0210400_1000262933300021170SoilVKLIRNTSGVVFAILALSGITWAPCSLHAQSFAGGEFSLSQSVRWGNAVLPTGKFIYSVENGVGPTVVRVRQIGGDFSGLFLPQTKSEDSDANPRGIVLARLGEEMFVTSLRVEERGLVLNFSPPSAETDVPHSGATQTRYVSVTKDPALGYFTIFNPRNEKISYSEAEKVYLAACQTIEREFNRPTPIRPRLTVLLGSVENNLHYPNRELRLARWDKNRFAEAVVELVLHDMISAEDRMRLIKLAVAQAGATVSLCELKECTN
Ga0210400_1002600923300021170SoilVNPFRGMSFVFAILALAGIIGFSSSAHAQVSAGGQFKLSQRVRWGNSILPTGTFIYSIESAAGATLVRVRQIGGSFTGLFMPQTESDASSSDSRGIVLATVGEDTFVTALRTEGRGPVLNFSPPNAETEGMHPGATETRYLSVSKDPALGYFTIFNPANEKIAYDEAEKVYLAACETIEREFNRSSPIRPHFTVHLHSDENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVARAGATVSLCQLKTCPN
Ga0210400_1002752323300021170SoilVHATLDLAVSSKDLLQEAGVKLIRSTLGVFAILALAGITWTPCSLHAQSFAGGEFSLSQSVRWGNAVLPTGKFIYSVENSTGPTVVRVHQIGGDFSGLFLPQTKSEDSDSNPRGIVLSRLGEEMFVTSLRVEERGLVLNFSPPSAETDVPHPGATQTRYVSVTKDPALGYFTIFNPRNEKISYSEAEKVYLAACQTIEREFNRPTPIRPRLTVLLGSVENNLHYPNRELRLARWDKNRFAEAVVELVLHDMISAEDRMRLIKLAVAQAGASVSLCELKDCAN
Ga0210400_1003977833300021170SoilVKRSRSRSIILAILILAGSAGLASSARAQMSEAGQFSLSQSIHWGNTILPTGRFTYSIDTAAGATVVRVRQIGGNFTGLFLPQTESEGSGSNSRGIVLAKLGGDIFVTSLRTEGRGLVLNFSPPNAEPDAAHPGATVTRYLSASRDPALGYFTIFNPANEKISYAEAEKVYLAACQTIEREFNRPDPIRPHLTVHLHSDENNLHYPDRDLRLSRWDKDRFAEAVVELVLHDMVSPQERLRLTKLAVSQAEATVGLCELKTCTN
Ga0210400_1012395413300021170SoilVKSIGNISYIYAILAVAGMIGFSSSARAQIAASGQFKLSQSVRWGNSVLPTGTFVYSIDSAAGATVVRVRQVGGNFTGLFMPQTQSEGSDSDSTGIVLATVGEDTFVTALRTEGRGPVFNFSLPNGQAEGAHPGATETRYLSESKDPALGYFTVFNPANEKISYTEAEKVYLAACETVEREFNRPIPIRPHFTVHLHSEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAG
Ga0193699_1005777223300021363SoilGISGLFAILALAGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSAAGATFVRVRQIGGNFTGLFMPQTESEGSSSDSRGIVLSTVGEDTFVTALRTEGRGPVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSSPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKNCTN
Ga0210410_100000202723300021479SoilVKSIRGISGVVFAIVVMAGWGGFPSSLHAQASVGGEFRLSQSVHWGRSVLPTGEYTYSVESAGWPNIVRVSQVGGSFTGVFLPRTTSQDGDSSSKGIVLARIGEEMFVTSLRVEERGLVLNFLPPHADTEVARPEATRTRYISVTKDPALGYFTIFNPGGEKISYSEAEKVYVAACETIEREFNRSNPIRPRLTVHLHSTENNLHYPDRDLRLVRWDKNKFAEAVVELVLHDMISPEDRQRLIKMAVAQAGATVSLCELKNCTN
Ga0210410_1011004723300021479SoilGMIGFSSSARAQIAASGQFKLSQSVRWGNSVLPTGTFVYSIDSAAGATVVRVRQVGGNFTGLFMPQTQSEGSDSDSTGIVLATVGEDTFVTALRTEGRGPVFNFSLPNGEAEATHPGATETRYLSESKDPALGYFTVFNPANEKISYTEAEKVYLAACETIEREFNRPIPIRPHFTVHLHSEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVGLCELKNCSN
Ga0137417_113789123300024330Vadose Zone SoilVKSNRGISAALFAILGIVGINWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISITRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKNRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN
Ga0137417_117340413300024330Vadose Zone SoilMTRIDFVDWTFQQAPATVRMSILLLTESCQATRTSKFDLRAPIAPIECVRFQTAPSPAKFYHRRLGVKSIRGSSAVLFAILIIVGIIWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISITRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLRYPDRDLRLARWDKNRFAEAVVEQLVLHDMISSEDRLRLTKLAVAQAGATVSLCDLKNC
Ga0137417_124716423300024330Vadose Zone SoilVKSNRGISAALFAILGIVGINWFPIFVHAQASAGGEFRLSQSVHWGRAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN
Ga0257151_101453413300026341SoilECVRYQTAPSPAKIYQRRLGVKSIRGISTALFAILVIVGIIWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLARWDKNRFAEAVVELVLHDMISPEDRLRLTKLAVAQAGAT
Ga0257176_101440913300026361SoilPACAAERSNSTKFAMTRIDFVDWTFQQAPATVRMSILLLTESRQAIRTSKFDLRAPIAPIECVRYQTAPSPAKIYQRRLGVKSIRGISTALFAILVIVGIIWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAEAGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKNRFAEAVVELVLHDMISPEDRLRLTKLAVAQAGATVSLCELKNCTN
Ga0179587_1044369613300026557Vadose Zone SoilAIRTLKFDLRAPIAPIECVRFQTAPSPAKFYQRRLGVKSIRGISAALFAILGIVGINWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKSRFAEAVVELVLHDMISSEDRLRLTKLA
Ga0179587_1060560213300026557Vadose Zone SoilAPIECVRLKTAPSPANIYLRRHRVKSIRSIIVVVYAIVAVVGMIGFPFSVHAQASAGGQFRLSQSVHWGSAVLPTGEYTYSVESAGWPNIVRVSQVGGSFKGVFLPRTTSQDGDSASKGIVLARLGEEMFVSSLRVEERGLVLNFSPPNADTEVVRPDATRTQYISITKDPALGYFTIYNPGGEKISYSEAEKVYLAACETIEKEFNRTTPIRPRLTVHLHSNENNLHYPDRDLRLSRWD
Ga0208997_102577313300027181Forest SoilFAILIIVGIIWFPLCVHAQASAGGEFRLSQSVHWGSAVLPTGDYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGGRGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLARWDKNRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN
Ga0208993_103249213300027480Forest SoilWPKASAHRYQIEPALPRVRRRAIQFHKVCYDADRFCRLDIPASPCDSEDVNSFADLLLTESCQATRTSKFDLRAPIAPIECVRFQTAPSPAKFYHRRLGVKSIRGISTALFAILVIVGIIWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGDYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNGETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLRYPDRD
Ga0209179_100761223300027512Vadose Zone SoilVKSIHGISGLFAILALAGIIGFSSSAHAQAAASGQFKLSQSVRWGNSVLPTGTFIYSIDSGAGATFVRVRQIGGNFTGLFMPQTESEGSSSDSRGIVLSTVGENTFVTALRTADRGTVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSFPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKNCTNQN
Ga0209220_102831723300027587Forest SoilPCSVHAQAPAGGEFSLSQSVHWGNTVLPTGKYMYSVESAGWPTVVRVNQIGGSFTGVFLPQTSLQGGDSGSRGIVLARIGEEMFVTSLRVKERGLVLNFSAPNVEMEVPRSDSTQPHYITVSKDQALGFFTIFNPGNEKISFAEAEKVYVAACETIEREFNLSTPIRPRLTVHLHSNENNLHYPDRDLRLAKWDKNRFAEAVVELVLHDMISSEDRLRLTKLAVTKAGGIVSLCELKDCT
Ga0209106_100721723300027616Forest SoilVKSIRGSSAVLFAILIIVGIIWFPLFVHAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNGETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLRYPDRDLRLARWDKNRFAEAVVELVLHDMISPEDRLRLTKLAVAQAGATVSLCELKNCTN
Ga0208988_100239523300027633Forest SoilVKSIRGSSAVLFAILIIVGIIWFPLCVHAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQAHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLRYPDRDLRLARWDKNRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN
Ga0209076_101006123300027643Vadose Zone SoilVKSNRGISAALFAILGIVGINWFPLCVHAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKNRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN
Ga0209117_105153723300027645Forest SoilPCSVHAQAPAGGEFSLSQSVHWGNTVLPTGKYMYSVESAGWPTVVRVNQIGGSFTGVFLPQTSLQGGDSGSRGIVLARIGEEMFVTSLRVEERGLVLNFSAPTVEMEVARSDSTQGHNITVSNDPALGFFTIFNPSNEKISYPEAEKVYLAACETIEREFNLSTPIRPRLTVHLHTNENNLHYPDRDLRLARWDKNRFAEAVVELVLHDMISAEDRLRLTKLAVTKAGGIVSLCELKDCT
Ga0209117_105653913300027645Forest SoilVKSIRSISIIVFAIWAVTGITCFPWPVRAQAPAGGEFTLSQSVHWASAVLPTGKYMYSVESGSGPTVVRVRQIGGSFTGLFLPKTLSEAGDSRSRGIVLVRMGEEMFVSSLHLKEHGLVLNFSAPNVEMEVPRLDSTQAHYITVSKDQALGFFTIFNPGNEKISYAEAEKVYLAACETIEREFNLSTPIRPRLTVHLHSNENNLHYPDHDLRLAKWDKNRFAEAVVELMLHDMISSEDRLRLTKLAVTKAGGTVSLCELKDCTN
Ga0209736_100394223300027660Forest SoilMTGFSSSARAQRSEGGEFRLSQSIHWGNTILPTGRFTYSIDTAAGATVVRVRQIGGNFTGLFLPQTESEGGGSDPRGIVLGKVGEDTFVASLRTEDRGLVLNFSPPNAEIDAVHPGATVTRYLSVSRDPALGYFTIFNPANEKIPYTEAEKVYLAACQTIEREFNRSAPIRPHLTVHLHSEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQERLRLTKLAVSQAGATVSMCELKNCTN
Ga0209736_103634423300027660Forest SoilAILAMAGITGFPCSVHAQAPAGGEFSLSQSVHWGNTVLPTGKYMYSVESSGWPTVVRVSQIGGSFTGIFLPQTTLQGGDSGSRGIVLARIGEEMFVTSLRVEERGLVLNFSAPTVEMEVARSDSTQGHNITVSNDPALGFFTIFNPSNEKISYPEAEKVYLAACETIEREFNLSTPIRPRLTVHLHTNENNLHYPDRDLRLARWDKNRFAEAVVELVLHDMISAEDRLRLTKLAVTKAGGIVSLCELKDCTN
Ga0208990_100368223300027663Forest SoilVKSIRGISIALFAILVIVGIIWFPLCVHAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLRYPDRDLRLARWDKNRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN
Ga0208981_100231133300027669Forest SoilVKSIRGSSAVLFAILIIVGIIWFPLCVHAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLARWDKNRFAEAVVELVLHDMISPEDRLRLTKLAVAQAGATVSLCELKNCTN
Ga0209011_100079673300027678Forest SoilVALIRRILGVVFAILAMAGITGFPCSVHAQAPAGGEFSLSQSVHWGNTVLPTGKYMYSVESAGWPTVVRVNQIGGSFTGVFLPQTSLQGGDSGSRGIVLARIGEEMFVTSLRVKERGLVLNFSAPNVEMEVPRSDSTQPHYITVSKDQALGFFTIFNPGNEKISFAEAEKVYVAACETIEREFNLSTPIRPRLTVHLHTNENNLHYPDRDLRLARWDKNRFAEAVVELVLHDMISAEDRLRLTKLAVTKAGGIVSLCELKDCTN
Ga0208989_1005374813300027738Forest SoilAVLFAILIIVGIIWFPLFVHAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLRYPDRDLRLARWDKNRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKNCTN
Ga0209283_1021105413300027875Vadose Zone SoilVKSIRSISGMVFAILAMAGITWLPCSVDAQAPAGGVFQLSQSVRWGNAVLPTGKFIYSVESGAGPIVVRVRQIGGSFTGSFLPQTIAEGSDSNSRGIVLERIGEDMVVTSLRTEGRGLVLNFSPPNAETEGPRPDATQTRYISISRDPALGYFTIYNPGGEKISYSEAEKVYLAACEAIEREFNRSTPIRPRLTVHLHSNENNLHYPDRDLRLSRWDKNRFAEAVVELVLHDMISPEDRLRLTQLAVAQAGASVSLCELKDCT
Ga0209590_1007700133300027882Vadose Zone SoilVKSIRSISGMVFAILAMAGITWLPCSVDAQAPAGGVFQLSQSVRWGNAVLPTGKFIYSVESGAGPIVVRVRQIGGSFTGSFLPQTIAEGSDSNSRGIVLERIGEDMVVTSLRTEGRGLVLNFSPPNEETGGPRPDATQTRYISISRDPALGYFTIYNPGGEKISYSEAEKVYLAACEAIEREFNRSTPIRPRLTVHLHSNENNLHYPDRDLRLSRWDKNRFAEAVVELVLHDMISPEDRLRLTKLAVAQAEASVSLCELKDCTN
Ga0209488_1007978223300027903Vadose Zone SoilVKSIHGISGLFAILALAGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSAAGATVVRVRQLGGNFSGLFMPQTESEGSSSDSRGIVLSTVGENTFVTALRTADRGTVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSSPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKSCTN
Ga0209168_1000827733300027986Surface SoilVKSIRTSLGQFAIFALAVIIGFSSSAHAQVATGGQFKLSQSVHWGSSILPTGTFIYSIDNAAGATVVRVRQIGGNFTGLFMPQTETEGSDFDSRGIVIATVGEDKFVSSLRTEGRGPVFNFSLPNAETEVEHPGETVTRYLSISKDPALGYFTIFNPANEKISYTEAEHVYLAACETIEREFNRSSPIRPHFTVHLHSDENNLHYPDRDLRLSRWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKNCTN
Ga0209526_1001679723300028047Forest SoilVIRIRGISIIIAILTLAGMTGFSSSARAQRSEGGEFRLSQSIHWGNTILPTGRFTYSIDTAAGTTVVRVRQIGGNFTGLFLPQTESEGGGSDPRGIVLGKVGEDTFVASLRTEDRGLVLNFSPPNAEIDAVHPGATVTRYLSVSRDPTLGYFTIFNPANEKIPYTEAEKVYLAACQTIEREFNRSAPIRPHLTVHLHSEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQERLRLTKLAVSQAGATVSLCELKNCTN
Ga0209526_1006396123300028047Forest SoilVALIRRILGVVFATLAMAGITGFPCSVHAQAPAGGEFSLSQSVHWGNTVLPTGKYMYSVESAGWPTVVRVNQIGGSFTGVFLPQTSLQGGDSGSRGIVLARIGEEMFVTSLRVKERGLVLNFSAPNVEMEVPRSDSTQPHYITVSKDQALGFFTIFNPGNEKISFAEAEKVYVAACETIEREFNLSTPIRPRLTVHLHTNENNLHYPDRDLRLARWDKNRFAEAVVELVLHDMISAEDRLRLTKLAVTKAGGIVSLCELKDCTN
Ga0209526_1022760623300028047Forest SoilVNSIRGISGTLAILALAGIIGFSSSAHAQVSAGGQFKLSQRVRWGNSILPTGTFIYSIESAAGATLVRVRQIGGNFTGLFIPQTESDASSSDSRGIVLATVGEDTFVTALRTEGRGPVLNFSPPNAETEGMHPGATETRYLSVSKDPALGYFTIFNPANEKIAYDEAEKVYLAACETIEREFNRSSPIRPHFTVHLHSDENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVARAGATVSLCQLKTCPN
Ga0137415_1007178723300028536Vadose Zone SoilVKSIHGISGLFAILALAGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSAAGATVVRVRQLGGNFSGLFMPQTESEGSSSDSRGIVLSTVGEDTFVTALRTADRGTVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSSPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKSCTN
Ga0137415_1052167713300028536Vadose Zone SoilSTKFAMTRIDFVDWTFQQAPATVRMSILLLTESCQATRTSKFDLRAPIAPIECVRFQTAPSPAKFYHRRLGVKSIRGSSAVLFAILIIVGIIWFPICVHAQSSAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVARSDATQTRYISVTRDPALGYFTIFNPGSEKISYSEAEKVYLAACETIEREFHQSAPIRPRLTVHLQSNENNLHYPDRDLRLSRWDKNRFAEAVVELVLHDMISSEDRLRLTKLAVAQAGATVSLCELKN
Ga0073994_1002613813300030991SoilMSIHLLTESRQAIRTSKFDLRAPIAPIECVRYQTAPSPAKFYHRRLGVKSIRGSSTALFAILVIVGIIWFPIFVHAQASAGGEFRLSQSVHWGSAVLPTGEYLYSVESRSGMTVVQVHQIGGSFAGVFLPRTFSESGDSGSRGIALARIGEEMFVTSLRVGERGLVLNFSPPNAETGVERSDATQTRYISITKDPAVGYFTIFNPGGEKISYGEAEKVYLAACETIEREFHQSAPIRPRLTVHLQANENNLHYPDRDLRLARWDKNRFAEAVVELVLHDMISPE
Ga0170824_11563594813300031231Forest SoilVKSIHGISRLFAILALTGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDSAAGATFVRVRQIGGNFTVLFMPQTESEGSSSDSRGIVLSTVGEDTFVTALRTEGRGPVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSSPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKNCTN
Ga0307479_1000524843300031962Hardwood Forest SoilMKSIRGISGVVFAILAMTGWGGIASSLHAQASAGGTFKLTQSVHWGSSVLPTGEYTYSVESAGWPNIVRVSQVGGSFTGVFLPRTTSQDGDLGSKGIVLARIGEEMFVTSLRVEERGLVLNFSPPNADTEVLRPDATRTRYISINKDPALGYFTIYNPGGEKISYSEAEKVYLAACETIEKEFNRSTPIRPRLTVHLHSNENNLRYPDRDLRLSRWDKNRFAEAVVELVLHDMISPEDRQRLTKMAVAQAGATVNLCELKNCTN
Ga0307471_10022198923300032180Hardwood Forest SoilVKSIHGISGLFAILALAGIIGFSSSAHAQVAASGQFKLSQSVRWGNSVLPTGTFIYSIDGAAGATFVRVRQIGGNFTGLFMPQTESEGSSSDSRGIVLSTVGEDTFVTALRTEGRGPVLNFSLPNAETEGAHPGATETRYLSVSKDPALGYFTIFNPANEKISYTEAEKVYLAACETIEREFNRSAPIRPHLTVHLHAEENNLHYPDRDLRLARWDKDRFAEAVVELVLHDMVSPQDRSRLTKLAVSRAGATVSLCELKNCTN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.