NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F052631

Metagenome Family F052631

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F052631
Family Type Metagenome
Number of Sequences 142
Average Sequence Length 151 residues
Representative Sequence MARNNKKVHHPARRRALAVLARMRSRGESLSQAARLEHTTARTVRKLVGKQLKRGPSGRYSATHGDTLRRDLSVLGFDGFEPVVVRSSKQAQLASEHLIAVGRFLRTGDTEWLKPFVGKRVGGVELLTDPDRLRELADAGLVKLDALYREHRGARQEK
Number of Associated Samples 111
Number of Associated Scaffolds 142

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 48.57 %
% of genes near scaffold ends (potentially truncated) 39.44 %
% of genes from short scaffolds (< 2000 bps) 64.08 %
Associated GOLD sequencing projects 101
AlphaFold2 3D model prediction Yes
3D model pTM-score0.53

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (61.972 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(21.127 % of family members)
Environment Ontology (ENVO) Unclassified
(26.761 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(41.549 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 43.01%    β-sheet: 12.90%    Coil/Unstructured: 44.09%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.53
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 142 Family Scaffolds
PF08281Sigma70_r4_2 9.86
PF01867Cas_Cas1 5.63
PF12728HTH_17 2.11
PF08843AbiEii 1.41
PF00027cNMP_binding 0.70
PF06415iPGM_N 0.70
PF13620CarboxypepD_reg 0.70
PF01339CheB_methylest 0.70
PF03848TehB 0.70
PF04297UPF0122 0.70
PF05496RuvB_N 0.70
PF02954HTH_8 0.70
PF05235CHAD 0.70
PF04967HTH_10 0.70
PF07693KAP_NTPase 0.70
PF07510DUF1524 0.70
PF00069Pkinase 0.70
PF12651RHH_3 0.70

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 142 Family Scaffolds
COG1518CRISPR-Cas system-associated integrase Cas1Defense mechanisms [V] 5.63
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 2.82
COG2201Chemotaxis response regulator CheB, contains REC and protein-glutamate methylesterase domainsSignal transduction mechanisms [T] 1.41
COG2253Predicted nucleotidyltransferase component of viral defense systemDefense mechanisms [V] 1.41
COG0696Phosphoglycerate mutase (BPG-independent), AlkP superfamilyCarbohydrate transport and metabolism [G] 0.70
COG2255Holliday junction resolvasome RuvABC, ATP-dependent DNA helicase subunit RuvBReplication, recombination and repair [L] 0.70
COG2739Predicted DNA-binding protein YlxM, UPF0122 familyTranscription [K] 0.70
COG3025Inorganic triphosphatase YgiF, contains CYTH and CHAD domainsInorganic ion transport and metabolism [P] 0.70
COG4928Predicted P-loop ATPase, KAP-likeGeneral function prediction only [R] 0.70
COG5607CHAD domain, binds inorganic polyphosphatesFunction unknown [S] 0.70


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A61.97 %
All OrganismsrootAll Organisms38.03 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001356|JGI12269J14319_10110155Not Available1312Open in IMG/M
3300002908|JGI25382J43887_10087051All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1669Open in IMG/M
3300003152|Ga0052254_1137728All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1106Open in IMG/M
3300004080|Ga0062385_10023273Not Available2373Open in IMG/M
3300005186|Ga0066676_10741235Not Available668Open in IMG/M
3300005327|Ga0070658_10103456All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2355Open in IMG/M
3300005332|Ga0066388_100933501Not Available1441Open in IMG/M
3300005339|Ga0070660_100070968All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2719Open in IMG/M
3300005445|Ga0070708_100113250Not Available2495Open in IMG/M
3300005454|Ga0066687_10658428Not Available622Open in IMG/M
3300005555|Ga0066692_10064592All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2079Open in IMG/M
3300005563|Ga0068855_100241687All Organisms → cellular organisms → Bacteria → Acidobacteria2017Open in IMG/M
3300006354|Ga0075021_10497158Not Available772Open in IMG/M
3300009088|Ga0099830_10993917Not Available695Open in IMG/M
3300009088|Ga0099830_11030921Not Available682Open in IMG/M
3300009089|Ga0099828_10519565Not Available1073Open in IMG/M
3300009090|Ga0099827_11389194Not Available611Open in IMG/M
3300009548|Ga0116107_1032258All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1927Open in IMG/M
3300009549|Ga0116137_1000927All Organisms → cellular organisms → Bacteria21440Open in IMG/M
3300010048|Ga0126373_10002823All Organisms → cellular organisms → Bacteria → Acidobacteria12842Open in IMG/M
3300010048|Ga0126373_10862349Not Available969Open in IMG/M
3300010339|Ga0074046_10021290Not Available4544Open in IMG/M
3300010361|Ga0126378_10626599Not Available1189Open in IMG/M
3300010366|Ga0126379_11470586Not Available787Open in IMG/M
3300010366|Ga0126379_13376048Not Available535Open in IMG/M
3300010371|Ga0134125_11108797Not Available866Open in IMG/M
3300010376|Ga0126381_100339788All Organisms → cellular organisms → Bacteria2072Open in IMG/M
3300010376|Ga0126381_101704214Not Available910Open in IMG/M
3300010398|Ga0126383_10032088All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → Brevundimonas4181Open in IMG/M
3300010398|Ga0126383_10655434Not Available1125Open in IMG/M
3300012199|Ga0137383_10039567All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3343Open in IMG/M
3300012199|Ga0137383_10182877All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1536Open in IMG/M
3300012202|Ga0137363_10035457Not Available3477Open in IMG/M
3300012202|Ga0137363_11811806Not Available503Open in IMG/M
3300012363|Ga0137390_10724360Not Available956Open in IMG/M
3300012922|Ga0137394_10002114All Organisms → cellular organisms → Bacteria → Acidobacteria14615Open in IMG/M
3300012927|Ga0137416_10204004All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1588Open in IMG/M
3300012931|Ga0153915_10287278All Organisms → cellular organisms → Bacteria → Acidobacteria1833Open in IMG/M
3300012944|Ga0137410_11422560Not Available603Open in IMG/M
3300012971|Ga0126369_10093554Not Available2701Open in IMG/M
3300013104|Ga0157370_10481750All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1140Open in IMG/M
3300014162|Ga0181538_10174654Not Available1219Open in IMG/M
3300014501|Ga0182024_10001246All Organisms → cellular organisms → Bacteria65994Open in IMG/M
3300015054|Ga0137420_1365011Not Available1223Open in IMG/M
3300015241|Ga0137418_10265140All Organisms → cellular organisms → Bacteria1450Open in IMG/M
3300016319|Ga0182033_11858458Not Available547Open in IMG/M
3300016357|Ga0182032_10001708All Organisms → cellular organisms → Bacteria9995Open in IMG/M
3300016357|Ga0182032_10476397Not Available1023Open in IMG/M
3300016387|Ga0182040_10806879Not Available774Open in IMG/M
3300016422|Ga0182039_10691204Not Available899Open in IMG/M
3300017929|Ga0187849_1000635All Organisms → cellular organisms → Bacteria45500Open in IMG/M
3300017938|Ga0187854_10243694Not Available782Open in IMG/M
3300017955|Ga0187817_10038056Not Available2936Open in IMG/M
3300018007|Ga0187805_10317593Not Available717Open in IMG/M
3300018020|Ga0187861_10059708All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1949Open in IMG/M
3300018088|Ga0187771_10056307Not Available3076Open in IMG/M
3300018088|Ga0187771_10936076Not Available735Open in IMG/M
3300018468|Ga0066662_10747347All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium941Open in IMG/M
3300020579|Ga0210407_10002713All Organisms → cellular organisms → Bacteria → Acidobacteria15001Open in IMG/M
3300020579|Ga0210407_10251174Not Available1378Open in IMG/M
3300020580|Ga0210403_10921636Not Available688Open in IMG/M
3300020581|Ga0210399_10676228Not Available850Open in IMG/M
3300020581|Ga0210399_11039047Not Available658Open in IMG/M
3300021046|Ga0215015_10150876All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2836Open in IMG/M
3300021046|Ga0215015_10333387Not Available536Open in IMG/M
3300021168|Ga0210406_10192158Not Available1696Open in IMG/M
3300021171|Ga0210405_10002138All Organisms → cellular organisms → Bacteria20903Open in IMG/M
3300021407|Ga0210383_10033496All Organisms → cellular organisms → Bacteria4294Open in IMG/M
3300021420|Ga0210394_10009213All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria9996Open in IMG/M
3300021420|Ga0210394_10144575Not Available2064Open in IMG/M
3300021479|Ga0210410_10339533Not Available1347Open in IMG/M
3300025469|Ga0208687_1057401Not Available926Open in IMG/M
3300025501|Ga0208563_1050799Not Available890Open in IMG/M
3300025913|Ga0207695_10025223All Organisms → cellular organisms → Bacteria → Acidobacteria6661Open in IMG/M
3300025919|Ga0207657_10095233All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2478Open in IMG/M
3300025934|Ga0207686_10359140Not Available1099Open in IMG/M
3300025949|Ga0207667_10197074All Organisms → cellular organisms → Bacteria → Acidobacteria2066Open in IMG/M
3300026142|Ga0207698_10540955All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1140Open in IMG/M
3300026334|Ga0209377_1175274Not Available763Open in IMG/M
3300026557|Ga0179587_10104659All Organisms → cellular organisms → Bacteria → Acidobacteria1717Open in IMG/M
3300026557|Ga0179587_10179588All Organisms → Viruses → Predicted Viral1332Open in IMG/M
3300027011|Ga0207740_1031701Not Available645Open in IMG/M
3300027516|Ga0207761_1089799Not Available598Open in IMG/M
3300027703|Ga0207862_1156523All Organisms → cellular organisms → Archaea → Candidatus Thermoplasmatota → DHVE2 group → Aciduliprofundum → Aciduliprofundum boonei681Open in IMG/M
3300027875|Ga0209283_10183550All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1390Open in IMG/M
3300027875|Ga0209283_10406171Not Available887Open in IMG/M
3300027894|Ga0209068_10360801Not Available824Open in IMG/M
3300027905|Ga0209415_10002080All Organisms → cellular organisms → Bacteria37888Open in IMG/M
3300027911|Ga0209698_10040558All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4169Open in IMG/M
3300028047|Ga0209526_10411473Not Available896Open in IMG/M
3300028536|Ga0137415_10180089All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1939Open in IMG/M
3300031545|Ga0318541_10041144All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2339Open in IMG/M
3300031564|Ga0318573_10253003Not Available939Open in IMG/M
3300031573|Ga0310915_10395161Not Available982Open in IMG/M
3300031576|Ga0247727_10100255All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3046Open in IMG/M
3300031576|Ga0247727_10290812Not Available1396Open in IMG/M
3300031668|Ga0318542_10653486Not Available549Open in IMG/M
3300031670|Ga0307374_10008229All Organisms → cellular organisms → Bacteria15642Open in IMG/M
3300031670|Ga0307374_10077249Not Available3100Open in IMG/M
3300031670|Ga0307374_10223278Not Available1307Open in IMG/M
3300031670|Ga0307374_10481466Not Available675Open in IMG/M
3300031671|Ga0307372_10007563All Organisms → cellular organisms → Bacteria15524Open in IMG/M
3300031671|Ga0307372_10384887Not Available727Open in IMG/M
3300031671|Ga0307372_10571827Not Available514Open in IMG/M
3300031672|Ga0307373_10025164All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → unclassified Bryobacterales → Bryobacterales bacterium7301Open in IMG/M
3300031672|Ga0307373_10026991All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Parcubacteria group → Candidatus Falkowbacteria → Candidatus Falkowbacteria bacterium RIFOXYD2_FULL_34_1206923Open in IMG/M
3300031679|Ga0318561_10370683Not Available786Open in IMG/M
3300031682|Ga0318560_10251853Not Available948Open in IMG/M
3300031708|Ga0310686_116479137All Organisms → cellular organisms → Bacteria → Acidobacteria6943Open in IMG/M
3300031708|Ga0310686_118015417All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2443Open in IMG/M
3300031719|Ga0306917_11512740Not Available516Open in IMG/M
3300031736|Ga0318501_10287745Not Available875Open in IMG/M
3300031771|Ga0318546_10012498All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4593Open in IMG/M
3300031771|Ga0318546_10523258Not Available833Open in IMG/M
3300031819|Ga0318568_10146572Not Available1441Open in IMG/M
3300031821|Ga0318567_10396439Not Available782Open in IMG/M
3300031879|Ga0306919_10693637Not Available784Open in IMG/M
3300031896|Ga0318551_10589961Not Available641Open in IMG/M
3300031912|Ga0306921_10207684Not Available2295Open in IMG/M
3300031912|Ga0306921_10845301Not Available1043Open in IMG/M
3300031947|Ga0310909_11432628Not Available552Open in IMG/M
3300031954|Ga0306926_11235069Not Available876Open in IMG/M
3300031954|Ga0306926_11739382Not Available710Open in IMG/M
3300031959|Ga0318530_10275205Not Available695Open in IMG/M
3300031962|Ga0307479_10019275All Organisms → cellular organisms → Bacteria → Acidobacteria6437Open in IMG/M
3300032001|Ga0306922_11604592Not Available647Open in IMG/M
3300032001|Ga0306922_11844354Not Available594Open in IMG/M
3300032035|Ga0310911_10717484Not Available579Open in IMG/M
3300032035|Ga0310911_10862358Not Available523Open in IMG/M
3300032041|Ga0318549_10149448All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1039Open in IMG/M
3300032059|Ga0318533_10846197Not Available671Open in IMG/M
3300032261|Ga0306920_101378879Not Available1011Open in IMG/M
3300032261|Ga0306920_101704201Not Available893Open in IMG/M
3300032770|Ga0335085_10002183All Organisms → cellular organisms → Bacteria37504Open in IMG/M
3300032770|Ga0335085_10010222All Organisms → cellular organisms → Bacteria → Acidobacteria14045Open in IMG/M
3300032782|Ga0335082_10012314All Organisms → cellular organisms → Bacteria → Acidobacteria9286Open in IMG/M
3300032783|Ga0335079_10064556Not Available4170Open in IMG/M
3300032805|Ga0335078_10307934Not Available2126Open in IMG/M
3300032805|Ga0335078_10666847Not Available1297Open in IMG/M
3300032896|Ga0335075_10002396All Organisms → cellular organisms → Bacteria30221Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil21.13%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil13.38%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil10.56%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil7.04%
SoilEnvironmental → Terrestrial → Soil → Clay → Unclassified → Soil6.34%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil5.63%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Peatland2.82%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.82%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil2.82%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.82%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland2.11%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.11%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere2.11%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere2.11%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment1.41%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil1.41%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.41%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.41%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm1.41%
SedimentEnvironmental → Aquatic → Freshwater → Lentic → Sediment → Sediment0.70%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.70%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog0.70%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.70%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.70%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.70%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil0.70%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost0.70%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.70%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.70%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.70%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.70%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.70%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001356Peat soil microbial communities from Weissenstadt, Germany - SII-SIP-2007EnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300003152Freshwater sediment microbial communities from Loktak Lake, IndiaEnvironmentalOpen in IMG/M
3300004080Coassembly of ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005327Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C1-3 metaGHost-AssociatedOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005339Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaGHost-AssociatedOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005563Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2Host-AssociatedOpen in IMG/M
3300006354Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009548Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_6_100EnvironmentalOpen in IMG/M
3300009549Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_20_100EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010339Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM3EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300013104Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C3-5 metaGHost-AssociatedOpen in IMG/M
3300014162Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin23_30_metaGEnvironmentalOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300016387Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300017929Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_4_100EnvironmentalOpen in IMG/M
3300017938Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_6_150EnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300018007Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_5EnvironmentalOpen in IMG/M
3300018020Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_10_100EnvironmentalOpen in IMG/M
3300018088Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP15_10_MGEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300025469Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_6_100 (SPAdes)EnvironmentalOpen in IMG/M
3300025501Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_17_150 (SPAdes)EnvironmentalOpen in IMG/M
3300025913Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025919Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025949Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026142Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027011Tropical forest soil microbial communities from Luquillo Experimental Forest, Puerto Rico - Sample 29 (SPAdes)EnvironmentalOpen in IMG/M
3300027516Tropical forest soil microbial communities from Luquillo Experimental Forest, Puerto Rico - Sample 34 (SPAdes)EnvironmentalOpen in IMG/M
3300027703Tropical forest soil microbial communities from Luquillo Experimental Forest, Puerto Rico - Sample 81 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027905Peat soil microbial communities from Weissenstadt, Germany - SII-SIP-2007 (SPAdes)EnvironmentalOpen in IMG/M
3300027911Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031545Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f26EnvironmentalOpen in IMG/M
3300031564Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f21EnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031576Biofilm microbial communities from Wishing Well Cave, Virginia, United States - WW16-25EnvironmentalOpen in IMG/M
3300031668Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.168b4f23EnvironmentalOpen in IMG/M
3300031670Soil microbial communities from Risofladan, Vaasa, Finland - OX-3EnvironmentalOpen in IMG/M
3300031671Soil microbial communities from Risofladan, Vaasa, Finland - OX-1EnvironmentalOpen in IMG/M
3300031672Soil microbial communities from Risofladan, Vaasa, Finland - OX-2EnvironmentalOpen in IMG/M
3300031679Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f23EnvironmentalOpen in IMG/M
3300031682Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f22EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031719Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000 (v2)EnvironmentalOpen in IMG/M
3300031736Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f21EnvironmentalOpen in IMG/M
3300031771Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f19EnvironmentalOpen in IMG/M
3300031819Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f21EnvironmentalOpen in IMG/M
3300031821Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f20EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300031896Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f19EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031947Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.T000HEnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300031959Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f24EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300032035Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF170EnvironmentalOpen in IMG/M
3300032041Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f22EnvironmentalOpen in IMG/M
3300032059Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f27EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300032782Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.1EnvironmentalOpen in IMG/M
3300032783Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.3EnvironmentalOpen in IMG/M
3300032805Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2EnvironmentalOpen in IMG/M
3300032829Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.3EnvironmentalOpen in IMG/M
3300032896Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.4EnvironmentalOpen in IMG/M
3300033289Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN108EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12269J14319_1011015523300001356Peatlands SoilMRTKRKKVNFAVRKRALAVLARMRNRGESLAEAARHVHTTSRSVRKHAGKQLKRGRSGRYSVKQSDALRRDLNVLTHDGYVAVSVRSSRQAHRASEHLIAVGRFLRTGDREWLKPFVGKRVGGVELLTDPERLHILADADLVKLDALYRNSPVGGRKK*
JGI25382J43887_1008705123300002908Grasslands SoilMSRINKKVDHPARLRALSVLARMRSRGESLSQAARLEHTTPRTVLKLVGRQLKRGTSGRYSATRGDTLRRDLSVLASEGYVAVAVRSSRQAQLASEHLIAVGRFLRTGDTEWLKPFVGKRVGGVELLTDPDRLRELADAGLVKLDALYRENRSGRQEK*
Ga0052254_113772813300003152SedimentQQFLAIFLGISILRNMKRRLHNTAAKQRAIAVLARMRSRGESLSEAARHERTTPRTVRRILPKQLRRGSSGRYVASRGDRLQREISVLSFDGYVHVVVRSSKQAQLASAHLIAINRYLRTGDERWLKPFIGKHVGGVELLTDPDRLQILADAGLVKLDALYRNNPGGGHEK*
Ga0062385_1002327323300004080Bog Forest SoilMPKKVSRLARSRALAVLARMRSRGESLSEAARLERTTPRTVRKIVGKHLKRSASRRYSATRGDRLRRDLSVLGFDGYEPVVVRSSKQAHLAADHLVAVGRFLRTGDTEWLKPFIGKRVGGVELLTDPDRLHDLADADLVKLDALYRDHRGGTRKSV*
Ga0066676_1074123513300005186SoilMKRKRFTNSARQRALAVLARMRSRGESLSQAARLEHTTPRTVLQIVAKQFKRSSSGHYTARKSDTLRRDLTVLGFDGYIPVVVRSSKQAQLASAHLVAINRFLRTGDKEWLKPFIGKRVGGVELLTDQDRLQVL
Ga0070658_1010345623300005327Corn RhizosphereMRKREKTDSPARLRAFAVLARMRNRGESLTQAARLEHTTPNVVRKLVRQQLKRDVSGRYSATRSDTLRRDLNVLGYEGYEPVAVRSSKKAQVAADHLVAVGRFLRTGDKEWLKPFVGRRLGRVELLTDTDRLQTIANADLIKLDALYRDHRRAR*
Ga0066388_10093350123300005332Tropical Forest SoilMLGVSDRTNILRKMRNKFQNPARIRALAVLSRIRNRGETPTIAAALEHTTLRTVRKYVGKQLRRGRTGRYTATRSDTLRRDITVLGSDGYVPTVVRSSKQAQLASAHLIAVNRFLRPGGDENWLKLFIGKRVGGVELLTDPQRLREFAEADLVKLDGLYRNNRGGTER*
Ga0070660_10007096833300005339Corn RhizosphereMRKREKTDSPARLRAFAVLARMRNRGESLTQAARLEHTTPNVVRKLVRQQLKRDVSGRYSATRSDTLRRDLNVLGYEGYEPVAVRSSKKAQVAADHLVAVGRFLRTGDKEWLKPFVGRRLGRVELLTDTDRLQTIANANLIKLDALYRDHRRAR*
Ga0070708_10011325033300005445Corn, Switchgrass And Miscanthus RhizosphereMATNANHKLPMKANKFQSPARLCALSVLARMRSRGESLTQAARLEHTTSRTVRKLIGQQLKRGSSGRYKPTRGDTLRRDLSVLGFDGYEPVVVRSSKQARLAAEHLIAVGRFLRTGDPERLKPFVGKRVGGVELLTDPGRLRELGDAGLVKLDALYREHRGGRQEK*
Ga0066687_1065842813300005454SoilMKREQFQNTNRLHALAVLARMRSRGQSLSEAAAFEHTTTRTVKKYVGKQLRQGTSGRYFATPGDTLRRDLNALGVDGYQPVVVRSSNQAQLASAHLIAVNRYLRTGDIEWLKPFIGKRVGGVELLTDPERLHILGDADLVKLDALYRSNPGGGR
Ga0066692_1006459223300005555SoilLEDTVDTRLTFSAYLLNKYEQGPSPKAANTRLSLLANKCEYWNGPDGASILPGMKRNQFQSPARLRALAVLARMRSRGESLSQAARLEHTTPRTVRKLVGKQLKRGPSGRYSATRGDTLRRDLSVLGFDGFEPVIVRSSKQAHLASEHLIAVGRFLRTGDAEWLKPFVGKRVGGVELLTDPDRLHELADAGLVKLDGLYREHRGARQEK*
Ga0068855_10024168733300005563Corn RhizosphereMRKREKTDSPARLRAFAVLARMRNRGESLTQAARLEHTTPNVVRKLVRQQLKRDVSGRYSATRSDTLRRDLNVLGYEGYEPVVVRSSKQAQVAADHLVAVGRFLRTGDKEWLKPFVGRRLGRVELLTDTDRLQTIANADLIKLDALYRDHRRAR*
Ga0075021_1049715823300006354WatershedsMVPHIPHGWGLRVKVITKRSMRKKRQKTDTTARRRAFAVLAGMRNRGESLTQAARQEHTTPRMVRKYVGTQLRRGPSRHYSATRSDTLRRDLNVLGFEGYEPVVVRSSKQAQVASKHLVAVGRFLRTGDKEWLKPFVGKRVGGIELLTDTDRLQMIANADLIKLDALYREHQRAR*
Ga0099830_1099391723300009088Vadose Zone SoilVKVQGEKAMARNNKKVHHPARRRALAVLARMRSRGESLSQAARLEHTTARTVRKLVGKQLKRGPSGRYSATHGDTLRRDLSVLGFDGFEPVVVHSSKQAQLASEHLIAVGRFLRTGDTEWLKPFVGKRVGGVELLTDPDRLRELADAGLVKLDALYREHRGARQEK*
Ga0099830_1103092113300009088Vadose Zone SoilMRSRGESLSQAARLEHTTPQTVRKIVGKQLKRARTSGRYSATRGDTLRRDLSVLGFDGYEPVVVRSSKQAHLAAEHLVAVGRFLRTGDTEWLKPFIGKRVGDVELLTDPDRLHELADAGLVKLDALYREHRGGTEKPV*
Ga0099828_1051956513300009089Vadose Zone SoilMKRKQFQSPKRLRALAVLARMRSHGESLSQAARLEHTTPRTVRKTVGKQLKRGSSGRYSATRGDMLRRDLSVLGFDGYEPVVVRSSKQAQLASAHLVAVNRFLRTGDEEWLKPFIGKRVGGIELLTDPDRLHVLADADLVKLDALYREHRGGTEKSV*
Ga0099827_1138919413300009090Vadose Zone SoilQAARLEHTTPRTVRKLVGKQLKRSGSGRYSATSGDTLRRDLNVLGSEGYVPVPVRSSKQAQLDSEHLIAVGRFIRTGDTEWLKPFVGKRVGGVELLTDPDRLHELADAGLVKLDALYRENRGGRQDR*
Ga0116107_103225823300009548PeatlandMPRKNRKADHPARLRVLAVLARMRRGESLSHAAPVEHTTPSTVRKLVGKQLKRGLSGRYSATLGDTLRRDLNVLGFDGYVPVAVRSSKQAQLASEHLVAVARFLRTGDTEWLKPFVGKRVGGVELLTDPDRLGVLADAGLVKLDALYREHRVGGQEK*
Ga0116137_100092793300009549PeatlandMPRKNRKADHPARLRVLAVLARMRRGESLSHAAPVEHTTPSTVRKLVGKQLKRGLSGRYSATLGDTLRRDLNVLGFDGYVPVAVRSSKQAQLASEHLVAVARFLRTGDTEWLKPFVGKRVGGVELLTDPDRLGVLADAGLVKLDALYREHRGGGQEK*
Ga0126373_1000282363300010048Tropical Forest SoilMTGNAVEFISGFPKAAKMSKKHKNRTSRSVQAHNRAMHVLARMRRGESLSQAARAEHTTPSTVRKVVGKQLKRHASGHYRATRGDTLRRDLNVLGYDGYVPVTVHSSKQARLASEHLVAAGRFLRSGNTELLKPFVGKRVGGVELLTDPDRLQILADADLVKLDALYRQNRGQGQGE*
Ga0126373_1086234923300010048Tropical Forest SoilMPKKIRRWHQLARQRALAVLARMRRGESLSQAARLEHTTPRTVLKLVGKQLTRGPTEIYVATFGDTLRRDLNVLGFDRYQPVTVYSSQQAHLASEHLIAVSRFLRTWDTNWLKPFIGKRVGGVELLTDPDRLHELADAGLVKLDGLYRQNRGQGQGE*
Ga0074046_1002129033300010339Bog Forest SoilMRSRGESLSHAARNEGTTPRAVRKEVGNQLTRGPTGRYVATSGDTLRRDLTVLGFDGYLPAVVRSWKQAQLASAHLVAVDRYLRTGDTKWLKPFIGKRVGGVELLTDPDRLQILADADLVKLDALYRGNSGASWEK*
Ga0126378_1062659913300010361Tropical Forest SoilSPAKLRSLAVLARVRRGEPLSRGARDERTTVRTVRKHVGKQLRRDSSGRYRATRGDTLRRDINVLGYDGYEPVVVHSSKQAHLASQHLIAVARFLRPPGDRELLRPFIGKRVGGVELLTDPDRLEILGDAGLVKLDGLYRSNRGSREASE*
Ga0126379_1147058613300010366Tropical Forest SoilMRSRGESLARASRNEHTTPRAVRKEVGKQLTRGPSGRYVATSGDTLKRDLNVLGFDGYEPAVVRSSKQARLAAEHLVAVGRFLRTGDAAWLKPFVGKRVGGVELLTDPDRLQVLADADLVKLDGLYRQNRGQRQGK*
Ga0126379_1337604813300010366Tropical Forest SoilRGESLSQAAANERTTPRAVRKVVGKQLIRGPTGRYVATSGDTLRRDLNVLGFDGYEPVVVRSSKQAHLAAEHLVGVARFVRTGDTHWLKPFIGKRVGGVELLTDPDRLQILADADLVKLDALYRSNPGGRQEK*
Ga0134125_1110879713300010371Terrestrial SoilMRKREKTDSPARLRAFAVLARMRNRGESLTQAARLEHTTPNVVRKLVRQQLKRDVSGRYSATRSDTLRRDLNVLGYEGYEPVAVRSSKKAQVAADHLVAVGRFLRTGDKEWLKPFVGRRVGGVELLTDTDGLQMIANADLIKLDALYRDHRRAR*
Ga0126381_10033978833300010376Tropical Forest SoilMPKKVRRLHQLARQRALAVLARMRRGEPLTQAARLEHTTPRTVLKFIGKQLTRGPTEAYVATSGDTLRRDLNVLGFDGYEPVTVHSSKQAHLASEHLIAVGRFLRPPGDPELLKPFVGKRVGGVELLTDPDRLGVLADAGLVKLDGLYRQNRGQGQGE*
Ga0126381_10170421413300010376Tropical Forest SoilKLRSLAVLARVRRGEPLSRGARDERTTVRTVRKHVGKQLRRDSSGRYRATRGDTLRRDINVLGYDGYEPVVVHSSKQAHLASQHLIAVARFLRPPGDRELLRPFIGKRVGGVELLTDPDRLEILGDAGLVKLDGLYRSNRGSREASE*
Ga0126383_1003208843300010398Tropical Forest SoilMRSHGESLSRAALNERTTPRSVRKDVGNQLTRGPTGRYIATSGDTLRRDLNVLGFDGYEPVVVRSSKQAHLAAEHLVAVGRFLRTGDTEWLKPFTGKRVGGVELLTDPDRLRILADVDLVKLDALYRNNPGGGRQK*
Ga0126383_1065543423300010398Tropical Forest SoilMRRGESLWQAARNEHTTPRAVRKIVGKQLIRGPTGRYIATSSDTLRRDLNVLGFDGYEPAVVRSSKQAHLGAGHLVAVGRFLRTGDTEWLKPFIGKRVGGVELLTDPDRLRILADADLVKLDALYRNNPGRREK*
Ga0137383_1003956733300012199Vadose Zone SoilMTRNNKKVRSLTRQRALAVLARMRSRGESLSQATRLEHTTLRTVRKVVGKQLKRGISGRYSATRGDTLRRDLSVLGFDGYEPVVVRSSKQAHLAAEHLVAVGRFLRTGDTEWLKPFIGKRVGGVELLTDPDRLQVLADADLVKLDALYRNNRGGSQDK*
Ga0137383_1018287723300012199Vadose Zone SoilMPKDSKKARRLTRLRALGVLARMRSRGESLSQAAQLEHTTPRTVRKFVAKQLKRDASGHYRATSGDTLRRDLNILGLDGYEPAVVSSWIQAQLAAEHLVAVNRFLRTGDIEWLKPFVGKRVGGVELLTDPDRLQILADADLVKLDALYRNNPGGGREK*
Ga0137363_1003545723300012202Vadose Zone SoilMKRKRYTNSARQRALAVLARMRSRGESLSEAARHERTTPRTVHRIFPKQLKRGISGRYIASRGDRIRREISVLSFDGYLPVAVRSSKQAQLASAHLIAINRYLRTGDERWLKPFIGKRVGGVELLTDTDRIQILGDADLVKLDGLYRNNRGNREASG*
Ga0137363_1181180613300012202Vadose Zone SoilVRRGESLSQAARNERTTTRTARKYVGKQLRRDSSGRYRATRGDTLRRDINVLGYDGYEPVVVHSSKQVHLASQHLIAVARFLRPPGDRESLRPFIGKRVGGVELLTDPDRLEILGDAGLVKLDGLYRHLRGSREKQA*
Ga0137390_1072436013300012363Vadose Zone SoilMRSRGESLTKAARLERTTPRTVRKIVGKQLRRGASRRYSATRGDTLRRDLSVLGFDGYESVVVRSSKQAHLAADHLVAVGRFLRTGDTEWLKPFIGKRVGGVELLTDPDRLHDLADADLVKLDALYRDHRGGTEKSV*
Ga0137394_1000211443300012922Vadose Zone SoilMRSRGESLSQAARNERTTPRTVRQEVGNQLTRGPTGRYVVTSGDTLRRDLNVLGFDGYEPVVVRSSKQAHLAAEHLVAVGRFLRTGDAEWLKPFVERRVGGVELLTNPDRLQILADADLVKLDALYRNNPGRGREK*
Ga0137416_1020400423300012927Vadose Zone SoilMPKASKKVNRVTRLRTLAVLARMRSRGESLSQAARLEHTTPRTVLKIVGKQLKRGASGRYSATRGDTLRRDLSVLGFEGYEPVVVRSSKQAHLAAEHLVAVGRFLRTGDTEWLKPFIGKRLDGVELLTDPDRLHILADADLVKLDALYRDQRGGREGSV*
Ga0153915_1028727813300012931Freshwater WetlandsMKRKQSQSPARLRALSVLARVRRGESLSRAARLEHTTPRSVRKVVGKQLKRGTSGRYSATRGDTLRRDLSVLGFDGFEPVVVRSSKKAHVASEHLIAVGRFLKTGDTEWLKPFVGKRVGGVELLTDPDRLDVLGDAGLVKLDALYREHRGRTEDSQ*
Ga0137410_1142256013300012944Vadose Zone SoilMKVLKAMRKNKKMQDPARRRALAVLARMRSRGESLSKAARLEHTTPRTVRKLAGKQLKRGNSGHYSATRGDTLRRDLSVLSSEGYVAVAVRSSKQAQLASEHLIAVDRFIKTDDSEWLKPFVGKRVGGVELLTDPDRLNELADAGLVKLDALYREHRSGRQE*
Ga0126369_1009355413300012971Tropical Forest SoilMRSRGESLSKAARNERTTPRTVHRIFPKQLKRGSSGRYIASHGDRLRREISVLSFDGYVPVAVRSSKQAQLASAHLIAVNRFLRTGDENWLKPFIGKRVGGVELLTDPDRLQILGDADLVKLDGLYRNHRGNIGSSE*
Ga0157370_1048175013300013104Corn RhizosphereMRKREKTDSPARLRAFAVLARMRNRGESLTQAARLEHTTPNVVRKLVRQQLKRDVSGRYSATRSDTLRRDLNVLGYEGYEPVVVRSSKQAQVAADHLVAVGRFLRTGDKEWLKPFVGRRVGGVELLTDTDGLQMIANADLIKLDALYRDHRRAR*
Ga0181538_1017465423300014162BogVKGDNKIMAKNIYRPARRRALAVLARMRSRGESLSQAARLEHTTARTVRKFVGKQLRRRASGHYSATRSDTLRRDLTVLGFDGFQPVVARSWKQAQLASEHLIAVNRFQRTGDPEWLKPFVGKRVGGVELLTDAERILVFSDADLIKLDALYRENRGTTE*
Ga0182024_1000124683300014501PermafrostMRNRGESLSEAAHLEHTTTATVHKLVGKQLRREASGRYSAAPNDTLRRDLSVLGSEGYASVTARSSKQAQLASEHLIAVGRFLRTGDDEWLKPFIGKHVGGVELLTDSNRLRELADAGLVKLDALYRTNRAGRDGT*
Ga0137420_136501113300015054Vadose Zone SoilLEGADRYGWEEQKQEGESLVRLRALAVLARMRSRGDSLSQAARLEHTTSRTVRKFVGKQLKRGTSRRYSATTGDTLRRDLSVLSFDGYEPVVVRSSKQAHLAAEHLVAVGRFLRTGDTEWLKPFIGKRVGGIELLTDPDRLHILADAGLVKLDALY
Ga0137418_1026514023300015241Vadose Zone SoilVLARVRRGESFSQAARNERTTIRTARKYVGKQLRRGSSGRYRATRGDTLRRDINVLGYDGYEPVVVHSSKQAHFASQHLIAVARFLRPPGDPELLKPFIGKRVGGVELLTDPDRLQILADADLVKLDGLYRNNRGGTER*
Ga0182033_1185845813300016319SoilMSGKSKEVNSLTRQRALAALARMRSRGESLSQAARIEHTTPKTVLKIVGKQLKRGTSRHYTATRGDTLRRDLNVLGFDGYQPVVVHSWKQAQLAAEHLVAVNRFLRTGDSEWLKPFVGKRVGGVELLTDPQRLREFAEADLLKLDGLYRD
Ga0182032_1000170853300016357SoilVLARMRRGESLSQAARLEHTRPRTVLNLIGKQLKRGPTGHYVATSADTLRRDLNVLGFDGYEPVTVYSSKQAHLASEHLIAVGRFLRTGNFEWLKPFVGKRVAGVELLTDPDRLGVLADAGLVKLDGLYRQNRGQGLEK
Ga0182032_1047639713300016357SoilRTMHTKNSNNKHNAARLRALAVLARMRSRAESLTQAARTERTTPRTVRRIIGKQLKRNASGHYSATSGDTLRRDLNVLGFDGYEPAVVRSWKRAQLAAEHLVAVNRFLRTGDSEWLKPFVGKRVGGVELLTDPQRLREFAEADLLKLDGLYRDNRGGTER
Ga0182040_1080687913300016387SoilRISPAGRRGNRQAVARRQLTRQRSLAVLAGVRRGKSLSQAASDEHTTTRTVLKYVGKELKRDSSGHYRPTGSDTLRRDLNVLGFDGYEPVVVRSSQQAHLASEHLIALGRFLRTGDTDQLKPFVGKRVGGVELLTDPDRLQILADADLVRLDGLYRQNRGQGRGD
Ga0182039_1069120423300016422SoilMHTKNSNNKHNAARLRALAVLARMRSRAESLTQAARTERTTPRTVRRIIGKQLKRNASGHYSATSGDTLRRDLNVLGFDGYEPAVVRSWKRAQLAAEHLVAVNRFLRTGDSEWLKPFVGKRVGGVELLTDPERLRILADADLVKLDGLYRHNRAGRQEG
Ga0187849_1000635333300017929PeatlandMPRKNRKADHPARLRVLAVLARMRRGESLSHAAPVEHTTPSTVRKLVGKQLKRGLSGRYSATLGDTLRRDLNVLGFDGYVPVAVRSSKQAQLASEHLVAVARFLRTGDTEWLKPFVGKRVGGVELLTDPDRLGVLADAGLVKLDALYREHRGGGQEK
Ga0187854_1024369413300017938PeatlandMPRKNRKADHPARLRVLAVLARMRRGESLSHAAPVEHTTPSTVRKLVGKQLKRGLSGRYSATLGDTLRRDLNVLGFDGYVPVAVRSSKQAQLASEHLVAVARFLRTGDTEWLKPFVGKRVGGVELLTDPDRLGVLADAGLVKLDA
Ga0187817_1003805633300017955Freshwater SedimentSRAERTTTRTVRKWVRNQLKCNTSGSYSVTAGDTLKRELNVLGFHGYESVVVRSFKQAHLASEHLIAISRFLRTGDARLLRPFRGKRVGGVRLLTDPDRIREFAEAGLVKLDSLYRNGQGGGKRR
Ga0187805_1031759313300018007Freshwater SedimentMRSRRESLSQASRLEHTTPRTVRIMVGKQLKRGTSGRYAATSGDTLRRDLVVLGFDGYEPAVVRSSKQAQLASEHLIAIGRFLRTGDTEWLKPFIGKRVGGVELLTDSDRLQVLGESGFKLDSLYRQQRGGKGEN
Ga0187861_1005970813300018020PeatlandMPRKNRKADHPARLRVLAVLARMRRGESLSHAAPVEHTTPSTVRKLVGKQLKRGLSGRYSATLGDTLRRDLNVLGFDGYVPVAVRSSKQAQLASEHLVAVARFLRTGDTEWLKPFVGKRVGGVELLTDPDRL
Ga0187771_1005630733300018088Tropical PeatlandMVTNANHKLSMEPKQFQSPARLRALSVLARMRSRGESLSHAARLEHTTPRTVHRRVGSALIRDPRTGRFSAKRGDTFRRDLNVLSFDGYVPVTVRSWKQAQLAAEHLIAVNRSLRTGDGEWLKPFVRKRVGGVQLLTDPDRLGVFADADLIKLDALYREHRSARQEK
Ga0187771_1093607623300018088Tropical PeatlandSQAARLVHTSPRTVHKLVGKQLKRGRSGRYSATRGDTLRRDLSVLGFDGFEPAVVRSWKQAQVASEHLIAVARFLRTGDTEWLKPFVGKSVGGVELVTDHDRLRELADAGLVRLDNLYRQNRGGRQEK
Ga0066662_1074734723300018468Grasslands SoilLEDTVDTRLTFSAYLLNKYEQGPSPKAANTRLSLLANKCEYWNGPDGASILPGMKRNQFQSPARLRALAVLARMRSRGESLSQAARLEHTTPRTVRKLVGKQLKRGPSGRYSATRGDTLRRDLSVLGFDGFEPVIVRSSKQAHLASEHLIAVGRFLRTGDAEWLKPFVGKRVGGVELLTDPDRLHELADAGLVKLDGLYREHRGARQ
Ga0210407_10002713133300020579SoilMAKKRTATVRLKRQQALAVLARMRSRGESLSQAARLERTTPRTVRKIVGKQLKRGASGRYSATRGDTLRRDLNVLGFDGYEPVVVRSSKQAQLASAHLVAVNRFLRTGDEEWLKPFIGKRVGGIELLTDPDRLQVLADADLVKLDALYRQHRGGTEKSV
Ga0210407_1025117423300020579SoilMSDRINILRIMKRKRFTFSARQRVLAALARMRSRGESLSQAARLEHTTSRTVLRIIPKQFKRSSSGRFTATRSDTLRRDLTVLGFDGYVPVVVRSSKHTQLASAHLVAVNRFLRTGDKEWLKPFIGKRVGGVELLSDPDRIQILGEADLVKLDGLYRSHRGDTGRSE
Ga0210403_1092163613300020580SoilMIDMPKNNKKVQSRARRRALAVLARMRSSGESLSEAARSEHTTPRTVRKELPKQFKRGPSGRYSATAADTLQRFLNVLGFDGYVPVTVRSSKQAQLASDHLIAVGKFLGLIPSPLAGDTELLKPFVGKRVGGVQLLTDPDRLRELAEAGLIKLDALYRNNRAGRHEK
Ga0210399_1067622813300020581SoilMPKASKRVNRVARLRALAVLARMRSRGESLSQATRHEHTTPRTVRKNLGKQLKRDASGRYSATRGDTLRRDLSVLGFDGYSPVVVRSSRQAHLAADHLVAVDRFLRTGDTEWLKPFIGKRVGGVELLTDPDRL
Ga0210399_1103904713300020581SoilMPKDSKKVNRVARSRALAVLARMRSRGESLSQAARLERTTPRTVRKIVGKQLKRGASRRYSATRGDTLRRDLSVLGFDGYEPVVVRSSKQAHLAAEHLVAVGRFLRTGDTEWLKPFIGKRVGGVVLLTDLGRLRELADAD
Ga0215015_1015087613300021046SoilMTRNNKKVRSLTRQRALAVLARMRSRGESLSQATRLEHTTLRTVRKVVGKQLKRGTSGRYSATRGDTLRRDLSVLGFDGYEPVVVRSWKQAQLASEHLVAVGRFLRTGDTEWLKPFIGKRVGGVELLTDPDRLHELANAGLVKLDALYREHRGGRETVSYTH
Ga0215015_1033338713300021046SoilMTKSKKLQSPARRRALAVLARMRRGESLSEAARHEHTSPHTVRKELPKQLKRSLSGRYLATLADTLRRDLSVLGFDGYVPVVVRSSKQAQLASEHLIAVSRFLRTGDTEWLKPFVGKQVGGVELLTDPDRLRELAEAGPVSYTHL
Ga0210406_1019215823300021168SoilMAKKNSKSHLLSRQRSLAVLARVRRGEPLSRAARSERTTSRTVRKHVGKQFKRDSSGRYRATRGDTLRRDLNVLGFDGYEPVVVRSSKQAHLAAEHLVAVGRFLRTGDSEWLKPFTGKRVGGVELLTDQDRLRVLADADLVKLDGLYRNHRGHSEASK
Ga0210405_1000213893300021171SoilMPKKATSLARSRAIAVLARMRSRGETLSQAARNERTTPRTVRKHVGKQLKRDSSRRYRATRGDTLRRDLSVLGFDGYEPVVVRSSRQAHLAAEHLVAVGRFLRTGDAEWLKPFVGKRVGGIELLTDLDRIGVLADADLVKLDGLYRQNRGGTEK
Ga0210383_1003349623300021407SoilMRTKNSKNKQNMTRIRALSVLARMRSRAESLSQAAHAEHTTPRTVRRIIGKQLKRNSSGHYLATRGDTLRRDLNVFGFDGYEPVVVRSSKQASLAAEHLVAVGRFLRTGDAEWLKPFIGKRVGAIELLTDPDRLRIFADADLVKLDALYRSNPGGGRDK
Ga0210394_1000921373300021420SoilMRNRDESLSQAARGEHTTPRTVRKLLGRQLKRGTSGHYSATRGDTLRRDLNVLGYDAYEPVSVRSSKQAQLASEHLIAVNRFLRTGDTEWLKPFVGKRVGGVELLTDPDRLHELADAGLVKLDALYRGNRGGTEKSS
Ga0210394_1014457513300021420SoilARLERTTPRTVVRIIPKQFKRSSSGRYTATRGDTLRRDLTVLGFDGYVPVAVRSSRHAQLASAHLVAVNRFLRTGDEEWLKPFIGKRVGGVELLTDQDRLQVLADADLVKLDGLYRNQRGRSEASE
Ga0210410_1033953313300021479SoilMSKKNGKSSFARSRALAVLARMRSRGESLSQAARFERTTVKSVRRFIGKQLRRSDTGRYVATSGDTLRRDLNVLGFDGYQPAVVRSSKQAHIAADHLIAVGRFLRTGDTEWLKPFVGKRVGGVELLTDPDRLYELADADLVRLDSLYRQNRPTK
Ga0208687_105740123300025469PeatlandLRRKKTMPRKNRKADHPARLRVLAVLARMRRGESLSHAAPVEHTTPSTVRKLVGKQLKRGLSGRYSATLGDTLRRDLNVLGFDGYVPVAVRSSKQAQLASEHLVAVARFLRTGDTEWLKPFVGKRVGGVELLTDPDRL
Ga0208563_105079933300025501PeatlandRMRRGESLSHAAPVEHTTPSTVRKLVGKQLKRGLSGRYSATLGDTLRRDLNVLGFDGYVPVAVRSSKQAQLASEHLVAVARFLRTGDTEWLKPFVGKRVGGVELLTDPDRLGVLADAGLVKLDALYREHRGGGQEK
Ga0207695_1002522383300025913Corn RhizosphereMRKREKTDSPARLRAFAVLARMRNRGESLTQAARLEHTTPNVVRKLVRQQLKRDVSGRYSATRSDTLRRDLNVLGYEGYEPVVVRSSKQAQVAADHLVAVGRFLRTGDKEWLKPFVGRRLGRVELLTDTDRLQTIANADLIKLDALYRDHRRAR
Ga0207657_1009523323300025919Corn RhizosphereMRKREKTDSPARLRAFAVLARMRNRGESLTQAARLEHTTPNVVRKLVRQQLKRDVSGRYSATRSDTLRRDLNVLGYEGYEPVAVRSSKKAQVAADHLVAVGRFLRTGDKEWLKPFVGRRLGRVELLTDTDRLQTIANADLIKLDALYRDHRRAR
Ga0207686_1035914023300025934Miscanthus RhizosphereMRQKRKKAGYASRLRAFAVLARMRNRGESLTQAARQEHTTPAIVRQHVGRQLRRGAAGRYTATRSDTLRRDLTVLGSEGYEPVVVRSSKRAQVASEHLVAVGRFLRTGDKEWLNPFVGKRVGGVELLTDPDRLQRIANADLMKLDGLYRDHRRAR
Ga0207667_1019707433300025949Corn RhizosphereMRKREKTDSPARLRAFAVLARMRNRGESLTQAARLEHTTPNVVRKLVRQQLKRDVSGRYSATRSDTLRRDLNVLGYEGYEPVVVRSSKQAQVAADHLVAVGRFLRTGDKEWLKPFVGRRLGRVELLTDTDRLQTIANANLIKLDALYRDHRRAR
Ga0207698_1054095513300026142Corn RhizosphereLTQAARLEHTTPNVVRKLVRQQLKRDVSGRYSATRSDTLRRDLNVLGYEGYEPVVVRSSKQAQVAADHLVAVGRFLRTGDKEWLKPFVGRRLGRVELLTDTDRLQTIANADLIKLDALYRDHRRAR
Ga0209377_117527423300026334SoilAVLARMRSRGESLSQAARLEHTTPRTVRKLVGKQLKRGPSGRYSATRGDTLRRDLSVLGFDGFEPVIVRSSKQAHLASEHLIAVGRFLRTGDAEWLKPFVGKRVGGVELLTDPDRLHELADAGLVKLDGLYREHRGARQEK
Ga0179587_1010465933300026557Vadose Zone SoilMRSRKESLSEASRHEHTTPRTVHRIFPKQLKRGSSGRYVASRGDRLRREISVLSFDGYVPVVVRSSKQAQLASAHLIAVNRFLRTGDERFLKPFIGKRVGGVELLTDTDRIQVLGDADLVKLDGLYRSNRGNREASE
Ga0179587_1017958833300026557Vadose Zone SoilERKVDHPARSRAISVLARMRSRGESLSQAARLEHTTPRTVRRLVGRQIKRGSSGRYSATSGDTLRRDLSVLGSDGYVPVTVRSSKQAQLASEHLIAVGRFLKTGDTEWLKPFVGKRVGGVELLTDPDRLHELDDAGLIKSDGLYRENRGVRQEQ
Ga0207740_103170123300027011Tropical Forest SoilQLERLRALAVLARMRRGESLSQAARLEHTRPRTVLKLIRKQLTRGPTGRYIATSGDTLRRDLNVLGFDGYEPVTVHSSKQAHLASEHLIAVNRFLRNGDEEWLKPFVGKRVGGVELLTDPDRLGVLADAGLVKLDGLYRQSHGERREK
Ga0207761_108979923300027516Tropical Forest SoilMRSRGETLSYAAHFEHTTVRTVRRHVGSALKLNPHTGRYTAKRGDTFRRDVNVLGADGYVPVTVRSSNQARLASQHLIAVNRFLRPPGDAELLAPFVGKRIGGVELLTDPDLLSMFADAGLVKLDGLYRHNRGTNGDSV
Ga0207862_115652313300027703Tropical Forest SoilMPKKIRRLHQLARQRALAVLARMRRGESLSQAARLEHTRPRTVLKLIRKQLTRGPTGRYIATSGDTLRRDLNVLGFDGYEPVTVHSSKQAHLASEHLIAVNRFLRNGDEEWLKPFVGKRVGGVELLTDPDRLGVLADAGL
Ga0209283_1018355023300027875Vadose Zone SoilMARNNKKVHHPARRRALAVLARMRSRGESLSQAARLEHTTARTVRKLVGKQLKRGPSGRYSATHGDTLRRDLSVLGFDGFEPVVVRSSKQAQLASEHLIAVGRFLRTGDTEWLKPFVGKRVGGVELLTDPDRLRELADAGLVKLDALYREHRGARQEK
Ga0209283_1040617123300027875Vadose Zone SoilMKRKQFQSPKRLRALAVLARMRSHGESLSQAARLEHTTPRTVRKTVGKQLKRGSSGRYSATRGDMLRRDLSVLGFDGYEPVVVRSSKQAQLASAHLVAVNRFLRTGDEEWLKPFIGKRVGGIELLTDPDRLHVLADADLVKLDALYREHRGGTEKSV
Ga0209068_1036080123300027894WatershedsMRNRGESLTQAARQEHTTPRMVRKYVGTQLRRGPSRHYSATRSDTLRRDLNVLGFEGYEPVVVRSSKQAQVASKHLVAVGRFLRTGDKEWLKPFVGKRVGGIELLTDTDRLQMIANADLIKLDALYREHQRAR
Ga0209415_1000208023300027905Peatlands SoilMRTKRKKVNFAVRKRALAVLARMRNRGESLAEAARHVHTTSRSVRKHAGKQLKRGRSGRYSVKQSDALRRDLNVLTHDGYVAVSVRSSRQAHRASEHLIAVGRFLRTGDREWLKPFVGKRVGGVELLTDPERLHILADADLVKLDALYRNSPVGGRKK
Ga0209698_1004055833300027911WatershedsMMQRKRRRSQSAARGRALAVLARMRSRGESLSEAARNEKTTARTVRRHVGSAMIRDPRTGHFAAKSGDTFRRDINVLGYDGYVPLSVRSSKQAQLASEHLIATGRFIRTGDTEWLKPFIGKRVGRVELLTDPDRLREFADAGLVKLDALYRQNHGGRREK
Ga0209526_1041147323300028047Forest SoilESLAQAARAEHTTPHTVRKIVCKQLKRRPSGRYTVTSFDTFRRDLSVLSWEGYIAVSVRSSKQAQIASAHLIAIIRFLIDGDTEWLKPFVGKQVGGVELLTDPDRLHELADAGLIKLDALYRSNRAGRPEE
Ga0137415_1018008923300028536Vadose Zone SoilMPKASKKVNRVTRLRTLAVLARMRSRGESLSQAARLEHTTPRTVLKIVGKQLKRGASGRYSATRGDTLRRDLSVLGFEGYEPVVVRSSKQAHLAAEHLVAVGRFLRTGDTEWLKPFIGKRLDGVELLTDPDRLHILADADLVKLDALYRDQRGGREGSV
Ga0318541_1004114413300031545SoilVLAVLARMRRGESLSQAARLEHTRPRTVLNLIGKQLKRGPTGHYVATSADTLRRDLNVLGFDGYEPVTVYSSKQAHLASEHLIAVGRFLRTGNFEWLKPFVGKRVAGVELLTDPDRLGVLADAGLVKLDGLYRQNRGQGLEK
Ga0318573_1025300323300031564SoilGNRQALTRHQLTRQRVLAVLARMRRGESLSQAARLEHTRPRTVLNLIGKQLKRGPTGHYVATSADTLRRDLNVLGFDGYEPVTVYSSKQAHLASEHLIAVGRFLRTGNFEWLKPFVGKRVAGVELLTDPDRLGVLADAGLVKLDGLYRQNRGQGLEK
Ga0310915_1039516123300031573SoilMHTKNSNNKHNAARLRALAVLARMRSRAESLTQAARTERTTPRTVRRIIGKQLKRNASGHYSATSGDTLRRDLNVLGFDGYEPAVVRSWKRAQLAAEHLVAVNRFLRTGDTEWLKPFVGKRVGGVELLTDPDRLRTLAEADLVKLDGLYRQTRGQGREK
Ga0247727_1010025523300031576BiofilmMHRKLRKAQSAARLRALAALARMRRRGESLSRAARLEHTTPRTVRKVVGKQLKRGPSGRYSATRGDTLRRDLSVLGFDGFEPVVVRSSKQAQLASEHLVAVGRFLRTGDTEWLKPFVGKRVGGAELLTDPGRLRELADAGLVKLDALYREHRGARQEK
Ga0247727_1029081233300031576BiofilmMPRKNKKVDHAARLRALAVLARMRSRAESLSQAARLEHTTPRTVRKVVGKQLKRGASGRYSATRGDTLRRDLSVLGFDGFEPVVVRSSKQAHLASEHLVAVGRFLRTGDTEWLKPFVGKRVGGVELLTDPGRLRELADAGLVKLDALYREHRGGTEKP
Ga0318542_1065348613300031668SoilSLRRQLALAVLARMRSRGESLSQAARNEHTTPRAVRKHVGKQLTRARTGRYIATSGDTLRRELNVLGFDGYEPAVVRSSKQAHLAAEHLVAVGRFLRTGDAEWLKPFAGKRVGHVELLTDPDRLQILANADLVKLDGLYRNNPGGGREK
Ga0307374_10008229103300031670SoilMPRKIKVVNHPARPRALRVLARMRNRGEVLTQAARSEHTTPRTVVRIVGKQLRRSTSGRYSATSGDTLRRDINVFGAAGYVPVTVRSSKQPQLASDHLIAVGRFLRTGDAEWLKPFVGKRVGGVELLTDPDRIREFADADLIKLDSLYRQQRGGKYGN
Ga0307374_1007724923300031670SoilVLRKIKVVNHPARPRALAVLARMRNRGEVLTQAARNEHTTPRTVVRIVGKQLRRSASGRYSATSGDTLRRDINVFSREGYVPVTVRSSKQPQLASEHLIAVSRFLRTGDTEWLKPFVGKRVGGVELLTDPDRIHEFADADLIKLDSLYRQQRGGKQEN
Ga0307374_1022327823300031670SoilMPKKKKTTSQAARRRVLSVLARMRRRGESLSEAARLERTTPRTVRKLIGKQFRRSASGRYSATSGDTLRREINVLGVDGYVPVTVRSSKQAQLASEHLIAVGRFLRTGDTEWLRPFIAKRASSVELLTNPDRLEILGDAGFKLDSLYRQQRSGN
Ga0307374_1048146623300031670SoilVLTQAARSEHTTPRTVVRIVGKQLRRSTSGRYSATSGDTLRRDINVFSREGYVPVTVRSSKQPQLASEHLIAVDRFLRTRDTEWLRPFIGKRVGGVELLTDPDRLREFDDAGLIKLDSLYRQQRGGKHEN
Ga0307372_1000756353300031671SoilMPRKIKVVNHPARPRALRVLARMRNRGEVLTQAARSEHTTPRTVVRIVGKQLRRSTSGRYSATSGDTLRRDINVFSREGYVPVTVRSSKQPQLASEHLIAVDRFLRTRDTEWLRPFIGKRVGGVELLTDPDRLREFDDAGLIKLDSLYRQQRGGKHEN
Ga0307372_1038488713300031671SoilTPRTVVRIVGKQLRRSTSGRYSATSGDTLRRDINVFGAAGYVPVTVRSSKQPQLASDHLIAVGRFLRTGDAEWLKPFVGKRVGGVELLTDPDRIREFADADLIKLDSLYRQQRGGKYGN
Ga0307372_1057182713300031671SoilMPKKKKTTSQAARRRVLSVLARMRRRGESLSEAARLERTTPRTVRKLIGKQFRRSASGRYSATSGDTLRREINVLGVDGYVPVTVRSSKQAQLASEHLIAVGRFLRTGDTEWLRPFIAKRASSVELLTNPDRLEILGDAGFKLD
Ga0307373_1002516463300031672SoilMTKKLKIRFQSRARLRALSVLARMRSRGESLSTAARLERTTPRTVRIVLGKQLRRSASGRYSATSGDTLRRDINVFSREGYVPVTVRSSKQPQLASDHLIAVGRFLRTGDTEWLKPFVGKRVGGVELLTDPDRLHEFADADLIKLDSLYRQQRGGKQEN
Ga0307373_1002699113300031672SoilARSEHTTPRTVVRIVGKQLRRSTSGRYSATSGDTLRRDINVFSREGYVPVTVRSSKQPQLASEHLIAVDRFLRTRDTEWLRPFIGKRVGGVELLTDPDRLREFDDAGLIKLDSLYRQQRGGKHEN
Ga0318561_1037068313300031679SoilMRRGESLSQAARLEHTRPRTVLNLIGKQLKRGPTGHYVATSADTLRRDLNVLGFDGYEPVTVYSSKQAHLASEHLIAVGRFLRTGNFEWLKPFVGKRVAGVELLTDPDRLGVLADAGLVKLDGLYRQNRGQGLEK
Ga0318560_1025185333300031682SoilLEHTRPRTVLNLIGKQLKRGPTGHYVATSADTLRRDLNVLGFDGYEPVTVYSSKQAHLASEHLIAVGRFLRTGNFEWLKPFVGKRVAGVELLTDPDRLGVLADAGLVKLDGLYRQNRGQGLEK
Ga0310686_11647913753300031708SoilMPTKRKKVNFAAMKRALAVLARMRHRGESLSEAARHEHTTLRSVRKHARKQLKRGRSGRYSVKRSDALRRDLNVLTHDGYAAVSVRSSRQAHRASEHLVAVGRFLKTGDREWLKPFVGKRVGGVELLTEPERLHILADADLVKLDALYRNSPVGGRNK
Ga0310686_11801541723300031708SoilMASKRRRRPSLKRQRAVAVLARMRSRGESLSQAASLEHTTARSVRQEVGKQLKRGPSGRYVATAGDTIRRDLNVLGFDGYEPVATRSWKQAQLAAEHLVAVGRFLRTGDTEWLKPFIGKRVGGIELLTNPDRIGMLADADLVKLDGLYRQNRGGTEK
Ga0306917_1151274013300031719SoilQAARTERTTPRTVRRIIGKQLKRNASGHYSATSGDTLRRDLNVLGFDGYEPAVVRSWKRAQLAAEHLVAVNRFLRTGDSEWLKPFVGKRVGGVELLTDPQRLREFAEADLLKLDGLYRDNRGGTER
Ga0318501_1028774513300031736SoilLSISHAGRTGNRQALTRHQLTRQRVLAVLARMRRGESLSQAARLEHTRPRTVLNLIGKQLKRGPTGHYVATSADTLRRDLNVLGFDGYEPVTVYSSKQAHLASEHLIAVGRFLRTGNFEWLKPFVGKRVAGVELLTDPDRLGVLADAGLVKLDGLYRQNRGQGLEK
Ga0318546_1001249813300031771SoilHQLTRQRVLAVLARMRRGESLSQAARLEHTRPRTVLNLIGKQLKRGPTGHYVATSADTLRRDLNVLGFDGYEPVTVYSSKQAHLASEHLIAVGRFLRTGNFEWLKPFVGKRVAGVELLTDPDRLGVLADAGLVKLDGLYRQNRGQGLEK
Ga0318546_1052325823300031771SoilMHTKNSNNKHNAARLRALAVLARMRSRAESLTQAARTERTTPRTVRRIIGKQLKRNASGHYSATSGDTLRRDLNVLGFDGYEPAVVRSWKRAQLAAEHLVAVNRFLRTGDSEWLKPFVGKRVGGVELLTDPQRLREFAEADLLKLDGLYRDNRGGTER
Ga0318568_1014657213300031819SoilSHAGRTGNRQALTRHQLTRQRVLAVLARMRRGESLSQAARLEHTRPRTVLNLIGKQLKRGPTGHYVATSADTLRRDLNVLGFDGYEPVTVYSSKQAHLASEHLIAVGRFLRTGNFEWLKPFVGKRVAGVELLTDPDRLGVLADAGLVKLDGLYRQNRGQGLEK
Ga0318567_1039643923300031821SoilHAGRTGNRQALTRHQLTRQRVLAVLARMRRGESLSQAARLEHTRPRTVLNLIGKQLKRGPTGHYVATSADTLRRDLNVLGFDGYEPVTVYSSKQAHLASEHLIAVGRFLRTGNFEWLKPFVGKRVAGVELLTDPDRLGVLADAGLVKLDGLYRQNRGQGLEK
Ga0306919_1069363713300031879SoilMWPLRVNAVVRNQGENVMTKNVSRPARSRALSVLARMRSRGESLSDAARSEGTTPRTVRKIVGKQLRRDKSGHYRATQGDTLRRDLNVLGFDGYEPVVVRSSRQAHLAAEHLVAVNRFLRTGDTEWLKPFAGKRSGGVELLTDPERLQILADADLVKLDGLYRQNRGQGQGN
Ga0318551_1058996113300031896SoilLTRHQLTRQRVLAVLARMRRGESLSQAARLEHTRPRTVLNLIGKQLKRGPTGHYVATSADTLRRDLNVLGFDGYEPVTVYSSKQAHLASEHLIAVGRFLRTGNFEWLKPFVGKRVAGVELLTDPDRLGVLADAGLVKLDGLYRQNRGQGLEK
Ga0306921_1020768443300031912SoilMTKNVSRPARSRALSVLARMRSRGESLSDAARSEGTTPRTVRKIVGKQLRRDKSGHYRATQGDTLRRDLNVLGFDGYEPVVVRSSRQAHLAAEHLVAVNRFLRTGDTEWLKPFAGKRSGGVELLTDPERLQILADADLVKLDGLYRQNRGQGQGN
Ga0306921_1084530113300031912SoilMSGKSKEVNSLTRQRALAALARMRSRGESLSQAARIEHTTPKTVLKIVGKQLKRGTSRHYTATRGDTLRRDLNVLGFDGYQPVVVHSWKQAQLAAEHLVAVNRFLRTGDTEWLKPFVGKRVGGVELLTDPDRLRTLAEADLVKLDGLYRQTRGQGREK
Ga0310909_1143262813300031947SoilMILRNMKRKRNQFQNPARLRALAVLARIRNRGESPTEAAALEHTTLRTVRKYVGKQLKRGPKGRYTATRSDTLRRDITVLGWDGYLPVVVRSSKQAQLASAHLVAVNRFLSPPGDLEWLKPFVGKRVGGVELLTDPERLRLFAEADL
Ga0306926_1123506913300031954SoilMWPLRVNAVVRNQGENVMTKNVSRPARSRALSVLARMRSRGESLSDAARSEGTTPRTVRKIVGKQLRRDKSGHYRATQGDTLRRDLNVLGFDGYEPVVVRSSRQAHLAAEHLVAVNRFLRTGDTEWLKPFAGKRSGGVELLTDPERLQILADADLVKLDGLYRQNRGQ
Ga0306926_1173938213300031954SoilMRTKNSKKKDNAARLRALAVLARMRSRAESLSQAARNERTTPGTVRKIVGKQLKRSASGQYRATRGDTLRRDLNVLGVDGYEPVVVRSSKQAHLAAEHLVAVGRFLRTGDTEWLKPFVGKRIGGIELLTDPDRLQILADADLVKVDGLYRRHRGDVGSSE
Ga0318530_1027520523300031959SoilKLTNLSISHAGRTGNRQALTRHQLTRQRVLAVLARMRRGESLSQAARLEHTRPRTVLNLIGKQLKRGPTGHYVATSADTLRRDLNVLGFDGYEPVTVYSSKQAHLASEHLIAVGRFLRTGNFEWLKPFVGKRVAGVELLTDPDRLGVLADAGLVKLDGLYRQNRGQGLEK
Ga0307479_1001927563300031962Hardwood Forest SoilMTKKNSKSHHLSRQRSLAVLARVRRGESLSQATRNERTTRRTVRKHIGKQFKRDSSGRYRATRGDTLRRDLSVLGFDGYEPVVTRSSKHAHIAAEHLIAVGRFLRSGDTKWLKPFIGKRIGGVELLTDPERLEVLADADLVKLDALYRDHRGATEKSV
Ga0306922_1160459213300032001SoilGKSKEVNSLTRQRALAALARMRSRGESLSQAARIEHTTPKTVLKIVGKQLKRGTSRHYTATRGDTLRRDLNVLGFDGYQPVVVHSWKQAQLAAEHLVAVNRFLRTGDTEWLKPFVGKRVGGVELLTDPDRLRTLAEADLVKLDGLYRQTRGQGREK
Ga0306922_1184435413300032001SoilLRNMKRKRNQFQNPARLRALAVLARIRNRGESPTEAAALEHTTLRTVRKYVGKQLKRGPKGRYTATRSDTLRRDITVLGWDGYLPVVVRSSKQAQLASAHLVAVNRFLSPPGDLEWLKPFVGKRVGGVELLTDPERLRLFAEADLVKLDSLYRNNRGSNGDSV
Ga0310911_1071748413300032035SoilKKDNAARLRALAVLARMRSRAESLSQAARNERTTPGTVRKIVGKQLKRSASGQYRATRGDTLRRDLNVLGVDGYEPVVVRSSKQAHLAAEHHVAVGRFLRTGDTEWLKPFVGKRIGGIELLTDPDRLQILADADLVKVDGLYRRHRGDVGSSE
Ga0310911_1086235813300032035SoilMRHKQFQSPARLRALSVLARMRSRGETLTQAALAEHTTPRTVLKVVGKLLKRGASGRYSATRSDTFRRDISVLGYDGYVPVSVRSSKQARFASEHLIAVGRFLRPPGDPELLKPFVGKRVAGVELLTDPDRLGVLADAGLVKLDGLYRQNRGRGREK
Ga0318549_1014944823300032041SoilVLAVLARMRRGESLSQAARLEHTRPRTVLNLIGKQLKRGPTGHYVATSADTLRRDLNVLGFDGYEPVTVYSSKQAHLASEHLIAVGRFLRTGNFEWLKPFVGKRVALTDPDRLGVLADAGLVKLDGLYRQNRGQGLEK
Ga0318533_1084619723300032059SoilPRTVRKIVGKQLRRDKSGHYRATQGDTLRRDLNVLGFDGYEPVVVRSSRQAHLAAEHLVAVNRFLRTGDTEWLKPFAGKRSGGVELLTDPERLQILADADLVKLDGLYRQNRGQGQGN
Ga0306920_10137887923300032261SoilVSDRIHILRTMKDKNRFTNSARRRALAVLARVRSRGESLSQAARLERTTPQTVLKIIPKQFKRSSSGRYTATRGDTLRRDLSVLGFEGYVPVVVRSSDQAQVASAHLIAANRFLRTGDREWLKPFVRKRVGGVELLTDPDRLQILADADLVKLDGLYRQNRSQGQEN
Ga0306920_10170420113300032261SoilGGKCAYLEVPNGVSNLSRMKRKPSTNSVTLRDFAVLARMRRGESLSQAARNERTTPRSVRKHVGKQLTRGPTGRYVATSGDTLRRDLNVLGFDGYEPVVVRSSKQAHLAAEHLVAVNRFLRTGTTEWLKPFIGKRVGGVELLTDPDRLQILAAADLVKLDALYRTNPGGRREK
Ga0335085_10002183303300032770SoilMPRKIKRVSHLTRQRALAVLARMRSGGESLSQAARNERTTPLTVRKIVGKQLKRAPSGHYSASRGDTLRRDINVLGYEGYEPVVTRSSKQAQLASAHLVAASRYLRTGDTEWLKPFIGKRVGGVELLTDPDRLQILADADLVKLDGLYRTNRGGTEK
Ga0335085_1001022283300032770SoilMLDVSDRICILRNMKRKQFQNPARLRALAVLARVRSRGESPTEAAALEHTTLRTVRRYIGKQLRRGPTGRYTATRSDSLRRDLTVLGFDGYVSVVVRSSKQAQLASAHLVAVNRFLRPPGDSEWLKPFVGKRVGGVELLTDPERLQILADADQVKLDGLYRHNPAGREG
Ga0335082_1001231483300032782SoilMKRKQSQNPARLRALAVLARMRSRGESLTEAAALEHTTPRAVRKYIGKQLTRGPTGRYLATSGDMLRRDLAVLGFDGYEPAVVRSWKQAQLASAHLVAVNRYLKTGDTKWLKPFIGKRVGGVKLLTNPGRLQILADADLVKLDALYRNNSAGGREK
Ga0335079_1006455623300032783SoilMAKNIYRPARRRALAVLARMRRGESLSNATRAEHTTARTVRKLVGKQLRRGASGRYSATRGDTLRRDLTVLGFDGFEPVSVRSWKQAQLASEHLIAVNRFLRTGDPEWLKPFVGKRVGGVELLTNPERLYQFADADLIKLDSLYRDQRGTREEG
Ga0335078_1030793413300032805SoilVKGDKKVMAKNIYRPARRRALAVLARMRRGESLSNATRAEHTTARTVRKLVGKQLRRGASGRYSATRGDTLRRDLTVLGFDGFEPVSVRSWKQAQLASEHLIAVNRFLRTGDPEWLKPFVGKRVGGVELLTNPERLYQFADADLIKLDSLYRDQRGTREEG
Ga0335078_1066684713300032805SoilMATEKQFRSPARLRALRVLSRMPRGESLSQAARLEHTTARTVAKVLGRQLRRSASGRYSATEGDTLRRDLSVLGSEGYVPVAVRSSKQAQVASEHLVAVARYLRTGDSAWLRPFVGMRVGGVELLTDPDRLHELASAGLVQLDNLYRQNRGGGREKSA
Ga0335070_1198942413300032829SoilMKRKQFQNSARLRALAVLARIRTRRESPTEAAALEHTTLRTVRKHVGKHLRRGPTGRYTATRSDTLRRDLSVLSFDGYVPVVVRSWNQAQLASAHLIAVNRFLRTGDEDWIRPFIGKRVGGVELLTDLERLRE
Ga0335075_10002396143300032896SoilMRERNAKPQNAQESQGRERALAVLARMRSRGESLSKAARALRTTPRTVRKLVGSQLRRSASGRYSPTSSDRLKREIFVFGNDGYEPVTVHSSKRAQLASEHLIAINRFLRTGDTEWLKPFQQKRISGVELLTDPDRIREFAEADLVKLDGLYRDQRGQGYRK
Ga0310914_1059770723300033289SoilMHTKNSNNKHNAARLRALAVLARMRSRAESLTQAARTERTTPRTVRRIIGKQLKRNASGHYSATSGDTLRRDLNVLGFDGYEPAVVRSWKRAQLAAEHLVAVNRFLRTGDSEWLKPFVGKRVGGVELLTDPQRL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.