NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F094458

Metagenome / Metatranscriptome Family F094458

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F094458
Family Type Metagenome / Metatranscriptome
Number of Sequences 106
Average Sequence Length 121 residues
Representative Sequence MTTRLMRPLAFMGLLALSPSTSQAQEPVGATALRIQWESDQPVSGLQAVCGRVFNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATHRVTIVGVEWVARPEAP
Number of Associated Samples 94
Number of Associated Scaffolds 106

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 76.42 %
% of genes near scaffold ends (potentially truncated) 23.58 %
% of genes from short scaffolds (< 2000 bps) 73.58 %
Associated GOLD sequencing projects 87
AlphaFold2 3D model prediction Yes
3D model pTM-score0.70

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (68.868 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(17.924 % of family members)
Environment Ontology (ENVO) Unclassified
(30.189 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(37.736 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 0.00%    β-sheet: 43.05%    Coil/Unstructured: 56.95%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.70
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
b.1.4.1: beta-Galactosidase/glucuronidase domaind4cu7a24cu70.79591
b.1.4.1: beta-Galactosidase/glucuronidase domaind2je8a12je80.73259
b.1.29.4: Complement C3 MG3-liked2a73a32a730.72047
b.2.2.2: Cellulose-binding domain family IIId4b9fa_4b9f0.7182
b.2.2.0: automated matchesd4b9pa_4b9p0.71134


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 106 Family Scaffolds
PF04392ABC_sub_bind 23.58
PF00999Na_H_Exchanger 5.66
PF00072Response_reg 2.83
PF01641SelR 2.83
PF03992ABM 2.83
PF02954HTH_8 1.89
PF00085Thioredoxin 1.89
PF00196GerE 0.94
PF09084NMT1 0.94
PF02720DUF222 0.94
PF02371Transposase_20 0.94
PF01717Meth_synt_2 0.94
PF13714PEP_mutase 0.94
PF00589Phage_integrase 0.94
PF00763THF_DHG_CYH 0.94
PF07769PsiF_repeat 0.94
PF04519Bactofilin 0.94
PF01207Dus 0.94
PF08734GYD 0.94
PF06779MFS_4 0.94
PF00565SNase 0.94
PF08334T2SSG 0.94

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 106 Family Scaffolds
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 23.58
COG0025NhaP-type Na+/H+ or K+/H+ antiporterInorganic ion transport and metabolism [P] 5.66
COG0475Kef-type K+ transport system, membrane component KefBInorganic ion transport and metabolism [P] 5.66
COG3004Na+/H+ antiporter NhaAEnergy production and conversion [C] 5.66
COG3263NhaP-type Na+/H+ and K+/H+ antiporter with C-terminal TrkAC and CorC domainsEnergy production and conversion [C] 5.66
COG4651Predicted Kef-type K+ transport protein, K+/H+ antiporter domainInorganic ion transport and metabolism [P] 5.66
COG0229Peptide methionine sulfoxide reductase MsrBPosttranslational modification, protein turnover, chaperones [O] 2.83
COG0042tRNA-dihydrouridine synthaseTranslation, ribosomal structure and biogenesis [J] 0.94
COG01905,10-methylene-tetrahydrofolate dehydrogenase/Methenyl tetrahydrofolate cyclohydrolaseCoenzyme transport and metabolism [H] 0.94
COG0620Methionine synthase II (cobalamin-independent)Amino acid transport and metabolism [E] 0.94
COG0715ABC-type nitrate/sulfonate/bicarbonate transport system, periplasmic componentInorganic ion transport and metabolism [P] 0.94
COG1664Cytoskeletal protein CcmA, bactofilin familyCytoskeleton [Z] 0.94
COG3547TransposaseMobilome: prophages, transposons [X] 0.94
COG4274Uncharacterized conserved protein, contains GYD domainFunction unknown [S] 0.94
COG4521ABC-type taurine transport system, periplasmic componentInorganic ion transport and metabolism [P] 0.94


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms68.87 %
UnclassifiedrootN/A31.13 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300003319|soilL2_10059512All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2355Open in IMG/M
3300003324|soilH2_10039421All Organisms → cellular organisms → Bacteria5609Open in IMG/M
3300004156|Ga0062589_100615064Not Available946Open in IMG/M
3300004480|Ga0062592_100918309Not Available791Open in IMG/M
3300004643|Ga0062591_100850149All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria849Open in IMG/M
3300005176|Ga0066679_10664971All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium678Open in IMG/M
3300005295|Ga0065707_10309545Not Available987Open in IMG/M
3300005406|Ga0070703_10613595Not Available503Open in IMG/M
3300005440|Ga0070705_100463060All Organisms → cellular organisms → Bacteria954Open in IMG/M
3300005445|Ga0070708_101278617Not Available686Open in IMG/M
3300005467|Ga0070706_100173007All Organisms → cellular organisms → Bacteria2016Open in IMG/M
3300005536|Ga0070697_100740177All Organisms → cellular organisms → Bacteria868Open in IMG/M
3300005546|Ga0070696_100563103All Organisms → cellular organisms → Bacteria → Proteobacteria914Open in IMG/M
3300005549|Ga0070704_100067033All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2590Open in IMG/M
3300006041|Ga0075023_100016560Not Available1990Open in IMG/M
3300006163|Ga0070715_10394883All Organisms → cellular organisms → Bacteria767Open in IMG/M
3300006358|Ga0068871_101520463Not Available633Open in IMG/M
3300006755|Ga0079222_11370236Not Available652Open in IMG/M
3300006804|Ga0079221_10618291All Organisms → cellular organisms → Bacteria735Open in IMG/M
3300006806|Ga0079220_10074795All Organisms → cellular organisms → Bacteria1681Open in IMG/M
3300006852|Ga0075433_10041346All Organisms → cellular organisms → Bacteria3994Open in IMG/M
3300006954|Ga0079219_11699588All Organisms → cellular organisms → Bacteria → Proteobacteria585Open in IMG/M
3300007076|Ga0075435_100158567All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1905Open in IMG/M
3300009147|Ga0114129_10009833All Organisms → cellular organisms → Bacteria13640Open in IMG/M
3300009147|Ga0114129_10083869All Organisms → cellular organisms → Bacteria4425Open in IMG/M
3300009147|Ga0114129_10529688All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1535Open in IMG/M
3300009148|Ga0105243_12638985Not Available542Open in IMG/M
3300009162|Ga0075423_10063324All Organisms → cellular organisms → Bacteria3830Open in IMG/M
3300010397|Ga0134124_11118872Not Available805Open in IMG/M
3300010399|Ga0134127_10744139Not Available1024Open in IMG/M
3300011120|Ga0150983_12710186All Organisms → cellular organisms → Bacteria832Open in IMG/M
3300012202|Ga0137363_10885418All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium758Open in IMG/M
3300012203|Ga0137399_11301377Not Available611Open in IMG/M
3300012922|Ga0137394_10091132All Organisms → cellular organisms → Bacteria2559Open in IMG/M
3300012922|Ga0137394_10545351Not Available983Open in IMG/M
3300012925|Ga0137419_11033104Not Available682Open in IMG/M
3300012927|Ga0137416_10384781All Organisms → cellular organisms → Bacteria → Terrabacteria group1185Open in IMG/M
3300012929|Ga0137404_10320268All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1350Open in IMG/M
3300012929|Ga0137404_10572012All Organisms → cellular organisms → Bacteria1014Open in IMG/M
3300012930|Ga0137407_10071964All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2882Open in IMG/M
3300012931|Ga0153915_11357061All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium831Open in IMG/M
3300012931|Ga0153915_13458040Not Available511Open in IMG/M
3300012944|Ga0137410_10301043All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1270Open in IMG/M
3300014149|Ga0181613_1117958All Organisms → cellular organisms → Bacteria759Open in IMG/M
3300014968|Ga0157379_10086719All Organisms → cellular organisms → Bacteria2807Open in IMG/M
3300015245|Ga0137409_10391260Not Available1205Open in IMG/M
3300015264|Ga0137403_10228336Not Available1781Open in IMG/M
3300015264|Ga0137403_10414092All Organisms → cellular organisms → Bacteria1226Open in IMG/M
3300017792|Ga0163161_11265090Not Available640Open in IMG/M
3300017927|Ga0187824_10015147All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2247Open in IMG/M
3300017930|Ga0187825_10012534All Organisms → cellular organisms → Bacteria → Proteobacteria2839Open in IMG/M
3300017966|Ga0187776_10003188All Organisms → cellular organisms → Bacteria8381Open in IMG/M
3300017994|Ga0187822_10074633All Organisms → cellular organisms → Bacteria995Open in IMG/M
3300018000|Ga0184604_10191183All Organisms → cellular organisms → Bacteria697Open in IMG/M
3300018027|Ga0184605_10143992All Organisms → cellular organisms → Bacteria1070Open in IMG/M
3300018051|Ga0184620_10248082Not Available597Open in IMG/M
3300018054|Ga0184621_10047596All Organisms → cellular organisms → Bacteria1423Open in IMG/M
3300018061|Ga0184619_10192704All Organisms → cellular organisms → Bacteria936Open in IMG/M
3300018066|Ga0184617_1112464Not Available771Open in IMG/M
3300018071|Ga0184618_10092479All Organisms → cellular organisms → Bacteria1174Open in IMG/M
3300018071|Ga0184618_10280212Not Available708Open in IMG/M
3300018422|Ga0190265_10156580All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium2237Open in IMG/M
3300018422|Ga0190265_12715485All Organisms → cellular organisms → Bacteria591Open in IMG/M
3300018429|Ga0190272_10319268All Organisms → cellular organisms → Bacteria1216Open in IMG/M
3300019458|Ga0187892_10096226All Organisms → cellular organisms → Bacteria1785Open in IMG/M
3300019878|Ga0193715_1022617All Organisms → cellular organisms → Bacteria1365Open in IMG/M
3300019879|Ga0193723_1007549Not Available3580Open in IMG/M
3300019881|Ga0193707_1047323All Organisms → cellular organisms → Bacteria1376Open in IMG/M
3300019882|Ga0193713_1012284All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium2585Open in IMG/M
3300019883|Ga0193725_1099043All Organisms → cellular organisms → Bacteria691Open in IMG/M
3300019886|Ga0193727_1150582Not Available633Open in IMG/M
3300020004|Ga0193755_1019958All Organisms → cellular organisms → Bacteria2213Open in IMG/M
3300020004|Ga0193755_1224118Not Available521Open in IMG/M
3300020579|Ga0210407_10023475All Organisms → cellular organisms → Bacteria4583Open in IMG/M
3300021078|Ga0210381_10148962All Organisms → cellular organisms → Bacteria793Open in IMG/M
3300021168|Ga0210406_10828238All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium702Open in IMG/M
3300021432|Ga0210384_10078978All Organisms → cellular organisms → Bacteria2958Open in IMG/M
3300025324|Ga0209640_10294626Not Available1357Open in IMG/M
3300025885|Ga0207653_10294836Not Available628Open in IMG/M
3300025906|Ga0207699_10102336All Organisms → cellular organisms → Bacteria → Proteobacteria1820Open in IMG/M
3300025910|Ga0207684_10016294All Organisms → cellular organisms → Bacteria6382Open in IMG/M
3300025923|Ga0207681_11566160Not Available552Open in IMG/M
3300026320|Ga0209131_1014050All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales4922Open in IMG/M
3300027181|Ga0208997_1068783Not Available530Open in IMG/M
3300027651|Ga0209217_1022482All Organisms → cellular organisms → Bacteria2013Open in IMG/M
3300027725|Ga0209178_1421710Not Available510Open in IMG/M
3300027727|Ga0209328_10211100Not Available585Open in IMG/M
3300027765|Ga0209073_10321657All Organisms → cellular organisms → Bacteria618Open in IMG/M
3300028047|Ga0209526_10629841All Organisms → cellular organisms → Bacteria683Open in IMG/M
3300028536|Ga0137415_10271727All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1501Open in IMG/M
3300028771|Ga0307320_10279518All Organisms → cellular organisms → Bacteria661Open in IMG/M
3300028784|Ga0307282_10012811All Organisms → cellular organisms → Bacteria3397Open in IMG/M
3300028792|Ga0307504_10216641Not Available686Open in IMG/M
3300028828|Ga0307312_10743224All Organisms → cellular organisms → Bacteria650Open in IMG/M
3300030006|Ga0299907_10396913All Organisms → cellular organisms → Bacteria1109Open in IMG/M
3300030619|Ga0268386_10199047Not Available1501Open in IMG/M
3300031114|Ga0308187_10498260All Organisms → cellular organisms → Bacteria501Open in IMG/M
(restricted) 3300031150|Ga0255311_1012215All Organisms → cellular organisms → Bacteria → Proteobacteria1732Open in IMG/M
3300031720|Ga0307469_10004189All Organisms → cellular organisms → Bacteria6228Open in IMG/M
3300031720|Ga0307469_10066024All Organisms → cellular organisms → Bacteria2369Open in IMG/M
3300031740|Ga0307468_100276458Not Available1201Open in IMG/M
3300031820|Ga0307473_10320950All Organisms → cellular organisms → Bacteria983Open in IMG/M
3300032180|Ga0307471_100066402All Organisms → cellular organisms → Bacteria3083Open in IMG/M
3300032180|Ga0307471_100089875All Organisms → cellular organisms → Bacteria2741Open in IMG/M
3300032180|Ga0307471_102371537All Organisms → cellular organisms → Bacteria670Open in IMG/M
3300032421|Ga0310812_10099079All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1196Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil17.92%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil13.21%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere10.38%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment7.55%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil6.60%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil5.66%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere5.66%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.72%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment2.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil2.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.83%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands1.89%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.89%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil1.89%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.94%
Anoxic, Neutral-Ph, Fe/Si-Rich Hot Spring WaterEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Neutral → Anoxic, Neutral-Ph, Fe/Si-Rich Hot Spring Water0.94%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.94%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.94%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.94%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.94%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.94%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.94%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze0.94%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.94%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.94%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.94%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.94%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.94%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300003319Sugarcane bulk soil Sample L2EnvironmentalOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006358Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2Host-AssociatedOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014149In situ water column microbial community from the vent pool of Chocolate Pots hot spring, Yellowstone National Park, Wyoming, USA - CP Vent PoolEnvironmentalOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017792Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S4-5 metaGHost-AssociatedOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017966Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_20_MGEnvironmentalOpen in IMG/M
3300017994Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_2EnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018051Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_b1EnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018066Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_b1EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019878Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2m2EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019882Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a2EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300019886Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300021078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redoEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025923Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300027181Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM2_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027725Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027727Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028771Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_369EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300030619Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (Novaseq)EnvironmentalOpen in IMG/M
3300031114Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_182 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032421Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NN3EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
soilL2_1005951243300003319Sugarcane Root And Bulk SoilMRALRYLASFGLLALSASASQAQQSAGATPLRIVWESAAPAHGLQAVCGRVFNDGPVDARRVRVRVEGLNERGEVTSRRDGDVLGQVSSRGIGRFCLTMAAGATSYRVTIVLAEWAAAPESP*
soilH2_1003942193300003324Sugarcane Root And Bulk SoilMRALRYLASFGLLALSASASQAQQSAGATPLRIVWESDAPAHGLQAVCGRVFNDGPVDARRVRVRVEGLNERGEVTSRRDGDVLGQVSSRGIGRFCLTMAAGATSYRVTIVLAEWAAAPESP*
Ga0062589_10061506423300004156SoilMTTRLMRSLAFMGLLTLSPWTSQAQEAAGLRMQWEVDPPASGLQTVCGRVFNDRPVEARRVRIRVEGLDERGDVTRRRDGDVGLIWSKGIGRFCLAMSAGAATYRATIVGVEWVAGPESP
Ga0062592_10091830923300004480SoilMTTRLMRSLAFMGLLTLSPWTSQAQEAAGLRMQWEVDPPASGLQTVCGRVFNDRPVEARRVRVRVEGLDERGDVTRRRDGDVGLIWSKGIGRFCLAMSAGAAAYRVTIVGVEWVAGPESP
Ga0062591_10085014913300004643SoilRLMRSLAFMGLLTLSPWTSQAQEAAGLRMQWEVDPPASGLQTVCGRVFNDRPVEARRVRIRVEGLDERGDVTRRRDGDVGLIWSKGIGRFCLAMSAGAATYRATIVGVEWVAGPESP*
Ga0066679_1066497113300005176SoilMDRRVFISAFMGLLALFSSTSQAQERVGASGLRIAWELDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVSSRGLGRFCLTMAAGAADYRVSILNAEWAVAPESP*
Ga0065707_1030954513300005295Switchgrass RhizosphereVTTRLMRHLAFMGLLALYPSTGQAQEPVGATALRIQWEPAQPVSGLQAVCGRVFNDGPVDARRVRVRVEGLDERGAVTGRRDGDVGQVWSRSIGRFCLTMSAGAASYRVTIVGAEWVAGPEAP*
Ga0070703_1061359513300005406Corn, Switchgrass And Miscanthus RhizosphereSPAQERVGASPLRIAWELDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVPSKGLGRFCLTMAAGAADYRVTIINAEWVVTPEAP*
Ga0070705_10046306013300005440Corn, Switchgrass And Miscanthus RhizosphereMQHLAFMCLLALSSSTSQAQERVGASPPLRIAWELDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVSSKGLGRFCLTMAAGAADYRVTIVNAEWVVTPEA
Ga0070708_10127861713300005445Corn, Switchgrass And Miscanthus RhizosphereMRSMRHLAFMGLLTLFSSPSPAQERVGASPLRIAWELDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVPSKGLGRFCLTMAAGAADYRVTIINAEWVVTPEAP*
Ga0070706_10017300733300005467Corn, Switchgrass And Miscanthus RhizosphereMRHLAFMGLLTLFSSPSPAQERVGASPLRIAWELDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVPSKGLGRFCLTMAAGAADYRVTIINAEWVVTPEAP*
Ga0070697_10074017723300005536Corn, Switchgrass And Miscanthus RhizosphereMNRRDARMRSMRHLAFMGLLTLFSSPSPAQERVGASPLRIAWELDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVPSKGLGRFCLTMAAGAADYRVTIINAEWVVTPEAP*
Ga0070696_10056310323300005546Corn, Switchgrass And Miscanthus RhizosphereMQHLAFVCLLALSSSTSQAQERVGASPPLRIAWELDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVSSKGLGRFCLTMAAGAADYRVTIVNAEWVVTPEAP
Ga0070704_10006703333300005549Corn, Switchgrass And Miscanthus RhizosphereVTTRLMRHLAFMGLLALYPLTSQAQEPVGATALRIQWEPAQPVSGLQAVCGRVFNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATYRVTIVGAEWVAGPEAP*
Ga0075023_10001656023300006041WatershedsVTTRLMRHLAFMSLLALYPSTSQAQAPVGATALRIQWEPDQPVSGLQAVCGRVFNEGPVDARHVRVRVEGLDERGGVTGRRDGDVGQVWFRSIGRFCLTMSAGAATYRVTIVGADWVAGPEAP*
Ga0070715_1039488323300006163Corn, Switchgrass And Miscanthus RhizosphereMDRRVFISAFMGLLALFSSTSQAQERVGASGLRIAWELDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVSSRGLGRFCLTMAAGAADYRVTIVNAEWVVTPEAP*
Ga0068871_10152046323300006358Miscanthus RhizosphereVTTRLMRHLAFMGLLALYPSTSQAQAPVGVTALRIQWESDQPVSGLQAVCGRVFNEGPVDARHVRVRVEGLDERGGVTGRRDGDVGRVWSRSIGRFCLTMPAGAATYRVTIVGADWVAGPEAP*
Ga0079222_1137023623300006755Agricultural SoilMRSMRHLAFMSLLAVLSSTTQAQERVGGSPLRIAWELDAPAQGFQAVCGRVINDGPVDAKRVRVRVEGVDERGGVITRRDGDVLGQVSSKGLGRFCITMAAGAADYRVTILNAEWVVAPESP*
Ga0079221_1061829123300006804Agricultural SoilMRALRYLASFGLLVLSASASQGQQSAGATPLRVIWESDPPAYGLQAVCGRVFNDGPVDARRVRVRVEGLNERGDVTSRRDGDVLGQVSSRGIGRFCLTMAAGAASYRVTIVLAEWAAAPESP*
Ga0079220_1007479523300006806Agricultural SoilMRHVAFVGLLAVSPSTGQAQEPVGATALRVRWEADAPAYGLQAVCGRVFNDGPVDARRIRVRVEGLDERGAVTARRDGDIVGQVSSRGIGRFCLTMSAGAVTYRVTIVGAEWVAGPEAP*
Ga0075433_1004134633300006852Populus RhizosphereMDRRAFISAFMGLLALCSSTSQAQDRVGASPLRIAWEADAPAQGFQAVCGRVFNDGSVDARRVRVRVEGLDERGGVITRRDGDVLGQVSSRGLGRFCLTMAAGPANYRVTILNAEWAVAPETP*
Ga0079219_1169958823300006954Agricultural SoilMRSMRHLAFMSLLAVLSSTTQAQERVGGSPLRIAWELDAPAQGFQAVCGRVINDGPVDAKRVRVRVEGVDERGGVITRRDGDVLGQVSSKGLGRFCITMAAGAADYRVTILNAEWVVAPESR*
Ga0075435_10015856713300007076Populus RhizosphereSAFMGLLALCSSTSQAQDRVGASPLRIAWEADAPAQGFQAVCGRVFNDGSVDARRVRVRVEGLDERGGVITRRDGDVLGQVSSRGLGRFCLTMAAGPANYRVTILNAEWAVAPETP*
Ga0114129_10009833113300009147Populus RhizosphereMRHLAFMSLLAVLSSTTQAQERVGGSPLRIAWELDAPAQGFQAVCGRVINDGPVDAKRVRVRVEGVDERGGVITRRDGDVLGQVSSKGLGRFCITMAAGAADYRVTILNAEWVVAPESP*
Ga0114129_1008386933300009147Populus RhizosphereMDRRALISAFMGLLALCSSTSQAQDRVGASPLRIAWEADAPAQGFQAVCGRVFNDGSVDARRVRVRVEGLDERGGVITRRDGDVLGQVSSRGLGRFCLTMAAGPANYRVTILNAEWAVAPETP*
Ga0114129_1052968833300009147Populus RhizosphereMHVTMRLMRHLAFMGLLALYPSTSQAQEPVGATALRIQWESAQPVSGLQAVCGRVFNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATYRVTIVGAEWVAGPEAP*
Ga0105243_1263898513300009148Miscanthus RhizosphereMRSLAFMGLLTLSPWTSQAQEAAGLRMQWEVDPPASGLQTVCGRVFNDRPVEARRVRIRVEGLDERGDVTRRRDGDVGLIWSKGIGRFCLAMSAGAATYRATIVGV
Ga0075423_1006332413300009162Populus RhizosphereMDRRAFISAFMGLLALCSSTSQAQDRVGASPLRIAWEADAPAQGFQAVCGRVFNDGSVDARRVRVRVEGLDERGGVITRRDGDVLGQVSSRGLGRFCLTMAAGPANYRVTILNAEWA
Ga0134124_1111887223300010397Terrestrial SoilMRSLAFMGLLTLSPWTSQAQEAAGLRMQWEVDPPASGLQTVCGRVFNDRPVEARRVRVRVEGLDERGDVTRRRDGDVGLIWSKGIGRFCLAMSAGAATYRATIVGVEWVAGPESP*
Ga0134127_1074413923300010399Terrestrial SoilMGLLTLSPWTSQAQEAAGLRMQWEVDPPASGLQTVCGRVFNDRPVEARRVRVRVEGLDERGDVTRRRDGDVGLIWSKGIGRFCLAMSAGAATYRATIVGVEWVAGPESP*
Ga0150983_1271018623300011120Forest SoilMDRRVFISAFMGLLALFSSTSQAQERVGASGLRIAWELDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVSSRGLGRFCLTMAAGAADYRVTILNAEWAVAPESP*
Ga0137363_1088541813300012202Vadose Zone SoilRVFISAFMGLLALFSSTSQAQERVGASGLRIAWELDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVSSRGLGRFCLTMAAGAADYRVSILNAEWAVAPESP
Ga0137399_1130137713300012203Vadose Zone SoilSQAQEPVGATALRIHWEVDPPVHGLQTVCGRVFNDGPVDARRVRVRVEGLDERGGVTQRRDGDVGQVSSRSIGRFCLTMSAGAATHRVTIVGVEWVAMPEAP*
Ga0137394_1009113223300012922Vadose Zone SoilMTTRLMRPLAFMGLLALSPSTSQAQEPVGATALRIHWEVDPPVHGLQTVCGRVFNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATHRVTIVGADWVAGPQAP*
Ga0137394_1054535123300012922Vadose Zone SoilVTDARMTTRLMRPLAFMSLLALSPSTSQAQEPVGATALRIQWEVDQPVSGLQAVCGRVLNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVSSRSLGRFCLTMSAGAATYRVTIVGAEWVAGPEAP*
Ga0137419_1103310423300012925Vadose Zone SoilMITRLMRPLAFMGLLALSPSTSQAQEPVGATALRIHWEVDPPVHGLQTVCGRVFNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVCSRSIGRFCLTMSAGAATHRVTIVGADWVAGPQAP*
Ga0137416_1038478123300012927Vadose Zone SoilMTDARMTMRLMRPLAFMSLLALSPSTSQAQEPVGATALRIQWEVDQPVSGLQAVCGRVLNDGPVDARRVRVRVEGLDERGGVTQRRDGDVGQVSSRSIGRFCLTMSAGAATHRVTIVGVEWVARPEAP*
Ga0137404_1032026823300012929Vadose Zone SoilVTMRLMRHLAFMGLLALYPSTSQAQEPVGATALRIQWEPAQPVSGLQAVCGRVFNDGPVDARRVRVRVEGLDERGAVTGRRDGDVGQVWSRSIGRFCLTMSAGAATYRVTIVGAEWVAGPEAP*
Ga0137404_1057201223300012929Vadose Zone SoilMRLLAFMGLLTLSPWTGQAQEATGLRMQWEVDPPVSGLQTVCGRVFNDRPVDAKRVRVRVEGLDERGDVTRRREGDVGLVWSKGIGRFCLTTSAGAATYRVTIVGVEWVAGPESP*
Ga0137407_1007196433300012930Vadose Zone SoilMHVTMRLMRHLAFMGLLALYPSTSQAQEPVGATALRIQWEPAQPVSGLQAVCGRVFNDGPVDARRVRVRVEGLDERGAVTGRRDGDVGQVWSRSIGRFCLTMSAGAATYRVTIVGAEWVAGPEAP*
Ga0153915_1135706113300012931Freshwater WetlandsLGTAVLLASLPSTSRAQAPVDASALRVRWEADPPASGLQAVCGRVFNDSRTDARRVRVRVEGLDERAGVVTRRDGDVLGQVSTRGLGRFCLTMAAGAASYRVTIIGVEWAAAPESP*
Ga0153915_1345804013300012931Freshwater WetlandsRRVRLGQLGSDARMTTRLMRRLAFVGLLTLSPWTSRAQEPVGATGLRILWEVDPPAQGLQTVCGRVVNDRAVDARRVRVRVEGLDERGAVTGSRDGDVVGQVSSRGIGRFCLTMSAGAVTYRVTIVGVEWAGPEAP*
Ga0137410_1030104323300012944Vadose Zone SoilMTTRLMRPLAFMGLLALSPSTSQAQEPVGATALRIHWEVDPPVHGLQTVCGRVFNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRGIGRFCLTMSAGAATHRVTIVGADWVAGPQAP*
Ga0181613_111795813300014149Anoxic, Neutral-Ph, Fe/Si-Rich Hot Spring WaterSTSQAQEPAGATTLRIQWEVDQPVSGLQTVCGRAFNDGPVDASRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATYRVTIVGVEWAARPEAP*
Ga0157379_1008671943300014968Switchgrass RhizosphereMTTRLMRSLAFMGLLTLSPWTSQAQEAAGLRMQWEVDPPASGLQTVCGRVFNDRPVEARRVRVRVEGLDERGDVTRRRDGDVGLIWSKGIGRFCLAMSAGA
Ga0137409_1039126023300015245Vadose Zone SoilMTTRLMRPLAFMGLLALSPSTSQAQEPVGATALRIHWEVDPPVHGLQTVCGRVFNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRGIGRFCLTMSAGAATHRVTIVGADWVAGP
Ga0137403_1022833623300015264Vadose Zone SoilVTMRLMRHLAFMGLLALYPSTSQAQEPVGATALRIQWEPAQPVSGLQAVCGRVFNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATHRVTIVGADWVAGPQAP*
Ga0137403_1041409213300015264Vadose Zone SoilMRPLAFMGLLALSPSTSQAQEPVGATALRIHWEVDPPVSGLQTVCGRVFNDRPVDAKRVRVRVEGLDERGDVTRRREGDVGLVWSKGIGRFCLTTSAGATTYRVTIVGVEWVAGPESP*
Ga0163161_1126509023300017792Switchgrass RhizosphereLMRSLAFMGLLTLSPWTSQAQEAAGLRMQWEVDPPASGLQTVCGRVFNDRPVEARRVRIRVEGLDERGDVTRRRDGDVGLIWSKGIGRFCLAMSAGAATYRATIVGVEWVAGPESP
Ga0187824_1001514723300017927Freshwater SedimentMRHVAFVGLLAVSPSTGQAQEPVGATALRVRWEADAPAYGLQAVCGRVFNDGPVDARRIRVRVEGLDERGVVTARRDGDVVGQIFSRGVGRFCLTMSAGAVTYRVTIVGAEWVVGPEAP
Ga0187825_1001253433300017930Freshwater SedimentMRHVAFVGLLAVSPSTGQAQEPVGATALRVRWEADAPAYGLQAVCGRVFNDGPVDARRIRVRVEGLDERGAVTARRDGDIVGQVSSRGIGRFCLTMSAGAVTYRVTIVGAEWVVGPEAP
Ga0187776_10003188103300017966Tropical PeatlandMTTRLPLAFMGLLALSPSTSQAQEPVGATAMRITWEADPPVSGLQAVCGRVVNDGPVDARGVRVRVEGLDERGGVIGRRDGDVGRVWSRSIGRFCLTMSAGAATYRVTIVGVDWVAGPEA
Ga0187822_1007463313300017994Freshwater SedimentMRHVAFMGLLALSSSTGQAQEPVGATALRVRWEADAPAYGLQAVCGRVFNDGPVDARRIRVRVEGLDERGAVTARRDGDIVGQVSSRGIGRFCLTMSAGAVTYRVTIVGAEWVVGPEAP
Ga0184604_1019118323300018000Groundwater SedimentMTNARMTTRLMRPLAFMSLLALCPSTSQAQEPVGATALRIQWEVDPPVSGLQAVCGRVLNDGPVDAKRVRVRVEGLDERGGVTGRRDGDVLGRVSSRGIGRFCLTMSAGAATHRVTIVGIEWVARPEAP
Ga0184605_1014399223300018027Groundwater SedimentMTNARMTTRLMRPLAFMSLLALSPSTSQAQEPVGATALRIQWEVDPPVSGLQTVCGRVLNDGPMDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATYRVTIVGADWVAGPEAP
Ga0184620_1024808223300018051Groundwater SedimentMTNARMTTRLMRPLAFMSLLALSPSTSQAQEPVGATALRIQWEVDPPVSGLQAVCGRVLNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATYRVTIVGAEWVAGPEAP
Ga0184621_1004759633300018054Groundwater SedimentMTNARMTTRLMRPLAFMSLLALSPSTSQAQEPVGATALRIQWEVDPPVSGLQAVCGRVFNDGPVDARRVRVRVEGLDERGAVTGRRDGDVGQVWSRSIGRFCLTMSAGATTHRVTIVGVEWVARPEAP
Ga0184619_1019270413300018061Groundwater SedimentMTNARMTTRLMRPLAFMSLLALSPSTSQAQEPVGATALRIQWEVDPPVSGLQAVCGRVLNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGATTHRVTIVGVEWVARPEAP
Ga0184617_111246423300018066Groundwater SedimentMTMRLMRPLAFMSLLALSPSTSQAQEPVGATALRIQWEVDPPVSGLQAVCGRVLNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATHRVTIVGVEWVARPEAP
Ga0184618_1009247923300018071Groundwater SedimentMHVTTRLMRHLAFMGLLALYPSTSQAQEPVGATALRIRWESDPPVSGLQAVCGRVFNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATYRVTIVGADWVAGPEAP
Ga0184618_1028021223300018071Groundwater SedimentDPAVPPAAHGPGDRMTNARMTTRLMRPLAFMSLLALSPSTSQAQEPVGATALRIQWEVDPPVSGLQAVCGRVLNDGPVDAKRVRVRVEGLDERGGVTGRRDGDVLGRVSSRGIGRFCLTMSAGAATHRVTIVGTDWVAGPEAP
Ga0190265_1015658023300018422SoilMPTRQIRHLAFVGLLALSPSTSQAQDPMGALRIQWEADQPVSGLQTVCGRVFNDGPVDARRVRVRVEALDERGGVTGRRDGDAGQVWSRGIGRFCLTMSARAASYRATIVGVDWVAGPEA
Ga0190265_1271548513300018422SoilAFLGTLAGGLLAAPMRMTTRLMRHLLIAFMGLLALSPSTSQAQEATDLRIRWEVDQPAHGLQTVCGRVFNDGPVDARRVRVRVEGLDERGGVTGRRDGDVIGQVTSRGGGLFCLTTSAGATTYRVTIVGVEWAARPEGP
Ga0190272_1031926823300018429SoilMTTRLIRHLAVMGLLALSPSTSQAQDSVGATALRIRWEADPPVSGLQTVCGRVFNDGPVDARRVRVRVEGMDERGGVTGRRDGDVGQVWSRNIGRFCLTMSAGAATYQVTIVGVEWVAGTQAP
Ga0187892_1009622623300019458Bio-OozeMTTRLIRYVAFMGLLTLPPSTSQAQAPVEATSLRVRWEAAAPAHGLQTVCGRVLNDRTVGARRVRLRVEGLDAGGGVTGRRDADLPGQVSAQGIGLFCITMSAGAASYRVTVVGVDWAAEPEAP
Ga0193715_102261733300019878SoilVIMTNARMTTRLMRPLAFMSLLALCPSTSQAQEPVGATALRIQWEVDPPVSGLQAVCGRVFNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGATTHRVTIVGVEWVARPEAP
Ga0193723_100754933300019879SoilMTTRLIRHLAFMGVLALSPSTTQAQDSVGGTALRIRWEADPPVSGLQTVCGRVFNDALVDVRRVRVRVEGLDERGGVTTRRDGDVGQIWSRSAGRFCLAMSAGAATYRVTIVAADWVAEPQGP
Ga0193707_104732313300019881SoilDPAVAPAAGGSGDRVTDARMTTRLMRPLAFMSLLALSPSTSQAQEPVGATALRIQWEVDQPVSGLQAVCGRVLNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATYRVTIVGAEWVAGPQAP
Ga0193713_101228423300019882SoilMDARMTTRLMRPLAFMSLLALSPSTSQAQEPVGATALRIQWEVDQPVSGLQAVCGRVLNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATYRVTIVAAEWVAGPQAP
Ga0193725_109904323300019883SoilMTDARMTTRLMRPLAFMSLLALSPSTSQAQEPVGATALRIQWESDQPVSGLQAVCGRVFNDGPVDARRVRVRVEGLDERGVVTGRRDGDVGQVWSRSIGRFCLTMSAGAATYRVTIVGADWVAGPEAP
Ga0193727_115058223300019886SoilFMSLLALSPSTSQAQEPVGATALRIQWEVDQPVSGLQAVCGRVLNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATHRVTIVGVEWVARPEAP
Ga0193755_101995833300020004SoilMTTRLIRHLAFMGVLALSPSTTQAQDSVGGTALRIRWEADPPVSGLQTVCGRVFNDALVDVRRVRVRVEGLDERGGVTTRRDGDVGQIWSRSIGRFCLAMSAGAATYRVTIVAADWVAEPQGP
Ga0193755_122411813300020004SoilGGSGDRMTDARMTTRLMRPLAFMSLLALSPSTSQAQEPVGATALRIQWEVDPPVSGLQAVCGRVLNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATYRVTIVGAEWVAGPQAP
Ga0210407_1002347563300020579SoilMDRRVFISAFMGLLALFSSTSQAQERVGASGLRIAWELDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVSSRGLGRFCLTMAAGAADYRVTILNAEWAVAPESP
Ga0210381_1014896223300021078Groundwater SedimentMTNARMTTRLMRPLAFMSLLALCPSTSQAQEPVGATALRIQWEVDPPVSGLQAVCGRVFNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGATTHRVTIVGVEWVARPEAP
Ga0210406_1082823823300021168SoilFALAAGGSGDRVMDRRVFISAFMGLLALFSSTSQAQERVGASGLRIAWELDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVSSRGLGRFCLTMAAGAADYRVTILNAEWVVAPESP
Ga0210384_1007897833300021432SoilMDRRVFISAFMGLLALFSWTSQAQERVGASGLRIAWELDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVSSRGLGRFCLTMAAGAADYRVTILNAEWVVAPESP
Ga0209640_1029462623300025324SoilVPTRLIRHLAFMGFLALNPSTSQAQEPVGTTALRIRWEADQPVSGLQTVCGRVFNDGPVDARHVRVRVEGLDERGGVTGRRDGDVGRVWSRSIGRFCLTMSAGAATYRVTIVGAEWVAGPQAP
Ga0207653_1029483613300025885Corn, Switchgrass And Miscanthus RhizosphereMTTRLMRSLAFMGLLTLSPWTSQAQEAAGLRMQWEVDPPASGLQTVCGRVFNDRPVEARRVRVRVEGLDERGDVTRRRDGDVGLIWSKGIGRFCLAMSAGAATYRATIVGVEWVAGPESP
Ga0207699_1010233623300025906Corn, Switchgrass And Miscanthus RhizosphereMRSMQHLAFMCLLALSSSTSQAQERVGASPPLRIAWELDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVSSKGLGRFCLTMAAGAADYRVTIVNAEWVVTPEAP
Ga0207684_1001629413300025910Corn, Switchgrass And Miscanthus RhizosphereMRHLAFMGLLTLFSSPSPAQERVGASPLRIAWELDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVPSKGLGRFCLTMAAGAADYRVTIINAEWVVTPEAP
Ga0207681_1156616013300025923Switchgrass RhizosphereLSPWTSQAQEAAGLRMQWEVDPPASGLPTVCGRVFNDRPVEARRVRVRVEGLDERGDVTRRRDGDVGLIWSKGIGRFCLAMSAGAAAYRVTIVGVEWVAGPESP
Ga0209131_101405053300026320Grasslands SoilMTTRLMRLLAFMGLLTLSPWTGQAQEATGLRMQWEVDPPVSGLQTVCGRVFNDRPVDAKRVRVRVEGLDERGDVTRRREGDVGLVWSKGIGRFCLTTSAGAATYRVTIVGVEWVAGPESP
Ga0208997_106878313300027181Forest SoilMTTRLMRPLAFMGLLALSPSTSQAQEPVGATALRIQWESDQPVSGLQAVCGRVFNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATHRVTIVGVEWVARPEAP
Ga0209217_102248233300027651Forest SoilMRHLAFTSLLALLSSTSQAQERVGASPLRIAWESDPPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRREGDVLGQVSSKGLGRFCLTMAAGAATFRVTIVNAEWVVTPEAP
Ga0209178_142171013300027725Agricultural SoilQERVGGSPLRIAWELDAPAQGFQAVCGRVINDGPVDAKRVRVRVEGVDERGGVITRRDGDVLGQVSSKGLGRFCITMAAGAADYRVTILNAEWVVAPESP
Ga0209328_1021110013300027727Forest SoilMRSMRHLAFTSLLALLSSTSQAQERVGASPLRIAWESDPPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRREGDVLGQVSSKGLGRFCLTMAAGAATFRVTIVNAEWVVTPEAP
Ga0209073_1032165713300027765Agricultural SoilMRALRYLASFGLLVLSASASQGQQSAGATPLRVIWESDPPAYGLQAVCGRVFNDGPVDARRVRVRVEGLNERGDVTSRRDGDVLGQVSSRGIGRFCLTMAAGAASYRVTIVLAEWAAAPESP
Ga0209526_1062984123300028047Forest SoilMSRRYARMRSMRHLAFTSLLALLSSTSQAQERVGASPLRIAWESDPPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRREGDVLGQVSSKGLGRFCLTMAAGAATFRVTIVNAEWVVTPEAP
Ga0137415_1027172723300028536Vadose Zone SoilVTDARMTTRLMRPLAFMSLLALSPSTSQAQEPVGATALRIQWEVDQPVSGLQAVCGRVLNDGPVDARRVRVRVEGLDERGGVTQRRDGDVGQVSSRSIGRFCLTMSAGAATHRVTIVGVEWVAMPEAP
Ga0307320_1027951813300028771SoilMSLLALSPSTSQAQEPVGATALRIQWEVDPPVSGLQAVCGRVFNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATHRVTIVGIEWVARPEAP
Ga0307282_1001281123300028784SoilMTNARMTTRLMRPLAFMSLLALCPSTSQAQEPAGATALRIQWEVDPPVSGLQAVCGRVFNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATHRVTIVGIEWVARPEAP
Ga0307504_1021664113300028792SoilMTTRLIRHLAFMGVLALSPSTSQAQDSVGGTALRILWEADPPVSGLQTVCGRVFNDALVDARRVRVRVEGLDERGGVTARREGDVGQIWSRSAGRFCLAMSAGAATYRVTIVAADWVAAPQSP
Ga0307312_1074322423300028828SoilTSQAQEPAGATALRIQWEVDPPVSGLQAVCGRVFNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATHRVTIVGIEWVARPEAP
Ga0299907_1039691323300030006SoilMTTRLMRPLAFMGLLALSPSTSQAQEPVGATALRIQWEADPPAHGLQTVCGRVFNDGPVDARRVRVRVEGLDERGGVTGRRDGEVLGQVWSRGIGLFCLTTSAGAATHRVTIVGVEWVARPEAP
Ga0268386_1019904723300030619SoilVGATALRIRWEVDQPVSGLQTVCGRVFNDGPVDARHVWVRVEGLDERGGVTGRRDGDVGRVWSRSIGRFCLTMSAGAATYRVTIVGAEWVAGPQAP
Ga0308187_1049826013300031114SoilAGASSGVMHVTMRLMRHLAFMGLLALYPSTSQAQEPVGATALRIQWEVDPPVSGLQAVCGRVFNDGPVDARRVRVRVEGLDERGGVTGRRDGDVGQVWSRSIGRFCLTMSAGAATYRVTIVGAEWVAGPQSP
(restricted) Ga0255311_101221523300031150Sandy SoilMTTRVMRPLAFMGLLTFSPWTSQAQEATALRIQWEVDPPASGLQTVCGRVFNDAAVDARRVRVRVEGLDERGDVTRRRDGDVGLVWSKGIGRFCLTTSAGAATYRVAIVGVEWVAGPEAP
Ga0307469_10004189113300031720Hardwood Forest SoilMQHLAFMCLLALSSSTSQAQERVGASPPLRIAWELDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVSSKGLGRFWLTMAAGAADYRVTIVNAEWVVAPEAP
Ga0307469_1006602423300031720Hardwood Forest SoilMDRRLFISAFMGLLALFSSTSQAQERVGASGLRIAWEVDAPAQGFRAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVSSRGLGRFCLTMAAGAADYRVTILNAEWAVAPESP
Ga0307468_10027645823300031740Hardwood Forest SoilMTTRLIRHLAFMGVLALSPSTSQAQDSVGGTALRIQWEVDPPVSGLQTVCGRVLNDAPVAARRVRVRVEGLDERGGVTTRREGDVGQIWSRSIGRFCLAMSAGAATYRVTIVAADWVADPQGP
Ga0307473_1032095013300031820Hardwood Forest SoilMRHLAFMGLLALFSSPSPAQERVGASPLRIAWELDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVSSKGLGRFCLTMAAGAADYRVTIINAEWVVTPEAP
Ga0307471_10006640233300032180Hardwood Forest SoilMTTRLIRHLAFMAVLALSPSTSQAQDSVGGTALRIQWEVDPPVSGLQTVCGRVLNDAPVAARRVRVRVEGLDERGGVTTRREGDVGQIWSRSIGRFCLAMSAGAATYRVTIVAADWVADPQGP
Ga0307471_10008987523300032180Hardwood Forest SoilMDRRDARMRSMRHLAFMGLLALFSSTSQAQERVGASALRIAWESDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGSAITRREGDVLGQVSSKGLGRFCLTMAAGAANYRVTIVNAEWVVPPEAP
Ga0307471_10237153713300032180Hardwood Forest SoilMNGRDARMRSMRHLAFMGLLALFSSPSPAQERVGASPLRIAWELDAPAQGFQAVCGRVFNDGPVDARRVRVRVEGLDERGGVITRRDGDVLGQVSSKGLGRFCLTMAAGAADYRVT
Ga0310812_1009907913300032421SoilMSTRRMRHVAFMGLLALSPSTGQAQEPVGASALRIRWEVDAPAYGLQAVCGRVFNDGPVDARRVRVRVEGLDERGGVTARRDGDVVGQVSSRGIGRFCLTMSAGAAAYRVTIVGAEWVAAPEAP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.