NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F080441

Metagenome / Metatranscriptome Family F080441

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F080441
Family Type Metagenome / Metatranscriptome
Number of Sequences 115
Average Sequence Length 114 residues
Representative Sequence MSQATMRLGSLLIPFVTLCGCAVVNSAPPPQEVVQLRDVRDACLARNVVALDDYRSDAATVGMAVVSACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDAATQSVLAHRAARRS
Number of Associated Samples 92
Number of Associated Scaffolds 115

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 75.65 %
% of genes near scaffold ends (potentially truncated) 25.22 %
% of genes from short scaffolds (< 2000 bps) 74.78 %
Associated GOLD sequencing projects 83
AlphaFold2 3D model prediction Yes
3D model pTM-score0.70

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (60.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(29.565 % of family members)
Environment Ontology (ENVO) Unclassified
(40.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(39.130 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 57.64%    β-sheet: 0.00%    Coil/Unstructured: 42.36%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.70
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 115 Family Scaffolds
PF00144Beta-lactamase 7.83
PF13561adh_short_C2 6.09
PF01494FAD_binding_3 3.48
PF08238Sel1 2.61
PF13458Peripla_BP_6 2.61
PF00550PP-binding 1.74
PF04542Sigma70_r2 1.74
PF00866Ring_hydroxyl_B 1.74
PF10604Polyketide_cyc2 1.74
PF00817IMS 1.74
PF00106adh_short 0.87
PF00472RF-1 0.87
PF00496SBP_bac_5 0.87
PF12680SnoaL_2 0.87
PF01738DLH 0.87
PF01636APH 0.87
PF00805Pentapeptide 0.87
PF01471PG_binding_1 0.87
PF00355Rieske 0.87
PF05988DUF899 0.87
PF13460NAD_binding_10 0.87
PF04909Amidohydro_2 0.87
PF00884Sulfatase 0.87
PF07040DUF1326 0.87

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 115 Family Scaffolds
COG1680CubicO group peptidase, beta-lactamase class C familyDefense mechanisms [V] 7.83
COG1686D-alanyl-D-alanine carboxypeptidaseCell wall/membrane/envelope biogenesis [M] 7.83
COG2367Beta-lactamase class ADefense mechanisms [V] 7.83
COG06542-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductasesEnergy production and conversion [C] 6.96
COG0578Glycerol-3-phosphate dehydrogenaseEnergy production and conversion [C] 3.48
COG0644Dehydrogenase (flavoprotein)Energy production and conversion [C] 3.48
COG0665Glycine/D-amino acid oxidase (deaminating)Amino acid transport and metabolism [E] 3.48
COG0389Nucleotidyltransferase/DNA polymerase DinP involved in DNA repairReplication, recombination and repair [L] 1.74
COG0568DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)Transcription [K] 1.74
COG1191DNA-directed RNA polymerase specialized sigma subunitTranscription [K] 1.74
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 1.74
COG4941Predicted RNA polymerase sigma factor, contains C-terminal TPR domainTranscription [K] 1.74
COG55173-phenylpropionate/cinnamic acid dioxygenase, small subunitSecondary metabolites biosynthesis, transport and catabolism [Q] 1.74
COG0216Protein chain release factor RF1Translation, ribosomal structure and biogenesis [J] 0.87
COG1186Protein chain release factor PrfBTranslation, ribosomal structure and biogenesis [J] 0.87
COG1357Uncharacterized conserved protein YjbI, contains pentapeptide repeatsFunction unknown [S] 0.87
COG4312Predicted dithiol-disulfide oxidoreductase, DUF899 familyGeneral function prediction only [R] 0.87
COG5588Uncharacterized conserved protein, DUF1326 domainFunction unknown [S] 0.87


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms60.00 %
UnclassifiedrootN/A40.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000364|INPhiseqgaiiFebDRAFT_105391332Not Available645Open in IMG/M
3300001661|JGI12053J15887_10003264All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales8144Open in IMG/M
3300001661|JGI12053J15887_10011697All Organisms → cellular organisms → Bacteria → Proteobacteria4821Open in IMG/M
3300001661|JGI12053J15887_10282050All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria817Open in IMG/M
3300003203|JGI25406J46586_10183173Not Available611Open in IMG/M
3300004092|Ga0062389_100409471Not Available1477Open in IMG/M
3300004092|Ga0062389_100749055All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1153Open in IMG/M
3300004093|Ga0065178_100635All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium URHD001726655Open in IMG/M
3300004633|Ga0066395_10047831All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium1879Open in IMG/M
3300005332|Ga0066388_106089121Not Available609Open in IMG/M
3300005332|Ga0066388_106882235Not Available572Open in IMG/M
3300005445|Ga0070708_100506514Not Available1139Open in IMG/M
3300005467|Ga0070706_100247966All Organisms → cellular organisms → Bacteria1663Open in IMG/M
3300005467|Ga0070706_100751090All Organisms → cellular organisms → Bacteria → Proteobacteria904Open in IMG/M
3300005468|Ga0070707_101895944Not Available563Open in IMG/M
3300005764|Ga0066903_103134507All Organisms → cellular organisms → Bacteria895Open in IMG/M
3300006050|Ga0075028_100302639All Organisms → cellular organisms → Bacteria → Proteobacteria892Open in IMG/M
3300006172|Ga0075018_10523759All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium URHD0017621Open in IMG/M
3300006176|Ga0070765_100291467All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Burkholderia → Burkholderia cepacia complex → Burkholderia ubonensis1506Open in IMG/M
3300006176|Ga0070765_101437470Not Available649Open in IMG/M
3300006176|Ga0070765_102259126Not Available507Open in IMG/M
3300006572|Ga0074051_11026904Not Available875Open in IMG/M
3300006577|Ga0074050_10688099Not Available502Open in IMG/M
3300006642|Ga0075521_10485297All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium URHD0017605Open in IMG/M
3300007255|Ga0099791_10145666All Organisms → cellular organisms → Bacteria1105Open in IMG/M
3300007788|Ga0099795_10334141Not Available674Open in IMG/M
3300008886|Ga0115930_1005825All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium URHD00175607Open in IMG/M
3300009038|Ga0099829_10404264All Organisms → cellular organisms → Bacteria1129Open in IMG/M
3300009038|Ga0099829_10812645Not Available776Open in IMG/M
3300009143|Ga0099792_10502359Not Available759Open in IMG/M
3300009695|Ga0123337_10183742Not Available1142Open in IMG/M
3300009792|Ga0126374_10073323All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium1841Open in IMG/M
3300010043|Ga0126380_10054833All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2175Open in IMG/M
3300010046|Ga0126384_11052263All Organisms → cellular organisms → Bacteria744Open in IMG/M
3300010047|Ga0126382_10959847Not Available745Open in IMG/M
3300010048|Ga0126373_10765241All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales1026Open in IMG/M
3300010359|Ga0126376_11895216Not Available636Open in IMG/M
3300010362|Ga0126377_10774346All Organisms → cellular organisms → Bacteria1017Open in IMG/M
3300010362|Ga0126377_11329212All Organisms → cellular organisms → Bacteria791Open in IMG/M
3300010362|Ga0126377_11569194All Organisms → cellular organisms → Bacteria732Open in IMG/M
3300010867|Ga0126347_1060199Not Available512Open in IMG/M
3300010880|Ga0126350_10499374All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Burkholderia → Burkholderia cepacia complex → Burkholderia ubonensis1381Open in IMG/M
3300011269|Ga0137392_10166153All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Burkholderia → Burkholderia cepacia complex → Burkholderia ubonensis1790Open in IMG/M
3300011269|Ga0137392_11143310Not Available636Open in IMG/M
3300011270|Ga0137391_10067319All Organisms → cellular organisms → Bacteria → Proteobacteria3072Open in IMG/M
3300011270|Ga0137391_10226699Not Available1622Open in IMG/M
3300012200|Ga0137382_10998700Not Available601Open in IMG/M
3300012205|Ga0137362_10421161Not Available1156Open in IMG/M
3300012363|Ga0137390_10108241All Organisms → cellular organisms → Bacteria2749Open in IMG/M
3300012683|Ga0137398_10725975All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiales incertae sedis → Vineibacter → Vineibacter terrae691Open in IMG/M
3300012685|Ga0137397_10013189All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales5783Open in IMG/M
3300012922|Ga0137394_10102856All Organisms → cellular organisms → Bacteria → Proteobacteria2408Open in IMG/M
3300012923|Ga0137359_11019597Not Available710Open in IMG/M
3300012924|Ga0137413_10256429Not Available1203Open in IMG/M
3300012924|Ga0137413_10729070Not Available755Open in IMG/M
3300012925|Ga0137419_10149303All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1685Open in IMG/M
3300012925|Ga0137419_10418231All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1050Open in IMG/M
3300012944|Ga0137410_10315710All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1241Open in IMG/M
3300012944|Ga0137410_10779788Not Available801Open in IMG/M
3300012971|Ga0126369_10797953All Organisms → cellular organisms → Bacteria1026Open in IMG/M
3300015241|Ga0137418_10155063All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2015Open in IMG/M
3300015242|Ga0137412_10007204All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales8929Open in IMG/M
3300015245|Ga0137409_10972295Not Available686Open in IMG/M
3300015371|Ga0132258_13669647All Organisms → cellular organisms → Bacteria1048Open in IMG/M
3300015372|Ga0132256_100270679All Organisms → cellular organisms → Bacteria → Proteobacteria1774Open in IMG/M
3300018429|Ga0190272_12638740Not Available550Open in IMG/M
3300018476|Ga0190274_11677760Not Available729Open in IMG/M
3300019887|Ga0193729_1003999All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Salinarimonadaceae → Salinarimonas → Salinarimonas rosea7179Open in IMG/M
3300019889|Ga0193743_1076659All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiales incertae sedis → Vineibacter → Vineibacter terrae1329Open in IMG/M
3300019890|Ga0193728_1032320All Organisms → cellular organisms → Bacteria → Proteobacteria2599Open in IMG/M
3300019890|Ga0193728_1055258Not Available1921Open in IMG/M
3300019890|Ga0193728_1236723Not Available741Open in IMG/M
3300020021|Ga0193726_1019221All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae3496Open in IMG/M
3300020022|Ga0193733_1083999All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiales incertae sedis → Vineibacter → Vineibacter terrae893Open in IMG/M
3300020027|Ga0193752_1027967All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Burkholderia → Burkholderia cepacia complex → Burkholderia ubonensis2608Open in IMG/M
3300020034|Ga0193753_10086100All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Burkholderia1594Open in IMG/M
3300020579|Ga0210407_10017623All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales5297Open in IMG/M
3300020581|Ga0210399_10040087All Organisms → cellular organisms → Bacteria → Proteobacteria3751Open in IMG/M
3300021088|Ga0210404_10005854All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales4964Open in IMG/M
3300021168|Ga0210406_10091939All Organisms → cellular organisms → Bacteria → Proteobacteria2584Open in IMG/M
3300021168|Ga0210406_10292123All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiales incertae sedis → Vineibacter → Vineibacter terrae1328Open in IMG/M
3300021178|Ga0210408_10012632All Organisms → cellular organisms → Bacteria6938Open in IMG/M
3300021363|Ga0193699_10304097Not Available665Open in IMG/M
3300021403|Ga0210397_10417245All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1006Open in IMG/M
3300021403|Ga0210397_10898402Not Available686Open in IMG/M
3300021475|Ga0210392_10230035All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1306Open in IMG/M
3300022561|Ga0212090_10504698Not Available829Open in IMG/M
3300022756|Ga0222622_10032680All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2792Open in IMG/M
3300026551|Ga0209648_10249127All Organisms → cellular organisms → Bacteria1317Open in IMG/M
3300026557|Ga0179587_10072381All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiales incertae sedis → Vineibacter → Vineibacter terrae2032Open in IMG/M
3300026557|Ga0179587_10081381All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1927Open in IMG/M
3300026557|Ga0179587_10808146Not Available618Open in IMG/M
3300026825|Ga0209909_100437All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium URHD001726723Open in IMG/M
3300027480|Ga0208993_1003182All Organisms → cellular organisms → Bacteria → Proteobacteria2693Open in IMG/M
3300027655|Ga0209388_1164094Not Available624Open in IMG/M
3300027669|Ga0208981_1013991All Organisms → cellular organisms → Bacteria2037Open in IMG/M
3300027669|Ga0208981_1030428All Organisms → cellular organisms → Bacteria → Proteobacteria1379Open in IMG/M
3300027671|Ga0209588_1119153Not Available845Open in IMG/M
3300027738|Ga0208989_10003767All Organisms → cellular organisms → Bacteria → Proteobacteria5040Open in IMG/M
3300027846|Ga0209180_10136911All Organisms → cellular organisms → Bacteria1408Open in IMG/M
3300027855|Ga0209693_10001943All Organisms → cellular organisms → Bacteria → Proteobacteria9072Open in IMG/M
3300027862|Ga0209701_10618324Not Available571Open in IMG/M
3300027903|Ga0209488_10233576Not Available1381Open in IMG/M
3300028047|Ga0209526_10590827Not Available712Open in IMG/M
3300028536|Ga0137415_10169312Not Available2015Open in IMG/M
3300028906|Ga0308309_11235735Not Available643Open in IMG/M
3300031170|Ga0307498_10228300Not Available664Open in IMG/M
3300031231|Ga0170824_121993826Not Available633Open in IMG/M
3300031231|Ga0170824_124900211All Organisms → cellular organisms → Bacteria → Proteobacteria1923Open in IMG/M
3300031366|Ga0307506_10428900Not Available547Open in IMG/M
3300031446|Ga0170820_14086917Not Available657Open in IMG/M
(restricted) 3300031825|Ga0255338_1034105All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2259Open in IMG/M
3300031902|Ga0302322_100879682All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1073Open in IMG/M
3300032008|Ga0318562_10448917All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Reyranellaceae → Reyranella → Reyranella soli749Open in IMG/M
3300032089|Ga0318525_10444738All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria664Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil29.57%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil12.17%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil9.57%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil8.70%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil6.96%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil4.35%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.48%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.48%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil2.61%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil1.74%
Glacier ValleyEnvironmental → Aquatic → Freshwater → Ice → Glacier → Glacier Valley1.74%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.74%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.74%
CyanobacterialHost-Associated → Microbial → Bacteria → Unclassified → Unclassified → Cyanobacterial1.74%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil1.74%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.87%
Arctic Peat SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Arctic Peat Soil0.87%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.87%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.87%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.87%
FenEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Fen0.87%
Micrasterias Crux-Melitensis (Mzch 98) AssociatedHost-Associated → Microbial → Bacteria → Unclassified → Unclassified → Micrasterias Crux-Melitensis (Mzch 98) Associated0.87%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.87%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.87%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.87%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300003203Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S4T2R2Host-AssociatedOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004093Cyanobacterial communities from the Joint Genome Institute, California, USA - FECB-22 (version 2)Host-AssociatedOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006572Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtHPB (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006577Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtHPA (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006642Arctic peat soil microbial communities from the Barrow Environmental Observatory site, Barrow, Alaska, USA - NGEE PermafrostAB12-DEnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300008886Microbial communities associated with unicellular green alga Micrasterias crux-melitensis, Germany - (MZCH: 98)Host-AssociatedOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009695Glacier valley bacterial and archeal communities from Borup Fiord, Nunavut, Canada, to study Microbial Dark Matter (Phase II) - frozenSSSS metaGEnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010867Boreal forest soil eukaryotic communities from Alaska, USA - C3-5 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300010880Boreal forest soil eukaryotic communities from Alaska, USA - C5-1 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018476Populus adjacent soil microbial communities from riparian zone of Yellowstone River, Montana, USA - 531 TEnvironmentalOpen in IMG/M
3300019887Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c2EnvironmentalOpen in IMG/M
3300019889Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2c2EnvironmentalOpen in IMG/M
3300019890Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c1EnvironmentalOpen in IMG/M
3300020021Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c1EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300020027Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1c1EnvironmentalOpen in IMG/M
3300020034Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1c2EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021363Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3c2EnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300022561Borup_combined assemblyEnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300026825Cyanobacterial communities from the Joint Genome Institute, California, USA - FECB-22 (SPAdes)Host-AssociatedOpen in IMG/M
3300027480Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM2_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027669Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027738Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027855Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300031170Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 12_SEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031366Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 25_SEnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031825 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - MeOH1_35cm_T4_195EnvironmentalOpen in IMG/M
3300031902Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Fen_T0_2EnvironmentalOpen in IMG/M
3300032008Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f18EnvironmentalOpen in IMG/M
3300032089Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f23EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10539133213300000364SoilMRRRYASLIPFAVLCGCAVVNTAPPPAEVARLRDVRDDCLMRNAIALDDGRSDAATVGRAVVAACGRENAAIVSSIAGPDGFREGEIRRQVEQNSQEAATQYVLSHRAARSVRR*
JGI12053J15887_1000326443300001661Forest SoilMSQATMRRGSLXIPFVTLCGCAVVNSAPPPQEVVQLRDVRDACLARNVAALDDYRSDAATVAMAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLNATTQAVLAHRAARRS*
JGI12053J15887_1001169743300001661Forest SoilMRLHPLLIPLVMLSGCAVVNTAPPPGQVVQLRDVRDACLMRNAVTLDDNRSDAATVGSAAVAACQQENAALVAAIAGPDGFRQGEIRRQIEQNSQQAATQYVLSHRAAMSVRR*
JGI12053J15887_1028205013300001661Forest SoilMSQAKMRSASLLIPLVTLCGCAVVNSAPPPAEVAQLRNVRDACLSRNVVALDDYRSDAASVGMAVVAACRQENAALVASIAGPDGFRQGEISRQIEQNSRDAATQMVLSHRAAQRS*
JGI25406J46586_1018317323300003203Tabebuia Heterophylla RhizosphereMRLVVHRRIRSTLSRCSLLILLAPLGGCAVAYGTASPAEVTRLRDARDACLAQYAIRLDDYRSDAATVGRAVVTACSQENAALISAIAGPDGYRGSEISRQILQNSQEAATQY
Ga0062389_10040947143300004092Bog Forest SoilMSQAKMRLESLLIPLVMLCGCAVVNSAPPPAEVAQLRDVRDACLARNVVALDDYRSDAATVGMAVVAACRQENAAIVAAIAGPDGFRQGEISRQIEQDSRNAATQMVLSRRAAQRS*
Ga0062389_10074905523300004092Bog Forest SoilMSQATKRLGFLLIPFVTLCGCAVVNSAPPPQEVVQLRNARDACLARNVAALDDYRSDAATIGMAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDAATQSVLAHRAARRS*
Ga0065178_10063593300004093CyanobacterialMVRSRRLLPMLALGGCAVVNSAPSSQEVVQLRDVRDQCLMQNAIRLDDGRSDPQAIAASVVAACQSENQALITAIAGPDGFRQSEIARQIQQNSQQAATQYVLQVRAARTRS*
Ga0066395_1004783123300004633Tropical Forest SoilMRIHPLLLLFLSLGGCAAVYSAPPAYEVRQLRDIRDQCLMQNAVRLDDYRSDAATVGAAVVQACGRENAALVSAIAGPDGYRESEIQRQIDQNSQQAATQYVLSHRAAMSVRR*
Ga0066388_10608912113300005332Tropical Forest SoilMMRTQPLLLLVLSLGGCAPFYSAPPAYEVRELRDIRDQCLMRNAIRLDDYRSDAATIGAAAVQACSRENAALVSAIAGPDGFRQSEIQRQIDQNSQQAATQYVLSHRAAASVRR*
Ga0066388_10688223513300005332Tropical Forest SoilVTMRPHSLLLLLLSLGGCAAVYSAPPAYEVRQLRDIRDQCLMQNAVRLDDYRSDAATVGAAVVAACGRENAALVSAIAGPDGYRESEIQRQIDQNSQQAATQYVLSH
Ga0070708_10050651413300005445Corn, Switchgrass And Miscanthus RhizosphereMLQATMRLGSLLIPFVTLCGCAVVNSAPPPQEVVQLRNVRDACLARNVVALDDYRSDAATIGMAVVAACRQENAALVASIAGPDGFRQSEIARQIDQNSRDAATQMVLSHRAAQRS*
Ga0070706_10024796633300005467Corn, Switchgrass And Miscanthus RhizosphereMLQATMRLGSLLIPFVTLCGCAVVNSAPPPQEVVQLRNVRDACLARNVVALDDYRSDAATIGMAVVAACRQENAALVASIAGPDGFRQSEIARQIDQNSRDAATQMVLSHRAGQRS*
Ga0070706_10075109023300005467Corn, Switchgrass And Miscanthus RhizosphereLGGCAVVNTAPPPAEVARLRDVRDACLMRNAITLDDGRSDPAAVGRTVVAACGRENAAVVSSIAGPDGFRAGEIRRQVEQNSQEAATQYVLSHRAARSVRR*
Ga0070707_10189594413300005468Corn, Switchgrass And Miscanthus RhizosphereMKRYASLIPFVVLCGCAVVNTAPPPAEVARLRDVRDACLMRNAITLDDGRSDPAAVGRTVVAACGRENAAVVSSIAGPDGFRAGEIRRQVEQNSQEAATQYVLSHRAARSVRR*
Ga0066903_10313450723300005764Tropical Forest SoilKRLSGFALLLALGGCAVVNTAPPPSEVVRLRDVRDQCLMQNAVRLDDGRSDAATVGRAVVAACQPENAAIISAIAGPDGFRQSEIARQVDQNSQQAATQYVLSHRAAMSVRR*
Ga0075028_10030263913300006050WatershedsMSQATMRLGSLLIPFVTLCGCAVVNSAPPPQEVVQLRDARDACLARNVAALDDYRSDAATVAMAVVSACRYENQALVSAIAGPDGFRQSEISRQIEQNSLDATTQAVLAHRAARRS*
Ga0075018_1052375913300006172WatershedsMSQATMRLGSLLIPFVTLCGCAVVNSAPPPQEVVQLRDARDACLARNVAALDDYRSDAATVAMAVVSACRYENQALVSAIAGPDGFRQSEISRQVEQNSLDATTQAVLAHRAARRS*
Ga0070765_10029146713300006176SoilMSQATKRLGSLLIPFVTLCGCAVVNSAPPPEEVVQLRNVRDACLARNVVALDDYRSDAATVGMAVVAACRQENGALVVSIAGPDGFRQSEIARQIDQNSRDAATQMVLSHRAAQRS*
Ga0070765_10143747013300006176SoilMSQATMRLGALLIPLVTLCGCAVVNSAPPPEEVVQLRNVRDACLARNVVALDDYRSDAATVGMAVVAACRQENAALVASIAGPDGFRQSEIARQIDQNSRDAATQMVLSHRAAQRS*
Ga0070765_10225912613300006176SoilCGCAVVNSAPPPQEVVQLRNVRDACLARNVAALDDYRSDAATVAMAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDAATQSVLAHRAARRS*
Ga0074051_1102690413300006572SoilRMLQAKLIPKSLLIPLVTLCGCAVVNSAPPPEQVRQLRDVRDACLARNVVALDDYRSDAATIGMAVVAACHYENAALVQAIAGPDGFRQGEISRQIEQNSLDAATQAVVMHRAQRRS*
Ga0074050_1068809913300006577SoilGRREAPFPCRPHSRSAGTIDFLEFPRESGARPNPATRARTERMLQAKLIPKSLLIPLVTLCGCAVVNSAPPPEQVRQLRDVRDACLARNVVALDDYRSDAATIGMAVVAACRYENAALVDAIAGPDGFRRSEIARQIDQNSLDAATQSVVMHRAQRRS*
Ga0075521_1048529713300006642Arctic Peat SoilMSQATMRIGFLLAPFVALSGCAVVNSAPPAQEVVQLREARDACLARNVAALDDYRSDAATIGMAVVAACRYENQALVSAIAGPDGFRQSEISRQIQQNSLDAATQSVVAHRASRRS
Ga0099791_1014566623300007255Vadose Zone SoilMRTLPLLIPLVMLGGCAAVYSAPSSAEVRQLRDVRDACLMQNAVTLDNDRSDAATVGRAVVAACRRENAAIVAAIAGPDGFRESEISRQIDQNSQEAATQYVLSHRAAMSVRR*
Ga0099795_1033414123300007788Vadose Zone SoilAALCGCAVVNTAPPPAEVARLRDVRDACLMRNAITLDDGRSDATAIGRAVVAACGRENGAVVSSIAGPDGFREGEIRRQVEQNSQEAATQYVLSHRAARSVRR*
Ga0115930_100582593300008886Micrasterias Crux-Melitensis (Mzch 98) AssociatedMLALGGCAVVNSAPSSQEVVQLRDVRDQCLMQNAIRLDDGRSDPQAIAASVMAACQSENQALITAIAGPDGFRQSEIARQIQQNSQQAATQYVLQVRAARTRS*
Ga0099829_1040426423300009038Vadose Zone SoilMRTLPLLIPLVMLGGCAAVYSAPSSAEVRQLRDVRDACLMQNAVTLDNGRSDAATVGRAVVAACRRENAAIVAAIAGPDGFRESEISRQIDQNSQEAATQYVLSHRAAMSVRR*
Ga0099829_1081264513300009038Vadose Zone SoilMSQATMRLGSLLIPFVTLCGCAVVNSAPPPQEVVQLRDVRDACLARNVVALDDYRSDAATVGMAVVSACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDAATQSVLAHRAARRS*
Ga0099792_1050235923300009143Vadose Zone SoilMRRRYASLIPFAALCGCAVVNTAPPPAEVARLRDVRDACLMRNAITLDDGRSDATAIGRAVVAACGRENGAVVSSIAGPDGFREGEIRRQVEQNSQEAA
Ga0123337_1018374223300009695Glacier ValleyMNRLLRLLPVLALAGCAAAYSAPSSNEIVQLRNQRDACLMRNAVALDDGRSDPRAIATSVVGYCQPENNQIIAAIAGPDDFRRSEITRQVQQDSLQAATNYVLSHRAARARS*
Ga0126374_1007332323300009792Tropical Forest SoilMRPHSLLLLLLSLGGCAAVYSAPPAYEVRQLRDIRDQCLMQNAVRLDDYRSDAATVGAAVVAACGRENAALVSAIAGPDGYRESEIQRQIDQNSQQAATQYVLSHRAAMSVRR*
Ga0126380_1005483323300010043Tropical Forest SoilMRIHLLLLLFLSLGGCAAVYSAPPAYEVRQLRDIRDQCLMQNAVRLDDYRSDAATVGAAVVQACGRENAALVSAIAGPDGYRESEIQRQINQNSQQAATQYVLSHRAAMSVRR*
Ga0126384_1105226323300010046Tropical Forest SoilAETMRPYCLLLLLLSLGGCAAVYSAPPAYEVRQLRDIRDQCLMQNAVRLDDYRSDAATVGAAVVAACGRENAALVSAIAGPDGYRESEIRRQIDQNSQQAATQYVLSHRAAMSVRR*
Ga0126382_1095984723300010047Tropical Forest SoilMRPHSLLLLLLLLPLGGCAAVYSAPPAYEVRQLRDIRDQCLMQNAVRLDDYRSDAATVGAAVVAACGRENAALVSAIAGPDGYRESEIRRQIDQNSQQAATQYVLSHRAAMSVRR*
Ga0126373_1076524123300010048Tropical Forest SoilMRIHLLLFLSLGGCAAVYSAPPAYEVRQLRDIRDQCLMQNAVRLDDYRSDAATVGAAVVQACGRENAALVSAIAGPDGYRESEIQRQIDQNSQQAATQYVLSHRAAMSVRR*
Ga0126376_1189521623300010359Tropical Forest SoilMRIHLLLLLFLSLGGCAAVYSAPPAYEVRQLRDIRDQCLMQNAVRLDDYRSDAATVGAAVVQACGRENAALVSAIAGPDGYRESEIQRQIDQNSQQAATQYVLSHRAAMSVRR*
Ga0126377_1077434623300010362Tropical Forest SoilLLSIFMIRFWRLLPMLALGGCAVVNSAPSSQEVVQLRDVRDQCLMQNAIRLDDGRSDARAVASAVVAACQSENAAIISAIAGPDGFRQSEIARQVQQDSQQAATQYVLQVRAARARR*
Ga0126377_1132921223300010362Tropical Forest SoilMRPHSLLLLLLPLGGCAAVYSAPPAYEVRQLRDIRDQCLMQNAVRLDDYRSDAATVGAAVVAACGRENAALVSAIAGPDGYRESEIRRQIDQNSQQAATQYVLSHRAAMSVRR*
Ga0126377_1156919423300010362Tropical Forest SoilMTMRTHPLLLVLLSLGGCAVVNSAPPAYEVRELRDIRDQCLMRNAIRLDDYRSDAATVGAAVVQACSRENAALVSAIAGPDGFRQSEIQRQIDQN
Ga0126347_106019913300010867Boreal Forest SoilMSQAAKRLGSLLIPFVTLCGCAVVNSAPPPQEVVQLRNARDACLARNVAALDDYRSDAATIGMAVVAACRYENQALVSAIAGPDGFRQSEISRQIEQNSLDAATQSVLAHRAARR*
Ga0126350_1049937413300010880Boreal Forest SoilMSQATKTLGSLLIPFVTLCGCAVVNSAPPPQEVVQLRNARDACLARNVAALDDYRSDAATIGTAVVAACRYENQALVSAIAGPDGFRQSDISRQIEQNSLDAATQSVLAHRAARRS*
Ga0137392_1016615313300011269Vadose Zone SoilMSQATMRLGSLLIPLAILCGCAVVNSAPPPQEVVQLRNVRDACLARNVVALDDYRSDAATIGMAVVAACRQENAALVASIAGPDGFRQSEIARQIDQNSRDAATQMVLSHRAAQRS*
Ga0137392_1114331013300011269Vadose Zone SoilMRLKSLLIPLVTLCGCAVVNSAPPPAEVAQLRDVRDACLARNVVALDDYRSDAATVGMAVVAACRQENAALVASIAGPDGFRQSEIARQIDQNSRDAATQMVLSHRATRRS*
Ga0137391_1006731923300011270Vadose Zone SoilMLQATMRLGSLLIPLAILCGCAVVNSAPPPQEVVQLRNVRDACLARNVVTLDDYRSDAATIGMAVVAACRQENAALVASIAGPDGFRQSEIARQIDQNSRDAATQMVLSHRAAQRS*
Ga0137391_1022669923300011270Vadose Zone SoilMSQATMRLESLLIPLVTLCGCAVVNSAPPPAEVAQLRDVRDACLARNVVALDDYRSDAATVGMAVVAACRQENAALVASIAGPDGFRQSEIARQIDQNSRDAATQMVLSHRATRRS*
Ga0137382_1099870013300012200Vadose Zone SoilIPFAALCGCAVVNTAPPPAEVARLRDVRDACLMRNAITLDDGRSDAAAIGRAVVAACGRENAAVVSSIAGPDGFRKGEIRRQVEQNSQEAATQYVLSHRAARSVRR*
Ga0137362_1042116113300012205Vadose Zone SoilMRTLPLLIPLVMLGGCAAVYSAPSSAEVRQLRDVRDACLMQNAVTLDNGRSDAATAGRAVVAACRRENAAIVAAIAGPDGFRESEISRQIDQNSQEAATQYVLSHRAAMSVRR*
Ga0137390_1010824113300012363Vadose Zone SoilMRLESLLIPLVTLCGCAVVNSAPPPAEVAQLRDVRDACLARNVVALDDYRSDAATVGMAVVAACRQENAALVASIAGPDGFRQSEIARQIDQNSRDAATQMVLSHRATRRS*
Ga0137398_1072597533300012683Vadose Zone SoilAVVNSAPPAAEVVQLRDVRDACLARNGAALDDYRSDAATVALAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDAATQAVLAHRAARRS*
Ga0137397_1001318943300012685Vadose Zone SoilMRRRYASLIPFAVLCGCAVVNTAPPPAEVARLRDVRDACLMRNAITLDDGRSDATAIGRAVVAACGRENAAVVSSIAGPDGFREGEIRRQVEQNSQDAATQYVLSHRAARSVRR*
Ga0137394_1010285643300012922Vadose Zone SoilMRRRYASLIPFAALCGCAVVNTAPPPAEVARLRDVRDACLMRNAITLDDGRSDATAIGRAVVAACGRENAAVVSSIAGPDGFREGEIRRQVEQNSQDAATQYVLSHRAARSVRR*
Ga0137359_1101959713300012923Vadose Zone SoilMRRRYASLIPFAALCGCAVVNTAPPPAEVARLRDVRDACLMRNAITLDDGRSDATAIGRAVVAACGRENAAVVSSIAGPDGFREGEIRRQVEQNSQEAATQYVLSHRAARSVRR*
Ga0137413_1025642923300012924Vadose Zone SoilMSQAKMRPASLLIPLATLCGCAVVNSAPPPAEVAQLRNVRDACLSRNVVALDDYRSDAASVGMAVVAACRQENAALVASIAGPDGFRQGEISRQIEQNSRDAATQAVVMHRSQRRS*
Ga0137413_1072907013300012924Vadose Zone SoilTQARAIGPVVAAEDASAGPSPRGVDFLIFPREVVAMQDQQRARARRMLQATKRLGSLLIPFVTLCGCAVVNSAPPPQEVVQLRNARDACLARNVAALDDYRSDAATIGRAVVSACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDAATQSVLAHRATRRS*
Ga0137419_1014930323300012925Vadose Zone SoilMSQAKMRPASLLIPLVTLCGCAVVNSAPPPAEVAQLRNVRDACLSRNVVALDDYRSDAASVGMAVVAACRQENAALVASIAGPDGFRQGEISRQIEQNSRDAATQMVLSHRAARRS*
Ga0137419_1041823113300012925Vadose Zone SoilMLSGCAVVNTAPPPGQVVQLRDVRDACLMRNAVTLDDNRSDAATVGSAAVAACQQENAALVAAIAGPDGFRQGEIRRQIEQNSQQAATQYVLSHRAAMSVRR*
Ga0137410_1031571033300012944Vadose Zone SoilMSRRYASLLPFAVLCGCAVVNAAPPPAEVARLRDVRDACLMRNAITLDDGRSDATAIGRAVVAACGRENAAVVSSIAGPDGFREGEIRRQVEQNSQEAATQYVLSHRAARSVRR*
Ga0137410_1077978813300012944Vadose Zone SoilMSQAKMRLASLLIPLVALCGCAVVNSAPPPEQVRQLRDVRDACLARNVVALDDYRSDAATVGMAVVSACRYENAALVSAIAGPDGFRQGEISRQIEQNSRDAATQAIVMHRSQRRS*
Ga0126369_1079795313300012971Tropical Forest SoilMRIHPLLLLFLSLGGCAAVYSAPPAYEVRQLRDIRDQCLMQNAVRLDDYRSDAATVGAAVVQACGRENAALVSAIAGPDGYRESEIQRQIDQNSQQAATQYVLSHR
Ga0137418_1015506343300015241Vadose Zone SoilMRPASLLIPLVTLCGCAVVNSAPPPAEVAQLRNVRDACLARNVVALDDYRSDAASVGMAVVAACRQENAALVASIAGPDGFRQGEISRQIEQNSRDAATQMVLSHRAARR
Ga0137412_1000720493300015242Vadose Zone SoilLLIPFVTLCGCAVVNSAPPAAEVVQLRDVRDACLARNLVALDDYRTDAATIGRAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDATTQAVLAHRAARRS*
Ga0137409_1097229523300015245Vadose Zone SoilMRLASLLIPLVALCGCAVVNSAPPPEQVRQLRDVRDACLARNVVALDDYRSDAATVGMAVVSACRYENAALVSAIAGPDGFRQGEISRQIEQNSRDAATQAIVMHRSQRRS*
Ga0132258_1366964723300015371Arabidopsis RhizosphereMQGQGRLLSLFMVQFWRLLPVAALGGCAVVNSAPSSQEVVQLRDVRDQCLMQNAIRLDDGRSDPRAIASSVVAACQRENAAIITAIAGPDGFRQSEIARQVQQDSQQAATQYVLQVRAARTRS*
Ga0132256_10027067933300015372Arabidopsis RhizosphereMAINWRERFRAFVIHFLVTLAVGGCAVAYSAPSSQEVVQLRDVRDQCLMQNAIRLDDGRSDPRAIASSVVAACQRENAAIITAIAGPDGFRQSEIARQVQQDSQQAATQYVLQVRAARARR*
Ga0190272_1263874013300018429SoilMMPSVMKPYWRLLPVLAVAGCAVAYSAPSSEEVKRLRDNRDACLMHNAVRLDDYRSDAAAVASAVVAACQRENAILISAIAGPDGFRQSEIARQIEQNSQQAATQYVLSHRAARRTS
Ga0190274_1167776013300018476SoilSVMMPNVVKPHWRLLPVLAVAGCAVAYSAPSSEEVKRLRDNRDACLMQNAVRLDDYRSDPAAVASAVVAACQRENAILISAIAGPDGFRQSEIARQIEQNSQQAATQYVLSHRAARRTS
Ga0193729_100399973300019887SoilMQDRQRARTMRMSQVTKRLGSLLIPFVTLCGCAVVNSAPPPQEVVQLRNARDACLARNVAALDDYRSDAATIGRAVVAACRYENQALVSAIAGPDGFRQGEISRQI
Ga0193743_107665923300019889SoilMSGLATRARTRRMSQATMRRGSLLIPFVTLCGCAVVNSAPPPQEVVQLRDVRDACLARNVAALDDYRSDAATVAMAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLNATTQAVLAHRAARRS
Ga0193728_103232043300019890SoilMSQATMRRGSLLIPFVTLCGCAVVNSAPPPQEVVQLRNVRDACLARNVAALDDYRSDAATVAMAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDATTQAVLAHRAARRS
Ga0193728_105525823300019890SoilMQDRQRARTMRMSQVTKRLGSLLIPFVTLCGCAVVNSAPPPQEVVQLRNARDACLARNVAALDDYRSDAATIGRAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDAATQSVLAHRAARRS
Ga0193728_123672313300019890SoilMSQAKMRLALLLVPLVTLCGCAVVNSAPPPDQVRQLRDVRDACLARNVVALDDYRSDAATIGMAVVSACRYENAALVDAIAGPDGFRRGEIARQIDQNSRDAATQAVVLHRAQRRS
Ga0193726_101922143300020021SoilMSQATMRRGSLLIPFVTLCGCAVVNSAPPSQEVVQLRDVRDACLARNVAALDDYRTDAATIGRAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDAATQSVLAHRAARRS
Ga0193733_108399933300020022SoilMSQATMRRGSLLIPFVTLCGCAVVNSAPPPQEVVQLRDVRDACLARNVAALDDYRTDAATIGRAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDATTQAVLAHRAARRS
Ga0193752_102796743300020027SoilMSQATMRRGSLLIPFVTLCGCAVVNSAPPPQEVVQLRDVRDACLARNVAALDDYRTDAATIGRAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDAATQSVLAHRAARRS
Ga0193753_1008610023300020034SoilMSEPATRARTRRMSQATKRVGSLLIPLVTLCGCAVVNSAPPPQEVVQLRNVRDACLARNVVALDDYRSDAATVAMAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDATTQAVLAHRAARRS
Ga0210407_1001762353300020579SoilMSQATMRLASLLIPLVTLCGCAVVNSAPPPEEVVQLRNARDACLARNVAALDDYRSDAATIGMAVVAACRYENQALVASIAGPDGFRQGEISRQIEQNSLDAATQAVLAHRATRRS
Ga0210399_1004008753300020581SoilMSQATRRLASLLIPLVTLCGCAVVNSAPPPEEVVQLRNARDACLARNVAALDDYRSDAATIGMAVVAACRYENQALVASIAGPDGFRQGEISRQIEQNSLDAATQAVLAHRATRRS
Ga0210404_1000585433300021088SoilVTLCGCAVVNSAPPPEEVVQLRNARDACLARNVAALDDYRSDAATIGIAVVAACRYENQALVASIAGPDGFRQGEISRQIEQNSLDAATQAVLAHRATRRS
Ga0210406_1009193923300021168SoilMSQATMRLASLLIPLVTLCGCAVVNSAPPPEEVVQLRNARDACLARNVAALDDYRSDAATIGIAVVAACRYENQALVASIAGPDGFRQGEISRQIEQNSLDAATQAVLAHRATRRS
Ga0210406_1029212313300021168SoilMSQATMRRGSLLIPFVTLCGCAVVNSAPPPQEVVQLRNVRDACLARNVAALDDYRSDAATVAMAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDAATQSVLAHRAARRS
Ga0210408_10012632103300021178SoilMSQATMRLGALLIPLVTLCGCAVVNSAPPPEEVVQLRNVRDACLARNVVALDDYRSDAATVGMAVVAACRQENAALVASIAGPDGFRQSEIARQIDQNSRDAATQMVLSHRAAQRS
Ga0193699_1030409713300021363SoilVDFLIFPREVVAMQDRQRARTMRMSQVTKRLGSLLIPFVTLCGCAVVNSAPPPQEVVQLRNARDACLARNVAALDDYRSDAATIGRAVVSACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDATTQAVLAHRAARRS
Ga0210397_1041724523300021403SoilMSQMTKRLGSLLIPFVTLCGCAVVNSAPPPQEVVQLRNARDTCLARNVAALDDYRSDAATIGMAVVAACRYENQALVSAIAGPDGFRQSEISRQIEQNSLDAATQSVVAHRAARR
Ga0210397_1089840223300021403SoilMSQATIRIGFLLIKFVTLSGCAVVNSAPPAQEVVQLREARDACLARNVAALDDYRSDAATIGMAVVAACRYENQALVSAIAGPDGFRQSEISRQIQQNSLDAATQAVVAHRASRRS
Ga0210392_1023003523300021475SoilMSQAAKRLGSLLIPFVTLCGCAVVNSAPPPQEVVQLRNARDACLARNVAALDDYRSDAATIGMAVVAACRYENQALVSAIAGPDGFRQSEISRQIEQNSLDAATQSVLAHRAARR
Ga0212090_1050469823300022561Glacier ValleyMNRLLRLLPVLALAGCAAAYSAPSSNEIVQLRNQRDACLMRNAVALDDGRSDPRAIATSVVGYCQPENNQIIAAIAGPDDFRRSEITRQVQQDSLQAATNYVLSHRAARARS
Ga0222622_1003268023300022756Groundwater SedimentMSQAKMRLASLLIPLGMLGGCAVVNSAPPPDQVRQLRDVRDACLARNVVALDDYRSDAATIGMAVVSACRYENAALVDAIAGPDGFRRGEIARQIDQNSRDAATQAVVMHRSQRRS
Ga0209648_1024912713300026551Grasslands SoilRRTAHWSPVTIDFLEFLREAVVTPKPATRARTGRMSQATMRRGSLLISFVTLCGCAVVNSAPPPQEVVQLRDVRDACLARNVVALDDYRSDAATIGMAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDAATQAVLAHRAARRS
Ga0179587_1007238133300026557Vadose Zone SoilMSQATMRRGSLLIPFVTLCGCAVVNSAPPAAEVVQLRDVRDACLARHLVAPDYYPTDAATIGRAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDATTQAVLAHRAARRS
Ga0179587_1008138123300026557Vadose Zone SoilMSQAKMRPASLLMPLVTLCGCAVVNSAPPPAEVAQLRNVRDACLSRNVVALDDYRSDAASVGMAVVAACRQENAALVASIAGPDGFRQGEISRQIEQNSRDAATQMVLSHLAARRS
Ga0179587_1080814623300026557Vadose Zone SoilMRRRYASLIPFAALCGCAVVNTAPPPAEVARLRDVRDACLMRNAITLDDGRSDATAIGRAVVAACGRENAAVVSSIAGPDGFREGEIRRQVEQNSQEAATQYVLSHRAARSVRR
Ga0209909_100437253300026825CyanobacterialMLALGGCAVVNSAPSSQEVVQLRDVRDQCLMQNAIRLDDGRSDPQAIAASVVAACQSENQALITAIAGPDGFRQSEIARQIQQNSQQAATQYVLQVRAARTRS
Ga0208993_100318223300027480Forest SoilMSQATMRRGSLLIPFVTLCGCAVVNSAPPPQEVVQLRDVRDACLARNVAALDDYRSDAATVAMAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLNATTQAVLAHRAARRS
Ga0209388_116409413300027655Vadose Zone SoilLIPLVMLGGCAAVYSAPSSAEVRQLRDVRDACLMQNAVTLDNDRSDAATVGRAVVAACRRENAAIVAAIAGPDGFRESEISRQIDQNSQEAATQYVLSHRAAMSVRR
Ga0208981_101399123300027669Forest SoilMSGPATRARARRMSQATMRRGSLLIPFVTLCGCAVVNSAPPPQEVVQLRDVRDACLARNVAALDDYRSDAATVAMAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLNATTQAVLAHRAARRS
Ga0208981_103042813300027669Forest SoilMRLHPLLIPLVMLSGCAVVNTAPPPGQVVQLRDVRDACLMRNAVTLDDNRSDAATVGSAAVAACQQENAALVAAIAGPDGFRQGEIRRQIEQNSQQAATQYVLSHRAAMSVRR
Ga0209588_111915323300027671Vadose Zone SoilMRTLPLLIPLVMLGGCAAVYSAPSSAEVRQLRDVRDACLMQNAVTLDNDRSDAATVGRAVVAACRRENAAIVAAIAGPDGFRESEISRQIDQNSQEAATQYVLSHRAAMSVRR
Ga0208989_1000376753300027738Forest SoilMSQATMRRGSLLIPFVTLCGCAVVNSAPPPQEVVQLRDVRDACLARNVAALDDYRSDAATIAMAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLNATTQAVLAHRAARRS
Ga0209180_1013691123300027846Vadose Zone SoilLIPLVMLGGCAAVYSAPSSAEVRQLRDVRDACLMQNAVTLDNGRSDAATVGRAVVAACRRENAAIVAAIAGPDGFRESEISRQIDQNSQEAATQYVLSHRAAMSVRR
Ga0209693_1000194353300027855SoilMSQATKRLGSLLIPFVTLCGCAVVNSAPPPEEVVQLRNVRDACLARNVVALDDYRSDAATVGMAVVAACRQENGALVVSIAGPDGFRQSEIARQIDQNSRDAATQMVLSHRAAQRS
Ga0209701_1061832413300027862Vadose Zone SoilLVMLGGCAAVYSAPSSAEVRQLRDVRDACLMQNAVTLDNGRSDAATVGRAVVAACRRENAAIVAAIAGPDGFRESEISRQIDQNSQEAATQYVLSHRAAMSVRR
Ga0209488_1023357633300027903Vadose Zone SoilMRRRYASLIPFAALCGCAVVNTAPPPAEVARLRDVRDACLMRNAITLDDGRSDATAIGRAVVAACGRENAAVVSSIAGPDGFREGEIRRQVEQNS
Ga0209526_1059082723300028047Forest SoilMSQATKRLGSLLIPFVTLCGCAVVNGAPPPQEVVQLRDVRDACLARNVVALDDYRSDAATIGRAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDTTTQAVLAHRAARRS
Ga0137415_1016931233300028536Vadose Zone SoilMSQAKMRLVSLLIPLVTLCGCAVVNSAPPPAEVRQLRDVRDACLARNVVALDDYRSDAASVGMAVVSACRYENAALVSAIAGPDGFRQSEIARQIDQNSRDAATQMVLSHRAAQRS
Ga0308309_1123573513300028906SoilMSQATMRLGALLIPLVTLCGCAVVNSAPPPEEVVQLRNVRDACLARNVVALDDYRSDAATVGMAVVAACRQENAALVASIAGPDGFRQSEIARQIDQNSRDAATQMVL
Ga0307498_1022830013300031170SoilMLQAKLMAKSLLIPLVTLCGCAVVTSAPPPEQVRQLRDVRDACLARNVVALDDYRSDAATIGMAVVAACRYENAALVQAIAGPDGFRQGEISRQIEQNSQDAATQAVVMHRAQRRSSQAA
Ga0170824_12199382613300031231Forest SoilMLQAKLMAKSLLIPLVTLCGCAVVTSAPPPEQVRQLRDVRDACLARNVVALDDYRSDAATIGMAVVAACRYENAALVQAIAGPDGFRQSEISRQIEQNSQDAATQAV
Ga0170824_12490021123300031231Forest SoilMQDRQRARTMRMSQVTKRLGSLLIPFVTLCGCAVVNSAPPPQEVVQLRNARDACLARNVAALDDYRSDAATIGRAVVSACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDAATQSVLAHRATRRS
Ga0307506_1042890013300031366SoilVANSAPPPDQVRQLRDARDACLSRNVVALDDYRSDAATIGAAVVSACRYENQVLIDTIAGPDGYRRSEIARQIDQNSRDAATQMVVSHRAAQRRSS
Ga0170820_1408691723300031446Forest SoilMLQAKLMAKSLLIPLVTLCGCAVVTSAPPPEQVRQLRDVRDACLARNVVALDDYRSDAATIGMAVVAACRYENAALVQAIAGPDGFRQSEISRQIEQNSQDAATQAVVMHRAQRRSSQAA
(restricted) Ga0255338_103410533300031825Sandy SoilMKRRYASLIPFAALCGCAVVNTAPPPAEVARLRDVRDACLMRNAVTLDDGRSDPATVGRAVVAACGRENAALVSAIAGPDGFREGEIRRQIEQNSQEAATQYVLSHRAARSVRR
Ga0302322_10087968213300031902FenQAMMRLAPLSILFVTLCGCAVANSAPPSQEVVQLRNARDSCLARNVAALDDYRSDAATIAGAVVAACRYENQALVSAIAGPDGFRQGEISRQIEQNSLDATTQAILARRSARRS
Ga0318562_1044891713300032008SoilMNRYACLMPLAALGACAVVNTAPPPAEVARLRDVRDACLMHNAVTLDDGRSDPATIGRMVVAACQPENAAIVASIAGPDGFQQSEIARQIDQNSQQAATQYVLSHRAAVSVRS
Ga0318525_1044473823300032089SoilLMPLAALGACAVVNTAPPPAEVARLRDVRDACLMHNAVTLDDGRSDPATIGRMVVAACQPENAAIVASIAGPDGFQQSEIARQIDQNSQQAATQYVLSHRAAVSVRS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.