NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F098819

Metagenome / Metatranscriptome Family F098819

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F098819
Family Type Metagenome / Metatranscriptome
Number of Sequences 103
Average Sequence Length 86 residues
Representative Sequence NEIGWRLAPLDDPGLPEFIAQRALPDPATARCELCDSRPRQPLSTALIVDGPEGLPVPFLICAQCRRTLNELHALLESAGRARSA
Number of Associated Samples 74
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 47.57 %
% of genes near scaffold ends (potentially truncated) 26.21 %
% of genes from short scaffolds (< 2000 bps) 89.32 %
Associated GOLD sequencing projects 67
AlphaFold2 3D model prediction Yes
3D model pTM-score0.77

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (66.990 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(31.068 % of family members)
Environment Ontology (ENVO) Unclassified
(31.068 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(49.515 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 28.32%    β-sheet: 15.04%    Coil/Unstructured: 56.64%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.77
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
b.55.1.0: automated matchesd5d3xa15d3x0.54992
b.40.4.3: Single strand DNA-binding domain, SSBd2pi2a_2pi20.54429
b.121.4.8: Tetraviridae-like VPd1ohfa_1ohf0.5374
a.119.1.0: automated matchesd3v98a23v980.5207
b.55.1.0: automated matchesd3dxea_3dxe0.51889


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 103 Family Scaffolds
PF12840HTH_20 8.74
PF01425Amidase 8.74
PF13229Beta_helix 2.91
PF12697Abhydrolase_6 2.91
PF08281Sigma70_r4_2 1.94
PF06724DUF1206 1.94
PF01436NHL 0.97
PF14698ASL_C2 0.97
PF00793DAHP_synth_1 0.97
PF00903Glyoxalase 0.97
PF02566OsmC 0.97
PF01717Meth_synt_2 0.97
PF12848ABC_tran_Xtn 0.97
PF01408GFO_IDH_MocA 0.97
PF00296Bac_luciferase 0.97
PF01022HTH_5 0.97
PF12681Glyoxalase_2 0.97
PF01370Epimerase 0.97
PF00561Abhydrolase_1 0.97

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 103 Family Scaffolds
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 8.74
COG0620Methionine synthase II (cobalamin-independent)Amino acid transport and metabolism [E] 0.97
COG1764Organic hydroperoxide reductase OsmC/OhrADefense mechanisms [V] 0.97
COG1765Uncharacterized OsmC-related proteinGeneral function prediction only [R] 0.97
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 0.97


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms66.99 %
UnclassifiedrootN/A33.01 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005171|Ga0066677_10094723Not Available1572Open in IMG/M
3300005174|Ga0066680_10125539All Organisms → cellular organisms → Bacteria → Terrabacteria group1591Open in IMG/M
3300005177|Ga0066690_10152651Not Available1519Open in IMG/M
3300005178|Ga0066688_10184305All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1320Open in IMG/M
3300005178|Ga0066688_10303088Not Available1031Open in IMG/M
3300005179|Ga0066684_10743502All Organisms → cellular organisms → Bacteria653Open in IMG/M
3300005187|Ga0066675_11125864Not Available585Open in IMG/M
3300005187|Ga0066675_11406195Not Available511Open in IMG/M
3300005445|Ga0070708_100995029All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi786Open in IMG/M
3300005445|Ga0070708_101079310All Organisms → cellular organisms → Bacteria752Open in IMG/M
3300005447|Ga0066689_10245056All Organisms → cellular organisms → Bacteria1101Open in IMG/M
3300005447|Ga0066689_10553524All Organisms → cellular organisms → Bacteria → Terrabacteria group727Open in IMG/M
3300005467|Ga0070706_100333883All Organisms → cellular organisms → Bacteria1413Open in IMG/M
3300005468|Ga0070707_100054226All Organisms → cellular organisms → Bacteria → Proteobacteria3844Open in IMG/M
3300005468|Ga0070707_101177011Not Available733Open in IMG/M
3300005468|Ga0070707_101335863Not Available683Open in IMG/M
3300005471|Ga0070698_100135417All Organisms → cellular organisms → Bacteria2417Open in IMG/M
3300005518|Ga0070699_101000145All Organisms → cellular organisms → Bacteria → Proteobacteria767Open in IMG/M
3300005536|Ga0070697_100251343All Organisms → cellular organisms → Bacteria1512Open in IMG/M
3300005537|Ga0070730_10148813All Organisms → cellular organisms → Bacteria1590Open in IMG/M
3300005537|Ga0070730_10197915All Organisms → cellular organisms → Bacteria1342Open in IMG/M
3300005542|Ga0070732_10003316All Organisms → cellular organisms → Bacteria8624Open in IMG/M
3300005542|Ga0070732_10413902All Organisms → cellular organisms → Bacteria814Open in IMG/M
3300005552|Ga0066701_10262471All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1069Open in IMG/M
3300005559|Ga0066700_10218738Not Available1322Open in IMG/M
3300005561|Ga0066699_10353161All Organisms → cellular organisms → Bacteria1051Open in IMG/M
3300005566|Ga0066693_10060489All Organisms → cellular organisms → Bacteria1289Open in IMG/M
3300005575|Ga0066702_10974720Not Available507Open in IMG/M
3300005576|Ga0066708_10656365Not Available668Open in IMG/M
3300005598|Ga0066706_10322543All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales1220Open in IMG/M
3300005764|Ga0066903_100101307All Organisms → cellular organisms → Bacteria3893Open in IMG/M
3300005764|Ga0066903_101009199All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Thermomicrobia → Sphaerobacteridae → Sphaerobacterales → Sphaerobacterineae → Sphaerobacteraceae → Sphaerobacter → Sphaerobacter thermophilus1522Open in IMG/M
3300005764|Ga0066903_101629001All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1225Open in IMG/M
3300005764|Ga0066903_107580319All Organisms → cellular organisms → Bacteria → Terrabacteria group559Open in IMG/M
3300006028|Ga0070717_11087569All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium728Open in IMG/M
3300006031|Ga0066651_10253281All Organisms → cellular organisms → Bacteria934Open in IMG/M
3300006854|Ga0075425_101125266All Organisms → cellular organisms → Bacteria → Terrabacteria group894Open in IMG/M
3300006914|Ga0075436_100343299All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1075Open in IMG/M
3300009012|Ga0066710_100167850All Organisms → cellular organisms → Bacteria3081Open in IMG/M
3300009012|Ga0066710_101697764All Organisms → cellular organisms → Bacteria961Open in IMG/M
3300009089|Ga0099828_11613286All Organisms → cellular organisms → Bacteria → Terrabacteria group571Open in IMG/M
3300009090|Ga0099827_10460851All Organisms → cellular organisms → Bacteria1090Open in IMG/M
3300009137|Ga0066709_100649997All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Catenulisporales → Catenulisporaceae → Catenulispora → Catenulispora acidiphila → Catenulispora acidiphila DSM 449281509Open in IMG/M
3300009137|Ga0066709_104519668Not Available508Open in IMG/M
3300010360|Ga0126372_12019920Not Available623Open in IMG/M
3300010366|Ga0126379_11348093All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium820Open in IMG/M
3300010371|Ga0134125_12997110Not Available512Open in IMG/M
3300010376|Ga0126381_104389948Not Available545Open in IMG/M
3300011269|Ga0137392_11323951Not Available579Open in IMG/M
3300011270|Ga0137391_10132401All Organisms → cellular organisms → Bacteria2168Open in IMG/M
3300011271|Ga0137393_11314528All Organisms → cellular organisms → Bacteria612Open in IMG/M
3300011271|Ga0137393_11393113Not Available590Open in IMG/M
3300012096|Ga0137389_10615324Not Available933Open in IMG/M
3300012189|Ga0137388_10122725All Organisms → cellular organisms → Bacteria → Terrabacteria group2264Open in IMG/M
3300012189|Ga0137388_11618155Not Available583Open in IMG/M
3300012203|Ga0137399_11276337All Organisms → cellular organisms → Bacteria618Open in IMG/M
3300012203|Ga0137399_11480554All Organisms → cellular organisms → Bacteria566Open in IMG/M
3300012206|Ga0137380_10952667All Organisms → cellular organisms → Bacteria735Open in IMG/M
3300012207|Ga0137381_10574878Not Available982Open in IMG/M
3300012207|Ga0137381_11482045Not Available570Open in IMG/M
3300012209|Ga0137379_10189898All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1967Open in IMG/M
3300012209|Ga0137379_10749203All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Catenulisporales → Catenulisporaceae → Catenulispora → Catenulispora acidiphila → Catenulispora acidiphila DSM 44928882Open in IMG/M
3300012210|Ga0137378_10519596All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1098Open in IMG/M
3300012212|Ga0150985_101587619All Organisms → cellular organisms → Bacteria → Terrabacteria group506Open in IMG/M
3300012212|Ga0150985_105741349All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium930Open in IMG/M
3300012212|Ga0150985_121842487All Organisms → cellular organisms → Bacteria → Terrabacteria group1172Open in IMG/M
3300012356|Ga0137371_10143663All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1870Open in IMG/M
3300012356|Ga0137371_10867179All Organisms → cellular organisms → Bacteria687Open in IMG/M
3300012357|Ga0137384_11029984Not Available661Open in IMG/M
3300012357|Ga0137384_11430659Not Available540Open in IMG/M
3300012362|Ga0137361_11197948All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micromonosporales → Micromonosporaceae682Open in IMG/M
3300012363|Ga0137390_10051122All Organisms → cellular organisms → Bacteria → Proteobacteria4009Open in IMG/M
3300012363|Ga0137390_11105355All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium742Open in IMG/M
3300012363|Ga0137390_11396190Not Available645Open in IMG/M
3300012363|Ga0137390_11868454All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Dehalococcoidia → unclassified Dehalococcoidia → Dehalococcoidia bacterium530Open in IMG/M
3300012469|Ga0150984_101103727Not Available640Open in IMG/M
3300012532|Ga0137373_11147140All Organisms → cellular organisms → Bacteria552Open in IMG/M
3300012685|Ga0137397_10640407All Organisms → cellular organisms → Bacteria790Open in IMG/M
3300012918|Ga0137396_10809029Not Available689Open in IMG/M
3300012918|Ga0137396_11299103All Organisms → cellular organisms → Bacteria505Open in IMG/M
3300012925|Ga0137419_11315916Not Available608Open in IMG/M
3300012977|Ga0134087_10558066All Organisms → cellular organisms → Bacteria586Open in IMG/M
3300013772|Ga0120158_10486316Not Available547Open in IMG/M
3300015356|Ga0134073_10103899Not Available844Open in IMG/M
3300016422|Ga0182039_10932831All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium776Open in IMG/M
3300018468|Ga0066662_10229283All Organisms → cellular organisms → Bacteria1497Open in IMG/M
3300018468|Ga0066662_12853296All Organisms → cellular organisms → Bacteria513Open in IMG/M
3300021362|Ga0213882_10013506All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi3002Open in IMG/M
3300021384|Ga0213876_10003165All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi9476Open in IMG/M
3300021860|Ga0213851_1596840Not Available587Open in IMG/M
3300021861|Ga0213853_11577471Not Available600Open in IMG/M
3300025910|Ga0207684_10180252All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Catenulisporales → Catenulisporaceae → Catenulispora → Catenulispora acidiphila → Catenulispora acidiphila DSM 449281821Open in IMG/M
3300025910|Ga0207684_10351702All Organisms → cellular organisms → Bacteria1268Open in IMG/M
3300025922|Ga0207646_11156161Not Available680Open in IMG/M
3300026319|Ga0209647_1211684All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia663Open in IMG/M
3300026542|Ga0209805_1192509All Organisms → cellular organisms → Bacteria887Open in IMG/M
3300027748|Ga0209689_1121614Not Available1312Open in IMG/M
3300027842|Ga0209580_10041514Not Available2121Open in IMG/M
3300027857|Ga0209166_10117293All Organisms → cellular organisms → Bacteria1474Open in IMG/M
3300027857|Ga0209166_10430493Not Available683Open in IMG/M
3300027882|Ga0209590_10691131All Organisms → cellular organisms → Bacteria653Open in IMG/M
3300031954|Ga0306926_12391062All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium583Open in IMG/M
3300032261|Ga0306920_102911001All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium649Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil31.07%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil18.45%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere12.62%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil6.80%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.80%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.88%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.91%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.91%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere2.91%
WatershedsEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Watersheds1.94%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.94%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.94%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.97%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost0.97%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.97%
Exposed RockEnvironmental → Terrestrial → Rock-Dwelling (Subaerial Biofilms) → Unclassified → Unclassified → Exposed Rock0.97%
Plant RootsHost-Associated → Plants → Roots → Unclassified → Unclassified → Plant Roots0.97%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.97%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300013772Permafrost microbial communities from Nunavut, Canada - A10_80_0.25MEnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021362Barbacenia macrantha exposed rock microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - ER_R09EnvironmentalOpen in IMG/M
3300021384Root-associated microbial communities from Barbacenia macrantha in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R9Host-AssociatedOpen in IMG/M
3300021860Metatranscriptome of freshwater sediment microbial communities from post-fracked creek in Pennsylvania, United States - ABR_2014 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021861Metatranscriptome of freshwater sediment microbial communities from post-fracked creek in Pennsylvania, United States - ABR_2016 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026319Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cm (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0066677_1009472323300005171SoilVNHGWRVAPLDDPGLPAFIARGALPDAATARCELCETRPKQPLLWVLVVDGPDGLPVPFLICSQCRRTLHQLHTMLEAAGGSPTSG*
Ga0066680_1012553923300005174SoilLLFRVSNEIRWRLAPLDDPGLPEFIAHRALPDPATARCELCDSRPRQPLSTVLIVDGPEGLPVPFLICAQCRRTLDELHALLESAGREQSA*
Ga0066690_1015265113300005177SoilNEIRWRLAPLDDPGLPEFIAHRALPHPATARCELCDSRPKQPLSTILIVDGPEGLPVPFLICAQCRRTLDELHALLESAGREQSA*
Ga0066688_1018430533300005178SoilVSNEIRWRLAPLDDPGLPEFIAHRALPDPATARCELCDSRPRQPLSTVLIVDGPEGLPVPFLICAQCRRTLDELHALLESAGREQSA*
Ga0066688_1030308813300005178SoilRWRLAPLDDAGLPEFIANRALPDPATARCELCDARPRQPLSTMLIVDGPDGLPVPFLICAQCRRTLDELHALLESAGRQQSA*
Ga0066684_1074350213300005179SoilVAEIGWRLAPIDDPGLPEFIAQRALPDAVTARCELCDARPRQPLSTILIVDGPDGLPVPFLICAQCRRTLDELHALLEAASARSAGL*
Ga0066675_1112586423300005187SoilMTEPAKMSGMTGLAGIAEIPWRLAPLDDPGMPSFLTQRALPDAATARCELCDSRPRQPLSTVLIVNGPEGLPVPFLVCNQCRRTLDELHALLESAHRLSSAE*
Ga0066675_1140619513300005187SoilVAEIGWRLAPIDDPGLPEFIAQRALPDAATARCELCDARPRQPLSTILIVDGPDGLPVPFLICAQCRRTLDELHALLEAASARSAGL*
Ga0070708_10099502913300005445Corn, Switchgrass And Miscanthus RhizosphereMGANPQISQIAWRLVPLDDPGLPEFIAQRALPEPATARCELCDTRPRQPLSTILIVDGPDGLPVPFLICAACRRTLDELHALLEAAGRASE*
Ga0070708_10107931023300005445Corn, Switchgrass And Miscanthus RhizosphereVTEFGWRLAPLDEPGLPEFVARGALPDPATTRCELCEDRPRQPLSTVLVVDGPDGLPVPFLICAQCRRTLSQLRTMLEAASGRSSP*
Ga0066689_1024505613300005447SoilVSNEIRWRLAPLDDPGLPEFIAHRALPDPATARCELCDSRPRQPLSTVLIVDGPEGLPVPFLICAQCRRTLD
Ga0066689_1055352423300005447SoilMNNEIRWRLAPLDDAGLPEFIANRALPDPATARCELCDARPRQPLSTMLIVDGPDGLPVPFLICAQCRRTLDELHALLESAGRQQSA*
Ga0070706_10033388323300005467Corn, Switchgrass And Miscanthus RhizosphereVAEFGWRLAPLDEPGLPEFVARGALPDPTTTRCELCEDRPRQPLSTVLVVDGPDGLPVPFLICAHCRRALNQLRTMLEAASSSDYV*
Ga0070707_10005422663300005468Corn, Switchgrass And Miscanthus RhizosphereVTEFGWRLAPLDEPGLPEFVARGALPDPATTRCELCEDRPRQPLSTVLVVDGPDGLPVPFLICAQCRRTLSQLRTMLESASGRSSP*
Ga0070707_10117701123300005468Corn, Switchgrass And Miscanthus RhizosphereVAEFGWRLAPLDEPELPEFVTRGALPDPATTRCELCDQRPRQPLSTVLVVDGPDGLPVPFLICAACRRTLSQLRTMLEAAAAT*
Ga0070707_10133586313300005468Corn, Switchgrass And Miscanthus RhizosphereVAEFGWRLAPLDEPGLPEFVARGALPDPTTTRCELCEDRPRQPLSTVLVVDGPDGLPVPFLICAHCRRALNQLRTMLEAASSSDNV*
Ga0070698_10013541723300005471Corn, Switchgrass And Miscanthus RhizosphereVTEFGWRLAPLDEPGLPEFVARGALPDPTTTRCELCEERPRQPLSTVLVVDGPDGLPVPFLICARCRRTLSQLRNMLEAATGTASS*
Ga0070699_10100014523300005518Corn, Switchgrass And Miscanthus RhizosphereVTDFGWRLAPLDEPGLPEFVARGALPDPATTRCELCEDRPRQPLSTVLVVDGPDGLPVPFLICAQCRRTLSQLRNMLVAASTRADT*
Ga0070697_10025134323300005536Corn, Switchgrass And Miscanthus RhizosphereVTEFGWRLAPLDEPGLPEFVARGALPDPATTRCELCEDRPRQPLSTVLVVDGPDGLPVPFLICAQCRRTLSQLRNMLAAATSRADT*
Ga0070730_1014881333300005537Surface SoilMANPGWRVAPVDDPGLPSFIVRGALPDAATGRCELCDDRPKQPMSAVLIIDGPDGLPVPFLICVHCRRSLHQLHSMLESARVTNG*
Ga0070730_1019791523300005537Surface SoilMAEFGWRIAPLDDPGIPTVIAQRALPDPVGARCELCETRPRQPLTTVLVVDGPDGLPVPFLICTQCRRTLSQLYAMLQSASRPAPD*
Ga0070732_1000331633300005542Surface SoilMTRADSLREEIPWRLVAIDDPGLPTFITQRALPDPATSRCELCDARPRQPLSTVLIVDGPEGLPVPFLICNQCRRTLDELHALLESAGRVSPQG*
Ga0070732_1041390213300005542Surface SoilMINPLPEIPWRVAPLDDPGLPAFITQRALPDPNTSRCELCDSRPRQPLSTILIVDGPDGLPVPFLICNQCRRTLDELHALLESAHRPHVTE*
Ga0066701_1026247123300005552SoilMVEIGWRLAPLDDPGVPGFIARGALPDPAAARCELCERRPSQPLSTVLVVDGPDGLPVPFLVCAQCRRTLDALHTMLESARRG*
Ga0066700_1021873823300005559SoilVSNEIRWRLAPLDDPGLPEFIAHRALPHPATARCELCDSRPKQPLSTILIVDGPEGLPVPFLICAQCRRTLDELHALLESAGREQSA*
Ga0066699_1035316113300005561SoilVADIGWRLAPIDDPGLPEFIAQRALPDAATARCELCDTRPRQPLSTILIVDGPDGLPVPFLVCSQCRRTLDELHALLEAASARSAGL*
Ga0066693_1006048923300005566SoilMSDIAWRVVPLDDPGLPRFIAQRALPEPATARCKLCDSRPRQPLSTVLVVDGPDGLPVPFLICAQCRRTLDELHALLESASRTSNSA*
Ga0066702_1097472013300005575SoilVAKIGWRLAPIDDPGLPEFIAQRALPDAATARCELCDTRPRQPLSTILIVDGPDGLPVPFLVCSQCRRTLDELHALLEAASAR
Ga0066708_1065636513300005576SoilMADIGWRLAPIDDPGLPEFIAQRALPDPATARCELCDTRPRQPLSTILIVDGPDGLPVPFLVCSQCRRTLDELHALLEAASARSAGL*
Ga0066706_1032254313300005598SoilVTEFGWRLAPLDEPGLPEFVTRGALPDPATTRCELCEDRPRQPLSTVLVVEGPDGLPVPFLICAPCRRTLSQLRNMLEAAGRPPSG*
Ga0066903_10010130743300005764Tropical Forest SoilVTEIGWRLAPLDDPGLPEFIAQRALPDAATARCELCDNRPRQPLSTILIVDGPEGLPVPFLICVGCRRTLNELRALLEASGRASSASAL*
Ga0066903_10100919923300005764Tropical Forest SoilVVPLDDPGLPAFITQRALPDPGTARCELCDSRPRQPLSTVLIVDGPEGLPVPFLVCQQCRRTLDELHALLESARR*
Ga0066903_10162900123300005764Tropical Forest SoilMSEIPWRVAPLDDPGMPAFIAQRALPDPATARCELCDSRPRQPLSTVLIVEGPEELPVPFLICSQCRRTLDELHALLESASRSSPAL*
Ga0066903_10758031913300005764Tropical Forest SoilTPWRRAGGHNHWHMSEIPWRVAPLDDPGMPAFIAQRALPDPVTARCELCDSRPRQPLSTVLIVEGPEELPVPFLICSQCRRTLDELHALLESANRSSPAL*
Ga0070717_1108756923300006028Corn, Switchgrass And Miscanthus RhizosphereVTDIPWRVAPRDDPGLPAFIAQRALPDPTTARCELCDARPRQPLSTVLVVDGPDGLPVPFLICNQCRRTLDELHALLESAGRSAPAE*
Ga0066651_1025328113300006031SoilVNHGWRVAPLDDPGLPAFIARGALPDAATARCELCETRPKQPLLWVLVVDGPDGLPVPFLICSQCRRTLHQLHTMLEAAGGSATS
Ga0075425_10112526613300006854Populus RhizosphereNEIGWRLAPLDDPGLPEFIAQRALPDPATARCELCDSRPRQPLSTALIVDGPEGLPVPFLICAQCRRTLNELHALLESAGRARSA*
Ga0075436_10034329923300006914Populus RhizosphereMTDIPWRVAPLDDPGLPAFIARRALPDPATARCELCDSRPRAPLSTVLIVDGPDGLPVPFLICSQCRRTLDELHALLESAGRSSNAD*
Ga0066710_10016785013300009012Grasslands SoilMNNEIRWRLAPLDDAGLPEFIANRALPDPATARCELCDARPRQPLSTMLIVDGPDGLPVPFLICAQCRRTLDELHALLESAGRQQSA
Ga0066710_10169776423300009012Grasslands SoilMNHGWRVAPLDDPGLPVFIARGALPDAATARCELCETRPKQPLSWVLVVDGPDGLPVPFLICSQCRRTLHQLHTMLESAGGSATSG
Ga0099828_1161328623300009089Vadose Zone SoilMADLGWRVAPLDEPGLPGFITQGALPDTASARCELCETRPRQPLSSVLVVDGPDGLPVPFLICAQCRRTLHQLHTMLEAAASSQGA*
Ga0099827_1046085123300009090Vadose Zone SoilMAETGWRLAPLDDPGLPEFIAQRALPDAATARCELCDTRPKQPLSTILIVDGPEGLPVPFLVCTGCRRTLNELRALLEASARAGSA*
Ga0066709_10064999723300009137Grasslands SoilVTEFGWRLAPLDEPGLPEFLTRGALPDPSTTRCELCEERARQPLSTVLVVDGPDGLPVPFLICAACRRTLAQLRTMLEAASRQAQA*
Ga0066709_10451966823300009137Grasslands SoilMTEPAKISGMTGLAGIAEIPWRLAPLDDPGMPSFLTQRALPDAATARCELCDSRPRQPLSTVLIVNGPEGLPVPFLVCNQCRRTLDELHALLESAHRLSSAE*
Ga0126372_1201992013300010360Tropical Forest SoilMAEIPWRLAPLDDPGLPEFIAQRALPDAASARCELCDRRPRQPMSTVLIVDGPHGLPVPFLICAECRRTL
Ga0126379_1134809323300010366Tropical Forest SoilMTMTDIPWRVAPLDDPGLPVFITQRALPDPNTARCELCDSRPRAPLSTVLIVEGPDGLPVPFLVCNQCRRTLDELHALLESAGRSTIA*
Ga0134125_1299711013300010371Terrestrial SoilSNEQLTDIPWRLAPLDDPGLPAFITQRALPDPASARCELCDARPRQPLSTILVIDGPEGLPVPFLICNQCRRTLDELHALLESAGQSQTD*
Ga0126381_10438994823300010376Tropical Forest SoilMTDIPWRVAPLDDPGLPAFIAQRALPNPATARCELCDSRPRQPLSTILIVDGPEGLPVPFLICSQCRRTLDELHALLESASQSSAAI*
Ga0137392_1132395113300011269Vadose Zone SoilVAEIGWRLAPLDEPGLPEFVARGALPDPTTTRCELCEDRPRLPLSTVLVVDGPDGLPVPFLICAHCRRALNQLRTMLEAASSSDGV*
Ga0137391_1013240133300011270Vadose Zone SoilVTEFGWRLAPLDEPGLPEFLARGALPDPATTPCELCEVRPRQPLSTVLVVDGPDGLPVPFLICVRCRRTLTQLRNMLESASVAPK*
Ga0137393_1131452813300011271Vadose Zone SoilVTEFGWRLAPLDEPGLPEFLARGALPDPATTACELCEERPRQPLSTVLVVDGPDGLPVPFLICVRCRRTLTQLRNML
Ga0137393_1139311323300011271Vadose Zone SoilVAEFGWRLAPLDEPGLPEFVARGALPDPTTTRCELCEDRPRRPLSTVLVVDGPDGLPVPFLICAHCRRALNQLRT
Ga0137389_1061532423300012096Vadose Zone SoilVTEFGWRLAPLDEPGLPEFVARGALPDPATTRCELCEDRPRQPLSTVLVVDGPDGLPVPFLICAHCRRALNQLRTMLEAASSSDGV*
Ga0137388_1012272513300012189Vadose Zone SoilVAEFGWRLAPLDEPGLPEFVARGALPDPTTTRCELCEDRPRRPLSTVLVVDGPDGLPVPFLICAHCRRALNQLRTMLEAASSSDGL*
Ga0137388_1161815513300012189Vadose Zone SoilVTELGWRLAPLDEPGLPEFVARGALPDPATTRCELCEDRPKQPLSTVLVVDGPDGLPVPFLICAPCRRTLYQLRNMLEAAHSTGAT*
Ga0137399_1127633713300012203Vadose Zone SoilVTEFGWRLAPLDEPGLPEFVTRGALPDPATTRCELCEDRPRQPLSTVLVVDGPDGLPVPFLICTRCRRTLAQLRNMLEAASVQKNM*
Ga0137399_1148055423300012203Vadose Zone SoilVTEFGWRLAPLDEPGLPEFLARGALPDPASTPCELCEERPRQPLSTVLVVDGPDGLPVPFLICARCRRTLAQLRNMLESASV
Ga0137380_1095266713300012206Vadose Zone SoilVTEFGWRLAPLDEPGLPEFVTRGALPDPATTRCELCEDRPRQPLSTVLVVEGPDGLPVPFLICAPCRRTLSQLRNML
Ga0137381_1057487823300012207Vadose Zone SoilMAEAIPWRLVPLDASGLPVPSQRSLPDPIAGMCELCSRRPKQPLSTVLVIDGPDGLPVPFLICAPCRRTLSQLRTMLEAASKKADT*
Ga0137381_1148204513300012207Vadose Zone SoilMSEIGWRLAPLDDPGVPGFIARGALPDPAAARCELCERRPNQPLSTVLVVDGPDGLPVPFLVCAQCRRTLDALHTMLESARRG*
Ga0137379_1018989823300012209Vadose Zone SoilMVMSEIGWRLAPLDDPGVPGFIARGALPDPAAARCELCERRPNQPLSTVLVVDGPDGLPVPFLVCAQCRRTLDALHTMLESARRG*
Ga0137379_1074920323300012209Vadose Zone SoilVTEFGWRLAPLDEPGLPEFVTRGALPDPATTRCELCEDRPRQPLSTVLVVEGPDGLPVPFLICALCRRTLSQLRTMLEAASKKADT*
Ga0137378_1051959623300012210Vadose Zone SoilVTEFGWRLAPLDEPGLPEFVTRGALPDPATTRCELCEDRPRQPLSTVLVVEGPDGLPVPFLICAPCRRTLSQLRTMLEAASKQADT*
Ga0150985_10158761923300012212Avena Fatua RhizosphereDDPGLPAFLTQRALPNPATARCELCDARPRQPLSTVLIVDGPEGLPVPFLVCNQCRRTLDELHALLESAIRAPTQM*
Ga0150985_10574134933300012212Avena Fatua RhizosphereVTDIPWRVAPRDDPGLPAFIAQRALPDPVSARCELCDSRPRQPLSTVLIVDGPEGLPVPFLICNQCRRTLDELHALLESAGRAASAE*
Ga0150985_12184248713300012212Avena Fatua RhizosphereIPWRVAPLDDPGLPAFLTQRALPDAATARCELCDARPRQPLSTVLVVDGPDDLPVPFLVCNQCRRTLDELRALLESARRSSTKI*
Ga0137371_1014366323300012356Vadose Zone SoilMLDIGWRVVPLDDPVLRAFIAQRAVPDPATARCELCDSRPRQPLSTVLIVDGPDGLPVPFLICAQCRRTLDELHALLQSASRSSNSA*
Ga0137371_1086717923300012356Vadose Zone SoilVAEIGWRLAPLDDPGLPEFIAQRALPDAATARCELCDTRPKQPLSTILIVDGPEGLPVPFLVCAGCRRTLNELRALLEASARAGSA*
Ga0137384_1102998423300012357Vadose Zone SoilMLDIGWRVVPLDDPGLPAFIAQRAVPDPATARCELCDSRPRQPLSTVLIVDGPDGLPVPFLICAQCRRTLDELHALLQSASRSSNSA*
Ga0137384_1143065923300012357Vadose Zone SoilMAEISWRLAPLDDPGLPEFIAQRALPNPNTARCELCDDRPRQPLSTVLIVDGPEGLPVPFLVCARCRRTLDELHVLLETAARTSREE*
Ga0137361_1119794813300012362Vadose Zone SoilVSNNEIRWRLAPMDDPGLPEFIAHRALPNPATARCELCDSRPRQPLSTVLIVDGPEGLPVPFLICAQCRRTLDELHALLESAGREQSA*
Ga0137390_1005112253300012363Vadose Zone SoilVTEFGWRLAPLDEPGLPEFLARGALPDPATTACELCEERPRQPLSTVLVVDGPDGLPVPFLICVRCRRTLTQLRNMLESASVAPK*
Ga0137390_1110535523300012363Vadose Zone SoilMLDRAEIPWRLALLDDPGLPEFIAQRALPDASTARCELCDNRPRQPLSTVLIVDGPDGLPVPFLICATCRRTLDELHALLEAAGRAP*
Ga0137390_1139619013300012363Vadose Zone SoilMADLGWRVAPLDEPGLPGFITQGALPDTASARCELCETRPRQPLSSVLVVDGPDGLPVPFLICAQCRQTLHQLHTMLEAAASSQGA*
Ga0137390_1186845423300012363Vadose Zone SoilMAEISWRLAPLDDPGLPAFITQRALPDAATARCELCDVRARQPLSTVLIVDGPDGLPVPFLICVGCRRTLDELHALLE
Ga0150984_10110372723300012469Avena Fatua RhizospherePWRVAGLDDPGLPAFITKRALPDPSTARCELCDARPRARQPLSTVLIVDGPDGLPVPFLACHQCRRTLDELHALLESAGRASQQL*
Ga0137373_1114714013300012532Vadose Zone SoilVTEFGWRLAPLDEPGLPEFVARGALPDPATTRCELCEDRPRQPLSTVLIVDGPDGLPVPFLICAQCRRTLSQLRTMLEAASGRSSA*
Ga0137397_1064040723300012685Vadose Zone SoilVTEFGWRLAPLDEPGLPEFVTRGALPDPATTRCELCEDRPRQPLSTVLVVDGPDGLPVPFLICTRCRRTLAQLRNMLEVASVHKNT*
Ga0137396_1080902923300012918Vadose Zone SoilLAPLDEPGLPEFLARGALPDPASTPCELCEERPRQPLSTVLVVDGPDGLPVPFLICARCRRTLAQLRNMLESASVTPT*
Ga0137396_1129910313300012918Vadose Zone SoilVTPFGWRLAPLDEPGLPEFVARGALPDPATTRCELCEDRPRQPLSTVLVVDGPDGLPVPFLICTRCRRTLAQLRNMLEAASVHKNT*
Ga0137419_1131591613300012925Vadose Zone SoilVTEFGWRLAPLDEPGLPEFVARGALPDPATTRCELCEERPRQPLSTVLVVDGPDGLPVPFLICARCRRTLAQLGNMLEAARVAPM*
Ga0134087_1055806623300012977Grasslands SoilVAEIGWRLAPIDDPGLPEFIAQRALPDAATARCELCDTRPRQPLSTILIVDGPDGLPVPFLICAQCRRTLDELHA
Ga0120158_1048631613300013772PermafrostTGLTVSEIGWRLAPLDDPGVPGFIARGALPDATAARCELCERRPIQPLSTVLIVDGPNGLPVPFLICAQCRRTLDQLHAMLESAGHPQAD*
Ga0134073_1010389923300015356Grasslands SoilMADIGWRLAPIDDPGLPEFIAQRALPDAATARCELCDTRPRQPLSTILIVDGPDGLPVPFLVCSQCRRTLDELHALLEAASARSAGL*
Ga0182039_1093283123300016422SoilPRWRACGYHYGSMTDIPWRVAPLDDPGLPVFITQRALPDPITARCELCDSRPRAPLSTVLIVDGPDGLPVPFLICSQCRRTLDELHALLESAGRSTPTG
Ga0066662_1022928323300018468Grasslands SoilVSNEIRWRLAPLDDPGLPEFIAHRALPDPATARCELCDSRPRQPLSTVLIVDGPEGLPVPFLICAQCRRTLDELHALLESAGREQSA
Ga0066662_1285329613300018468Grasslands SoilVADIGWRLAPIDDPGLPEFIAQRALPDAATARCELCDTRPRQPLSTILIVDGPDGLPVPFLICSQCRRTLDELHALLEAARARSAGL
Ga0213882_1001350623300021362Exposed RockVTDIPWRVAPLDDPGLPAFIARRALPDPATARCELCDSRPRIPLSTVLIVDGPDGLPVPFLICNQCRRTLDELHALLESAGRQPNDE
Ga0213876_1000316543300021384Plant RootsVAPLDDPGLPAFIKQRALPDPATARCELCDARPRQPLSTVLIVDGPDGLPVPFLVCNQCRRTLDELHALLESADRSSPAPQSW
Ga0213851_159684023300021860WatershedsMQPGPEIPWRLAPLDDPGLPAFIASGALPDPATARCELCDARRGPPLATVLIVNGPDGLPVPFLICHQCRRTLDQLHAMLE
Ga0213853_1157747113300021861WatershedsMQPGPEIPWRLAPLDDPGLPAFIASGALPDPATARCELCDTRRGPPLSTVLIVNGPDGWPVPFLICHQCRRTLDQLHAMLEAAGRPH
Ga0207684_1018025223300025910Corn, Switchgrass And Miscanthus RhizosphereVTEFGWRLAPLDEPGLPEFVARGALPDPATTRCELCEDRPRQPLSTVLVVDGPDGLPVPFLICAQCRRTLSQLRTMLEAASGRSSP
Ga0207684_1035170223300025910Corn, Switchgrass And Miscanthus RhizosphereVAEFGWRLAPLDEPGLPEFVARGALPDPTTTRCELCEDRPRQPLSTVLVVDGPDGLPVPFLICAHCRRALNQLRTMLEAASSSDYV
Ga0207646_1115616113300025922Corn, Switchgrass And Miscanthus RhizosphereVAEFGWRLAPLDEPELPEFVTRGALPDPATTRCELCDERPRQPLSTVLVVDGPDGLPVPFLICAACRRTLSQLRTMLEAAAAT
Ga0209647_121168413300026319Grasslands SoilVTEFGWRLAPLDEPGLPEFLARGALPDPATTPCELCEERPRQPLTTVLVVDGPDGLPVPFLICARCRRTLAQLRNMLESASVAPK
Ga0209805_119250913300026542SoilVADIGWRLAPIDDPGLPEFIAQRALPDAATARCELCDTRPRQPLSTILIVDGPDGLPVPFLVCSQCRRTLDELHALLEAASARSAGL
Ga0209689_112161423300027748SoilVSNEIRWRLAPLDDPGLPEFIAHRALPHPATARCELCDSRPKQPLSTILIVDGPEGLPVPFLICAQCRRTLDELHALLESAGREQSA
Ga0209580_1004151433300027842Surface SoilMTRADSLREEIPWRLVAIDDPGLPTFITQRALPDPATSRCELCDARPRQPLSTVLIVDGPEGLPVPFLICNQCRRTLDELHALLESAGRVSPQG
Ga0209166_1011729323300027857Surface SoilMANPGWRVAPVDDPGLPSFIVRGALPDAATGRCELCDDRPKQPMSAVLIIDGPDGLPVPFLICVHCRRSLHQLHSMLESARVTNG
Ga0209166_1043049323300027857Surface SoilNERQPPKPVDHRGMAEFGWRIAPLDDPGIPTVIAQRALPDPVGARCELCETRPRQPLTTVLVVDGPDGLPVPFLICTQCRRTLSQLYAMLQSASRPAPD
Ga0209590_1069113123300027882Vadose Zone SoilMAETGWRLAPLDDPGLPEFIAQRALPDAATARCELCDTRPKQPLSTILIVDGPEGLPVPFLVCTGCRRTLNELRALLEASARAGSA
Ga0306926_1239106213300031954SoilMTMTDIPWRVAPLDDPGLPVFIAQRALPDPNTARCELCDSRPRAPLSTVLIVEGPDGLPVPFLVCSQCRRTLDELHALLESANRATPA
Ga0306920_10291100123300032261SoilMTDIPWRVAPLDDPGLPVFITQRALPDPITARCELCDSRPRAPLSTVLIVDGPDGLPVPFLICSQCRRTLDELHALLESAGRSTPTG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.