NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F073335

Metagenome Family F073335

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F073335
Family Type Metagenome
Number of Sequences 120
Average Sequence Length 92 residues
Representative Sequence MSDALPVLDLPDGLPPWLAAFVLEAQRETDTLRENGAEQAAAARVALLRKLITAAAAQLDEEIDIHEAAREKGVCEETIRRAVRDGRIPDRRAN
Number of Associated Samples 87
Number of Associated Scaffolds 120

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 64.17 %
% of genes near scaffold ends (potentially truncated) 92.50 %
% of genes from short scaffolds (< 2000 bps) 92.50 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.64

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.167 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(45.000 % of family members)
Environment Ontology (ENVO) Unclassified
(47.500 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(48.333 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 54.10%    β-sheet: 0.00%    Coil/Unstructured: 45.90%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.64
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 120 Family Scaffolds
PF12728HTH_17 9.17
PF15723MqsR_toxin 1.67
PF00106adh_short 0.83
PF13338AbiEi_4 0.83
PF01381HTH_3 0.83
PF13302Acetyltransf_3 0.83
PF04851ResIII 0.83
PF13560HTH_31 0.83
PF00589Phage_integrase 0.83
PF12161HsdM_N 0.83
PF15731MqsA_antitoxin 0.83



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.17 %
UnclassifiedrootN/A0.83 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002908|JGI25382J43887_10076931All Organisms → cellular organisms → Bacteria1799Open in IMG/M
3300002908|JGI25382J43887_10379824All Organisms → cellular organisms → Bacteria597Open in IMG/M
3300004013|Ga0055465_10220651All Organisms → cellular organisms → Bacteria629Open in IMG/M
3300005172|Ga0066683_10577438All Organisms → cellular organisms → Bacteria684Open in IMG/M
3300005172|Ga0066683_10708709All Organisms → cellular organisms → Bacteria596Open in IMG/M
3300005176|Ga0066679_10761215All Organisms → cellular organisms → Bacteria622Open in IMG/M
3300005178|Ga0066688_10094348All Organisms → cellular organisms → Bacteria1818Open in IMG/M
3300005186|Ga0066676_10878474All Organisms → cellular organisms → Bacteria603Open in IMG/M
3300005187|Ga0066675_10891934All Organisms → cellular organisms → Bacteria672Open in IMG/M
3300005549|Ga0070704_102166563All Organisms → cellular organisms → Bacteria517Open in IMG/M
3300005552|Ga0066701_10371740All Organisms → cellular organisms → Bacteria885Open in IMG/M
3300005552|Ga0066701_10456951All Organisms → cellular organisms → Bacteria793Open in IMG/M
3300005553|Ga0066695_10361674All Organisms → cellular organisms → Bacteria908Open in IMG/M
3300005553|Ga0066695_10639968All Organisms → cellular organisms → Bacteria632Open in IMG/M
3300005555|Ga0066692_10878513All Organisms → cellular organisms → Bacteria550Open in IMG/M
3300005576|Ga0066708_10282743All Organisms → cellular organisms → Bacteria1060Open in IMG/M
3300005598|Ga0066706_10146087All Organisms → cellular organisms → Bacteria1774Open in IMG/M
3300005598|Ga0066706_10877809All Organisms → cellular organisms → Bacteria701Open in IMG/M
3300006796|Ga0066665_10022708All Organisms → cellular organisms → Bacteria3971Open in IMG/M
3300006796|Ga0066665_11330784All Organisms → cellular organisms → Bacteria553Open in IMG/M
3300006796|Ga0066665_11423986All Organisms → cellular organisms → Bacteria537Open in IMG/M
3300006797|Ga0066659_11504759All Organisms → cellular organisms → Bacteria564Open in IMG/M
3300006903|Ga0075426_10603080All Organisms → cellular organisms → Bacteria820Open in IMG/M
3300006914|Ga0075436_100416495All Organisms → cellular organisms → Bacteria975Open in IMG/M
3300007076|Ga0075435_101522837All Organisms → cellular organisms → Bacteria587Open in IMG/M
3300007255|Ga0099791_10374629All Organisms → cellular organisms → Bacteria684Open in IMG/M
3300007255|Ga0099791_10691064All Organisms → cellular organisms → Bacteria501Open in IMG/M
3300007265|Ga0099794_10005834All Organisms → cellular organisms → Bacteria4966Open in IMG/M
3300009012|Ga0066710_102242005All Organisms → cellular organisms → Bacteria796Open in IMG/M
3300009012|Ga0066710_103088659All Organisms → cellular organisms → Bacteria644Open in IMG/M
3300009012|Ga0066710_103291268All Organisms → cellular organisms → Bacteria617Open in IMG/M
3300009012|Ga0066710_104593559All Organisms → cellular organisms → Bacteria516Open in IMG/M
3300009088|Ga0099830_10435173All Organisms → cellular organisms → Bacteria1064Open in IMG/M
3300009088|Ga0099830_10726076All Organisms → cellular organisms → Bacteria818Open in IMG/M
3300009088|Ga0099830_11018638All Organisms → cellular organisms → Bacteria686Open in IMG/M
3300009089|Ga0099828_10652747All Organisms → cellular organisms → Bacteria946Open in IMG/M
3300009597|Ga0105259_1180723All Organisms → cellular organisms → Bacteria514Open in IMG/M
3300010301|Ga0134070_10440382All Organisms → cellular organisms → Bacteria520Open in IMG/M
3300010304|Ga0134088_10211472All Organisms → cellular organisms → Bacteria930Open in IMG/M
3300010304|Ga0134088_10667308All Organisms → cellular organisms → Bacteria520Open in IMG/M
3300010320|Ga0134109_10059348All Organisms → cellular organisms → Bacteria1280Open in IMG/M
3300010320|Ga0134109_10277900All Organisms → cellular organisms → Bacteria639Open in IMG/M
3300010322|Ga0134084_10163627All Organisms → cellular organisms → Bacteria756Open in IMG/M
3300010322|Ga0134084_10273928All Organisms → cellular organisms → Bacteria618Open in IMG/M
3300010323|Ga0134086_10313725All Organisms → cellular organisms → Bacteria612Open in IMG/M
3300010333|Ga0134080_10047732All Organisms → cellular organisms → Bacteria1678Open in IMG/M
3300010333|Ga0134080_10517077All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300011269|Ga0137392_10622780All Organisms → cellular organisms → Bacteria895Open in IMG/M
3300011271|Ga0137393_11219072All Organisms → cellular organisms → Bacteria639Open in IMG/M
3300011430|Ga0137423_1067032All Organisms → cellular organisms → Bacteria1065Open in IMG/M
3300012038|Ga0137431_1105525All Organisms → cellular organisms → Bacteria793Open in IMG/M
3300012041|Ga0137430_1051393All Organisms → cellular organisms → Bacteria1118Open in IMG/M
3300012189|Ga0137388_10151485All Organisms → cellular organisms → Bacteria2054Open in IMG/M
3300012189|Ga0137388_10922102All Organisms → cellular organisms → Bacteria807Open in IMG/M
3300012189|Ga0137388_11184766All Organisms → cellular organisms → Bacteria701Open in IMG/M
3300012200|Ga0137382_10310868All Organisms → cellular organisms → Bacteria1101Open in IMG/M
3300012201|Ga0137365_10414530All Organisms → cellular organisms → Bacteria993Open in IMG/M
3300012202|Ga0137363_10370258All Organisms → cellular organisms → Bacteria1189Open in IMG/M
3300012203|Ga0137399_10973502All Organisms → cellular organisms → Bacteria714Open in IMG/M
3300012204|Ga0137374_10320912All Organisms → cellular organisms → Bacteria1262Open in IMG/M
3300012204|Ga0137374_10606989All Organisms → cellular organisms → Bacteria834Open in IMG/M
3300012206|Ga0137380_10182633All Organisms → cellular organisms → Bacteria1907Open in IMG/M
3300012208|Ga0137376_10723527All Organisms → cellular organisms → Bacteria858Open in IMG/M
3300012209|Ga0137379_10409787All Organisms → cellular organisms → Bacteria1264Open in IMG/M
3300012209|Ga0137379_10410809All Organisms → cellular organisms → Bacteria1263Open in IMG/M
3300012210|Ga0137378_10106390All Organisms → cellular organisms → Bacteria2580Open in IMG/M
3300012211|Ga0137377_10623229All Organisms → cellular organisms → Bacteria1015Open in IMG/M
3300012349|Ga0137387_10533971All Organisms → cellular organisms → Bacteria851Open in IMG/M
3300012351|Ga0137386_11034552All Organisms → cellular organisms → Bacteria583Open in IMG/M
3300012355|Ga0137369_10304513All Organisms → cellular organisms → Bacteria1181Open in IMG/M
3300012355|Ga0137369_10533557All Organisms → cellular organisms → Bacteria824Open in IMG/M
3300012356|Ga0137371_10518202All Organisms → cellular organisms → Bacteria919Open in IMG/M
3300012356|Ga0137371_10591642All Organisms → cellular organisms → Bacteria853Open in IMG/M
3300012358|Ga0137368_10192814All Organisms → cellular organisms → Bacteria1458Open in IMG/M
3300012358|Ga0137368_10305434All Organisms → cellular organisms → Bacteria1074Open in IMG/M
3300012362|Ga0137361_10770381All Organisms → cellular organisms → Bacteria877Open in IMG/M
3300012363|Ga0137390_11311922All Organisms → cellular organisms → Bacteria669Open in IMG/M
3300012363|Ga0137390_11578105All Organisms → cellular organisms → Bacteria595Open in IMG/M
3300012683|Ga0137398_11153112All Organisms → cellular organisms → Bacteria531Open in IMG/M
3300012685|Ga0137397_10145128All Organisms → cellular organisms → Bacteria1757Open in IMG/M
3300012917|Ga0137395_10857941All Organisms → cellular organisms → Bacteria658Open in IMG/M
3300012917|Ga0137395_10979896All Organisms → cellular organisms → Bacteria607Open in IMG/M
3300012918|Ga0137396_10187051All Organisms → cellular organisms → Bacteria1519Open in IMG/M
3300012924|Ga0137413_10943166All Organisms → cellular organisms → Bacteria673Open in IMG/M
3300012925|Ga0137419_10094673All Organisms → cellular organisms → Bacteria2053Open in IMG/M
3300012925|Ga0137419_11985376All Organisms → cellular organisms → Bacteria500Open in IMG/M
3300012927|Ga0137416_10194348All Organisms → cellular organisms → Bacteria1624Open in IMG/M
3300012929|Ga0137404_11695642All Organisms → cellular organisms → Bacteria587Open in IMG/M
3300012930|Ga0137407_10383820All Organisms → cellular organisms → Bacteria1299Open in IMG/M
3300012930|Ga0137407_10681998All Organisms → cellular organisms → Bacteria967Open in IMG/M
3300012930|Ga0137407_11489538All Organisms → cellular organisms → Bacteria644Open in IMG/M
3300012944|Ga0137410_11379887All Organisms → cellular organisms → Bacteria612Open in IMG/M
3300012976|Ga0134076_10170783All Organisms → cellular organisms → Bacteria900Open in IMG/M
3300014154|Ga0134075_10200215All Organisms → cellular organisms → Bacteria859Open in IMG/M
3300014157|Ga0134078_10148533All Organisms → cellular organisms → Bacteria920Open in IMG/M
3300014157|Ga0134078_10357565All Organisms → cellular organisms → Bacteria644Open in IMG/M
3300014166|Ga0134079_10297783All Organisms → cellular organisms → Bacteria715Open in IMG/M
3300015054|Ga0137420_1440860All Organisms → cellular organisms → Bacteria1666Open in IMG/M
3300015241|Ga0137418_10108425All Organisms → cellular organisms → Bacteria2491Open in IMG/M
3300015245|Ga0137409_11032003All Organisms → cellular organisms → Bacteria660Open in IMG/M
3300017654|Ga0134069_1271465All Organisms → cellular organisms → Bacteria594Open in IMG/M
3300018079|Ga0184627_10339575All Organisms → cellular organisms → Bacteria786Open in IMG/M
3300018084|Ga0184629_10281639All Organisms → cellular organisms → Bacteria873Open in IMG/M
3300018433|Ga0066667_10039116All Organisms → cellular organisms → Bacteria2756Open in IMG/M
3300018433|Ga0066667_11755452All Organisms → cellular organisms → Bacteria559Open in IMG/M
3300019879|Ga0193723_1071659All Organisms → cellular organisms → Bacteria999Open in IMG/M
3300026296|Ga0209235_1079520All Organisms → cellular organisms → Bacteria1468Open in IMG/M
3300026310|Ga0209239_1199009All Organisms → cellular organisms → Bacteria724Open in IMG/M
3300026310|Ga0209239_1327561All Organisms → cellular organisms → Bacteria532Open in IMG/M
3300026313|Ga0209761_1052186All Organisms → cellular organisms → Bacteria2283Open in IMG/M
3300026328|Ga0209802_1161677All Organisms → cellular organisms → Bacteria929Open in IMG/M
3300026538|Ga0209056_10208361All Organisms → cellular organisms → Bacteria1423Open in IMG/M
3300026538|Ga0209056_10276310All Organisms → cellular organisms → Bacteria1177Open in IMG/M
3300027266|Ga0209215_1053924All Organisms → cellular organisms → Bacteria558Open in IMG/M
3300027875|Ga0209283_10380471All Organisms → cellular organisms → Bacteria922Open in IMG/M
3300028536|Ga0137415_10440349All Organisms → cellular organisms → Bacteria1109Open in IMG/M
3300028814|Ga0307302_10174437All Organisms → cellular organisms → Bacteria1045Open in IMG/M
3300032256|Ga0315271_10822585All Organisms → cellular organisms → Bacteria800Open in IMG/M
3300032770|Ga0335085_12163727Not Available560Open in IMG/M
3300033407|Ga0214472_10033080All Organisms → cellular organisms → Bacteria5304Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil45.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil17.50%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil13.33%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil10.00%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil3.33%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.50%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.67%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.83%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.83%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.83%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.83%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300004013Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailA_D2EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009597Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT299EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011430Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT600_2EnvironmentalOpen in IMG/M
3300012038Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT800_2EnvironmentalOpen in IMG/M
3300012041Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT754_2EnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300027266Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM2H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300032256Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_topEnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300033407Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT140D175EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25382J43887_1007693113300002908Grasslands SoilVPTSNIPDLPDGLPPWLAAFISEALRETDTLRENGADQAALARMALLKKLVAAAAAWLNAELDTVKAAGETGRCEETIRRAVRDGTIPDSRANRKD
JGI25382J43887_1037982413300002908Grasslands SoilMADPRLGLDLPDGLPPWLASFVLEAQRETDALRDNGAEQAAAVRLALLRKLITAAAAQLDEEIDIHEAAREKGVCEETIRRAVRDGRIPDRRANPKGRHRV
Ga0055465_1022065113300004013Natural And Restored WetlandsVPPSGLDLPPGLPPWLAAFITEAQRETDTLLENGAEQAAAARTALLRRLLDAARAWLDAEIDTHEAAQEKGVCEETIRRAVRDGRIPDRRANPRGRHQVRRGDLERVA
Ga0066683_1057743823300005172SoilMDLPIPEGLPPWLAAFVLEAQRETDTLRDNGAEQAAAARVALLRKLIAAAEIYFDTELDTEEAASETGRCEETIRRAVRDGTIPDRRA
Ga0066683_1070870923300005172SoilMPDARPVVDLQDGLPPWLAAFVLEAQHETDTLRENGAEQAAVARLALLRKLVVTAQTYLDTELDAAEAALETGRCEETIRRAVRDGTIPDRRANPKGRHRVR
Ga0066679_1076121513300005176SoilVSASGSSTAFSLADLRLGLELPDGLPPWLAAFVLEAQRETDTLRENGAEQAAAARDALLKKLLTAAAAQLDEEIDIHEAAHEKGVCEETIRRAVRDGRIPDRRANPKGRHRVRRGDLNRV
Ga0066688_1009434843300005178SoilMRDSRSVGLDLPDDTPPWLAAFILEAQRETDTLRDNGAEQAAAARVALLRKLIAAAQLYFDTELDAAEAASETGRCEETIRRAVRDGTIPDRRANPKGRHRVRRGDLQK
Ga0066676_1087847413300005186SoilMRDARPVADLPDGFPPWLAAFVLEAQHETDTLRENGAEQAAVARLALLRKLVVTAQTYLDTELDATEAAVETGRCEETIRRAVRDGTI
Ga0066675_1089193413300005187SoilMRDARPVADLPDGFPPWLAAFVLEAQHETDTLRENGAEQAAVARLALLRKLVVTAQTYLDTQLDAAEAALETGRCE
Ga0070704_10216656313300005549Corn, Switchgrass And Miscanthus RhizosphereMSGPLGLDLPKGTPPWLAAFVLEAQRETDTLRDNGAEQAAAARTTLLRKLVAAAQSYLDTELDAGEAALETGRCEETIRRAVRDGS
Ga0066701_1037174013300005552SoilMPSMDLPIPEGLPPWLAAFVLEAQRETDTLRENGAEQAASARLALLRKLVTSVAAHLDAEIDIHEAAREKGVCEE
Ga0066701_1045695113300005552SoilMSISLGLELPKGTPPWLAAFVLEAQRETDTLRENGAEQAAAARLAVLKKLLTAAAAQLDEEIDIHEAAREKGVCEE
Ga0066695_1036167423300005553SoilMPCSLGLELPKGTQSWLAAFVLEAQRETDTLRDNGAEQAAAARTALLKKLVAAAQSYFDTELDAAEAASETGRCQETIRRAVRDGTIPDRRANPK
Ga0066695_1063996833300005553SoilVLTSNIPDLPEGLPPWLAAFVLEARLETDTLRENGADQAASARMALLKKLLAAAAAWLNAELDTVTAAEETGRCEETIRRAVRDGTIPDSRTNRKDH
Ga0066692_1087851313300005555SoilMADLRPVLDLPDGTPPWLAAFVLEAQRESDTLRENGAEQAAAARVALLRKLIAAAQMYFDTELDAAEAASETGRCEETIRRAVRDGTI
Ga0066708_1028274313300005576SoilMDLPIPEGLPPWLAAFILEAQRETDTLRDNGAEQAAAARVALLRKLITAVAAQLDEEIDIHQAAREKGVCEETIRR
Ga0066706_1014608733300005598SoilMDLPLPDGLPPWLAAFVLEAQRETDTLQDNGAEQAAAARVALLRKLIASAQLYFDTELDAAEAASETGRCEETIRRAVRDGTIPDRRANPKGRHRVRRGDLQK
Ga0066706_1087780923300005598SoilMPVPRNVELDVPDGTPPWLAAFVLEAQRETDALRENGAEQAAAARLALLRKLISAGAAHLDAEIDIHEAAEEKGVCEETIRRAV
Ga0066665_1002270853300006796SoilMQSMDLPLPDGLPPWLAAFVLEAQRETDTLRDNGAEQAAAARVALLRKLIAAAQVYFDTELDAAEAASETGRCDETIRRAVRDGTIPDRRANPRGR
Ga0066665_1133078413300006796SoilMRDSRSVGLDLPDGIPPWLAAFVREAQRETDTLRENGAEQAAAARVALLRKLVAAAQIYFDTELDAAEAASETG
Ga0066665_1142398613300006796SoilMPDARPVVDLPDGLPPWLAGFVLEAQRETDTLRDNGAEQAAAARVALLRKLIAAAQVYFDTELDAAEAASETGRCEETIRRAVRDGTIPDRRANPKGRHRVRRG
Ga0066659_1150475923300006797SoilMADLRLGLDLPDGLPPWLAAFVLEAQRETDTLRENGAEQAAAARLALLRKLIAAAAAHLDAEIDIHEAAQEEGVCE
Ga0075426_1060308013300006903Populus RhizosphereMPSMDLPIPQGLPPWLAAFIHEAQRETDTLRENGADQAAAARLALLRKLVAAVAAHLDAEIDIHEAAREKGVHEETIRRAVRDGRIPDRRANPK
Ga0075436_10041649533300006914Populus RhizosphereMSGALGFDLPDGLPPWLAAFVREALRETDTLRENGAEQAAAARLALLRKLAAAATAHLDAEIDIHEAAREKGVHEETIRRAVRDGRIPDRRANPKGRHRVRRGDL
Ga0075435_10152283713300007076Populus RhizosphereLPDALGLELPEGTPPWLAAFIHEAQRETDTLRENGADQAAAARLALLRKLVAAVAAHLDAEIDIHEAAREKGVHEE
Ga0099791_1037462913300007255Vadose Zone SoilMDLPLPDGLPPWLAAFALEAQRETDTLRENGAEQAAAARLALLKKLLAAAAAHLDAEIDIHEAAREKGVCEETIRRAIR
Ga0099791_1069106423300007255Vadose Zone SoilMADGRSVLDLPDGLPPWLAAFVLEAQRETDTLRDNGAEQAAAARVALLRKLITAAAAQLDEEIDIHEAAREKGVCEETIRRAVRDGTIPDRRPNPKGRHRVRRGDLQK
Ga0099794_1000583473300007265Vadose Zone SoilMSGSLGLDLPKSTPPWLAAFVLEAQRETDTLRENGAEQAASARLALLRKLVASVAAHLDTEIDIHEAAHEKGVCEETIRRA
Ga0066710_10224200533300009012Grasslands SoilVPTSNIPDLPDGLPPWLAAFISEALRETDTLRENGADQAALARMALLKKLVAAAAAWLNAELDTVKAAGETGRCEETIRRAVRDGTIPDSRANRKDHIRVRRGDLQKLAA
Ga0066710_10308865923300009012Grasslands SoilMPPARSVSLDPLDGIPPWLAAFVLDAQRETDTLRDNGAEQAAAARVALLRKLIAAAQVYFDTELDAAEAASETGRCEETIRRAVRDGTIPDR
Ga0066710_10329126813300009012Grasslands SoilMPSMDLPIPEGLPPWLAAFVLEAQRETDTLRENGAEQAAAARLALLRKLLAAAAAHLDAEIDIHEAAEEKGVCEETIRRAVRDGRIPDRRANPKGRHRLRRGDLN
Ga0066710_10459355913300009012Grasslands SoilMDLPLPDGLSPWLAAFVLEPQCGTATLRENGAEQAAAARVALLRKLITAVAAQLDEEIDIHEAAREKGVCEETIRRAVRDGRIPDRRANPKGRHRVRRGDLNRVAES
Ga0099830_1043517333300009088Vadose Zone SoilVPTSNIPDLPEGLPPWLAAFVLEARHETDTLRENGADQAASARMALLKKLVAAAAAWLNAELDTVTAAEETG
Ga0099830_1072607623300009088Vadose Zone SoilMDLPIPEGLPPWLAAFILEAQRETDTLRENGAEQAASARTALLKKLVAAAQSYFDTELDAGEAAIETGRCEETIRRAVRDGTIPDRRPNPKGRHRVRRGDLQKLAATP
Ga0099830_1101863813300009088Vadose Zone SoilMPVDESVRLETLSLPDGLPPWLAAFISEALRETDTLRENGADQAASARMAMLKKLVAAAAAWFNAELDTATAAEETGRCEETIRRAVRDGTISDSR
Ga0099828_1065274733300009089Vadose Zone SoilMSGALGLDLPDGLPPWLAAFLLEAQRETDTLRDNGAEQAAAARTALLKRLVAAAQSYRDTELDAGEAAVETGRCEETNRRAVRDGTIPDRRANPKG
Ga0105259_118072323300009597SoilMADLRPVLDLPDGLPPWLAAFVLEAQRETDTLRDNGAEQAVAARVALLRKLITAAAAQLDEEIDIHEAAREKGVCEETIRRAVRDGRIPDRRANPKGRHRVR
Ga0134070_1044038223300010301Grasslands SoilMSDALPVLDLPDGLPPWLAAFVLEAQRETDTLRENGAEQAAAARVALLRKLITAAAAQLDEEIDIHEAAREKGVCEETIRRAVRDGRIPDRRAN
Ga0134088_1021147213300010304Grasslands SoilMSDARSVVDLPDGFPPWLAAFVLEAQHETDTLRENGAEQAAVARLALLRKLVVTAQTYLDTELDAAEAALETGRCEETIRRAVRDGTIPDRRANPKGRHRV
Ga0134088_1066730823300010304Grasslands SoilVPTSNIPDLPEGLPPWLAAFVLEARRETDTLRENGADQAASARMALLKKLLAAAAAWLNAELDTVTAAEETGRCEETIRRAVRDGTIPDS
Ga0134109_1005934843300010320Grasslands SoilPPWLAAFVVEAQRETDTLRDNGAEQAAAARTALLKKLVAAAQAYLDTELDASEAASETGRCEETIHRAVRDGTIPDRPANPVDAR*
Ga0134109_1027790023300010320Grasslands SoilMPGSLGLELPKGLPPWLAAFILEAQRETDTLRENGAEQAAAARDALLKKLLTAAAAQLDEEIDIHEAAREKGVHEETIRRAVRDGRIPDRRANPKGRHRIRRGDL
Ga0134084_1016362723300010322Grasslands SoilMPDARPVVDLQDGLPPWLAAFVLEAQRETDTLRDNGAEQAAAARVALLRKLIAAAQVYFDTELNTAEAASETGRCEETIRRAVRDGSIPD
Ga0134084_1027392823300010322Grasslands SoilMDLPLPDGLPPWLAAFVLEAQRETDSLRENGAEQAAAARLALLRKLVAAAAAHLEAEIDIHEAAREKGVCEETIRRAVR
Ga0134086_1031372513300010323Grasslands SoilMPDARPVVDLQDGLPPWLAAFVLEAQHETDTLRENGAEQAAVARLALLRKLVVTVQTYLDTELDTAEAALETGRCGETIRRAVRDGTIPDRRAKPKG
Ga0134080_1004773223300010333Grasslands SoilMADARSVLDLPDGLPPWLAAFVLEAQRETDTLRDNGAEQAAAARTALLKKLVAAAQSYFDTELDASEAASETGRCEETIHRAVRDGTTPDHRANPVDRPVTPARHR*
Ga0134080_1051707723300010333Grasslands SoilMPDARPVVDLPDGFPPWLVAFVLEAQHETDALRENGAEQAAVARLALLRKLVVTAQTYLDTELDAAEAALETGRCEETIR
Ga0137392_1062278013300011269Vadose Zone SoilMPDARPVPDLPDGLPPWLTAFVLEAQRETDTLRENGAEQAAVVRLALLRKLVVTAQTHLDTELDAAEAALETGRCEETIRRAVR
Ga0137393_1121907223300011271Vadose Zone SoilMSDSRPNALDLPDGTPPWLAAFVLEAQRETDTLRENGAEQAAAARLALLRKLLTAAVTQLDAEIDIHEAAREKGVCEETIRRAVRKGRIPDRR
Ga0137423_106703233300011430SoilMPGSLGLELPKGTPPWLAAFVLEAQRETDTLRDNGAEQAVAARVALLRKLVAAVAAHLDTEIDIHEAAQEKGVCEETIRRAVRDGRIPDRR
Ga0137431_110552523300012038SoilMPGSLGLELPKGTPPWLAAFVLEAQRETDTLRENGAEQAASARLALLRKLVAAVAAHLDTEIDIHEAAQEKGVCEETIRRAVRDGRIP
Ga0137430_105139313300012041SoilMPGSLGLELPKGTPPWLAAFVLEAQRETDTLRENGAEQAASARLALLRKLVAAVAAHLDTEIDIHEAAQEKGVCEETIRRAVRD
Ga0137388_1015148533300012189Vadose Zone SoilMPVVESVRLETPPLPDGLPPWLAAFISEALRETDTLRENGADQAASARMAMLKKLVAAAAAWLNAELDTATAAEETGRCEETIRRAVRDGTIPDSRANRKDHIRVR
Ga0137388_1092210213300012189Vadose Zone SoilMPVPRTVELDLPDGTPPWLAAFVLEAQRETDTLRENGAEQAAAARLALLRKLFAAAAAHLDAQIDIHEAAREKGVHEETIRRAV
Ga0137388_1118476613300012189Vadose Zone SoilVPTSNIPDLPEGLPPWLAAFVLEARRETDTLRENGADQAASARMALLKKLVAAAAAWLNAELDTVMAAEETGRCEETIRRAVRDGTIPDSRANRKD
Ga0137382_1031086823300012200Vadose Zone SoilMDLPIPEGLPPWLAAFVLEAQRETDTLRDNGAEQAASARTALLRKLVAAAQAYLDTELDAAEAASETGRCEETIRRAVRDGTIP
Ga0137365_1041453023300012201Vadose Zone SoilMDLPLPNGLPPWLTAFVLEAQRETDTLRENGAEQAASARLALLRKLVASVAAHLDAEIDIHEAAHEKGVCEETIRRAVRDGRIPDRRANP
Ga0137363_1037025843300012202Vadose Zone SoilMSDSRPIGLDLPDGLPPWLAAFVLEAQHETDTLRDNGAEQAAAARVALLRKLITAAAAQLDEEIDIHEAAHEKGVCEETIRRAVRDGRIPDRRANPKGRHR
Ga0137399_1097350213300012203Vadose Zone SoilMPDARPVPDLPDGLPPWLAAFALEAQRETDTLRDNGAEQAAAARVALLRKLIAAAQVYFDTELDAAEAASETGRCEETIRRAVRDGTLPDRRSNPTGRHR
Ga0137374_1032091233300012204Vadose Zone SoilMADLRPVLDLPDGLPPWLAAFVLEAQRETDTLRDNGAEQAAAARTALLKKLVAAAQSYFDAELDAGEAANETGRCEETIRRAVRDGTIPDRRANPVDRPVTPGSAPLGGPDR*
Ga0137374_1060698913300012204Vadose Zone SoilMSAALPVLDLPEGLPPWLAAFVLEAQRETDTLRDNGAEQASAARTALLKKLVAAAQSYLDTELDAGEAAIETGRCEETIRRAVRDGTVPDRRANPRGRHRARRAT
Ga0137380_1018263353300012206Vadose Zone SoilVPTSNITDLPEGLPPWLAAFVLEARRETDTLLENGADQAASARMALLKKLVAAAAAWLNAELDTATAAEETGRCEETIRRAVRDGT
Ga0137376_1072352723300012208Vadose Zone SoilMSDSRPIALDVPDGIPPWLAAFVLEAQRETDTLRENGAEQAASARLALLRKLVASVAAHLDAEIDIHEAAHEQGVCEETIRRAVR
Ga0137379_1040978713300012209Vadose Zone SoilMDLPIPEGLPPWLAAFVLEAQHETDTLRDNGAEQAAAARVALLRKLITAAAAQLDEEIDIHEAAREKGVCEETIRRAVRDGRIPDR
Ga0137379_1041080913300012209Vadose Zone SoilVPTSNITDLPEGLPPWLAAFVLEARRETDTLLENGADQAASARMALLKKLVAAAAAWLNAELDTATAAEETGRCEETIRRAVRDGTISDS
Ga0137378_1010639043300012210Vadose Zone SoilMDLPLPDGLPPWLAAFVLEAQRETDVLRDNGAEQAAAARLALLRKLIAAAQVYFDTELDAAAAASETGRCEETIRRAVRDGTIPDRRANPKGRH
Ga0137377_1062322923300012211Vadose Zone SoilMDLPLPDGLPPWLAAFVLEARRETDTLGENGAEQAAAARLALLRKLLAAAAAHLDAEIDIHEAAREKGVCEETIRRAVRGGRIPDRRANPKGRH
Ga0137387_1053397113300012349Vadose Zone SoilMADLRPVLDLPDGLPPWLAAFVLEAQRETDTLRENGAEQAAAARVALLRKLVAAAQIYFDTELDAAEAALETG
Ga0137386_1103455213300012351Vadose Zone SoilMDLPLPNGLPPWLTAFVLEAQRETYTLRENGAEQAAAARLALLKKLAAAATAHLDAEIDIHEAAREKGVHEETIRRA
Ga0137369_1030451313300012355Vadose Zone SoilMPVDKSLRLETLPLPDGLPPWLAAFIREALRETDTLRENGADQAASARMALLKRLVASAAAWLNAELDTATAAEETGRCEETIRRAVRDGAIPDKR
Ga0137369_1053355713300012355Vadose Zone SoilMSDSWPIALDSPDGLPPWLAAFVLEAQRETDTLRDNGAEQAAAARTALLKKFVAAAQSYLDTELDAGEAAIETGRCEETIRR
Ga0137371_1051820233300012356Vadose Zone SoilVSDSGRINLNLPDGLPPWLAAFVLEAQRETDTLRENGAEQAASARLALLRKLVASVAAHLDAEIDIHEAAQEKGVCEE
Ga0137371_1059164223300012356Vadose Zone SoilMPDLRLGLDLPDGLPPWLAAFVLEAQRETDTLRDNGAEQAAAARLALLRKLLTAAAAQLDEEIDIHEAAREKGVCEETIRRAVRDGRIPDRRTNPRAVTASAAAT*
Ga0137368_1019281423300012358Vadose Zone SoilMADLRPVLDLPDDLPPWLAAFVLEAQRETETLRDNGADQAAAARTALLKKLVAAAQAYFDTELDAGEAAIETGRCKETIRRAVRDGTVPDRRANPKGRHRARRASRKRVAPRDR*
Ga0137368_1030543413300012358Vadose Zone SoilMADLRLGLDLPDGLPPWLAAFVLEAQRETDTLRDNGAEQAAAARVALLRRLIAAAQIYPDAELDAGEAAIETGRCEETIRRAVRDGTVPDRRANPKGRHRARRASRKRVAPRDR*
Ga0137361_1077038113300012362Vadose Zone SoilMHDARPVPDLPDDFPPWLAAFVLEAQHETDTLRENGAEQAASARAALLRKLVAAAQAYLDAELDAGEAALETGRCEETIRRAVRDGTIPDRRANPKGRHRVRR
Ga0137390_1131192213300012363Vadose Zone SoilVDESSRLETFSLPDGLPSWLAAFINEALRETDTLRENGAGQAASARMALLKKLVAAAASWLNAELDTATAAEESGRCEETIRRAVRDGT
Ga0137390_1157810523300012363Vadose Zone SoilMADLGPGLDLPDGLPPWLAAFVREALRETDALRENGAEQAAAARLALLRKLAAAATAHLDAEIDIHEAAREKGVHEETIR
Ga0137398_1115311223300012683Vadose Zone SoilMPGVDSPLPDGLPPWLAAFVAEAQRETDVLRDNGAEQAAAARVALLRKLIAAAQLYFDTELDAAEAASETGRCEETIRRAVRDGTIPDRRANPKGRHRVRRGDLQK
Ga0137397_1014512813300012685Vadose Zone SoilMDLPIPEGLPPWLAAFVLEAQRETDTLRDNGAEQAAAARTALLRKLVAAAQSYLDTELDAGEAAVETGRCEETIRR
Ga0137395_1085794123300012917Vadose Zone SoilMPDSRRIGLELPDGLPSWLAAFVLEAQHETETLRDNGAEQAAAARVALLRKLIAAAQVYFDTELDAAAAASETGRCEETIRRAVRDGTIPDRRANPKGRHRVRRGDL
Ga0137395_1097989613300012917Vadose Zone SoilLPDARPVPDLPDGLPPWLAAFILEAQRETDTLRDNGADQAAAARVALLRKLVAAAQIYLDTELDAAEAALETGRCEETI
Ga0137396_1018705113300012918Vadose Zone SoilMDLPIPEGLPPWLAAFVLEAQRETDTLRDNGAEQAAAARTALLKKLVSAAQAYFDTELDAGEAALETGRCEETI
Ga0137413_1094316623300012924Vadose Zone SoilMDLPLPDGLPPWLAAFVLEAQRETDTLRDNGAEQAAAARTALLKKLVAAAQSYLDTELDAGEAAVETGRCEETIRRAVRDGTIPDRRANPK
Ga0137419_1009467313300012925Vadose Zone SoilMDLPIPEGLPPWLAAFVLEAQRETDTLRENGAEQAASARLALLRKLVASVAAHLDAEIDIHEAANEKGVCEETIRRAVRDGRIPDRRANPK
Ga0137419_1198537613300012925Vadose Zone SoilMADPRLGLDLPDGTPPWLAAFVLEAQRETDTLRDNGAEQAAAARVALLRKLLTAAAAQLDEEIDIHEAAREKGVCEETIRRAVRDGRIPDRRANPKG
Ga0137416_1019434833300012927Vadose Zone SoilMDLPIPEGLPPWLAAFVLEAQRETDTLRENGAEQAASARLALLRKLVASVAAHLDAEIDIHEAAQEKGVCEET
Ga0137404_1169564223300012929Vadose Zone SoilMPDARPVPDLPDGLPPWLVAFVLEAHRETDTLRDNGADQAAAARVALLRKLVAAAQIYFDTELDAAEAASETGRCEETI
Ga0137407_1038382013300012930Vadose Zone SoilMDLPLPDGLPPWLAAFVLEAQRETDTLRENGADQAAAARLALLRKLLAAAAAYLDAEIDIHEAAQEKGVCEETIRRAVRGGRIPDRRANPQGRHRVRRG
Ga0137407_1068199813300012930Vadose Zone SoilVPTSNIPDLPEGLPPWLAAFVLEARRETDTLRENGADQAASARMALLKKLLAAAGAWLNAELDTVTAAEETGRCEETIRRAVRDGTIPDSRTNRKDHIRLRR
Ga0137407_1148953823300012930Vadose Zone SoilMPDARPVPDLPDGLPPWLVAFVLEAHRETDTLRDNGADQAAAARVALLRKLVAAAQIYFDTELDAAEAASETGRCEE
Ga0137410_1137988723300012944Vadose Zone SoilMDIPLPDGLPPWLAGFVLEAQRETDTLRENGAEQGASARTALLKKLVSAAQSYFDAELDAGEAAIETGRCEETIRRAVRDGTIPDRRANPKGR
Ga0134076_1017078323300012976Grasslands SoilMDLPLPDGLPPWLAAFVLEAQHETNTLRENGAEQAAVARLALLRKLVVTAQTYLDTELDAAEAALETGRCEETIRRAVRDGT
Ga0134075_1020021513300014154Grasslands SoilVPTSNIPDLPEGLPPWLAAFVLEARRETDTLRENGADQAASARMALLKKLVAAAAAWLNTELDTVTAAEETGRCEETIRRAVRDG
Ga0134078_1014853313300014157Grasslands SoilMPGSLGLELPKGLPPWLAAFILEAQRETDTLRENGAEQAAAARDALLKKLLTAAAAQLDEEIDIHEAAREKGVCEETIRRAVRDGRIPDRRVTWGRYRDPNDFLFSGG
Ga0134078_1035756513300014157Grasslands SoilMPDARPVVDLQDGLPPWLAAFVLEAQHETDTLRENGAEQAAVARLALLRKLVVTAQTYLDTELDAAEAALETGRCEETIRRAV
Ga0134079_1029778313300014166Grasslands SoilMADLRLALDLPDGLPPWLAAFVLEAQRETDTLRDNGAEQAAAARVALLRKLITAAATQLDEEIDIHEAAREKGVCEETIRRAVRDGRIPDRRANPKGRHRVRRGDLNRVA
Ga0137420_144086043300015054Vadose Zone SoilMPDARPVLHLPEGLPPWLAAFVLEAQRETDALRENGAEQAAAARLALLRKLITAAAAQLDEEIDIHEAAREKGVCEETIRRAVRDGRIPDRRANPKGRH
Ga0137418_1010842523300015241Vadose Zone SoilMDLPIPEGLPPWLAAFVLEAQRETDTLRDNGAEQAAAARTALLKKLVAAAQSYFDTELDAGEAAVETGRCEETIRRAVRDGTIPDRRANPKGRHRVRRSRRLRPTTKLAR*
Ga0137409_1103200323300015245Vadose Zone SoilMADLRPGLDLPEGLPPWLAAFVLEAQRETDTLRENGAEQAASARLALLRKLVASVAAHLDAQIDIHEAAHEKGVCEETIRR
Ga0134069_127146513300017654Grasslands SoilMPSIDLPIPDGLPPWLTAFVLEAQRETDTLRENGAEQAAAARTALLKKLIAAAQAYLDTELDAGEAALETGRCEETIRRAVRDGTIPDRRANPKGRHRVRRGDLRKL
Ga0184627_1033957513300018079Groundwater SedimentMSGSLGLDLPKGTPPWLAAFVLEAQRETDTLRDNGAEQAAAARTALLKKLVATARDYFDTELDAGEAALETGRCEETIRRAVRD
Ga0184629_1028163933300018084Groundwater SedimentMSGSLGLELPKGTPPWLAAFVLEAQRETDTLRENGAEQAGAARLALLRKLVTAAAAQLDEEIDIHEAAREKGVCEETIRRAVRDGRIPDR
Ga0066667_1003911613300018433Grasslands SoilMQSMDLPLPDGLPPWLAAFVLEAQRETDTLRENGAEQAAAARFALLRKLIAAAQIYFDTELDAAEAALETGRCEETIRRAVRD
Ga0066667_1175545223300018433Grasslands SoilMRDSRSVDLDLPDGTPPWLAAFVLEAQRETDTLRENGAEQAAAARLALLRKLVAAAAAHLDAEIDIHEAAQEEGVCEETIRRAVRDGRIPDRRANPRGRHRVRRG
Ga0193723_107165913300019879SoilMSGALGLELPQDTPAWLAAFVLEAQRETDTLRDNGAEQAAAARVALLRKLIAAAQTYFDTELDAAEAASETGRCEETIRRAVRDGTIPDRRANPKGRH
Ga0209235_107952013300026296Grasslands SoilMSISLGLELPKGTPPWLAAFVLEAQRETDALRENGAEQAAAARLALLRKLITASAAQLDEEIDIHEAAREKGVCEETIRRAVRDGL
Ga0209239_119900923300026310Grasslands SoilMDLPLPDGLPPWLAAFVLEAQHETDTLRENGAEQAAVARLALLRKLVVTAQTYLDTELDAAEAALETGRCEETIRRAVRDGTIPDRRA
Ga0209239_132756113300026310Grasslands SoilMPDARPVVDLPDGFPPWLAAFVLEAQRETDTLRDNGADQAAAARVALLRKLVAAAQIYLDTELDATEAASETGRCEETIRRAVRDGTIPDRRANP
Ga0209761_105218633300026313Grasslands SoilVPTSNIPDLPEGLPPWLAAFILEARRETDTLRENGADQAASARMALLKKLLAAAAAWLNAELDTVTAAEETGRCEETIRRAVRDGAIPDSRANP
Ga0209802_116167733300026328SoilMPDARSVSLDLPDGLPPWLAAFVLEAQRETDTLRENGAEQAAAARVALLRKLVAAAQIYLDTELDAAEAALETGRC
Ga0209056_1020836133300026538SoilVPTSNIPDLPEGLPPWLAAFVLEARRETDTLRENGADQAASARMALLKKLLAAAAAWLNAELDTVTAAEETGRCEETIRRAVRDGTIPDSRTN
Ga0209056_1027631013300026538SoilMRDSRSVDLDLPDGTPPWLAAFVLEAQRETDTLRENGAEQAAAARLALLKKLLTAASAQMDEEIDIHEAAREKGVCEETIRRAVREGRIPDRRANHKGRH
Ga0209215_105392423300027266Forest SoilMPDSFGLELPEGLSPWLAAFIHEAQRETDTLRENGADQAAAARLALLRKLVAAVAAHLDAEIDIHEAAREKGVHEETIRRAVRDGRIPDRRANPKGRHRVRRGDLN
Ga0209283_1038047123300027875Vadose Zone SoilMPSMDLPLPDGLPPWLTAFVVEAQRETDTLRDNGAEQAAAARVALLRKLVVAAQTYLNTELDAGEAALETGRCEETIRRAVRDGTIPD
Ga0137415_1044034933300028536Vadose Zone SoilMADARSVLDLPDGLPPWLAAFVLEAQRETDTLRENGAEQAAAARLALLKKLLTASSAQMDEEIDTHEAAREKGVCEETIRRAV
Ga0307302_1017443723300028814SoilMDLPIPDGLPSWLAAFVVEAQRETDTLRDNGADQAAAARTALLKKLIAAARAYLNAELDASEAASETGRCEETIRRAVRDGTIPDRPV
Ga0315271_1082258513300032256SedimentVHDERRISLSVPADLSPWLAAFITEALRETDTLRDNGADAAVAARDALLRKLVTAASTFLDATIDTHQAAREL
Ga0335085_1216372723300032770SoilMPAARAHGLNLPDGTPPWLVAFFDEALRETDTLRENGAEPTAAARLALLRRLVAAITAHLDHDLDVHEAARQLGVHEETIRRAVRK
Ga0214472_1003308043300033407SoilMSDSRPIGLDLPDGLPPWLAAFVLEAQRETDTLRENGAEQAAAARTALLKKLIAAAQAYLDTELDAGQAALETGRCEETIRRAVRDGRIPDRRANPKGRHRVRRGDFERVATRTVDRPVNPARHRYVRPGELCPVHHMHRFTI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.