NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F100811

Metagenome Family F100811

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F100811
Family Type Metagenome
Number of Sequences 102
Average Sequence Length 131 residues
Representative Sequence MKKWTMFALLSIFACAGVSRASADDALSGPWHGVIRKGMLESVVYFDFSRTDTGYRGNYWGTVPIGAPVSLSGIELGHSVRFEVPRMGVFDGEIAGETMEGTFHDAEGAGSFRLEKQLAWDALPNGP
Number of Associated Samples 83
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 52.94 %
% of genes near scaffold ends (potentially truncated) 37.25 %
% of genes from short scaffolds (< 2000 bps) 63.73 %
Associated GOLD sequencing projects 69
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(43.137 % of family members)
Environment Ontology (ENVO) Unclassified
(43.137 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(51.961 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 18.90%    β-sheet: 21.26%    Coil/Unstructured: 59.84%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF00793DAHP_synth_1 16.67
PF00149Metallophos 11.76
PF03795YCII 11.76
PF07681DoxX 11.76
PF11937DUF3455 6.86
PF13202EF-hand_5 2.94
PF08281Sigma70_r4_2 1.96
PF02597ThiS 1.96
PF00582Usp 0.98
PF06628Catalase-rel 0.98
PF09836DUF2063 0.98
PF00005ABC_tran 0.98
PF04909Amidohydro_2 0.98
PF14748P5CR_dimer 0.98
PF09859Oxygenase-NA 0.98
PF12697Abhydrolase_6 0.98
PF00884Sulfatase 0.98
PF03466LysR_substrate 0.98
PF13185GAF_2 0.98
PF00300His_Phos_1 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG2259Uncharacterized membrane protein YphA, DoxX/SURF4 familyFunction unknown [S] 11.76
COG2350YciI superfamily enzyme, includes 5-CHQ dehydrochlorinase, contains active-site pHisSecondary metabolites biosynthesis, transport and catabolism [Q] 11.76
COG4270Uncharacterized membrane proteinFunction unknown [S] 11.76
COG1977Molybdopterin synthase sulfur carrier subunit MoaDCoenzyme transport and metabolism [H] 1.96
COG2104Sulfur carrier protein ThiS (thiamine biosynthesis)Coenzyme transport and metabolism [H] 1.96
COG0753CatalaseInorganic ion transport and metabolism [P] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001160|JGI12654J13325_1002748All Organisms → cellular organisms → Bacteria1080Open in IMG/M
3300005171|Ga0066677_10014716All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales3486Open in IMG/M
3300005174|Ga0066680_10250767All Organisms → cellular organisms → Bacteria1125Open in IMG/M
3300005175|Ga0066673_10014334All Organisms → cellular organisms → Bacteria3500Open in IMG/M
3300005178|Ga0066688_10316071All Organisms → cellular organisms → Bacteria1009Open in IMG/M
3300005179|Ga0066684_10608562All Organisms → cellular organisms → Bacteria734Open in IMG/M
3300005179|Ga0066684_10915397All Organisms → cellular organisms → Bacteria572Open in IMG/M
3300005180|Ga0066685_10513169All Organisms → cellular organisms → Bacteria829Open in IMG/M
3300005181|Ga0066678_10407550All Organisms → cellular organisms → Bacteria902Open in IMG/M
3300005181|Ga0066678_10865424All Organisms → cellular organisms → Bacteria593Open in IMG/M
3300005184|Ga0066671_10087855All Organisms → cellular organisms → Bacteria1692Open in IMG/M
3300005187|Ga0066675_10079037All Organisms → cellular organisms → Bacteria2116Open in IMG/M
3300005445|Ga0070708_100017726All Organisms → cellular organisms → Bacteria → Proteobacteria5946Open in IMG/M
3300005446|Ga0066686_10362353All Organisms → cellular organisms → Bacteria989Open in IMG/M
3300005447|Ga0066689_10351758All Organisms → cellular organisms → Bacteria919Open in IMG/M
3300005450|Ga0066682_10065720All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales2230Open in IMG/M
3300005552|Ga0066701_10279801All Organisms → cellular organisms → Bacteria1034Open in IMG/M
3300005556|Ga0066707_10013011All Organisms → cellular organisms → Bacteria → Proteobacteria4187Open in IMG/M
3300005561|Ga0066699_10003205All Organisms → cellular organisms → Bacteria → Proteobacteria6671Open in IMG/M
3300005561|Ga0066699_10332498All Organisms → cellular organisms → Bacteria1084Open in IMG/M
3300005569|Ga0066705_10007265All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales4997Open in IMG/M
3300006796|Ga0066665_10082922All Organisms → cellular organisms → Bacteria2310Open in IMG/M
3300006800|Ga0066660_10120622All Organisms → cellular organisms → Bacteria1895Open in IMG/M
3300007255|Ga0099791_10015432All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales3248Open in IMG/M
3300007255|Ga0099791_10056319All Organisms → cellular organisms → Bacteria1765Open in IMG/M
3300007258|Ga0099793_10018437All Organisms → cellular organisms → Bacteria → Proteobacteria2832Open in IMG/M
3300007258|Ga0099793_10056876All Organisms → cellular organisms → Bacteria → Proteobacteria1740Open in IMG/M
3300007265|Ga0099794_10268529All Organisms → cellular organisms → Bacteria881Open in IMG/M
3300007788|Ga0099795_10373421All Organisms → cellular organisms → Bacteria642Open in IMG/M
3300009012|Ga0066710_100215089All Organisms → cellular organisms → Bacteria2750Open in IMG/M
3300009012|Ga0066710_100437258All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales1959Open in IMG/M
3300009137|Ga0066709_100314933All Organisms → cellular organisms → Bacteria2133Open in IMG/M
3300009137|Ga0066709_100473778All Organisms → cellular organisms → Bacteria1757Open in IMG/M
3300009137|Ga0066709_100481560All Organisms → cellular organisms → Bacteria1743Open in IMG/M
3300009143|Ga0099792_10071220All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1754Open in IMG/M
3300010321|Ga0134067_10078466All Organisms → cellular organisms → Bacteria1103Open in IMG/M
3300010335|Ga0134063_10326549All Organisms → cellular organisms → Bacteria742Open in IMG/M
3300010364|Ga0134066_10125635All Organisms → cellular organisms → Bacteria777Open in IMG/M
3300012202|Ga0137363_10125443All Organisms → cellular organisms → Bacteria1987Open in IMG/M
3300012202|Ga0137363_10869983All Organisms → cellular organisms → Bacteria765Open in IMG/M
3300012203|Ga0137399_10282568All Organisms → cellular organisms → Bacteria1368Open in IMG/M
3300012205|Ga0137362_10025311All Organisms → cellular organisms → Bacteria4637Open in IMG/M
3300012205|Ga0137362_10410632All Organisms → cellular organisms → Bacteria1173Open in IMG/M
3300012208|Ga0137376_10174642All Organisms → cellular organisms → Bacteria1854Open in IMG/M
3300012208|Ga0137376_10828445All Organisms → cellular organisms → Bacteria796Open in IMG/M
3300012211|Ga0137377_11293693All Organisms → cellular organisms → Bacteria659Open in IMG/M
3300012361|Ga0137360_10029600All Organisms → cellular organisms → Bacteria3771Open in IMG/M
3300012362|Ga0137361_10026414All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia4524Open in IMG/M
3300012582|Ga0137358_10094014All Organisms → cellular organisms → Bacteria2027Open in IMG/M
3300012582|Ga0137358_10332930All Organisms → cellular organisms → Bacteria1029Open in IMG/M
3300012685|Ga0137397_10524633All Organisms → cellular organisms → Bacteria882Open in IMG/M
3300012917|Ga0137395_10019425All Organisms → cellular organisms → Bacteria → Proteobacteria3920Open in IMG/M
3300012918|Ga0137396_10455682All Organisms → cellular organisms → Bacteria948Open in IMG/M
3300012922|Ga0137394_10014155All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales6302Open in IMG/M
3300012922|Ga0137394_10059408All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales3167Open in IMG/M
3300012925|Ga0137419_10134978All Organisms → cellular organisms → Bacteria1762Open in IMG/M
3300012930|Ga0137407_10000951All Organisms → cellular organisms → Bacteria18975Open in IMG/M
3300012930|Ga0137407_11294804All Organisms → cellular organisms → Bacteria692Open in IMG/M
3300012944|Ga0137410_10104259All Organisms → cellular organisms → Bacteria2107Open in IMG/M
3300012972|Ga0134077_10438697All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300012977|Ga0134087_10087074All Organisms → cellular organisms → Bacteria1289Open in IMG/M
3300014157|Ga0134078_10254632All Organisms → cellular organisms → Bacteria738Open in IMG/M
3300015053|Ga0137405_1151474All Organisms → cellular organisms → Bacteria2118Open in IMG/M
3300015053|Ga0137405_1172200All Organisms → cellular organisms → Bacteria1639Open in IMG/M
3300015054|Ga0137420_1125927All Organisms → cellular organisms → Bacteria1037Open in IMG/M
3300015054|Ga0137420_1293391All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria2857Open in IMG/M
3300015245|Ga0137409_10056083All Organisms → cellular organisms → Bacteria3723Open in IMG/M
3300015264|Ga0137403_10004082All Organisms → cellular organisms → Bacteria17271Open in IMG/M
3300015357|Ga0134072_10056571All Organisms → cellular organisms → Bacteria → Proteobacteria1105Open in IMG/M
3300018027|Ga0184605_10025807All Organisms → cellular organisms → Bacteria2392Open in IMG/M
3300018061|Ga0184619_10510678All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300018431|Ga0066655_10523446All Organisms → cellular organisms → Bacteria788Open in IMG/M
3300018433|Ga0066667_10115316All Organisms → cellular organisms → Bacteria1823Open in IMG/M
3300018433|Ga0066667_10462406All Organisms → cellular organisms → Bacteria1038Open in IMG/M
3300018482|Ga0066669_10025552All Organisms → cellular organisms → Bacteria3394Open in IMG/M
3300019789|Ga0137408_1181532All Organisms → cellular organisms → Bacteria6477Open in IMG/M
3300019789|Ga0137408_1256782All Organisms → cellular organisms → Bacteria1022Open in IMG/M
3300020170|Ga0179594_10119450All Organisms → cellular organisms → Bacteria960Open in IMG/M
3300020199|Ga0179592_10066286All Organisms → cellular organisms → Bacteria → Proteobacteria1650Open in IMG/M
3300024330|Ga0137417_1197075All Organisms → cellular organisms → Bacteria → Proteobacteria1525Open in IMG/M
3300026312|Ga0209153_1187191All Organisms → cellular organisms → Bacteria738Open in IMG/M
3300026314|Ga0209268_1093873All Organisms → cellular organisms → Bacteria840Open in IMG/M
3300026523|Ga0209808_1011824All Organisms → cellular organisms → Bacteria4416Open in IMG/M
3300026524|Ga0209690_1071473All Organisms → cellular organisms → Bacteria → Proteobacteria1482Open in IMG/M
3300026536|Ga0209058_1055968All Organisms → cellular organisms → Bacteria2211Open in IMG/M
3300026542|Ga0209805_1130243All Organisms → cellular organisms → Bacteria1180Open in IMG/M
3300026547|Ga0209156_10157218All Organisms → cellular organisms → Bacteria1100Open in IMG/M
3300026550|Ga0209474_10435785All Organisms → cellular organisms → Bacteria670Open in IMG/M
3300026552|Ga0209577_10777951All Organisms → cellular organisms → Bacteria536Open in IMG/M
3300027521|Ga0209524_1005172All Organisms → cellular organisms → Bacteria2480Open in IMG/M
3300027603|Ga0209331_1109085All Organisms → cellular organisms → Bacteria673Open in IMG/M
3300027643|Ga0209076_1150213All Organisms → cellular organisms → Bacteria652Open in IMG/M
3300027651|Ga0209217_1000385All Organisms → cellular organisms → Bacteria11885Open in IMG/M
3300027655|Ga0209388_1125945All Organisms → cellular organisms → Bacteria729Open in IMG/M
3300027655|Ga0209388_1163465All Organisms → cellular organisms → Bacteria626Open in IMG/M
3300027663|Ga0208990_1009968All Organisms → cellular organisms → Bacteria → Proteobacteria3164Open in IMG/M
3300027678|Ga0209011_1021431All Organisms → cellular organisms → Bacteria2091Open in IMG/M
3300027903|Ga0209488_10005359All Organisms → cellular organisms → Bacteria → Proteobacteria9864Open in IMG/M
3300028536|Ga0137415_10005074All Organisms → cellular organisms → Bacteria → Proteobacteria13020Open in IMG/M
3300028828|Ga0307312_10589955All Organisms → cellular organisms → Bacteria735Open in IMG/M
3300031720|Ga0307469_10687349All Organisms → cellular organisms → Bacteria926Open in IMG/M
3300032180|Ga0307471_101025659All Organisms → cellular organisms → Bacteria → Proteobacteria992Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil43.14%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil27.45%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil8.82%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil6.86%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil5.88%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.96%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.96%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.98%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.98%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001160Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026312Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120 (SPAdes)EnvironmentalOpen in IMG/M
3300026314Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143 (SPAdes)EnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027521Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027603Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027663Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027678Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12654J13325_100274823300001160Forest SoilMKKWTFALLSIFACAGVSRASADDALSGPWHGVVRKGMLENVVYFDFSRTDTGYRGNYWGTAPIGAPVSLSGIELGHSVRFEVPRMGVFDGEIVGETMEGTFHDAEGAGSFRLQKQLAWDALPNGP*
Ga0066677_1001471623300005171SoilMKKQMLIALLSVCACATLSRSSGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVELGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQLAKQLDWDALQAGP*
Ga0066680_1025076713300005174SoilMKRMMMTLLSLCACATLSRTSGNDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVELGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGVGSFQLEKQQDWDALQAGP*
Ga0066673_1001433423300005175SoilMKKWTMFAMLSIFACAGISRASADDVLSGPWHGVVRKGMLENVVYFDFSRTDTGYRGTYWGMAPTGAPVSLSGIELGHSVRFEVPRMGVFDGQIAGETMEGTFHDAEGAGSFRLEKQLAWDALPNGS*
Ga0066688_1031607123300005178SoilIALLSVCACATLSRSSGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVQLGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQLAKQQDWDALQAGP*
Ga0066684_1060856223300005179SoilCACASVSRASAGDVLSGTWRGVVRKGALESVVYFEFSRSDAGYRGAYWGTAPLGLPAPLAGVEFGHSVRFEVPKVAVFDGEIAGDTMQGTFQDGDGAGSFRLEKQPDWEVPQT*
Ga0066684_1091539713300005179SoilMKKWTMFAMLSIFACAGVSRASADDVLSGPWHGVVRKGMLENVVYFDFSRTDTGYRGTYWGMAPTGAPVSLSGIELGHSVRFEVPRMGVFDGQITGETMEGTFHDAEGAGSFRLEKQLAWDALPNGS*
Ga0066685_1051316913300005180SoilDAMKKWTMFAMLSIFACAGISRASADDVLSGPWHGVVRKGMLENVVYFDFSRTDTGYRGTYWGMAPTGAPVSLSGIELGHSVRFEVPRMGVFEGQIAGETMEGTFHDAEGAGSFRLEKQLAWDALPNGS*
Ga0066678_1040755013300005181SoilVMKKQMLIALLSVCACATLSRSSGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVEFGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQLAKQLDWDALQAGP*
Ga0066678_1086542413300005181SoilASKDHGEVDSERCPVIPDRPRGNLSAASAIGSADGFLPMEDAMKKWTFALLSMFACAGVSRASADDALSGPWRGVVRKGMLENVVYFDFSRTDTGYRGNYWGAAPIGAPVSLSGIELGHSVRFEVPQMGVFEGEIAGETMEGTFHDAEGAGSFRLEKQLAWDALPNGP*
Ga0066671_1008785523300005184SoilMKRMMIGLLSLCACAGVSRASANDALSGTWRGVVRKGAMESVVLFEFSRTGTGYRGIYWGTAPLGLPVPLAGVEFGHSVRFEVPKVAVFDGEIAGDTMEGTFDDGQGPGSFRLEKQPAWEIPET*
Ga0066675_1007903733300005187SoilMKKWTMFAMLSIFACAGISRASADDVLSGPWHGVVRKGMLENVVYFDFSRTDTGYRGTYWGMAPTGAPVSLSGIELGHSVRFEVPRMGVFDGQIAGDTMEGTFHDAEGAGSFRLEKQLAWDALPNGS*
Ga0070708_10001772653300005445Corn, Switchgrass And Miscanthus RhizosphereMKKCTMIALLSVFACAGVSRASADDALSGPWHGVVRKGMLQNVVYLDFSRTDTGYRGNYWGTAPLGAPVPLSGIEFGHSVRFEVPRMGVFDGEIAGETMEGTFEDGNGSGSFRLEKQLAWDALPNGP*
Ga0066686_1036235323300005446SoilRSSGNDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVELGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGVGSFQLEKQQDWDALQAGP*
Ga0066689_1035175823300005447SoilMKKQMLIALLSVCACATLSRSSGNDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVELGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQLAKQLDWDALQAGP*
Ga0066682_1006572013300005450SoilMKKQMLIALLSVCACATLSRSSGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVELGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQL
Ga0066701_1027980123300005552SoilMKRMMITLLSVCACATLSRSSGNDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVELGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGVGSFQLEKQQDWDALQAGP*
Ga0066707_1001301173300005556SoilMKRMMMTLLSLCACATLSRTSGNDALSGTWRGVVRKGLVESVVWFDFMRTDAGYRGNYWGMAPPGERVALTGVELGHSVRFEVPRMGVFEGEIAGGTMEGTFVDAQGAGSFQLAKQQDWDALQAGP*
Ga0066699_1000320573300005561SoilMKKQMLIALLSVCACATLSRSSGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVQLGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQLAKQLDWDALQAGP*
Ga0066699_1033249823300005561SoilMKRMMIGLLSLCACASVSRASANDALSGTWRGVVRKGAMESVVLFEFSRTGTGYRGIYWGTAPLGLPVPLAGVEFGHSVRFEVPKVAVFDGEVAGDTMEGTFDDGQGPGSFRLEKQPAWEIPET*
Ga0066705_1000726573300005569SoilMKKQMLIALLSVCACATLSRSSGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVELGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQLAKQLDWDA
Ga0066665_1008292223300006796SoilMKRMMMTLLSLCACATLSRTSGNDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVELGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQLAKQQDWDALQAGP*
Ga0066660_1012062233300006800SoilMKKQMLIALLSVCACATLSRSSGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGVRVALTGVELGHSVRFEVPRMGVFEGEIAGEMMEGTFADAQGAGSFQLEKQPDWDALSAGP*
Ga0099791_1001543223300007255Vadose Zone SoilMKKWTMFALLSIFACAGVSRASADDALSGPWHGVIRKGMLESVVYFDFSRTDTGYRGNYWGTVPIGAPVSLSGIELGHPLRFEVPRMGVFDGEIAGDTMEGTFHDAEGAGSFRLEKQLAWDALPNGP*
Ga0099791_1005631923300007255Vadose Zone SoilLSVAPAIGFADGFLPVEDAMKKWTMFALLSMFACAGVSRASADDALSGSWHGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPVGAPMSLSGIEVGHSVRFEVPRMGVFEGEIVGETIEGTFHDAEGAGSFRLEKQLAWDALQNGP*
Ga0099793_1001843723300007258Vadose Zone SoilLSAASAIGVADGFLPMEDAMKKWTMFALLSTFACAGVSRASADDALSGPWQGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPIGAPVSLSGIELGHSVRFEVPQMGVFDGEIAGETMEGTFHDAEGAGSFRLQKQIAWDALPNGP*
Ga0099793_1005687613300007258Vadose Zone SoilMKKWTMFSLLSIFACAGVSRASADDALSGPWHGVIRKGMLESIVYFDFSRTDTGYRGNYWGTIPIGAPVSLSGIELGHPLRFEVPRMGVFDGEIAGDTMEGTFHDAEGAGSFRLEKQLAWDALPNGP*
Ga0099794_1026852913300007265Vadose Zone SoilLSVAPAIGFADGFLPVEDAMKKWTMFALLSIFACAGVSRASAEDALSGPWHGVIRKGMLESVVYFDFSRTDTGYRGNYWGTVPIGAPVSLSGIELGHPLRFEVPRMGVFDGEIAGDTMEGTFHDAEGAGSFRLEKQLAWDALPNGP*
Ga0099795_1037342123300007788Vadose Zone SoilSRASASDALSGPWRGVVRKGMLQNVVYFDFSRTDAGYRGNYWGTAPIGAPVSLSGIELGHSVRFEVPQMGVFDGEIAGETMEGTFHDAEGEGSFRLQKQIAWDALPNGP*
Ga0066710_10021508943300009012Grasslands SoilMKKWTMFAMLSIFACAGISRASADDVLSGPWHGVVRKGMLENVVYFDFSRTDTGYRGTYWGMAPTGAPVSLSGIELGHSLRFEVPRMGVFDGQIAGETMEGTFHDAEGAGSFRLEKQLAWDALPNGS
Ga0066710_10043725833300009012Grasslands SoilMKRMMMTLLSLCACATLSRTSGNDALSGTWRGVVRKGLVESVVWFDFMRTDAGYRGNYWGMAPPGERVALTGVELGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQLAKQQDWDALQAGP
Ga0066709_10031493323300009137Grasslands SoilLPKEDVMKKQMLIALLSVCACATLSRSSGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVQLGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQLAKQLDWDALQAGP*
Ga0066709_10047377833300009137Grasslands SoilLPKEDVMKRMMMTLLSLCACATLSRTSGNDALSGTWRGVVRKGLVESVVWFDFMRTDAGYRGNYWGMAPPGERVALTGVELGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQLAKQ
Ga0066709_10048156033300009137Grasslands SoilMEDAMKKWTMFAMLSIFACAGISRASADDVLSGPWHGVVRKGMLENVVYFDFSRTDTGYRGTYWGMAPTGAPVSLSGIELGHSVRFEVPRMGVFDGQIAGETMEGTFHDAEGAGSFRLEKQLAWDALPNGS*
Ga0099792_1007122023300009143Vadose Zone SoilLSAASAIGVADGFLPMEDAMKKWTMFALLSTFACAGVSRASADDALSGPWQGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPIGAPVSLSGIELSHSVRFEVPQMGVFDGEIAGETMEGTFHDAEGAGSFRLQKQIAWDALPNGP*
Ga0134067_1007846623300010321Grasslands SoilLSETFAIASADGFLPKEDVMKKQMLIALLSVCACATLSRSSGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVQLGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQLAKQLDWDALQAGP*
Ga0134063_1032654913300010335Grasslands SoilENGSAHSQQLDELTPRFHEALRLVHRGDSKRCLVIPGCRAREFVRDFRNRFRRRLLPKEDVMKKQMLIALLSVCACATLSRSSGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVELGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQLAKQLDWDALQAGP*
Ga0134066_1012563523300010364Grasslands SoilFGCAGVSRASADDVLSGPWHGVVRKGMLENVVYFDFSRTDTGYRGTYWGMAPTGAPVSLSGIELGHSVRFEVPRMGVFDGQIAGDTMEGTFHDAEGAGSFRLEKQLAWDALPNGS*
Ga0137363_1012544313300012202Vadose Zone SoilMEDAMKKWTMFALLSTFAFAGVSRASSDDALSGPWHGVIRKGMLVNVVYFDFSRTDTGYRGNYWGTAPIGAPVSLSGIELGRSVRFEVPRMGVFDGEIAGETIEGTFHDAEGAGS
Ga0137363_1086998313300012202Vadose Zone SoilKWTMFALLSIFACAGMSRASADDALSGPWQGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPIGTPMSLSGIELGHSVRFEVPRMGVFDGEIAGDTMEGTFHDAEGAGSFRLEKQIAWDALPNGP*
Ga0137399_1028256833300012203Vadose Zone SoilMKKWTMFALLSIFACAGVSRASADDALSGPWHGVIRKGMLESVVYFDFSRTDTGYRGNYWGTIPIGAPVSLSGIELGHPLRFEVPRMGVFDGEIAGDTMEGTFHDAEGAGSFRLEKQLAWDALPNGP*
Ga0137362_1002531123300012205Vadose Zone SoilMEDVMKKWTMFALLSIFACAGMSRASADDALSGPWQGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPIGTPMSLSGIELGHSVRFEVPRMGVFDGEIAGDTMEGTFHDAEGAGSFRLEKQIAWDALPNGP*
Ga0137362_1041063223300012205Vadose Zone SoilLSAASAIGSADGFLPMEDAMKKWTMFALLSTFAFAGVSRASSDDALSGPWHGVIRKGMLVNVVYFDFSRTDTGYRGNYWGTAPIGAPVSLSGIELGRSVRFEVPRMGVFDGEIAGETIEGTFHDAEGAGSFRLEKQLAWDALPNGP*
Ga0137376_1017464233300012208Vadose Zone SoilMKPSGWSTAGTRSAVWLFPDVARGNLSATSVIASADGFLPKEDVMKKQMVIALLSVCACATLSRASGTDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGVRVALTGVELGHSVRFEVPRMGVFEGEIAGEMMEGTFADAQGAGSFQLEKQLDWDALQAGP*
Ga0137376_1082844523300012208Vadose Zone SoilIVLCMCTPFLGFAVVAKFSPSGLGALSSYSPDRPRGNLSAAPAIGSADGFLPMEDAMKKWTFALLSMFACAGVSRASADDALSGPWRGVVRKGMLENVVYFDFSRTDTGYRGTYWGMAPTGAPVSLSGIELGHSVRFEVPRMGVFDGQIAGETMEGTFHDAEGAGSFRLEKQLAWDALPNGS*
Ga0137377_1129369313300012211Vadose Zone SoilVWLFPDAVRGNLSATSVIASADGFLPKEDVMKKQMLIALLSVCACATLSRASGTDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGVRVALTGVELGHSVRFEVPRMGVFEGEIAGEMMEGTFADAQGAGSFQ
Ga0137360_1002960023300012361Vadose Zone SoilLSAASAIGPADGFSPMEDVMKKWTMFALLSIFACAGMSRASADDALSGPWQGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPIGTPMSLSGIELGHSVRFEVPRMGVFDGEIAGDTMEGTFHDAEGAGSFRLEKQIAWDALPNGP*
Ga0137361_1002641423300012362Vadose Zone SoilMEDVMKKWTMFALLSIFACAGMSRASADDPLSGPWQGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPIGTPMSLSGIELGHSVRFEVPRMGVFDGEIAGDTMEGTFHDAEGAGSFRLEKQIAWDALPNGP*
Ga0137358_1009401423300012582Vadose Zone SoilMKKWTMFALLSIFACAGVSRASADDALSGPWHGVIRKGMLESVVYFDFSRTDTGYRGNYWGTVPIGAPVSLSGIEVGHPLRFEVPRMGVFDGEIAGDTMEGTFHDAEGAGSFRLEKQLAWDALPNGP*
Ga0137358_1033293013300012582Vadose Zone SoilMEDVMKKWTMFALLSIFACAGMSRASADDALSGPWQGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPIGTPMSLSGIELGHSVRFEVPRMGVFDGEIAGETMEGTFQDAEGAGSFRLEKRLAWDALPNGP*
Ga0137397_1052463313300012685Vadose Zone SoilLSAASAIASADGFLPMEDAMKKWTMFALLSTFACAGVSRASADDALSGPWQGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPIGTPVSLSGIELGHSVRFEVPQMGVFDGAIAGETMEGTFHDAEGAGSFRLQKQIAWDALPNGP*
Ga0137395_1001942543300012917Vadose Zone SoilLSAASAIGVADGFLPMEDAMKKWTMFALLSTFACAGVSRASADDALSGPWQGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPIGTPMSLSGIELGHSVRFEVPRMGVFDGEIAGETMEGTFHDAEGAGSFRLQKQIAWDALPNGP*
Ga0137396_1045568223300012918Vadose Zone SoilLSVAPAIGFADGFLPVEDAMKKWTMFALLSMFACAGVSRASADDALSGSWHGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPVGAPMSLSGIEVGHSVRFEVPRMGVFEGEIVGETIEGTFHDAEGAGSF
Ga0137394_1001415553300012922Vadose Zone SoilLPVEDAMKKWTMFALLSMFACAGVSRASADDALSGSWHGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPVGAPMSLSGIEVGHSVRFEVPRMGVFEGEIVGETIEGTFHDAEGAGSFRLEKQLAWDALQNGP*
Ga0137394_1005940843300012922Vadose Zone SoilLSAASAIASADGFLPMEDAMKKWTMFALLSTFACAGVSRASADDALSGPWQGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPIGTPVSLSGIELGHSVRFEVPQMGVFDGEIAGETMEGTFHDAEGAGSFRLQKQIAWDALPNGP*
Ga0137419_1013497813300012925Vadose Zone SoilLSRYSRTGRAGICRRLRQSDPPTASYLEVAMKKWTMFALLSIFACAGVSRASADDALSGPWHGVIRKGMLESVVYFDFSRTDTGYRGNYWGTVPIGAPVSLSGIELGHPLRFEVPRMGVFDGEIAGDTMEGTFHDAEGAGSFRLEKQLAWDALPNGP*
Ga0137407_10000951103300012930Vadose Zone SoilLSVAPAIGFADGFLPVEDAMKKWTMFALLSMFACAGVSRASADDALSGSWHGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPVGAPMSLSGIEVGHSVRFEVPRIGVFEGEIVGETIEGTFHDAEGAGSFRLEKQLAWDALQNGP*
Ga0137407_1129480423300012930Vadose Zone SoilMKKWTMFALLSIFACAGVSRASADDALSGPWHGVIRKGMLESVVYFDFSRTDTGYRGNYWGTVPIGAPVSLSGIELGHSVRFEVPRMGVFDGEIAGETMEGTF
Ga0137410_1010425933300012944Vadose Zone SoilVASAIGSAEGFLPKEDAMVKWAMLALLSIFACAGVSRASADDALSGQWQGVVRKGMLESLVYFDFSRTDTGYRGNYWGRAPIGSPVSLSGIELGHSVRFEVPRMGVFDGEIAGETMKGTFKDAEGAGSFRLEKQTAWDPLPNGP*
Ga0134077_1043869723300012972Grasslands SoilGNLSATFAIASADGFLPKEDVMKKQMLIALLSVCACATLSRSSGNDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVELGHSVRFEVPRMGVFEGEIAGETMEGTYVDAQGAGSFQLEKQQDWDALQAGP*
Ga0134087_1008707423300012977Grasslands SoilLSETFAIASADGFLPKEDVMKKQMLIALLSVCACATLSRSSGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVELGHSVRFEVPRMGVFEGEIAGETME
Ga0134078_1025463213300014157Grasslands SoilRGDSKRCLVIPGRRAREFVRDFRNRSADGFLPKEDVMKKQMLIALLSVCACATLSRSSGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVELGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQLAKQLDWDALQAGP*
Ga0137405_115147423300015053Vadose Zone SoilMKKWTMFALLSIFACAGVSRASADDALSGPWHGVIRKGMLENVVYFDFSRTDTGYRGNYWGTVPIGAPVSLSGIELGHSVRFEVPRMGVFDGEIAGETMEGTFQDAEGAGSFRLEKRLAWDALPNGP*
Ga0137405_117220043300015053Vadose Zone SoilKWTMFALLSIFACAGVSRASADDALSGPWHGVIRKGMLENVVYFDFSRTDTGYRGNYWGTVPIGAPVSLSGIELGHSVRFEVPRMGVFDGEIAGETMEGTFQDAEGAGSFRLEKRLAWDALPNGP*
Ga0137420_112592723300015054Vadose Zone SoilSYLEVAMKKWTMFALLSIFACAGVSRASADDALSGPWHGVIRKGMLESVVYFDFSRTDTGYRGNYWGTVPIGAPVSLSGIELGHPLRFEVPRMGVFDGEIAGDTMEGTFHDAEGAGSFRLEKQLAWDALPNGP*
Ga0137420_129339113300015054Vadose Zone SoilMKKWTMFALLSIFACAGVSRASAEDALSGPWHGVIRKGMLESVVYFDFSRTDTGYRGNYWGTVPIGAPVSLSGIELGHPLRFEVPRMGVFDGEIAGDTMEGTFHDAE
Ga0137409_1005608353300015245Vadose Zone SoilVASAIGSAEGFLPKEDAMVKWAMLALLSIFACAGVSRASADDALSGQWQGVVRKGMLESLVYFDFSRTDTGYRGNYWGRAPIGSPVSLSGIELGHSVRFEVPRMGVFEGEIVGETIEGTFHDAEGAGSFRLEKQLAWDALQNGP*
Ga0137403_10004082113300015264Vadose Zone SoilMKKWTMFALLSIFACAGVSRASADDALSGPWHGVIRKGMLESVVYFDFSRTDTGYRGNYWGTVPIGAPVSLSGIELGHSVRFEVPRMGVFDGEIAGETMEGTFHDAEGAGSFRLEKQLAWDALPNGP*
Ga0134072_1005657123300015357Grasslands SoilLVHRRDSKRCLVIPGRRAREFVRDFRNRSADGFLPKEDVMKKQMLIALLSVCACATLSRSSGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVELGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQLAKQLDWDALQAGP*
Ga0184605_1002580733300018027Groundwater SedimentMKKQMVIALLSVCACATLSRSSGNDALSGTWRGVVRKGLVESVVWFDFMRTDAGYRGNYWGMAPPGERIALAGVELGHSVRFEVPHMGVFEGEIAGETMEGTFADAQGAGSFLLEKQLDWDALQAGP
Ga0184619_1051067813300018061Groundwater SedimentSTAGTRSAVGLFPSAVRGNLSATFAIASADGFLPKEDVMKKQMVIALLSVCACATLSRSSGNDALSGTWRGVVRKGLVESVVWFDFMRTDAGYRGNYWGMAPPGERIALAGVELGHSVRFEVPHMGVFEGEIAGETMEGTFADAQGAGSFLLEKQLDWDALQAGP
Ga0066655_1052344623300018431Grasslands SoilMKKQMLIALLSVCACATLSRSSGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVELGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQLAKQLDWDALQAGP
Ga0066667_1011531633300018433Grasslands SoilMKKWTMFAMLSICACAGVSRASADDVLSGPWHGVVRKGMLENVVYFDFSRTDTGYRGTYWGMAPTGAPVSLSGIELGHSVRFEVPRMGVFDGQIAGETMEGTFHDAEGAGSFRLEKQLAWDALPNGS
Ga0066667_1046240623300018433Grasslands SoilMKRMMMTLLSLCACATLSRTSGNDALSGTWRGVVRKGLVESVVWFDFMRTDAGYRGNYWGMAPPGERVALTGVELGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQLAKQLDWDALQAGP
Ga0066669_1002555233300018482Grasslands SoilMKKWTMFAMLSLFACAGVSRASADDVLSGPWHGVVRKGMLENVVYFDFSRTDTGYRGTYWGMAPTGAPVSLSGIELGHSVRFEVPRMGVFDGQIAGETMEGTFHDAEGAGSFRLEKQLAWDALPNGS
Ga0137408_118153233300019789Vadose Zone SoilMKKWTMFALLSIFACAGVSRASADDALSGPWHGVIRKGMLENVVYFDFSRTDTGYRGNYWGTVPIGAPVSLSGIELGHSVRFEVPRMGVFDGEIAGETMEGTFQDAEGAGSFRLEKRLAWDALPNGP
Ga0137408_125678223300019789Vadose Zone SoilMKKWTMFALLSIFACAGMSRASADDALSGPWQGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPIGTPMSLSGIELGHSVRFEVPRMGVFDGEIAGDTMEGTFHDAEGAGSFRLEKQIAWDALPNGP
Ga0179594_1011945023300020170Vadose Zone SoilASADDALSGPWHGVIRKGMLESVVYFDFSRTDTGYRGNYWGRAPIGTPMSLSGIELGHSVRFEVPRMGVFDGEIAGDTMEGTFHDAEGAGSFRLEKQIAWDALPNGP
Ga0179592_1006628623300020199Vadose Zone SoilMKKWTMFALLSTFACAGVSRASADDALSGPWQGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPIGAPVSLSGIELGHSVRFEVPQMGVFDGEIAGETMEGTFHDAEGAGSFRLQKQIAWDALPNGP
Ga0137417_119707533300024330Vadose Zone SoilSRASADDALSGPWHGVIRKGMLESIVYFDFSRTDTGYRGNYWGTIPIGAPVSLSGIELGHPLRFEVPRMGVFDGEIAGDTMEGTFHDAEGAGSFRLEKQLAWDALPNGP
Ga0209153_118719123300026312SoilMKRMMIGLLSLCACAGVSRASANDALSGTWRGVVRKGAMESVVLFEFSRTGTGYRGIYWGTAPLGLPVPLAGVEFGHSVRFEVPKVAVFDGEVAGDTMEGTFDDGQGPGSFRLEKQPAWEIPET
Ga0209268_109387323300026314SoilMKKQMLIALLSVCACATLSRSSGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVELGHSVRFEVPRMGVFEGEIAGETMEGTF
Ga0209808_101182463300026523SoilMKKQMLIALLSVCACATLSRSSGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVELAGVEIGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQLAKQLDWDALQAGP
Ga0209690_107147323300026524SoilMKRMMITLLSVCACATLSRSSGNDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVELGHSVRFEVPRMGVFEGEIAGETMEGTFVDGQGAGSFQLAKQLDWDALQAGP
Ga0209058_105596823300026536SoilMKKQMLIALLSVCACATLSRSSGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVELGHSVRFEVPRMGVFEGEIAGETAKQLDWDALQAGP
Ga0209805_113024323300026542SoilMKKQMLIALLSVCACATLSRSSGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGERVALAGVQLGHSVRFEVPRMGVFEGEIAGETMEGTFVDAQGAGSFQLAKQLDWDALQAGP
Ga0209156_1015721823300026547SoilMKKWTMFAMLSIFACAGISRASADDVLSGPWHGVVRKGMLENVVYFDFSRTDTGYRGTYWGMAPTGAPVSLSGIELGHSVRFEVPRMGVFDGQIAGDTMEGTFHDAEGAGSF
Ga0209474_1043578523300026550SoilASGSRASANDALSGTWRGVLRKGALESVVILEFSRTEAGYRGTYWGTAPLGLPVPLAGVEFGHSLRFEVPKVAVFDGELAGDTMEGTFQDTDGAGSFRLEKQPDWEVPLT
Ga0209577_1077795113300026552SoilGDDALSGTWRGVVRKGLVESVVWFDFTRTDAGYRGNYWGMAPPGVRVALTGVELGHSVRFEVPRMGVFEGEIAGEMMEGTFADAQGAGSFQLEKQPDWDALSAGP
Ga0209524_100517213300027521Forest SoilMKKWTFALLSIFACAGVSRASADDALSGPWHGVVRKGMLENVVYFDFSRTDTGYRGNYWGTAPIGAPVSLSGIELGHSVRFEVPRMAVFDGEIAGETMVGTFHDAEGAGSFRLQKQLAWDALPNGP
Ga0209331_110908523300027603Forest SoilMKKWTFALLSIFACAGVSRASADDALSGPWHGVVRKGMLENVVYFDFSRTDTGYRGNYWGTAPIGAPVSLSGIELGHSVRFEVPRMGVFDGEIVGETMEGTFHDAE
Ga0209076_115021323300027643Vadose Zone SoilASADDALSGPWQGVVRKGMLENVVYFDFSRTDTGYRGNYWGTIPIGAPVSLSGIELGHPLRFEVPRMGVFDGEIAGDTMEGTFHDAEGAGSFRLEKQLAWDALPNGP
Ga0209217_100038543300027651Forest SoilMKKWTFALLSIFACAGVSRASADDALSGPWHGVVRKGMLENVVYFDFSRTDTGYRGNYWGTAPIGAPVSLSGIELGHSVRFEVPRMGVFDGEIVGETMEGTFHDAEGAGSFRLQKQLAWDALPNGP
Ga0209388_112594513300027655Vadose Zone SoilNLSVAPAIGFADGFLPVEDAMKKWTMFALLSMFACAGVSRASADDALSGSWHGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPVGAPMSLSGIEVGHSVRFEVPRMGVFEGEIVGETIEGTFHDAEGAGSFRLEKQLAWDALQNGP
Ga0209388_116346513300027655Vadose Zone SoilDPPTASYLEVAMKKWTMFALLSIFACAGVSRASADDALSGPWHGVIRKGMLESVVYFDFSRTDTGYRGNYWGTVPIGAPVSLSGIELGHPLRFEVPRMGVFDGEIAGDTMEGTFHDAEGAGSFRLEKQLAWDALPNGP
Ga0208990_100996823300027663Forest SoilMKKWTMFALLSTFACAGVSRASADDALSGPWQGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPIGTPVSLSGIELGHSVRFEVPQMGVFDGEIAGETMEGTFHDAEGAGSFRLEKQLAWDALPNGP
Ga0209011_102143113300027678Forest SoilMKKWTFALLSIFACAGVSRASAADALSGPWHGVVRKGMLENVVYFDFSRTDTGYRGNYWGTAPIGAPVSLSGIELGHSVRFEVPRMGVFDGEIAGETMEGTFHDAEGAGSFRLEKQLGWDALPNGP
Ga0209488_1000535953300027903Vadose Zone SoilMKKWTMFALLSTFACAGVSRASADDALSGPWQGVVRKGMLENVVYFDFSRTDTGYRGNYWGRAPIGAPVSLSGIELSHSVRFEVPQMGVFDGEIAGETMEGTFHDAEGAGSFRLQKQIAWDALPNGP
Ga0137415_1000507453300028536Vadose Zone SoilMKKWTMFALLSIFACAGVSRASADDALSGPWHGVIRKGMLESVVYFDFSRTDTGYRGNYWGTVPIGAPVSLSGIELGHPLRFEVPRMGVFDGEIAGDTMEGTFHDAEGAGSFRLEKQLAWDALPNGP
Ga0307312_1058995513300028828SoilMKKRMMVALLGAVCACAGLPRASDDALSGTWRGVVRKGALESVVVFQFSRMESGYRGNYWGMPPLTAPIPLTGIELGHSVRFEVPRVGVFHGEIGGEAIDGTFEDAQGPGSFHLEKQGPLDDGTLAV
Ga0307469_1068734923300031720Hardwood Forest SoilMMKWAMLALLSIFACAGVSRASADDALSGPWQGVVRKGMLESVVYFDFSRTDTGYRGNYWGRAPIGAPVSLSGIELGHSVRFEVPRMGVFDGEIAGETMEGTFKDAEGAGSFRLEKKTAWDPLPNGP
Ga0307471_10102565923300032180Hardwood Forest SoilRGDPKGCFVIPDRLRGNLSVASAIGSADGFLPKEDAMMKWTMLALMSIFACAGVSRASADDALSGPWQGVVRKGMLESVVYFDFSRTDTGYRGNYWGRAPIGAPVSLSGIELGHSVRFEVPRMGVFDGEIAGETMEGTFKDAEGAGSFRLEKKTAWDPLPNGP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.