NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F095578

Metagenome Family F095578

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F095578
Family Type Metagenome
Number of Sequences 105
Average Sequence Length 121 residues
Representative Sequence MAATRTSQEIRVPLSFRIPLSQLEEIEEAVRKEVRKHRTDLLEVIWDWAWKEYKRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIDDVARLLTARAGKYGEPK
Number of Associated Samples 82
Number of Associated Scaffolds 105

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 49.52 %
% of genes near scaffold ends (potentially truncated) 28.57 %
% of genes from short scaffolds (< 2000 bps) 61.90 %
Associated GOLD sequencing projects 71
AlphaFold2 3D model prediction Yes
3D model pTM-score0.27

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (86.667 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil
(10.476 % of family members)
Environment Ontology (ENVO) Unclassified
(26.667 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(50.476 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 56.55%    β-sheet: 0.00%    Coil/Unstructured: 43.45%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.27
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 105 Family Scaffolds
PF13483Lactamase_B_3 8.57
PF00106adh_short 6.67
PF04542Sigma70_r2 2.86
PF13620CarboxypepD_reg 2.86
PF01844HNH 2.86
PF03703bPH_2 1.90
PF09989DUF2229 1.90
PF01926MMR_HSR1 1.90
PF00248Aldo_ket_red 1.90
PF01712dNK 1.90
PF01904DUF72 1.90
PF00593TonB_dep_Rec 1.90
PF04545Sigma70_r4 1.90
PF01553Acyltransferase 0.95
PF01625PMSR 0.95
PF00753Lactamase_B 0.95
PF13411MerR_1 0.95
PF01266DAO 0.95
PF13231PMT_2 0.95
PF03787RAMPs 0.95
PF08447PAS_3 0.95
PF10544T5orf172 0.95
PF13616Rotamase_3 0.95
PF16177ACAS_N 0.95
PF04055Radical_SAM 0.95
PF14824Sirohm_synth_M 0.95
PF01713Smr 0.95
PF09721Exosortase_EpsH 0.95
PF01738DLH 0.95
PF00535Glycos_transf_2 0.95
PF00486Trans_reg_C 0.95
PF12706Lactamase_B_2 0.95
PF02223Thymidylate_kin 0.95
PF03190Thioredox_DsbH 0.95
PF13145Rotamase_2 0.95
PF09723Zn-ribbon_8 0.95

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 105 Family Scaffolds
COG0568DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)Transcription [K] 2.86
COG1191DNA-directed RNA polymerase specialized sigma subunitTranscription [K] 2.86
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 2.86
COG4941Predicted RNA polymerase sigma factor, contains C-terminal TPR domainTranscription [K] 2.86
COG1428Deoxyadenosine/deoxycytidine kinaseNucleotide transport and metabolism [F] 1.90
COG1801Sugar isomerase-related protein YecE, UPF0759/DUF72 familyGeneral function prediction only [R] 1.90
COG3402Uncharacterized membrane protein YdbS, contains bPH2 (bacterial pleckstrin homology) domainFunction unknown [S] 1.90
COG3428Uncharacterized membrane protein YdbT, contains bPH2 (bacterial pleckstrin homology) domainFunction unknown [S] 1.90
COG0125Thymidylate kinaseNucleotide transport and metabolism [F] 0.95
COG0225Peptide methionine sulfoxide reductase MsrAPosttranslational modification, protein turnover, chaperones [O] 0.95
COG1331Uncharacterized conserved protein YyaL, SSP411 family, contains thoiredoxin and six-hairpin glycosidase-like domainsGeneral function prediction only [R] 0.95
COG1332CRISPR-Cas system type III CSM-effector complex subunit Csm5, RAMP superfamily Cas7 groupDefense mechanisms [V] 0.95
COG1336CRISPR-Cas system type III CMR-effector complex subunit Cmr4, RAMP superfamily Cas7 groupDefense mechanisms [V] 0.95
COG1337CRISPR-Cas system type III CSM-effector complex subunit Csm3, RAMP superfamily Cas7 groupDefense mechanisms [V] 0.95
COG1367CRISPR-Cas system type III CMR-effector complex subunit Cmr1, RAMP superfamily Cas7 groupDefense mechanisms [V] 0.95
COG1567CRISPR-Cas system type III CSM-effector complex subunit Csm4, RAMP superfamily Cas5 groupDefense mechanisms [V] 0.95
COG1604CRISPR/Cas system CMR subunit Cmr6, Cas7 group, RAMP superfamilyDefense mechanisms [V] 0.95


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms86.67 %
UnclassifiedrootN/A13.33 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002124|C687J26631_10100423All Organisms → cellular organisms → Bacteria → Acidobacteria982Open in IMG/M
3300002914|JGI25617J43924_10297019Not Available555Open in IMG/M
3300004019|Ga0055439_10084218All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium921Open in IMG/M
3300004025|Ga0055433_10178926All Organisms → cellular organisms → Bacteria → Acidobacteria526Open in IMG/M
3300004080|Ga0062385_11003907All Organisms → cellular organisms → Bacteria → Acidobacteria560Open in IMG/M
3300005174|Ga0066680_10011678All Organisms → cellular organisms → Bacteria4542Open in IMG/M
3300005176|Ga0066679_10043513All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2550Open in IMG/M
3300005179|Ga0066684_10673708All Organisms → cellular organisms → Bacteria → Acidobacteria694Open in IMG/M
3300005524|Ga0070737_10002284All Organisms → cellular organisms → Bacteria23602Open in IMG/M
3300005534|Ga0070735_10138424All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1514Open in IMG/M
3300005534|Ga0070735_10479023All Organisms → cellular organisms → Bacteria742Open in IMG/M
3300005534|Ga0070735_10498760All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium725Open in IMG/M
3300005538|Ga0070731_10093255All Organisms → cellular organisms → Bacteria1995Open in IMG/M
3300005541|Ga0070733_10000696All Organisms → cellular organisms → Bacteria26492Open in IMG/M
3300005555|Ga0066692_10431410All Organisms → cellular organisms → Bacteria838Open in IMG/M
3300005557|Ga0066704_10304081All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1077Open in IMG/M
3300005561|Ga0066699_10060424All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2370Open in IMG/M
3300005575|Ga0066702_10394886All Organisms → cellular organisms → Bacteria844Open in IMG/M
3300005836|Ga0074470_10355969All Organisms → cellular organisms → Bacteria → Acidobacteria824Open in IMG/M
3300006354|Ga0075021_10232355All Organisms → cellular organisms → Bacteria → Acidobacteria1131Open in IMG/M
3300006755|Ga0079222_10974383Not Available724Open in IMG/M
3300006800|Ga0066660_10136475All Organisms → cellular organisms → Bacteria1798Open in IMG/M
3300006804|Ga0079221_10010601All Organisms → cellular organisms → Bacteria3521Open in IMG/M
3300006804|Ga0079221_10011733All Organisms → cellular organisms → Bacteria3368Open in IMG/M
3300006804|Ga0079221_10123479All Organisms → cellular organisms → Bacteria1312Open in IMG/M
3300006804|Ga0079221_10743896All Organisms → cellular organisms → Bacteria → Acidobacteria692Open in IMG/M
3300006806|Ga0079220_10002417All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6054Open in IMG/M
3300006806|Ga0079220_10029045All Organisms → cellular organisms → Bacteria → Acidobacteria2428Open in IMG/M
3300006954|Ga0079219_10287386All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1008Open in IMG/M
3300009038|Ga0099829_10053758All Organisms → cellular organisms → Bacteria → Acidobacteria2994Open in IMG/M
3300009088|Ga0099830_11138742Not Available647Open in IMG/M
3300009093|Ga0105240_10005234All Organisms → cellular organisms → Bacteria19389Open in IMG/M
3300010361|Ga0126378_10562319Not Available1255Open in IMG/M
3300010371|Ga0134125_10056793All Organisms → cellular organisms → Bacteria → Acidobacteria4351Open in IMG/M
3300010373|Ga0134128_10427121All Organisms → cellular organisms → Bacteria → Acidobacteria1479Open in IMG/M
3300010396|Ga0134126_10004232All Organisms → cellular organisms → Bacteria17971Open in IMG/M
3300010396|Ga0134126_10106460All Organisms → cellular organisms → Bacteria → Acidobacteria3441Open in IMG/M
3300010396|Ga0134126_12297645All Organisms → cellular organisms → Bacteria → Acidobacteria588Open in IMG/M
3300010398|Ga0126383_10186570All Organisms → cellular organisms → Bacteria → Acidobacteria1977Open in IMG/M
3300010400|Ga0134122_10682264All Organisms → cellular organisms → Bacteria962Open in IMG/M
3300012202|Ga0137363_11418411Not Available585Open in IMG/M
3300012211|Ga0137377_11099141All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium725Open in IMG/M
3300012349|Ga0137387_10663132All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium755Open in IMG/M
3300012927|Ga0137416_10496150Not Available1051Open in IMG/M
3300012927|Ga0137416_11907700Not Available544Open in IMG/M
3300012931|Ga0153915_10229750All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis2047Open in IMG/M
3300012964|Ga0153916_11041786All Organisms → cellular organisms → Bacteria → Acidobacteria898Open in IMG/M
3300013307|Ga0157372_11076600All Organisms → cellular organisms → Bacteria → Acidobacteria930Open in IMG/M
3300017821|Ga0187812_1077777Not Available1096Open in IMG/M
3300017822|Ga0187802_10117168Not Available1007Open in IMG/M
3300017970|Ga0187783_10033996All Organisms → cellular organisms → Bacteria → Acidobacteria3779Open in IMG/M
3300018063|Ga0184637_10583327All Organisms → cellular organisms → Bacteria → Acidobacteria636Open in IMG/M
3300018074|Ga0184640_10455757All Organisms → cellular organisms → Bacteria → Acidobacteria568Open in IMG/M
3300018077|Ga0184633_10217599Not Available985Open in IMG/M
3300018079|Ga0184627_10190733All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1084Open in IMG/M
3300018468|Ga0066662_10234843All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1484Open in IMG/M
3300018468|Ga0066662_10733111All Organisms → cellular organisms → Bacteria948Open in IMG/M
3300021046|Ga0215015_10342269All Organisms → cellular organisms → Bacteria7761Open in IMG/M
3300021170|Ga0210400_10575220All Organisms → cellular organisms → Bacteria → Acidobacteria929Open in IMG/M
3300021403|Ga0210397_10001009All Organisms → cellular organisms → Bacteria18340Open in IMG/M
3300021420|Ga0210394_10064415All Organisms → cellular organisms → Bacteria3181Open in IMG/M
3300021420|Ga0210394_10095628All Organisms → cellular organisms → Bacteria → Acidobacteria2569Open in IMG/M
3300021560|Ga0126371_10017382All Organisms → cellular organisms → Bacteria6600Open in IMG/M
3300025155|Ga0209320_10174564All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium945Open in IMG/M
3300025160|Ga0209109_10029645All Organisms → cellular organisms → Bacteria → Acidobacteria2949Open in IMG/M
3300025165|Ga0209108_10007699All Organisms → cellular organisms → Bacteria6091Open in IMG/M
3300025167|Ga0209642_10589634All Organisms → cellular organisms → Bacteria → Acidobacteria606Open in IMG/M
3300025289|Ga0209002_10235698All Organisms → cellular organisms → Bacteria → Acidobacteria1108Open in IMG/M
3300025313|Ga0209431_10678317Not Available764Open in IMG/M
3300025319|Ga0209520_10109770All Organisms → cellular organisms → Bacteria1759Open in IMG/M
3300025322|Ga0209641_10020729All Organisms → cellular organisms → Bacteria5152Open in IMG/M
3300025913|Ga0207695_10114402All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2673Open in IMG/M
3300026328|Ga0209802_1001865All Organisms → cellular organisms → Bacteria15110Open in IMG/M
3300026551|Ga0209648_10002811All Organisms → cellular organisms → Bacteria → Acidobacteria14509Open in IMG/M
3300027674|Ga0209118_1036311All Organisms → cellular organisms → Bacteria1495Open in IMG/M
3300027725|Ga0209178_1000880All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis9162Open in IMG/M
3300027725|Ga0209178_1008726All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3181Open in IMG/M
3300027725|Ga0209178_1015281All Organisms → cellular organisms → Bacteria → Acidobacteria → environmental samples → uncultured Acidobacteria bacterium HF4000_26D022399Open in IMG/M
3300027846|Ga0209180_10024303All Organisms → cellular organisms → Bacteria → Acidobacteria3227Open in IMG/M
3300027867|Ga0209167_10000479All Organisms → cellular organisms → Bacteria29093Open in IMG/M
3300027869|Ga0209579_10158281All Organisms → cellular organisms → Bacteria1209Open in IMG/M
3300027894|Ga0209068_10485456All Organisms → cellular organisms → Bacteria → Acidobacteria711Open in IMG/M
3300027968|Ga0209061_1000676All Organisms → cellular organisms → Bacteria75445Open in IMG/M
3300027986|Ga0209168_10156868All Organisms → cellular organisms → Bacteria1152Open in IMG/M
3300027986|Ga0209168_10428037Not Available643Open in IMG/M
3300028536|Ga0137415_10152947All Organisms → cellular organisms → Bacteria2143Open in IMG/M
3300028536|Ga0137415_10494397All Organisms → cellular organisms → Bacteria → Acidobacteria1031Open in IMG/M
3300028536|Ga0137415_10545313Not Available969Open in IMG/M
3300030606|Ga0299906_10313921All Organisms → cellular organisms → Bacteria → Acidobacteria1223Open in IMG/M
3300031949|Ga0214473_10001827All Organisms → cellular organisms → Bacteria → Acidobacteria27235Open in IMG/M
3300031949|Ga0214473_10021674All Organisms → cellular organisms → Bacteria → Acidobacteria7522Open in IMG/M
3300031949|Ga0214473_10074171All Organisms → cellular organisms → Bacteria3979Open in IMG/M
3300031949|Ga0214473_11812764All Organisms → cellular organisms → Bacteria → Acidobacteria602Open in IMG/M
3300031949|Ga0214473_12304710All Organisms → cellular organisms → Bacteria → Acidobacteria516Open in IMG/M
3300031962|Ga0307479_10001730All Organisms → cellular organisms → Bacteria19908Open in IMG/M
3300031962|Ga0307479_10352368All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Acidobacterium → Acidobacterium ailaaui1455Open in IMG/M
3300031965|Ga0326597_10193743All Organisms → cellular organisms → Bacteria → Acidobacteria2390Open in IMG/M
3300031965|Ga0326597_10319718All Organisms → cellular organisms → Bacteria → Acidobacteria1757Open in IMG/M
3300032770|Ga0335085_10305543All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1883Open in IMG/M
3300032805|Ga0335078_11300161All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium830Open in IMG/M
3300032892|Ga0335081_10613258All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1340Open in IMG/M
3300032896|Ga0335075_10003807All Organisms → cellular organisms → Bacteria23711Open in IMG/M
3300033433|Ga0326726_10971778All Organisms → cellular organisms → Bacteria824Open in IMG/M
3300033433|Ga0326726_11795661All Organisms → cellular organisms → Bacteria597Open in IMG/M
3300034090|Ga0326723_0300011Not Available721Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil10.48%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil10.48%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil10.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil8.57%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil5.71%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.76%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil4.76%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment3.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.81%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.81%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil3.81%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil3.81%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.86%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil2.86%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands1.90%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment1.90%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands1.90%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.90%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.90%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.90%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.95%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)0.95%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.95%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.95%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.95%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002124Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_3EnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300004019Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D2EnvironmentalOpen in IMG/M
3300004025Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D1EnvironmentalOpen in IMG/M
3300004080Coassembly of ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005524Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen10_05102014_R1EnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005538Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005836Microbial communities from Youngs Bay mouth sediment, Columbia River estuary, Oregon - S.42_YBBEnvironmentalOpen in IMG/M
3300006354Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012964Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 4 metaGEnvironmentalOpen in IMG/M
3300013307Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C5-5 metaGHost-AssociatedOpen in IMG/M
3300017821Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW-S_2EnvironmentalOpen in IMG/M
3300017822Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_2EnvironmentalOpen in IMG/M
3300017970Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018077Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1EnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300025155Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 4EnvironmentalOpen in IMG/M
3300025160Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 2EnvironmentalOpen in IMG/M
3300025165Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 1EnvironmentalOpen in IMG/M
3300025167Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 19_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025289Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 2EnvironmentalOpen in IMG/M
3300025313Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_3 (SPAdes)EnvironmentalOpen in IMG/M
3300025319Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 1EnvironmentalOpen in IMG/M
3300025322Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025913Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027725Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027867Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027869Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027968Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen10_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027986Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300030606Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT145D125EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300031965Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT100D185EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300032805Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2EnvironmentalOpen in IMG/M
3300032892Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.5EnvironmentalOpen in IMG/M
3300032896Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.4EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300034090Peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF00NEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
C687J26631_1010042323300002124SoilMHTLHTIPIEIVVVKRYFQRQNAAMAATRTSQEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWDWAWKEYRRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIDEVARLLTARAGKYGDSK*
JGI25617J43924_1029701913300002914Grasslands SoilMIKTKAANTITIPLSFRVSPAHLEEIERATKKEVRKYRTELIEMIWEWAWKEYKRAGSLRDLLEGTRARKYSRRVSEELQDQLYTALETILNRAPSAVIEDVARSLTARAGRYGDEK*
Ga0055439_1008421823300004019Natural And Restored WetlandsMPTTRAPKDLRVPLSFRIPLSQLEEIEGAVRKEVRRHRSDLVELVWDWAWKEYKRSGSLHALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIEEVGRILTSR
Ga0055433_1017892613300004025Natural And Restored WetlandsSFRIPLSQLEEIEGAVRKEVRRHRSDLVELVWDWAWKEYKRSGSLHALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIEEVGRILTSRAGKYGEAK*
Ga0062385_1100390713300004080Bog Forest SoilSRILEGTMRAVKNKTDRELNVPLSFRVPLRQLHEIEEAVGKEARRHRTDLLEFIWNWAWGEYKKSGSLQALLAGTRTRRYSRRVSEELQDHLYTALETILDRAPSTVVEEIARTLTQRAGKYGTEK*
Ga0066680_1001167853300005174SoilMTKTKAAETLTVPLSFRVSPAHLEEIERATKKEVRKYRTELIGMIWEWAWKEYKRAGSLRDLLDGTRARQYSRRVSEELQDELYTALETILNRAPSAVIEDVARSLTARAGKYGDER*
Ga0066679_1004351333300005176SoilMTKTKSAETLSVPLSFRVPPAHLEEIERAAKKEVRKYRTELIELIWEWAWKEYKRAGSLRDLLEGTRARKYSRRVSEELQDQLYTALETILNRAPSAVIEDMARSLTSRAGKYGDER*
Ga0066684_1067370813300005179SoilMTKTKSAETLSVPLSFRVPPAHVEEIERATKREVRKYRTELIELIWEWAWKEYKRAGSLRDLLEGTRARKYSRRVSEELQDQLYTALETILNRAPSAVIEDMARSLTSRAGKYGDER*
Ga0070737_1000228453300005524Surface SoilMHTKHTMIREITLVNRKSDGHDDGMKTRVQRDLNVPVSFRIPLQQLHEIEDAVQKEARRHRSDLLEFVWNWAWNEYKKAGSLRDLLAGTRTRRYSRRVSEELQDELFTALETILERAPSAVIEDVARTLTQRAGKYGFEK*
Ga0070735_1013842423300005534Surface SoilMAKSKAADSLTVPLSFRVSPAHLEEIERSTRKEVRKYRSELIELIWEWAWKEYERSGSLRDLLEGTRARKYSRRVSEELQDQLYTALETILNRAPSAVIEDVARLLTARAGKYGDEK*
Ga0070735_1047902323300005534Surface SoilMTTTKVAGSLTVPLSFRVSAAHLEEIERATKKEVRKYRTEVIEMIWEWAWKEYKRAGSLRDLLEGTRARRYSRRVSEELQDQLYTALETILNRAPSAVIEDVARSLTARAGKYGDDK*
Ga0070735_1049876013300005534Surface SoilVVKSSCLKYDGVMPQTTASREVKVPLSFRIPLSQLEEIEAAVARESRKHRTDLIDFIWNWAWNEYKKAGSLQALLGGTRTGRYSRRVSEELQDQLYTALATILERAPSTVIENVAHLLTQRAGKYGGE*
Ga0070731_1009325513300005538Surface SoilGVCVVYIQYMRDVVKSRVRAIRWGMPQPTSSRDVKVPLSFRIPLSQLEEIEAAVATESRKHRTDLIDFIWNWAWHEYKKAGSLQALLGGTRTGRYSRRVSEELQDQLYTALATILERAPSTVVENVAHLLTQRAGKYGGE*
Ga0070733_1000069653300005541Surface SoilMRTKPPRELNVPVSFRIPLEQLHEIEEAVHKEARRHRTDLLEFVWNWAWDEYKKAGSLQGLLAGTRTRRYSRRVSEELQDELYTALATILERAPSAVIEDVARTLTQRAGKYGSEK*
Ga0066692_1043141013300005555SoilMTKTKAAETLSVPLSFRVPRAHLEEIERAAKKEVRKYRTELIGMIWEWAWKEYKRAGSLRDLLDGTRARQYSRRVSEELQDELYTALETILNRAPSAVIEDVARSLTARAGKYGDER*
Ga0066704_1030408123300005557SoilMTKTKAAETLSVPLSFRVSPAHLEEIERAAKKEVRKYRTELIGMIWEWAWKEYKRAGSLRDLLDGTRARQYSRRVSEELQDELYTALETILNRAPSAVIEDVARSLTARAGKYGDER*
Ga0066699_1006042443300005561SoilMTKTKSAETLSVPLSFRVPPAHLEEIERAAKKEVRKYRTELIELIWEWAWKEYQRAGSLRDLLEGTRARKYSRRVSEELQDQLYTALETILNRAPSAVIEDMARSLTSRAGKYGDER*
Ga0066702_1039488613300005575SoilGQHLIMTKTKSAETLSVPLSFRVPPAHLEEIERAAKKEVRKYRTELIELIWEWAWKEYKRAGSLRDLLEGTRARKYSRRVSEELQDQLYTALETILNRAPSAVIEDMARSLTSRAGKYGDER*
Ga0074470_1035596923300005836Sediment (Intertidal)MAMTRNAQDVRVPLSFRIPLSQLEEIEEAVRKEVRKHRTDIVELVWDWAWKEYKRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIEDVARILTSRAGKYGDAK*
Ga0075021_1023235523300006354WatershedsVPKTTSEKELRVPLSFRIPLKQLREIEEAVLKETRKHRTDLVEFIWNWAWNEYRKSGSLQALLSGTRTGRYSRRVSEELQDQLYTALETILERAPSTVIEDVAHGLTTRAGKYGTPK*
Ga0079222_1097438313300006755Agricultural SoilMYTLHTLHRISLLSRNIFKGRMLAVLKTRTTREPNVPLSFRIPFTQLQEIEDAVRAEARRHRTDLVEFIWNWAWSEYKKAGSLQALLGGTRTRRYSRRVSEELQDQLYIALGTILERAPSAVIDDVARTLTSRAGKYGSEK*
Ga0066660_1013647523300006800SoilMTKTKSAETLSVPLSFRVPPAHVEEIERATKKEVRKYRTELIELIWEWAWKEYKRAGSLRDLLEGTRARKYSRRVSEELQDQLYTALETILNRAPSAVIEDMARSLTSRAGKYGDER*
Ga0079221_1001060123300006804Agricultural SoilMKTRPTRDLNVPVSFRIPLQQLHEIEEAVQKEARRHRSDVLEFVWSWAWSEYKKAGSLQALLAGTRTRHYSRRVSEELQDELYTALTTILERAPSAVIEDVARVLTQRAGKYGGEK*
Ga0079221_1001173353300006804Agricultural SoilMTCLHTLHTMNCEISLVNRNSHGHHDGMRTRPTRDLNVPVSFRIPLQQLHEIEEAVHKEARRHRSDLLEFVWNWAWSEYKKAGSLQALLAGTRTRQYSRRVSEELQDELYTALNTILERAPSAVIEDVARVLTQRAGKYGGEK*
Ga0079221_1012347913300006804Agricultural SoilMYGMYTLHTLHRISLLSRSIFKGRMLAVLKTRTTREPNVPLSFRIPFTQLQEIEDAVRAEARRHRTDLVEFIWNWAWSEYKKAGSLQALLGGTRTRRYSRRVSEELQDQLYIALGTILERAPSAVIDDVAR
Ga0079221_1074389613300006804Agricultural SoilMPTVKPSRELNVPLSFRIPLSQLHEIEEAVRKEARRHRTDVIEFVWNWAWGEYKKAGSLQALLSGTRTRKYTRRVSEELQDQLHTALATIIERAPSAVI
Ga0079220_1000241713300006806Agricultural SoilMYGMYTLHTLHRISLLSRSIFKGRMLAVLKTRTTREPNVPLSFRIPFTQLQEIEDAVRAEARRHRTDLVEFIWNWAWSEYKKAGSLQALLGGTRTRRYSRRVSEELQDQLYIALGTILERAPSAVIDDVARILTSRAGKYGSEK*
Ga0079220_1002904533300006806Agricultural SoilMYTLHTLHRISLLSRNIFKGRMLAVLKTRTTREPNAPLSFRIPFTQLQEIEDAVRAEARRHRTDLVEFIWNWAWSEYKKAGSLQALLGGTRTRRYSRRVSEELQDQLYTALGTILERAPSAVIDDVARTLTSRAGKYGSEK*
Ga0079219_1028738613300006954Agricultural SoilMLAVLKTRTTREPNVPLSFRIPFTQLQEIEDAVRAEARRHRTDLVEFIWNWAWSEYKKAGSLQALLGGTRTRRYSRRVSEELQDQLYTALRTILERAPSAVIDDVARTLTSRAGKYGSEK
Ga0099829_1005375843300009038Vadose Zone SoilMMLRGDIRVVPKSKSDADLKIPLSFRIPLRQLNEIEEAVHNEARRHRTDLIEFIWDWAWNEYKKSGSLQALLTGTRAKRYSRRVSEELQDQLHTALETILDRAPSAVIEDVARLLTLRAGKYGSP*
Ga0099830_1113874223300009088Vadose Zone SoilPLSFRIPLRQLNEIEEAVRNEARRHRTDLIEFIWDWAWNEYKKSGSLQALLTGTRAKRYSRRVSEELQDQLHTALETILDRAPSAVIEDVARLLTLRAGKYGSP*
Ga0105240_1000523463300009093Corn RhizosphereMLAVLKTRTTREPNVPLSFRIPFTQLQEIEDAVRAEARRHRTDLVEFIWNWAWSEYKKAGSLQALLGGTRTRRYSRRVSEELQDQLYIALGTILERAPSAVIDDVARTLTSRAGKYGSEK
Ga0126378_1056231913300010361Tropical Forest SoilSRDLNVPVSFRIPLEQLHEIEEAVQKEARRHRSDVLEFVWNWAWSEYKKAGSLQALLAGTRTRHYSRRVSEELQDELYTALTTILERAPSAVIEDVARVLTQRAGKYGSEK*
Ga0134125_1005679323300010371Terrestrial SoilMLAVLKTRTTREPNVPLSFRIPFTQLQEIEDAVRAEARRHRTDLVEFIWNWAWSEYKKAGSLQALLGGTRTRRYSRRVSEELQDQLYTALGTILERAPSAVIDDVARTLTSRAGKYGSEK
Ga0134128_1042712133300010373Terrestrial SoilLVNRNFARQDTRVPKARASRELNVPLSFRVPLQQIEEIEDSVRREARRHRTDLVEFIWGWAWEQYKKAGSLQSLLAGMRARRYSRRVSEELQDQLYTALETIIERAPSTVIEDVAGILTQRAGKYGTPK*
Ga0134126_1000423273300010396Terrestrial SoilVPKARASRELNVPLSFRVPLQQIEEIEDSVRREARRHRTDLVEFIWGWAWEQYKKAGSLQSLLAGMRARRYSRRVSEELQDQLYTALETIIERAPSTVIEDVAGILTQRAGKYGTPK*
Ga0134126_1010646043300010396Terrestrial SoilMNRQILLVNRNPHGHHDGMRTRPTRELNVPVSFRIPLQQLHEIEEAVQKEARRHRSDLLEFVWNWAWSEYKKAGSLQALLAGTRTRHYSRRVSEELQDDLFTALETIIDRAPSTVIDDVARVLTQRAGKYGAEK*
Ga0134126_1229764513300010396Terrestrial SoilMLVVLKTRTTREPNVPLSFRVAFTQLQEIEDAVRNEARRHRTDLVEFIWNWAWNEYKKAGSLQALLGGTRTRRYSRRVSEELQDQLYTALGTILERAPSAVIDDVARTLTSRAGKYGSEK
Ga0126383_1018657033300010398Tropical Forest SoilMPTAKPGRELNVPLSFRIPLSQLHEIEEAVRKEARRHRTDVIEFVWTWAWGEYKKAGSLQALLSGTRTRKYSRRVSEELQDQLHIALATIIERAPSAVIEDMARTLTARAGKYGSEK*
Ga0134122_1068226423300010400Terrestrial SoilMQTVYTLMHTFSIVNKKVGGQTCFMSPARTAQELRVPLSFRIPISQLEEIEEAVRKEVRKHRTDLVEVIWEWAWKEYKRSGSLRGLVEGRRARKYSRRVSEELQDQLYTAFETILERAPSAVIEDVASVLTTRAGKYGEPK*
Ga0137363_1141841113300012202Vadose Zone SoilMTKTKAAETLSVPLSFRVPPAHLEEIERAAKKEVRKYRTELIGMIWEWAWKEYKRAGSLRDLLDGTRARQYSRRVSEELQDELYTALETILNRAPSAVIEDVARSLTARAGKYGDER*
Ga0137377_1109914123300012211Vadose Zone SoilMTKTKAAETLSVPLSFRVSPAHLEEIERAAKKEVRKYRTELIGMIWEWAWKEYKRAGSLRDLLDGTRARQYSRRVSEELQDELYTALETILNRAPSAVIE
Ga0137387_1066313223300012349Vadose Zone SoilMTKTKAAETLSVPLSFRVSPAHLEEIERAAKKEVRKYRTELIGMIWEWAWKEYKRAGSLRDLLDGTRARQYSRRVSEELQDELYTALETILNRAPSAVIEDVARSLTARA
Ga0137416_1049615013300012927Vadose Zone SoilMTRHICVVPKIIHDKELRVPLSFRIPLQQLQEIEEAVRKETRKHRTDLVEFVWNWAWNEYKKSGSLQALLSGTRTGRYSRRVSEELQDQLYTALATILDRAPSTVIEDVAHTLTARAGKYGTPK*
Ga0137416_1190770013300012927Vadose Zone SoilEIWQVILCIVPKTTNEKELRVPLSFRVPLKQLREIEEAVLKETRKHRTDLLEFIWNWAWGEYRKSGSLQALLSGTRTGRYSRRVSEELQDQLYTALGAILERAPSAVIEDIARILTARAGKYGTPK*
Ga0153915_1022975033300012931Freshwater WetlandsLNVPISFRIPLQQLQDIEEAVHKEARRHRTDLLEFIWGWAWNEYKKSGSLQALLAGTRTRRYSRRVSEELQDQLFTALETILERAPSAVIEDVARALTARAGKYGTEE*
Ga0153916_1104178623300012964Freshwater WetlandsLNVPISFRVPLQQLQDIEEAVHKEARRHRTDLLEFIWGWAWNEYKKSGSLQALLAGTRTRRYSRRVSEELQDQLFTALETILERAPSAVIEDVARALTARAGKYGTEE*
Ga0157372_1107660023300013307Corn RhizosphereMLAVLKTRTTREPNVPLSFRIPFTQLQEIEDAVRAEARRHRTDLVEFIWNWAWSEYKKAGSLQALLGGTRTRRYSRRVSEELQDQLYTALGTILERAPSAVIDDV
Ga0187812_107777733300017821Freshwater SedimentMRNMPRSKVERELNVPLSFRVPLRQLEEIEEAVRKEARRHRTDLLEFIWNWAWNEYKKAGSLQALLAGTRAGKYTRRVSEELQDELYIALETILERAPSAVIE
Ga0187802_1011716823300017822Freshwater SedimentMRNMPRSKVERELNVPLSFRVPLRQLEEIEEAVRKEARRHRTDLLEFIWNWAWNEYKKAGSLQALLAGTRAGKYTRRVSEELQDELYIALETILERAPSAVIEDMARTLTQRAGRYGSEK
Ga0187783_1003399633300017970Tropical PeatlandMPSSKSERELNVPLSFRIPLSQLQEIEEAVRKESRKYRTDLVEFIWNWAWNEYRRAGSLQGLLAGTRTGKYSRRVSEELQDQLYTALQTILERAPSAVIEDIARTLTQRAGKYGTERK
Ga0184637_1058332723300018063Groundwater SedimentMHTLHTIAIEIAVVNRYFQGHNTAMAATRTSQEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWEWAWKEYKRSGSLRGLLQGARARRYSRRVSEELQDQLYTAFETILERAPSAVIEDVARLLTSKAGKYGEPK
Ga0184640_1045575713300018074Groundwater SedimentMHTLHTIAIEIAVVNRYFQGHNAAMAATRTSQEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWEWAWKEYKRSGSLRGLLQGARARRYSRRVSEELQDQLYTAFETILERAPSAVIEDVARLLTSKAGKYGEPK
Ga0184633_1021759913300018077Groundwater SedimentMATVKSSQEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWEWAWKEYKRSGSLRGLLQGARARRYSRHVSEELQDQLYAAFETILERAPSAVIEDVAKLLATKAGKCGDPK
Ga0184627_1019073313300018079Groundwater SedimentMHTLHTIAIEIAVVNRYFQGHNTAMAATRTSQEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWEWAWKEYKRSGSLRGLLQGARARRYSRRVSEELQDQLYTAFETILERAPSAVIEDVAKLLATKAGKCGDPK
Ga0066662_1023484323300018468Grasslands SoilMTKTKAAETLSVPLSFRVSPAHLEEIERAAKKEVRKYRTELIGMIWEWAWKEYKRAGSLRDLLDGTRARQYSRRVSEELQDELYTALETILNRAPSAVIEDVARSLTARAGKYGDER
Ga0066662_1073311123300018468Grasslands SoilMTKTKSAETLSVPLSFRVPPAHVEEIERATKKEVRKYRTELIELIWEWAWKEYKRAGSLRDLLEGTRARKYSRRVSEELQDQLYTALETILNRAPSAVIEDMARSLTSRAGKYGDER
Ga0215015_1034226983300021046SoilMTLSTAIFCGNIFVMTKTKAAETLTVPLSFRVSPAHLEEIERATKKEVRKYRTELIEMIWEWAWKEYRRAGSLRDLLEGTRARKYSRRVSEELQDELYTALETILNRAPSAVIEDVARSLTARAGKYGDER
Ga0210400_1057522023300021170SoilMPTTKANRELNVPLSFRIPLSQLEEIEEAVRKEARRHRTDLIEFIWSWAWSEYKKAGSLQSLLSGTRSRRYSRRVSEELQDQLYTALETILERAPSAVIDDLARVLTTRAGKYGSPTT
Ga0210397_1000100953300021403SoilMQTGEGTMRAVKNKTDRELNVPLSFRVPLRQLHEIEEAVGKEARRHRTDLLEFIWNWAWGEYKKAGSLQALLAGTRTRRYSRRVSEELQDQLYTALETVLDRAPSTVIEEIGRTLTLRAGKYGTEK
Ga0210394_1006441533300021420SoilMPQPTSSRDVKVPLSFRIPLSQLEEIEAAVATESRKHRTDLIDFIWNWAWHEYKKAGSLQALLGGTRTGRYSRRVSEELQDQLYTALATILERAPSTVVENVAHLLTQRAGKYGGE
Ga0210394_1009562813300021420SoilTTYDSTTITVVKRNVREHNAVMPTTRTSQEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWDWAWKEYKRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETVLERAPSAVIEDVARVLTSRAGKYGEPK
Ga0126371_1001738273300021560Tropical Forest SoilMQKSKTHRELNVPISFRIPLPQLQEIEEAVQKEARRHRTDLLEFVWSWAWNEYRKAGSLQALLGGTRTRKYSRRVSEELQDDLYTALATIFERAPSAVIEDVARILTQRAGKYGSEK
Ga0209320_1017456413300025155SoilMAATRTSQEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWDWAWKEYRRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIDEVARLLT
Ga0209109_1002964523300025160SoilMAATRTSQEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWDWAWKEYRRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIDEVARLLTARAGKYGDSK
Ga0209108_1000769933300025165SoilMAATRTSQEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWDWAWKEYRRSGSLRALLQGTRARKYSRRVSEELQDQLYTAFETILERAPSAVIDEVARLLTARAGKYGDSK
Ga0209642_1058963423300025167SoilMAATRTSQEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWDWAWKEYRRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAV
Ga0209002_1023569813300025289SoilMHTLHTIPIEIVVVKRYFQRQNAAMAATRTSQEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWDWAWKEYRRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIDEVARLLTARAGKYGDSK
Ga0209431_1067831723300025313SoilMAATRTSQEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWDWAWKEYRRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIDEIARLLTARAGKYGDSK
Ga0209520_1010977023300025319SoilMHTLHTIPIEIVVVKRYFQRQNAAMAATRTSQEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWDWAWKEYKRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIDEVARLLTARAGKYGDAK
Ga0209641_1002072983300025322SoilMAATRASQEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWEWAWKEYKRSGSLRALLQGARARRYSRRVSEELQDQLYTAFETILERAPSAVIEDVARLLTTKAGKYGEPK
Ga0207695_1011440243300025913Corn RhizosphereMYTLHTLHRISLLSRNIFKGRMLAVLKTRTTREPNVPLSFRIPFTQLQEIEDAVRAEARRHRTDLVEFIWNWAWSEYKKAGSLQALLGGTRTRRYSRRVSEELQDQLYTALGTILERAPSAVIDDVARTLTSRAGKYGSEK
Ga0209802_100186593300026328SoilMTKTKAAETLTVPLSFRVSPAHLEEIERATKKEVRKYRTELIGMIWEWAWKEYKRAGSLRDLLDGTRARQYSRRVSEELQDELYTALETILNRAPSAVIEDVARSLTARAGKYGDER
Ga0209648_10002811143300026551Grasslands SoilMIKTKAANTITIPLSFRVSPAHLEEIERATKKEVRKYRTELIEMIWEWAWKEYKRAGSLRDLLEGTRARKYSRRVSEELQDQLYTALETILNRAPSAVIEDVARSLTARAGRYVRRREVNRPVLAEEQFRPNQCCR
Ga0209118_103631123300027674Forest SoilMLPMRTTKSPRSLRLPLSFRIPLAQLQEIDKAMQIEVRKDRTDLVEIIWDWAWKEYKRAGSLQALLQGTRARKYSRRVSEELQDQLYIALETILERAPSAVIEDVGRSLTVRAGKYGEPKEE
Ga0209178_100088033300027725Agricultural SoilVLKTRTTREPNVPLSFRIPFTQLQEIEDAVRAEARRHRTDLVEFIWNWAWSEYKKAGSLQALLGGTRTRRYSRRVSEELQDQLYIALGTILERAPSAVIDDVARTLTSRAGKYGSEK
Ga0209178_100872643300027725Agricultural SoilMKTRPTRDLNVPVSFRIPLQQLHEIEEAVQKEARRHRSDVLEFVWSWAWSEYKKAGSLQALLAGTRTRHYSRRVSEELQDELYTALTTILERAPSAVIEDIARVLTQRAGKYGGEK
Ga0209178_101528143300027725Agricultural SoilMTCLHTLHTMNCEISLVNRNSHGHHDGMRTRPTRDLNVPVSFRIPLQQLHEIEEAVHKEARRHRSDLLEFVWNWAWSEYKKAGSLQALLAGTRTRQYSRRVSEELQDELYTALNTILERAPSAVIEDVARVLTQRAGKYGGEK
Ga0209180_1002430353300027846Vadose Zone SoilLRGDIRVVPKSKSDADLKIPLSFRIPLRQLNEIEEAVHNEARRHRTDLIEFIWDWAWNEYKKSGSLQALLTGTRAKRYSRRVSEELQDQLHTALETILDRAPSAVIEDVARLLTLRAGKYGSP
Ga0209167_1000047963300027867Surface SoilMRTKPPRELNVPVSFRIPLEQLHEIEEAVHKEARRHRTDLLEFVWNWAWDEYKKAGSLQGLLAGTRTRRYSRRVSEELQDELYTALATILERAPSAVIEDVARTLTQRAGKYGSEK
Ga0209579_1015828123300027869Surface SoilLPSGVCVVYIQYMRDVVKSRVRAIRWGMPQPTSSRDVKVPLSFRIPLSQLEEIEAAVATESRKHRTDLIDFIWNWAWHEYKKAGSLQALLGGTRTGRYSRRVSEELQDQLYTALATILERAPSTVVENVAHLLTQRAGKYGGE
Ga0209068_1048545613300027894WatershedsLRVPLSFRIPLKQLREIEEAVLKETRKHRTDLVEFIWNWAWNEYRKSGSLQALLSGTRTGRYSRRVSEELQDQLYTALETILERAPSTVIEDVAHGLTTRAGKYGTPK
Ga0209061_1000676463300027968Surface SoilMHTKHTMIREITLVNRKSDGHDDGMKTRVQRDLNVPVSFRIPLQQLHEIEDAVQKEARRHRSDLLEFVWNWAWNEYKKAGSLRDLLAGTRTRRYSRRVSEELQDELFTALETILERAPSAVIEDVARTLTQRAGKYGFEK
Ga0209168_1015686823300027986Surface SoilNIFVMTKSKAADSLTVPLSFRVSPAHLEEIERSTRKEVRKYRSELIELIWEWAWKEYERSGSLRDLLEGTRARKYSRRVSEELQDQLYTALETILNRAPSAVIEDVARLLTARAGKYGDE
Ga0209168_1042803713300027986Surface SoilMTTTKVAGSLTVPLSFRVSAAHLEEIERATKKEVRKYRTEVIEMIWEWAWKEYKRAGSLRDLLEGTRARRYSRRVSEELQDQLYTALETILNRAPSAVIEDVARSLTARAGKYGDDK
Ga0137415_1015294723300028536Vadose Zone SoilMTKTKAAETLSVPLSFRVPPAHLEEIERAAKKEVRKYRTELIGMIWEWAWKEYKRAGSLRDLLDGTRARQYSRRVSEELQDELYTALETILNRAPSAVIEDVARSLTARAGKYGDER
Ga0137415_1049439713300028536Vadose Zone SoilVPKTTNEKELRVPLSFRVPLKQLREIEEAVLKETRKHRTDLLEFIWNWAWGEYRKSGSLQALLSGTRTGRYSRRVSEELQDQLYTALGAILERAPSAVIEDIARILTARAGKYGTPK
Ga0137415_1054531313300028536Vadose Zone SoilMHTIHTLLFEIVLVNKNMTRHICVVPKIIHDKELRVPLSFRIPLQQLQEIEEAVRKETRKHRTDLVEFVWNWAWNEYKKSGSLQALLSGTRTGRYSRRVSEELQDQLYTALATILDRAPSTVIEDVAHTLTARAGKYGTPK
Ga0299906_1031392113300030606SoilMHTLHTIAINIVVVNVYFQRQNAVMATTRASQEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWDWAWKEYKRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIDDVARLLTARAGKYGDSR
Ga0214473_1000182733300031949SoilMAGTRTSHEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWDWAWKEYKRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIEEVARLLTARAGKYGEPK
Ga0214473_1002167423300031949SoilMAATRTSQEIRVPLSFRIPLSQLEEIEEAVRKEVRKHRTDLLEVIWDWAWKEYKRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIDDVARLLTARAGKYGEPK
Ga0214473_1007417143300031949SoilLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWDWAWKEYKRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIDDVARLLTARAGKYGDSK
Ga0214473_1181276423300031949SoilMHTLHTVTLKIVVVNVFVQRQNAVMTTTRASQEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWDWAWKEYKRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIDDVARLLTARAGKYGDPR
Ga0214473_1230471013300031949SoilMSATRASHEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWDWAWKEYKRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIDDVARVLTAR
Ga0307479_10001730123300031962Hardwood Forest SoilVKNKSERELNVPLSFRVPLRELHEIEEAVGKEARRHRTDLIQIIWTWAWGEYKKAGSLQALLAGTRTRRYSRRVSEELQDQLYTALETVIERAPSTVIEEIARTLTLRAGKYGTEK
Ga0307479_1035236813300031962Hardwood Forest SoilPLSFRVPLRQLHEIEEALAKEARRHRTDLLQIIWNWAWSEYKKAGSLQALLAGTRTRKYSRRVSEELQDQLYTALQTILDRAPSPIIEEIARTLTLRAGKYGTEK
Ga0326597_1019374323300031965SoilMATTRASQEIRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWDWAWKEYKRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIDDVARLLTARAGKYGDSR
Ga0326597_1031971833300031965SoilMTISRTSQELRVPLSFRVPLSQLEEIEEAVRKEVRKHRTDLLEVVWDWAWKEYKRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIDDVARLLTARAGKYGDSK
Ga0335085_1030554333300032770SoilVYNVFFVYDKDKHLSSIICSANIFVMTKPKAAESLSVPLSFRVSPAHLEEIERATRKEVRKYRTEVVELIWEWAWREYKRSGSLRDLLEGTRARKYSRRVSAELQDQLYTALETILDRAPGAVIEDVAHLLAMRAGKYGDDK
Ga0335078_1130016133300032805SoilVYNVFFVYDKDKNLSSIICSANIFVMTKPKAAESLSVPLSFRVSPAHLEEIERATRKEVRKYRTEVVELIWEWAWREYKRSGSLRDLLEGTRARKYSRRVSAELQDQLYTALETILD
Ga0335081_1061325833300032892SoilVYNVFFVYDKDKNLSSIICSANIFVMTKPKAAESLSVPLSFRVSPAHLEEIERATRKEVRKYRTEVVELIWEWAWREYKRSGSLRDLLEGTRARKYSRRVSAELQDQLYTALETILDRAPGAVIEDVAHLLAMRAGKYGDDK
Ga0335075_1000380763300032896SoilMEEIEEAVRKEARRHRTDVVEFVWRWAWDRYKKAGSLRALLDGGRSRHYSRRVSEELQDQLYTAIDTIIERAPSTVIEDVANLLTERAGKYGESK
Ga0326726_1097177813300033433Peat SoilTLAMPRTSQDVRVPLSFRIPISQLEEIEEAVRKEVRKHRTDLVEVVWDWAWKEYKRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIEDVARVLTSRAGKYGDAK
Ga0326726_1179566133300033433Peat SoilRIPISQLDEIEEAVRKEVRKHRTDLVEVVWDWAWKEYKRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSTVIEEVARVLTSRAGKYGDAK
Ga0326723_0300011_35_3883300034090Peat SoilMAMPRTSQDVRVPLSFRIPISQLEEIEEAVRKEVRKHRTDLVEVVWDWAWKEYKRSGSLRALLQGTRTRKYSRRVSEELQDQLYTAFETILERAPSAVIEDVARVLTSRAGKYGDAK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.