NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F082505

Metagenome / Metatranscriptome Family F082505

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F082505
Family Type Metagenome / Metatranscriptome
Number of Sequences 113
Average Sequence Length 95 residues
Representative Sequence MWPFEHKHRKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILSEKQDDQRHAAPVLNLHWRTDAPGTA
Number of Associated Samples 92
Number of Associated Scaffolds 113

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 73.45 %
% of genes near scaffold ends (potentially truncated) 28.32 %
% of genes from short scaffolds (< 2000 bps) 68.14 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.35

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.115 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Engineered → Wastewater → Anaerobic Digestor → Unclassified → Unclassified → Anaerobic Digestor Sludge
(14.159 % of family members)
Environment Ontology (ENVO) Unclassified
(35.398 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Water (non-saline)
(48.673 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 10.40%    β-sheet: 16.80%    Coil/Unstructured: 72.80%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.35
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 113 Family Scaffolds
PF135322OG-FeII_Oxy_2 39.82
PF06748DUF1217 5.31
PF02773S-AdoMet_synt_C 3.54
PF01522Polysacc_deac_1 2.65
PF01494FAD_binding_3 2.65
PF03055RPE65 2.65
PF00012HSP70 1.77
PF04940BLUF 0.88
PF00106adh_short 0.88
PF01638HxlR 0.88
PF01725Ham1p_like 0.88
PF08712Nfu_N 0.88
PF13561adh_short_C2 0.88
PF00581Rhodanese 0.88
PF04055Radical_SAM 0.88
PF03062MBOAT 0.88
PF02130YbeY 0.88
PF02518HATPase_c 0.88

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 113 Family Scaffolds
COG06542-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductasesEnergy production and conversion [C] 5.31
COG0192S-adenosylmethionine synthetaseCoenzyme transport and metabolism [H] 3.54
COG0578Glycerol-3-phosphate dehydrogenaseEnergy production and conversion [C] 2.65
COG0644Dehydrogenase (flavoprotein)Energy production and conversion [C] 2.65
COG0665Glycine/D-amino acid oxidase (deaminating)Amino acid transport and metabolism [E] 2.65
COG0726Peptidoglycan/xylan/chitin deacetylase, PgdA/NodB/CDA1 familyCell wall/membrane/envelope biogenesis [M] 2.65
COG3670Carotenoid cleavage dioxygenase or a related enzymeSecondary metabolites biosynthesis, transport and catabolism [Q] 2.65
COG0443Molecular chaperone DnaK (HSP70)Posttranslational modification, protein turnover, chaperones [O] 1.77
COG0127Inosine/xanthosine triphosphate pyrophosphatase, all-alpha NTP-PPase familyNucleotide transport and metabolism [F] 0.88
COG0319ssRNA-specific RNase YbeY, 16S rRNA maturation enzymeTranslation, ribosomal structure and biogenesis [J] 0.88
COG1733DNA-binding transcriptional regulator, HxlR familyTranscription [K] 0.88


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.12 %
UnclassifiedrootN/A0.88 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000271|PBR_1002334All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3434Open in IMG/M
3300000883|EsDRAFT_10075794All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1797Open in IMG/M
3300000956|JGI10216J12902_101776240All Organisms → cellular organisms → Bacteria → Proteobacteria1179Open in IMG/M
3300001800|JGI24115J20150_1006565All Organisms → cellular organisms → Bacteria → Proteobacteria2627Open in IMG/M
3300001800|JGI24115J20150_1008361All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2159Open in IMG/M
3300002856|draft_11444063All Organisms → cellular organisms → Bacteria → Proteobacteria1160Open in IMG/M
3300003278|U2draft_1001729All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomonadales → Hyphomonadaceae17817Open in IMG/M
3300004071|Ga0055486_10157855All Organisms → cellular organisms → Bacteria → Proteobacteria556Open in IMG/M
3300004096|Ga0066177_10377578All Organisms → cellular organisms → Bacteria → Proteobacteria613Open in IMG/M
3300004128|Ga0066180_10016626All Organisms → cellular organisms → Bacteria → Proteobacteria2272Open in IMG/M
3300004463|Ga0063356_103827694All Organisms → cellular organisms → Bacteria → Proteobacteria648Open in IMG/M
3300004481|Ga0069718_13708351All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria9762Open in IMG/M
3300005438|Ga0070701_11136093All Organisms → cellular organisms → Bacteria → Proteobacteria551Open in IMG/M
3300005466|Ga0070685_10008418All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria5298Open in IMG/M
3300005660|Ga0073904_10126464All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1535Open in IMG/M
3300005758|Ga0078117_1103528All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1260Open in IMG/M
3300005844|Ga0068862_101911615All Organisms → cellular organisms → Bacteria → Proteobacteria603Open in IMG/M
3300005982|Ga0075156_10004381All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria8099Open in IMG/M
3300005982|Ga0075156_10361879All Organisms → cellular organisms → Bacteria → Proteobacteria750Open in IMG/M
3300005985|Ga0081539_10456894All Organisms → cellular organisms → Bacteria → Proteobacteria529Open in IMG/M
3300005988|Ga0075160_10727967All Organisms → cellular organisms → Bacteria → Proteobacteria526Open in IMG/M
3300005988|Ga0075160_10773811All Organisms → cellular organisms → Bacteria → Proteobacteria507Open in IMG/M
3300006033|Ga0075012_10117449All Organisms → cellular organisms → Bacteria → Proteobacteria1997Open in IMG/M
3300006033|Ga0075012_10124328All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1927Open in IMG/M
3300006033|Ga0075012_10396381All Organisms → cellular organisms → Bacteria → Proteobacteria912Open in IMG/M
3300006033|Ga0075012_10520236All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria761Open in IMG/M
3300006056|Ga0075163_10212847All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2244Open in IMG/M
3300006056|Ga0075163_11616847All Organisms → cellular organisms → Bacteria → Proteobacteria623Open in IMG/M
3300006755|Ga0079222_11518315All Organisms → cellular organisms → Bacteria → Proteobacteria629Open in IMG/M
3300006844|Ga0075428_101158970All Organisms → cellular organisms → Bacteria → Proteobacteria815Open in IMG/M
3300006954|Ga0079219_11572957All Organisms → cellular organisms → Bacteria → Proteobacteria600Open in IMG/M
3300006954|Ga0079219_11647008All Organisms → cellular organisms → Bacteria → Proteobacteria591Open in IMG/M
3300009032|Ga0105048_10012000All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria13619Open in IMG/M
3300009078|Ga0105106_10032685All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales3850Open in IMG/M
3300009079|Ga0102814_10017501All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4158Open in IMG/M
3300009086|Ga0102812_10064647All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2025Open in IMG/M
3300009152|Ga0114980_10042312All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales2783Open in IMG/M
3300009179|Ga0115028_10888268All Organisms → cellular organisms → Bacteria → Proteobacteria704Open in IMG/M
3300009688|Ga0116176_10009189All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria6698Open in IMG/M
3300009778|Ga0116151_10179496All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1083Open in IMG/M
3300009779|Ga0116152_10032418All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3473Open in IMG/M
3300009780|Ga0116156_10000877All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria30944Open in IMG/M
3300009783|Ga0116158_10491967All Organisms → cellular organisms → Bacteria → Proteobacteria650Open in IMG/M
3300009868|Ga0130016_10677669All Organisms → cellular organisms → Bacteria → Proteobacteria631Open in IMG/M
3300010047|Ga0126382_11208183All Organisms → cellular organisms → Bacteria → Proteobacteria678Open in IMG/M
3300010356|Ga0116237_10114596All Organisms → cellular organisms → Bacteria2715Open in IMG/M
3300010356|Ga0116237_11214665All Organisms → cellular organisms → Bacteria → Proteobacteria626Open in IMG/M
3300010399|Ga0134127_12356543All Organisms → cellular organisms → Bacteria → Proteobacteria612Open in IMG/M
3300010401|Ga0134121_10197519All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1738Open in IMG/M
3300010885|Ga0133913_10071584All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria9303Open in IMG/M
3300010885|Ga0133913_13173186All Organisms → cellular organisms → Bacteria → Proteobacteria1088Open in IMG/M
3300012948|Ga0126375_11720196All Organisms → cellular organisms → Bacteria → Proteobacteria544Open in IMG/M
3300012956|Ga0154020_11192142All Organisms → cellular organisms → Bacteria → Proteobacteria576Open in IMG/M
3300012973|Ga0123351_1125522All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1063Open in IMG/M
3300013006|Ga0164294_10065280All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2773Open in IMG/M
3300014323|Ga0075356_1003682All Organisms → cellular organisms → Bacteria → Proteobacteria2367Open in IMG/M
3300015245|Ga0137409_10602249All Organisms → cellular organisms → Bacteria → Proteobacteria929Open in IMG/M
3300015360|Ga0163144_10003358All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria36475Open in IMG/M
3300015360|Ga0163144_10575764All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1232Open in IMG/M
3300017944|Ga0187786_10446206All Organisms → cellular organisms → Bacteria → Proteobacteria573Open in IMG/M
3300018059|Ga0184615_10159261All Organisms → cellular organisms → Bacteria → Proteobacteria → Hydrogenophilalia → Hydrogenophilales → Hydrogenophilaceae → unclassified Hydrogenophilaceae → Hydrogenophilaceae bacterium1274Open in IMG/M
3300018422|Ga0190265_10229686All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1889Open in IMG/M
3300018481|Ga0190271_10512318All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1312Open in IMG/M
3300020027|Ga0193752_1206917All Organisms → cellular organisms → Bacteria → Proteobacteria741Open in IMG/M
3300020172|Ga0211729_10366701All Organisms → cellular organisms → Bacteria → Proteobacteria742Open in IMG/M
3300020205|Ga0211731_10336361All Organisms → cellular organisms → Bacteria → Proteobacteria524Open in IMG/M
3300021090|Ga0210377_10009752All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria7492Open in IMG/M
3300022720|Ga0242672_1111477All Organisms → cellular organisms → Bacteria → Proteobacteria549Open in IMG/M
3300023100|Ga0247738_10057508All Organisms → cellular organisms → Bacteria → Proteobacteria1293Open in IMG/M
3300024502|Ga0255181_1018876All Organisms → cellular organisms → Bacteria → Proteobacteria1362Open in IMG/M
3300025526|Ga0208492_1000822All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomonadales → Hyphomonadaceae23423Open in IMG/M
3300025526|Ga0208492_1015755Not Available1925Open in IMG/M
3300025772|Ga0208939_1000980All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria22012Open in IMG/M
3300025772|Ga0208939_1005407All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria7832Open in IMG/M
3300025772|Ga0208939_1011789All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4638Open in IMG/M
3300025772|Ga0208939_1089699All Organisms → cellular organisms → Bacteria → Proteobacteria1169Open in IMG/M
3300025855|Ga0209717_1002370All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria18232Open in IMG/M
3300025866|Ga0208822_1122568All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → unclassified Burkholderiales → Burkholderiales bacterium987Open in IMG/M
3300025871|Ga0209311_1046243All Organisms → cellular organisms → Bacteria → Proteobacteria2144Open in IMG/M
3300025871|Ga0209311_1172606All Organisms → cellular organisms → Bacteria → Proteobacteria885Open in IMG/M
3300025882|Ga0209097_10364341All Organisms → cellular organisms → Bacteria552Open in IMG/M
3300027503|Ga0255182_1121538All Organisms → cellular organisms → Bacteria590Open in IMG/M
(restricted) 3300027728|Ga0247836_1000464All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria74753Open in IMG/M
3300027776|Ga0209277_10289010All Organisms → cellular organisms → Bacteria → Proteobacteria669Open in IMG/M
3300027781|Ga0209175_10476121All Organisms → cellular organisms → Bacteria → Proteobacteria515Open in IMG/M
3300027818|Ga0209706_10591246All Organisms → cellular organisms → Bacteria → Proteobacteria500Open in IMG/M
3300027851|Ga0209066_10096274All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1971Open in IMG/M
3300027851|Ga0209066_10369351All Organisms → cellular organisms → Bacteria → Proteobacteria777Open in IMG/M
3300027878|Ga0209181_10015708All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomonadales → Hyphomonadaceae → Hirschia → Hirschia maritima9225Open in IMG/M
3300027892|Ga0209550_10054930All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3193Open in IMG/M
3300027973|Ga0209298_10023590All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3049Open in IMG/M
3300027974|Ga0209299_1083339All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1274Open in IMG/M
3300027974|Ga0209299_1247180All Organisms → cellular organisms → Bacteria → Proteobacteria637Open in IMG/M
3300028091|Ga0255184_1031556All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1102Open in IMG/M
3300028380|Ga0268265_11592996All Organisms → cellular organisms → Bacteria → Proteobacteria658Open in IMG/M
(restricted) 3300028581|Ga0247840_10016239All Organisms → cellular organisms → Bacteria8179Open in IMG/M
3300030114|Ga0311333_11827399All Organisms → cellular organisms → Bacteria → Proteobacteria528Open in IMG/M
3300031455|Ga0307505_10089224All Organisms → cellular organisms → Bacteria → Proteobacteria1378Open in IMG/M
3300031740|Ga0307468_100333313All Organisms → cellular organisms → Bacteria → Proteobacteria1122Open in IMG/M
3300031740|Ga0307468_100875565All Organisms → cellular organisms → Bacteria → Proteobacteria775Open in IMG/M
3300031758|Ga0315907_10673858All Organisms → cellular organisms → Bacteria → Proteobacteria791Open in IMG/M
3300031918|Ga0311367_10748797All Organisms → cellular organisms → Bacteria → Proteobacteria991Open in IMG/M
3300032144|Ga0315910_10519336All Organisms → cellular organisms → Bacteria → Proteobacteria919Open in IMG/M
3300032144|Ga0315910_10593485All Organisms → cellular organisms → Bacteria → Proteobacteria858Open in IMG/M
3300032157|Ga0315912_10071124All Organisms → cellular organisms → Bacteria → Proteobacteria2734Open in IMG/M
3300032157|Ga0315912_10567304All Organisms → cellular organisms → Bacteria → Proteobacteria904Open in IMG/M
3300032174|Ga0307470_10219268All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1232Open in IMG/M
3300032205|Ga0307472_102203691All Organisms → cellular organisms → Bacteria → Proteobacteria556Open in IMG/M
3300033433|Ga0326726_11443031All Organisms → cellular organisms → Bacteria → Proteobacteria670Open in IMG/M
3300033978|Ga0334977_0004529All Organisms → cellular organisms → Bacteria → Proteobacteria7930Open in IMG/M
3300034071|Ga0335028_0319922All Organisms → cellular organisms → Bacteria → Proteobacteria913Open in IMG/M
3300034111|Ga0335063_0557551All Organisms → cellular organisms → Bacteria → Proteobacteria547Open in IMG/M
3300034169|Ga0370480_0334458All Organisms → cellular organisms → Bacteria → Proteobacteria509Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Anaerobic Digestor SludgeEngineered → Wastewater → Anaerobic Digestor → Unclassified → Unclassified → Anaerobic Digestor Sludge14.16%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil7.08%
Wastewater EffluentEngineered → Wastewater → Nutrient Removal → Unclassified → Unclassified → Wastewater Effluent7.08%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater5.31%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake5.31%
WatershedsEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Watersheds5.31%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.54%
Serpentinite Rock And FluidEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Serpentinite Rock And Fluid3.54%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake2.65%
FreshwaterEnvironmental → Aquatic → Freshwater → River → Unclassified → Freshwater2.65%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.65%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment1.77%
Freshwater Microbial MatEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Microbial Mat1.77%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater1.77%
FreshwaterEnvironmental → Aquatic → Freshwater → Ice → Glacial Lake → Freshwater1.77%
EstuarineEnvironmental → Aquatic → Marine → Intertidal Zone → Estuary → Estuarine1.77%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.77%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.77%
FenEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Fen1.77%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.77%
SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Sediment0.89%
WetlandEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Wetland0.89%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.89%
Lake WaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Lake Water0.89%
FreshwaterEnvironmental → Aquatic → Freshwater → Unclassified → Unclassified → Freshwater0.89%
Freshwater And MarineEnvironmental → Aquatic → Marine → Intertidal Zone → Estuary → Freshwater And Marine0.89%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.89%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.89%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil0.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.89%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.89%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.89%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.89%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.89%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.89%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.89%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter0.89%
FecalHost-Associated → Mammals → Digestive System → Large Intestine → Fecal → Fecal0.89%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.89%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.89%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.89%
Activated SludgeEngineered → Wastewater → Activated Sludge → Unclassified → Unclassified → Activated Sludge0.89%
WastewaterEngineered → Wastewater → Activated Sludge → Unclassified → Unclassified → Wastewater0.89%
Active SludgeEngineered → Wastewater → Activated Sludge → Unclassified → Unclassified → Active Sludge0.89%
Down-Flow Hanging Sponge ReactorEngineered → Bioreactor → Unclassified → Unclassified → Unclassified → Down-Flow Hanging Sponge Reactor0.89%
Hydrocarbon Resource EnvironmentsEngineered → Biotransformation → Microbial Solubilization Of Coal → Unclassified → Unclassified → Hydrocarbon Resource Environments0.89%
Photobioreactor IncubatedEngineered → Biotransformation → Microbial Enhanced Oil Recovery → Unclassified → Unclassified → Photobioreactor Incubated0.89%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000271Photobioreactor incubated microbial communities from Hamburg, Germany - Sample 1EngineeredOpen in IMG/M
3300000883Estuary microbial communities from the Columbia River - 5 PSUEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300001800Serpentinite rock and fluid subsurface biosphere microbial communities from McLaughlin Reserve, California, USA - CR12Mar_CSW14ABEnvironmentalOpen in IMG/M
3300002856Wastewater microbial communities from Syncrude, Ft. McMurray, Alberta - Tailing Pond Surface TP_surfaceEngineeredOpen in IMG/M
3300003278Down-flow hanging sponge reactor microbial communities from the University of Illinois at Urbana-Champaign, USA - U2-648F-DHSEngineeredOpen in IMG/M
3300004071Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushMan_ThreeSqB_D2EnvironmentalOpen in IMG/M
3300004096Freshwater lake microbial communities from Lake Michigan, USA - Su13.BD.MM15.SN (version 2)EnvironmentalOpen in IMG/M
3300004128Freshwater lake microbial communities from Lake Michigan, USA - Su13.BD.MM110.SN (version 2)EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004481Combined Assembly of Gp0112041, Gp0112042, Gp0112043EnvironmentalOpen in IMG/M
3300005438Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-2 metaGEnvironmentalOpen in IMG/M
3300005466Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3L metaGEnvironmentalOpen in IMG/M
3300005660Active sludge microbial communities from Klosterneuburg, Austria, studying microevolution and ecology of nitrifiers - Klosterneuburg WWTP active sludge metagenome KNB14_precipitateEngineeredOpen in IMG/M
3300005758Cyanobacteria communities in tropical freswater systems - freshwater lake in SingaporeEnvironmentalOpen in IMG/M
3300005844Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2Host-AssociatedOpen in IMG/M
3300005982Wastewater effluent complex algal communities from Wisconsin, to seasonally profile nutrient transformation and Carbon sequestration - JI 8/11/14 A brown DNAEngineeredOpen in IMG/M
3300005985Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S4T2R2Host-AssociatedOpen in IMG/M
3300005988Wastewater effluent complex algal communities from Wisconsin, to seasonally profile nutrient transformation and Carbon sequestration - JI 9/18/14 C2 DNAEngineeredOpen in IMG/M
3300006033Freshwater microbial communities in response to fracking from Pennsylvania, USA - Allegheny Zone_MetaG_DW_15EnvironmentalOpen in IMG/M
3300006056Wastewater effluent complex algal communities from Wisconsin, to seasonally profile nutrient transformation and Carbon sequestration - JI 10/23/14 1A DNAEngineeredOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300009032Freshwater microbial communities from Lake Fryxell liftoff mats and glacier meltwater in Antarctica - MAT-05EnvironmentalOpen in IMG/M
3300009078Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm September2015EnvironmentalOpen in IMG/M
3300009079Estuarine microbial communities from the Columbia River estuary - Flood tide ETM metaG S.741EnvironmentalOpen in IMG/M
3300009086Estuarine microbial communities from the Columbia River estuary - Flood tide ETM metaG S.713EnvironmentalOpen in IMG/M
3300009152Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140806_EF_MetaGEnvironmentalOpen in IMG/M
3300009179Wetland microbial communities from Old Woman Creek Reserve in Ohio, USA - Plant_0915_D1EnvironmentalOpen in IMG/M
3300009688Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_STIC08_MetaGEngineeredOpen in IMG/M
3300009778Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from Hong Kong - AD_UKC117_MetaGEngineeredOpen in IMG/M
3300009779Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from Hong Kong - AD_UKC119_MetaGEngineeredOpen in IMG/M
3300009780Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_UKC045_MetaGEngineeredOpen in IMG/M
3300009783Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_UKC052_MetaGEngineeredOpen in IMG/M
3300009868Activated sludge microbial diversity in wastewater treatment plant from Tai Wan - Bali plant Bali plantEngineeredOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010356AD_USDEcaEngineeredOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300010885northern Canada Lakes Co-assemblyEnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012956Active sludge microbial communities from wastewater, Klosterneuburg, Austria - Klosneuvirus_20160825_MGEngineeredOpen in IMG/M
3300012973Fecal eukaryotic communites from dung pellets of Tule Elk in California, USA - Elk Dung E36 Day 36 MetagenomeHost-AssociatedOpen in IMG/M
3300013006Oligotrophic lake water microbial communities from Sparkling Lake, Wisconsin, USA - GEODES005 metaGEnvironmentalOpen in IMG/M
3300014323Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - WestPond_CattailB_D1EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015360Freshwater microbial mat bacterial communities from Lake Vanda, McMurdo Dry Valleys, Antarctica - Oligotrophic Lake LV.19.BULKMAT1EnvironmentalOpen in IMG/M
3300017944Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0815_BV2_10_20_MGEnvironmentalOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018481Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 356 TEnvironmentalOpen in IMG/M
3300020027Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1c1EnvironmentalOpen in IMG/M
3300020172Freshwater lake microbial communities from Lake Erken, Sweden - P4710_102 megahit1EnvironmentalOpen in IMG/M
3300020205Freshwater lake microbial communities from Lake Erken, Sweden - P4710_103 megahit1EnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300022720Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300023100Plant litter microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-L013-104B-2EnvironmentalOpen in IMG/M
3300024502Freshwater microbial communities from Altamaha River, Georgia, United States - Atl_Colum_RepC_8dEnvironmentalOpen in IMG/M
3300025526Serpentinite rock and fluid subsurface biosphere microbial communities from McLaughlin Reserve, California, USA - CR12Mar_CSW14AB (SPAdes)EnvironmentalOpen in IMG/M
3300025772Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_STIC12_MetaG (SPAdes)EngineeredOpen in IMG/M
3300025855Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_UKC048_MetaG (SPAdes)EngineeredOpen in IMG/M
3300025866Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_STIC08_MetaG (SPAdes)EngineeredOpen in IMG/M
3300025871Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_UKC045_MetaG (SPAdes)EngineeredOpen in IMG/M
3300025882Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_UKC052_MetaG (SPAdes)EngineeredOpen in IMG/M
3300027503Freshwater microbial communities from Altamaha River, Georgia, United States - Atl_UVDOM_RepA_8dEnvironmentalOpen in IMG/M
3300027728 (restricted)Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2015_14mEnvironmentalOpen in IMG/M
3300027776Wastewater effluent complex algal communities from Wisconsin, to seasonally profile nutrient transformation and Carbon sequestration - JI 8/11/14 A brown DNA (SPAdes)EngineeredOpen in IMG/M
3300027781Wastewater effluent complex algal communities from Wisconsin, to seasonally profile nutrient transformation and Carbon sequestration - JI 9/18/14 C2 DNA (SPAdes)EngineeredOpen in IMG/M
3300027818Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 19-21cm September2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027851Freshwater microbial communities in response to fracking from Pennsylvania, USA - Allegheny Zone_MetaG_DW_15 (SPAdes)EnvironmentalOpen in IMG/M
3300027878Freshwater microbial communities from Lake Fryxell liftoff mats and glacier meltwater in Antarctica - MAT-05 (SPAdes)EnvironmentalOpen in IMG/M
3300027892Freshwater lake microbial communities from Lake Michigan, USA - Su13.BD.MM15.SN (SPAdes)EnvironmentalOpen in IMG/M
3300027973Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140806_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027974Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140806_MF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300028091Freshwater microbial communities from Mississippi River, Louisiana, United States - Miss_Cont_RepA_0hEnvironmentalOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028581 (restricted)Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2015_17mEnvironmentalOpen in IMG/M
3300030114I_Fen_E2 coassemblyEnvironmentalOpen in IMG/M
3300031455Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 23_SEnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031758Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 12 MA123EnvironmentalOpen in IMG/M
3300031918III_Fen_N3 coassemblyEnvironmentalOpen in IMG/M
3300032144Garden soil microbial communities collected in Santa Monica, California, United States - Edamame soilEnvironmentalOpen in IMG/M
3300032157Garden soil microbial communities collected in Santa Monica, California, United States - V. faba soilEnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033978Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME28Sep2014-rr0002EnvironmentalOpen in IMG/M
3300034071Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME17Oct2008D10-rr0110EnvironmentalOpen in IMG/M
3300034111Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME03Oct2011-rr0186EnvironmentalOpen in IMG/M
3300034169Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_02D_15EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
PBR_100233433300000271Photobioreactor IncubatedMWPFTPRCRQAVVVFHIDQDGIARGAFFRGDADVLIIDERDPKNRIYRMTAESSVEELRRKIGRHPLGSLLEVPRDKERNQTPVLNLHWRSDGVHAG*
EsDRAFT_1007579423300000883Freshwater And MarineMWPFQTEKKKAVVVFHIDQDGNARGAWFRGNADVLIIDERDPKSRVYRMESESTDEELRRKIGKHPLGRMLGEQVPTERPTLNLHWRVDGP
JGI10216J12902_10177624023300000956SoilMWPFEHKHRKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILSENQADQRHPAPVLNLHWRTDAAGVA*
JGI24115J20150_100656533300001800Serpentinite Rock And FluidMWPFADKSKKAVVVLHIDNEGHARGAWFRGNADVLIIDERDPSNRVYRMASETPDDELRRKIGRHPLARTLSDKPTAAPLPTLNLHWPVDGPQTA*
JGI24115J20150_100836113300001800Serpentinite Rock And FluidMWPFADKSKKAVVVLHIDSDGHARGAWFRGNADVLIIDERDPSNRVYRMASETPDDELRRKIGRHPLARTLSDKPTAAPLPTLNLHWPVDGPQTA*
draft_1144406333300002856Hydrocarbon Resource EnvironmentsAAPDQVSQGLNDMWPFESRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILNEKDPDPRRASPVLNLHWDETGARAT*
U2draft_100172993300003278Down-Flow Hanging Sponge ReactorMRGRIMWPFDSRKQKAVVVFHIDNEGRARGAWFRGDADVLIIDERDPFNRVHLMDQETNVDELRRKIGRHPLGRMLNEDCKASALPVLNLHWRTDGKAAAH*
Ga0055486_1015785523300004071Natural And Restored WetlandsMWPFETKHRKAVVVFHIDQNGMAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILAEKQSEQRHAAPVLNLHWR
Ga0066177_1037757813300004096Freshwater LakeGLSNMWPFETRKKKAVVVFHIDHDGHARGAYFRGDADVLIIDERDPQNRVYRMQSETPVDEIRRKIGKHPIGRTLDDKPNSAQLPVLNLHWRVDGPQTV*
Ga0066180_1001662623300004128Freshwater LakeMWPFETRKKKAVVVFHIDHDGHARGAYFRGDADVLIIDERDPQNRVYRMQSETPVDEIRRKIGKHPIGRTLDDKPNSAQLPVLNLHWRVDGPQTV*
Ga0063356_10382769423300004463Arabidopsis Thaliana RhizosphereMWPFESSQRKKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRIYRMGQETPDEELRRKIGKHPLGRLLADRSVDQDSGAPVLNLHWRTRDSATI*
Ga0069718_1370835123300004481SedimentMWPFKARRRQAVVVFHVDQDGYARGAFFRGDADVLIIDERDPQNRVYRMGSESSDIDLRRRIGKHPLGRLLDEPRDKARKVAPVVNLHWANDTVQAN*
Ga0070701_1113609313300005438Corn, Switchgrass And Miscanthus RhizosphereMWPFENKHRKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYKMGQESADDELRRKIGKHPLGRLLAEDKVDQRHPAPVLNLHWGVDATPAS*
Ga0070685_1000841843300005466Switchgrass RhizosphereMWPFENKHRKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYKMGQETADDELRRKIGKHPLGRLLAEDKLDQRHPAPVLNLHWGVDATPAR*
Ga0073904_1012646433300005660Activated SludgeMWPFESRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGRHPLGRILNEKESDDRHASPVLNLHWDSKGARAS*
Ga0078117_110352813300005758Lake WaterMWPFQNRTKKAVVVLHIDNEGHARGAWFRGDADVLIIDERDPRNRVYRMGSETPDDELRRKIGRHPFGRTLGDRPAARDYPVLNLHGPVDGAQAI*
Ga0068862_10191161513300005844Switchgrass RhizosphereMWPFEHKHRKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILSEKQDDQRHPAPVLNLHWRTDATGVA*
Ga0075156_1000438143300005982Wastewater EffluentMWPFNSKRRQAVVVFHIDQDGIARGAFFRGDADVLIIDERDPKNRIYRMTAESSVEELRRKIGRHPLGSLLEEPRDQERNSTPVLNLHWRTDGVHAG*
Ga0075156_1036187923300005982Wastewater EffluentMWPFESRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQESPDDEIRRKIGKHPLGRILEDKGGDDRHAAPVLNLHWDGTGARAT*
Ga0081539_1045689423300005985Tabebuia Heterophylla RhizosphereGLAKGAYFRGEADVLIIDERDPKNRVYKMGQETADDELRRKIGKHPLGRLLAEDKVDERHPAPVLNLHWGADATAAR*
Ga0075160_1072796713300005988Wastewater EffluentMWPFAHKQKKAVVVFHIDHNGLAKGAFFQGEADVLIIDERDPKNRVYRMSQETPSDELRRKIGKHPLGRLLDTTRPGAERTASPVLSLHWRTDAEMVV*
Ga0075160_1077381123300005988Wastewater EffluentRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQESPDDEIRRKIGKHPLGRILEDKGGDDRHAAPVLNLHWDGTGARAT*
Ga0075012_1011744923300006033WatershedsMWPFSSRRKRAVVVFHIDPNGQAKGAFFRGDADVLIIDERDQRRNVYRMSQETPDDELRRRIGKHPLRQLLDRKEDRASSPVLNLHWRTDAAQLV*
Ga0075012_1012432823300006033WatershedsMWPFETRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETADDEIRRKIGKHPLGRILEEKGDDRHAAPVLNLHWDGTGARAT*
Ga0075012_1039638123300006033WatershedsMWPFETRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETADDEIRRKIGKHPLGRILEDKGGDRHAAPVLNLHWDGTGARAT*
Ga0075012_1052023623300006033WatershedsVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMAQESPDDELRRKIGKHPLGRILDEANTGDHNAAPVLNLHWSGTGARAS*
Ga0075163_1021284733300006056Wastewater EffluentMWPFAHKQKKAVVVFHIDHNGLAKGAFFQGEADVLIIDERDPKNRVYRMSQETPSDELRRKIGKHPLGRLLDTTRPGAERAANPVLSLHWRTDAEMVV*
Ga0075163_1161684723300006056Wastewater EffluentMWPFQTEKKKAVVVFHIDQDGNARGAWFRGNADVLIIDERDPKSRVYRMESESTDEELRRKIGKHPLGRMLGEQVPTERPTLNLHWRVDGPQTA*
Ga0079222_1151831523300006755Agricultural SoilMWPFENKHRKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYKMGQESPDDELRRKIGKHPLGRLLAEDKVDERHPAPVLNLHWGVDATPAN*
Ga0075428_10115897023300006844Populus RhizosphereMWPFEHKHRKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILSETQADQRYPAPVLNLHWRTDATGAA*
Ga0079219_1157295713300006954Agricultural SoilMWPFENKHRKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYKMGQETADDELRRKIGKHPLGRQITEDHQDERHPAPVLNLHWNVDAPATN*
Ga0079219_1164700813300006954Agricultural SoilAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDEMRRKIGKHPLGQLIADRSVDQRTPAPVLNLHWRTDNVQAA*
Ga0105048_1001200023300009032FreshwaterMFHVDQDGYARGAFFRGDADVLIIDERDPKNRVYRMGAETSDIDLRRKIGKHPLGRLLDEPQDKTRKSAPVLNLHWPSDPLQAS*
Ga0105106_1003268543300009078Freshwater SedimentMWPFETKHRKAVVVFHIDQNGMAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILAEKQSDQRHAAPVLNLHWRTDPAGVA*
Ga0102814_1001750153300009079EstuarineMWPFETRKKKAVVVFHIDHDGHARGAYFRGDADVLIIDERDPQNRVYRMQAETTVDEIRRKIGKHPIGRTLDDKPNSAQLPVLNLHWRVDGPQTV*
Ga0102812_1006464713300009086EstuarineMWPFETRKKKAVVVFHIDHDGHARGAYFRGDADVLIIDERDPQNRVYRMQAETTVDEIRRKIGKHPIGRTLDDKPNSAQLPVLNLHWRVDGPQAV*
Ga0114980_1004231243300009152Freshwater LakeMWPFQTDKKKAVVVFHIDQDGNARGAWFRGNADVLIIDERDPKSRVYRMESESTDEELRRKIGKHPLGRMLGEQVPTERPTLNLHWRVDGPQTA*
Ga0115028_1088826823300009179WetlandMWPFETKHRKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILAEKQTDQRHAAP
Ga0116176_1000918943300009688Anaerobic Digestor SludgeMWPFESRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQESPDDEIRRKIGKHPLGRILEDKGGDDRYAAPVLNLHWDGTGARAT*
Ga0116151_1017949633300009778Anaerobic Digestor SludgeMWPFESQRRKAVVVLHVDQEGIAKGAYFRGDADVLIIDERDPRNRVYRMRAETPDAELRRRIGKHPLGRMLDAPKRKARDQLPMLNLHWRTDGAQAS*
Ga0116152_1003241823300009779Anaerobic Digestor SludgeMWPFESQRRKAVVVLHVDQEGIAKGAYFRGDADVLIIDERDPRNRVYRMRAETPDAELRRRIGKHPLGRMLDAPKHKARDQLPMLNLHWRTDGAQAS*
Ga0116156_10000877203300009780Anaerobic Digestor SludgeMWPFESRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQESADDELRRKIGKHPLGRILEEKQSDDRHAAPVLNLHWDGAGARAV*
Ga0116158_1049196723300009783Anaerobic Digestor SludgeGPLQVKFLQGLNNMWPFESRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQESPDDEIRRKIGKHPLGRILEEKGGDDRHAAPVLNLHWDGTGARAG*
Ga0130016_1067766913300009868WastewaterMWPFESRKKKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETADDELRRKIGKHPLGRILDEQSSDDRHASPVLNLHWDGAARAS
Ga0126382_1120818313300010047Tropical Forest SoilMWPFESNPRKKAVVVFHIDHNGLAKGAYFRGDADVLIIDERDPKNRIYRMGQETPDEELRRKIGKHPLGRLLADRAVDQDAGAPVLNLHWRTRDSATI*
Ga0116237_1011459623300010356Anaerobic Digestor SludgeMWPFESRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQESPDDELRRKIGKHPLGRLLDETSTGDRHAAPVLNVHWNGTGARAS*
Ga0116237_1121466523300010356Anaerobic Digestor SludgeRRTAVVVLHVDQEGIAKGAYFRGDADVLIIDERDPRNRVYRMRAETPDAELRRRIGKHPLGRMLDAPKRKARNQLPMLNLHWRTDGAQAS*
Ga0134127_1235654313300010399Terrestrial SoilMWPFEHRHRKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILSEKQDDQRHPAPVLNLPSRT
Ga0134121_1019751923300010401Terrestrial SoilMWPFEQKRRKAVVVLHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILSEKQDDPRHAAPVLNVHWRHDSVQA
Ga0133913_1007158423300010885Freshwater LakeMGFGFQGFTMWPFQTDKKKAVVVFHIDSDGHARGAWFRGNADVLIIDERDPSSRVYRMGSESSEEELRRKIGKHPLGRMLNEPVPNERPTLNLHWRVDGPQTA*
Ga0133913_1317318623300010885Freshwater LakeVFHIDHDGQARGAYFRGDADVLIIDERDPQNRVYRMQSETAADELRRKIGRHAIDRALDDKPNSSQLPVLNLHWRIDGSRTV*
Ga0126375_1172019623300012948Tropical Forest SoilMWPFENKHRKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYRMSQETPDDELRRKLGKHPLGRILSDDQSGQRHPAPVLN
Ga0154020_1119214223300012956Active SludgeMWPFESRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGRHPLGRILNEKESDDRHASPVLNLHWDS
Ga0123351_112552223300012973FecalMWPFESRKKKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETADDELRRKIGKHPLGRILDEQSSDDRHASPVLNLHWDGAARAS*
Ga0164294_1006528033300013006FreshwaterMWPFETRKKKAVVVFHIDHDGHARGAYFRGDADVLIIDERDPQNRVYRMQSETPVDEIRRKIGKHPIGRTLDDKPNSAQLPVLNLHWRVDGPQTA*
Ga0075356_100368233300014323Natural And Restored WetlandsMWPFENKHRKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYKMGQESPDEELRRKIGKHPLGRLLAEDKADERHPAPVLNLHWSVDATPAS*
Ga0137409_1060224923300015245Vadose Zone SoilMWPFESRRKKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDDMRRKIGKHPLGQLIADRTIDERAPAPVLNLHWRTDGAQAA*
Ga0163144_10003358243300015360Freshwater Microbial MatMWPFQTRKTKAVVVFHIDHDGQARGAYFRGDADVLIIDERDPQNRVYRMQSETPVDEIRRKIGKHPTGRSLDDKPNSSQLPVLNLHWRVDGPQTA*
Ga0163144_1057576423300015360Freshwater Microbial MatMWPFDSRKKKAVVVFHIDHDGQARGAYFRGDADVLIIDERDPQNRVYRMQSETPVDEIRRKIGKHPIGRTLDDKPNSAQLPVLNLHWRVDGPQTA*
Ga0187786_1044620623300017944Tropical PeatlandMWPFEQKRRKAVVVLHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILSETQDDPKHAAPVLNLHWRHDPMQAT
Ga0184615_1015926113300018059Groundwater SedimentMWPFEHKHRKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILSEKQDDQRHAAPVLNLHWRTDAPG
Ga0190265_1022968633300018422SoilMWPFESKRRKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILADRSVDEPNGSPVLNLHWRTEPVQA
Ga0190271_1051231823300018481SoilVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYRMGNETPDDELRRKIGKHPLGRILSEKPDDDAKPVLNLHWRTDGAQTV
Ga0193752_120691713300020027SoilMWPFESKHRKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMSQETPDDELRRKIGKHPLGRILAEKQADQRNPAPVLNLHWRHETA
Ga0211729_1036670123300020172FreshwaterMWPFETRKKKAVVVFHIDHDGHARGAYFRGDADVLIIDERDPQNRVYRMQAETTVDEIRRKIGKHPIGRTLDDKPNSAQLPVLNLHWRVDGPQTV
Ga0211731_1033636113300020205FreshwaterKKAVVVFHIDHDGHARGAYFRGDADVLIIDERDPQNRVYRMQAETTVDEIRRKIGKHPIGRTLDDKPNSAQLPVLNLHWRVDGPQTV
Ga0210377_1000975263300021090Groundwater SedimentMWPFEHKHRKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILSEKQDDQRHAAPVLNLHWRTDAPGTA
Ga0242672_111147713300022720SoilIWAVSPNRMMGLTPMWPFQPKHRKAVVVFHIDQNGLAKGAFFRGDADVLIIDERDPRNRVYRMSQETPEDELRRKIGKHPLGRLLREDPKQPAQPVLNLHWRHDPAHVL
Ga0247738_1005750823300023100Plant LitterMWPFESRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETADDELRRKIGKHPLGRILEEKASDDRHASPVLNLHWDGAGARAS
Ga0255181_101887623300024502FreshwaterMWPFQNRTRKAVVVLHIDNEGHARGAWFRGDADVLIIDERDPRNRVYRMGSETPDDELRRKIGRHPLGRTLGDRPAARDNPVLNLHWPVDGAQAI
Ga0208492_100082223300025526Serpentinite Rock And FluidMWPFADKSKKAVVVLHIDNEGHARGAWFRGNADVLIIDERDPSNRVYRMASETPDDELRRKIGRHPLARTLSDKPTAAPLPTLNLHWPVDGPQTA
Ga0208492_101575523300025526Serpentinite Rock And FluidMWPFADKSKKAVVVLHIDSDGHARGAWFRGNADVLIIDERDPSNRVYRMASETPDDELRRKIGRHPLARTLSDKPTAAPLPTLNLHWPVDGPQTA
Ga0208939_100098083300025772Anaerobic Digestor SludgeMWPFNSKRRQAVVVFHIDQDGIARGAFFRGDADVLIIDERDPKNRIYRMTAESSVEELRRKIGRHPLGSLLEEPRDQERNSTPVLNLHWRTDGVHAG
Ga0208939_100540773300025772Anaerobic Digestor SludgeMWPFESRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQESPDDEIRRKIGKHPLGRILEDKGGDDRHAAPVLNLHWDGTGARAT
Ga0208939_101178923300025772Anaerobic Digestor SludgeMGRWFAGGGAGPLQVKFLQGLNNMWPFESRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQESPDDEIRRKIGKHPLGRILEEKGGDDRHAAPVLNLHWDGTGARAG
Ga0208939_108969923300025772Anaerobic Digestor SludgeMWPFESRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGRHPLGRILNEKESDDRHASPVLNLHWDSKGARAS
Ga0209717_1002370203300025855Anaerobic Digestor SludgeMWPFESRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQESADDELRRKIGKHPLGRILEEKQSDDRHAAPVLNLHWDGAGARAV
Ga0208822_112256813300025866Anaerobic Digestor SludgeMWPFESRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGRHPLGRILNEKESDDRHASPVQSALG
Ga0209311_104624343300025871Anaerobic Digestor SludgeMWPFESRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQESPDDEIRRKIGKHPLGRILEDKGGDDRHAA
Ga0209311_117260633300025871Anaerobic Digestor SludgeESRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQESPDDEIRRKIGKHPLGRILEDKGGDDRHAAPVLNLHWDGTGARAT
Ga0209097_1036434113300025882Anaerobic Digestor SludgeNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQESPDDEIRRKIGKHPLGRILEEKGGDDRHAAPVLNLHWDGTGARAG
Ga0255182_112153813300027503FreshwaterEGHARGAWFRRDADVLIIDERDPRNRVYRMGSETPDDELRRKIGRHPFGRTLGDRPAARDNPVLNLHWPVDGAQAI
(restricted) Ga0247836_1000464143300027728FreshwaterVVFHIDHDGQARGAYFRGDADVLIIDERDPQNRVYRMQAETAVDELRRKIGRHPAGCTLDDKPNPSQLPVLNLHWRIDGSQTA
Ga0209277_1028901013300027776Wastewater EffluentMWPFESRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQESPDDEIRRKIGKHPLGRILEDKGGDDRHAAP
Ga0209175_1047612113300027781Wastewater EffluentRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQESPDDEIRRKIGKHPLGRILEDKGGDDRHAAPVLNLHWDGTGARAT
Ga0209706_1059124613300027818Freshwater SedimentMWPFETKHRKAVVVFHIDQNGMAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILAEKQSDQRHAAPVLNLHWRTDPAGVA
Ga0209066_1009627423300027851WatershedsMWPFETRRKKAVVVFHIDQNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETADDEIRRKIGKHPLGRILEEKGDGRHAAPVLNLHWDGTGARAT
Ga0209066_1036935113300027851WatershedsMWPFSSRRKRAVVVFHIDPNGQAKGAFFRGDADVLIIDERDQRRNVYRMSQETPDDELRRRIGKHPLRQLLDRKEDRASSPVLNLHWRTDAAQLV
Ga0209181_1001570863300027878FreshwaterMWPFKAGRRQAVVMFHVDQDGYARGAFFRGDADVLIIDERDPKNRVYRMGAETSDIDLRRKIGKHPLGRLLDEPQDKTRKSAPVLNLHWPSDPLQAS
Ga0209550_1005493033300027892Freshwater LakeMWPFETRKKKAVVVFHIDHDGHARGAYFRGDADVLIIDERDPQNRVYRMQSETPVDEIRRKIGKHPIGRTLDDKPNSAQLPVLNLHWRVDGPQTV
Ga0209298_1002359013300027973Freshwater LakeMWPFQTDKKKAVVVFHIDQDGNARGAWFRGNADVLIIDERDPKSRVYRMESESTDEELRRKIGKHPLGRMLGEQVPTERPTLNLHWRVDGPQTA
Ga0209299_108333923300027974Freshwater LakeMWPFQTDKKKAVVVFHIDQDGNARGAWFRGNADVLIIDERDPKSRVYRMESESTDEELRRKIGKHPLGRMLGEQVPTERPTLNLHWR
Ga0209299_124718013300027974Freshwater LakeMGFGFQGFTMWPFQTDKKKAVVVFHIDSDGHARGAWFRGNADVLIIDERDPSSRVYRMGSESSEEELRRKIGKHPVGRMLNEPVPNERPTLNLHWRVDGPQ
Ga0255184_103155613300028091FreshwaterMWPFQNRTKKAVVVLHIDNEGHARGAWFRGDADVLIIDERDPRNRVYRMGSETPDDELRRKIGRHPFGRTLGDRPAARDYPVLNLHGPVDGAQAI
Ga0268265_1159299613300028380Switchgrass RhizosphereMWPFEHRHRKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILSEKQDDQRHPAPVLNLHWRTDATGVA
(restricted) Ga0247840_1001623943300028581FreshwaterMWPFRSRKKKAVVVFHIDHDGQARGAYFRGDADVLIIDERDPQNRVYRMQAETAVDELRRKIGRHPAGCTLDDKPNPSQLPVLNLHWRIDGSQTA
Ga0311333_1182739923300030114FenMWPFSTKHDRAVVVFHIDQNGLAKGAFFRGQADVLIIDERDPRNRVYRMSQETPSDELRRKIGKHPLNRLLDPARMERNKQERPALSLHWRTDSEAVV
Ga0307505_1008922423300031455SoilMWPFETRRRKAVVVFHVDHEGVARGAWFRGDADVLIIDERDPKSRVYRMDGETGDDELRRRIGKHPLGRLLDAPRDKARDVAPVLNLHWRVDGAQTG
Ga0307468_10033331323300031740Hardwood Forest SoilMWPFENKHRKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYKMGQETGDDELRRKIGKHPLGRILAEEQPDTRRTAPVLNLHWRTDAAGAA
Ga0307468_10087556513300031740Hardwood Forest SoilAIVVFHIDQSGLAKGAFFRGDADVLIIDERDPRNRVYRMSQETPDEELRRKIGKHPLGRLIADRSVDARTPAPVLNLHWRTDNVQAA
Ga0315907_1067385813300031758FreshwaterMWPFGSRRKRAVVVFHIDHDGQARGAYFRGDADVLIIDERDPQNRVYRMQSETAADELRRKIGRHAIDRALDDKPNSSQMPVLNLHWRIDGSQTV
Ga0311367_1074879713300031918FenMWPFSTKHDRAVVVFHIDQNGLAKGAFFRGQADVLIIDERDPRNRVYRMSQETPSDELRRKIGKHPLNRLLDPARMERNKQERPALSLHWRTDAEAVV
Ga0315910_1051933613300032144SoilMWPFESNTRKKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRIYRMGQETPDEELRRKIGKHPLGRLLADRSVDQDAGAPVLNLHWRTRDSATI
Ga0315910_1059348523300032144SoilMWPFESNQRKKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRIYRMGQETPDEELRRKIGKHPLGRLLADRSVDQDSGAPVLNLHWRTRDSATI
Ga0315912_1007112443300032157SoilMWPFESNQRKKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDPKNRIYRMGQETPDEELRRKIGKHPLGRLLADRSVDQQTGAPVLNLHWRTRDSATA
Ga0315912_1056730423300032157SoilFESHTRKKAVVVFHIDHNGLAKGAYFRGEADVLIIDERDAKNRIYRMGQETPDEELRRKIGKHPLGRLLADRSVDQQSGAPVLNVHWRTRDSATA
Ga0307470_1021926813300032174Hardwood Forest SoilMWPFETKQRKAVVVFHIDQNGLAKGAYFRGDADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILAEEQPDTRRTAPVLNLHWRTDAAGAA
Ga0307472_10220369113300032205Hardwood Forest SoilMWPFEQKRRKAVVVLHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILAEKQDDPRHAAPVLNVHWRHDSVQAT
Ga0326726_1144303123300033433Peat SoilWSMWPFEHKRRKAVVVLHIDHNGLAKGAYFRGEADVLIIDERDPKNRVYRMGQETPDDELRRKIGKHPLGRILSEKQDDPRHAAPVLNLHWRHDSAQAI
Ga0334977_0004529_5968_62193300033978FreshwaterVVFHIDHDGQARGAYFRGDADVLIIDERDPQNRVYRMQSETAADELRRKIGRHAIDRALDDKPNSSQMPVLNLHWRIDGSQTV
Ga0335028_0319922_538_8193300034071FreshwaterMWPFALKRNRAVVVFHIDHNGLAKGAFFKGDADVLIIDERDPANRVYRMTQETPDQELRRRIGKHPLRHLINRRDQGQPVINLHWRTDTARLV
Ga0335063_0557551_169_4563300034111FreshwaterMWPFRSGKKKAVVVFHIDHDGQARGAYFRGDADVLIIDERDPQNRVYRMPSETAVDELRRKIGRHPIGRTLDDKPNASQLPVLNLKRRIDGPQTV
Ga0370480_0334458_46_3423300034169Untreated Peat SoilMWPFAHKQKKAVVVFHIDHNGLAKGAFFQGEADVLIIDERDPKNRVYRMSQETPSDELRRKIGRHPLGRLLDATRPGAERTANPVLSLHWRTDAEMVV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.