Basic Information | |
---|---|
Family ID | F105032 |
Family Type | Metagenome / Metatranscriptome |
Number of Sequences | 100 |
Average Sequence Length | 40 residues |
Representative Sequence | MLFKIGVVLVGGFAIFVLISAMYAHISYTLMKMDEERDI |
Number of Associated Samples | 58 |
Number of Associated Scaffolds | 100 |
Quality Assessment | |
---|---|
Transcriptomic Evidence | Yes |
Most common taxonomic group | Viruses |
% of genes with valid RBS motifs | 86.00 % |
% of genes near scaffold ends (potentially truncated) | 17.00 % |
% of genes from short scaffolds (< 2000 bps) | 56.00 % |
Associated GOLD sequencing projects | 54 |
AlphaFold2 3D model prediction | Yes |
3D model pTM-score | 0.54 |
Hidden Markov Model |
---|
Powered by Skylign |
Most Common Taxonomy | |
---|---|
Group | Predicted Viral (44.000 % of family members) |
NCBI Taxonomy ID | 10239 (predicted) |
Taxonomy | All Organisms → Viruses → Predicted Viral |
Most Common Ecosystem | |
---|---|
GOLD Ecosystem | Environmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake (16.000 % of family members) |
Environment Ontology (ENVO) | Unclassified (41.000 % of family members) |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Water (non-saline) (57.000 % of family members) |
⦗Top⦘ |
⦗Top⦘ |
Predicted Topology & Secondary Structure | |||||
---|---|---|---|---|---|
Classification: | Transmembrane (alpha-helical) | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 55.22% β-sheet: 0.00% Coil/Unstructured: 44.78% | Feature Viewer |
|
|||||
Powered by Feature Viewer |
Structure Viewer | |
---|---|
| |
Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.54 |
Powered by PDBe Molstar |
⦗Top⦘ |
Pfam ID | Name | % Frequency in 100 Family Scaffolds |
---|---|---|
PF01068 | DNA_ligase_A_M | 14.00 |
PF14743 | DNA_ligase_OB_2 | 9.00 |
PF00149 | Metallophos | 5.00 |
PF00136 | DNA_pol_B | 4.00 |
PF10504 | DUF2452 | 3.00 |
PF01764 | Lipase_3 | 3.00 |
PF13392 | HNH_3 | 2.00 |
PF00535 | Glycos_transf_2 | 2.00 |
PF03104 | DNA_pol_B_exo1 | 2.00 |
PF07460 | NUMOD3 | 2.00 |
PF05050 | Methyltransf_21 | 2.00 |
PF09889 | DUF2116 | 2.00 |
PF00085 | Thioredoxin | 2.00 |
PF09293 | RNaseH_C | 1.00 |
PF09834 | DUF2061 | 1.00 |
PF01844 | HNH | 1.00 |
PF13578 | Methyltransf_24 | 1.00 |
PF00383 | dCMP_cyt_deam_1 | 1.00 |
PF01223 | Endonuclease_NS | 1.00 |
PF04434 | SWIM | 1.00 |
PF00588 | SpoU_methylase | 1.00 |
PF13517 | FG-GAP_3 | 1.00 |
PF00692 | dUTPase | 1.00 |
PF00016 | RuBisCO_large | 1.00 |
PF12850 | Metallophos_2 | 1.00 |
PF02773 | S-AdoMet_synt_C | 1.00 |
PF13506 | Glyco_transf_21 | 1.00 |
COG ID | Name | Functional Category | % Frequency in 100 Family Scaffolds |
---|---|---|---|
COG1423 | ATP-dependent RNA circularization protein, DNA/RNA ligase (PAB1020) family | Replication, recombination and repair [L] | 14.00 |
COG1793 | ATP-dependent DNA ligase | Replication, recombination and repair [L] | 14.00 |
COG0417 | DNA polymerase B elongation subunit | Replication, recombination and repair [L] | 6.00 |
COG0192 | S-adenosylmethionine synthetase | Coenzyme transport and metabolism [H] | 1.00 |
COG0219 | tRNA(Leu) C34 or U34 (ribose-2'-O)-methylase TrmL, contains SPOUT domain | Translation, ribosomal structure and biogenesis [J] | 1.00 |
COG0565 | tRNA C32,U32 (ribose-2'-O)-methylase TrmJ or a related methyltransferase | Translation, ribosomal structure and biogenesis [J] | 1.00 |
COG0566 | tRNA G18 (ribose-2'-O)-methylase SpoU | Translation, ribosomal structure and biogenesis [J] | 1.00 |
COG0717 | dCTP deaminase | Nucleotide transport and metabolism [F] | 1.00 |
COG0756 | dUTP pyrophosphatase (dUTPase) | Defense mechanisms [V] | 1.00 |
COG1850 | Ribulose 1,5-bisphosphate carboxylase, large subunit, or a RuBisCO-like protein | Carbohydrate transport and metabolism [G] | 1.00 |
COG1864 | DNA/RNA endonuclease G, NUC1 | Nucleotide transport and metabolism [F] | 1.00 |
COG4279 | Uncharacterized protein, contains SWIM-type Zn finger domain | Function unknown [S] | 1.00 |
COG4715 | Uncharacterized protein, contains SWIM-type Zn finger domain | Function unknown [S] | 1.00 |
COG5431 | Predicted nucleic acid-binding protein, contains SWIM-type Zn-finger domain | General function prediction only [R] | 1.00 |
⦗Top⦘ |
Name | Rank | Taxonomy | Distribution |
All Organisms | root | All Organisms | 63.00 % |
Unclassified | root | N/A | 37.00 % |
Visualization |
---|
Powered by ApexCharts |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
3300001838|RCM33_1093185 | Not Available | 719 | Open in IMG/M |
3300001844|RCM35_1156882 | Not Available | 743 | Open in IMG/M |
3300005527|Ga0068876_10017745 | All Organisms → Viruses → Predicted Viral | 4528 | Open in IMG/M |
3300005527|Ga0068876_10018389 | All Organisms → cellular organisms → Bacteria | 4442 | Open in IMG/M |
3300005527|Ga0068876_10055596 | All Organisms → Viruses → Predicted Viral | 2407 | Open in IMG/M |
3300005527|Ga0068876_10062810 | All Organisms → Viruses → Predicted Viral | 2248 | Open in IMG/M |
3300005527|Ga0068876_10078323 | All Organisms → Viruses → Predicted Viral | 1986 | Open in IMG/M |
3300005527|Ga0068876_10146913 | All Organisms → Viruses → Predicted Viral | 1387 | Open in IMG/M |
3300005527|Ga0068876_10173742 | All Organisms → Viruses → Predicted Viral | 1258 | Open in IMG/M |
3300005527|Ga0068876_10183694 | All Organisms → Viruses → Predicted Viral | 1218 | Open in IMG/M |
3300005527|Ga0068876_10214336 | All Organisms → Viruses → Predicted Viral | 1113 | Open in IMG/M |
3300005527|Ga0068876_10629898 | All Organisms → cellular organisms → Bacteria | 579 | Open in IMG/M |
3300005662|Ga0078894_10122221 | Not Available | 2318 | Open in IMG/M |
3300005805|Ga0079957_1009249 | Not Available | 7559 | Open in IMG/M |
3300005805|Ga0079957_1033588 | All Organisms → Viruses → Predicted Viral | 3360 | Open in IMG/M |
3300005805|Ga0079957_1059953 | All Organisms → Viruses → Predicted Viral | 2268 | Open in IMG/M |
3300005805|Ga0079957_1075169 | All Organisms → Viruses → Predicted Viral | 1933 | Open in IMG/M |
3300005805|Ga0079957_1092290 | All Organisms → Viruses → Predicted Viral | 1675 | Open in IMG/M |
3300005805|Ga0079957_1123212 | All Organisms → Viruses → Predicted Viral | 1366 | Open in IMG/M |
3300005805|Ga0079957_1169348 | All Organisms → cellular organisms → Bacteria | 1088 | Open in IMG/M |
3300005805|Ga0079957_1218064 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 907 | Open in IMG/M |
3300005805|Ga0079957_1272196 | Not Available | 775 | Open in IMG/M |
3300006030|Ga0075470_10007918 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 3275 | Open in IMG/M |
3300006641|Ga0075471_10008937 | All Organisms → cellular organisms → Bacteria | 6219 | Open in IMG/M |
3300006641|Ga0075471_10074789 | All Organisms → Viruses → Predicted Viral | 1847 | Open in IMG/M |
3300006875|Ga0075473_10189722 | Not Available | 829 | Open in IMG/M |
3300007202|Ga0103274_1208595 | Not Available | 3071 | Open in IMG/M |
3300007541|Ga0099848_1170693 | Not Available | 795 | Open in IMG/M |
3300007974|Ga0105747_1037365 | All Organisms → Viruses → Predicted Viral | 1395 | Open in IMG/M |
3300008107|Ga0114340_1000438 | Not Available | 48591 | Open in IMG/M |
3300008107|Ga0114340_1024802 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes | 2791 | Open in IMG/M |
3300008107|Ga0114340_1166604 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 790 | Open in IMG/M |
3300008110|Ga0114343_1064685 | All Organisms → cellular organisms → Bacteria | 3545 | Open in IMG/M |
3300008110|Ga0114343_1095613 | Not Available | 1038 | Open in IMG/M |
3300008113|Ga0114346_1015564 | All Organisms → Viruses → Predicted Viral | 4201 | Open in IMG/M |
3300008116|Ga0114350_1006916 | Not Available | 7629 | Open in IMG/M |
3300008117|Ga0114351_1087852 | All Organisms → Viruses → Predicted Viral | 2790 | Open in IMG/M |
3300008120|Ga0114355_1034256 | All Organisms → cellular organisms → Bacteria | 2485 | Open in IMG/M |
3300008266|Ga0114363_1040856 | All Organisms → Viruses → Predicted Viral | 2336 | Open in IMG/M |
3300008267|Ga0114364_1035566 | All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon | 1905 | Open in IMG/M |
3300008448|Ga0114876_1014504 | All Organisms → Viruses → Predicted Viral | 4304 | Open in IMG/M |
3300008448|Ga0114876_1042148 | All Organisms → Viruses → Predicted Viral | 2133 | Open in IMG/M |
3300010293|Ga0116204_1027684 | All Organisms → Viruses → Predicted Viral | 2250 | Open in IMG/M |
3300010354|Ga0129333_10041737 | All Organisms → Viruses → Predicted Viral | 4314 | Open in IMG/M |
3300010354|Ga0129333_10114242 | Not Available | 2490 | Open in IMG/M |
3300010354|Ga0129333_10265154 | All Organisms → Viruses → Predicted Viral | 1545 | Open in IMG/M |
3300010354|Ga0129333_10672339 | Not Available | 892 | Open in IMG/M |
3300010354|Ga0129333_11214710 | Not Available | 626 | Open in IMG/M |
3300010354|Ga0129333_11551707 | Not Available | 541 | Open in IMG/M |
3300010370|Ga0129336_10412058 | Not Available | 737 | Open in IMG/M |
3300011268|Ga0151620_1007057 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Maricaulales → Robiginitomaculaceae → Robiginitomaculum → unclassified Robiginitomaculum → Robiginitomaculum sp. | 4103 | Open in IMG/M |
3300012968|Ga0129337_1002285 | Not Available | 793 | Open in IMG/M |
3300012968|Ga0129337_1003219 | Not Available | 607 | Open in IMG/M |
3300012970|Ga0129338_1533504 | Not Available | 940 | Open in IMG/M |
(restricted) 3300013126|Ga0172367_10023407 | Not Available | 5680 | Open in IMG/M |
3300013372|Ga0177922_10266034 | Not Available | 712 | Open in IMG/M |
3300019784|Ga0181359_1040457 | All Organisms → Viruses → Predicted Viral | 1806 | Open in IMG/M |
3300020074|Ga0194113_10076896 | All Organisms → Viruses → Predicted Viral | 3041 | Open in IMG/M |
3300020083|Ga0194111_10041794 | All Organisms → Viruses → Predicted Viral | 4164 | Open in IMG/M |
3300020084|Ga0194110_10776143 | Not Available | 584 | Open in IMG/M |
3300020160|Ga0211733_11027181 | All Organisms → Viruses → Predicted Viral | 1857 | Open in IMG/M |
3300020172|Ga0211729_10072468 | Not Available | 12801 | Open in IMG/M |
3300020183|Ga0194115_10045188 | Not Available | 2858 | Open in IMG/M |
3300020183|Ga0194115_10089225 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Rhodospirillaceae → Candidatus Endolissoclinum → unclassified Candidatus Endolissoclinum → Candidatus Endolissoclinum sp. TMED37 | 1757 | Open in IMG/M |
3300021092|Ga0194122_10092027 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Rhodospirillaceae → Candidatus Endolissoclinum → unclassified Candidatus Endolissoclinum → Candidatus Endolissoclinum sp. TMED37 | 1700 | Open in IMG/M |
3300021092|Ga0194122_10151344 | All Organisms → Viruses → Predicted Viral | 1246 | Open in IMG/M |
3300021376|Ga0194130_10028637 | Not Available | 4407 | Open in IMG/M |
3300021961|Ga0222714_10000149 | Not Available | 79905 | Open in IMG/M |
3300021961|Ga0222714_10019442 | Not Available | 5364 | Open in IMG/M |
3300021961|Ga0222714_10038819 | Not Available | 3432 | Open in IMG/M |
3300021962|Ga0222713_10079995 | All Organisms → Viruses → Predicted Viral | 2391 | Open in IMG/M |
3300022179|Ga0181353_1128278 | Not Available | 601 | Open in IMG/M |
3300024510|Ga0255187_1012469 | All Organisms → Viruses → Predicted Viral | 1208 | Open in IMG/M |
3300025585|Ga0208546_1008779 | Not Available | 2705 | Open in IMG/M |
3300025732|Ga0208784_1000764 | Not Available | 14604 | Open in IMG/M |
3300025732|Ga0208784_1087055 | Not Available | 939 | Open in IMG/M |
3300025872|Ga0208783_10048939 | All Organisms → Viruses → Predicted Viral | 1946 | Open in IMG/M |
3300027160|Ga0255198_1024566 | All Organisms → Viruses → Predicted Viral | 1138 | Open in IMG/M |
3300027659|Ga0208975_1144521 | Not Available | 666 | Open in IMG/M |
3300027793|Ga0209972_10261344 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 777 | Open in IMG/M |
3300027805|Ga0209229_10139224 | All Organisms → Viruses → Predicted Viral | 1094 | Open in IMG/M |
3300027816|Ga0209990_10041662 | All Organisms → Viruses → Predicted Viral | 2388 | Open in IMG/M |
3300027816|Ga0209990_10056211 | All Organisms → Viruses → Predicted Viral | 1992 | Open in IMG/M |
3300029930|Ga0119944_1002633 | All Organisms → cellular organisms → Bacteria | 3077 | Open in IMG/M |
3300029933|Ga0119945_1003957 | All Organisms → Viruses → Predicted Viral | 2110 | Open in IMG/M |
3300029933|Ga0119945_1015966 | Not Available | 928 | Open in IMG/M |
3300031758|Ga0315907_10036559 | All Organisms → Viruses → Predicted Viral | 4393 | Open in IMG/M |
3300031787|Ga0315900_10228442 | All Organisms → Viruses → Predicted Viral | 1614 | Open in IMG/M |
3300031857|Ga0315909_10008475 | Not Available | 11259 | Open in IMG/M |
3300031857|Ga0315909_10150510 | All Organisms → Viruses → Predicted Viral | 1917 | Open in IMG/M |
3300031857|Ga0315909_10317557 | All Organisms → Viruses → Predicted Viral | 1155 | Open in IMG/M |
3300031857|Ga0315909_10431278 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 933 | Open in IMG/M |
3300031951|Ga0315904_10069167 | All Organisms → Viruses → Predicted Viral | 3833 | Open in IMG/M |
3300031951|Ga0315904_10320239 | All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon | 1443 | Open in IMG/M |
3300031963|Ga0315901_10008281 | Not Available | 12109 | Open in IMG/M |
3300031963|Ga0315901_10183152 | All Organisms → Viruses → Predicted Viral | 1831 | Open in IMG/M |
3300031963|Ga0315901_10508024 | Not Available | 937 | Open in IMG/M |
3300032050|Ga0315906_10251161 | All Organisms → Viruses → Predicted Viral | 1625 | Open in IMG/M |
3300032050|Ga0315906_10429867 | All Organisms → Viruses → Predicted Viral | 1140 | Open in IMG/M |
3300032093|Ga0315902_10584253 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 944 | Open in IMG/M |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
⦗Top⦘ |
Habitat | Taxonomy | Distribution |
Freshwater Lake | Environmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake | 16.00% |
Freshwater | Environmental → Aquatic → Freshwater → Unclassified → Unclassified → Freshwater | 14.00% |
Aqueous | Environmental → Aquatic → Marine → Coastal → Unclassified → Aqueous | 12.00% |
Freshwater, Plankton | Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater, Plankton | 11.00% |
Freshwater Lake | Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake | 11.00% |
Lake | Environmental → Aquatic → Freshwater → Lake → Unclassified → Lake | 9.00% |
Freshwater To Marine Saline Gradient | Environmental → Aquatic → Marine → Coastal → Unclassified → Freshwater To Marine Saline Gradient | 7.00% |
Estuarine Water | Environmental → Aquatic → Marine → Unclassified → Unclassified → Estuarine Water | 4.00% |
Freshwater | Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater | 3.00% |
Aquatic | Environmental → Aquatic → Freshwater → Drinking Water → Unclassified → Aquatic | 3.00% |
Marine Plankton | Environmental → Aquatic → Freshwater → Lotic → Unclassified → Marine Plankton | 2.00% |
Freshwater | Environmental → Aquatic → Freshwater → River → Unclassified → Freshwater | 2.00% |
Freshwater Lentic | Environmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lentic | 1.00% |
Freshwater And Sediment | Environmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater And Sediment | 1.00% |
Freshwater | Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater | 1.00% |
Anoxic Lake Water | Environmental → Aquatic → Freshwater → Lake → Unclassified → Anoxic Lake Water | 1.00% |
Freshwater | Environmental → Aquatic → Freshwater → Lotic → Unclassified → Freshwater | 1.00% |
Estuary Water | Environmental → Aquatic → Marine → Coastal → Unclassified → Estuary Water | 1.00% |
Visualization |
---|
Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
Taxon OID | Sample Name | Habitat Type | IMG/M Link |
---|---|---|---|
3300001838 | Marine plankton microbial communities from the Amazon River plume, Atlantic Ocean - RCM33, ROCA_DNA217_0.2um_bLM_C_2a | Environmental | Open in IMG/M |
3300001844 | Marine plankton microbial communities from the Amazon River plume, Atlantic Ocean - RCM35, ROCA_DNA220_0.2um_bLM_C_3a | Environmental | Open in IMG/M |
3300005527 | Freshwater lake microbial communities from Lake Erie, under a cyanobacterial bloom - NOAA_Erie_Diel5S_2200h metaG | Environmental | Open in IMG/M |
3300005662 | Freshwater lake microbial communities from Lake Michigan, USA - Su13.BD.MLB.SD (version 4) | Environmental | Open in IMG/M |
3300005805 | Microbial and algae communities from Cheney Reservoir in Wichita, Kansas, USA | Environmental | Open in IMG/M |
3300006030 | Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_0.19_D_<0.8_DNA | Environmental | Open in IMG/M |
3300006641 | Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_0.19_D_>0.8_DNA | Environmental | Open in IMG/M |
3300006875 | Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_0.19_N_>0.8_DNA | Environmental | Open in IMG/M |
3300007202 | Combined Assembly of cyanobacterial bloom in Marina Bay water reservoir, Singapore (Monthly Sampling-Site C) 9 sequencing projects | Environmental | Open in IMG/M |
3300007541 | Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1S Viral MetaG | Environmental | Open in IMG/M |
3300007974 | Coastal water column microbial communities from Columbia River Estuary, Oregon, USA - CMOP_DNA_1460C_0.2um | Environmental | Open in IMG/M |
3300008107 | Freshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE2, Sample E2014-0046-3-NA | Environmental | Open in IMG/M |
3300008110 | Freshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE12, Sample E2014-0048-3-NA | Environmental | Open in IMG/M |
3300008113 | Freshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE4, Sample E2014-0050-3-NA | Environmental | Open in IMG/M |
3300008116 | Freshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE2, Sample E2014-0106-3-NA | Environmental | Open in IMG/M |
3300008117 | Freshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE12, Sample E2014-0108-C-NA | Environmental | Open in IMG/M |
3300008120 | Freshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE12, Sample E2014-0108-3-NA | Environmental | Open in IMG/M |
3300008266 | Freshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE12, Sample HABS-E2014-0108-C-NA | Environmental | Open in IMG/M |
3300008267 | Freshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE12, sample HABS-E2014-0024-100-LTR | Environmental | Open in IMG/M |
3300008448 | Freshwater viral communities during cyanobacterial harmful algal blooms (CHABs) in Western Lake Erie, USA - August 4, 2014 all contigs | Environmental | Open in IMG/M |
3300010293 | Anoxic lake water microbial communities from Lake Kivu, Rwanda to study Microbial Dark Matter (Phase II) - Lake Kivu water 52m metaG | Environmental | Open in IMG/M |
3300010354 | Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_0.6_0.8_DNA | Environmental | Open in IMG/M |
3300010370 | Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_0.6_0.2_DNA | Environmental | Open in IMG/M |
3300011268 | Sub-surface freshwater microbial communities from San Francisco Estuary Delta, California, USA . Combined Assembly of Gp0173482, Gp0175554, Gp0175555 | Environmental | Open in IMG/M |
3300012968 | Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_0.6_0.2_RNA1 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
3300012970 | Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_0.6_0.2_RNA2 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
3300013126 (restricted) | Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_022012_10m | Environmental | Open in IMG/M |
3300013372 | Freshwater microbial communities from Lake Erie, Ontario, Canada. Combined Assembly of 10 SPs | Environmental | Open in IMG/M |
3300019784 | Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM15.S.D | Environmental | Open in IMG/M |
3300020074 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015017 Mahale Deep Cast 200m | Environmental | Open in IMG/M |
3300020083 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015033 Kigoma Deep Cast 300m | Environmental | Open in IMG/M |
3300020084 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015032 Kigoma Deep Cast 1200m | Environmental | Open in IMG/M |
3300020160 | Freshwater lake microbial communities from Lake Erken, Sweden - P4710_105 megahit1 | Environmental | Open in IMG/M |
3300020172 | Freshwater lake microbial communities from Lake Erken, Sweden - P4710_102 megahit1 | Environmental | Open in IMG/M |
3300020183 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015002 Mahale S4 surface | Environmental | Open in IMG/M |
3300021092 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015021 Mahale Deep Cast 10m | Environmental | Open in IMG/M |
3300021376 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015050 Kigoma 12 surface | Environmental | Open in IMG/M |
3300021961 | Estuarine water microbial communities from San Francisco Bay, California, United States - C33_3D | Environmental | Open in IMG/M |
3300021962 | Estuarine water microbial communities from San Francisco Bay, California, United States - C33_649D | Environmental | Open in IMG/M |
3300022179 | Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MLB.D.N | Environmental | Open in IMG/M |
3300024510 | Freshwater microbial communities from Mississippi River, Louisiana, United States - Miss_Cont_RepA_8h | Environmental | Open in IMG/M |
3300025585 | Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_0.19_D_<0.8_DNA (SPAdes) | Environmental | Open in IMG/M |
3300025732 | Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_0.19_N_>0.8_DNA (SPAdes) | Environmental | Open in IMG/M |
3300025872 | Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_0.19_D_>0.8_DNA (SPAdes) | Environmental | Open in IMG/M |
3300027160 | Freshwater microbial communities from Mississippi River, Louisiana, United States - Miss_Law_RepC_8h | Environmental | Open in IMG/M |
3300027659 | Freshwater lentic microbial communities from great Laurentian Lakes, MI, USA - Great Lakes metaG ER78MSRF (SPAdes) | Environmental | Open in IMG/M |
3300027793 | Freshwater lake microbial communities from Lake Erie, under a cyanobacterial bloom - NOAA_Erie_Diel1S_2200h metaG (SPAdes) | Environmental | Open in IMG/M |
3300027805 | Freshwater and sediment microbial communities from dead zone in Sandusky Bay, Ohio, USA (SPAdes) | Environmental | Open in IMG/M |
3300027816 | Freshwater lake microbial communities from Lake Erie, under a cyanobacterial bloom - NOAA_Erie_Diel5S_2200h metaG (SPAdes) | Environmental | Open in IMG/M |
3300029930 | Aquatic microbial communities from drinking water treatment plant in Pearl River Delta area, China - influent_20120727 | Environmental | Open in IMG/M |
3300029933 | Aquatic microbial communities from drinking water treatment plant in Pearl River Delta area, China - influent_20120727_2 | Environmental | Open in IMG/M |
3300031758 | Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 12 MA123 | Environmental | Open in IMG/M |
3300031787 | Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 12 MA114 | Environmental | Open in IMG/M |
3300031857 | Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 2 MA125 | Environmental | Open in IMG/M |
3300031951 | Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 12 MA120 | Environmental | Open in IMG/M |
3300031963 | Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 2 MA116 | Environmental | Open in IMG/M |
3300032050 | Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 2 MA122 | Environmental | Open in IMG/M |
3300032093 | Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 12 MA117 | Environmental | Open in IMG/M |
Geographical Distribution | |
---|---|
Zoom: | Powered by OpenStreetMap |
⦗Top⦘ |
Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
Protein ID | Sample Taxon ID | Habitat | Sequence |
RCM33_10931852 | 3300001838 | Marine Plankton | MLFKIGVFLVGGFAIFVLISAIVAHISYTLMKMDEERDLL* |
RCM35_11568822 | 3300001844 | Marine Plankton | MLFKIGVFLVGGFAIFVLISAIVAHISYTLMKMDEERDIS* |
Ga0068876_100177452 | 3300005527 | Freshwater Lake | MVFKIGVVLVCAFAIFVLISAIYAHISYTLIKMDEERDIS* |
Ga0068876_100183898 | 3300005527 | Freshwater Lake | MLFKISVVLVGLFAIVVLLAAMYAHISYALIKMDEERDIS* |
Ga0068876_100555961 | 3300005527 | Freshwater Lake | MLFKVSIVLVCLFAIVVLFSAMFAHISYYIIKKDEERDIS* |
Ga0068876_100628101 | 3300005527 | Freshwater Lake | MLFKVSIVLACLFAIVVLFSTMYARISYYLIKKDEERDFS* |
Ga0068876_100783233 | 3300005527 | Freshwater Lake | MLFKISVILVGLFAIFVLISAIYAHISYALMKMDEERDIS* |
Ga0068876_101469132 | 3300005527 | Freshwater Lake | MLFKIGVVVVGGFAIFTLLSAIYVHISYALMKMDEEN* |
Ga0068876_101737422 | 3300005527 | Freshwater Lake | MVFKIGVVLVCGFAIFVLISAIYAHISYKLMKMDEERDIS* |
Ga0068876_101836942 | 3300005527 | Freshwater Lake | MLFKIGVVVVGGFAIFTLLSAIYAHISYSLIKMDEERDI* |
Ga0068876_102143361 | 3300005527 | Freshwater Lake | MLFKVSIVLVCLFAIVVLFSAMFAHISYYIIKKDEERDF* |
Ga0068876_106298981 | 3300005527 | Freshwater Lake | MLFKIGVVLIGGFAIFTLLSAIYAHISYALMKMDEERDV* |
Ga0078894_101222213 | 3300005662 | Freshwater Lake | MLFKTSVIIVGLFAIFVLISAIYAHISYAIMKMDSERDIS* |
Ga0079957_100924911 | 3300005805 | Lake | MLFKVGVVLVGGFAIFTLFSAIYAHISYAIMKADKERDI* |
Ga0079957_10335881 | 3300005805 | Lake | SSSGSSSQRTNMLFKIGVIILCTFAILVLFSAMYAHISYYLIKKDEERDFS* |
Ga0079957_10599535 | 3300005805 | Lake | MLFKIGVFLVGGLAIFTLISAMYAHISYYLIKKDEERDIS* |
Ga0079957_10751692 | 3300005805 | Lake | MLFKIGIFLVGGLAIFTLISAMYAHISYYLIKKDEERDFS* |
Ga0079957_10922903 | 3300005805 | Lake | MLFKIGVVLVGGFAIFTLLSAIYAHIVKYLMDKIDEEHEV* |
Ga0079957_11232124 | 3300005805 | Lake | MLFKIGVVLVGGFAIFVLISAMYAHISYTLMKMDEERDI |
Ga0079957_11693484 | 3300005805 | Lake | MLFKISVVLVGVFAIVVLISAIYAHISYALIKMDEERDIS* |
Ga0079957_12180642 | 3300005805 | Lake | MLFKIGVVLTCAFAIFVLISAIYAHISYTLIKMDEERDIS* |
Ga0079957_12721962 | 3300005805 | Lake | MLFKVSVVLVCLFAIVVLISAIMAHVSYHLIKKDEERDFS* |
Ga0075470_100079185 | 3300006030 | Aqueous | MMLFKISVVLVGLFAIAVLISAMVAHVSYELMKRDEERDRT* |
Ga0075471_100089377 | 3300006641 | Aqueous | MLFKISVVLVAGFAVFVLFSAIYAHISYTLMKMDEERDI* |
Ga0075471_100747892 | 3300006641 | Aqueous | MLFKIGVVVVGGFAIFTLLSAIYAHISYALMKMDEERDI* |
Ga0075473_101897222 | 3300006875 | Aqueous | MLFKIGVVLVGTFAVFVVFSAIYAHISYTLMKIDEERDL* |
Ga0103274_12085955 | 3300007202 | Freshwater Lake | MLFKVSVILVGLFAIVVLISAMMAHVSYHLIKKDEERDFS* |
Ga0099848_11706932 | 3300007541 | Aqueous | NMLFKIGVVLVGGFAIFVLISAMYAHISYTLMKMDEERDIS*LILD* |
Ga0105747_10373651 | 3300007974 | Estuary Water | MLFKIGVVLVGGFAIFVLISAMYAHISYALMKMDEERDL* |
Ga0114340_100043871 | 3300008107 | Freshwater, Plankton | MLFKIGVVLVCAFAIFVIFSAIYAHISYALLKIDEERNNL* |
Ga0114340_10248024 | 3300008107 | Freshwater, Plankton | MLFKIGVILACAFAIFVLISAMLAHISYHIIKKDEERDFS* |
Ga0114340_11666043 | 3300008107 | Freshwater, Plankton | MLFKISVVIVGLFAIFVLISAIYAHISYALMKMDE |
Ga0114343_10646855 | 3300008110 | Freshwater, Plankton | MLFKISMVLVCLFAIAVLISAMVAHVSYEIMKRDEERDIS* |
Ga0114343_10956134 | 3300008110 | Freshwater, Plankton | MLFKISVVIVGLFAIFVLISAIYAHISYALMKMDEERDIS* |
Ga0114346_10155641 | 3300008113 | Freshwater, Plankton | MLFKIGVVLVGGFAIFVLISAMYAHISYAIMKMDEERDI* |
Ga0114350_10069165 | 3300008116 | Freshwater, Plankton | MLFKVSIVLACLFAIVVLFSTMYARISYYLIKKDEERDF* |
Ga0114351_10878525 | 3300008117 | Freshwater, Plankton | MLFKIGVVVVGCFAIFTLLSAIYVHISYALMKMDEEN* |
Ga0114355_10342562 | 3300008120 | Freshwater, Plankton | MLFKISMVLVCLFAIAVLISAMVSHVSYEIMKRDEERDIS* |
Ga0114363_10408563 | 3300008266 | Freshwater, Plankton | MLFKIGVVLVCAFAIFVIISAIYAHISYTLIKMDEERDIS* |
Ga0114364_10355663 | 3300008267 | Freshwater, Plankton | MLFKVGVVLVFGFALFVLGSAMYAHISYALMKKDEERNSL* |
Ga0114876_10145041 | 3300008448 | Freshwater Lake | MLFKIGVVLVSGFAIFVLISAIYAHISYTLMKMDEERDL* |
Ga0114876_10421483 | 3300008448 | Freshwater Lake | MMLFKIGVFLVGGFAIFVLCSAIYAHISYALIKADKERDI* |
Ga0116204_10276842 | 3300010293 | Anoxic Lake Water | MLFKVGVVLVCGFAIFVLISAMYAHISYYLIKKDEERDIS* |
Ga0129333_100417372 | 3300010354 | Freshwater To Marine Saline Gradient | MLFKIGVVLIGGFAIFVLLSAIYAHIMYAKWRNEV* |
Ga0129333_101142422 | 3300010354 | Freshwater To Marine Saline Gradient | MLFKIGVVLVGGFAIFTLFSAIYVHISYALMKADKERDI* |
Ga0129333_102651543 | 3300010354 | Freshwater To Marine Saline Gradient | MLFKISVVLVALFAIAVLISAIVAHVSYELMKRDEERDRT* |
Ga0129333_106723391 | 3300010354 | Freshwater To Marine Saline Gradient | IGVVLIGGFAIFTLLSAIYAHISYALMKMDEERDV* |
Ga0129333_112147101 | 3300010354 | Freshwater To Marine Saline Gradient | MLFKIGVVLIGGFAIFTLLSAIYAHISYALMKMDEERDI* |
Ga0129333_115517072 | 3300010354 | Freshwater To Marine Saline Gradient | ILMLFKISVVLVGLFAIVVLLSAIYAHISYALIKMDEERDIS* |
Ga0129336_104120581 | 3300010370 | Freshwater To Marine Saline Gradient | NMLFKIGVVLVGGFAIFVLISAMYAHISYTLMKMDEERDIS* |
Ga0151620_10070576 | 3300011268 | Freshwater | MLFKVSAVLVCLFAIVVLISAIMAQVSYHLIKKDEERDFS* |
Ga0129337_10022852 | 3300012968 | Aqueous | MLFKISVVLVGLFAIVVLLSAIYAHISYALIKMDEERDIS* |
Ga0129337_10032191 | 3300012968 | Aqueous | MLFKISMVLVCLFAIAVLISAMIAHVSYEIMKRDEERDISLTLD* |
Ga0129338_15335042 | 3300012970 | Aqueous | MLFKISVVLVALFAIVVLLSAIYAHISYALIKMDEERDIS* |
(restricted) Ga0172367_100234078 | 3300013126 | Freshwater | MLFKVVGVLLCLFAIVVLGAAMYAHISYYLITKDEERDFS* |
Ga0177922_102660342 | 3300013372 | Freshwater | NMVFKIGVVLVCGFAIFVLISAMYAHISYAIMKMDEERDI*ISD* |
Ga0181359_10404573 | 3300019784 | Freshwater Lake | MLFKIGVVLVFGFAIFTLLSAIYAHISYSLIKMDEERDI |
Ga0194113_100768965 | 3300020074 | Freshwater Lake | MLFKVGVVLVCGFAIFVLISAMYAHISYYLIKKDEERDFS |
Ga0194111_100417947 | 3300020083 | Freshwater Lake | MLFKIGVVLVCAFAIFVLISAMYAHISYYLIKKDEERDFS |
Ga0194110_107761431 | 3300020084 | Freshwater Lake | MLFKISAVLLCLFSIVVLISAMYAHISYHLIKKDEERDFS |
Ga0211733_110271812 | 3300020160 | Freshwater | MLFKIGIILLCIVAIVTLISAIYAHISYTLIKMDEERDIS |
Ga0211729_1007246811 | 3300020172 | Freshwater | MLFKVGVFLVCGFAIFTLISAIYVHISYYFIKKEQRKGL |
Ga0194115_100451882 | 3300020183 | Freshwater Lake | MLFKISAVLLCLLSIVVLISAMYAHISYHLIKKDEERDFS |
Ga0194115_100892252 | 3300020183 | Freshwater Lake | MLFKISAVLLCLFAIVVLISAMYAHISYYLITKDEERDFS |
Ga0194122_100920271 | 3300021092 | Freshwater Lake | FTWFGSEGNMLFKISAVLLCLFSIVVLISAMYAHISYHLIKKDEERDFS |
Ga0194122_101513445 | 3300021092 | Freshwater Lake | MLFKISAVLLCLFAIVVLISAMYAHISYHLIKKDEERDFS |
Ga0194130_1002863712 | 3300021376 | Freshwater Lake | KISAVLLCLFAIVVLISAMYAHISYYLITKDEERDFS |
Ga0222714_10000149138 | 3300021961 | Estuarine Water | MLFKISVVLVGLFAIVVLLSAIYAHISYALIKMDEERDIS |
Ga0222714_100194426 | 3300021961 | Estuarine Water | MLFKVSAVLVCLFAIVVLISAIMAQVSYHLIKKDEERDFS |
Ga0222714_100388197 | 3300021961 | Estuarine Water | MLFKVSVVLVCLFAIVVLISAIMAHVSYHLIKKDEERDFS |
Ga0222713_100799953 | 3300021962 | Estuarine Water | MVFKIGVVLVCGFAIFVLISAIYAHISYKLMKMDEERDIS |
Ga0181353_11282782 | 3300022179 | Freshwater Lake | MLFKISVVLVGLFAIVVLLAAMYAHISYALIKMDEERDIS |
Ga0255187_10124692 | 3300024510 | Freshwater | MLFKIGVVVVGGFAIFTLLSAIYVHISYALMKMDEEN |
Ga0208546_10087796 | 3300025585 | Aqueous | MMLFKISVVLVGLFAIAVLISAMVAHVSYELMKRDEERDRT |
Ga0208784_10007649 | 3300025732 | Aqueous | MLFKISVVLVAGFAVFVLFSAIYAHISYTLMKMDEERDI |
Ga0208784_10870552 | 3300025732 | Aqueous | MLFKIGVVLVGGFAVFVVFSAIYAHLSYARMKMDEERDV |
Ga0208783_100489392 | 3300025872 | Aqueous | MMLFKIGVVVVGGFAIFTLLSAIYAHISYALMKMDEERDI |
Ga0255198_10245662 | 3300027160 | Freshwater | MLFKIGVVLVGGFAIFVLISAMYAHISYALMKMDEERDL |
Ga0208975_11445212 | 3300027659 | Freshwater Lentic | MVFKIGVVLVCAFAIFVLISAIYAHISYTLIKMDEERDIS |
Ga0209972_102613442 | 3300027793 | Freshwater Lake | MLFKVSIVLVCLFAIVVLFSAMFAHISYYIIKKDEERDIS |
Ga0209229_101392241 | 3300027805 | Freshwater And Sediment | LFISSSQRTNMLFKIGIVLVCAFAIFVLISAIYAHISYTLIKMDEERDFS |
Ga0209990_100416621 | 3300027816 | Freshwater Lake | MLFKVSIVLACLFAIVVLFSTMYARISYYLIKKDEERDFS |
Ga0209990_100562113 | 3300027816 | Freshwater Lake | MLFKISVILVGLFAIFVLISAIYAHISYALMKMDEERDIS |
Ga0119944_10026338 | 3300029930 | Aquatic | MLFKVSVVLVCLFAIVVLISAMVAHVSYHLIKKDEEREFS |
Ga0119945_10039573 | 3300029933 | Aquatic | MLFKIGVVLACAFAIFVLISAMLAHISYYLIKKDEERDFS |
Ga0119945_10159662 | 3300029933 | Aquatic | MLFKVSMVLMCLFAIVVLISAMVAQVSYHLIKKDEERDFS |
Ga0315907_100365594 | 3300031758 | Freshwater | MLFKIGVILACAFAIFVLISAMLAHISYHIIKKDEERDFS |
Ga0315900_102284422 | 3300031787 | Freshwater | MLFKIGVVVVGGFAIFTLLSAIYAHISYSLIKMDEERDI |
Ga0315909_100084757 | 3300031857 | Freshwater | MLFKIGVVLIGGFAIFTLLSAIYAHISYALMKMDEERDV |
Ga0315909_101505101 | 3300031857 | Freshwater | MLFKVSIVLVCLFAIVVLFSAMFAHISYYIIKKDEERDF |
Ga0315909_103175571 | 3300031857 | Freshwater | MLFKISMVLVCLFAIAVLISAMVAHVSYEIMKRDEERDIS |
Ga0315909_104312782 | 3300031857 | Freshwater | MLFKVSIVLVCLFAIVVLFSAMFAHISYYIIKKDEERD |
Ga0315904_100691677 | 3300031951 | Freshwater | MLFKVGVVLVFGFALFVLGSAMYAHISYALIKKDDERNAL |
Ga0315904_103202393 | 3300031951 | Freshwater | MLFKVGVVFIFGFALFVVCSAMYAHISYALMKKDEERNSL |
Ga0315901_1000828118 | 3300031963 | Freshwater | MLFKIGVVLVCAFAIFVIFSAIYAHISYALLKIDEERNNL |
Ga0315901_101831522 | 3300031963 | Freshwater | MLFKVGVVLVFGFALFVLGSAMYAHISYALMKKDEERNTL |
Ga0315901_105080242 | 3300031963 | Freshwater | LFKVGVVLVFGFALFVLGSAMYAHISYALMKKDEERNTL |
Ga0315906_102511613 | 3300032050 | Freshwater | KVSIVLVCLFAIVVLFSAMFAHISYYIIKKDEERDF |
Ga0315906_104298671 | 3300032050 | Freshwater | MLFKVGVFLVCGFAIFTLISAIYAHISYSLIKNDEERNNL |
Ga0315902_105842531 | 3300032093 | Freshwater | MLFKISVVIVGLFAIFVLISAIYAHISYALMKMDEERDIS |
⦗Top⦘ |