


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300009344 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0117984 | Gp0126423 | Ga0103834 |
| Sample Name | Microbial communities of water from the North Atlantic ocean - ACM58 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | University of Georgia |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 45141334 |
| Sequencing Scaffolds | 15 |
| Novel Protein Genes | 19 |
| Associated Families | 15 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| Not Available | 5 |
| All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea | 2 |
| All Organisms → cellular organisms → Eukaryota → Sar | 4 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → Xanthomarina → Xanthomarina gelatinilytica | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes | 1 |
| All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Stichotrichia → Urostylida → Pseudourostylidae → Pseudourostyla → Pseudourostyla cristata | 2 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Aquatic Microbial Communities From Amazon River, Brazil And North Atlantic Ocean |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Freshwater → Unclassified → Unclassified → River Water → Aquatic Microbial Communities From Amazon River, Brazil And North Atlantic Ocean |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | marine biome → marine water body → surface water |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Saline → Water (saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | North Pacific Ocean | |||||||
| Coordinates | Lat. (o) | N/A | Long. (o) | N/A | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000237 | Metagenome / Metatranscriptome | 1498 | Y |
| F001145 | Metagenome / Metatranscriptome | 765 | Y |
| F001219 | Metagenome / Metatranscriptome | 744 | Y |
| F001478 | Metagenome / Metatranscriptome | 687 | Y |
| F001583 | Metagenome / Metatranscriptome | 668 | Y |
| F003081 | Metagenome / Metatranscriptome | 508 | Y |
| F004358 | Metagenome / Metatranscriptome | 442 | Y |
| F005505 | Metagenome / Metatranscriptome | 398 | Y |
| F006008 | Metagenome / Metatranscriptome | 384 | Y |
| F013645 | Metagenome / Metatranscriptome | 269 | Y |
| F027191 | Metagenome / Metatranscriptome | 195 | Y |
| F030104 | Metagenome / Metatranscriptome | 186 | Y |
| F051986 | Metagenome / Metatranscriptome | 143 | Y |
| F064686 | Metagenome / Metatranscriptome | 128 | Y |
| F090413 | Metagenome / Metatranscriptome | 108 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0103834_1000226 | Not Available | 1962 | Open in IMG/M |
| Ga0103834_1000300 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea | 1777 | Open in IMG/M |
| Ga0103834_1000411 | All Organisms → cellular organisms → Eukaryota → Sar | 1629 | Open in IMG/M |
| Ga0103834_1000431 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → Xanthomarina → Xanthomarina gelatinilytica | 1606 | Open in IMG/M |
| Ga0103834_1003093 | Not Available | 798 | Open in IMG/M |
| Ga0103834_1004532 | All Organisms → cellular organisms → Eukaryota → Sar | 697 | Open in IMG/M |
| Ga0103834_1006570 | All Organisms → cellular organisms → Eukaryota → Sar | 609 | Open in IMG/M |
| Ga0103834_1006792 | All Organisms → cellular organisms → Eukaryota → Sar | 601 | Open in IMG/M |
| Ga0103834_1007025 | Not Available | 594 | Open in IMG/M |
| Ga0103834_1007110 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea | 591 | Open in IMG/M |
| Ga0103834_1007386 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes | 583 | Open in IMG/M |
| Ga0103834_1009005 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Stichotrichia → Urostylida → Pseudourostylidae → Pseudourostyla → Pseudourostyla cristata | 541 | Open in IMG/M |
| Ga0103834_1009060 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Stichotrichia → Urostylida → Pseudourostylidae → Pseudourostyla → Pseudourostyla cristata | 540 | Open in IMG/M |
| Ga0103834_1009797 | Not Available | 524 | Open in IMG/M |
| Ga0103834_1010548 | Not Available | 511 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0103834_1000226 | Ga0103834_10002265 | F064686 | MIKITHEEYQKMWYALNDGIITEQEWRVFCDALFNQVLEENKDVMVRLKFR* |
| Ga0103834_1000300 | Ga0103834_10003002 | F003081 | MFNDTRFGVEVFLMHLRGVDSLMLLSYMHILKKIYLKNYVTSESDG* |
| Ga0103834_1000411 | Ga0103834_10004111 | F001219 | PLETALFILLATIVTFVFLNPISKQLDERAEFIDYTLRKSTILLSFGYEKLSECVGLLTEEISEMNRQIKLVRDYTNSQFENEVSEVQKENMKILSQLKGDLAIKSAYLFSSLSEDLTQLTDKFFAKKFQSA* |
| Ga0103834_1000411 | Ga0103834_10004112 | F001145 | MNWNIFSFFVNSEEPFITFNTNIFDTNIINIGLLIGLLVYANKISFSVTLESRQKEIIQTIENAQKDVLNASNYYYLAEKGFTQSLFWLQSWKVVYEKDKLDIVNTKYNTVKSGLLEVFSTSENLIKNFENRAFLSLQRYIVLTAASKILRKFFFLSEKEQSKLIELTISKLGEFKK* |
| Ga0103834_1000431 | Ga0103834_10004313 | F004358 | MFGLITMLLTTLGATGMGSMLKIVAGTIQSINDSRQQKAQRELARDLAMSNANASFQKAVFEGGSEQESMFTRGTRRIIALIGMLNFATISILCTIWPSTTLVTFTPPENKESISILYGLVKFPSGADVTTAITTGHISLVSIATLGAIIGFYFTPGGKN* |
| Ga0103834_1002614 | Ga0103834_10026141 | F001583 | MYLKNYVSAESDG*LLGGYAFF*FHYIVALGISLSATHLSDLTLTIIANIF*SVFNFAYKTYYIVFTNKHLNTDQLTRLMMFHYFTP*YYLYLVKLHALFCHES*DSDSGENVYEDKSGTYLS*FYDAFLKEIQDA*Y*TLYVFVYFFIHHFDASTVNYHFFER*NISELDEIRFYGVAPH*YFRPLMGLLVVSPSHYEGLM*MGL*FVLLAALPIIYNLYNSNNNYLSIIPMQSSMLQTLFFILFMLSMYCTASMLPCGRYYYAPEG |
| Ga0103834_1003093 | Ga0103834_10030932 | F051986 | MTAITQTTYNVTCKVCAVIAKGFKQLVRKIAYGMQMSANQRVARELVHLGFNQQKEMAQILQRMNDKTNEEYHGRY* |
| Ga0103834_1003839 | Ga0103834_10038391 | F000237 | SGENTYEDKTGTYVS*FYDAFLKEIQDA*Y*VTYIYLYFFLHHFNGSTINYFFFER*NIAELDEIRFYGVAPH*YFRPYMGLLVISPTHYEGLM*MGLFLGSLAFLPVIYNVYNSMHKYVATIPMQNSILQTSAFIFFMLSLFCANSMLPCGRYYYEPEGGYVGNPLVKFSFQYMYLYLC*ILHHLDLIDHYIFYLTQTFLRKLTLDLNPRVSARKTNV* |
| Ga0103834_1004532 | Ga0103834_10045321 | F027191 | MIRKERKERLKNEIEKPLLHSINLVRVVRLLISPKKKRMKIEVSKTMKPAFLLGTAFNIAYWHRKYHSGTICKGVSNALTSKALSGCEREDTPK* |
| Ga0103834_1006570 | Ga0103834_10065702 | F013645 | MKINIEVNNIMNPAFLLGTAFNIAYWHRKYHSGTICKGVTNPQTSNALSGCDKE* |
| Ga0103834_1006643 | Ga0103834_10066431 | F030104 | MTFDDNQEEVIEFIDTGYFYFFILITIYICFKYSIHYFSFLEASIADGKAVSYIVQAGKDTINSGSIFLRFYLLALRINIYDFLDDVMDSYYCLLIDFDDDEYFSELLLSIHGTLFFTNDNHDDRSFL |
| Ga0103834_1006792 | Ga0103834_10067921 | F001219 | MIFPTTIFINKLFDFDLTFVIELGFFITLAIIVTFKFINPISKEIDDRAEFINYNLRKSTILLTFGYEKLSDCISLLTQEINELNRQTKLTRNYCNSNFENEISSIQKENLKILSELKGDLSIKSAFLFSNISNDLNSLTDKFFEKKFQSVS* |
| Ga0103834_1007025 | Ga0103834_10070252 | F001478 | MFRYKDGFVTNEKGKVIAVDGGIDNENRNIVMENKNGQVHQRWKIVYADEYEGEPTKGQFNKRFGLYVERDFYVVSKLPDGRYLDLINNRNMVIKVPNGRRTQV |
| Ga0103834_1007110 | Ga0103834_10071101 | F003081 | ECFYMHVRGVDTLMLLSYMHILKKIYLKNYVLAESDG* |
| Ga0103834_1007386 | Ga0103834_10073862 | F004358 | FAGIADAKAAAAKRELARDLAMSKASIETQQAIFGESNEEISMFTRSTRRIIALIGMCNFFVISVLCTIWPSTTLVTFTPPESKEAWSLAWGLIKIPSGADVTTAITTGHIALVSVATLGAIIGFYFTPGGKN* |
| Ga0103834_1009005 | Ga0103834_10090051 | F005505 | EIRYYGVAPH*YFRPLMGMLIVCPTHYEGVF*LLFYLVLIAALPVIYNIYNSYGKYVSAIPMQNSLLQTTTFILFMMSLYCSASILPCGRYYYEPEGGYVGNP*IKFSYQYIYLYLA*ILHHLDVLDHHIYQFSSTFTRRTNSYNLRVST*RSATSNKNFS* |
| Ga0103834_1009060 | Ga0103834_10090601 | F005505 | *FYDAFLKEIQDA*Y*TIFVFAYFFMHHFNGGTVNYFFFER*NIAELDEIRFYGVAPH*YFRPFMGVLTISPTHYEGLM*FAL*LILLAFLPLINTLYNPHNKYTAVIPMQNSLLQTTFFILFMMSLYCAASMLPCGRYYYEPEGGYVGNP*VKFSYQYGYLYLM*FIHHLDALDHYIF |
| Ga0103834_1009797 | Ga0103834_10097972 | F006008 | MPLVKKDITLAAGATSEQILAGTTYEYVSQNTRLIV |
| Ga0103834_1010548 | Ga0103834_10105481 | F090413 | MKYKHIKGLQLQKPSYLNTSNQKIRELLAGKEVELEKENLEEFESLGVQVQPVK |
| ⦗Top⦘ |