Basic Information | |
---|---|
Family ID | F099653 |
Family Type | Metagenome |
Number of Sequences | 103 |
Average Sequence Length | 62 residues |
Representative Sequence | VRENVLGILVGLLLCGSLQGCSISPAQQDAIRRAWEERDAERARECYRAGRGFVAGGCTGGGGP |
Number of Associated Samples | 85 |
Number of Associated Scaffolds | 103 |
Quality Assessment | |
---|---|
Transcriptomic Evidence | No |
Most common taxonomic group | Bacteria |
% of genes with valid RBS motifs | 73.53 % |
% of genes near scaffold ends (potentially truncated) | 47.57 % |
% of genes from short scaffolds (< 2000 bps) | 75.73 % |
Associated GOLD sequencing projects | 83 |
AlphaFold2 3D model prediction | Yes |
3D model pTM-score | 0.50 |
Hidden Markov Model |
---|
Powered by Skylign |
Most Common Taxonomy | |
---|---|
Group | Bacteria (72.816 % of family members) |
NCBI Taxonomy ID | 2 |
Taxonomy | All Organisms → cellular organisms → Bacteria |
Most Common Ecosystem | |
---|---|
GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil (16.505 % of family members) |
Environment Ontology (ENVO) | Unclassified (41.748 % of family members) |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Plant → Plant rhizosphere (46.602 % of family members) |
⦗Top⦘ |
⦗Top⦘ |
Predicted Topology & Secondary Structure | |||||
---|---|---|---|---|---|
Classification: | Globular | Signal Peptide: | Yes | Secondary Structure distribution: | α-helix: 47.83% β-sheet: 6.52% Coil/Unstructured: 45.65% | Feature Viewer |
|
|||||
Powered by Feature Viewer |
Structure Viewer | |
---|---|
| |
Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.50 |
Powered by PDBe Molstar |
⦗Top⦘ |
Pfam ID | Name | % Frequency in 103 Family Scaffolds |
---|---|---|
PF04392 | ABC_sub_bind | 34.95 |
PF13458 | Peripla_BP_6 | 7.77 |
PF00583 | Acetyltransf_1 | 4.85 |
PF13432 | TPR_16 | 3.88 |
PF00072 | Response_reg | 1.94 |
PF14559 | TPR_19 | 1.94 |
PF14534 | DUF4440 | 1.94 |
PF13365 | Trypsin_2 | 1.94 |
PF02653 | BPD_transp_2 | 1.94 |
PF13474 | SnoaL_3 | 0.97 |
PF05050 | Methyltransf_21 | 0.97 |
PF04909 | Amidohydro_2 | 0.97 |
PF07589 | PEP-CTERM | 0.97 |
PF01062 | Bestrophin | 0.97 |
PF01025 | GrpE | 0.97 |
PF13240 | zinc_ribbon_2 | 0.97 |
PF01725 | Ham1p_like | 0.97 |
PF01370 | Epimerase | 0.97 |
PF08334 | T2SSG | 0.97 |
PF02581 | TMP-TENI | 0.97 |
COG ID | Name | Functional Category | % Frequency in 103 Family Scaffolds |
---|---|---|---|
COG2984 | ABC-type uncharacterized transport system, periplasmic component | General function prediction only [R] | 34.95 |
COG0127 | Inosine/xanthosine triphosphate pyrophosphatase, all-alpha NTP-PPase family | Nucleotide transport and metabolism [F] | 0.97 |
COG0352 | Thiamine monophosphate synthase | Coenzyme transport and metabolism [H] | 0.97 |
COG0576 | Molecular chaperone GrpE (heat shock protein HSP-70) | Posttranslational modification, protein turnover, chaperones [O] | 0.97 |
⦗Top⦘ |
Name | Rank | Taxonomy | Distribution |
All Organisms | root | All Organisms | 72.82 % |
Unclassified | root | N/A | 27.18 % |
Visualization |
---|
Powered by ApexCharts |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
3300000891|JGI10214J12806_11140695 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium CSP1-6 | 540 | Open in IMG/M |
3300000891|JGI10214J12806_12788459 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 543 | Open in IMG/M |
3300001661|JGI12053J15887_10372542 | All Organisms → cellular organisms → Bacteria | 688 | Open in IMG/M |
3300001661|JGI12053J15887_10458595 | Not Available | 610 | Open in IMG/M |
3300002886|JGI25612J43240_1007395 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 1565 | Open in IMG/M |
3300004058|Ga0055498_10100401 | All Organisms → cellular organisms → Bacteria | 585 | Open in IMG/M |
3300004114|Ga0062593_100299023 | All Organisms → cellular organisms → Bacteria | 1370 | Open in IMG/M |
3300004463|Ga0063356_100050684 | All Organisms → cellular organisms → Bacteria | 4182 | Open in IMG/M |
3300004479|Ga0062595_100803290 | All Organisms → cellular organisms → Bacteria | 775 | Open in IMG/M |
3300005204|Ga0068997_10014021 | All Organisms → cellular organisms → Bacteria | 1223 | Open in IMG/M |
3300005213|Ga0068998_10106579 | All Organisms → cellular organisms → Bacteria | 631 | Open in IMG/M |
3300005293|Ga0065715_10833481 | All Organisms → cellular organisms → Bacteria | 558 | Open in IMG/M |
3300005295|Ga0065707_10052541 | All Organisms → cellular organisms → Bacteria | 847 | Open in IMG/M |
3300005295|Ga0065707_10116356 | All Organisms → cellular organisms → Bacteria | 2240 | Open in IMG/M |
3300005295|Ga0065707_11023255 | Not Available | 533 | Open in IMG/M |
3300005434|Ga0070709_10210084 | All Organisms → cellular organisms → Bacteria | 1383 | Open in IMG/M |
3300005440|Ga0070705_100585742 | All Organisms → cellular organisms → Bacteria | 861 | Open in IMG/M |
3300005444|Ga0070694_100329909 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 1177 | Open in IMG/M |
3300005445|Ga0070708_100319530 | All Organisms → cellular organisms → Bacteria | 1462 | Open in IMG/M |
3300005468|Ga0070707_100063399 | All Organisms → cellular organisms → Bacteria | 3547 | Open in IMG/M |
3300005468|Ga0070707_100298015 | Not Available | 1567 | Open in IMG/M |
3300005468|Ga0070707_101439306 | Not Available | 656 | Open in IMG/M |
3300005539|Ga0068853_101097562 | All Organisms → cellular organisms → Bacteria | 767 | Open in IMG/M |
3300005547|Ga0070693_100040720 | All Organisms → cellular organisms → Bacteria | 2610 | Open in IMG/M |
3300006047|Ga0075024_100828599 | Not Available | 520 | Open in IMG/M |
3300006049|Ga0075417_10318248 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium | 757 | Open in IMG/M |
3300006175|Ga0070712_100495985 | Not Available | 1023 | Open in IMG/M |
3300006852|Ga0075433_10041346 | All Organisms → cellular organisms → Bacteria | 3994 | Open in IMG/M |
3300006904|Ga0075424_100053122 | All Organisms → cellular organisms → Bacteria | 4236 | Open in IMG/M |
3300009038|Ga0099829_10267651 | All Organisms → cellular organisms → Bacteria | 1396 | Open in IMG/M |
3300009098|Ga0105245_10043565 | All Organisms → cellular organisms → Bacteria | 4003 | Open in IMG/M |
3300010400|Ga0134122_10234660 | All Organisms → cellular organisms → Bacteria | 1541 | Open in IMG/M |
3300010400|Ga0134122_10457751 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Xanthomonadaceae | 1143 | Open in IMG/M |
3300011269|Ga0137392_10543899 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 964 | Open in IMG/M |
3300011269|Ga0137392_11477401 | All Organisms → cellular organisms → Bacteria | 538 | Open in IMG/M |
3300011270|Ga0137391_10081049 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 2798 | Open in IMG/M |
3300011271|Ga0137393_10329435 | All Organisms → cellular organisms → Bacteria | 1303 | Open in IMG/M |
3300011438|Ga0137451_1102358 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 875 | Open in IMG/M |
3300012189|Ga0137388_10807983 | All Organisms → cellular organisms → Bacteria | 869 | Open in IMG/M |
3300012202|Ga0137363_10854871 | Not Available | 772 | Open in IMG/M |
3300012205|Ga0137362_10035851 | All Organisms → cellular organisms → Bacteria | 3963 | Open in IMG/M |
3300012361|Ga0137360_10062884 | All Organisms → cellular organisms → Bacteria | 2719 | Open in IMG/M |
3300012362|Ga0137361_11715443 | All Organisms → cellular organisms → Bacteria | 547 | Open in IMG/M |
3300012922|Ga0137394_10029963 | All Organisms → cellular organisms → Bacteria | 4414 | Open in IMG/M |
3300012923|Ga0137359_10268644 | Not Available | 1520 | Open in IMG/M |
3300012925|Ga0137419_11025289 | All Organisms → cellular organisms → Bacteria | 685 | Open in IMG/M |
3300012929|Ga0137404_12067011 | Not Available | 532 | Open in IMG/M |
3300012944|Ga0137410_10278647 | All Organisms → cellular organisms → Bacteria | 1319 | Open in IMG/M |
3300013297|Ga0157378_12612751 | Not Available | 557 | Open in IMG/M |
3300014884|Ga0180104_1023266 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium | 1550 | Open in IMG/M |
3300015254|Ga0180089_1054038 | All Organisms → cellular organisms → Bacteria | 796 | Open in IMG/M |
3300015259|Ga0180085_1007584 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 2969 | Open in IMG/M |
3300015259|Ga0180085_1023905 | Not Available | 1703 | Open in IMG/M |
3300018000|Ga0184604_10026336 | Not Available | 1436 | Open in IMG/M |
3300018028|Ga0184608_10427736 | Not Available | 572 | Open in IMG/M |
3300018056|Ga0184623_10046421 | All Organisms → cellular organisms → Bacteria | 1978 | Open in IMG/M |
3300018056|Ga0184623_10118994 | All Organisms → cellular organisms → Bacteria | 1223 | Open in IMG/M |
3300018061|Ga0184619_10397074 | Not Available | 624 | Open in IMG/M |
3300018075|Ga0184632_10008110 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae | 4381 | Open in IMG/M |
3300018076|Ga0184609_10027465 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 2313 | Open in IMG/M |
3300018076|Ga0184609_10085231 | All Organisms → cellular organisms → Bacteria | 1401 | Open in IMG/M |
3300018078|Ga0184612_10043731 | All Organisms → cellular organisms → Bacteria | 2328 | Open in IMG/M |
3300018422|Ga0190265_11157708 | All Organisms → cellular organisms → Bacteria | 893 | Open in IMG/M |
3300018422|Ga0190265_11866011 | Not Available | 708 | Open in IMG/M |
3300018422|Ga0190265_11876281 | Not Available | 706 | Open in IMG/M |
3300018429|Ga0190272_10172236 | All Organisms → cellular organisms → Bacteria | 1524 | Open in IMG/M |
3300019458|Ga0187892_10239757 | All Organisms → cellular organisms → Bacteria | 939 | Open in IMG/M |
3300019879|Ga0193723_1095173 | Not Available | 845 | Open in IMG/M |
3300019882|Ga0193713_1122559 | Not Available | 714 | Open in IMG/M |
3300019883|Ga0193725_1120718 | All Organisms → cellular organisms → Bacteria | 598 | Open in IMG/M |
3300019886|Ga0193727_1064037 | Not Available | 1153 | Open in IMG/M |
3300019997|Ga0193711_1001337 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_20CM_4_70_14 | 3110 | Open in IMG/M |
3300020004|Ga0193755_1021106 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_20CM_4_70_14 | 2153 | Open in IMG/M |
3300020199|Ga0179592_10100691 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 1327 | Open in IMG/M |
3300021073|Ga0210378_10008256 | All Organisms → cellular organisms → Bacteria | 4546 | Open in IMG/M |
3300021073|Ga0210378_10092519 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 1180 | Open in IMG/M |
3300021344|Ga0193719_10042738 | All Organisms → cellular organisms → Bacteria | 1963 | Open in IMG/M |
3300021432|Ga0210384_11849248 | Not Available | 509 | Open in IMG/M |
3300025907|Ga0207645_10405317 | Not Available | 917 | Open in IMG/M |
3300025910|Ga0207684_10038670 | All Organisms → cellular organisms → Bacteria | 4048 | Open in IMG/M |
3300025910|Ga0207684_10054674 | All Organisms → cellular organisms → Bacteria | 3387 | Open in IMG/M |
3300025912|Ga0207707_10586112 | All Organisms → cellular organisms → Bacteria | 945 | Open in IMG/M |
3300025921|Ga0207652_11202094 | Not Available | 660 | Open in IMG/M |
3300025923|Ga0207681_11544764 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium CSP1-6 | 556 | Open in IMG/M |
3300025957|Ga0210089_1019093 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria | 794 | Open in IMG/M |
3300026285|Ga0209438_1003783 | All Organisms → cellular organisms → Bacteria | 5133 | Open in IMG/M |
3300026340|Ga0257162_1000301 | All Organisms → cellular organisms → Bacteria | 4715 | Open in IMG/M |
3300026361|Ga0257176_1059234 | Not Available | 610 | Open in IMG/M |
3300026480|Ga0257177_1013237 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 1115 | Open in IMG/M |
3300026535|Ga0256867_10031950 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 2201 | Open in IMG/M |
3300028381|Ga0268264_12125306 | All Organisms → cellular organisms → Bacteria | 570 | Open in IMG/M |
3300028716|Ga0307311_10219724 | All Organisms → cellular organisms → Bacteria | 560 | Open in IMG/M |
3300028719|Ga0307301_10159239 | Not Available | 728 | Open in IMG/M |
3300028784|Ga0307282_10323039 | Not Available | 745 | Open in IMG/M |
3300028792|Ga0307504_10080715 | Not Available | 998 | Open in IMG/M |
3300028803|Ga0307281_10000516 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 10183 | Open in IMG/M |
(restricted) 3300031197|Ga0255310_10090217 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. Ec3.3 | 818 | Open in IMG/M |
(restricted) 3300031197|Ga0255310_10129133 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 688 | Open in IMG/M |
3300031740|Ga0307468_100128132 | All Organisms → cellular organisms → Bacteria | 1579 | Open in IMG/M |
3300031740|Ga0307468_101034383 | All Organisms → cellular organisms → Bacteria | 726 | Open in IMG/M |
3300031820|Ga0307473_11370776 | All Organisms → cellular organisms → Bacteria | 532 | Open in IMG/M |
3300032180|Ga0307471_101237538 | Not Available | 910 | Open in IMG/M |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
⦗Top⦘ |
Habitat | Taxonomy | Distribution |
Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 16.50% |
Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 16.50% |
Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 10.68% |
Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 8.74% |
Soil | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Soil | 4.85% |
Natural And Restored Wetlands | Environmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands | 3.88% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 3.88% |
Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 3.88% |
Switchgrass Rhizosphere | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere | 2.91% |
Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 2.91% |
Groundwater Sediment | Environmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment | 1.94% |
Terrestrial Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil | 1.94% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil | 1.94% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 1.94% |
Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 1.94% |
Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil | 1.94% |
Corn Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere | 1.94% |
Sandy Soil | Environmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil | 1.94% |
Watersheds | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds | 0.97% |
Bio-Ooze | Environmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze | 0.97% |
Miscanthus Rhizosphere | Host-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Miscanthus Rhizosphere | 0.97% |
Corn Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere | 0.97% |
Switchgrass Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere | 0.97% |
Arabidopsis Thaliana Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere | 0.97% |
Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere | 0.97% |
Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere | 0.97% |
Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere | 0.97% |
Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere | 0.97% |
Visualization |
---|
Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
Taxon OID | Sample Name | Habitat Type | IMG/M Link |
---|---|---|---|
3300000891 | Soil microbial communities from Great Prairies - Wisconsin, Continuous corn soil | Environmental | Open in IMG/M |
3300001661 | Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly) | Environmental | Open in IMG/M |
3300002886 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm | Environmental | Open in IMG/M |
3300004058 | Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D1 | Environmental | Open in IMG/M |
3300004114 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5 | Environmental | Open in IMG/M |
3300004463 | Combined assembly of Arabidopsis thaliana microbial communities | Host-Associated | Open in IMG/M |
3300004479 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAs | Environmental | Open in IMG/M |
3300005204 | Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleA_D2 | Environmental | Open in IMG/M |
3300005213 | Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleB_D2 | Environmental | Open in IMG/M |
3300005293 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Bulk Soil Replicate 1 : eDNA_1 | Host-Associated | Open in IMG/M |
3300005295 | Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3 | Environmental | Open in IMG/M |
3300005434 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG | Environmental | Open in IMG/M |
3300005440 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaG | Environmental | Open in IMG/M |
3300005444 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaG | Environmental | Open in IMG/M |
3300005445 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaG | Environmental | Open in IMG/M |
3300005468 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG | Environmental | Open in IMG/M |
3300005539 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2 | Host-Associated | Open in IMG/M |
3300005547 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-3 metaG | Environmental | Open in IMG/M |
3300006047 | Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 | Environmental | Open in IMG/M |
3300006049 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 | Host-Associated | Open in IMG/M |
3300006175 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG | Environmental | Open in IMG/M |
3300006852 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2 | Host-Associated | Open in IMG/M |
3300006904 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3 | Host-Associated | Open in IMG/M |
3300009038 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG | Environmental | Open in IMG/M |
3300009098 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG | Host-Associated | Open in IMG/M |
3300010400 | Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2 | Environmental | Open in IMG/M |
3300011269 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaG | Environmental | Open in IMG/M |
3300011270 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaG | Environmental | Open in IMG/M |
3300011271 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaG | Environmental | Open in IMG/M |
3300011438 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT500_2 | Environmental | Open in IMG/M |
3300012189 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaG | Environmental | Open in IMG/M |
3300012202 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaG | Environmental | Open in IMG/M |
3300012205 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaG | Environmental | Open in IMG/M |
3300012361 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaG | Environmental | Open in IMG/M |
3300012362 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaG | Environmental | Open in IMG/M |
3300012922 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaG | Environmental | Open in IMG/M |
3300012923 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaG | Environmental | Open in IMG/M |
3300012925 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
3300012929 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
3300012944 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
3300013297 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaG | Host-Associated | Open in IMG/M |
3300014884 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_1Da | Environmental | Open in IMG/M |
3300015254 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT860_16_10D | Environmental | Open in IMG/M |
3300015259 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_10D | Environmental | Open in IMG/M |
3300018000 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coex | Environmental | Open in IMG/M |
3300018028 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coex | Environmental | Open in IMG/M |
3300018056 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1 | Environmental | Open in IMG/M |
3300018061 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1 | Environmental | Open in IMG/M |
3300018075 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1 | Environmental | Open in IMG/M |
3300018076 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coex | Environmental | Open in IMG/M |
3300018078 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coex | Environmental | Open in IMG/M |
3300018422 | Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 T | Environmental | Open in IMG/M |
3300018429 | Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 T | Environmental | Open in IMG/M |
3300019458 | Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaG | Environmental | Open in IMG/M |
3300019879 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2 | Environmental | Open in IMG/M |
3300019882 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a2 | Environmental | Open in IMG/M |
3300019883 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2 | Environmental | Open in IMG/M |
3300019886 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2 | Environmental | Open in IMG/M |
3300019997 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m2 | Environmental | Open in IMG/M |
3300020004 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2 | Environmental | Open in IMG/M |
3300020199 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
3300021073 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redo | Environmental | Open in IMG/M |
3300021344 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2 | Environmental | Open in IMG/M |
3300021432 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M | Environmental | Open in IMG/M |
3300025907 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
3300025910 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes) | Environmental | Open in IMG/M |
3300025912 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes) | Environmental | Open in IMG/M |
3300025921 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaG (SPAdes) | Environmental | Open in IMG/M |
3300025923 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
3300025957 | Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D1 (SPAdes) | Environmental | Open in IMG/M |
3300026285 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes) | Environmental | Open in IMG/M |
3300026340 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-A | Environmental | Open in IMG/M |
3300026361 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-B | Environmental | Open in IMG/M |
3300026480 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-B | Environmental | Open in IMG/M |
3300026535 | Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (HiSeq) | Environmental | Open in IMG/M |
3300028381 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes) | Host-Associated | Open in IMG/M |
3300028716 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_198 | Environmental | Open in IMG/M |
3300028719 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_182 | Environmental | Open in IMG/M |
3300028784 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121 | Environmental | Open in IMG/M |
3300028792 | Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_S | Environmental | Open in IMG/M |
3300028803 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_120 | Environmental | Open in IMG/M |
3300031197 (restricted) | Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1 | Environmental | Open in IMG/M |
3300031740 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05 | Environmental | Open in IMG/M |
3300031820 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515 | Environmental | Open in IMG/M |
3300032180 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515 | Environmental | Open in IMG/M |
Geographical Distribution | |
---|---|
Zoom: | Powered by OpenStreetMap |
⦗Top⦘ |
Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
Protein ID | Sample Taxon ID | Habitat | Sequence |
JGI10214J12806_111406952 | 3300000891 | Soil | MGMRQILAAVLAGIVWAGFLQGCSISPAEQDAIRRAWDARDAERARECYRAGRGF |
JGI10214J12806_127884591 | 3300000891 | Soil | MTPVLQAVLGGMLLVGALSGCSVSPAQQDAIRQAWEERDAERAKECRRAGRGFVAGGCTG |
JGI12053J15887_103725421 | 3300001661 | Forest Soil | VTSARKAFSAILVGILVAGVLQGCSLTPAQQDAIRRAWAEEDAERARECYRHGVGFAAGGCTSPGA* |
JGI12053J15887_104585952 | 3300001661 | Forest Soil | VTSARKAFSAILAGILVAGALQGCSLTPAQQEAIRQAWEERDAERERECRRAGRGFVAGGCAGGGGP* |
JGI25612J43240_10073952 | 3300002886 | Grasslands Soil | MGIHRILAAVLAGIVWAGFLQGCSISPAEQDAIRRAWDARDAERARECYRAGRGFVAGGCTGGGGP* |
Ga0055498_101004012 | 3300004058 | Natural And Restored Wetlands | RFVKGIVAAMVAGILWAGLVQGCSTSPAEQDAIRRAWEERDAERARECHRAGRGFVAGGCTGGGGP* |
Ga0062593_1002990232 | 3300004114 | Soil | MGMRQILAAVLAGIVWAGFLQGCSISPAEQDAIRRAWDARDAERARECYRAGRGFVAGGCTGGGGP* |
Ga0063356_1000506843 | 3300004463 | Arabidopsis Thaliana Rhizosphere | MTPVLQAVLGGMLLVGALSGCSVSPAQQDAIRQAWEERDAERAKECRRAGRGFVAGGCTGGGGP* |
Ga0062595_1008032901 | 3300004479 | Soil | RCGAYSLRMHPLPVILVGLLLAGILPGCSISPARQEEIRKAWEERDAERARECYRQGRGFVAGGCTGGGGA* |
Ga0068997_100140212 | 3300005204 | Natural And Restored Wetlands | VKGIVAAMVAGILWAGLVQGCSTSPAEQDAIRRAWEERDAERARECHRAGRGFVAGGCTGGGGP* |
Ga0068998_101065791 | 3300005213 | Natural And Restored Wetlands | IVAAMVAGILWAGLVQGCSTSPAEQDAIRRAWEERDAERARECHRAGRGFVAGGCTGGGGP* |
Ga0065715_108334811 | 3300005293 | Miscanthus Rhizosphere | VRENLPGILVGLLLCGIVHGCSISPDEQEAIRQAWADRDAERARECDRVRGFLVAGSCLPRP* |
Ga0065707_100525412 | 3300005295 | Switchgrass Rhizosphere | MGMRQILAAVLAGIVWAGFLQGCSISPAEQDAIRRAWDARDAERARECYRAGRGFVAG |
Ga0065707_101163562 | 3300005295 | Switchgrass Rhizosphere | VRENVLGILVGLLLCGSLQGCSISPAQQDAIRRAWEERDAERARECYRAGRGFVAGGCTGGGGP* |
Ga0065707_110232551 | 3300005295 | Switchgrass Rhizosphere | SGDRVTKTRKTFSAILAGILVAGVLQGCSLTPAQQDAIRQAWEERDAERARECERAGRGFVAGACGGGGGP* |
Ga0070709_102100842 | 3300005434 | Corn, Switchgrass And Miscanthus Rhizosphere | VSGWLGGLLVGLLVAGILPGCSISPTQQDAIRRAWEARDAERARECERVGRGFVAGGCTGGGGP* |
Ga0070705_1005857421 | 3300005440 | Corn, Switchgrass And Miscanthus Rhizosphere | VRENLPGILVGLLLCGIVHGCSISPDEQEAIRQAWADRDAERARECDRVRGFLVAGSCLP |
Ga0070694_1003299091 | 3300005444 | Corn, Switchgrass And Miscanthus Rhizosphere | VRENVLGILVGLLLCGSLQGCSISPAQQDAIRRAWEERDAERASECYRAGRGFVAGGCTGGGGP* |
Ga0070708_1003195302 | 3300005445 | Corn, Switchgrass And Miscanthus Rhizosphere | MSENLRGFLVGLLAAAILQGCSISPAQQEAIRKAWAERDAERARECYRHELGFANGGCTGPGGP* |
Ga0070707_1000633992 | 3300005468 | Corn, Switchgrass And Miscanthus Rhizosphere | VRENLPGILVGLLLCGIVHGCSISPDKQEAIRQAWADRDAERARECDRVRGFLVAGSCLPRP* |
Ga0070707_1002980153 | 3300005468 | Corn, Switchgrass And Miscanthus Rhizosphere | VSGWLGGLLVGLLVAGILLGCSVSPAQQDAIRRAWEARDAERSRECERVGRGFVAGGCTGGGGP* |
Ga0070707_1014393062 | 3300005468 | Corn, Switchgrass And Miscanthus Rhizosphere | FLLEAFSAGGSGHRVTRTRKTFPAILAGILVAGVLQGCSLTPAEQDAIRRAWEERDAERAQECQRAGRSFVAGACGGSGGP* |
Ga0068853_1010975622 | 3300005539 | Corn Rhizosphere | MGMRQILAAVLAGIVWAGFLQGCSISPAEQDAIRRAWDARDAERARECYRAGRGFVAGGCTGGGG |
Ga0070693_1000407202 | 3300005547 | Corn, Switchgrass And Miscanthus Rhizosphere | MRQILAAVLAGIVWAGFLQGCSISPAEQDAIRRAWDARDAERARECYRAGRGFVAGGCTGGGGP* |
Ga0075024_1008285991 | 3300006047 | Watersheds | MENLPGLLVGLLLAGILQGCSISPAQQEAIRQAWEERDAERARECHRAGRGFVAGGCAGGGGP* |
Ga0075417_103182482 | 3300006049 | Populus Rhizosphere | VSGWLGGLLVGLLVAGILPGCSISPAQQDAIRRAWEERDAERARECERMGRGFVAGGCTGGGGP* |
Ga0070712_1004959852 | 3300006175 | Corn, Switchgrass And Miscanthus Rhizosphere | LKRARGWLGGLLVGLLMAGILPGCSISPAQQDAIRRAWDERDAERARECERMGRGFVAGGCTGGGGP* |
Ga0075433_100413467 | 3300006852 | Populus Rhizosphere | VSGWLGGLLVGLLVAGILPGCSISPAQQDTIRRAWEERDAERARECERMGRGFVAGGCTGGGGP* |
Ga0075424_1000531227 | 3300006904 | Populus Rhizosphere | VSGWLGGLLVGLLVAGILPGCSISPAQQDTIRRAWEERDAERARECERMGRGFVAGGCTG |
Ga0099829_102676512 | 3300009038 | Vadose Zone Soil | VRENVPGILVGLLLCGIVHGCSISPDEQEAIRQAWADRDAERARECDRVRGFLVAGSCLPRP* |
Ga0105245_100435655 | 3300009098 | Miscanthus Rhizosphere | MGMRQILAAVLAGIVWAGFLQGCSISPAEQDAIRRAWDARDAERARECYRAGRGFV |
Ga0134122_102346602 | 3300010400 | Terrestrial Soil | MTSVLRAILGGMLLLGALSGCSLSPAQQDAIRQAWDERDAERAKECRRAGRGFVAGGCTGGGGP* |
Ga0134122_104577512 | 3300010400 | Terrestrial Soil | MVGMLLVGAFPACSISPAERDAIIQAWEERDAERAQECRRAGRGFVNGGCTGGGGP* |
Ga0137392_104598882 | 3300011269 | Vadose Zone Soil | VGALQACAISSAEQDAIRRAWEERDAERARECQRAGRGFVAGGCTGGGGP* |
Ga0137392_105438992 | 3300011269 | Vadose Zone Soil | MSENLRGFLVGLLVAGILQGCSISPAQQEAIRKAWEERDAERARECYRHELGFANGGCTGPGGP* |
Ga0137392_114774012 | 3300011269 | Vadose Zone Soil | VLRQVLPAILGGLLLVGALQACAISSAEQDAIRRAWEERDAERARECQRAGRGFV |
Ga0137391_100810492 | 3300011270 | Vadose Zone Soil | VLRQVLPAILGGLLLVGALQACAISSAEQDAIRRAWEERDAERARECQRAGRGFVAGGCTGGGGP* |
Ga0137393_103294353 | 3300011271 | Vadose Zone Soil | VRENLPGILVGLLLCGIVHGCSISPDEQEAIRQAWADRDAERARECDRVRGFLV |
Ga0137451_11023581 | 3300011438 | Soil | VREALSAILIGMLLAGTLQGCSLTPAEQDAIRRAWEDRDAERARECRRNGGGFIAGGCV |
Ga0137388_108079831 | 3300012189 | Vadose Zone Soil | MPADLPPAGGSGDRVTNTRKPFSAILAGILVAGVLQGCSLTPAEQDAIRQAWEERDAERARECHRAGRG |
Ga0137363_108548712 | 3300012202 | Vadose Zone Soil | GLLVAGILQGCSISPAQQEAIRKAWEERDAERARECYRHELGFANGGCTGPGGP* |
Ga0137362_100358517 | 3300012205 | Vadose Zone Soil | VGETLLGTLVVLLLAGILHGCSISPDRQEAIRQAWADRDAERAHDCDRVRGFLVAGSCLPRP* |
Ga0137360_100628842 | 3300012361 | Vadose Zone Soil | VGETLLGTLVVLLLAGILHGCSISPDRQEAIRQAWADRDAERAHECDRVRGFLVAGSCLPRP* |
Ga0137361_117154432 | 3300012362 | Vadose Zone Soil | VVGNRSEILVGLLLAGILQGCSISPAQQEAIRQAWEERDAERARECQRAGRGFVAGGC |
Ga0137394_1002996310 | 3300012922 | Vadose Zone Soil | FLVGLLVAGILQGCSISPAQQEAIRKAWEERDAERARECYRHELGFANGGCTGPGGP* |
Ga0137359_102686444 | 3300012923 | Vadose Zone Soil | VRETLPGILVVLLLAGILPGCSISPDRQEAIRQAWADRDAERARECDRVRGFLVA |
Ga0137419_110252892 | 3300012925 | Vadose Zone Soil | VRENLPGILVGLLLCGIVHGCSISPDEQEAIRQAWADRDAERARECDRVRGFLVAGSCLPRPWSVGKAR* |
Ga0137404_120670111 | 3300012929 | Vadose Zone Soil | LVGLLVAGILQGCSISPAQQEAIRKAWEERDAERARECYRHELGFANGGCTGPGGP* |
Ga0137410_102786472 | 3300012944 | Vadose Zone Soil | MGIHRILAAVLAGIVWAGFLQGCSISPAEQDAIRRAWDARDAERARECYRAGRGFV |
Ga0157378_126127512 | 3300013297 | Miscanthus Rhizosphere | LAAVLAGIVWAGFLQGCSISPAEQDAIRRAWDARDAERARECYRAGRGFVAGGCTGGGGP |
Ga0180104_10232663 | 3300014884 | Soil | VREILSAILIGMLLAGTLLGCSLTPAEQDAIRRAWEDRDAERARECQRAGGGFVAGGCVRGGP* |
Ga0180089_10540382 | 3300015254 | Soil | VREILSAILIGMLLAGTLQGCSLTPAEQDTIRRAWEDRDAERARECRRNGGGFVAGGCVR |
Ga0180085_10075844 | 3300015259 | Soil | VREILSAILIGMLLAGTLQGCSLTPAEQDTIRRAWEDRDAERARECHRAGRGFVAGGCAGGGP* |
Ga0180085_10239051 | 3300015259 | Soil | VTTPDEHTFGHVLLAILIGMLLVGTFQACSISSAEQDAIRRAWEDRDAERARECHRAGRGFVAGGCAGGGGP* |
Ga0184604_100263362 | 3300018000 | Groundwater Sediment | VTSARKGFSAILTGILVAGVLQGCSLTPAQHEAIRQAWEERDAERERECRRAGRGFVAGGCAGGGGP |
Ga0184608_104277362 | 3300018028 | Groundwater Sediment | TSARKGFSAILAGILVAGVLQGCSLTPAQQEAIRQAWEERDAERERECRRAGRGFVAGGCAGGGGP |
Ga0184623_100464212 | 3300018056 | Groundwater Sediment | MPAILGGILLVGALQGCSISPAQQEAIRHAWEERDAERARECYRAGRGFVAGGCAGGGGP |
Ga0184623_101189942 | 3300018056 | Groundwater Sediment | LKRVLPAILGGILLVGALQGCAISPAQQEAIRQAWEERDAERARECHRAGRGFVAGGCAGGGGP |
Ga0184619_103970741 | 3300018061 | Groundwater Sediment | ALSAILAGILVAGVLQGCSLTPAQREAIRQAWEERDAERERECRRAGRGFVAGGCAGGGG |
Ga0184632_100081103 | 3300018075 | Groundwater Sediment | LRQVLPAILGGTLLAGALQGCSMSPAQQEAIRQAWEERDAERARECHRAGRGFVAGGCAG |
Ga0184609_100274653 | 3300018076 | Groundwater Sediment | MKEILPAILAGILWAGILPGCSISPAQQEAIRQAWEERDAERARECYRHGLGFAAGGCTSPF |
Ga0184609_100852312 | 3300018076 | Groundwater Sediment | VREILSAILIGMLLAGTLLGCSLTPDEQDAIRRAWEDRDAERARECQRAGGGFVAGGCVRGGP |
Ga0184612_100437314 | 3300018078 | Groundwater Sediment | VREILSAILIVMVLAGTTLQGCSLTAVEQDAIRRAWEDRDAERARECHRAGRGFVAGGCAGGGGP |
Ga0190265_111577081 | 3300018422 | Soil | LVERSDVLGAVVGGILVAPLAAGLLLVGALQGCSVSPAQQDAIRQAWQEKDAERAAECRRAGRGFVAGGCTGGGGP |
Ga0190265_118660112 | 3300018422 | Soil | MSGFPRGYRLAVPAILVGILLAGTLAGCSLTAAEQDAIRRAWEDRDAERARECRRNGGG |
Ga0190265_118762811 | 3300018422 | Soil | MSGFPRGYRLAVPAILVGMLLAGTLAGCSLTAAEQDAIRRAWEDRDAERARECRRNGGGF |
Ga0190272_101722363 | 3300018429 | Soil | VREVLSAILIGMLLAGTLLGCSLTPVEQDAIRRAWEDRDAERALECRRNGGGFVAGGCARGGP |
Ga0187892_102397573 | 3300019458 | Bio-Ooze | PAILSGILMVAALQGCSVSPAEQEAIRQAWEERDAERARECRRAGRGFVAGGCTGGGGP |
Ga0193723_10951732 | 3300019879 | Soil | VALAGMLWAGLLQGCSISPAEQDAIRRAWDARDAERARECYRAGRGFVAGGCTGGGGP |
Ga0193713_11225592 | 3300019882 | Soil | ILAVALAGILWAGLLQGCSISPAEQDAIRRAWDARDAERARECYRAGRGFVAGGCTGGGG |
Ga0193725_11207181 | 3300019883 | Soil | VREILPAILAGILWAAILPGCSISPAQQDAIRQAWEERDAERARECHRAGRGFVAG |
Ga0193727_10640372 | 3300019886 | Soil | RVMPENVLRILVGLFLCGSLQGCSISQAQQEAIRQAWEERDAERARECYRAGRGFVAGGCSGGGGP |
Ga0193711_10013372 | 3300019997 | Soil | MGVYQTLAVALAGILWAGLLQGCSISPAEQDAIRRAWDARDAERARECYRAGRGFVAGGCTGGGGP |
Ga0193755_10211062 | 3300020004 | Soil | MGVYQILAVALAGILWAGLLQGCSISPAEQDAIRRAWDARDAERARECYRAGRGFVAGGCTGGGGP |
Ga0179592_101006912 | 3300020199 | Vadose Zone Soil | MSENLRGFLVGLLVAGILQGCSISPAQQEAIRKAWEERDAERARECYRHELGFANGGCTGPGGP |
Ga0210378_100082566 | 3300021073 | Groundwater Sediment | VREILSAILIGMLLAGTLQGCSLTPTEQDTIRRAWEDRDAERARECRRAGGGFVAGGCVRGGP |
Ga0210378_100925191 | 3300021073 | Groundwater Sediment | VREILPAILAGILWAAILPGCSISPAQQEAIRQAWEERDAERARECHRAGRGFVAGGC |
Ga0193719_100427383 | 3300021344 | Soil | VTSARKGFSAILTGILVAGVLQGCSLTPAQQEAIRQAWEERDAERERECRRAGRGFVAGGCAGGGGP |
Ga0210384_118492481 | 3300021432 | Soil | WLGGLLVGLLVAGILPGCSISPAQQDAIRRAWDERDAERARECERMGRGFVAGGCTGGGG |
Ga0207645_104053172 | 3300025907 | Miscanthus Rhizosphere | MGMRQILAAVLAGIVWAGFLQGCSISPAEQDAIRRAWDARDAERARECYRAGRGFVAGGCTGGGGP |
Ga0207684_100386704 | 3300025910 | Corn, Switchgrass And Miscanthus Rhizosphere | VRENLPGILVGLLLCGIVHGCSISPDEQEAIRQAWADRDAERARECDRVRGFLVAGSCLPRP |
Ga0207684_100546746 | 3300025910 | Corn, Switchgrass And Miscanthus Rhizosphere | MSENLRGFLVGLLAAAILQGCSISPAQQEAIRKAWAERDAERARECYRHELGFANGGCTGPGGP |
Ga0207707_105861122 | 3300025912 | Corn Rhizosphere | MTPVLQAVLGGMLLVGALSGCSVSPAQQDAIRQAWEERDAERAKECRRAGRGFVAGGCTGGGGP |
Ga0207652_112020942 | 3300025921 | Corn Rhizosphere | RFAAPRSRGEALAMGMRQILAAVLAGIVWAGFLQGCSISPAEQDAIRRAWDARDAERARECYRAGRGFVAGGCTGGGGP |
Ga0207681_115447642 | 3300025923 | Switchgrass Rhizosphere | MGMRQILAAVLAGIVWAGFLQGCSISPAEQDAIRRAWDARDAERARECYRAGRGFVAGGCTGG |
Ga0210089_10190933 | 3300025957 | Natural And Restored Wetlands | RFVKGIVAAMVAGILWAGLVQGCSTSPAEQDAIRRAWEERDAERARECHRAGRGFVAGGCTGGGGP |
Ga0209438_10037834 | 3300026285 | Grasslands Soil | MGIHRILAAVLAGIVWAGFLQGCSISPAEQDAIRRAWDARDAERARECYRAGRGFVAGGCTGGGGP |
Ga0257162_10003014 | 3300026340 | Soil | VRENVPGILVGLLLCGIVHGCSISPDEQEAIRQAWADRDAERARECDRVRGFLVAGSCLPRP |
Ga0257176_10592341 | 3300026361 | Soil | ILVGLLLCGIVHGCSISPDEQEAIRQAWADRDAERARECDRVRGFLVAGSCLPRP |
Ga0257177_10132373 | 3300026480 | Soil | LSQVRAGILVGTLLAGSLQGCSISPAEQDAIRRAWEDRDAERARECRRAGGGFIAGGCVR |
Ga0256867_100319503 | 3300026535 | Soil | LRQVLPAILGGILLAGALQGCSMSPAQQDAIRQAWEERDAERARECHRAGRGFLAGGCAGGGP |
Ga0268264_121253062 | 3300028381 | Switchgrass Rhizosphere | VRENVLGILVGLLLCGSLQGCSISSAQQEAIRQAWEERDAERARECYRAGRGFVAGGCSGGGG |
Ga0307311_102197241 | 3300028716 | Soil | VRENLPGILVVLLLAGILPGCSISPDQQEAIRQAWAERDAERARECERVRGFIVAGSCLPRP |
Ga0307301_101592391 | 3300028719 | Soil | VTSARKGFSAILTGILVAGVLQGCSLTPAQHEAIRQAWEERDAERERECRRAGRGFVAGGCAGGG |
Ga0307282_103230391 | 3300028784 | Soil | GILVAGVLQGCSLTPAQHEAIRQAWEERDAERERECRRAGRGFVAGGCAGGGGP |
Ga0307504_100807152 | 3300028792 | Soil | MGICQILVAVLAGILWVGFLQGCSISPAEQDAIRRAWDARDAERARECSRAGRGFVAGGCSGGGGP |
Ga0307281_100005162 | 3300028803 | Soil | VREALSAILIGMLLAGTLQGCSLTPAEQDAIRRAWEDRDAERARECRRNGGGFIAGGCVRGGP |
(restricted) Ga0255310_100902173 | 3300031197 | Sandy Soil | MENLPGLLVGLLLAGILQGCSISPAQQEAIRQAWEERDAERARECHRAGRGFVAGGCAGG |
(restricted) Ga0255310_101291331 | 3300031197 | Sandy Soil | LKQVLPAMLGGILLVGALQGCSMSLAQQEAIRQAWEERDAERARECHRAGRGFVAGGCGG |
Ga0307468_1001281322 | 3300031740 | Hardwood Forest Soil | MGIYRILAVALAGILWAGLLQGCSISPAEQDAIRRAWDARDAERARECYRAGRGFVAGGCTGGGGP |
Ga0307468_1010343832 | 3300031740 | Hardwood Forest Soil | MVGMLLVGAFPACSISPAERDAIIQAWEERDAERAQECRRAGRGFVNGGCTGGGGP |
Ga0307473_113707761 | 3300031820 | Hardwood Forest Soil | VGAYGDIVRKNRSGIILVGLVLCGILQGCSISPAQQEAIRKAWQERDAERARECQRRGLSFVAGACTGGGGP |
Ga0307471_1012375381 | 3300032180 | Hardwood Forest Soil | IILVGLVLCGILQGCSISPAQQEAIRKAWQERDAERARECQRRGLSFVAGGCTGGGGP |
⦗Top⦘ |