Basic Information | |
---|---|
Family ID | F095795 |
Family Type | Metagenome / Metatranscriptome |
Number of Sequences | 105 |
Average Sequence Length | 75 residues |
Representative Sequence | MRKQKDFGLLILRGAGLLLALTFGVQKIGWYWSALHAGKSFSSIGLAPLIAKMGFPIPVALALWITFNESIGAFLIG |
Number of Associated Samples | 87 |
Number of Associated Scaffolds | 105 |
Quality Assessment | |
---|---|
Transcriptomic Evidence | Yes |
Most common taxonomic group | Unclassified |
% of genes with valid RBS motifs | 0.00 % |
% of genes near scaffold ends (potentially truncated) | 0.00 % |
% of genes from short scaffolds (< 2000 bps) | 0.00 % |
Associated GOLD sequencing projects | 87 |
AlphaFold2 3D model prediction | Yes |
3D model pTM-score | 0.39 |
Hidden Markov Model |
---|
Powered by Skylign |
Most Common Taxonomy | |
---|---|
Group | Unclassified (100.000 % of family members) |
NCBI Taxonomy ID | N/A |
Taxonomy | N/A |
Most Common Ecosystem | |
---|---|
GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil (13.333 % of family members) |
Environment Ontology (ENVO) | Unclassified (25.714 % of family members) |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (55.238 % of family members) |
⦗Top⦘ |
⦗Top⦘ |
Predicted Topology & Secondary Structure | |||||
---|---|---|---|---|---|
Classification: | Transmembrane (alpha-helical) | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 60.95% β-sheet: 0.00% Coil/Unstructured: 39.05% | Feature Viewer |
|
|||||
Powered by Feature Viewer |
Structure Viewer | |
---|---|
| |
Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.39 |
Powered by PDBe Molstar |
⦗Top⦘ |
Pfam ID | Name | % Frequency in 105 Family Scaffolds |
---|---|---|
PF04173 | DoxD | 20.95 |
PF07681 | DoxX | 8.57 |
PF12833 | HTH_18 | 2.86 |
PF12706 | Lactamase_B_2 | 1.90 |
PF07676 | PD40 | 1.90 |
PF00210 | Ferritin | 1.90 |
PF08240 | ADH_N | 1.90 |
PF00571 | CBS | 0.95 |
PF13387 | DUF4105 | 0.95 |
PF03544 | TonB_C | 0.95 |
PF04191 | PEMT | 0.95 |
PF04982 | HPP | 0.95 |
PF02661 | Fic | 0.95 |
PF00005 | ABC_tran | 0.95 |
PF00128 | Alpha-amylase | 0.95 |
PF04140 | ICMT | 0.95 |
PF00248 | Aldo_ket_red | 0.95 |
PF14357 | DUF4404 | 0.95 |
PF00165 | HTH_AraC | 0.95 |
PF06250 | YhcG_C | 0.95 |
PF13193 | AMP-binding_C | 0.95 |
COG ID | Name | Functional Category | % Frequency in 105 Family Scaffolds |
---|---|---|---|
COG2259 | Uncharacterized membrane protein YphA, DoxX/SURF4 family | Function unknown [S] | 29.52 |
COG4270 | Uncharacterized membrane protein | Function unknown [S] | 8.57 |
COG0296 | 1,4-alpha-glucan branching enzyme | Carbohydrate transport and metabolism [G] | 0.95 |
COG0366 | Glycosidase/amylase (phosphorylase) | Carbohydrate transport and metabolism [G] | 0.95 |
COG0810 | Periplasmic protein TonB, links inner and outer membranes | Cell wall/membrane/envelope biogenesis [M] | 0.95 |
COG1523 | Pullulanase/glycogen debranching enzyme | Carbohydrate transport and metabolism [G] | 0.95 |
COG3280 | Maltooligosyltrehalose synthase | Carbohydrate transport and metabolism [G] | 0.95 |
COG3448 | CBS-domain-containing membrane protein | Signal transduction mechanisms [T] | 0.95 |
COG4804 | Predicted nuclease of restriction endonuclease-like (RecB) superfamily, DUF1016 family | General function prediction only [R] | 0.95 |
⦗Top⦘ |
Name | Rank | Taxonomy | Distribution |
Unclassified | root | N/A | 100.00 % |
Visualization |
---|
Powered by ApexCharts |
Scaffold | Taxonomy | Length | IMG/M Link |
---|
⦗Top⦘ |
Habitat | Taxonomy | Distribution |
Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 13.33% |
Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 12.38% |
Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 9.52% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 7.62% |
Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 5.71% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 5.71% |
Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil | 5.71% |
Grass Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil | 4.76% |
Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 4.76% |
Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 4.76% |
Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 2.86% |
Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere | 2.86% |
Switchgrass Rhizosphere | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere | 1.90% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil | 1.90% |
Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 1.90% |
Soil | Environmental → Terrestrial → Soil → Loam → Grasslands → Soil | 1.90% |
Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Soil | 1.90% |
Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 0.95% |
Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 0.95% |
Terrestrial Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil | 0.95% |
Grass Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grass Soil | 0.95% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural → Soil | 0.95% |
Agricultural Soil | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil | 0.95% |
Arabidopsis Rhizosphere | Host-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere | 0.95% |
Switchgrass Rhizosphere | Host-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Switchgrass Rhizosphere | 0.95% |
Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 0.95% |
Corn Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere | 0.95% |
Arabidopsis Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere | 0.95% |
Visualization |
---|
Powered by ApexCharts |
Taxon OID | Sample Name | Habitat Type | IMG/M Link |
---|---|---|---|
2162886007 | Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 | Host-Associated | Open in IMG/M |
2170459002 | Grass soil microbial communities from Rothamsted Park, UK - March 2009 direct MP BIO 1O1 lysis 0-21 cm | Environmental | Open in IMG/M |
2170459004 | Grass soil microbial communities from Rothamsted Park, UK - March 2009 indirect MP BIO 1O1 lysis 0-21cm (2) | Environmental | Open in IMG/M |
2170459006 | Grass soil microbial communities from Rothamsted Park, UK - March 2009 indirect in plug lysis (for fosmid construction) 0-10cm | Environmental | Open in IMG/M |
2170459011 | Grass soil microbial communities from Rothamsted Park, UK - July 2009 indirect Gram positive lysis 0-10cm | Environmental | Open in IMG/M |
2189573004 | Grass soil microbial communities from Rothamsted Park, UK - FG2 (Nitrogen) | Environmental | Open in IMG/M |
2228664022 | Soil microbial communities from Great Prairies - Iowa, Native Prairie soil | Environmental | Open in IMG/M |
3300002916 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm | Environmental | Open in IMG/M |
3300004114 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5 | Environmental | Open in IMG/M |
3300004157 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2 | Environmental | Open in IMG/M |
3300005172 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 | Environmental | Open in IMG/M |
3300005175 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 | Environmental | Open in IMG/M |
3300005180 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 | Environmental | Open in IMG/M |
3300005186 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 | Environmental | Open in IMG/M |
3300005187 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 | Environmental | Open in IMG/M |
3300005294 | Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk Soil | Environmental | Open in IMG/M |
3300005332 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly) | Environmental | Open in IMG/M |
3300005434 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG | Environmental | Open in IMG/M |
3300005435 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG | Environmental | Open in IMG/M |
3300005439 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG | Environmental | Open in IMG/M |
3300005445 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaG | Environmental | Open in IMG/M |
3300005536 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaG | Environmental | Open in IMG/M |
3300005564 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaG | Host-Associated | Open in IMG/M |
3300005566 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142 | Environmental | Open in IMG/M |
3300005568 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 | Environmental | Open in IMG/M |
3300005576 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 | Environmental | Open in IMG/M |
3300005713 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2) | Environmental | Open in IMG/M |
3300005764 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2) | Environmental | Open in IMG/M |
3300006028 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaG | Environmental | Open in IMG/M |
3300006032 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 | Environmental | Open in IMG/M |
3300006046 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101 | Environmental | Open in IMG/M |
3300006176 | Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 | Environmental | Open in IMG/M |
3300006791 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 | Environmental | Open in IMG/M |
3300006852 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2 | Host-Associated | Open in IMG/M |
3300007255 | Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 | Environmental | Open in IMG/M |
3300009012 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159 | Environmental | Open in IMG/M |
3300009792 | Tropical forest soil microbial communities from Panama - MetaG Plot_12 | Environmental | Open in IMG/M |
3300010048 | Tropical forest soil microbial communities from Panama - MetaG Plot_11 | Environmental | Open in IMG/M |
3300010304 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015 | Environmental | Open in IMG/M |
3300010326 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaG | Environmental | Open in IMG/M |
3300010335 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015 | Environmental | Open in IMG/M |
3300010360 | Tropical forest soil microbial communities from Panama - MetaG Plot_6 | Environmental | Open in IMG/M |
3300010366 | Tropical forest soil microbial communities from Panama - MetaG Plot_24 | Environmental | Open in IMG/M |
3300010371 | Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1 | Environmental | Open in IMG/M |
3300011270 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaG | Environmental | Open in IMG/M |
3300012199 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaG | Environmental | Open in IMG/M |
3300012202 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaG | Environmental | Open in IMG/M |
3300012351 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaG | Environmental | Open in IMG/M |
3300012361 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaG | Environmental | Open in IMG/M |
3300012927 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
3300012930 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
3300012951 | Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_226_MG | Environmental | Open in IMG/M |
3300012961 | Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MG | Environmental | Open in IMG/M |
3300012971 | Tropical forest soil microbial communities from Panama - MetaG Plot_1 | Environmental | Open in IMG/M |
3300012988 | Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_242_MG | Environmental | Open in IMG/M |
3300012989 | Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MG | Environmental | Open in IMG/M |
3300013297 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaG | Host-Associated | Open in IMG/M |
3300014166 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015 | Environmental | Open in IMG/M |
3300014969 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4-5 metaG | Host-Associated | Open in IMG/M |
3300015264 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly) | Environmental | Open in IMG/M |
3300015372 | Soil combined assembly | Host-Associated | Open in IMG/M |
3300015374 | Col-0 rhizosphere combined assembly | Host-Associated | Open in IMG/M |
3300017659 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
3300018000 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coex | Environmental | Open in IMG/M |
3300018431 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104 | Environmental | Open in IMG/M |
3300019868 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s1 | Environmental | Open in IMG/M |
3300019870 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m1 | Environmental | Open in IMG/M |
3300019881 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2 | Environmental | Open in IMG/M |
3300021086 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
3300021560 | Tropical forest soil microbial communities from Panama - MetaG Plot_4 | Environmental | Open in IMG/M |
3300022756 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1 | Environmental | Open in IMG/M |
3300026469 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-B | Environmental | Open in IMG/M |
3300026498 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-49-A | Environmental | Open in IMG/M |
3300026537 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes) | Environmental | Open in IMG/M |
3300026791 | Grasslands soil microbial communities from Kansas, USA that are Nitrogen fertilized - NN591 (SPAdes) | Environmental | Open in IMG/M |
3300027903 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes) | Environmental | Open in IMG/M |
3300028828 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202 | Environmental | Open in IMG/M |
3300030916 | Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA12 EcM (Eukaryote Community Metatranscriptome) | Environmental | Open in IMG/M |
3300030945 | Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA10 EcM (Eukaryote Community Metatranscriptome) | Environmental | Open in IMG/M |
3300031128 | Oak Summer Coassembly Site 11 - Champenoux / Amance forest | Environmental | Open in IMG/M |
3300031231 | Coassembly Site 11 (all samples) - Champenoux / Amance forest | Environmental | Open in IMG/M |
3300031469 | Fir Spring Coassembly Site 11 - Champenoux / Amance forest | Environmental | Open in IMG/M |
3300031538 | Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D1 | Environmental | Open in IMG/M |
3300031754 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515 | Environmental | Open in IMG/M |
3300031941 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080 | Environmental | Open in IMG/M |
3300031942 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF176 | Environmental | Open in IMG/M |
3300031962 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515 | Environmental | Open in IMG/M |
Geographical Distribution | |
---|---|
Zoom: | Powered by OpenStreetMap |
⦗Top⦘ |
Protein ID | Sample Taxon ID | Habitat | Sequence |
SwRhRL2b_0276.00005740 | 2162886007 | Switchgrass Rhizosphere | MREQKDLALLIFRGAGLLLVATFGVQKIGWYWSALLAGKSLTSSGLAQLIAKMGFPIPVALALWITFNESIGAFLVAC |
E1_07145150 | 2170459002 | Grass Soil | MSAFHRFPSRDLGLLILRGAGFLLAATFGLQKFSWYWKAFHAGKSLSAIGLAPLIARMGFPIPVVLALWITFNESIGALLIGCGFLTR |
E4B_03987120 | 2170459004 | Grass Soil | MSAFHRFPSRDLGLLILRGAGFLLAATFGLQKFSWYWTAFHAGKSLSAIGLAPLIARMGFPIPVVLALWITFNESIGALLIGCGF |
E4B_03713560 | 2170459004 | Grass Soil | MSAFHRFPSRDLGLLILRGAGFLLAATFGLQKFSWYWTAFHAGKSLSAIGLAPLIARMGFPIPVVLALWITFNESIGALLIGXGF |
L01_02280730 | 2170459006 | Grass Soil | PSRDLGLLILRGAGFLLAATFGLQKFSWYWTAFHAGKSLSAIGLAPLIARMGFPIPVVLALWITFNESIGALLIGCGFLT |
F64_01308620 | 2170459011 | Grass Soil | MSAFHRFPSRDLGLLILRGAGFLLAATFGLQKFSWYWTAFHAGKSLSAIGLAPLIARMGFPIPVVLALWITFNESIGALL |
FG2_08791670 | 2189573004 | Grass Soil | MREQKDFGLLILRGAGFLLAGTFGIQKIGWYWSLFHASKSLSSAGLAPLIARMGFPIPFALALWITFNESIAVVSVGCGF |
INPgaii200_10504191 | 2228664022 | Soil | VNGRRIKMSGFQRFPDRDLGLLILRGAGFLLAATFGLQKIGWYWTAFHTGKALSAIGLAPLIATMGFPIPVVLAFWITFNESIGALLIGCGFLTRILAAS |
INPgaii200_11608543 | 2228664022 | Soil | MLDLGLLALRSAGFLLALTFGFQKIGWYISAFHSDKAFSSVGLAPLIAHVGFPAPDPRCVDYVQ |
JGI25389J43894_10232702 | 3300002916 | Grasslands Soil | MLDIGLLLFRAAGFLLAFTFGVQKIGWYVTAFHAGKPLLSSIGLTPLIAHMGLPLPVILAFWITLNESIGA |
Ga0062593_1031351842 | 3300004114 | Soil | MIDPGLLLLRAAGFLLAFTFGIQKIGWYVTAFHAGKPLSSIGLTPLIAHVGFPLPMILAL |
Ga0062590_1011985851 | 3300004157 | Soil | MRKQTDLGLLILRGAGLLLALTFGVQKIGWYWSALHAGKSFSSIGLAPLIAKMGFPIPVALAIWITFNESIGA |
Ga0066683_101853553 | 3300005172 | Soil | MRKHKDLGLLLLRGSGLLLALTFGVQKIGWYCSALHVGKSFLSIGLAPLIAKMGFPIPVVLAVWITFNESIGAFL |
Ga0066673_104472132 | 3300005175 | Soil | MSAFQRFPSRDLGFLILRGAGFLLAATFGVQKIGWYWTAFHAGKSLSAIGLAPLIARMGFPIPVVLAL* |
Ga0066685_100201211 | 3300005180 | Soil | MLDIGLLLFRAAGFLLAFTFGVQKIGWYVTAFHAGKPLLSSIGLTPLIAHMGLPLPVILALWITLNESIG |
Ga0066676_110199681 | 3300005186 | Soil | MIYSGLLLLRAAGFLLAFTFGIQKIGWYVTGLHAGKPFSSIGLTPLIAHVGFPLPVILALWITLNES |
Ga0066675_109747501 | 3300005187 | Soil | MREQKDLGLLILRGAGPLLALTFGVQKIGWYWSALHAGKPFSSIGLAPLIAKMGFPIPVALAIWITFNESIGAFLIGCGFL |
Ga0065705_100457421 | 3300005294 | Switchgrass Rhizosphere | MREQKDFGLLILRGAGLLLAGTFGVQKIGWYWSAFHAGKSLSSAGLAPLIARMGFPIPFALALWITFNESIGAFLIGCGVLTRLLAASAALGMA |
Ga0065705_104479341 | 3300005294 | Switchgrass Rhizosphere | MREQKDFGLLVLRGAGLLLAVTFGLQKIGWYWSVFHAGKSLSSAGLTPLIARMGFPIPFALALWITFNESIGAFL |
Ga0066388_1009467223 | 3300005332 | Tropical Forest Soil | MRKQNDLGLLVLREAGLLLALTFGLQKIGWYWSALHARKSFSSIGLAPLIAKMGFPIPPALALWITFNESIGAF* |
Ga0066388_1085276262 | 3300005332 | Tropical Forest Soil | MLDLGLLALRSAGFLLAFTFGIQKIGWYVMALHANKPFSSIGLAPLIAKFGFPIPVILA |
Ga0070709_113687461 | 3300005434 | Corn, Switchgrass And Miscanthus Rhizosphere | MINIGLLLSRAAGFLLAFTFGIQKIGWYVAAFHAGKPLASVGLAPLIAKVGFPFPIILALWI |
Ga0070714_1009981972 | 3300005435 | Agricultural Soil | MSAFQRFPSRDLGLLILRGAGFLLAATFGLQKIGWYWTAFHAGKSLSAIGLAPLIARMGFPIPVVLALWITFNESIGALLIGCGFLTRI |
Ga0070711_1014783012 | 3300005439 | Corn, Switchgrass And Miscanthus Rhizosphere | MLDLGLLALRSAGFLLALTFGFQKIGWYLSAFHSDKAFSSVGLAPLIAHVGFPAPAILAVWITFNE |
Ga0070708_1008902971 | 3300005445 | Corn, Switchgrass And Miscanthus Rhizosphere | LKASPPFGYGSVSLHMRKQTDLGLLLLRGSGFLLALTFGAQKIGWYWSGLHAGKSFSSIGLAPLIAKIGFPIPVALAIWTTFNESIGAF |
Ga0070697_1009421342 | 3300005536 | Corn, Switchgrass And Miscanthus Rhizosphere | MKEQKDLGLLILRGAGLLLAVTFGVQKLGWYWAAFHAGKSLFHAGLAPLIARMGFPIPIVLALWITFN |
Ga0070664_1021469411 | 3300005564 | Corn Rhizosphere | MLDLGLLVLRTAGFFLAFTFGIQKIGWYIAGLHSDKAFSSTGLAPLIAKMGFPAAVILALWVTFNESVGAFLI |
Ga0066693_101179691 | 3300005566 | Soil | MIDIGRLLFRAAGLLLALTFGVQKIGWYVTAFHAGKPLLSSIGLTPLIAHMGLPLPVILALWITLNESIGAFF |
Ga0066703_107995182 | 3300005568 | Soil | MRKQADLGLLLLRGSGFLLALTFGVQKIGWYWSSLHAGKSFSSIGLAPLIAKIGFPIPVALAIWITFNESI |
Ga0066708_101711911 | 3300005576 | Soil | MKEQRDLGLLILRGWYWTALHAGKSLSHAGLAPLIARMGFPIPVVLALWVTFNESIGAFLIGCGFLTRVM |
Ga0066905_1001467605 | 3300005713 | Tropical Forest Soil | MIDLGLLLLRAAGFLLAFTFGIQKIGWYVTAFHSGKPLSSIGLAPLIGHVGFPLPVILA |
Ga0066903_1001996764 | 3300005764 | Tropical Forest Soil | MPSLINIGLLLSRAAGFLLAFTFGIQKIGWYVTAFHAGKPVASIGLAPLIAKVGFPFPIILALWITLNESIGAFFVGIGL |
Ga0066903_1083405991 | 3300005764 | Tropical Forest Soil | MIDRGLLTLRSAGFLLAFTFGIQKIGWYISAFHSEKPFASIGLTPLIAHMGFPVPVVLAL |
Ga0070717_100390676 | 3300006028 | Corn, Switchgrass And Miscanthus Rhizosphere | MREQKDFGLLILRGAGLLLAGTFGVQKIGWYWSALHAGKSLSSAGLAPLIAKMGFPIPVALAVWITFNESIGAFLIGGGFLTRLLAASAALGMAGA |
Ga0066696_102289251 | 3300006032 | Soil | MREQKDFGLLILRGAGLLLAGTFGVQKIGWYWSAFHAGKSLSSAGLAPLIAKMGLPIPFALALWITFNEIGVFLVGCGFLTRLLAASAALGMAGAL |
Ga0066652_1005557283 | 3300006046 | Soil | MRKHKDLGLLLLRGSGLLLALTFGVQKIGWYCSALHVGKSFLSIGLAPLIAKMGFPIPVVLAVWITFNESIG |
Ga0066652_1010897311 | 3300006046 | Soil | MREQKDFGLLILRGVGLLLAGTFGVQKIGWYWSAFHAGKSLSSAGLAPLIARMGFPIPVALALWITFNESIGAFLIGCGFLTRLLA |
Ga0070765_1006728943 | 3300006176 | Soil | MLDLGLLVLRTAGFLLAFTFGIQKIGWYMAAFHSDKAFSSIGLVPLIAKMGFPAAVILA |
Ga0070765_1014006971 | 3300006176 | Soil | MSAFQRFPSRDLGFLILRGAGFLLAATFGVQKIGWYWTAFHAGKSLSAIGLASLIARMGFPIPVVLALWITFNESIGAFLIGCGFLTRILAASAALGMAG |
Ga0066653_104011021 | 3300006791 | Soil | MRKQKDFGLLILRGAGLLLALTFGVQKIGWYWSALHAGKSFSSIGLAPLIAKMGFPIPVALALWITFNESIGAFLIG |
Ga0075433_119232092 | 3300006852 | Populus Rhizosphere | MAYSLVSLPMIKQKDLGLLVLRGAGFLLALTFGVQKIGWYWSALHAGKSFQSIGLAPLIAKMGFPIPVVLSIWIMFNESIGAFFVGC |
Ga0099791_100692034 | 3300007255 | Vadose Zone Soil | MLDLGLLALRSAGILLAFTFGIQKIGWYIAAFHSDKPLSSVGLAPLIAQIGFPAPAILALWVTFNESIGALLI* |
Ga0066710_1005182443 | 3300009012 | Grasslands Soil | MRKQKDFGLLILRGAGLLLALTFGVQKIGWYWSAVHAGKSFSSIGLAPLIAKMSFPIPVALAIWITFDESIGAFLIGRGFLARL |
Ga0126374_116759791 | 3300009792 | Tropical Forest Soil | MLNVGLLVLRATGFLLAFTFGIQKIGWYIAALHSDKAFSSIGLAPLISKMGFPASVILALWI |
Ga0126373_123373632 | 3300010048 | Tropical Forest Soil | MKITERLMNRDLGLLILRCAGLLLAATFGVQKIGWYWTGLHAGKDLSHIGLATLIARMGFPVPVLLALSVTFNESIGAFL |
Ga0134088_104060591 | 3300010304 | Grasslands Soil | MLDIGLLLFRAAGFLLAFTFGVQKIGWYVRALHAGKPWSSIGLAPLIAHVGFPLPVVLALWITLNESIGA |
Ga0134088_106993421 | 3300010304 | Grasslands Soil | MKEQRDLGLLILRGAGLLLAVTFGVQKVGWYWTALHAGKSLSHAGLAPLIARMGFPIPVVLALWVTFNESIGAFLIGCGF |
Ga0134065_104755122 | 3300010326 | Grasslands Soil | MREQKDFGLLILRGAGLLLAGTFGVQKIGWYWSAIHAGKSLSSAGLAPLIAKMGLPIPFALALWITFNESIGAFLVGCGFLTRLLAASAALGM |
Ga0134063_101102073 | 3300010335 | Grasslands Soil | MLDIGLLLFRAAGFLLAFTFGVQKIGWYVTAFHAGKPLLSSIGLTPLIAHMGLPLPVILAL* |
Ga0126372_117490422 | 3300010360 | Tropical Forest Soil | MIDPGLLVLRAAGFLLAFTFGIQKIGWYVTAFHAGKPLSSIGLAPLIADVGFPIPVILALWITLNESIGAFLV |
Ga0126379_101992615 | 3300010366 | Tropical Forest Soil | MFDLGLLALRGAGFLLAFTFGIQKIGWYVMAFHSNKPFSSIGLAPLIAKVGFPMAVILALWIT |
Ga0126379_110731302 | 3300010366 | Tropical Forest Soil | MRKQKDLGLLLLRGAGLLLALSFGVQKIGWYWSALHAEKPFSSIGLAPLIARMGFPIPVALAIWITFNESIGAFLIGCGFLTRSLSASLALG |
Ga0134125_112765001 | 3300010371 | Terrestrial Soil | MLDLGLLVLRAAGFLLAFTFGIQKIGWYIAGLHSDKAFSSIGLAPLIAKMGFPAAILLAL |
Ga0137391_101518705 | 3300011270 | Vadose Zone Soil | MLDLGLLALRSAGILLAFTFGIQKIGWYIAAFHSDKPLSSVGLAPLIAQIGFPAPAILALWVTFNESIGAL* |
Ga0137383_108389281 | 3300012199 | Vadose Zone Soil | MIDTGLLLLRAAGFLLAFTFGVQKIGWYVTAFHAGKPWSSIGLAPLIAHVGFPLPVILALWITLNESIGAFLIG |
Ga0137363_106266881 | 3300012202 | Vadose Zone Soil | MREQKDLGLLILRGAGFLLAGTFGVQKIGWYWSAFHAGKSLSTVGLAPLIARMGFPIPVALALWITFNESIGAFLIGCGF |
Ga0137386_102557431 | 3300012351 | Vadose Zone Soil | MRKQKDFGLLILRGAGLLLALTFGVQKIGWYWSAVHAGKSFLSIGLAPLIAKMGFPIPVVLAVWITFNESIGAFLIGCGVLTR |
Ga0137360_110094501 | 3300012361 | Vadose Zone Soil | MREQKDFGLLILRGVGLLLAGTFGVQKIGWYWSAFHAGKSLSSAGLAPLIARMGFPIPVALALWITFNESIGAFLIGCGFLTRLLAA |
Ga0137360_110221941 | 3300012361 | Vadose Zone Soil | MIDLGLLFLRAAGFLLAFTFGIQKIGWYVTAFHTGKPLSSIGLAPLIAHVGFPLPFILALWITLNESIGG |
Ga0137416_100496643 | 3300012927 | Vadose Zone Soil | MREQKDFGLLILRGAGLLLAGTFGVQKIGWYWSAFHAGKSLSSAGLAPLIAKMGFPIPVALAIWITFNELIGAFLIGCGFRHDY* |
Ga0137407_100039941 | 3300012930 | Vadose Zone Soil | MIYSGLLLLRAAGFLLAFTFGIQKIGWYVTGLHAGKPFSSIGLTPLIAHVGFPLPVIL |
Ga0137407_101905303 | 3300012930 | Vadose Zone Soil | MRKQKDFGLLILRGAGLLLALTFGVQKIGWYWSAVHAGKSFSSIGLAPLIAKMSFPIPVALAIWITFNESIGAFL |
Ga0164300_108742082 | 3300012951 | Soil | MREQKDFGLLILRGAGLLLAGTFGVQKIGWYWSAFHAGKSLPSDGLATLIAKIGFPILVALAIWITLDNSIGA |
Ga0164302_115971132 | 3300012961 | Soil | MREQKDSGLLILRGAGLLLALTFGVQKIGWYWSALHAGKSFSSIGLAPLIAKMGFPIPVALALWITFNESIGAFLIGCGRLTRY* |
Ga0126369_105180854 | 3300012971 | Tropical Forest Soil | VYDLGLLAVRSAGFLLAFTFGIQKIGWYISAFHSKKPFASIALTPLITHMGFPVPVILPLWVTFDTSVGAFLIGCGVFTRVF |
Ga0126369_131062181 | 3300012971 | Tropical Forest Soil | MRKQKDLGLLLLRGAGLLLALTFGVQKIGWYWSALHAGKSFSSIGLAPLIAKMGFPIPGVLAIWITFNESIGAFL |
Ga0164306_109211701 | 3300012988 | Soil | MREQKDFGLLILRGAGLLLAGTFGVQKIGWYWSAFHAGKSLSSAGLTPLIAKMGFPIPFALALWITFNESIGAFLIGCG |
Ga0164305_115594131 | 3300012989 | Soil | MREQKDSGLLILRGAGLLLALTFGVQKIGWYWSALHAGKSFSSIGLAPLIAKMGFPIPVALALWITFNESIGAFLIGCGLLT |
Ga0157378_118148481 | 3300013297 | Miscanthus Rhizosphere | MTKQKDLGLLVLRGAGFLLALTFGVQKIGWYWSALHARRPFSSIGLAPLIAKMGFPIPVALAIWVTFNESI |
Ga0134079_103072412 | 3300014166 | Grasslands Soil | MREQKDFGLLILRGAGLLLAGTFGIQKIGWYWSAFHAGKSLSTAGLAPLIARMGFPIPFALALWITFNESIGAFLIGCGFLT |
Ga0157376_104197631 | 3300014969 | Miscanthus Rhizosphere | MREQKDFGLLILRGAGFLLAGTFGVQKIGWYWSAFHAGKSLSSAGLAPLIAKMGFPIPFALALWITFNE |
Ga0157376_128483101 | 3300014969 | Miscanthus Rhizosphere | MINIGLLLSRAAGFLLAFTFGIQKIGWYVAAFHAGKPFASVGLAPLIAKVGFPFPIF |
Ga0137403_111248382 | 3300015264 | Vadose Zone Soil | MRKQKDFGLLILRGAGLLLALTFGVQKIGWYWSAVHAGKSFSSIGLAPLIAKMSFPIPVALAIWITF |
Ga0132256_1017494712 | 3300015372 | Arabidopsis Rhizosphere | MKEQKDFGLLILRGAGLLLAGTFGIQKIGWYWSAFHAGKSLSSAGLAPLIARMGFPIPFALAFWITFNESIGAFL |
Ga0132255_1046581462 | 3300015374 | Arabidopsis Rhizosphere | MREQKDFGLLILRGAGFLLAGTFGVQKIGWYWSAFHAGKSLSSGGLAPLIARMGFPIPFALALWITF |
Ga0134083_100275364 | 3300017659 | Grasslands Soil | MFDWGLLALRSAGFLLAFTFGLQKIGWYISAFQSDKSFSSIGLAPLIAQVGFPAPVILA |
Ga0184604_101419502 | 3300018000 | Groundwater Sediment | MREQKDFGLLVLRGAGLLLAGTFGVQKIGWYWSAIHAGKSLSSAGLAPLIARMGFPIPFALALWITFNESIGAFLIGCGVLTRRWLRGASNEMN |
Ga0066655_108480891 | 3300018431 | Grasslands Soil | MREQKDFGLLILRGAGLLLAGTFGVQKIGWYWSAVHAGKSLSSAGLAPLIARMGFPIPFALALWITFNESIGAILIGCGFLTRLLA |
Ga0193720_10433552 | 3300019868 | Soil | MRAGFLLAFTFGLQKIGWYISAFQSDKPFSSIGLAPLIAQVGFPAPVILALWITFNESIGALFLGCGLF |
Ga0193746_10114411 | 3300019870 | Soil | MREQKDLGLLILRGAGLLLTLTFGVQKIGWYWPALHAGKSFSSIGLAPLIAKMGLPIPVALALWITFNESVGAFLIG |
Ga0193707_11875291 | 3300019881 | Soil | MREQKDFGLLVLRGAGLLLAGTFGVQKIGWYWSAIHAGKSLSSAGLAPLIARMGFPIPFALALWITFNESIGAFLIGCGVLTRRWLHGASNEMN |
Ga0179596_100601461 | 3300021086 | Vadose Zone Soil | MLDLGLLALRSAGILLAFTFGIQKIGWYIAAFHSDKPLSSVGLAPLIAQIGFPAPAILALWV |
Ga0126371_108274142 | 3300021560 | Tropical Forest Soil | VFDFSLLVLRSAGFLLAFTFGVQKIGWYLIAFHSNKPFSSIGLAPLIAKMGFPAAGYSCALDNIQ |
Ga0126371_111936772 | 3300021560 | Tropical Forest Soil | MRQQKDLGLLVLRTAGFLLTFTFGIQKIGWYVTALRSGKHLSFIGLAPLISQIGFPVSVSVVLAIWVTFNESIGAFLIGCGLF |
Ga0126371_135365952 | 3300021560 | Tropical Forest Soil | VFDIGLLALRSAGFLLAFTFGIQKIGWYIAAFHSDKPLSSIGLAPLIAHVGFPVPVILALWITFNESIAS |
Ga0222622_114810061 | 3300022756 | Groundwater Sediment | MIDSGLLLLRAAGFLLAFTFGVQKIGWYVTAFHAGKPWSSIGLAPLIAHVGFPLPVILALWITLNESIGAFLIGI |
Ga0257169_10847911 | 3300026469 | Soil | MSAFQRFPSRDLGLLILRGAGFLLAATFGLQKIGWYWTAFHPGKSLSAIGLAPLIGRMGFPIPVVLALWITFNESTGALLIACGFLTRILAA |
Ga0257156_10007831 | 3300026498 | Soil | MREQRDFGLLILRIAGFLLVFTFGIQKIGWYWSALHAGKSFSSIGLAPLIAKMGFPIPVALAMWITFNESIGAFL |
Ga0209157_12493513 | 3300026537 | Soil | MFDWGLLALRSAGFLLAFTFGLQKIGWYISTFQSDKSFSSIGLAPLIAQVGFPAPVILALWITFNESIGALF |
Ga0208072_1081551 | 3300026791 | Soil | MRKQRDLGLLILRGAGLLLALTFGVQKIGWYWSGLHAGKSFSSIGLAPLIAKMGFPIPVALAIWITFNESIGAFLIGCGFLTRSL |
Ga0209488_108278582 | 3300027903 | Vadose Zone Soil | MRGQKDFGLLILRGAGLLLAGTFGVQKIGWYWSAFHAGKSLSSAGLAPLIAKMGFPIPVALAIWITFNESIGAFLIACGLTRLLAASL |
Ga0307312_104253411 | 3300028828 | Soil | MFDWGLLALRSAGFLLAFTFGLQKIGWYVSAFQSDKPFSSIGLAPLIAQVGFPAAVILALWITFNESIG |
Ga0075386_122061021 | 3300030916 | Soil | MSAFQRFPTRDLGFLILRGAGFLLAVTFGVQKIGWYWTAFHAGKSFSAVGLAPLIARMGFPIPVVLALW |
Ga0075373_113400642 | 3300030945 | Soil | MSAFHRFPSRDLGLLILRGAGFLLAATFGLQKFSWYWTAFHAGKSLSAIGLAPLIARMGFPIPVVLALWITFNESIG |
Ga0170823_136878712 | 3300031128 | Forest Soil | MRKQKDFGLLILRGAGLLLALTFGVQKIGWYWSALHAGKSFSSIGLAPLIAKMGFPIPVALAIWITFNESIGAFLIGCGFLT |
Ga0170824_1020581585 | 3300031231 | Forest Soil | LLILRGAGLLLALTFGVQKIGWYWSALHAGKSFSSIGLAPLIAKIGFPIPVALAIWITFNESIGAFLIGCGFL |
Ga0170824_1045280111 | 3300031231 | Forest Soil | MSALQRFLSRDLGLLILRGAGFLLAVTFGVQKIGWYWTAFHAGKSFSAVALAPLIARMGFPIPVVLALWITFNESIGALL |
Ga0170824_1181160203 | 3300031231 | Forest Soil | MKEQKDFGLLILRGTGLLLAGTFGIQKIGWYWSAVHAGKSLSSAGLAPLIAKMGFPIPFALALWITFNESIGAFLVGCGFLTR |
Ga0170824_1232579881 | 3300031231 | Forest Soil | AGFLLALTFGFQKIGWYLSAFHSDKAFASVGLAPLIAHLGFPAPAILAVWITFNE |
Ga0170819_144409651 | 3300031469 | Forest Soil | MSAFQRFPSRDLGFLILRGAGFSLAATFGLQKIGWYWTAFHTGKSLSAIGLAPLIARMGFPIPVVLALWITFNESIGAFLIGCGFLTRILAA |
Ga0310888_104630281 | 3300031538 | Soil | MKEQKDSGLLILRGAGLLLALTFGVQKIGWYWSALHAGKPFSSIGLAPLIAKMGFPISVALALWITFN |
Ga0307475_115563542 | 3300031754 | Hardwood Forest Soil | MKEQKDLGLLILRGAGLLLAVTFGVQKIGWYWTAFHAGKPLSHAGLAPLIARMGFPIPFILAWWVTFNESIGAFLIGCGFLTRTLAAS |
Ga0310912_102710412 | 3300031941 | Soil | MIDFGLLLLRAARFVLAFTFGIQKMGWYVTAFHAGKPLRSVGLAPLIAHVGFPLPIILALWITL |
Ga0310916_106051461 | 3300031942 | Soil | MIDFGLLLLRAARFVLAFTFGIQKMGWYVTAFHAGKPLRSVGLAPLIAHVGFPLPIILALWI |
Ga0307479_114745391 | 3300031962 | Hardwood Forest Soil | LLAVTFGVQKLGWYWAAFHAGKSLLHAGLAPLIARMGFPISAALALWITFNESIGAFLIGCGFLTRIMAGSAALGMAGALYT |
⦗Top⦘ |