NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F052839

Metagenome / Metatranscriptome Family F052839

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F052839
Family Type Metagenome / Metatranscriptome
Number of Sequences 142
Average Sequence Length 82 residues
Representative Sequence MDTASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEAQQELGLPDGRTLRFHASCAAAWRRLRDVLPLLRR
Number of Associated Samples 117
Number of Associated Scaffolds 142

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 61.27 %
% of genes near scaffold ends (potentially truncated) 26.76 %
% of genes from short scaffolds (< 2000 bps) 76.76 %
Associated GOLD sequencing projects 107
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (60.563 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere
(14.789 % of family members)
Environment Ontology (ENVO) Unclassified
(28.873 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(47.887 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 48.78%    β-sheet: 2.44%    Coil/Unstructured: 48.78%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 142 Family Scaffolds
PF00072Response_reg 26.06
PF01968Hydantoinase_A 4.23
PF04392ABC_sub_bind 3.52
PF00561Abhydrolase_1 3.52
PF12697Abhydrolase_6 2.82
PF13174TPR_6 2.11
PF13424TPR_12 2.11
PF00487FA_desaturase 2.11
PF05378Hydant_A_N 1.41
PF01168Ala_racemase_N 0.70
PF02538Hydantoinase_B 0.70
PF07859Abhydrolase_3 0.70
PF02201SWIB 0.70
PF13397RbpA 0.70
PF02954HTH_8 0.70
PF00589Phage_integrase 0.70
PF03069FmdA_AmdA 0.70
PF13358DDE_3 0.70
PF04239DUF421 0.70
PF12973Cupin_7 0.70

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 142 Family Scaffolds
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 3.52
COG0145N-methylhydantoinase A/oxoprolinase/acetone carboxylase, beta subunitAmino acid transport and metabolism [E] 2.82
COG1398Fatty-acid desaturaseLipid transport and metabolism [I] 2.11
COG3239Fatty acid desaturaseLipid transport and metabolism [I] 2.11
COG0146N-methylhydantoinase B/oxoprolinase/acetone carboxylase, alpha subunitAmino acid transport and metabolism [E] 1.41
COG0657Acetyl esterase/lipaseLipid transport and metabolism [I] 0.70
COG2323Uncharacterized membrane protein YcaP, DUF421 familyFunction unknown [S] 0.70
COG2421Acetamidase/formamidaseEnergy production and conversion [C] 0.70
COG5531DNA-binding SWIB/MDM2 domainChromatin structure and dynamics [B] 0.70


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms60.56 %
UnclassifiedrootN/A39.44 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2088090015|GPICI_8720685All Organisms → cellular organisms → Bacteria → Proteobacteria1910Open in IMG/M
3300002245|JGIcombinedJ26739_100425825All Organisms → cellular organisms → Bacteria1205Open in IMG/M
3300002245|JGIcombinedJ26739_101831609Not Available508Open in IMG/M
3300002914|JGI25617J43924_10169114Not Available736Open in IMG/M
3300003319|soilL2_10089971All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1581Open in IMG/M
3300004463|Ga0063356_100152835All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2635Open in IMG/M
3300004479|Ga0062595_100058201All Organisms → cellular organisms → Bacteria1819Open in IMG/M
3300005330|Ga0070690_100117134All Organisms → cellular organisms → Bacteria → Proteobacteria1784Open in IMG/M
3300005338|Ga0068868_100533574All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1032Open in IMG/M
3300005347|Ga0070668_100189911All Organisms → cellular organisms → Bacteria1682Open in IMG/M
3300005364|Ga0070673_100043363All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3476Open in IMG/M
3300005406|Ga0070703_10148130Not Available879Open in IMG/M
3300005434|Ga0070709_10054420All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2523Open in IMG/M
3300005437|Ga0070710_10396154Not Available924Open in IMG/M
3300005440|Ga0070705_100100531All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1825Open in IMG/M
3300005444|Ga0070694_100661290Not Available846Open in IMG/M
3300005445|Ga0070708_100095982All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2707Open in IMG/M
3300005445|Ga0070708_100295423All Organisms → cellular organisms → Bacteria1525Open in IMG/M
3300005467|Ga0070706_100009816All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria8897Open in IMG/M
3300005467|Ga0070706_101439184Not Available630Open in IMG/M
3300005468|Ga0070707_100170285All Organisms → cellular organisms → Bacteria2122Open in IMG/M
3300005471|Ga0070698_100260742All Organisms → cellular organisms → Bacteria1665Open in IMG/M
3300005471|Ga0070698_101517792Not Available621Open in IMG/M
3300005518|Ga0070699_101745856Not Available570Open in IMG/M
3300005529|Ga0070741_10003263All Organisms → cellular organisms → Bacteria40800Open in IMG/M
3300005536|Ga0070697_100052528All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3311Open in IMG/M
3300005545|Ga0070695_100415940All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1023Open in IMG/M
3300005546|Ga0070696_100126200All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1857Open in IMG/M
3300006041|Ga0075023_100079772Not Available1092Open in IMG/M
3300006047|Ga0075024_100212164Not Available912Open in IMG/M
3300006047|Ga0075024_100241651Not Available863Open in IMG/M
3300006050|Ga0075028_100006426All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4641Open in IMG/M
3300006172|Ga0075018_10182042Not Available987Open in IMG/M
3300006172|Ga0075018_10630299Not Available573Open in IMG/M
3300006176|Ga0070765_100165282All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1986Open in IMG/M
3300006237|Ga0097621_101424492Not Available656Open in IMG/M
3300006237|Ga0097621_102347117Not Available510Open in IMG/M
3300006755|Ga0079222_10047547All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales1969Open in IMG/M
3300006755|Ga0079222_10495133All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria893Open in IMG/M
3300006804|Ga0079221_10411063Not Available843Open in IMG/M
3300006804|Ga0079221_10883737All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales → Cystobacterineae → Myxococcaceae → unclassified Myxococcaceae → Myxococcaceae bacterium653Open in IMG/M
3300006852|Ga0075433_10007552All Organisms → cellular organisms → Bacteria8624Open in IMG/M
3300006854|Ga0075425_100237493All Organisms → cellular organisms → Bacteria → Proteobacteria2097Open in IMG/M
3300006871|Ga0075434_101278981Not Available745Open in IMG/M
3300006880|Ga0075429_101980309Not Available504Open in IMG/M
3300006914|Ga0075436_100160936All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1582Open in IMG/M
3300006914|Ga0075436_100911717Not Available657Open in IMG/M
3300006954|Ga0079219_11397012All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales625Open in IMG/M
3300007265|Ga0099794_10313277Not Available813Open in IMG/M
3300009038|Ga0099829_10443308All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1076Open in IMG/M
3300009143|Ga0099792_10052766All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1987Open in IMG/M
3300009143|Ga0099792_10166527Not Available1228Open in IMG/M
3300009162|Ga0075423_10038178All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4898Open in IMG/M
3300009162|Ga0075423_10143289All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2509Open in IMG/M
3300009176|Ga0105242_10931939Not Available871Open in IMG/M
3300010400|Ga0134122_11031071Not Available809Open in IMG/M
3300010400|Ga0134122_12605739Not Available556Open in IMG/M
3300011269|Ga0137392_10422972All Organisms → cellular organisms → Bacteria1106Open in IMG/M
3300012096|Ga0137389_10820309Not Available799Open in IMG/M
3300012202|Ga0137363_10058665All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2787Open in IMG/M
3300012203|Ga0137399_10012976All Organisms → cellular organisms → Bacteria → Proteobacteria5122Open in IMG/M
3300012205|Ga0137362_10076874Not Available2776Open in IMG/M
3300012205|Ga0137362_10332421All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1316Open in IMG/M
3300012363|Ga0137390_11109754Not Available740Open in IMG/M
3300012582|Ga0137358_10239175All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1236Open in IMG/M
3300012918|Ga0137396_10478746Not Available923Open in IMG/M
3300012923|Ga0137359_10035443All Organisms → cellular organisms → Bacteria4301Open in IMG/M
3300012929|Ga0137404_11639398Not Available597Open in IMG/M
3300012957|Ga0164303_11462916Not Available514Open in IMG/M
3300012986|Ga0164304_10250014All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1188Open in IMG/M
3300015262|Ga0182007_10053506All Organisms → cellular organisms → Bacteria → Proteobacteria1329Open in IMG/M
3300015264|Ga0137403_10335761All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1401Open in IMG/M
3300015371|Ga0132258_13402217Not Available1092Open in IMG/M
3300015374|Ga0132255_103109876All Organisms → cellular organisms → Bacteria → Proteobacteria708Open in IMG/M
3300017927|Ga0187824_10119936Not Available856Open in IMG/M
3300017930|Ga0187825_10200082Not Available719Open in IMG/M
3300017961|Ga0187778_10007901All Organisms → cellular organisms → Bacteria6741Open in IMG/M
3300018031|Ga0184634_10516761Not Available532Open in IMG/M
3300018422|Ga0190265_10027088All Organisms → cellular organisms → Bacteria4694Open in IMG/M
3300019877|Ga0193722_1088547All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium753Open in IMG/M
3300019999|Ga0193718_1085012Not Available665Open in IMG/M
3300020002|Ga0193730_1090645Not Available857Open in IMG/M
3300020006|Ga0193735_1124932Not Available695Open in IMG/M
3300020021|Ga0193726_1059538All Organisms → cellular organisms → Bacteria1801Open in IMG/M
3300020062|Ga0193724_1083722All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium660Open in IMG/M
3300020070|Ga0206356_10326617Not Available1063Open in IMG/M
3300020199|Ga0179592_10223954Not Available849Open in IMG/M
3300020581|Ga0210399_10905356All Organisms → cellular organisms → Bacteria715Open in IMG/M
3300020583|Ga0210401_10836524All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium780Open in IMG/M
3300021086|Ga0179596_10270810Not Available842Open in IMG/M
3300021088|Ga0210404_10026100All Organisms → cellular organisms → Bacteria2569Open in IMG/M
3300021088|Ga0210404_10082285All Organisms → cellular organisms → Bacteria1588Open in IMG/M
3300021420|Ga0210394_11476899Not Available576Open in IMG/M
3300021432|Ga0210384_10013404All Organisms → cellular organisms → Bacteria8171Open in IMG/M
3300021432|Ga0210384_11262780Not Available643Open in IMG/M
3300022756|Ga0222622_11178475Not Available564Open in IMG/M
3300025885|Ga0207653_10189403Not Available772Open in IMG/M
3300025904|Ga0207647_10243770All Organisms → cellular organisms → Bacteria1032Open in IMG/M
3300025910|Ga0207684_10013713All Organisms → cellular organisms → Bacteria → Proteobacteria6999Open in IMG/M
3300025910|Ga0207684_10019888All Organisms → cellular organisms → Bacteria5738Open in IMG/M
3300025910|Ga0207684_10034265All Organisms → cellular organisms → Bacteria4315Open in IMG/M
3300025915|Ga0207693_10544047All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales906Open in IMG/M
3300025934|Ga0207686_11066636All Organisms → cellular organisms → Bacteria658Open in IMG/M
3300025941|Ga0207711_10528449All Organisms → cellular organisms → Bacteria1100Open in IMG/M
3300025960|Ga0207651_11988279Not Available522Open in IMG/M
3300025972|Ga0207668_10281760All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Corynebacteriales1363Open in IMG/M
3300026023|Ga0207677_10852992Not Available818Open in IMG/M
3300026095|Ga0207676_10072887All Organisms → cellular organisms → Bacteria → Terrabacteria group → Armatimonadetes → unclassified Armatimonadetes → Armatimonadetes bacterium CSP1-32762Open in IMG/M
3300026304|Ga0209240_1085338All Organisms → cellular organisms → Bacteria1156Open in IMG/M
3300026358|Ga0257166_1013033All Organisms → cellular organisms → Bacteria1046Open in IMG/M
3300026359|Ga0257163_1046239Not Available693Open in IMG/M
3300026369|Ga0257152_1001860All Organisms → cellular organisms → Bacteria2041Open in IMG/M
3300026371|Ga0257179_1005444Not Available1187Open in IMG/M
3300026376|Ga0257167_1012339All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1156Open in IMG/M
3300026481|Ga0257155_1019694All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium961Open in IMG/M
3300026482|Ga0257172_1046133All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium797Open in IMG/M
3300026494|Ga0257159_1019029All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1114Open in IMG/M
3300026496|Ga0257157_1008540All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1605Open in IMG/M
3300026514|Ga0257168_1022886Not Available1302Open in IMG/M
3300026557|Ga0179587_10205550All Organisms → cellular organisms → Bacteria1249Open in IMG/M
3300027894|Ga0209068_10001603All Organisms → cellular organisms → Bacteria11261Open in IMG/M
3300027894|Ga0209068_10011615All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria4192Open in IMG/M
3300028906|Ga0308309_10129100All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2004Open in IMG/M
(restricted) 3300031248|Ga0255312_1041362Not Available1102Open in IMG/M
3300031716|Ga0310813_10945255Not Available783Open in IMG/M
3300031720|Ga0307469_10163964All Organisms → cellular organisms → Bacteria1685Open in IMG/M
3300031720|Ga0307469_10165824All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales1677Open in IMG/M
3300031720|Ga0307469_10387051All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1188Open in IMG/M
3300031720|Ga0307469_10483599All Organisms → cellular organisms → Bacteria1080Open in IMG/M
3300031740|Ga0307468_100944026All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium753Open in IMG/M
3300031754|Ga0307475_11281813Not Available568Open in IMG/M
3300031820|Ga0307473_10016315All Organisms → cellular organisms → Bacteria → Proteobacteria2892Open in IMG/M
3300031820|Ga0307473_10331813All Organisms → cellular organisms → Bacteria970Open in IMG/M
3300032174|Ga0307470_10050992All Organisms → cellular organisms → Bacteria2120Open in IMG/M
3300032174|Ga0307470_10462338All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium915Open in IMG/M
3300032180|Ga0307471_101693374All Organisms → cellular organisms → Bacteria786Open in IMG/M
3300033432|Ga0326729_1007292All Organisms → cellular organisms → Bacteria2028Open in IMG/M
3300033433|Ga0326726_10250331Not Available1649Open in IMG/M
3300033500|Ga0326730_1010288Not Available2065Open in IMG/M
3300033500|Ga0326730_1016846Not Available1550Open in IMG/M
3300034090|Ga0326723_0127657Not Available1110Open in IMG/M
3300034820|Ga0373959_0007256All Organisms → cellular organisms → Bacteria → Proteobacteria1860Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere14.79%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil13.38%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil11.97%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil7.75%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil7.04%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds5.63%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere5.63%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil3.52%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil3.52%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere2.82%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment1.41%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.41%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.41%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.41%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.41%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.41%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.41%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere1.41%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.41%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.70%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.70%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.70%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.70%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.70%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.70%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.70%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.70%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.70%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.70%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.70%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.70%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.70%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.70%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.70%
Rhizosphere SoilHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere Soil0.70%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090015Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300003319Sugarcane bulk soil Sample L2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005330Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaGEnvironmentalOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005347Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaGHost-AssociatedOpen in IMG/M
3300005364Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaGHost-AssociatedOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006047Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006237Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2 (version 2)Host-AssociatedOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300015262Rhizosphere microbial communities from Sorghum bicolor, Mead, Nebraska, USA - 072115-113_1 MetaGHost-AssociatedOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017961Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_20_MGEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300019877Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m1EnvironmentalOpen in IMG/M
3300019999Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a1EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300020021Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c1EnvironmentalOpen in IMG/M
3300020062Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a1EnvironmentalOpen in IMG/M
3300020070Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-1 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025904Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025941Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025960Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025972Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026023Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026095Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026358Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-14-BEnvironmentalOpen in IMG/M
3300026359Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-AEnvironmentalOpen in IMG/M
3300026369Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-05-AEnvironmentalOpen in IMG/M
3300026371Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-BEnvironmentalOpen in IMG/M
3300026376Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-BEnvironmentalOpen in IMG/M
3300026481Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-AEnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026494Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-AEnvironmentalOpen in IMG/M
3300026496Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-AEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033432Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF6AY SIP fractionEnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033500Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF7AN SIP fractionEnvironmentalOpen in IMG/M
3300034090Peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF00NEnvironmentalOpen in IMG/M
3300034820Populus rhizosphere microbial communities from soil in West Virginia, United States - WV94_WV_N_2Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
GPICI_033722402088090015SoilMETRTMDPASQRVAEKLAARLLPGIEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRLKGVLPLLRR
JGIcombinedJ26739_10042582533300002245Forest SoilMNAASQRIAEKIVARLLPAVADAVSEWEGFGIGLHCDGCDAPVLPSEAQHEPRLPDGRTLRFHAPCAAAWHRLRDVLPLLRR*
JGIcombinedJ26739_10183160933300002245Forest SoilMNPASRRIAEKIAARLLPAIADPVRVWEGFGIGLPCDGCDLPILPSEAQQDLGLPDGRTLRFHVPCATDWHRLRGVLPLLRR*
JGI25617J43924_1016911413300002914Grasslands SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEAQXELGLPDGRTLRFHASCAAAWRRLRDVLPLLRR*
soilL2_1008997123300003319Sugarcane Root And Bulk SoilMDPASQRVAEKLAARLLPGTEPTSTWEGFGLGLVCDGCDRPVLPSERQREMVLPDGRTLRFHAPCADTWRRLKGVLPLLRR*
Ga0063356_10015283543300004463Arabidopsis Thaliana RhizosphereMDPASQRVAEKLAARLLPGIEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRLKGVLPLLRR*
Ga0062595_10005820133300004479SoilMNTASRRIAEKIAARLLPAVADPVRVWEGFGIGLPCDGCDLPILPSEAQRDLGLPDGRTLRFHVPCATDWQRLRGVLPLLRR*
Ga0070690_10011713443300005330Switchgrass RhizosphereMDPASQRVAEKLAARLLPGIEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRRRGRRP*
Ga0068868_10053357423300005338Miscanthus RhizosphereMDPASQRVAEKLAARLLPGVEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRLKGVLPLLRR*
Ga0070668_10018991113300005347Switchgrass RhizosphereRVAEKLAARLLPGIEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRLKGVLPLLRR*
Ga0070673_10004336323300005364Switchgrass RhizosphereMETRTMDPASQRVAEKLAARLLPGIEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRLKGVLPLLRR*
Ga0070703_1014813013300005406Corn, Switchgrass And Miscanthus RhizosphereMDTASQRIAEKIAARILPAVADPVSVWEGFGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRRQMK*
Ga0070709_1005442033300005434Corn, Switchgrass And Miscanthus RhizosphereMKTASRRIAEKIAARLLPAIADPVRVWEGFGIGLPCDGCDLPILPSEAQRDLGLPDGRTLRFHVPCATDWQRLRGVLPLLRR*
Ga0070710_1039615423300005437Corn, Switchgrass And Miscanthus RhizosphereMRRLARLLSDVPGPMNPASRRIAEKIAARLLPAIADPVRVWEGFGIGLPCDGCDLPILPSEAQRDLGLPDGRTLRFHVPCATD*
Ga0070705_10010053123300005440Corn, Switchgrass And Miscanthus RhizosphereMDTASQRIAEKIAARILPAVADPVSVWEGFGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDILPLLRRQMK*
Ga0070694_10066129023300005444Corn, Switchgrass And Miscanthus RhizosphereMDTASQRIAEKIAARILPAVADPVSVWEGFGTGLHCDGCDLPVLPSEPQQGLGLPDGRTLRFHASCAAAWRRLRDVLPLLRRQMK*
Ga0070708_10009598243300005445Corn, Switchgrass And Miscanthus RhizosphereMDTASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEAQQELGLPDGRTLRFHASCAVAWRRLRDVLPLLRR*
Ga0070708_10029542333300005445Corn, Switchgrass And Miscanthus RhizosphereMDTASQRIAEKIAARILPAVADPVSVWEGFGTGLHCDGCDLPVLPSEPQKELALPDGRTLRFHASCAAAWRRLRDVLPLLRRQMK*
Ga0070706_10000981653300005467Corn, Switchgrass And Miscanthus RhizosphereMKTASRRIAEKIAARLLPAIADPVRIWEGFGIGLPCDGCDLPILPSEAQQDLGLPDGRTLRFHVPCATDWQRLRGVLPLLRR*
Ga0070706_10143918413300005467Corn, Switchgrass And Miscanthus RhizosphereEKIAARILPAVADPVSVWEGFGTGLHCDGCDLPVLPSEPQKELALPDGRTLRFHASCAAAWRRLRDVLPLLRRQMK*
Ga0070707_10017028523300005468Corn, Switchgrass And Miscanthus RhizosphereMDEASQRIAEKLAARILPGTEPAAAWEGFGIGLRCEGCDRPVLPSERQRELVMPDGRTIRFHDQCAADWRR*
Ga0070698_10026074253300005471Corn, Switchgrass And Miscanthus RhizosphereMDEASQRIAEKLAAQILPGTEPAAAWEGFGIGLRCEGCDRPVLPSERQRELVMPDGRTIRFHDQCAADWRR*
Ga0070698_10151779213300005471Corn, Switchgrass And Miscanthus RhizosphereARILPAVADPVSVWEGFGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDILPLLRRQMK*
Ga0070699_10174585613300005518Corn, Switchgrass And Miscanthus RhizosphereMDTASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEAQQELGLPDGRTLRFHASCA
Ga0070741_10003263143300005529Surface SoilMESRALNPASQRVAEKLAARLLPGTEPTSTWEGFGLGLSCDGCDRPVLPSERQRELALPDGRTFSCCCTDSRRRT*
Ga0070697_10005252863300005536Corn, Switchgrass And Miscanthus RhizosphereMKTASRRIAEKIAARLLPAIADPVRVWEGFGIGLPCDGCDLPILPSEAQQDLGLPDGRTLRFHVPCATDWQRLRGVLPLLRR*
Ga0070695_10041594023300005545Corn, Switchgrass And Miscanthus RhizosphereMDTASQRIAEKIAARILPAVADPVSVWEGFGTGLHCDGCDLPVLPSEPQQELGLPDGRTLRFHVACAAAWRRLRDVLPLLRR*
Ga0070696_10012620033300005546Corn, Switchgrass And Miscanthus RhizosphereMDTASQRIAEKIAARILPAIADPVSVWEGFGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRRQAK*
Ga0075023_10007977233300006041WatershedsMNAASRRVAEKIAARLLPAVADPVSEWEGFGIGLHCDGCDAPVLPSEAQHEPRLPDGRTLRFHAPCAAAWHRLRRVLPLLRR*
Ga0075024_10021216423300006047WatershedsMRIKTGSQRIAEKLAARLLPGTESSAMWEGFGIGLRCDGCDQPVLPSERQHDLVLPDGRTIRFHASCAAVWRRLKAVLPLLRR*
Ga0075024_10024165113300006047WatershedsIAARLLPAVADPVSEWEGFGIGLHCDGCDAPVLPSEAQHEPRLPDGRTLRFHAPCAAAWHRLRRVLPLLRR*
Ga0075028_10000642643300006050WatershedsMNAASQRVAEKIAARLLPAVADPVSEWEGFGIGLHCDGCDAPVLPSEAQHEPRLPDGRTLRFHAPCAAAWHRLRRVLPLLRR*
Ga0075018_1018204213300006172WatershedsRVAEKIAARLLPAVADPVSEWEGFGIGLHCDGCDAPVLPSEAQHEPRLPDGRTLRFHAPCAAAWHRLRRVLPLLRR*
Ga0075018_1063029923300006172WatershedsMDTASQRIAEKIAARILPAVADPVSVWEGSGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCATAWRRLRNVLPLLRRQMK*
Ga0070765_10016528223300006176SoilMDAPGQRIAEKLAARILPGYMEPVTTWEGFGIGLHCDGCDLPVLPSESQVETTMPDGRTIRFHAPCAAVWRRLKEVLILLRR*
Ga0097621_10142449223300006237Miscanthus RhizosphereMDTASQRIAEKIAARILPAVAEPVSVWEGFGIGLHCDGCDLPVLPSEPQQELGLPDGRTLRFHVACAAAWRRLRDVLPLLRR*
Ga0097621_10234711733300006237Miscanthus RhizosphereARADPGVMETRTMDPASQRVAEKLAARLLPGIEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRLKGVLPLLRR*
Ga0079222_1004754733300006755Agricultural SoilMNTASRRIAEKIAARLLPAIADPVRVWEGFGIGLPCDGCDLPILPSEAQQDLGLPDGRTLRFHVPCATDWHRLRGVLPLLRR*
Ga0079222_1049513313300006755Agricultural SoilRADPGLMETRPMDPASQRVAEKLAARLLPGTEPTSTWEGFGLGLVCDGCDRPVLPSERQREMVLPDGRTLRFHAPCADAWRRLKGVLPLLRR*
Ga0079221_1041106323300006804Agricultural SoilMDPASQRVAEKLAARLLPGIEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRLKGVLPLL
Ga0079221_1088373713300006804Agricultural SoilSDVPGPMNPASRRIAEKIAARLLPAIADPVRVWEGFGIGLPCDGCDLPILPSEAQQDLGLPDGRTLRFHVPCATDWHRLRGVLPLLRR*
Ga0075433_1000755273300006852Populus RhizosphereMNTASRRIAEKIAARLLPAIADPVRVWEGFGIGLLCDGCDLPILPSEAQQDLGLPDGRTLRFHVPCATDWHRLRGVLPLLRR*
Ga0075425_10023749323300006854Populus RhizosphereMNPASRRIAEKIAARLLPAIADPVRVWEGFGIGLPCDGCDLPILPSEAQQDLGLPDGRTLRFHVPCATDWRRLRGVLPLLRR*
Ga0075434_10127898123300006871Populus RhizosphereMNPASRRIAEKIAARLLPAIADPVRVWEGFGIGLPCDGCDLPILPSEAQQDLGLPDGRTLRFHVSCATDWRRLRGVLPLLRR*
Ga0075429_10198030933300006880Populus RhizosphereEKLAARLLPGVEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRLKGVLPLLRR*
Ga0075436_10016093643300006914Populus RhizosphereMNPASRRIAEKIAARLLPAIADPVRVWEGFGIGLPCDGCDLPILPSEAEQDLGLPDGRTLRFHVPCATDWRRLRGVLPLLRR*
Ga0075436_10091171723300006914Populus RhizosphereMNTASRRIAEKIAARLLPAIADPVRVWEGFGIGLPCDGCDLPILPSEAQQDLGLPDGRTLRFHVLCATDWHRLRGVLPLLRR*
Ga0079219_1139701223300006954Agricultural SoilMDAASQRVAEKLAARLLPGTEPTSTWEGFGLGLVCDGCDRPVLPSERQREMVLPDGRTLRFHAPCADTWRRLKRVLPLLRR*
Ga0099794_1031327713300007265Vadose Zone SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEAQHELGLPDGRTLRFHASCAAAWRRLRDILPLLRRQMK*
Ga0099829_1044330833300009038Vadose Zone SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEAQQELGLPDGRTLRFHASCAAAWRRLRDVLPLLRR*
Ga0099792_1005276633300009143Vadose Zone SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEAQHELGLPDGRTLRFHASCAAAWRRLRDVLPLLRR*
Ga0099792_1016652733300009143Vadose Zone SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGTDLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDILPLLRRQMK*
Ga0075423_1003817883300009162Populus RhizosphereMNTASRRIAEKIAARLLPAIADPVRVWEGFGIGLLCDGCDLPILPSEAQQDLGLPDGRTLRFHVLCATDWHRLRGVLPLLRR*
Ga0075423_1014328933300009162Populus RhizosphereMETRTMDPASQRVAEKLAARLLPGVEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRLKGVLPLLRR*
Ga0105242_1093193913300009176Miscanthus RhizosphereVDSASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRRQMK*
Ga0134122_1103107123300010400Terrestrial SoilVDSASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRR*
Ga0134122_1260573913300010400Terrestrial SoilMNPASRRIAEKIAARLLPAIADPVRAWEGFGIGLPCDGCDLPILPGEAQQDLGLPDGRTLRFHAPCATDWHRLRRVLPLLRRHEVGEQEAGGKR*
Ga0137392_1042297213300011269Vadose Zone SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEAQHELGLPDGRTLRFHASCAAAWRRL
Ga0137389_1082030913300012096Vadose Zone SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEAQQELGLPDGRTLRFHASCAAAWRRLRDILPLLRRQMK*
Ga0137363_1005866533300012202Vadose Zone SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGTDLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRRQVK*
Ga0137399_1001297633300012203Vadose Zone SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGTDLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRR*
Ga0137362_1007687433300012205Vadose Zone SoilMDTASRRIAEKIAARILPAIADPVSVWEGFGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRRQPK*
Ga0137362_1033242113300012205Vadose Zone SoilKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEAQQELGLPDGRTLRFHASCAVAWRRLRDVLPLLRR*
Ga0137390_1110975413300012363Vadose Zone SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCA
Ga0137358_1023917523300012582Vadose Zone SoilMDTASQRIAEKIAARILPAVADPVSVWKGFGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRRQPK*
Ga0137396_1047874623300012918Vadose Zone SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGTDLHCDGCDLPVLPSEPQQELALPDGRPLRFHASGAAAWRRLRDILPLLRRQMK*
Ga0137359_1003544343300012923Vadose Zone SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRRQVK*
Ga0137404_1163939823300012929Vadose Zone SoilMDSASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRGVLPLLRRQMK*
Ga0164303_1146291613300012957SoilMDTASQRIAEKIAARILPAIADPVSVWEGFGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAARRQLRDVLPLLRRQAK*
Ga0164304_1025001433300012986SoilMDTASQRIAEKIAARILPAIADPVSVWEGFGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDILPLLRRQMK*
Ga0182007_1005350613300015262RhizospherePGVMETRTMDPASQRVAEKLAARLLPGIEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRLKGVLPLLRR*
Ga0137403_1033576123300015264Vadose Zone SoilKIAARILPAIADPVSVWEGFGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRRQPK*
Ga0132258_1340221713300015371Arabidopsis RhizosphereMDVASQRIAEKLAARILPGTEPGATWRGFGIGLHCDGCDRPVLPSEEQVELVLPDGRTLRFHAPCADEWRRLKGVLPLLRR*
Ga0132255_10310987613300015374Arabidopsis RhizosphereRARADPGVMETRTMDPASQRVAEKLAARLLPGIEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRLKGVLPLLRR*
Ga0187824_1011993613300017927Freshwater SedimentAGDGSLKERAMDAAAQRIAEKLAARILPGTEPRATWRGFGIGLHCDGCDRPVLPSEEQLELVLPDGRTLRFHAPCAEEWRRLKGVLPLLRR
Ga0187825_1020008223300017930Freshwater SedimentMDAAAQRIAEKLAARILPGTEPRATWRGFGIGLHCDGCDRPVLPSEEQLELVLPDGRTLRFHAPCAEEWRRLKGVLPLLRR
Ga0187778_10007901103300017961Tropical PeatlandMSDSIESPGQRIAEKLAARLLPGSFEPTSTWTGFGIGLHCDGCDVPVLPSEPQVEFTLPDGRTIRFHAPCADMWRRLRSVLPLLRR
Ga0184634_1051676113300018031Groundwater SedimentMRAMDAAPQRIAEKIAARILPVIADPVSIWEGFGTGLHCDGCDLPILPSETQEELRLPDGRTLRFHAPCTASWRRLRGLLPLLRR
Ga0190265_1002708853300018422SoilMVGASIGSTGPMDAPSQRIAEKLASRILPGFEPTATWDGFGIGLHCDGCDQPILPSERQLELALPDGRTIRFHAPCAASWRRLKHVLPLLRR
Ga0193722_108854713300019877SoilMDSASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAGAWRRLRDVLPLLRRQMK
Ga0193718_108501213300019999SoilMDSAAQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRRQMK
Ga0193730_109064513300020002SoilMDSASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRR
Ga0193735_112493223300020006SoilMDSASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAGAWRR
Ga0193726_105953833300020021SoilMDSASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEPQQELELPDGRTLRFHASCAAAWRRLRDVLPLLRRQMK
Ga0193724_108372233300020062SoilIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAGAWRRLRDVLPLLRRQMK
Ga0206356_1032661723300020070Corn, Switchgrass And Miscanthus RhizosphereMDPASQRVAEKLAARLLPGIEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRLKGVLPLLRR
Ga0179592_1022395423300020199Vadose Zone SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGTDLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDILPLLRRQMK
Ga0210399_1090535613300020581SoilRHRSDVDTPAQRIAEKLASRMLPGYGEPGATWEGFGIGLVCDGCDQPVLPGHREVDVTLSDGRTIRFHAPCAATWRRLRDVLPLLRREP
Ga0210401_1083652423300020583SoilVDTPAQRIAEKLASRMLPGYGEPGATWEGFGIGLVCDGCDQPVLPGHREVDVTLSDGRTIRFHAPCAATWRRLRDVLPLLRREP
Ga0179596_1027081023300021086Vadose Zone SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEAQQELGLPDGRTLRFHASCAAAWRRLRDVLPLLRR
Ga0210404_1002610023300021088SoilVDTPAQRIAEKLASRMLLGYGEPGATWEGFGIGLVCDGCDQPVLPGHREVDVTLSDGRTIRFHAPCAATWRRLRDVLPLLRREP
Ga0210404_1008228533300021088SoilMDAPGQRIAEKLAARILPGYMEPVTTWEGFGIGLHCDGCDLPVLPSESQVETTMPDGRTIRFHAPCAAVWRRLKGVLILLRR
Ga0210394_1147689923300021420SoilMDAPGQRIAEKLAARILPGYLEPVTTWEGFGIGLHCDGCDLPVLPSESQVETTMPDGRTIRFHAPCAAVWRRLKGVLILLRR
Ga0210384_1001340463300021432SoilLAGKSRMDAPGQRIAEKLAARILPGYMEPVTTWEGFGIGLHCDGCDLPVLPSESQVETTMPDGRTIRFHAPCAAVWRRLKGVLILLRR
Ga0210384_1126278013300021432SoilMDTASQRIAEKIAARILPAIADPVSVWEGFGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRQLRDVLPLLRRQ
Ga0222622_1117847523300022756Groundwater SedimentMDSASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRD
Ga0207653_1018940323300025885Corn, Switchgrass And Miscanthus RhizosphereMDTASQRIAEKIAARILPAVADPVSVWEGFGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRRQMK
Ga0207647_1024377013300025904Corn RhizosphereIDDAIAQRLCWLLPGIEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAQCADTWRRLKGVLPLLRR
Ga0207684_1001371353300025910Corn, Switchgrass And Miscanthus RhizosphereMDTASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEAQQELGLPDGRTLRFHASCAVAWRRLRDVLPLLRR
Ga0207684_1001988863300025910Corn, Switchgrass And Miscanthus RhizosphereMKTASRRIAEKIAARLLPAIADPVRIWEGFGIGLPCDGCDLPILPSEAQQDLGLPDGRTLRFHVPCATDWQRLRGVLPLLRR
Ga0207684_1003426563300025910Corn, Switchgrass And Miscanthus RhizosphereMDTASQRIAEKIAARILPAVADPVSVWEGFGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDILPLLRRQMK
Ga0207693_1054404723300025915Corn, Switchgrass And Miscanthus RhizosphereMKTASRRIAEKIAARLLPAVADPVRVWEGFGIGLPCDGCDLPILPSEAQRDLGLPDGRTLRFHVPCATDWQRLRGVLPLLRR
Ga0207686_1106663623300025934Miscanthus RhizosphereVDSASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRRQMK
Ga0207711_1052844913300025941Switchgrass RhizosphereGRDRARADPGVMETRTMDPASQRVAEKLAARLLPGIEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRLKGVLPLLRR
Ga0207651_1198827913300025960Switchgrass RhizosphereETTMDPASQRVAEKLAARLLPGIEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRLKGVLPLLRR
Ga0207668_1028176033300025972Switchgrass RhizosphereRVAEKLAARLLPGIEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRLKGVLPLLRR
Ga0207677_1085299223300026023Miscanthus RhizosphereMDPASQRVAEKLAARLLPGVEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRLKGVLPLLRR
Ga0207676_1007288713300026095Switchgrass RhizosphereARLLPGVEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRLKGVLPLLRR
Ga0209240_108533813300026304Grasslands SoilMDTASQRIAEKIAARILPAVADPVSVWKGFGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDILPLLRRQMK
Ga0257166_101303313300026358SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGTDLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLR
Ga0257163_104623923300026359SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEAQQELGLPDGRTLRFHASCAAAWRRLRDVLPLLRRQMK
Ga0257152_100186053300026369SoilMDTASQRIAEKIAARILPAVADPVSVWKGFGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRRQV
Ga0257179_100544423300026371SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDILPLLRRQMK
Ga0257167_101233913300026376SoilKIAARILPAVADPVSVWEGFGTDLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDILPLLRRQMK
Ga0257155_101969433300026481SoilMDSASQRIAEKIAARILPAVADPVGVWEGFGIGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRRQMK
Ga0257172_104613313300026482SoilSTWSAASRPRMDTASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDILPLLRRQMK
Ga0257159_101902913300026494SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRRQVK
Ga0257157_100854013300026496SoilMDSASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRRQMK
Ga0257168_102288633300026514SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEAQHELGLPDGRTLRFHASCAAAWRRLRDVLPLLRR
Ga0179587_1020555033300026557Vadose Zone SoilMDSASQRIAEKIAARILPAVADPVSVWEGFGTDLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDILPLLRRQMK
Ga0209068_10001603143300027894WatershedsMNAASQRVAEKIAARLLPAVADPVSEWEGFGIGLHCDGCDAPVLPSEAQHEPRLPDGRTLRFHAPCAAAWHRLRRVLPLLRR
Ga0209068_1001161553300027894WatershedsMRIKTGSQRIAEKLAARLLPGTESSAMWEGFGIGLRCDGCDQPVLPSERQHDLVLPDGRTIRFHASCAAVWRRLKAVLPLLRR
Ga0308309_1012910023300028906SoilMDAPGQRIAEKLAARILPGYMEPVTTWEGFGIGLHCDGCDLPVLPSESQVETTMPDGRTIRFHAPCAAVWRRLKEVLILLRR
(restricted) Ga0255312_104136213300031248Sandy SoilMDAASQRIAEKLAARILPGTEPRATWLGFGIGLPCDGCDRPVLPSEEQVELVLPDGRTLRFHAPCGQEWRRLKGVLPLLRR
Ga0310813_1094525513300031716SoilMDAASQRIAEKLAARILPGTDPGATWRGFGIGLHCDGCDRPVLPSEEQVELVLPDGRTLRFHAACADEWRRLKGVLPLLRR
Ga0307469_1016396413300031720Hardwood Forest SoilMEAAPQRIAEKIAARILPVIADPVSIWKGFGTGLHCDGCDIPILPSEPQEELRLADGRTLRFHAPC
Ga0307469_1016582433300031720Hardwood Forest SoilMNLASRRIAEKIAARLLPAIADPVRVWEGFGIGLPCDGCDLPILPSEAQQDLGLPDGRTLRFHVTCATDWHRLRGVLPLLRR
Ga0307469_1038705133300031720Hardwood Forest SoilARILPAVADPVSVWEGFGTGLHCDGCDLPVLPSEPQKELALPDGRTLRFHASCAAAWRRLRDILPLLRRQMK
Ga0307469_1048359933300031720Hardwood Forest SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGTGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAA
Ga0307468_10094402613300031740Hardwood Forest SoilPRMAETGGYRYGTAGTVRAMEAAPQRIAEKIAARILPVIADPVSIWKGFGTGLHCDGCDIPILPSEPQEELRLADGRTLRFHAPCAVDWRRLRGVLPLLRRWLPNG
Ga0307475_1128181333300031754Hardwood Forest SoilMKTASRRIAEKIAARLLPAIADPVRVWEGFGIGLPCDGCDLPIMPSEAQQDLGLPDGRTLRFHVPCATDWQRLRGVLPLLRR
Ga0307473_1001631553300031820Hardwood Forest SoilMKTASRRIAEKIAARLLPAIADPVRVWEGFGIGLPCDGCDLPILPSEAQQDLGLPDGRTLRFHVPCATDWQRLRGVLPLLRR
Ga0307473_1033181323300031820Hardwood Forest SoilVDTPAQRIGEKLASRMLPGYGEPGATWEGFGIGLVCDGCDQPVLPGHREVDVTLSGGRTIRFHAPCAAAWRRLRDVLPLLRREPRRG
Ga0307470_1005099223300032174Hardwood Forest SoilMDTASQRIAEKIAARILPAVADPVSVWEGFGIGLHCDGCDLPVLPSEPQQELALPDGRTLRFHASCAAAWRRLRDVLPLLRRQMK
Ga0307470_1046233823300032174Hardwood Forest SoilMEAAPQRIAEKIAARILPVIADPVSIWKGFGTGLHCDGCDIPILPSEPQEELRLADGRTLRFHAPCAVDWRRLRGVLPLLRRWLPNG
Ga0307471_10169337423300032180Hardwood Forest SoilMRRLARLLSDVPGPMNPASRRIAEKIAARLLPAIADPVRVWEGFGIGLPCDGCDLPILPSEAQQDLGLPDGRTLRFHVTCATDWHRLRGVLPLLRR
Ga0326729_100729233300033432Peat SoilMDAAAQRIAEKLAARILPGTEPGATWRGFGIGLHCDGCDRPALPSEEQVELVLPDGRTLRFHAPCADEWRRLKGVLPLLRR
Ga0326726_1025033133300033433Peat SoilMDVASQRIAEKLVARILPGTEPGATWRGFGIGLHCDGCDRPALPSEEQVELVLPDGRTLRFHAPCADEWRRLKGVLPLLRR
Ga0326730_101028823300033500Peat SoilMDVASQRIAEKLAARILPGTEPGATWRGFGIGLHCDGCDRPALPSEEQVELVLPDGRTLRFHAPCADEWRRLKGVLPLLRR
Ga0326730_101684633300033500Peat SoilMDAAAQRIAEKLAARILPGTEPGATWRGFGIGLHCDGCDRPVLPSEEQVELVLPDGRTLRFHAPCADEWRRLKGVLPLLRR
Ga0326723_0127657_705_9503300034090Peat SoilMDAAAQRIAEKLAARILPGTESGATWRGFGIGLHCDGCDRPVLPSEEQVELVLPDGRTLRFHAPCADEWRRLKGVLPLLRR
Ga0373959_0007256_1618_18603300034820Rhizosphere SoilMETRTMDPASQRVAEKLAARLLPGVEPTSTWEGFGLGLVCDGCDRPVLPSERQRELMLPDGRTLRFHAPCADTWRRLKGVL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.