NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F086249

Metagenome Family F086249

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F086249
Family Type Metagenome
Number of Sequences 111
Average Sequence Length 201 residues
Representative Sequence MASLRGFRTVTLALLVGGLAGCAAEPRQPGSPALRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Number of Associated Samples 93
Number of Associated Scaffolds 111

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 65.77 %
% of genes near scaffold ends (potentially truncated) 42.34 %
% of genes from short scaffolds (< 2000 bps) 67.57 %
Associated GOLD sequencing projects 88
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (54.054 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere
(20.721 % of family members)
Environment Ontology (ENVO) Unclassified
(25.225 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(46.847 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: Yes Secondary Structure distribution: α-helix: 43.54%    β-sheet: 14.35%    Coil/Unstructured: 42.11%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 111 Family Scaffolds
PF12867DinB_2 29.73
PF01042Ribonuc_L-PSP 9.01
PF00291PALP 3.60
PF12706Lactamase_B_2 3.60
PF03466LysR_substrate 3.60
PF00753Lactamase_B 3.60
PF00126HTH_1 1.80
PF07690MFS_1 0.90
PF02112PDEase_II 0.90
PF01757Acyl_transf_3 0.90
PF06114Peptidase_M78 0.90

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 111 Family Scaffolds
COG0251Enamine deaminase RidA/Endoribonuclease Rid7C, YjgF/YER057c/UK114 familyDefense mechanisms [V] 9.01
COG5212cAMP phosphodiesteraseSignal transduction mechanisms [T] 0.90


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms54.05 %
UnclassifiedrootN/A45.95 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002245|JGIcombinedJ26739_100217459All Organisms → cellular organisms → Bacteria1804Open in IMG/M
3300005176|Ga0066679_10217783Not Available1222Open in IMG/M
3300005341|Ga0070691_10103217Not Available1418Open in IMG/M
3300005406|Ga0070703_10077197Not Available1126Open in IMG/M
3300005445|Ga0070708_100352841All Organisms → cellular organisms → Bacteria1386Open in IMG/M
3300005445|Ga0070708_100543574Not Available1096Open in IMG/M
3300005467|Ga0070706_100003824All Organisms → cellular organisms → Bacteria14709Open in IMG/M
3300005467|Ga0070706_100336964All Organisms → cellular organisms → Bacteria1406Open in IMG/M
3300005471|Ga0070698_100266920All Organisms → cellular organisms → Bacteria1643Open in IMG/M
3300005536|Ga0070697_100387394Not Available1211Open in IMG/M
3300005545|Ga0070695_100700911Not Available804Open in IMG/M
3300005546|Ga0070696_100398947Not Available1076Open in IMG/M
3300005875|Ga0075293_1003872Not Available1453Open in IMG/M
3300005878|Ga0075297_1007594Not Available999Open in IMG/M
3300005888|Ga0075289_1049791Not Available656Open in IMG/M
3300006049|Ga0075417_10391938Not Available686Open in IMG/M
3300006050|Ga0075028_100009505All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia3960Open in IMG/M
3300006163|Ga0070715_10116664All Organisms → cellular organisms → Bacteria1267Open in IMG/M
3300006173|Ga0070716_100185849Not Available1369Open in IMG/M
3300006755|Ga0079222_10558585Not Available859Open in IMG/M
3300006804|Ga0079221_11653804Not Available520Open in IMG/M
3300006852|Ga0075433_10026575All Organisms → cellular organisms → Bacteria → Proteobacteria4902Open in IMG/M
3300006854|Ga0075425_100237124All Organisms → cellular organisms → Bacteria2099Open in IMG/M
3300006854|Ga0075425_100403252All Organisms → cellular organisms → Bacteria1575Open in IMG/M
3300006871|Ga0075434_100500779All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1235Open in IMG/M
3300006903|Ga0075426_10227716Not Available1354Open in IMG/M
3300007255|Ga0099791_10006423All Organisms → cellular organisms → Bacteria4843Open in IMG/M
3300007255|Ga0099791_10338851Not Available719Open in IMG/M
3300007265|Ga0099794_10003966All Organisms → cellular organisms → Bacteria5745Open in IMG/M
3300009038|Ga0099829_10051611All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3049Open in IMG/M
3300009143|Ga0099792_10034453All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2370Open in IMG/M
3300009147|Ga0114129_10277556All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2240Open in IMG/M
3300011119|Ga0105246_11636612Not Available610Open in IMG/M
3300012189|Ga0137388_11095024Not Available733Open in IMG/M
3300012202|Ga0137363_10300163All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1318Open in IMG/M
3300012205|Ga0137362_10113136All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2296Open in IMG/M
3300012205|Ga0137362_10400847Not Available1188Open in IMG/M
3300012210|Ga0137378_11108840Not Available706Open in IMG/M
3300012361|Ga0137360_10071515All Organisms → cellular organisms → Bacteria2570Open in IMG/M
3300012362|Ga0137361_10041252All Organisms → cellular organisms → Bacteria3745Open in IMG/M
3300012685|Ga0137397_10009511All Organisms → cellular organisms → Bacteria6767Open in IMG/M
3300012923|Ga0137359_10369992All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1273Open in IMG/M
3300012923|Ga0137359_11112257Not Available676Open in IMG/M
3300012927|Ga0137416_10170802All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1721Open in IMG/M
3300012931|Ga0153915_10226218All Organisms → cellular organisms → Bacteria2063Open in IMG/M
3300015371|Ga0132258_12116949Not Available1414Open in IMG/M
3300015372|Ga0132256_102744793Not Available591Open in IMG/M
3300015373|Ga0132257_100580373Not Available1383Open in IMG/M
3300017930|Ga0187825_10073437All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1169Open in IMG/M
3300017936|Ga0187821_10227081Not Available724Open in IMG/M
3300017994|Ga0187822_10202495Not Available662Open in IMG/M
3300019881|Ga0193707_1016724All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2449Open in IMG/M
3300019890|Ga0193728_1061039All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1813Open in IMG/M
3300020002|Ga0193730_1030712All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1557Open in IMG/M
3300020021|Ga0193726_1043769All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2161Open in IMG/M
3300020579|Ga0210407_10035620All Organisms → cellular organisms → Bacteria3702Open in IMG/M
3300021086|Ga0179596_10358205Not Available733Open in IMG/M
3300021088|Ga0210404_10028467All Organisms → cellular organisms → Bacteria2477Open in IMG/M
3300021178|Ga0210408_10020718All Organisms → cellular organisms → Bacteria → Proteobacteria5258Open in IMG/M
3300021404|Ga0210389_10220889Not Available1480Open in IMG/M
3300021478|Ga0210402_10412904Not Available1253Open in IMG/M
3300025885|Ga0207653_10302288Not Available620Open in IMG/M
3300025905|Ga0207685_10291714Not Available804Open in IMG/M
3300025906|Ga0207699_10112914All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1744Open in IMG/M
3300025910|Ga0207684_10022945All Organisms → cellular organisms → Bacteria → Proteobacteria5329Open in IMG/M
3300025910|Ga0207684_10047626All Organisms → cellular organisms → Bacteria3635Open in IMG/M
3300025910|Ga0207684_10056847All Organisms → cellular organisms → Bacteria3319Open in IMG/M
3300025910|Ga0207684_10352764All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1266Open in IMG/M
3300025915|Ga0207693_10599696Not Available857Open in IMG/M
3300025922|Ga0207646_10010367All Organisms → cellular organisms → Bacteria → Acidobacteria9113Open in IMG/M
3300025922|Ga0207646_10069054All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3156Open in IMG/M
3300025939|Ga0207665_10025183All Organisms → cellular organisms → Bacteria3925Open in IMG/M
3300026005|Ga0208285_1016687Not Available588Open in IMG/M
3300026285|Ga0209438_1025526All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1970Open in IMG/M
3300026340|Ga0257162_1000037All Organisms → cellular organisms → Bacteria7838Open in IMG/M
3300026351|Ga0257170_1030339Not Available728Open in IMG/M
3300026376|Ga0257167_1014069Not Available1099Open in IMG/M
3300026482|Ga0257172_1058725Not Available705Open in IMG/M
3300026482|Ga0257172_1105355Not Available518Open in IMG/M
3300026496|Ga0257157_1001563All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3139Open in IMG/M
3300026507|Ga0257165_1005855All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1782Open in IMG/M
3300026508|Ga0257161_1003747All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2608Open in IMG/M
3300026514|Ga0257168_1005823All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2123Open in IMG/M
3300026514|Ga0257168_1018342Not Available1424Open in IMG/M
3300026551|Ga0209648_10013856All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria7043Open in IMG/M
3300026557|Ga0179587_10549488Not Available759Open in IMG/M
3300027645|Ga0209117_1022187All Organisms → cellular organisms → Bacteria2041Open in IMG/M
3300027894|Ga0209068_10058019All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1977Open in IMG/M
3300028047|Ga0209526_10010340All Organisms → cellular organisms → Bacteria → Proteobacteria6427Open in IMG/M
3300028828|Ga0307312_10326617All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1002Open in IMG/M
3300031716|Ga0310813_11778754Not Available578Open in IMG/M
3300031720|Ga0307469_10712042Not Available911Open in IMG/M
3300031720|Ga0307469_10815156Not Available857Open in IMG/M
3300031740|Ga0307468_100010443All Organisms → cellular organisms → Bacteria3607Open in IMG/M
3300031740|Ga0307468_100658181Not Available867Open in IMG/M
3300031820|Ga0307473_10291404All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1022Open in IMG/M
3300031820|Ga0307473_10334278Not Available967Open in IMG/M
3300031943|Ga0310885_10779051Not Available542Open in IMG/M
3300031962|Ga0307479_10017577All Organisms → cellular organisms → Bacteria6738Open in IMG/M
3300032174|Ga0307470_10121480Not Available1543Open in IMG/M
3300032180|Ga0307471_100207928All Organisms → cellular organisms → Bacteria1966Open in IMG/M
3300032180|Ga0307471_101129167All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria949Open in IMG/M
3300032180|Ga0307471_102749267Not Available624Open in IMG/M
3300032205|Ga0307472_100210635Not Available1487Open in IMG/M
3300032205|Ga0307472_100790375Not Available865Open in IMG/M
3300033412|Ga0310810_10247002All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1966Open in IMG/M
3300033432|Ga0326729_1004608All Organisms → cellular organisms → Bacteria2678Open in IMG/M
3300033433|Ga0326726_10269362All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1590Open in IMG/M
3300033486|Ga0316624_11326859Not Available657Open in IMG/M
3300033500|Ga0326730_1003341All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3691Open in IMG/M
3300034817|Ga0373948_0094438Not Available698Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere20.72%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil16.22%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil13.51%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil11.71%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil6.31%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere6.31%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil3.60%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment2.70%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.70%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil2.70%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.80%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.80%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.80%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.80%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.90%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.90%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.90%
Rhizosphere SoilHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere Soil0.90%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.90%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005875Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_101EnvironmentalOpen in IMG/M
3300005878Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_104EnvironmentalOpen in IMG/M
3300005888Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_80N_103EnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300011119Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-4 metaGHost-AssociatedOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017994Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_2EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019890Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c1EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020021Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c1EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025905Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026005Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_101 (SPAdes)EnvironmentalOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026340Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-AEnvironmentalOpen in IMG/M
3300026351Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-05-BEnvironmentalOpen in IMG/M
3300026376Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-BEnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026496Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-AEnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300026508Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-AEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031943Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D2EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M
3300033432Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF6AY SIP fractionEnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033486Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_N3_C1_D5_AEnvironmentalOpen in IMG/M
3300033500Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF7AN SIP fractionEnvironmentalOpen in IMG/M
3300034817Populus rhizosphere microbial communities from soil in West Virginia, United States - GW9791_WV_N_1Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10021745913300002245Forest SoilMIGPISGRLDAARLWRELRAMAFSRGFRMATLALLVGGLAGCAAEPRQPASPAQRFTLLEHVQVAGACILENGGVSARYAGFVERWRDVKRGRDLLHGSVSTIDRFVDPGRSGHRPPLSIQFGEIDVPPLSAGQLVLTGTSEGSQPSRATCTLDVTAREQPARTEGRLLASLLGRDPVSFPPRSVLSG
Ga0066679_1021778323300005176SoilMASSPVFRAVALAVLVGGLAGCAAEPRQSASPGPRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVRKGRDLLHGSVSTVDRFVDPGRTGHRPPLSIQFGEIDAAPLTAGQLVLTGVSEGSQPSRATCTLDVTNREQPARQEARLLASLLGRDPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLAFLLQLVSLLLW*
Ga0070691_1010321723300005341Corn, Switchgrass And Miscanthus RhizosphereMARSAMTPALTLASLAWMLAACTADPGRAAAPARRFALLEHVQVTGACVPESGGPAARYSGFVERWHDAQKQRDLLHGSVSNFDRFIDPGRPGDRPPPSIQFGETDATPLTADRLMLKGLSGGSPPSPATCTLDVTSREQPARHPGRLLAGVLDPRSAVGGAALLVGLVAGIRLIRHQVRRTSAPWLLGLALLLQLVALLL*
Ga0070703_1007719723300005406Corn, Switchgrass And Miscanthus RhizosphereTLLEHAQVAGACVLENGAASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPLSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLS*
Ga0070708_10035284123300005445Corn, Switchgrass And Miscanthus RhizosphereMASLGGFRSVTLALLVGGLAGCAAEPRQPGSPAPRFTLLEHVQVAGACVLENGAASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAGPRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPPRQEGRLLASLLGREPVSLPPRSVLSGAAVLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG*
Ga0070708_10054357413300005445Corn, Switchgrass And Miscanthus RhizosphereMAHWRRRPVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVKKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPARPEGKLLASLLGREPVALPPRAAVSALALLIGIVAGVRLIRHQVRRSSASWLLGLALLLQLVSLLLS*
Ga0070706_100003824123300005467Corn, Switchgrass And Miscanthus RhizosphereMAHSRRRPVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVKKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPARPEGKLLASLLGREPVALPPRAAVSALALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS*
Ga0070706_10033696423300005467Corn, Switchgrass And Miscanthus RhizosphereMASLRGFRTVALALLVGGLAGCAAEPRQPGSPALRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAEHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPLSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG*
Ga0070698_10026692013300005471Corn, Switchgrass And Miscanthus RhizosphereMASSPVFRAVALAVLVGGLAGCAAEPRQSASPGPRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVRKGRDLLHGSVSTVDRFVDPGRTGHRPPLSIQFGEIDAAPLTAGQLVLTGVSEGSQPSRATCTLDVTNREQPARQEARLLASLLGRDPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLAFLLQLVSL
Ga0070697_10038739423300005536Corn, Switchgrass And Miscanthus RhizosphereVTLALLVGGLAGCAAEPRQPGSPAPRFTLLEHVQVAGACVLENGAASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAGPRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPPRQEGRLLASLLGREPVSLPPRSVLSGAAVLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG*
Ga0070695_10070091113300005545Corn, Switchgrass And Miscanthus RhizosphereVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVKKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETAAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPARPEGKLLASLLGHEPVALPPRAAVSALALLIGIVAGVRLIRHQVRRSSASWLLGLALLLQLVSLLLS*
Ga0070696_10039894723300005546Corn, Switchgrass And Miscanthus RhizosphereMAHWRRRPVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVKKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPARPEGKLLASLLGHEPVALPPRAAVSALALLIGIVAGVRLIRHQVRRSSASWLLGL
Ga0075293_100387223300005875Rice Paddy SoilMARSAMTPALTLASLAWMLAACTADPGRPAAPARRFALLEHVQVTGACVPESGGPAARYSGFVERWHDAQKQRDLLHGSVSNFDRFIDPGRPGDRPPPSIQFGETDATPLTADRLMLKGLSGGSPPSPATCTLDVTSREQPARHPGRLLAGVLDPRSAVGGAALLVGLVAGIRLIRHQVRRTSAPWLLGLALLLQLVALLL*
Ga0075297_100759423300005878Rice Paddy SoilMARSAMTPALTLASLAWMLAACTADPGRPAAPARRFALLEHVQVTGVCVPESGGPAARYSGFVERWYDAQKRRDLLHGSVSNFDRFIDPGRPGDRPPPSIQFGETDATPLTADRLTLKGLSGGSPPSPATCTLDVTSREQPARHPGRLLAGVLDPRSAVGGAALLVGLVAGIRLIRHQVRRTSAPWLLGLALLLQLVALLL*
Ga0075289_104979123300005888Rice Paddy SoilRRLAACTADPGRAAAPARRFALLEHVQVTGACVPESGGPAARYSGFVERWHDAQKQRDLLHGSVSNFDRFIDPGRPGDRPPPSIQFGETDATPLTADRLMLKGLSGGSPPSPATCTLDVTSREQPARHPGRLLAGVLDPRSAVGGAALLVGLVAGIRLIRHQVRRTSAPWLLGLALLLQLVALLL*
Ga0075417_1039193813300006049Populus RhizosphereRFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGEIDAAPLSAGQLSLTGVSEGADPRRATCTLTVTKREGPARREGKLLASLLGREPVALPPRAAVSVLALLIGIVAGVRLVRHQVRRTSASWLLGLALLLQLVSLLLS*
Ga0075028_10000950533300006050WatershedsMASSPICRAMALAALVGGLAGCAGEPRESAAPAPRFTLQEHVQVAGACVLENGGASARYAGFVERWHDVQKGRDLLHGSVSTVDRFVDPGRTGIRPPLSIQFGDSDVAPLTAARLVLGGVSEGSQPSRATCTLDVINREQPARPGGRLLASLLGRHPVPLPPRSVVSGAALLIGLVGGIRLIRHQVRRTSASWLLGLAFLLQLVSLFLW*
Ga0070715_1011666423300006163Corn, Switchgrass And Miscanthus RhizosphereMAHSRSRPVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVKKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPARPEGKLLASLLGHEPVALPPRAAVSALALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS*
Ga0070716_10018584923300006173Corn, Switchgrass And Miscanthus RhizosphereMAHWRRRPVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVKKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPARPEGKLLASLLGHEPVALPPRAAVSALALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS*
Ga0079222_1055858513300006755Agricultural SoilPAVGLAALVWLLSACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTIDRFADPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTKREGPARREGKLLASLLGREPVALPPRAAVSAVALLIGIVAGVRLVRHQVRRTSASWLLGLALLLQLVSLLLS*
Ga0079221_1165380413300006804Agricultural SoilLRLEHVQVTGACAPENGGQAARYTGFIERWHDVQKRRDLLHGSVSNFDRFIDPGRPGDRPPPSIQFGETDATPLTADHLVLKGLSGGSPPSPATCTLDVTSRERPARLAGRLASVLDPRSVVSGAALLVGLVAGIRLIRHQVRRTSAPWLLGLALLLQLVALLL*
Ga0075433_1002657543300006852Populus RhizosphereMAHSRRRPVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTIDRFADPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTKREGPARREGKLLASLLGREPVALPPRAAVSVLALLIGIVAGVRLVRHQVRRTSASWLLGLALLLQLVSLLLS*
Ga0075425_10023712423300006854Populus RhizosphereMAHSRRRPVVRPVVGLAALVWLLAACSTEPRQPGSPGARFTLLEHVQVSGACALDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGEIDAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPPRPEGKLLASLLGREPIALPPRAAVSAAALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS*
Ga0075425_10040325233300006854Populus RhizosphereMAHSRRRPVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTIDRFADPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTKREGPARREGKLLASLLGREPVALPPRAAVSVLALLIGIVAGVRLVRHQVRRTSASWLLGL
Ga0075434_10050077913300006871Populus RhizosphereMAHSRRRPVVRPVVGLAALVWLLAACSTEPRQPGSPGARFTLLEHVQVSGACALDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGEIDAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPPRPEGKLLASLLGREPIALPPRAAVSAVALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS*
Ga0075426_1022771623300006903Populus RhizosphereMAHSRRRPVVRPVVGLAALVWLLAACSTEPRQPGSPGARFTLLEHVQVSGACVLDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGEIDAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPPRPEGKLLASLLGREPIALPPRAAVSAVALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS*
Ga0099791_1000642343300007255Vadose Zone SoilMASLRGFRTVALALLVGGLAGCAAEPRQPGSPALRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGAVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG*
Ga0099791_1033885113300007255Vadose Zone SoilGACVLESGGASARYAGFVERWRDVRKGRDLLHGSVSTVDRFVDPGRTGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVTNREQPARQEGRLLASLLGRDPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLAFLLQLVSLLLW*
Ga0099794_1000396653300007265Vadose Zone SoilMASLRGFRTVALALLVGGLAGCAAEPRQPGSPALRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGAVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPLSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG*
Ga0099829_1005161123300009038Vadose Zone SoilMASSPVFRAVALAVLVGGLAGCAAEPRQSASPGPRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVRKGRDLLHGSVSTVDRFVDPGRTGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVTNREQPARQEGRLLASLLGRDPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLAFLLQLVSLLLW*
Ga0099792_1003445333300009143Vadose Zone SoilMASSPVFRAVALAVLVGGLAGCAAEPRQSASPGPRFTLLEHVQVAGACVLESGGASAHYAGFVERWRDVRKGRVFLNGSVSTVDRFVDPGRTGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVTNREQPARQEGRLLASLLGRDPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLAFLLQLVSLLLW*
Ga0114129_1027755623300009147Populus RhizosphereVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTIDRFADPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTKREGPARREGKLLASLLGREPVALPPRAAVSVLALLIGIVAGVRLVRHQVRRTSASWLLGLALLLQLVSLLLS*
Ga0105246_1163661223300011119Miscanthus RhizosphereQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVTNREQPARQEGRLLASLLGREPVSLPPRSVLGGAALLIGLVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLW*
Ga0137388_1109502423300012189Vadose Zone SoilMASLRGFRTVALALLVGGLAGCAAEPRQPGSPALRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGAVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPLSLPPRSVLSGAALLIGLVAGIRLIR
Ga0137363_1030016323300012202Vadose Zone SoilMASSPVFQAVALAVLVGGLAGCAAEPRQSASPGPRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVRKGRDLLHGSVSTVDRFVDPGRTGHRPPLSIQFGEIDAAPLTAGQLVLTGVSEGSQPSRATCTLDVTNREQPARQEGRLLASLLGRDPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLAFLLQLVSLLLW*
Ga0137362_1011313623300012205Vadose Zone SoilMASSPVFRAVALAVLVGGLAGCAAEPRQSASPGPRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVRKGRDLLHGSVSTVDRFVDPGRTGHRPPLSIQFGEIDAAPLTAGQLVLTGVSEGSQPSRATCTLDVTNREQPARQEGRLLASLLGRDPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLAFLLQLVSLLLW*
Ga0137362_1040084723300012205Vadose Zone SoilGMASLRGFRTVALALLVGGLAGCAAEPRQPGSPALRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARPEGRLLASLLGREPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG*
Ga0137378_1110884013300012210Vadose Zone SoilAMASSPVFRAVALAVLVGGLAGCAAEPRQSASPGPRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVRKGRDLLHGSVSTVDRFVDPGRTGHRPPLSIQFGEIDAAPLTAGQLVLTGVSEGSQPSRATCTLDVTNREQPARQEARLLASLLGRDPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLAFLLQLVSLLLW*
Ga0137360_1007151513300012361Vadose Zone SoilMASLRGFRTVALALLVGGLAGCAAEPRQPGSPAVRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAGNRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARPEGRLLASLLGREPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG*
Ga0137361_1004125243300012362Vadose Zone SoilMASLRGFRTVALALLVGGLAGCAAEPRQPGSPAVRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAGNRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPLSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG*
Ga0137397_1000951193300012685Vadose Zone SoilMASLRGFRTVALALLVGGLAGCAAEPRQPGSPALRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGPAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPLSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG*
Ga0137359_1036999223300012923Vadose Zone SoilMASLRGFRTVALALLVGGLAGCAAEPRQPGSPALRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPLSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG*
Ga0137359_1111225713300012923Vadose Zone SoilSASPGPRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVRKGRDLLHGSVSTVDRFVDPGRTGHRPPLSIQFGEIDAAPLTAGQLVLTGVSEGSQPSRATCTLDVTNREQPARQEARLLASLLGRDPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLAFLLQLVSLLLW*
Ga0137416_1017080213300012927Vadose Zone SoilMAFSRGFRMATLALLVGGLAGCAAEPRQPASPAQRFMLLEHVQVAGACILENGGVSARYAGFVERWRDVKRGRDLLHGSVSTIDRFVDPGRSGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPLSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG*
Ga0153915_1022621833300012931Freshwater WetlandsMTPALPLAVLVWMLAACTAEPRQAVAPARRFTLLEHVQVTGACVPENGGQAARYTGFIERWHDVQKKRELLHGSVSNFDRFIDPGRPGDRPPPSIHFGETDAAPLTADHLVLKGLSGGSQPSPATCTLDVTSREQPARRQGRLLASILDPRSAVSGAALLVGLVAGIRLIRHQVRRTSAPWLLGLALLLQLVALLL*
Ga0132258_1211694923300015371Arabidopsis RhizosphereMIRMLTLAALGWTLAACTAEPGRAAAPVRRFTLLEHVQVTGACVPDNGGQSARYTGFVERWHDVQKRRDLLHGSVANFDRFIDPGRPGDRPPPSIQFGETDATPLAADHLVLKGLSGGSPPSPVTCTLDVTSRERPARLAGRLAGVPDPRSAVGGAALLVGLVAGIRLIRHQVRRTSAPWLLGLALLLQLVALLL*
Ga0132256_10274479313300015372Arabidopsis RhizosphereMARSAMTRMLTLAALGWTLAACTAEPGRAAAPVRRFTLLEHVQVTDACVPDNGGQSARYTGFVERWHDVQKRRDLLHGSVANFDRFIDPGRPGDRPPPSIQFGETDATPLAADHLVLKGLSGGSPPSPVTCTLDVTSRERPARLAGRLAGVPDPRSAVGGAALLVGLGAG
Ga0132257_10058037323300015373Arabidopsis RhizosphereMIRMLTLAALGWTLAACTAEPGRAAAPVRRFTLLEHVQVTGACVPDNGGQSARYTGFVERWHDVQKRRDLLHGSVANFDRFIDPGRPGDRPPPSIQFGETDATPLAADHLVLKGLSGGSPPSPVTCTLDVTSRERPARLAGRLAGVPDPRSAVGGAALLVGLVAGIRLIRHQVRRTSAPWLLGLALL
Ga0187825_1007343713300017930Freshwater SedimentMARSAMTRMLTLAALGWTLAACTAEPRQAVAPARRFTLLEHVQVTGACAPENGGQAARYTGFIERWHDVQKRRDLLHGSVSNFDRFIDPGRPGDRPPPSIQFGETDATPLATDHLVLKGLAGGSPPSPATCTLDVTSRERPARLAGRLASVLDLRSVVSGAALLVGLVAGIRLIRHQVRRTSAPWLLGLALLLQLVALLL
Ga0187821_1022708113300017936Freshwater SedimentMARSAMTRMLTLATLGWMLAACTAEPRQAAAPARRFTLLEHVQVTGACAPENGGQAARYTGFIERWHDVQKRRDLLHGSVSNFDRFIDPGRPGDRPPPSIQFGETDATPLTADHLVLKGLSGGSPPSPATCTLDVTSRERPARLAGRLASVLDPRSVVSGAALLVGLVAGIRLIRHQVRRTSAPWLLGLALLLQLVALLL
Ga0187822_1020249513300017994Freshwater SedimentMARSAMTRMLTPAALGWTLAACTAEPRQAVAPARRFTLLEHVQVTGACAPENGGQAARYTGFIERWHDVQKRRDLLHGSVSNFDRFIDPGRPGDRPPPSIQFGETDATPLAADHLVLKGLSGGSPPSPATCTLDVTSRERPARLAGRLASVLDPRSAVSGAALLVGLVAGIRLIRHQVR
Ga0193707_101672423300019881SoilMVTLALLVGGLAGCAAEPRQPASPAQRFTLLEHVQVAGACILESGGVSARYAGFVERWRDVKRGRDLLHGSVSTIDRFVDPGRSGHRPPLSIQFGEIDVPPLSAGQLVLTGTSEGSQPSRATCTLDVTAREQPARTEGRLLASLLGREPVSFPPRSVLSGAALLIGLVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0193728_106103933300019890SoilVRAMAFSRGFWVVRLALLVGGLAGCAAEPGQPVSPAQRFTLLEHVQVVGACVLESGGVSARYAGFVERWRDVRKGRDLLHGSVSTVDRFVDPGRTGHRPPLSIQFGETDVAPLSAGQLVLTGTSEGSQPSRATCTLDVTAREQPARTEGRLLASLLGREPLSFPPRSVLSGAALLIGLVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0193730_103071233300020002SoilLAGCAAEPGQPVSPAQRFTLLEHVQVVGACVLESGGVSARYAGFVERWRDVRKGRDLLHGSVSTVDRFADPGRTGHRPPLSIQFGETDVAPLSAGQLVLTGTSEGSQPSRATCTLDVTAREQPARTEGRLLASLLGREPLSFPPRSVLSGAALLIGLVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0193726_104376943300020021SoilMVTLALLVGGLAGCAAEPRQPASPAQRFTLLEHVQVAGACVLESGGVSARYAGFVERWRDVKRGRDLLHGSVSTVDRFVDPGRGGHRPPLSIQFGEIDVAPLSAGQLVLTGTSEGSQPSRATCTLDVIARENPERQDGRLLASLLGREPLSLPPRSVLSGAALLIGVVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0210407_1003562033300020579SoilMAHSGRRRVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVSGACVLDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAAQLSLTGVSEGADPRRATCTLTVTNREGPARPEGRLLASLLGREPVALPPRAAVSAVALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS
Ga0179596_1035820513300021086Vadose Zone SoilAVRVGGLAGCAAEPRQSASPGPRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVRKGRDLLHGSVSTVDRFVDPGRTGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVTNREQPARQEGRLLASLLGRDPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLAFLLQLVSLLLW
Ga0210404_1002846743300021088SoilLLAACSTEPRQPGSPAARFTLLEHVQVSGACVLDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAAQLSLTGVSEGADPRRATCTLTVTNREGPARPEGRLLASLLGREPVALPPRAAVSAVALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS
Ga0210408_1002071843300021178SoilMAHSRRQPVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVSGACVLDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAAQLSLTGVSEGADPRRATCTLTVTNREGPARPEGRLLASLLGREPVALPPRAAVSAVALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS
Ga0210389_1022088923300021404SoilMAHSGRRRVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVSGACVLDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAAQLSLTGVSEGADPRRATCTLTVTNREGPARPEGRLLASLLGREPVALPPRAAVSAVALLIGIVAGVRLIRHQVRRTSASWLLG
Ga0210402_1041290423300021478SoilMAHSGRRRVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVSGACVLDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAAQLSLTGVAEGADPRRATCTLTVTNREGPARPEGRLLASLLGREPVALPPRAAVSAVALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS
Ga0207653_1030228813300025885Corn, Switchgrass And Miscanthus RhizosphereRVGDDQFAAAVAVQVPKPHAQVAGACVLENGAASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0207685_1029171413300025905Corn, Switchgrass And Miscanthus RhizosphereMAHSRSRPVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVKKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPARPEGKLLASLLGHEPVALPPRAAVSALAL
Ga0207699_1011291413300025906Corn, Switchgrass And Miscanthus RhizosphereQPGSPAARFTLLEHVQVSGACVLDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAAQLSLTGVSEGADPRRATCTLTVTNREGPARPEGRLLASLLGREPVALPPRAAVSAVALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS
Ga0207684_1002294573300025910Corn, Switchgrass And Miscanthus RhizosphereMASSPVFRAVALAVLVGGLAGCAAEPRQSASPGPRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVRKGRDLLHGSVSTVDRFVDPGRTGHRPPLSIQFGEIDAAPLTAGQLVLTGVSEGSQPSRATCTLDVTNREQPARQEARLLASLLGRDPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLAFLLQLVSLLLW
Ga0207684_1004762623300025910Corn, Switchgrass And Miscanthus RhizosphereMASLGGFRSVTLALLVGGLAGCAAEPRQPGSPAPRFTLLEHVQVAGACVLENGAASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAGPRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPPRQEGRLLASLLGREPVSLPPRSVLSGAAVLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0207684_1005684733300025910Corn, Switchgrass And Miscanthus RhizosphereMAHSRRRPVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVKKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPARPEGKLLASLLGREPVALPPRAAVSALALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS
Ga0207684_1035276423300025910Corn, Switchgrass And Miscanthus RhizosphereMASLRGFRTVALALLVGGLAGCAAEPRQPGSPALRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAEHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPLSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0207693_1059969623300025915Corn, Switchgrass And Miscanthus RhizosphereMAHWRRRPVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVKKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPARPEGKLLASLLGREPVALPPRAAVSAVALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS
Ga0207646_1001036713300025922Corn, Switchgrass And Miscanthus RhizosphereRAVALVVLVGGLAGCAAEPRQSASPGPRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVRKGRDLLHGSVSTVDRFVDPGRTGHRPPLSIQFGEIDAAPLTAGQLVLTGVSEGSQPSRATCTLDVTNREQPARQEARLLASLLGRDPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLAFLLQLVSLLLW
Ga0207646_1006905423300025922Corn, Switchgrass And Miscanthus RhizosphereMASLGGFRSVTLALLVGGLAGCAAEPRQPGSPAPRFTLLEHVQVAGACVLENGAASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPPRQEGRLLASLLGREPVSLPPRSVLSGAAVLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0207665_1002518353300025939Corn, Switchgrass And Miscanthus RhizosphereMAHWRRRPVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVKKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPARPEGKLLASLLGHEPVALPPRAAVSALALLIGIVAGVRLIRHQVRRSSASWLLGLALLLQLVSLLLS
Ga0208285_101668713300026005Rice Paddy SoilMARSAMTPALTLASLAWMLAACTADPGRPAAPARRFALLEHVQVTGACVPESGGPAARYSGFVERWHDAQKQRDLLHGSVSNFDRFIDPGRPGDRPLPSIQFGETDATPLTADRLMLKGLSGGSPPSPATCTLDVTSREQPARHPGRLLAGVLDPRSAVGGAALLVGLVAGIRLIRHQV
Ga0209438_102552633300026285Grasslands SoilMASLRGFRTVALALLVGGLAGCAAEPRQPGSPALRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGAVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0257162_100003783300026340SoilMASLRGFRTVALALLVGGLAGCAAEPRQPGSPALRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGAVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPLSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0257170_103033913300026351SoilLRGFRTVALALLVGGLAGCAAEPRQPGSPALRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGAVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPLSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0257167_101406923300026376SoilMASLRGFRTVTLALLVGGLAGCAAEPRQPGSPALRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0257172_105872513300026482SoilMASLRGFRTVALALLVGGLAGCAAEPRQPGSPALRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGAVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPLSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQL
Ga0257172_110535513300026482SoilLAVLVGGLAGCAAEPRQSASPGPRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVRKGRDLLHGSVSTVDRFVDPGRTGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVTNREQPARQEGRLLASLLGRDPVSLPPRSVLSGAALLIGMVAGIRLIR
Ga0257157_100156313300026496SoilMIGPISGRLDAARLWRELRAMAFSRGFRTVTLALLVGGLAGCAAEPRQPAAPAQRFTLLEHVQVAGACVLESGGVSARYAGFVERWRDVKRGRDLLHGSVSTIDRFVDPGRSGHRPPLSIQFGEIDVPPLSAGQLVLTGTSEGSQPSRATCTLDVTAREQPARTEGRLLASLLGREPVSFPPRSVLSGAALLIGLV
Ga0257165_100585513300026507SoilEHVQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGPAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPLSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0257161_100374733300026508SoilMIGPISGRLDAARLWRELRAMAFSRGFRTVTLALLVGGLAGCAAEPRQPAAPAQRFTLLEHVQVAGACVLESGGVSARYAGFVERWRDVKRGRDLLHGSVSTIDRFVDPGRSGHRPPLSIQFGEIDVPPLSAGQLVLTGTSEGSQPSRATCTLDVTAREQPARTEGRLLASLLGREPVSFPPRSVLSGAALLIGLVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLIG
Ga0257168_100582323300026514SoilMASLRGFRTVALALLVGGLAGCAAEPRQPGSPALRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARPEGRLLASLLGREPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0257168_101834223300026514SoilMASSPGFRAVALAVLVGGLAGCAAEPRQSASPGPRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVRKGRDLLHGSVSTVDRFVDPGRTGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVTNREQPARQEGRLLASLLGRDPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLAFLLQLVSLLLW
Ga0209648_1001385693300026551Grasslands SoilMASSPVFRAVALAVLVGGLAGCAAEPRQSASPGPRFTLLEHVQVAGACVLESGGASARYAGFVERWRDVRKGRDLLHGSVSTVDRFVDPGRTGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVTNREQPARQEGRLLASLLGRDPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLAFLLQLVSLLLW
Ga0179587_1054948823300026557Vadose Zone SoilRFMLLEHVQVAGACILENGGVSARYAGFVERWRDVKRGRDLLHGSVSTIDRFVDPGRSGHRPPLSIQFGEIDVPPLSAGQLVLTGTSEGSQPSRATCTLDVTAREQPARTEGRLLASLLGRDPVSFPPRSVLSGAGLLIGLVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0209117_102218733300027645Forest SoilMIGPISGRLDAARLWRELRAMAFSRGFRTVTLALLVGGLAGCAAEPRQPAAPAQRFTLLEHVQVAGACVLESGGVSARYAGFVERWRDVKRGRDLLHGSVSTIDRFVDPGRSGHRPPLSIQFGDIDVPPLSAGQLVLTGTSEGSQPSRATCTLDVTAREQPARTEGRLLASLLGREPVSFPPRSVLSGAALLIGLVAG
Ga0209068_1005801913300027894WatershedsMASSPICRAMALAALVGGLAGCAGEPRESAAPAPRFTLQEHVQVAGACVLENGGASARYAGFVERWHDVQKGRDLLHGSVSTVDRFVDPGRTGIRPPLSIQFGDSDVAPLTAARLVLGGVSEGSQPSRATCTLDVINREQPARPGGRLLASLLGRHPVPLPPRSVVSGAALLIGLVGGIRLIRHQVRRTSASWLLGLAFLLQLVSLFLW
Ga0209526_1001034063300028047Forest SoilMIGPISGRLDAARLWRELRAMAFSRGFRMATLALLVGGLAGCAAEPRQPASPAQRFTLLEHVQVAGACVLESGGVSARYAGFVERWRDVKRGRDLLHGSVSTIDRFVDPGRSGHRPPLSIQFGEIDVPPLSAGQLVLTGTSEGSQPSRATCTLDVTAREQPARTEGRLLASLLGRDPVSFPPRSVLSGAALLIGLVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0307312_1032661723300028828SoilMAFSRGFRTVTLALLVGGLAGCAAEPRQPASPAQRFTLLEHVQVAGACVLESGGVSARYAGFVERWRDVKRGRDLLHGSVSTVDRFVDPGRGGHRPPLSIQFGEIDVAPLSAGQLVLTGTSEGSQPSRATCTLDVVARENPARQDGRLLASLLGREPLSLPPRSVLSAAALLIGLVAGVRLIRHQVRRTSASWLLGLALLLQLVALLLG
Ga0310813_1177875413300031716SoilMASSTMARMLILAALGWMLAACTAEPGRAAAPARRFTLLEHVQVTGACVPDNGGQSARYTGFVERWHDVQKRRDLLHGSVSNFDRFIDPGRPGDRPPPSIQFGETDATSLTADHLVLKGLSGGSQPSPVTCTLDVTSREQAPRRQGRMMAGFLDVRSAVGGAALLVGLVAGIRLIRHQVRRTSAPWLLGL
Ga0307469_1071204213300031720Hardwood Forest SoilMAPSHRQPVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVSGACVLDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAAQLSLTGVSEGADPRRATCTLTVTNREGPARPEGRLLASLLGREPVALPPRAAVSAVALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS
Ga0307469_1081515613300031720Hardwood Forest SoilMAHSRRRPVVAACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVKKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPARPGGKLLASLLGREPVALPPRAAVSALALLIGIVAGVRLIRHQVRRSSASWLLGLALLLQ
Ga0307468_10001044353300031740Hardwood Forest SoilMAHSRRRPVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVKKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPARPEGKLLASLLGHEPVALPPRAAVSALALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS
Ga0307468_10065818113300031740Hardwood Forest SoilMAPSHRQPVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVSGACVLDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAAQLSLTGVSEGADPRRATCTLTVTNREGPARPEGRLLASLLGREPVALPPRAAVSAVALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLV
Ga0307473_1029140413300031820Hardwood Forest SoilMAHSRRRPVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPARPEGKLLASLLGREPVALPPRAAVSALALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS
Ga0307473_1033427823300031820Hardwood Forest SoilMASLGGFRSVTLALLVGGLAGCAAEPRQPGSPAPRFTLLEHAQVAGACVLENGAASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPLSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0310885_1077905113300031943SoilTADPGRAAAPARRFALLEHVQVTGACVPESGGPAARYSGFVERWHDAQKQRDLLHGSVSNFDRFIDPGRPGDRPPPSIQFGETDATPLTADRLMLKGLSGGSPPSPATCTLDVTSREQPARHPGRLLAGVLDPRSAVGGAALLVGLVAGIRLIRHQVRRTSAPWLLGLALLLQLVALLL
Ga0307479_1001757793300031962Hardwood Forest SoilMAHSRRRPVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVKKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPARPEGKLLASLLGREPAALPPRAAVSALALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS
Ga0307470_1012148023300032174Hardwood Forest SoilMAHWRRRPVVRPAVGLAALVWLLAACSTEPRQPGSPAARFTLLEHVQVAGACVLDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTVDRFVDPGRAEHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPLSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0307471_10020792833300032180Hardwood Forest SoilMASLGGFRSVTLALLVGGLAGCAAEPRQPGSPAPRFTLLEHAQVAGACVLENGAASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLLGLALLLQLVSLLLG
Ga0307471_10112916723300032180Hardwood Forest SoilWLLSACSTEPRQPGSPAARFMLLEHVQVAGACVLDDGGGSARYAGFVERWQDVKKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPARPEGKLLASLLGREPVALPPRAAVSAVALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS
Ga0307471_10274926713300032180Hardwood Forest SoilRQPGSPAARFTLLEHVQVSGACVLDDGGGSARYAGFVERWQDVRKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAAQLSLTGVSEGADPRRATCTLTVTNREGPARPEGRLLASLLGREPVALPPRAAVSAVALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS
Ga0307472_10021063523300032205Hardwood Forest SoilMAHSRRRPVVRPAVGLAALVWLLSACSTEPRQPGSPTARFMLLEHVQVAGACVLDDGGGSARYAGFVERWQDVKKGKSLLHGSVSTVDRFVDPPRSGHRPPLSIQFGETDAAPLSAGQLSLTGVSEGADPRRATCTLTVTNREGPARPEGKLLASLLGREPVALPPRAAVSAVALLIGIVAGVRLIRHQVRRTSASWLLGLALLLQLVSLLLS
Ga0307472_10079037513300032205Hardwood Forest SoilMASLGGFRSVTLALLVGGLAGCAAEPRQPGSPAPRFTLLEHAQVAGACVLENGAASARYAGFVERWRDVKKGRDLLHGSVSTVDRFVDPGRAGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLDVIAREQPARQEGRLLASLLGREPVSLPPRSVLSGAALLIGLVAGIRLIRHQVRRTSASWLL
Ga0310810_1024700233300033412SoilMASSTMARMLILAALGWMLAACTAEPGRAAAPARRFTLLEHVQVTGACVPDNGGQSARYTGFVERWHDVQKRRDLLHGSVSNFDRFIDPGRPGDRPPPSIQFGETDATSLTADHLVLKGLSGGSQPSPVTCTLDVTSREQAPRRQGRMMAGFLDVRSAVGGAALLVGLVAGIRLIRHQVRRTSAPWLLGLALLLQLVALLL
Ga0326729_100460843300033432Peat SoilMARYAVARTLTLAALGCMLAACTAEPGPAAAPARRFTLLEHVQVTGACVPENGGQSARYTGFIERWHDVQKRRDLLHGSVSNFDRFIDPGRPGDRPPPSIQFGETDATPLTADHLVLKGLSGGSQPSPATCTLDVTSREQPARQHGRLLAGLLDPRSAVSGAALLVGLVAGIRLIRHQVRRTSAPWLLGLALLLQLVALLL
Ga0326726_1026936233300033433Peat SoilMAPYAVARTLTLAALGWMLAACTAAPGPAAAPARRFTLLEHVQVTGACIPENGGQSARYTGFIERWHDVQKRRDLLHGSVSNFDRFIDPGRPGDRPPPSIQFGETDATPLTADHLVLKGLSGGSQPSPATCTLDVTSREQPARQHGRLLAGLLDPRSAVSGAALLVGLVAGIRLIRHQVRRTSAPWLLGLALLLQLVALLL
Ga0316624_1132685913300033486SoilMARSAMTPTLTLAALAGMLAACTADPGRAVTPARRFALLEHVQVTGACVGEGGGAAARYTGFVERWHDAQTRRDLLHGSVSNFDRFIDPGRPGDRPPPSILFGETDARPLTANHLVLKGLSGGSPPSQATCTLDVTSRGQPARQPGRLLAGGLDPRSAVGGAALLVGLVAGIRLIRHQVRRTSAPWLLGLALLLQLVALLL
Ga0326730_100334123300033500Peat SoilMAPYAVARTLTLAALGWMLAACTAAPGPAAAPARRFTLLEHVQVTGACVPENGGQSARYTGFIERWHDVQKRRDLLHGSVANFDRFIDPGRPGDRPPPSIQFGETDATPLTADHLVLKGLSGGSQPSPATCTLDVTSREQPARQHGRLLAGLLDPRSAVSGAALLVGLVAGIRLIRHQVRRTSAPWLLGLALLLQLVALLL
Ga0373948_0094438_86_6973300034817Rhizosphere SoilFRTLTLALVVGGLAGCAAETPPPASSAQRFTLLEHVQVAGACVLENGGASARYAGFVERWRDVKKGRDLLHGSVSTVERFVDPGRTGHRPPLSIQFGEIDVAPLTAGQLVLTGVSEGSQPSRATCTLEVTNRERPARQEGRLLASLLGREPVSLPARSMLSVAALLIGLVAGIRLIRHQVRRTSAPWLLGLALLLQLVALLLW


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.