NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F074055

Metagenome Family F074055

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F074055
Family Type Metagenome
Number of Sequences 120
Average Sequence Length 169 residues
Representative Sequence MVTRAKEWRSRAGARGRLGAGLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADVAEAACLVARGYRAPVTFSQGPARIGYLYATSRGEAPTIVADFQGCQVEAFKAPVPVIPDTNTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKAFAACLTRRGYTVTDVTRLK
Number of Associated Samples 102
Number of Associated Scaffolds 120

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 57.14 %
% of genes near scaffold ends (potentially truncated) 37.50 %
% of genes from short scaffolds (< 2000 bps) 70.00 %
Associated GOLD sequencing projects 93
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (70.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(21.667 % of family members)
Environment Ontology (ENVO) Unclassified
(25.833 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(47.500 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 30.56%    β-sheet: 10.00%    Coil/Unstructured: 59.44%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 120 Family Scaffolds
PF02653BPD_transp_2 29.17
PF02321OEP 14.17
PF17186Lipocalin_9 5.83
PF07143CrtC 1.67
PF12704MacB_PCD 1.67
PF02687FtsX 0.83
PF03916NrfD 0.83
PF12399BCA_ABC_TP_C 0.83
PF00005ABC_tran 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 120 Family Scaffolds
COG1538Outer membrane protein TolCCell wall/membrane/envelope biogenesis [M] 28.33


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms70.83 %
UnclassifiedrootN/A29.17 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002245|JGIcombinedJ26739_100590076All Organisms → cellular organisms → Bacteria988Open in IMG/M
3300005341|Ga0070691_10021810All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales2967Open in IMG/M
3300005406|Ga0070703_10011990All Organisms → cellular organisms → Bacteria2454Open in IMG/M
3300005436|Ga0070713_100973188All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium818Open in IMG/M
3300005439|Ga0070711_101472056Not Available594Open in IMG/M
3300005440|Ga0070705_100196117All Organisms → cellular organisms → Bacteria1380Open in IMG/M
3300005445|Ga0070708_100441483All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1227Open in IMG/M
3300005445|Ga0070708_100943594Not Available809Open in IMG/M
3300005445|Ga0070708_101248689All Organisms → cellular organisms → Bacteria694Open in IMG/M
3300005467|Ga0070706_100372479All Organisms → cellular organisms → Bacteria1330Open in IMG/M
3300005467|Ga0070706_101003980Not Available769Open in IMG/M
3300005468|Ga0070707_100457092All Organisms → cellular organisms → Bacteria1238Open in IMG/M
3300005468|Ga0070707_100537009All Organisms → cellular organisms → Bacteria1131Open in IMG/M
3300005545|Ga0070695_100022535All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales3864Open in IMG/M
3300005875|Ga0075293_1003842All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1457Open in IMG/M
3300005876|Ga0075300_1008323Not Available1128Open in IMG/M
3300005879|Ga0075295_1014537All Organisms → cellular organisms → Bacteria868Open in IMG/M
3300005883|Ga0075299_1027511All Organisms → cellular organisms → Bacteria → Terrabacteria group → unclassified Terrabacteria group → Terrabacteria group bacterium ANGP1597Open in IMG/M
3300006041|Ga0075023_100036193All Organisms → cellular organisms → Bacteria → Proteobacteria1481Open in IMG/M
3300006163|Ga0070715_10963892All Organisms → cellular organisms → Eukaryota → Viridiplantae → Chlorophyta → core chlorophytes → Trebouxiophyceae → Chlorellales → Chlorellaceae → Chlorella clade → Chlorella → Chlorella variabilis529Open in IMG/M
3300006173|Ga0070716_100103440All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales1751Open in IMG/M
3300006175|Ga0070712_100154410All Organisms → cellular organisms → Bacteria1766Open in IMG/M
3300006804|Ga0079221_10520223All Organisms → cellular organisms → Bacteria778Open in IMG/M
3300006852|Ga0075433_10000880All Organisms → cellular organisms → Bacteria → Proteobacteria21092Open in IMG/M
3300006903|Ga0075426_10907865All Organisms → cellular organisms → Bacteria → Terrabacteria group → unclassified Terrabacteria group → Terrabacteria group bacterium ANGP1664Open in IMG/M
3300007255|Ga0099791_10007032All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales4650Open in IMG/M
3300007265|Ga0099794_10013577All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3549Open in IMG/M
3300009038|Ga0099829_10165425All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium1774Open in IMG/M
3300009143|Ga0099792_10481832All Organisms → cellular organisms → Bacteria774Open in IMG/M
3300009147|Ga0114129_10067702All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4979Open in IMG/M
3300009162|Ga0075423_10902588All Organisms → cellular organisms → Bacteria937Open in IMG/M
3300010400|Ga0134122_11035428All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium807Open in IMG/M
3300010401|Ga0134121_10456435All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1167Open in IMG/M
3300011119|Ga0105246_12217999Not Available535Open in IMG/M
3300011269|Ga0137392_10277541All Organisms → cellular organisms → Bacteria1383Open in IMG/M
3300011270|Ga0137391_10449146All Organisms → cellular organisms → Bacteria1097Open in IMG/M
3300011270|Ga0137391_11097275Not Available644Open in IMG/M
3300011271|Ga0137393_10104609All Organisms → cellular organisms → Bacteria2316Open in IMG/M
3300011271|Ga0137393_10991814All Organisms → cellular organisms → Bacteria715Open in IMG/M
3300012096|Ga0137389_11226080Not Available642Open in IMG/M
3300012189|Ga0137388_10092518All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium2578Open in IMG/M
3300012203|Ga0137399_10983152Not Available711Open in IMG/M
3300012205|Ga0137362_10115298All Organisms → cellular organisms → Bacteria2274Open in IMG/M
3300012361|Ga0137360_10211944All Organisms → cellular organisms → Bacteria1574Open in IMG/M
3300012362|Ga0137361_10797802All Organisms → cellular organisms → Bacteria859Open in IMG/M
3300012363|Ga0137390_11263034All Organisms → cellular organisms → Bacteria685Open in IMG/M
3300012582|Ga0137358_10017255All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales4597Open in IMG/M
3300012917|Ga0137395_10308713All Organisms → cellular organisms → Bacteria1122Open in IMG/M
3300012922|Ga0137394_10092824All Organisms → cellular organisms → Bacteria2535Open in IMG/M
3300012923|Ga0137359_10799433Not Available817Open in IMG/M
3300012923|Ga0137359_10812203Not Available810Open in IMG/M
3300012931|Ga0153915_10023957All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria6053Open in IMG/M
3300012931|Ga0153915_10254826All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1946Open in IMG/M
3300015371|Ga0132258_10954898All Organisms → cellular organisms → Bacteria2165Open in IMG/M
3300015372|Ga0132256_101860105Not Available709Open in IMG/M
3300017930|Ga0187825_10007738All Organisms → cellular organisms → Bacteria3595Open in IMG/M
3300017936|Ga0187821_10027576All Organisms → cellular organisms → Bacteria → Proteobacteria1998Open in IMG/M
3300017993|Ga0187823_10382768Not Available508Open in IMG/M
3300017994|Ga0187822_10005104All Organisms → cellular organisms → Bacteria2919Open in IMG/M
3300020062|Ga0193724_1115585Not Available534Open in IMG/M
3300020579|Ga0210407_10505572All Organisms → cellular organisms → Bacteria944Open in IMG/M
3300021088|Ga0210404_10063616All Organisms → cellular organisms → Bacteria1775Open in IMG/M
3300021170|Ga0210400_10331555All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1251Open in IMG/M
3300021432|Ga0210384_11674391Not Available541Open in IMG/M
3300021478|Ga0210402_10374277All Organisms → cellular organisms → Bacteria1322Open in IMG/M
3300021479|Ga0210410_11593013Not Available546Open in IMG/M
3300021559|Ga0210409_10006915All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae11766Open in IMG/M
3300025885|Ga0207653_10091414All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1066Open in IMG/M
3300025910|Ga0207684_10002886All Organisms → cellular organisms → Bacteria17035Open in IMG/M
3300025910|Ga0207684_10045789All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3710Open in IMG/M
3300025910|Ga0207684_10615031All Organisms → cellular organisms → Bacteria927Open in IMG/M
3300025922|Ga0207646_11031886Not Available726Open in IMG/M
3300025928|Ga0207700_11233492Not Available667Open in IMG/M
3300026005|Ga0208285_1013286All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium641Open in IMG/M
3300026354|Ga0257180_1007768All Organisms → cellular organisms → Bacteria1230Open in IMG/M
3300026355|Ga0257149_1010834All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1182Open in IMG/M
3300026490|Ga0257153_1031760All Organisms → cellular organisms → Bacteria1088Open in IMG/M
3300026496|Ga0257157_1052916Not Available684Open in IMG/M
3300026508|Ga0257161_1040551All Organisms → cellular organisms → Bacteria929Open in IMG/M
3300026514|Ga0257168_1050820All Organisms → cellular organisms → Bacteria908Open in IMG/M
3300026515|Ga0257158_1124018Not Available520Open in IMG/M
3300026551|Ga0209648_10014958All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales6773Open in IMG/M
3300026551|Ga0209648_10532229Not Available664Open in IMG/M
3300027645|Ga0209117_1098947All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium799Open in IMG/M
3300027651|Ga0209217_1038841All Organisms → cellular organisms → Bacteria1468Open in IMG/M
3300027671|Ga0209588_1033031All Organisms → cellular organisms → Bacteria1659Open in IMG/M
3300027894|Ga0209068_10121161All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1396Open in IMG/M
3300027915|Ga0209069_10046578All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium2036Open in IMG/M
3300028047|Ga0209526_10124009All Organisms → cellular organisms → Bacteria1815Open in IMG/M
3300028047|Ga0209526_10606569All Organisms → cellular organisms → Bacteria700Open in IMG/M
3300028536|Ga0137415_11363505Not Available529Open in IMG/M
(restricted) 3300031197|Ga0255310_10128579Not Available690Open in IMG/M
3300031716|Ga0310813_10124981All Organisms → cellular organisms → Bacteria2036Open in IMG/M
3300031720|Ga0307469_10346294All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1245Open in IMG/M
3300031720|Ga0307469_11764180Not Available597Open in IMG/M
3300031740|Ga0307468_100032223All Organisms → cellular organisms → Bacteria2515Open in IMG/M
3300031820|Ga0307473_10469646All Organisms → cellular organisms → Bacteria842Open in IMG/M
3300031962|Ga0307479_10051641All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales3952Open in IMG/M
3300032174|Ga0307470_10199158All Organisms → cellular organisms → Bacteria1278Open in IMG/M
3300032180|Ga0307471_100451800All Organisms → cellular organisms → Bacteria1424Open in IMG/M
3300032205|Ga0307472_101873421Not Available597Open in IMG/M
3300033432|Ga0326729_1020590Not Available1083Open in IMG/M
3300033433|Ga0326726_10096601All Organisms → cellular organisms → Bacteria → Terrabacteria group → Deinococcus-Thermus → Deinococci → Thermales → Thermaceae → Thermus → Thermus islandicus2638Open in IMG/M
3300033433|Ga0326726_10542824All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1114Open in IMG/M
3300033480|Ga0316620_11259241Not Available726Open in IMG/M
3300033486|Ga0316624_10285263All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1330Open in IMG/M
3300033486|Ga0316624_11898877Not Available552Open in IMG/M
3300033500|Ga0326730_1002224All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi4538Open in IMG/M
3300033502|Ga0326731_1004232All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi3758Open in IMG/M
3300033502|Ga0326731_1014633All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1934Open in IMG/M
3300033513|Ga0316628_101190155All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1014Open in IMG/M
3300034090|Ga0326723_0005430All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Stellaceae → Stella → Stella humosa4837Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil21.67%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere19.17%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil11.67%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil8.33%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil5.83%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil4.17%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.17%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere4.17%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment3.33%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil3.33%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.50%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands1.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.67%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.67%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.67%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.67%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.83%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.83%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.83%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.83%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005875Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_101EnvironmentalOpen in IMG/M
3300005876Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_401EnvironmentalOpen in IMG/M
3300005879Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_301EnvironmentalOpen in IMG/M
3300005883Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_302EnvironmentalOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011119Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-4 metaGHost-AssociatedOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017993Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_3EnvironmentalOpen in IMG/M
3300017994Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_2EnvironmentalOpen in IMG/M
3300020062Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a1EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025905Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026005Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_101 (SPAdes)EnvironmentalOpen in IMG/M
3300026354Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-BEnvironmentalOpen in IMG/M
3300026355Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-AEnvironmentalOpen in IMG/M
3300026490Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-AEnvironmentalOpen in IMG/M
3300026496Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-AEnvironmentalOpen in IMG/M
3300026508Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-AEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033432Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF6AY SIP fractionEnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033480Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M1_C1_D5_BEnvironmentalOpen in IMG/M
3300033486Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_N3_C1_D5_AEnvironmentalOpen in IMG/M
3300033500Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF7AN SIP fractionEnvironmentalOpen in IMG/M
3300033502Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF9FY SIP fractionEnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300034090Peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF00NEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10059007623300002245Forest SoilLAAGCASEGGGIDLRGGQTQAQLEDDRAKCIPFVQAHTETTAEMAEGACLVARGYRVPVTFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFKTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYPVTDVTRVK*
Ga0070691_1002181043300005341Corn, Switchgrass And Miscanthus RhizosphereMATVADLLRTRRPGALRLLGAAVGALILTGCAGEGGIDLRGGQTAAQLENDRAQCLPFVQAHTDASADLAETACLIARDYRAPLTIAQGPSRIGYLYVDKHGEAPAMLTEFQACQVEAFKTPMPVIPDTDTSGIFSNLFAKLFPRGMASKPPSPDDWVLKYFAACLTRRGYAVSGVTRFQ
Ga0070703_1001199023300005406Corn, Switchgrass And Miscanthus RhizosphereMVTGANEWRSRTGTRRRLGACLVTMMILGGCATESGIDLRGGQTAKQLEEDRAKCLPFVQAHTETTADLAEAACLVARGYRAPVTFSQGPARIGYLYAVSRGEAPTIVADFQGCQVEAFKAPIPVIPDTNTSGIFSNVFSKLFPRGFTSQPPTPDDWAFKAYATCLTRRGYTVTDVTRLK
Ga0070713_10097318813300005436Corn, Switchgrass And Miscanthus RhizosphereVKSVRSTRRGGRLAAGLAAMMLAAGCASEGGGIDLRGGQTQVQLEDDRAKCIPFVQAHTETTAEMAEGACLVARGYRVPITFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFRTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGY
Ga0070711_10147205613300005439Corn, Switchgrass And Miscanthus RhizosphereLAAGLAAMTLAAGCASEGGGIDLRGGQTQVQLEDDRAKCIPFVQAHTETTAEMAEGACLVARGYRVPVTFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFRTPPLVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFGACLTRRGYAVTDVTRVK*
Ga0070705_10019611723300005440Corn, Switchgrass And Miscanthus RhizosphereVKSVRSTRRGGRLAAGLAAMMLAAGCASEGGGIDLRGGQTQVQLEDDRAKCIPFVQAHTETTAEMAEGACLVARGYRVPITFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFRTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFGACLTRRGYAVTDVTRVK*
Ga0070708_10044148323300005445Corn, Switchgrass And Miscanthus RhizosphereMTLAAGCASEGSGINLRGGQTHAQLEDDRAKCIPFVQAHTETTAEIAEGACLVARGYRVPITFAQGPARIGYLSVTSRGEAAAIVGDFQGCQVEAFQTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYAITDVTRVK*
Ga0070708_10094359413300005445Corn, Switchgrass And Miscanthus RhizosphereMVTRAKEWRSRAGERGRLGACLVAMVILGGCATEGGIDLRGGQTAAQLEEDRTKCLPFVQAHPETTADLAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPTIVADFQGCQVEALKAPIPVIPDSDTSGIFSNFFAKLYPRGFTSQPPTPDDWALKAFAACLTRRGYTVTDVARLK
Ga0070708_10124868913300005445Corn, Switchgrass And Miscanthus RhizosphereRFGHSESRTDMVTGANEGRSRTGTRGRLGACLVATVILGGCATDGGIDLRGGQTAKQLEEDRAKCLPFVQAHPDTTADVAEAACLVARGYRTPVTFSQGPARIGYLYATSRGEAPTIVADFQGCQVEAFKAPVPVIPDTNTSGIFSNLFSKLYPRGFTSHPPTPDDWAFKAFAACLTRRGYTVTDVTRLK*
Ga0070706_10037247923300005467Corn, Switchgrass And Miscanthus RhizosphereMTLAAGCASEGSGIDLRGGQTHAQLEDDRAKCIPFVQAHTETTAEIAEGACLVARGYRVPITFAQGPARIGYLSVTSRGEAAAIVGDFQGCQVEAFQTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYAITDVTRVK*
Ga0070706_10100398013300005467Corn, Switchgrass And Miscanthus RhizosphereMVTRAKEWRSRAGERGRLGACLVAMVILGGCATEGGIDLRGGQTAAQLEEDRTKCLPFVQAHPETTADLAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPTVVADFQGCQVEALKAPIPVIPDSNTSGIVSNFFAKLYPRGFTSQPPTPDDWTFKAFAVCLTRRGYSVTDVTRLK
Ga0070707_10045709223300005468Corn, Switchgrass And Miscanthus RhizosphereVKSVRSTRRGGRLAAGLAAMTLAAGCASEGGGIDLRGGQTQVQLEDDRAKCIPFVQAHTETTAEMAEGACLVARGYRVPITFAQGPARIGYLSVTSRGEAAAIVGDFQGCQVEAFQTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYAITDVTRVK*
Ga0070707_10053700923300005468Corn, Switchgrass And Miscanthus RhizosphereMVTRAKEWRSRAGARGRLGAGLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADVAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPVVVADFQGCQVEALKAPIPVIPDSNTSGIFSNFFAKLYPRGFTSQPPTPDDWALKAFAACLTRRGYTVTDVTRLK
Ga0070695_10002253543300005545Corn, Switchgrass And Miscanthus RhizosphereMATVADLLRTRRPGALRLLGAAVGALILTGCAGEGGIDLRGGQTAAQLENDRAQCLPFVQAHTDASADLAETACLIARGYRAPLTIAQGPSRIGYLYVDKHGEAPAMLTEFQACQVEAFKTPMPVIPDTDTSGIFSNLFAKLFPRGMASKPPSPDDWVLKYFAACLTRRGYAVSGVTRFQ
Ga0075293_100384213300005875Rice Paddy SoilMATVADLLRTRRPGALRLLGAAVGALILTGCAGEGGIDLRGGQTAAQLENDRAQCLPFVQAHTDASADLAETACLIARDYRAPLTIAQGPSRIGYLYVDKHGEAPAMLTEFQACQVEAFKTPMPVIPDTDTSGIFSNLFAKLFPRGMASKPPSPDDWVLKYFAACLTRRGY
Ga0075300_100832323300005876Rice Paddy SoilMTTVADRLRARRGCALRLLGSAVGALILTGCAGEGGIDLRGGQTAAQLENDRAQCLPFVQAHTDASADLAETACLIARGYRAPLTIAQGPSRIGYLYVDKRGEAPAMLTEFQACQVEAFKTPMPVIPDTDTSGIFSNLFAKLFPRGMASKPPSPDDWVLKYFAACLTRRGYAVSGVTRFQ
Ga0075295_101453723300005879Rice Paddy SoilMTTVADRLRARRGCALRLLGAAVGALILTGCAGEGGIDLRGGQTAAQLENDRAECLPFVQAHTDASADLAETACLIARGYRAPLTIAQGPSRIGYLYVDKRGEAPAMLTEFQACQVEAFKTPMPVIPDTDTSGIFSNLFAKLFPRGMASKPPSPDDWVLKYFAACLTRRGYAVSGVTRFQ
Ga0075299_102751113300005883Rice Paddy SoilMATVADLLRTRRPGALRLLGAAVGALILTGCAGEGGIDLRGGQTAAQLENDRAQCLPFVQAHTDASADLAETACLIARDYRAPLTIAQGPSRIGYLYVDKHGEAPAMLTEFQACQVEAFK
Ga0075023_10003619323300006041WatershedsVKSVRSTRRGGRGRLAAGLAAMTLAAGCASEGGIDLRGGQTQAQLEDDRAKCIPFVQAHTEVTAETAEAACLVARGYRVPITFAQGPARIGYLYATSRGEAAAIVGDFQGCRVEAFKTPVPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYTVTDVTRVK*
Ga0070715_1096389213300006163Corn, Switchgrass And Miscanthus RhizospherePPTPRAAFRPPAPSGIVKSVRSTRRGGRGRLAAGLAAMTLAAGCASEGGIDLRGGQTHAQLEDDRAKCIPFVQAHTETTAEMAEAACLVARGYRVPVTFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFRTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFA
Ga0070716_10010344013300006173Corn, Switchgrass And Miscanthus RhizosphereVKSVRSTRRGGRLAAGLAAMMLAAGCASEGGGIDLRGGQTQVQLEDDRAKCIPFVQAHTETTAEMAEGACLVARGYRVPITFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFRTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYPVTGVTRTK*
Ga0070712_10015441023300006175Corn, Switchgrass And Miscanthus RhizosphereVKSVRSTRRGGRLAAGLAAMMLAAGCASEGGGIDLRGGQTQVQLEDDRAKCIPFVQAHTETTAEMAEGACLVARGYRVPVTFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFKTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFGACLTRRGYAVTDVTRVK*
Ga0079221_1052022323300006804Agricultural SoilEGGGIDLRGGQTSAQLEDDRAKCIPFVQTHTETTAEMAEAACLVARGYRVPVTLAQGPARIGYLYVTSRGEAAAILGDFQGCQVEAFKTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYAVTDVTRVK*
Ga0079220_1180880123300006806Agricultural SoilVKSVRSTRRGGRGRLAAGLAAMTLAAGCASEGGIDLRGGQTHAQLEDDRAKCIPFVQAHTETTAEIAEGACLVARGYRVPITFAQGPARIGYLSVTSRGEAAAIVGDFQGCQVEAFQTPPPVIPDSDTS
Ga0075433_10000880103300006852Populus RhizosphereVKSVRSTSRRARGRLAAGLAAMTLAAGCATEGGGIDLRGGQTSAQLEDDRAKCIPFVQTHTETTAEMAEAACLVARGYRVPVTLAQGPARIGYLYVTSRGEAAAILGDFQGCQVEAFKTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFGACLTRRGYAVTDVTRVK*
Ga0075426_1090786513300006903Populus RhizosphereVKSVRSTSRRARGRLAAGLAAMTLAAGCATEGGGIDLRGGQTSAQLEDDRAKCIPFVQTHTETTAEMAEAACLVARGYRVPVTLAQGPARIGYLYVTSRGEAAAILGDFQGCQVEAFKTPPPVIPDSDTSGIFSNFFSKVFPRGF
Ga0075436_10138211423300006914Populus RhizosphereVKSVRSTRRGGRGRLAAGVAAMTLAAGCASAGGGIDLRGGQTQAQLEDDRAKCIPFVQAHTETTAEMAEAACLVARGYRVPVTLAQGPARIGYLYVTSQGEAAAIVGDFQGCQVEAFKTPPPVIPDSDTSGIFSNFFSKVFP
Ga0099791_1000703223300007255Vadose Zone SoilMVTRAKEWRSRAGARGRLGAGLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADVAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPVVVADFQGCQVEALKAPIPVIPDSNTSGIVSNFFPKLYPRGFTSQPPTPDDWALKAFAVCLPRRGYAVTDVTRLK
Ga0099794_1001357753300007265Vadose Zone SoilMVTRAKEWRSRAGARGRLGAGLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADVAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPTIVADFQGCQVEALKAPIPVIPDSNTSGIVSNFFAKLYPRGFTSQPPTPDDWTFKAFAVCLTRRGYSVTDVTRLK
Ga0099829_1016542533300009038Vadose Zone SoilMVILGGCATAGGIDLRGGQTAKQLEEDRAKCLPFVQAHPETTADLAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPTIVADFQGCQVEAFKAPVPVIPDTDTSGIFSNVFSKLFPRGFASKPPTPDDWAFKAFAACLTRRGYTVTDV
Ga0099792_1048183223300009143Vadose Zone SoilMVTLASECRSRPSVRGRLGAALVAMVILGGCATEGGIDLRGGQTAKQLEEDRAKCLPFVQAHTETTADVAEAACLVARGYRAPVTFSQGPARIGYLYATSRGEAPTIVADFQGCQVEAFKAPVPVIPDTNTSGIFSNFFSKLFPRGFTSKPPTPDDWAFKAFAACLTRRGYTVTDVTRLK
Ga0114129_1006770253300009147Populus RhizosphereVKSVRSTSRRARGRLAAGLAAMTLAAGCATEGGGIDLRGGQTSAQLEDDRAKCIPFVQTHTETTAEMAEAACLVARGYRVPVTLAQGPARIGYLYVTSRGEAAAILGDFQGCQVEAFKTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYAVTDVTRVK*
Ga0075423_1090258823300009162Populus RhizosphereMTLAAGCASAGGGIDLRGGQTQAQLEDDRAKCIPFVQAHTETTAEMAEAACLVARGYRVPVTLAQGPARIGYLYVTSQGEAAAIVGDFQGCQVEAFKTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYPVTDVTRVK*
Ga0134122_1103542813300010400Terrestrial SoilMRPGARGRLGAALAATAIVGGCATGGGIDPRGGQTAAQLEEDRARCLPFVQAHTEVTADVAEAACLVARGYRAPVTFAQGPARIGYLYVTSRGEAPAIVADFQGCQVEAFKAPMPVLPDTNTSGIFSNFFSKLFPRGLTSKPPTP
Ga0134121_1045643523300010401Terrestrial SoilMVTLASECRSRPSARGRLGAVLVAMVILGGCATEGGIDLRGGQTAKQLEEDRAKCLPFVQAHTEVTADVAEAACLVARGYRAPVTFAQGPARIGYLYATSRAEAPTVVADFQGCQVEAFKAPVPVIPDTDTSGIFTNFFSKVFPRGFTSKPPTPDDWAFKAFAGCLSRRGYTVTDVTRLK
Ga0105246_1221799913300011119Miscanthus RhizosphereSRTDDMVIRGAERRMRPGARGRLGAALAATAIVGGCATGGGIDPRGGQTAAQLEEDRARCLPIVQAHTEVTADVAEAACLVARGYRAPVTFAQGPARIGYLYVTSRGEAPAIVADFQGCQVEAFKAPMPVLPDTNTSGIFSNFFSRIFPRGVTSHPPTPDDWAFKSFATCLARRGYA
Ga0137392_1027754123300011269Vadose Zone SoilMVTGANEGRSRTGTRRRLGACLVATVILGGCATDGGIDLRGGQTAKQLEEDRAKCLPFVQAHTETTADVAEAACLVARGYRAPVTFSQGPARIGYLYATSRGEAPTIVADFQGCQVEAFKAPVPVIPDTNTSGIFSNFFSKLFPRGFTSKPPTPDDWAFKAFAACLTRRGYTVTDVTRLK
Ga0137391_1044914623300011270Vadose Zone SoilMVTGANEGRSRTGTRGRLGACLVATVILGGCATESGIDLRGGQTAKQLEEDRAKCLPFVQAHTETTADVAEAACLVARGYRAPVTFSQGPARIGYLYATSRGEAPTIVADFQGCQVEAFKAPVPVIPDTDTSGIFSNFFSKLFPRGFTSQPPTPDDWAFKAFAACLTRRGYTVTEVTRLK
Ga0137391_1109727513300011270Vadose Zone SoilMVTRAKEWRSRPGARGRLGACLVAIVLLTGCATEGGIDLRGGQTAKQLEEDRAKCLPFVQAHPETTADLAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPVVVADFQGCQVEALKAPIPVIPDSNTSGIFSNVFSKLYPRGFTSQPPTPDDWAFKAFAACLTRRGYTVTDVTRLK
Ga0137393_1010460933300011271Vadose Zone SoilMVTGANEGRSRTGTRGRLGACLVATVILGGCATESGIDLRGGQTAKQLEEDRAKCLPFVQAHTETTADVAEAACLVARGYRAPVTFSQGPARIGYLYATSRGEAPTIVADFQGCQVEAFKAPVPVIPDTNTSGIFSNLFSKLYPRGFTSQPPTPDDWAFKAFAACLTRRGYTVTDVTRLK
Ga0137393_1099181413300011271Vadose Zone SoilMVILGGCATAGGIDLRGGQTAKQLEEDRAKCLPFVQAHPETTADLAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPVVVADFQGCQVEALKAPIPVIPDSNTSGIVSNFFAKLYPRGFTSQPPTPDDWTFKAFAVCLTRRGYSVTDATRLK*
Ga0137389_1122608013300012096Vadose Zone SoilMVTRAKEWRSRAGARGRLGAGLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADVAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPVVVADFQGCQVEALKAPIPVIPDSNTSGIVSNFFAKLYPRGFTSQPPTPDDWAFKAF
Ga0137388_1009251833300012189Vadose Zone SoilMVTGANEGRSRTGTRGRLGACLVATVILGGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADVAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPVVVADFQGCQVEALKAPIPVIPDSNTSGIVSNFFAKLYPRGFPSQPPTPDDWTFKAFAVCLTRRGYSVTDATRLK
Ga0137399_1098315213300012203Vadose Zone SoilMVTRAKEWRSRAGARGRLGAGLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTAEVAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPTIVADFQGCQVEALKAPIPVIPDSNTSGIFSNFFAKLYPRGITSQPPTPDDWALKAFAVCLTRRGYTVTDVTRLK
Ga0137362_1011529823300012205Vadose Zone SoilMVTRAKEWRSRAGARGRLGAGLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADVAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPVVVADFQGCQVEALKAPIPVIPDSNTSGIFSNFFAKLYPRGITSQPPTPDDWAFKAFAVCLTRRGYSVTDVTRLK
Ga0137360_1021194423300012361Vadose Zone SoilMVTRAKEWRSRAGARGRLGAGLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADVAEAACLVARGYRAPVTFAQGPARIGYLYATSRGESPVVVADFQGCQVEALKAPIPVIPDSNTSGIVSNFFAKLYPRGFTSQPPTPDDWTFKAFAVCLTRRGYSVTDVTRLK
Ga0137361_1079780223300012362Vadose Zone SoilMVTGANEWRSRTGTRRRLGACLVATVILGGCATESGIDLRGGQTAKQLEEDRAKCLPFVQAHTETTADVAEAACLVARGYRAPVTFSQGPARIGYLYATSRGEAPTIVADFQGCQVEAFKAPVPVIPDTNTSGIFSNLFSKLYPRGFTSQPPTPDDWAFKAFAACLTRRGYTVTDVTRLK
Ga0137390_1126303413300012363Vadose Zone SoilMVTGANEWRSRTGTRRRLGACLVATVILGGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADLAEAACLVARGYRVPVTFAQGPARIGYLYATSRGEAPVVVANFEGCQVEAFKAPIPVIPDTNTSGIFSNLFSKLYPRGFTSQPPTPDDWAFKAFAACLTRRGYTVTDVTRLK
Ga0137358_1001725563300012582Vadose Zone SoilMVTRAKEWRSRAGARGRLGAGLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADVAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPTIVADFQGCQVEALKAPIPVIPDSNTSGIFSNFFAKLYPRGFTSQPPTPDDWTFKAFAVCLTRRGYSVTDVTRLK
Ga0137395_1030871323300012917Vadose Zone SoilMVTGANEWRSRTGTRRRLGACLVATVILGGCATDGGIDLRGGQTAKQLEEDRAKCLPFVQAHTETTADVAEAACLVARGYRAPVTFSQGPARIGYLYATSRGEAPTIVADFQGCQVEAFKAPVPVIPDTNTSGIFSNFFSKLFPRGFTSKPPTPDDWAFKAFAACLTRRGYTVTDVTRLK
Ga0137394_1009282423300012922Vadose Zone SoilMVTRAKEWRSRAGARGRLGAGLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADVAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPTIVADFRGCQVEALKAPIPVIPDSNTSGIVSNFFAKLYPRGFTSQPPTPDDWAFKAFAVCLTRRGYTVTDVTRLK
Ga0137359_1079943313300012923Vadose Zone SoilMVTGANEWRSRTGTRRRLGACLVATMILGGCAADGGIDLRGGQTAKQLEEDRAKCLPFVQAHPDTTADVAEAACLVARGYRAPVTFSQGPARIGYLYATSRGEAPTIVADFQGCQVEAFKAPVPVIPDTNTSGIFSNLFSKLYPRGFTSHPPTPDDWAFKAFAACLTRRGYTVTDVTRLK
Ga0137359_1081220313300012923Vadose Zone SoilMVTRAKEWRSRAGARGRLGAGLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADVAEAACLVARGYRAPVTFAQGPARIGYLYATSRREAPVVVADFQGCQVEALKAPIPVIPDSNTSGIVSNIFSKLYPRGFTSQPPTPDDWTFKAFAVCLTRRGYSVTNVTRLK
Ga0137419_1171601213300012925Vadose Zone SoilMVTRAKEWRSRAGARGRLGAGLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTAEVAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPTIVADFQGCQVEALKAPIPVIPDSNTSGIFSNF
Ga0153915_1002395743300012931Freshwater WetlandsMATVVDLLRSRRPGAPGLLGVALAAMIVAGCAGEGGIELRGQTAAQLEDDRAKCLPFVQAHTEAAADLAEAACLIAKGYRAPITIAQGPSRIGYLYVDARGEAPAMLTDFQGCQVEAFRSPMPVIPDTDTSGIFSNLLAKLFPRGMASKPPSPDDWVLKSFATCLTRGGYTVSGVSRFK*
Ga0153915_1025482633300012931Freshwater WetlandsMPIVVDVPRARRAGALRPLGAALVALILTGCAGEGGIDLRGGQTAAQLENDRAQCLPFVQAHTEAGADLAEAACLIARGYRAPLTIAQGPSRIGYLYVSARGEAPAMLTEFQGCQVEAFKTPMPVIRDTDTSGIFSNLLAKLFPRGMASKPPSPDDWVLKSFATCLTRRGYTVSGVARFE
Ga0137418_1111587813300015241Vadose Zone SoilMVTRAKEWRSRAGARGRLGAGLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTAEVAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPTIVADFQGCQVEALKAPIPVIPDSNTSGIFSNFFAKLY
Ga0132258_1095489823300015371Arabidopsis RhizosphereMLLAGCAGESGIGLHGQTAAQLEEDRAKCLPFVQAHTDAGAELAEAACLIARGYRAPLSIAQGPSRIGYLYVEKRGEAPGMLTDFQACQVEAFKTPMPVIPDTDTSGIFSNLWAKLFPRGMTSKPPSPDDWVLKYFAACVTRHGYAVSGASRYQ*
Ga0132256_10186010513300015372Arabidopsis RhizosphereAALGAALVSMLLAGCAGESGIGLHGQTAAQLEEDRAKCLPFVQAHTDAGAELAEAACLIARGYRAPLSIAQGPSRIGYLYVEKRGEAPGMLTDFQACQVEAFKTPMPVIPDTDTSGIFSNLWAKLFPRGMTSKPPSPDDWVLKYFAACVTRHGYAVSGASRYQ*
Ga0187825_1000773833300017930Freshwater SedimentMATVVDLLRSWRPCAPGLLEAALVSMLLAGCAGESGIGLHGGQTAAQLEEDRAKCLPFVQAHTEAGADLAETACLIARGYRAPLSIVQGPSRIGYLYVEKRGEAPGMLTDFQGCQVEAFKTPTPVIPDTDTSGIFSNLWAKFFPRGMTSKPPSPDDWVLKYFAACLTRRGYAVSGVTRYQ
Ga0187821_1002757623300017936Freshwater SedimentMATVVDLLRSWRPCAPGLLGATLVSMLLAGCASESGIGLHGGQTAAQLEEDRTKCLPFVQAHTDAGADLAETACLIARGYRAPLSIVQGPSRIGYLYVEKRGEAPGMLTDFQGCQVEAFKTPTPVIPDTDTSGIFSNLWAKFFPRGMTSKPPSPDDWVLKYFAACLTRRGYAVSGATRYE
Ga0187823_1038276813300017993Freshwater SedimentMATVVDLLRSWRPCAPGLLEAALVSMLLVGCAGESGIGLHGGQTAGQLEEDRAKCLPFVQAHTEAGADLAETACLIARGYRAPLSIAQGPSRIGYLYVEKRGEAPGMLTDFQGCQVEAFKTPTPVIPDTDTSGIFSNLYAKFFPRGMT
Ga0187822_1000510433300017994Freshwater SedimentMATVVDLLRSWRPCAPGLLEAALVSMLLAGCASESGIGLHGGQTAAQLEEDRAKCLPFVQAHTEAGADLAETACLIARGYRAPLSIVQGPSRIGYLYVEKRGEAPGMLTDFQGCQVEAFKTPTPVIPDTDTSGIFSNLYAKFFPRGMTSKPPSPDAWVLRYFAACLTRRGYAVSGVTRYQ
Ga0193724_111558513300020062SoilGWLGAALVAMVILGGCATEGGIDLRGGQTAKQLEEDRAKCLPFVQAHTETTADVAEAACLVARGYRAPVTFSQGPARIGYLYATSRGEAPAIVGDFQGCQVEAFKAPVPVIPDTDTSGIFSNFFSKVFPRGFTSKPPTPDDWSFKAFAGCLSRRGYTVTDVTRLN
Ga0210407_1050557213300020579SoilGIVKSVRSTRRGGRGRLAAGLAAITLAAGCASAGGGIDLRGGQTQAQLEDDRAKCIPFVQAHTETTAEMAEAACLVARGYRVPVTFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFKTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYPVTDVTRVK
Ga0210404_1006361613300021088SoilFRPPAPSGIVKSVRSTRRGGRGRLAAGLAAMTLAAGCASEGGGIDLRGGQTQAQLEDDRAKCIPFVQAHTETTAEMAEAACLVARGYRVPVTFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFKTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYAVTDVTRVK
Ga0210400_1033155513300021170SoilVKSVRSTRRGGRGRLAAGLAAMTLAAGCASEGGIDLRGGQTHAQLEDDRAKCIPFVQAHTETTAEMAEAACLVARGYRVPVTFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFKTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYAVTDVT
Ga0210384_1167439113300021432SoilVKSVRSTRRGGRLAVGLAVMTLAAGCASEGGGIDLRGGQTQAQLEDDRAKCIPFVQAHTETTAEMAEAACLVARGYRVPVTFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFKTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPT
Ga0210402_1037427723300021478SoilAAFRPPAPSGIVKSVRSTRRGGRGRLAAGLAAMTLAAGCASGGGGIDLRGGQTHAQLEDDRAKCIPFVQAHTETTAEMAEAACLVARGYRVPVTFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFKTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYPVTDVTRVK
Ga0210410_1159301323300021479SoilMTLAAGCASEGGIDLRGGQTHAQLEDDRAKCIPFVQAHTETTAEMAEAACLVARGYRVPVTFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFKTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTP
Ga0210409_1000691523300021559SoilVKSVRSTRRGGRGRLAAGLAAMTLAAGCASEGGIDLRGGQTRAQLEDDRAKCIPFVQAHTETTAEMAEAACLVARGYRVPVTFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFKTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYPVTGVTRTK
Ga0207653_1009141423300025885Corn, Switchgrass And Miscanthus RhizosphereMVTGANEWRSRTGTRRRLGACLVTMMILGGCATESGIDLRGGQTAKQLEEDRAKCLPFVQAHTETTADLAEAACLVARGYRAPVTFAQGPARIGYLYAVSRGEAPTIVADFQGCQVEAFKAPIPVIPDTNTSGIFSNVFSKLFPRGFTSQPPTPD
Ga0207685_1030301013300025905Corn, Switchgrass And Miscanthus RhizosphereVKSVRSTRRGGRGRLAAGLAAMTLAAGCASEGGIDLRGGQTHAQLEDDRAKCIPFVQAHPETTAEMAEAACLVARGYRVPVTFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFKTP
Ga0207684_10002886173300025910Corn, Switchgrass And Miscanthus RhizosphereMVTGANEWRSRTGTRRRLGACLVATMILGGCATESGIDLRGGQTAKQLEEDRAKCLPFVQAHTETTADLAEAACLVARGYRAPVTFSQGPARIGYLYAVSRGEAPTIVADFQGCQVEAFKAPIPVIPDTNTSGIFSNVFSKLFPRGFTSQPPTPDDWAFKAYATCLTRRGYTVTDVTRLK
Ga0207684_1004578943300025910Corn, Switchgrass And Miscanthus RhizosphereMVTRAKEWRSRAGERGRLGACLVAMVILGGCATEGGIDLRGGQTAAQLEEDRTKCLPFVQAHPETTADLAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPTIVADFQGCQVEALKAPIPVIPDSDTSGIFSNFFAKLYPRGFTSQPPTPDDWALKAFAACLTRRGYTVTDVTRLK
Ga0207684_1061503123300025910Corn, Switchgrass And Miscanthus RhizosphereMVTGANEGRSRTGTRGRLGACLVATVILGGCATDGGIDLRGGQTAKQLEEDRAKCLPFVQAHPDTTADVAEAACLVARGYRTPVTFSQGPARIGYLYATSRGEAPTIVADFQGCQVEAFKAPVPVIPDTNTSGIFSNLFSKLYPRGFTSHPPTPDDWAFKAFAACLTRRGYTVTDVTRLK
Ga0207646_1103188623300025922Corn, Switchgrass And Miscanthus RhizosphereMVTRAKEWRSRAGARGRLGAGLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADVAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPVVVADFQGCQVEALKAPIPVIPDSNTSGIVSNFFAKLYPRGFTSQPPTPDDWTFKAFAVCLTRRGYSVTDVTRLK
Ga0207700_1123349213300025928Corn, Switchgrass And Miscanthus RhizosphereVKSVRSTRRGGRLAAGLAAMMLAAGCASEGGGIDLRGGQTQVQLEDDRAKCIPFVQAHTETTAEMAEGACLVARGYRVPITFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFRTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYPVTGVTRTK
Ga0208285_101328613300026005Rice Paddy SoilMATVADLLRTRRPGALRLLGAAVGALILTGCAGEGGIDLRGGQTAAQLENDRAQCLPFVQAHTDASADLAETACLIARDYRAPLTIAQGPSRIGYLYVDKHGEAPAMLTEFQACQVEAFKTPMPVIPDTDTSGIFSNLFAKL
Ga0257180_100776823300026354SoilMVTRAKEWRSRAGARGRLGAGLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADVAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPVVVADFQGCQVEALKAPIPVIPDSNTSGIVSNFFAKLYPRGFTSQPPTPDDWTFKAFAVCLTRRGYSVTDATRLK
Ga0257149_101083423300026355SoilMVTRAKEWRSRAGARGRLGAGLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADVAEAACLVARGYRAPVTFSQGPARIGYLYATSRGEAPTIVADFQGCQVEAFKAPVPVIPDTNTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKAFAACLTRRGYTVTDVTRLK
Ga0257153_103176013300026490SoilVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADVAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPVVVADFQGCQVEALKAPIPVIPDSNTSGIVSNFFAKLYPRGFTSQPPTPDDWTFKAFAVCLTRRGYSVTDATRLK
Ga0257157_105291613300026496SoilMVTLASECKSRPSGRGRLGAALVAMVILGGCATEGGIDLRGGQTAKQLEEDRAKCLPFVQAHTEITADVAEAACLVGRGYRAPVTFAQGPARIGYLYVTSRAEAPTVVADFQGCQVEAFKAPVPVIPDTNTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKAFAGCLSRRGYTVTD
Ga0257161_104055123300026508SoilMVTLASECRSRPSGRGRLGAALVAMVILGGCATEGGIDLRGGQTAKQLEEDRAKCLPFVQAHTEITADVAEAACLVGRGYRAPVTFAQGPARIGYLYVTSRAEAPTVVADFQGCQVEAFKAPVPVIPDTNTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKAFAGCLSRRGYTVTDVTRLK
Ga0257168_105082023300026514SoilMVTRAKEWRSRAGARGRLGACLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADVAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPTIVADFQGCQVEALKAPIPVIPDSNTSGIFSNFFAKLYPRGITSQPPTPDDWALKAFAVCLTRRGYTVTDVTRLK
Ga0257158_112401813300026515SoilAAVVAMVILGGCATEGGIDLRGGQTAKQLEEDRAKCLPFVQAHTEITADVAEAACLVGRGYRAPVTFAQGPARIGYLYATSRAEAPTVAADFQGCQVEAFKAPVPVIPDTNTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKAFAGCLSRRGYTVTDVTRLK
Ga0209648_1001495863300026551Grasslands SoilMVTGANEWRSRTGTRRRLGACLVATVILGGCATDGGIDLRGGQTAKQLEEDRGKCLPFVQAHTETTADVAEGACLVARGYRAPVTFSQGPARIGYLYATSRGEAPTIVADFQGCQVEAFKAPVPVIPDTNTSGIFSNLFSKLYPRGFTSQPPTPDDWAFKAFAACLTRRGYTVTEVTRLK
Ga0209648_1053222913300026551Grasslands SoilMVTRAKEWRSRAGTRGRLGACLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADVAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPTIVADFQGCQVEALKAPIPVIPDSNTSGIFSNFFAKLYPRGITSQPPTPDDWALKAFAVCLTRRGYTVTDVTRLK
Ga0209117_109894713300027645Forest SoilMVTGANEWRSRTGTRRRLGACLVAIVLLTGCATEGGIDLRGGQTAKQLEEDRAKCLPFVQAHPETTADLAEAGCLVARGYRAPVTFSQGPARIGYLYATSRGEAPTIVADFQGCQVEAFKAPVPVIPDTNTSGIFSNVFSKLFP
Ga0209217_103884123300027651Forest SoilVKSVRSTRRGGRGRLAAGLAAMTLAAGCASEGGGIDLRGGQTQAQLEDDRAKCIPFVQAHTETTAEMAEGACLVARGYRVPVTFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFKTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYPVTDVTRVK
Ga0209588_103303123300027671Vadose Zone SoilMVTRAKEWRSRAGARGRLGAGLVAMVILTGCATEGGIDLRGGQTAAQLEEDRAKCLPFVQAHPETTADVAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPTIVADFQGCQVEALKAPIPVIPDSNTSGIVSNFFAKLYPRGFTSQPPTPDDWTFKAFAVCLTRRGYTVTEVTRLK
Ga0209180_1079065113300027846Vadose Zone SoilMVTGANEWRSRTGTRGRLGACLVAMVILGGCATAGGIDLRGGQTAKQLEEDRAKCLPFVQAHPETTADLAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPTIVADFQGCQVEAFKAPVPVIPDTDTSGIFSN
Ga0209068_1012116123300027894WatershedsLAAMTLAAGCASEGGIDLRGGQTQAQLEDDRAKCIPFVQAHTEVTAETAEAACLVARGYRVPITFAQGPARIGYLYATSRGEAAAIVGDFQGCRVEAFKTPVPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYTVTDVTRVK
Ga0209069_1004657823300027915WatershedsVKSVRSTRRGGRGRLAAGLAAMTLAAGCASEGGIDLRGGQTQAQLEDDRAKCIPFVQAHTEVTAETAEAACLVARGYRVPITFAQGPARIGYLYATSRGEAAAIVGDFQGCRVEAFKTPVPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYTVTDVTRVK
Ga0209526_1012400923300028047Forest SoilMVTLASECRLRPSVRGRLGAALVAMVILGGCATEGGIDLRGGQTAKQLEEDRAKCLPFVQAHTELTADVAEAACLVARGYRAPVTFAQGPARIGYLYATSRAEAPTVAADFQGCQVEAFKAPVPVIPDTNTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKAFAGCLSRRGYTVTDVTRLK
Ga0209526_1060656913300028047Forest SoilLAAGLAAMTLAAGCASEGGGIDLRGGQTQAQLEDDRAKCIPFVQAHTETTAEMAEGACLVARGYRVPVTFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFKTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYPVTDVTRVK
Ga0137415_1136350513300028536Vadose Zone SoilRSRAGARGRLDACLVAMVILGGCATAGGIDLRGGQTAKQLEEDRAKCLPFVQAHPETTADLAEAACLVARGYRAPVTFSQGPARIGYLYATSRGEAPTIVADFQGCQVEAFKAPVPVIPDTNTSGIFSNVFSKLFPRGFASKPPTPDDWAFKAFAACLTRRGYTVTDVTRLK
(restricted) Ga0255310_1012857913300031197Sandy SoilMATVVDLLRSRRPCAPGLLGAALGSMLLAGCAGESGIGLHGGQTAAQLEEDRAKCLPFVQAHTEAGADLAETACLIARGYRAPLSIVQGPSRIGYLYVDKRGEAPGMLTDFQGCQVEAFKAPVPVIPDTDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKAFAGCLSRRGYTVTDV
Ga0310813_1012498123300031716SoilVDRLRSRPRRALLGAALVSMLLAGCASESGIGLHGQTAAQLEEDRAKCLPFVQAHTDAGADLAEAACLIARGYRAPISIAQGPSRIGYLYVDRRGEAPGMLTDFQACQVEAFKTPMPVIPDTNTSGIFSNLWAKLFPRGMTSKPPSPDDWVLKYFGACLTRHGYAVSGASRYP
Ga0307469_1034629423300031720Hardwood Forest SoilMVTRAKEWRSRAGERGRLGACLVAMVILGGCATEGGIDLRGGQTAAQLEEDRTKCLPFVQAHPETTADLAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPTIVADFQGCQVEALKAPIPVIPDSDTSGIFSNFFAKLYPRGFTSQPPTPDDWA
Ga0307469_1176418013300031720Hardwood Forest SoilMVTGANEWRSRTGTRRRLGACLVATMILGGCATESGIDLRGGQTAKQLEEDRAKCLPFVQAHTETTADLAEAACLVARGYRAPVTFSQGPARIGYLYAVSRGEAPTIVADFQGCQVEAFKAPIPVIPDTNTSGIFSNVFSKLFPRGFTSQPPTPD
Ga0307469_1223057213300031720Hardwood Forest SoilVKSVRSTRRGGRGRLAAGLAAMTLAAGCASEGGIDLRGGQTHAQLEDDRAKCIPFVQAHTETTAEMAEAACLVARGYRVPVTFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFKTPPPVIPDSDT
Ga0307468_10003222333300031740Hardwood Forest SoilVKSVRSTRRGGRLAAGLAAMMLAAGCASEGGGIDLRGGQTQAQLEDDRAKCIPFVQAHTETTAEMAEGACLVARGYRVPITFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFRTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFGACLTRRGYAVTDVTRVK
Ga0307473_1046964613300031820Hardwood Forest SoilIVKSVRSTRRGGRLAAGLAAMTLAAGCASEGGGIDLRGGQTHAQLEDDRAKCIPFVQVHTETTAEMAEGACLVARGYRVPITFAQGPARIGYLSVTSRGEAAAIVGDFQGCQVEAFQTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYAVTDVTRVK
Ga0307479_1005164123300031962Hardwood Forest SoilVKSVRSTRRGGRLAAGLAAMTLAAGCASEGGGIDLRGGQTHAQLEDDRAKCIPFVQVHTETTAEMAEGACLVARGYRVPITFAQGPARIGYLSVTSRGEAAAIVGDFQGCQVEAFQTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYAVTDVTRVK
Ga0307470_1019915823300032174Hardwood Forest SoilRLGAGLVAMVILTGCATEGGIDLRGGQTAAQLEEDRTKCLPFVQAHPETTADLAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPTIVADFQGCQVEALKAPIPVIPDSDTSGIFSNFFAKLYPRGFTSQPPTPDDWTFKAFAVCLTRRGYSVTDVARLK
Ga0307471_10045180023300032180Hardwood Forest SoilPSGIVKSVRSTRWGGRLAAGLAAMTLAAGCASEGGGIDLRGGQTQAQLEDDRAKCIPFVQAHTETTAETAEAACLVARGYRVPVTFAQGPARIGYLYVTSRGEAAAIVGDFQGCQVEAFKTPPPVIPDSDTSGIFSNFFSKVFPRGFTSKPPTPDDWAFKYFAACLTRRGYAVTDVTRVK
Ga0307472_10086655813300032205Hardwood Forest SoilMVTGANEWRSRTGTRRRLGACLVATMILGGCATESGIDLRGGQTAKQLEEDRAKCLPFVQAHTETTADLAEAACLVARGYRAPVTFSQGPARIGYLYAVSRGEAPTIVADFQGCQVEAFKAPIPVIPDTNTSGIFSNVF
Ga0307472_10187342123300032205Hardwood Forest SoilMVTRAKEWRSRAGERGRLGACLVAMVILGGCATEGGIDLRGGQTAAQLEEDRTKCLPFVQAHPETTADLAEAACLVARGYRAPVTFAQGPARIGYLYATSRGEAPTIVADFQGCQVEALKAPIPVIPDSDTSGIFSNFFAKLYPRGFTSQPPTPDDWAFKA
Ga0326729_102059023300033432Peat SoilDMTTVVDLLRTRRPSATGLLGAALVSMLLAGCAGESGIGLHGQTAAQLEEDRAKCLPFVQAHTDAGADLAEAACLIARGYRAPLSIAQGPSRIGYLYVEKRGEAPGMLTDFQACQVEAFKTPMPVIPDTDTSGIFSNLWAKFFPRGMTSKPPSPDDWVLKYFGDCLTRRGYAVSGVSRYQ
Ga0326726_1009660133300033433Peat SoilMATVVDVLRSRRPGAPGLLGAALAAMILVGCAGEGGIELRGQTAAQLEDDRAKCLPFVQAHTEAGADLAEAACLIAKGYRAPITIAQGPSRIGYLYAGARGEASAMLTDFQGCQVEAFKTPMPVIPDTDTSGIFSNLLAKLFPRGMASKPPSPDDWVLKSFATCLTRRGYTVSEVSRFK
Ga0326726_1054282423300033433Peat SoilMTTVVDLLRTRRPSAAGLLGAALVSMLLAGCARESGIGLHGQTTAQLEEDRAKCLPFVQAHTDAGADLAEAACLIARGYRAPLSIAQGPSRIGYLYVEKRGEAPGMLTDFQACQVEAFKTPMPVIPDTDTSGIFSNLWAKFFPRGMTSKPPSPDDWVLKYFGDCLTRRGYAVSGVSRYQ
Ga0316620_1125924123300033480SoilAAMIVAGCAGEGGIELRGQTAAQLEDDRAKCLPFVQAHTEAAADLAEAACLIAKGYRAPITIAQGPSRIGYLYVDARGEAPAMLTDFQGCQVEAFRSPMPVIPDTDTSGIFSNLLAKLFPRGMASKPPSPDDWVLKSFATCLTRGGYTVSGVSRFK
Ga0316624_1028526313300033486SoilMAIVVDVPRARRAGGLRPLGAALVALILTGCAGEGDIELRGGQTAAQLENDRAQCLPFVQAHTEAGADLAEAACLIARGYRAPLTIAQGPSRIGYLYVSARGEAPAMLTEFQGCQVEAFKTPMPVIPDTDTSGIFSNLLAKLFPRG
Ga0316624_1189887713300033486SoilGSGIVESRTDDMATVVDLLRSRRPGAPGLLGVALAAMIVAGCAGEGGIELRGQTAAQLEDDRAKCLPFVQAHTEAAADLAEAACLIAKGYRAPITIAQGPSRIGYLYPNARGEAPAMLADFQGCQVEAFKTPMPTIPDTDTSGIFSNLFAKLFPRGMASKPPSPDDWVLKSFAACLTRRGYTV
Ga0326730_100222413300033500Peat SoilMTTVADLLRTRRPSAAGLLGAALVSMLLAGCARESGIGLHGQTTAQLEEDRAKCLPFVQAHTDAGADLAEAACLIARGYRAPLSIAQGPSRIGYLYVEKRGEAPGMLTDFQACQVEAFKTPMPVIPDTDTSGIFSNLWAKFFPRGMTSKPPSPDDWVLKYFGDCLTRRGYAVSGVSRYQ
Ga0326731_100423213300033502Peat SoilMTTVVDLLRTRRPSAAGLLGAALVSMLLAGCARESGIGLHGQTAAQLEEDRAKCLPFVQAHTDAGADLAETACLIGRGYRAPLSIAQGPSRIGYLYVEKRGEASGMLTDFQACQVEAFKTPMPVIPDTDTSGIFSNLWAKLFPRGMTSKPPSPDDWVLKYFADCLTRRGYAVSGATRYQ
Ga0326731_101463323300033502Peat SoilMATVVDLLRSRRPGAPGLLGAALAAMILVGCAGEGGIELRGQTAAQLEDDRAKCLPFVQAHTEAGADLAEAACLIAKGYRAPITIAQGPSRIGYLYAGARGEASAMLTDFQGCQVEAFKTPMPVIPDTDTSGIFSNLLAKLFPRGMASKPPSPDDWVLKSFATCLTRRGYTVSEVSRFK
Ga0316628_10119015523300033513SoilVESRTDDMATVVDLLRSRRPGAPGLLGVALAAMIVSGCAGEGGIELRGQTAAQLEDDRAKCLPFVQAHTEAAADLAEAACLIAKGYRAPITIAQGPSRIGYLYVDARGEAPAMLTDFQGCQVEAFRSPMPVIPDTDTSGIFSNLLAKLFPRGMASKPPSPDDWVLKSFAT
Ga0326723_0005430_4086_46253300034090Peat SoilMTTVVDLLRTRRPSATGLLGAALVSMLLAGCARESGIGLHGQTTAQLEEDRAKCLPFVQAHTDAGADLAEAACLIARGYRAPLSIAQGPSRIGYLYVEKRGEAPGMLTDFQACQVEAFKTPMPVIPDTDTSGIFSNLWAKFFPRGMTSKPPSPDDWVLKYFGDCLTRRGYAVSGVSRYQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.