NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F044678

Metagenome Family F044678

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F044678
Family Type Metagenome
Number of Sequences 154
Average Sequence Length 183 residues
Representative Sequence MKISCFLLVFLCQSAVAAIPTPSPSPLPTALASPAATITQSVSELSAQRAATERSKIYRSGAAYGRWLDQIAKDSGKAFLQRAVFDRVTWMRLLSCAVALALLSLLSGWFVWIVRLRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIVTGINSRPTRIFWGRGLTAILYAGWIIAL
Number of Associated Samples 126
Number of Associated Scaffolds 154

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 19.51 %
% of genes near scaffold ends (potentially truncated) 77.92 %
% of genes from short scaffolds (< 2000 bps) 75.97 %
Associated GOLD sequencing projects 118
AlphaFold2 3D model prediction Yes
3D model pTM-score0.32

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (79.870 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(28.571 % of family members)
Environment Ontology (ENVO) Unclassified
(38.312 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(65.584 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: Yes Secondary Structure distribution: α-helix: 68.69%    β-sheet: 0.00%    Coil/Unstructured: 31.31%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.32
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 154 Family Scaffolds
PF07690MFS_1 2.60
PF00654Voltage_CLC 1.95
PF16576HlyD_D23 1.95
PF01070FMN_dh 1.30
PF11563Protoglobin 1.30
PF01757Acyl_transf_3 1.30
PF12833HTH_18 1.30
PF00529CusB_dom_1 1.30
PF00924MS_channel 1.30
PF13533Biotin_lipoyl_2 0.65
PF07681DoxX 0.65
PF00753Lactamase_B 0.65

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 154 Family Scaffolds
COG0038H+/Cl- antiporter ClcAInorganic ion transport and metabolism [P] 1.95
COG0069Glutamate synthase domain 2Amino acid transport and metabolism [E] 1.30
COG0668Small-conductance mechanosensitive channelCell wall/membrane/envelope biogenesis [M] 1.30
COG1304FMN-dependent dehydrogenase, includes L-lactate dehydrogenase and type II isopentenyl diphosphate isomeraseEnergy production and conversion [C] 1.30
COG3264Small-conductance mechanosensitive channel MscKCell wall/membrane/envelope biogenesis [M] 1.30
COG2259Uncharacterized membrane protein YphA, DoxX/SURF4 familyFunction unknown [S] 0.65
COG4270Uncharacterized membrane proteinFunction unknown [S] 0.65


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms79.87 %
UnclassifiedrootN/A20.13 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2088090014|GPIPI_17316685All Organisms → cellular organisms → Bacteria11174Open in IMG/M
2124908045|KansclcFeb2_ConsensusfromContig886685All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia516Open in IMG/M
2228664022|INPgaii200_c1160442All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia703Open in IMG/M
2228664022|INPgaii200_c1160984All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia832Open in IMG/M
3300000363|ICChiseqgaiiFebDRAFT_10816795All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia808Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_100837215All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia862Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_100837800All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1319Open in IMG/M
3300000559|F14TC_100101601All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia916Open in IMG/M
3300000709|KanNP_Total_F14TBDRAFT_1007166All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia789Open in IMG/M
3300000956|JGI10216J12902_104325029All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia589Open in IMG/M
3300000956|JGI10216J12902_106935523All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia572Open in IMG/M
3300001431|F14TB_100501398All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1031Open in IMG/M
3300001431|F14TB_103740490All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia615Open in IMG/M
3300002245|JGIcombinedJ26739_101798663All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia513Open in IMG/M
3300002903|JGI24801J43971_1001131All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → actinobacterium SCGC AAA044-N04954Open in IMG/M
3300004463|Ga0063356_103383203All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → unclassified Oscillospiraceae → Ruminococcaceae bacterium D5687Open in IMG/M
3300004479|Ga0062595_102205466All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micrococcales → Micrococcaceae → Micrococcus → Micrococcus luteus540Open in IMG/M
3300005166|Ga0066674_10002450All Organisms → cellular organisms → Bacteria6847Open in IMG/M
3300005171|Ga0066677_10168949All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1208Open in IMG/M
3300005176|Ga0066679_10844117All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia581Open in IMG/M
3300005179|Ga0066684_10850036All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia599Open in IMG/M
3300005184|Ga0066671_10554164All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia742Open in IMG/M
3300005356|Ga0070674_102106524All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia514Open in IMG/M
3300005450|Ga0066682_10191809All Organisms → cellular organisms → Bacteria1308Open in IMG/M
3300005536|Ga0070697_101795189All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia549Open in IMG/M
3300005540|Ga0066697_10648865All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia581Open in IMG/M
3300005546|Ga0070696_101466262All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia583Open in IMG/M
3300005554|Ga0066661_10819033All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia545Open in IMG/M
3300005559|Ga0066700_10516546All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia833Open in IMG/M
3300005560|Ga0066670_10527913All Organisms → cellular organisms → Bacteria → Synergistetes → Synergistia → Synergistales → Synergistaceae → Aminiphilus → Aminiphilus circumscriptus723Open in IMG/M
3300005566|Ga0066693_10297882All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptomyces → unclassified Streptomyces → Streptomyces sp. HCCB10043644Open in IMG/M
3300005569|Ga0066705_10814046All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptomyces → unclassified Streptomyces → Streptomyces sp. HCCB10043556Open in IMG/M
3300005575|Ga0066702_10103127All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1634Open in IMG/M
3300005575|Ga0066702_10158687All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1344Open in IMG/M
3300005576|Ga0066708_10205470All Organisms → cellular organisms → Bacteria1237Open in IMG/M
3300005598|Ga0066706_10464598All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1008Open in IMG/M
3300005598|Ga0066706_11484952All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia510Open in IMG/M
3300006028|Ga0070717_11998375All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia522Open in IMG/M
3300006031|Ga0066651_10839438All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia500Open in IMG/M
3300006175|Ga0070712_101234613All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Bacillaceae → Virgibacillus → Virgibacillus halodenitrificans650Open in IMG/M
3300006854|Ga0075425_101118197All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micrococcales → Micrococcaceae → Micrococcus → Micrococcus luteus897Open in IMG/M
3300006854|Ga0075425_101334024All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia813Open in IMG/M
3300009012|Ga0066710_100050327All Organisms → cellular organisms → Bacteria5145Open in IMG/M
3300009098|Ga0105245_12814763All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia539Open in IMG/M
3300010304|Ga0134088_10222282All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia906Open in IMG/M
3300010320|Ga0134109_10323482All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia599Open in IMG/M
3300010326|Ga0134065_10293126All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia621Open in IMG/M
3300010329|Ga0134111_10301893All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia667Open in IMG/M
3300010333|Ga0134080_10049502All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1650Open in IMG/M
3300010333|Ga0134080_10184402All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia898Open in IMG/M
3300010335|Ga0134063_10250432All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia843Open in IMG/M
3300010337|Ga0134062_10192823All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia925Open in IMG/M
3300010362|Ga0126377_10657418All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1098Open in IMG/M
3300010373|Ga0134128_11040363All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia905Open in IMG/M
3300011269|Ga0137392_10817746All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia769Open in IMG/M
3300011271|Ga0137393_10212351All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1636Open in IMG/M
3300012199|Ga0137383_10610469All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia798Open in IMG/M
3300012200|Ga0137382_10663700All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia746Open in IMG/M
3300012201|Ga0137365_11041592All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia592Open in IMG/M
3300012203|Ga0137399_11064475All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia681Open in IMG/M
3300012208|Ga0137376_10144565All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2043Open in IMG/M
3300012208|Ga0137376_10859927All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia779Open in IMG/M
3300012354|Ga0137366_10689384All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia729Open in IMG/M
3300012356|Ga0137371_10826563All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia705Open in IMG/M
3300012360|Ga0137375_10821369All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia746Open in IMG/M
3300012532|Ga0137373_10714995All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia746Open in IMG/M
3300012918|Ga0137396_10640055All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia786Open in IMG/M
3300012918|Ga0137396_11096592All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia569Open in IMG/M
3300012922|Ga0137394_10611892All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia921Open in IMG/M
3300012930|Ga0137407_12051276All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia546Open in IMG/M
3300012944|Ga0137410_10577079All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia927Open in IMG/M
3300012951|Ga0164300_10151019All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1088Open in IMG/M
3300012957|Ga0164303_11322634All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia534Open in IMG/M
3300012960|Ga0164301_11435535All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia565Open in IMG/M
3300012960|Ga0164301_11545081All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia548Open in IMG/M
3300012972|Ga0134077_10073487All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1293Open in IMG/M
3300012977|Ga0134087_10005839All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3994Open in IMG/M
3300013296|Ga0157374_10936261All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia885Open in IMG/M
3300014326|Ga0157380_13341851All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia513Open in IMG/M
3300015077|Ga0173483_10466745All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia665Open in IMG/M
3300015241|Ga0137418_10560169All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia903Open in IMG/M
3300015264|Ga0137403_10378591All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1298Open in IMG/M
3300015356|Ga0134073_10042871All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1186Open in IMG/M
3300015358|Ga0134089_10207315All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia790Open in IMG/M
3300017659|Ga0134083_10222542All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia784Open in IMG/M
3300018066|Ga0184617_1075840All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia903Open in IMG/M
3300018431|Ga0066655_10159036All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1351Open in IMG/M
3300018433|Ga0066667_10410227All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1097Open in IMG/M
3300018468|Ga0066662_12059011All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia598Open in IMG/M
3300018482|Ga0066669_10198351All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1534Open in IMG/M
3300018482|Ga0066669_10694311All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia896Open in IMG/M
3300018482|Ga0066669_11891091All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia555Open in IMG/M
3300019362|Ga0173479_10212640All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia824Open in IMG/M
3300019881|Ga0193707_1060371All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1193Open in IMG/M
3300020004|Ga0193755_1069850All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1135Open in IMG/M
3300020004|Ga0193755_1216599All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia535Open in IMG/M
3300020170|Ga0179594_10413256All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia512Open in IMG/M
3300020581|Ga0210399_11265943All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia582Open in IMG/M
3300021413|Ga0193750_1076391All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia636Open in IMG/M
3300021418|Ga0193695_1028764All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1186Open in IMG/M
3300022694|Ga0222623_10355372All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia559Open in IMG/M
3300025927|Ga0207687_11280349All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia630Open in IMG/M
3300026088|Ga0207641_12049983All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia573Open in IMG/M
3300026315|Ga0209686_1152032All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia707Open in IMG/M
3300026317|Ga0209154_1199615All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia771Open in IMG/M
3300026330|Ga0209473_1257170All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia597Open in IMG/M
3300026342|Ga0209057_1215564All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia545Open in IMG/M
3300026342|Ga0209057_1247927All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia507Open in IMG/M
3300026498|Ga0257156_1125168All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia535Open in IMG/M
3300026523|Ga0209808_1257173All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia557Open in IMG/M
3300026536|Ga0209058_1000502All Organisms → cellular organisms → Bacteria33402Open in IMG/M
3300026551|Ga0209648_10826319All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia507Open in IMG/M
3300026702|Ga0208708_102374All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia534Open in IMG/M
3300027635|Ga0209625_1035451All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1113Open in IMG/M
3300028717|Ga0307298_10232361All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia546Open in IMG/M
3300028819|Ga0307296_10279037All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia910Open in IMG/M
3300028824|Ga0307310_10507425All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia608Open in IMG/M
3300028884|Ga0307308_10474437All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia600Open in IMG/M
3300031720|Ga0307469_10231968All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1469Open in IMG/M
3300031740|Ga0307468_101630220All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia604Open in IMG/M
3300031753|Ga0307477_10649963All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia708Open in IMG/M
3300031754|Ga0307475_10446776All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1039Open in IMG/M
3300031820|Ga0307473_10779544All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia680Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil28.57%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil16.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil11.69%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil11.04%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.49%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil5.84%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.25%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.25%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.30%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.30%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.30%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.30%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.30%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.65%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.65%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.65%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.65%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.65%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.65%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.65%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.65%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090014Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
2124908045Soil microbial communities from Great Prairies - Kansas assembly 1 01_01_2011EnvironmentalOpen in IMG/M
2228664022Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000363Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300000709Amended soil microbial communities from Kansas Great Prairies, USA - Total DNA F1.4 TB amended with BrdU and acetate no abondanceEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300001431Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.4TB clc assemlyEnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002903Soil microbial communities from Manhattan, Kansas, USA - Sample 200um NexteraEnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005356Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaGHost-AssociatedOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012951Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_226_MGEnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300014326Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S3-5 metaGHost-AssociatedOpen in IMG/M
3300015077Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S178-409R-2 (version 2)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018066Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_b1EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019362Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S104-311B-1 (version 2)EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021413Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1c1EnvironmentalOpen in IMG/M
3300021418Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3s2EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300023058Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2m1EnvironmentalOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026305Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142 (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026498Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-49-AEnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026702Grasslands soil microbial communities from Kansas, USA, that are Nitrogen fertilized - NN593 (SPAdes)EnvironmentalOpen in IMG/M
3300027635Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028711Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_150EnvironmentalOpen in IMG/M
3300028717Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_158EnvironmentalOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300028824Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_197EnvironmentalOpen in IMG/M
3300028881Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_116EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_009574302088090014SoilMFICASALGADPVVSPSPPAASTSPAATITQSVTDLSAQRAATERSKIYRTGAAYGRWLDQVAQDSGNAFLQRSVFDRVTWMRLLSCVVALALLSLLTGWFVWIVRKRAGEIQSTRYQSWLALSAAAIRKPLALFLWTCGGAFALLPIAAGIVSRPTRIFWARALTAILYAGWIIALL
KansclcFeb2_107052902124908045SoilIASEANLTPSPPAPSTPSTSPAATLAPSVSEFSALNAATERSRIYRAGVAYSEWLDQIAKDSRSAFLQQRVFDRVTWMRLLASVGALALLSIFAGSFVWIVRRRAGEIQSKRYQSALALTVSAIRKPLALFLWMCGGAFALMPIATGIIGQPSRIFWVDLLTAILYAGPIV
INPgaii200_116044212228664022SoilIYCAFVMFICASALGADPVVSPSPPAASTSPAATITQSVTDLSAQRAATERSKIYRTGAAYGRWLDQVAQDSGNAFLQRSVFDRVTWMRLLSCVVALALLSLLTGWFVWIVRKRAGEIQSTRYQSWLALSAAAIRKPLALFLWTCGGAFALLPIAAGIVSRPTRIFWARALTAILYAGWIIALL*LVFRAIRAVEKRMRLWAER
INPgaii200_116098412228664022SoilMRIYCAFVMFICASALGADPVVSPSPPAASASPAATITQSVTDLSAQRAATERSKIYRTGAAYGRWLDQVAQDSGNAFLQRSVFDRVTWMRLLSCVVALALLSLLTGWFVWIVRKRAGEIQSTRYQSWLALSAAAIRKPLALFLWTCGGAFALLPIAAGIVSRPTRIFWARALTAILYAGWIIALL
ICChiseqgaiiFebDRAFT_1081679513300000363SoilMTATRILFALIGFACVSAMGAEPSATPSLSASPGASESPTATLTQSVSQLLERSKVYRAGAAYSKWLDQIAKDSGNAFLQRPVFDRVTWIRLLASVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALAASAIRKPVALFLWMCGGAFALMPIATGIIGRPTGLFWKGALTAIFEAGWIIA
INPhiseqgaiiFebDRAFT_10083721523300000364SoilMKISCLLIVFLSQAAVAAIPTPSPSPPLSESASPAATITQSVTELSAQRAATERSKIYRTGADYGRWLDQVAKDSGNAFLQRAVFDRVTWMRLLSCAVALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIVAGIDSRPTRIFWARAL
INPhiseqgaiiFebDRAFT_10083780013300000364SoilMRIYCAFVMFICASALGADPVVSPSPPAASTSPAATITQSVTDLSAQRAATERSKIYRTGAAYGRWLDQVAQDSGNAFLQRSVFDRVTWMRLLSCVVALALLSLLTGWFVWIVRKRAGEIQSTRYQSWLALSAAAIRKPLALFLWTCGGAFALLPIAAGIVSRPTRIFWARALTAILYAGWIIALL*
F24TB_1101820123300000550SoilMKISCLLIVFICQSALAALPTPSPSPSPRESVSPAATITQSVSELSAQRAATERSKIYRTGAAYGRWLDQVAKDSGNAFLQRTVFDRVTWMRLLSCAAALALLSVVTGWFVWIVRRRAGEINSTRYQSWLALSAAAIRKPVALF
F14TC_10010160113300000559SoilMRIYCAFVMFVYASATGAEPVVSPTPPTASASPAATITQSVTELSAQRAAAERSKIFHAGEAYGRWLDQVAKDSGNTFLQRAVFDRVTWMRLLSCAVALAFLSLFSGWFVWIVRRRAGEIQSTQYQSWLALSAAATRKPVALFIWMCGGAFALLPIAAGIVSRPTRIFWASALTAILYA
KanNP_Total_F14TBDRAFT_100716623300000709SoilMRIYCAFVMFVYASATGAEPVVSPTPPTASASPVATITQSVTELSAQRAAAERSKIFHAGEAYGRWLDQVAKDSGNTFLQRAVFDRVTWMRLLSCAVALAFLSLFSGWFVWIVRRRAGEIQSTQYQSWLALSAAATRKPVALFIWMCGGAFALLPIAAGIVSRPTRIFWASALTAILYAGWIIALLWLVF
JGI10216J12902_10432502913300000956SoilMKIFCLLIIFLCQPVFAAIATPSPSPSPAVSASPAATITQSVTELSAQRAAAERSKIYWTGAAYGRWLDQVAKDSGNTFLQRAVFDRVTWMRLLSSAVALALLSLLSAWFVWIVRQRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIAAGIVDRPTRIFWARALTAILYAGWIIALLW
JGI10216J12902_10693552313300000956SoilIGAFAAESGPPASPLPMASASPAATITQSVSELSAQRAATDRSKIYRAGRDYGFWLDQIAGDSGNTFLQRVVFDRVTWMRVLSCVVALTLLSLLNCWFIWFVRRRAGEIESKRYQSWLALSAAAIRKPLALLLWMCGGAFALLPIAAGIVDRPTRIFWARALTAILYAGWIIASLWLVFRAIRAIEKRMR
JGI10216J12902_11651554313300000956SoilMKISCLLIVFLCQSAVAAIQTPSPSPLPTAPASPAATITQSVSELSAQRAATERSKIYRSGAAYGRWLDQIAKDSGNAFLQRAVFDRVTWMRLLSCAVALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFA
JGI10216J12902_11801437613300000956SoilQTVTLTNKESFVSQATHWNIEPVNFEICMKISCLLIVFLSQAAVAAIPTPSPSPPLSESASPAATITQSVTELSAQRAATERSKIYRTGADYGRWLDQVAKDSGNAFLQRAVFDRVTWMRLLSCAVALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKP
JGI10216J12902_11940809823300000956SoilMRTVISIYCAFLMLFCTGAFAAESGPSPSPPPTASASPGATITQSVAELSAQRAATERSKIYRVGRDYGLWLDRVAKDSANDFLQQSVFDRVTWMRLLSCVAALALLSLLSGWFVWIVRRRAGEIQSTRYQSWLALSAAAIRKPVALF
F14TB_10050139823300001431SoilMILMRIYCAFVMFVYASATGAEPVVSPTPPTASASPAATITQSVTELSAQRAAAERSKIFHAGEAYGRWLDQVAKDSGNTFLQRAVFDRVTWMRLLSCAVALAFLSLFSGWFVWIVRRRAGEIQSTQYQSWLALSAAATRKPVALFIWMCGGAFALLPIAAGIVSRPTRIFWASALTAILYAGWIIALLWLVFRAIRAVEKRMRLWAERT
F14TB_10374049013300001431SoilMKIFCLLIIFFCEPVLAEVATPSPSPSPAASATPGATITQSVSELSAQRAATERSKIYRTGAAYGRWLDDLAKESRNTFLQQPAFDRVTWIRVLSCVVTLALLSLLSGWFVWMVRRRAGEIGSTRYQSWLALSAAAIRKPVALFLWTCGGAFALLPIVAGINSRPTRIFWGRGL
JGIcombinedJ26739_10179866313300002245Forest SoilTASSPATFEQSVSQLSAQSAVAERSKIYRAGAAYSSWLDQVAKNSGNAFLQRQVFDRVTWMRLATCGVTLALLSMLAGWILWIVRRRAGEIQSKRHQSWLALSAAAIRKPVALFLWMCGGGFALMPIVTGIVSRPARLFWAGALTGILYAGWIIALLWLVFRAIRAVEKR
JGI24801J43971_100113113300002903SoilMPATRILFALIGFACVSAMGAEPSATPSLSASPGASESPTATLTQSVSQLLERSKVYRAGAAYSKWLDQIAKDSGNAFLQRPVFDRVTWIRLLASVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALAASAIRKPVALFLWMCGGAFALMPIATGIIGRPTGLFWKGALTAIFEAGWIIALPWLVFRAIRAVE
Ga0063356_10338320313300004463Arabidopsis Thaliana RhizosphereMMNATQVLFALIGFVCVSAIASEPDLTPSPSAPPAASESPAAMLAPSVSELSAQNAATERSRIHRAGAAYSKWLDQIAKDSRSAFLQQRVFDRVTWMRLLASVGALSLLSIFAGSFVWIVRRRAGEIQSKRYQSALALTASAIRKHLALFLWMCGGAFALMPIATGIVGQPSRIFWVDMLTAILYAGPIAALL
Ga0062595_10220546613300004479SoilMTARLLALVLIGVVALSAFAAPVSPTPTATASPSVSPAESPATLVESVSELSAQRVAAERSKIYQQGVDYSKWLDQVAKDSGSAFLQRRVFDHVTWMRLIASVVGLVLLSILAGTFVWFVRRRAGEIQSHRHQSALQLTAAALRKPLALFLWMCGGAFALMPIATGIVGRPTRVFYVGL
Ga0066674_1000245013300005166SoilMKISCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTQLSTQRAATERSKIYRTGAAYGRWLDQVAKSSGNTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVWIVRQHAGEIESTRYQSWLALSAAAIRKPVALFIWMCGGTFALLPIAAGIVSRPTRIFWARALTAILYAGWIIALLWLVFRAIRAVEKRMRLWAERTRSLPGKIVVPIVG
Ga0066677_1016894923300005171SoilMTATRILFALIGFACVSAMGAEPSATPPPSASPAASESPPATLTQSVSQLLERSKVYRAGAAYSKWLDQIATHSGNAFLQRPVFDRVTWIRLLASVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALTASAIRKPVALFLWMCGGAFALMPIATGIIGRPTGLFWVGALTAIFEAGWVIALPWLVFRAIRAVEKRMRQWADRTASLVGKVMTPIV
Ga0066673_1062941013300005175SoilMKISCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTELSAQRAAAERSKIYRTGAAYGRWLDQVAKNSGSTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVGIVRQHAGEIESTRYQSWLALSAAAIRKPVA
Ga0066679_1084411713300005176SoilSSLMMKGTRVLCALLIGFTCISAIAAEPSPSPSSSESEEQTLAQSVTDLSAQRAAMERSRIYHAGADYSRWLDQVAKDSGSAFLQRSVFERVTWMRLLASAGALTLLSIFAGIFVWIVRRRAGEIQSKRYQSWPALSASAIRKPFAFLLVVCGGGFSLMPIVTGIVGRPTRVFWASTLTAILYAGWIIALLWL
Ga0066684_1055238013300005179SoilMKVFCALIIFLCQSVLAAIPTPSPSPSPVASASPVATITQSVTELSAQRAAAERSQIYRTGATYGRWLDQVAKDSGNSFLQRAVFDRVTWMRLLSCVVALALLSLLTGWFVWIVRQHAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGTFALLPIVAGINS
Ga0066684_1085003613300005179SoilLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTQLSAQRAATERSNIYRAGAAYGHWLDQMAKSSGNTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVWIVRQRAGEIESTRYQSWLALSAAAIRKPVALFIWMCGGTFALLPIAAGIVSRPTRIFWARALTAILYAGWIIALLWLVFRAIRAVEKRMRLWA
Ga0066671_1055416423300005184SoilMSSRKKFVWVLSGLLGLGLSWGAAAEPTATPSPSPSPAESASPPVTIAQSVSELSAQRAAAERSKIYRTGRAYSEWLDQVAKDSGSAFLQRPIYDRITWMRVFAPAAAIVLLGLLALWFVRFVRRRAGEIQSNRYQSWLAVSASAIRKPIALFLLMCGGGFALMPIVTGIASRPTR
Ga0070674_10210652413300005356Miscanthus RhizosphereSSATTLTQSVSELSAQRAAAERSQIYRAGATYGRWLDQVAKNSGSEFLRRPLFDRVTWMRLLTCVTSLGLLALITGWFLWVVRRRAGLLQSKQEQSWLAVGAAAIRKPLALFIWVVGGAFALMPVVNGIASRPTRVFWARTLTGIVYAGEIIALLWLVFRAVLAIEKRMHL
Ga0066682_1003737313300005450SoilMKISCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTELSAQRAAAERSKIYRTGAAYGRWLDQVAKNSGSTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVGIVRQHAGEIESTRYQSWLALSAAAIRKPVAPFSIQCGSSAQAFWLRLELQAS*
Ga0066682_1019180933300005450SoilMKISCFLLVFLCQFAVAAIPTPSPSPLPTAPASPAATITQSVSELSAQRAATERSKMYRSGAAYGRWLDQIAKDSGNAFLQRAVFDRVTWMRLLSCAVALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIVAGINSRPTRIF
Ga0070697_10179518913300005536Corn, Switchgrass And Miscanthus RhizosphereSELTAQRAAAERSKIYQAGATYGRWLDGIAKNSGHAFLQRQVVDRVTWMRLLTCAGTLAFLALLTGWFLWIVRRRAGAIQSKRHQSWLALSAAAIRKPLALYLWVVGGAFAFMPIVTGIVSRPTRVFWAGALTAILYAGQIVTVLWLVFRAIRAVEKRTRLWAERTGSVLGKVIVPILGQTL
Ga0066697_1064886513300005540SoilMTATRILFALIGFACISAMGAEPSATPPPSASPAASESPAATLTQSVSQLLERSKVYRAGAAYSKWLDQIAKDSGNAFLQRPVFDRVTWIRLLASVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALTASAIRKPVALFLWMCGGAFALMPIATGIIGRPTSLFWVGALTAIFEAGWIIALPWLV
Ga0070696_10146626213300005546Corn, Switchgrass And Miscanthus RhizosphereVGAEPVVSPSPSPTVSATPATTITQSVAELSAQRAAAERSKIYRTGAAYGRWLDQVAKDSGNAFLQRAVFDRVTWMRLLSCVVALALLSLLTGWFVWIVRRRAGELESTRYQSWIALSLAAIRKPVALFLWTCGGAFALLPIAAGIVSRPTRIFWARTFTAILYAGWIIALLWLVFRAIRAVEKRMRLWAERT
Ga0066661_1081903313300005554SoilELSAQRAAAERSKIYRSGATYSRWLDQVAKNSGNAFLQRPVFDRVTWVRLLSCAGALALLAVIASWFVWIVRRRAGEIQSKRYQSWLAVSASALRKPVALFLLMCGGGFALMPIVTGIVSRPSRVFWASTLTAVLYAGWIIAVLWLIFRAIRAVEKRMNLWAERTDSLLGRVIVPILGHT
Ga0066707_1081236423300005556SoilMKISCLLIVLLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTELSAQRAAAERSKIYRTGAAYGRWLDQVAKNSGNTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVWIVRQHAGEIESTRYQSWLALSAAAIRK
Ga0066700_1051654613300005559SoilMKISCFLLVFLCQSAVAAIPTPSPSPLPTALASPAATITQSVSELSAQRAATERSKIYRSGAAYGRWLDQIAKDSGKAFLQRAVFDRVTWMRLLSCAVALALLSLLSGWFVWIVRLRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIVTGINSRPTRIFWGRGLTAILYAGWIIAL
Ga0066670_1052791313300005560SoilMTATRILFALIGFACVSAMGAEPSATPPPSASPAASESPPATLTQSVSQLLERSKVYRAGAAYSKWLDQIATHSGNAFLQRPVFDRVTWIRLLASVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALTASAIRKPVALFLWMCGGAFALMPIATGIIGRPTGLFWVGALTAIFEAGWVIALPWLVFRAIRAVE
Ga0066693_1000794133300005566SoilMKISCLLIVFLCQSAVAPIPTPLPSPSVTASVSPAATITQSVTELSAQRAAAERSKIYRTGAAYGRWLDQVAKNSGSTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVGIVRQHAGEIESTRYQSWLALSAAAIRKPVAPFSIQCGSSAQAFWLRLELQAS*
Ga0066693_1029788213300005566SoilMKISCFLLVFLCQFAVAAIPTPSPSPLPTAPASPAATITQSVSELSAQRAATERSKVYRSGAAYGRWLDQTAKDSGNVFLQRAVFDRVTWMRLLSCAVALALLSLLSGWFVRIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIVAGINSRPTRIFWG
Ga0066705_1081404613300005569SoilMKISCFLLVFLRQFAVAAIPTPSPSPLPTAPASPAATITQSVSELSAQRAATERSKVYRSGAAYGRWLDQTAKDSGNVFLQRAVFDRVTWMRLLSCAVALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIVAGINSRPTRIFWGRGLTAIL
Ga0066694_1026147913300005574SoilMKISCFLLVFLCQFAVAAIPTPSSSPLPTAPASPAATITQSVSELSAQRAATERSKIYRSGAAYGRWLDQIAKGSCNVFLQRAVFDRVTWMRLLSCAVALALLSLLSGWFVRIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIVAGINSRPTRIF
Ga0066702_1010312733300005575SoilMTATRILFALIGFACVSAMGAEPSATPPPSASPAASESPPATLTQSVSQLLERSKVYRAGAAYSKWLDQIATHSGNAFLQRPVFNCVTWIRLLASVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALTASAIRKPVALFLWMCGGAFALMPIATGIIGRPTGLFWVGALTAIFEAGWVIALPWLVFRAIRAVEKRMRQWADRTASLVGKVMTPIVGDTLRLAVPLLVIILLLPLLRLP
Ga0066702_1015868713300005575SoilMKISCFLLVFLCQSAVAAIPTPSPSPLPTALASPAATITQSVSELSAQRAATERSKIYRSGAAYGRWLDQIAKDSGNAFLQRAVFDQVTWMRLLSCAVALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIVAGINSRPTRIFWGRGLTAILYAGWIIALLWLVFRAIRAVEKRMRLWAEQTRSLLGKILVPIVG
Ga0066708_1020547013300005576SoilMTATRILFALIGFACVSAMGAEPSATPPPSASPAASESPAATLTQSVSQLLERSKVYRAGAAYSKWLDQIATHSGNAFLQRPVFNCVTWIRLLASVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALTASAIRKPVALFLWMCGGAFALMPIATGIIGRPTGLFWVGALTAIFEAG
Ga0066706_1046459823300005598SoilMTATRILFALIGFACVSAMGAEPSATPPPSASPAASESPAATLTQSVSQLLERSKVYRAGAAYSKWLDQIATHSGNAFLQRPVFDRVTWIRLLASVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALTASAIRKPVALFLWMCGGAFALMPIATGIIGRPTGLFWVGALTAIFEAG
Ga0066706_1148495213300005598SoilQEFSELSAQRAAAERSKIYRAGAAYSRWLDQVAKDSGNAFLQRPVFDRVTWIRLLTCAITLAMLALLTGWFLWIVRRHAGELKSKQEQSWPALSASAIRKPLVLFVWVVGGAFALMPIVTGIVSRPTRLFWAGALTAILYAGWIIALLWLVFRAIRAVGKRMNLWAERT
Ga0070717_1199837513300006028Corn, Switchgrass And Miscanthus RhizosphereERSQIYRASAAYSRWLDQVAKNSGSEFLRTPLFDRVTWMRLLTCVTSLGLLALITGWFLWVVRRRAGLLQSKKEQSWLAVSASAIRKPLALFVWVVGGAFALMPIVNGIASRPTRIFWARTLTGIVYAGEIIALLWLVFRAVLAIEKRMHLWAERTNSVLGKVIVPITGQTLR
Ga0066651_1083943813300006031SoilFSAIAQEPSASLSPAASASPPPTIAESFSQLSTQNAAAERSKIYRAGAAYGRWLDQLAKDSDSSFLQRSVFDRVICMRLLSSGFALALIALLAGWFVWIVRRRAGEIQSNRYQSWLALSASAIRKPIALFLWMCGGAFALMPIATGIASRSTRIFWVGALTAIFYA
Ga0070716_10089841313300006173Corn, Switchgrass And Miscanthus RhizosphereMMTLIGIYCAFVMFICAGALVAEPVVSPSPSPTVSATPAATITQSVAELSAQRAAAERSKIYRTGAAYGRWLDQVAKDSGNTFLQRAVFDRVTWMRLLSCVVALALLSLLTGWFVWIVRRRAGELESTRYQSWIALSLAAIRKPVALFFWTCGGA
Ga0070712_10123461313300006175Corn, Switchgrass And Miscanthus RhizosphereMTARLLALVLISVVCLSAVAEPPPPTPIPTTLTQSVSELSAQRAAAERSQIYRAGAAYSRWLDQVAKNSGSEFLRRPLFDRVTWMRLLTCVTSLGLLALITGWFLWIVRRRAGLLQSKKEQSWLAVSAAAIRKPLALFVWVIGGAFALMPIVNGIASRPTRVFWARTLTGIVYAGEIIALLWLVFRAVLAIEKRMH
Ga0075425_10111819713300006854Populus RhizosphereMTARLLALVLIGVVALSAFAAPVSPTPTATASPSVSPAESPATLVESVSELSAQRVAAERSKIYQQGVDYSKWLDQVAKDSGSAFLQRRVFDHVTWMRLIASVVGLVLLSILAGTFVWFVRRRAGEIQSHRHQSALQLAAAALRKPLALFLWMCGGAFALMPIATGIDGRPTRVFY
Ga0075425_10133402413300006854Populus RhizosphereMFICASVVEAEPVISPSPSPAASATPSTTITQSVAELSAQRAATERSKIYRTGAAYGRWLDQLAKESGNAFLQRPVLDRITWMRLISCAVALALLSLLSGWFVWIVRRRAGEIESKRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIAAGVIDRPTRIFWARALTAILYAGWIIALLWMVFRAIRAVEKRMRFWAERTRS
Ga0066710_10005032713300009012Grasslands SoilMKISCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTELSAQRAAAERSKIYRTGAAYGRWLDQVAKNSGSTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVGIVRQHAGEIESTRYQSWLALSAAAIRKPVALFIWMCCGTFALLPIAAGIVSRSTRIFWARALTAILYAGWIIALLWLVFRAIRAVEK
Ga0105245_1281476313300009098Miscanthus RhizosphereFPMNSTRILFALIWLTCASAMVADPTATPISSPSPAGSESPTLVESVSELSAQRAAADRSKIYRAGADYSRWLDQLAKDLDNPFLQRTVFERVTWIRLVASAGALALISVLAGTFIWVVRRRVGEIQSKRYQSTLALTLTALRKPLAFFLWMCGGAFALMPIATGMIGRPTRVFYVGLL
Ga0126374_1023779123300009792Tropical Forest SoilMEPSASPAISPTAFQPSATSQSGLQLSAQSARVERSKIYRAGAAYTQWLDQAAENSGNAFLQRQIFGRVTWMRLLSAAGALVVLSLLAGSFVWIVRRRAGEIQSKQHQSWLALTASAIRKPFALFLLMCGGALALTPIVAGIVSRPTRLFFAGALTDIFYAGSIIALLWLVFRAIRVLGKRMHLWAER
Ga0134088_1022228213300010304Grasslands SoilMKISCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTQLSTQRAATERSKIYRTGAAYGRWLDQVAKSSGNTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVWIVRQRAGEIESTRYQSWLALSAAAIRKPVALFIWMCGGTFALLPIAAGIVSRPTRIFWARALTAILYAGWIIALLWLIFRAIRAVEKRMRLWAERT
Ga0134109_1032348213300010320Grasslands SoilILFALIGFACVSAMGAEPSATPPPSASPAASESPAATLTQSVSQLLERSKVYRAGAAYSKWLDQIATHSGNAFLQRPVFDRVTWIRLLASVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALTASAIRKPVALFLWMCGGAFALMPIATGIIGRPTGLFWVGALTAIFEAGWIIALPWLVFRAIRAVEKRMQQWA
Ga0134084_1035736113300010322Grasslands SoilVMKISCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTELSAQRAAAERSKIYRTGAAYGRWLDQVAKNSGSTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVGIVRQHAGEIESTRYQSWLALSAAAIRKPVAPFSIQCGSSAQAFWLRLELQAA*
Ga0134086_1033733813300010323Grasslands SoilMKISCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTELSAQRAAAERSKIYRTGAAYGRWLDQVAKNSGSTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVGIVRQHAGEIESTRYQSWLALSAAAIRKPVALFIWMCCGTFA
Ga0134065_1029312613300010326Grasslands SoilSARFCSIIYLVCFSAIAQEPSASLSPAASASPPPTIAESFSQLSTQNAAAERSKIYRAGAAYGRWLDQLAKDSGSSFLQRSVFDRVTCMRLLNSGFALALIALLAGWFVWIVRRRAGEIQSKRYQSWLAVSASALRKPVALFLLMCGGGFALMPIVTGIVGRPTRLFWAGALTAILYAGWIIAVLWLIFRAIRAVEKRMNLWTERT
Ga0134111_1030189313300010329Grasslands SoilMIVWKKDLKQFNDLMRIYCAFVMFSCVSAIAAEPTPSPSASPTTSASPAATITQSVTQLSTQRAATERSKIYRTGAAYGRWLDQVAKSSGNTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVWIVRQHAGEIESTRYQSWLALSAAAIRKPVALFIWMCGGTFALLPIAAGIVSRPTRIFWARALTAILYAGWIIAL
Ga0134080_1004950223300010333Grasslands SoilMKISCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTELSAQRAAAERSKIYRTGAAYGRWLDQVAKNSGSTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVGIVRQHAGEIESTRYQSWLALSAAAIRKPVALFIWMCGGTFALLPIAAGIVSRPTRIFWARALTAILYAGWIIALLWLVFRAIRAVEKRMRLWAERTRSLP
Ga0134080_1018440223300010333Grasslands SoilMKVFCALIIFLCQSVLAAIPTPSPSPSPVASASPVATITQSVTELSAQRAAAERSQIYRTGATYGRWLDQVAKDSGNSFLQRAVFDRVTWMRLLSCVVALALLSLLTGWFVWIVRQHAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGTFALLPIVAGINSRPTRIFWGRALTAILYAGWIIALLWLVFRAIRAVEKRMRVWAERTRSLPGKIVVPIVG
Ga0134063_1025043213300010335Grasslands SoilMYRSGAAYGRWLDQIAKDSGNAFLQRAVFDRVTWMRLLSCAVALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIVAGINSRPTRIFWGRGLTAILYAGWIIALLWLVFRAIRAVEKRMRLWAERTRSLPGKIIVPIVGQS
Ga0134062_1019282313300010337Grasslands SoilMKISCFLLVFLCQFAVAAIPTPSPSPLPTAPASPAATITQSVSELSAQRAATERSKIYRAGAAYGRWLDQVAKDSGNAFLQRAVFDRVTWMRLLSCAVALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIVAGINSRPTRIFWGRGLTAILYAGWIIALLWLVFRAIRAVEKRMRFWAEQTRSLLGKILVPI
Ga0126377_1065741813300010362Tropical Forest SoilMRTYCAFVIFICASAFGAEPVASPSPLPTTSGSPAATITQSVTELSAQRAATDRSKIYRTGAAYGRWLDQIAKDSGNAFLQRAVFDQVTWMRVLSCAAALALLSLFSGWFVWMVRRRAGEIQSTREQSWLALSAAAIRKPVALFLWVCGGAFVLLPIVAGISSRPTRIFWGRALTAILYAGWIIALLWLIFRAI
Ga0134128_1104036323300010373Terrestrial SoilVSPSSATTLTQSVSELSAQRAAAERSQIYRAGAAYSRWLDQVAKNSGSEFLRRPLFDRVTWMRLLTCVTSLGLLALITGWFLWIVRRRAGLLQSKKEQSWLAVSAAAIRKPLALFVWVIGGAFALMPIVNGIASRPTRVFLARTLTGSVYAGEIIARLWLVFRAVLAIEKRMH
Ga0137392_1081774613300011269Vadose Zone SoilMISRKKIHGSASLMWALCALIGLVYYASGSMAAEPSASPPPSPVASPATSPVATLEQSFSELSAQRAAVERSKIYQAGATYSRWLDGVAKNSGYTFLQRQVFDRVTWMRLLTCAGTLAFLALLTGWFLWIVRRRAGEIQSKRHQSWLALSAAAIRKPLALYLWVVGGAFAFMPIVTGIVSRPTRVFWAGALTAILYAGQIVTVL
Ga0137393_1021235113300011271Vadose Zone SoilMISRKKIHGSASLMWALCALIGLVYYASGSMAAEPSASPPPSPVASPATSPVATLEQSFSELSAQRAAAERSKIYQAGATYSRWLDGVAKNSGYTFLQRQVFDRVTWMRLLTCAGTLAFLALLTGWFLWIVRRRAGEIQSKRHQSWLALSAAAIRKPLALYLWVVGGAFAFMPIVTGIVSRPTRV
Ga0137383_1061046913300012199Vadose Zone SoilMTATRILFALIGFACVSAMGAEPSATPPPSASPAASESPAATLTQSVSQLLERSKVYRAGAAYSKWLDQIATHSGNAFLQRPVFDRVTWIRLLASVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALTASAIRKPVALFLWMCGGAFALMPIATGIIGRPTTGLFWVGALTAIFEAGWVIALPWLVFRAIRAV
Ga0137382_1066370013300012200Vadose Zone SoilMKISCFLLVFLCQFAVAAIPTPSPSPLPTAPASPAATITQSVSELSAQRAATERSKVYRSGAAYGRWLDQTAKDSGNVFLQRAVFDRVTWMRLLSCAVALALLSLLSGWFVRIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIVAGINSRPTRIFWGRGLTA
Ga0137365_1104159213300012201Vadose Zone SoilALIIFLCQSVLAAIPTPSPSPSPVASASPVATITQSVTELSAQRAAAERSQIYRTGATYGRWLDQVAKDSGNTFLQRAVFDRVTWMRLLSSAVALALLSLLSAWFVWIVRQRAGEIESTRYQSWLALSLAAIRKPVALFLWMCGGAFALLPIAAGIDARPTRIFWARALTAILYAGWIIALLWLVFRAIRAVEKRM
Ga0137399_1106447513300012203Vadose Zone SoilMPSPSPSPVESESPAATIVQSVSELSAQRAAAERSKIYRSGATYSRWLDQVAKNSGSAFLQRPVFDRVTWMRLFASARALALLALLAGWFVWIVRRRAGTIQSGRPQSWLALSASAIRKPFAFLLMVCGGGFALMPIATGIASRPTRVFWVGVLTGLLYAGWIIAVLWLIFRAIRAVEKRMNLWAERTNSLL
Ga0137399_1162858213300012203Vadose Zone SoilMPTPSASPSLPASASPVTTITQSVTELSAQGAAAERSKIYRTGAAYGRWLDQLAKDSGNAFLQGAVFDRVTWMRLLSCVLALALLSSFTAWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWTCGGAFALPPIVAGIDS
Ga0137381_1008115113300012207Vadose Zone SoilMMSSRTTFFLYALIVCVCVSAFAAEPSPTPVPSPSPSASESPVATIAQSVTELSAQRAAAERSKIYRSGATYSRWLDQVAKDSGSAFLQRTVFDRVTWMRLLSCAAALALFAAIASWFVWIVRRRAGEIQSKRYQSWLAVSASALRKPVALFLLMCG
Ga0137376_1014456533300012208Vadose Zone SoilMKISCFLLVFLCQFAVAAIPTPSPSPLPTAPASPAATITQSVSELSAQRAATERSKVYRSGAAYGRWLDQTAKDSGNVFLQRAVFDRVTWMRLLSCAVALALLSLLSGWFVRIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIVAGINSRPTRIFWGRGLTAIFYAGWIIALLWLVFR
Ga0137376_1085992713300012208Vadose Zone SoilMKISCLLIVFLCQSAVAAIPTPLPSSSVTASVSPAATITQSVTQLSAQRAATERSNIYRAGAAYGHWLDQMAKSSGNTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVWIVRQRAGEIESTRYQSWLALSAAAIRKPVALFIWMCGGTFALLPIAAGIVSRPTRIFWARALTAILYAGSIIALLWLVFRAIRAVEKRMRLWAERTRSLPGKII
Ga0137366_1068938413300012354Vadose Zone SoilMFVCASVVGAEPVVSPSSSPMISASPAATITQSVSELSAQRAATERSKIYWTGAAYGRWLDQVAKDSGNAFLQRPVFDRVTWVRLLSCVVALALLSLLTGWFVWIVRRRAGEIESTRYQSWLALSAAAVRKPVALFLWMCGGAFALLPIAAGIISRPTRIFWARALTAILYAGWIIALLWLVFRAIRAVEKR
Ga0137371_1082656313300012356Vadose Zone SoilMKVFCALIIFLCQSVLAAIPTPSPSPSPVASASPVATITQSVTELSAQRAAAERSQIYRTGATYGRWLDQVAKDSGNAFLQRPVFDRVTWMRLLSCVVALALLSLLSGWFVWIVRQHAGEIESTRYQSWLALSGAAIRKPVALFLWMCGGTFALLPIVAGINSRPTRIFW
Ga0137375_1082136913300012360Vadose Zone SoilMKLFYLLIVFLWQPALAAVPTPSPSPPLSESASPAATITQSVSELSAQRAATERSKIYWTGAAYGRWLDQVAKDSGNAFLQRPVFDRVTWMRLLSCVVALALLSLLTGWFVWIVRRRAGEIESTRYQSWLALSAAAVRKPVALFLWMCGGAFALLPIAAGIISRPTRIFWARALTAILYAGWIIALLWLVFRAIRA
Ga0137373_1019935823300012532Vadose Zone SoilMKISCLLIVFLCQSAVAAIPTPSPSPSLTASASPAARITQSVTELSAQRAATERSKIYWTGAAYGRWLDQVAKDSGNAFLQRPVFDRVTWMRLLSSVVALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLP
Ga0137373_1048248313300012532Vadose Zone SoilMFVCASVVGAEPVISPSPSPPASVTPVTTITQSVTELSAQRAATDRSKIYRTGAAYGHWLDQVAKDSGNAFLQRAVFDRVTWMRLLSCVIALGLLSLLTRWFVWIVRRRAGEIGSTRYQSWLALSAAAIRKPVAL
Ga0137373_1071499513300012532Vadose Zone SoilMRLFYLLIVFLWQPALAAVPTPSPSPPLSESASPAATITQSVSELSAQRAATERSKIYWTGAAYGRWLDQVAKDSGNAFLQRPVFDRVTWMRLLSCVVALALLSLLTGWFVWIVRRRAGEIESTRYQSWLALSAAAVRKPVALFLWMCGGAFALLPIAAGIISRPTRIFWARALTAILYAGWIIALLWLVFRAIRA
Ga0137396_1064005513300012918Vadose Zone SoilMFVCASVVGAEPVVSPSPSPTVSATPATTITQSVAELSAQRAAAERSKIYRTGAAYGRWLDQVAENSGNTFLQRAVFDRVTWMRLLSCAVALALLSLLNFWFVSIVRRRAGEIASTRYQSWLALSAAAIRKPVAFFLCVCGGAFALLPIAAGIASRPPRIFWARALTAILYAGWIIALLWLVFRAIRAIEKRMRLWAERTRSLPGKIVVPIVGHT
Ga0137396_1109659213300012918Vadose Zone SoilPAASESPAATLTQSVSQLLERSKVYRAGAAYSKWLDQIATHSGNAFLQRPVFDRVTWIRLLASVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALTASAIRKPVALFLWMCGGAFALMPIATGIIGRPTALFWVGALTAIFEAGWVIALPWLVFRAIRAVEKRMRQWADRTASLVGKVMTPIVGD
Ga0137394_1061189213300012922Vadose Zone SoilMTARLLALVLISVVCLSAVATPLSPAPTATASPSVSPAESPATLVESVSELSAQRVAAERSKIYRQGVDYSKWLDQVAKDSGSAFLQRRVFDHVTWMRLIASVVGLVLLSILGGTFVWFVRRRAGEIQSHRHQSALQLTAAALRKPLALFLWMCGGAFALMPIATGIVGRPTRVFYVGLLTAIL
Ga0137407_1165489313300012930Vadose Zone SoilMFICASVVGAEPIISPSPSPTVSAAPATTITQSVSELSAQRAATERSKIYRAGAAYGRWLDQVAKDSGNAFLQRAVFDRVTWMRVFSCVVALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALF
Ga0137407_1205127613300012930Vadose Zone SoilALNGLVWLSTSAMAVEPNASPSPSPAASPPATLTEEFSELSAQRAAAERSKIYRAGAAYSRWLDQVAENSGSAFLQRPVFDRVTRMRLLTCAITLAMLALLTGWFLWIVRRHAGELKSKREQSWLALSASAIRKPLVLFVWVVGGAFALMPIVTGIVSRPTRLFWAGALTAILYAGWIIA
Ga0137410_1057707923300012944Vadose Zone SoilMTATRILFALIGFACVSAMGAEPSATPPPSASPAASESPAATLTQSVSQLLERSKVYRAGAAYSKWLDQIATHSGNAFLQRPVFDRVTWIRLLASVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALTASAIRKPVALFLWMCGGAFALMPIATGIIGRPTGLFWVGALTAIFEAGWVIALPWLVFRAIRAVEKRMRQWADRTASLVGKVMTPI
Ga0164300_1015101913300012951SoilVSPSSATTLTQSVSELSAQRAAAERSQIYRAGAAYSRWLDQVAKNSGSEFLRRPLFDRVTWMRLLTCVTSLGLLALITGWFLWIVRRRAGLLQSKKEQSWLAVSAAAIRKPLALFVWVIGGAFALMPIVNGIASRPTRVFWARTLTGIVYAGEIIALLWLVFRAVHAIEKRMHLWAERTNSVLGKVIVPITGQTLRLAVP
Ga0164303_1132263413300012957SoilTLTQSVSELSAQRAAAERSQIYRAGATYGRWLDQVAKNSGSEFLRRPLFDRVTWMRLLTCVTSLGLLALITGWFLWVVRRRAGLLQSKQEQSWLAVGAAAIRKPLALFVWVVGGAFALMPVVNGIASRPTRVFWARTLTGIVYAGEIIALLWLVFRAVLAIEKRMHLWAERTNSVLGK
Ga0164301_1143553513300012960SoilMTARLLALVLIGVVALSAFAAPLSPTPTATASPSVSPAESPATLVESVSELSAQRVAAERSKIYQQGVDYSKWLDQVAKDSGSAFLQRRVFDHVTWMRLIASVVGLVLLSILAGTFVWFVRRRAGEIQSHRHQSALQLTAAALRKPLALFLWMCGGAFALMPIATGIVGRPTRVFYVGLLTAILYA
Ga0164301_1154508113300012960SoilSELSAQRAAAERSQIYRAGATYGRWLDQVAKNSGSEFLRRPLFDRVTWMRLLTCVTSLGLLALITGWFLWVVRRRAGLLQSKQEQSWLAVGASAIRKPLALFVWVVGGAFALMPVVNGIAFRPTRVFWARTLTGIVYAGEIIALLWLVFRAVLAIEKRMHLWAERTNSVLGKVIVPITGQTL
Ga0134077_1007348723300012972Grasslands SoilMKISCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTELSAQRAAAERSKIYRTGAAYGRWLDQVAKNSGSTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVWIVRQHAGEIESTRHQSWLALSAAAIRKPVALFIWMCGGTFALLPIAAGIVSRPTRIFWARALTAILYAGWIIALLWLVFRAIRAVEKRMRLW
Ga0134087_1000583913300012977Grasslands SoilMKISCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTQLSAQRAATERSNIYRAGAAYGHWLDQMAKSSGNTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVWIVRQRAGEIESTRYQSWLALSAAAIRKPVALFIWMCGGTFALLPIAAGIVSRPTRIFWARALTAILYAGWIIALLWLVFRAIRA
Ga0157374_1093626113300013296Miscanthus RhizosphereVSPSSATTLTQSVSELSAQRAAAERSQIYRAGAAYSRWLDQVAKNSGSEFLRRPLFDRVTWMRLLTCVTTLGLLALITGWFLWIVRRRAGLLQSKKEQSWLAVSAAAIRKPLALFVWVVVGAFALMPIVNGIASRPTRVFWARTLTGIVYAGEI
Ga0134079_1025581213300014166Grasslands SoilMKISCFLLVFLCQFAVAAIPTPSPSPLPTAPASPAATITQSVSELSAQRAATERSKVYRSGAAYGRWLDQTAKDSGNVFLQRAVFDRVTWMRLLSCAVALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIVAGINSR
Ga0157380_1334185113300014326Switchgrass RhizosphereATPISSPSPAGSESPTLVESVSELSAQRAAADRSKIYRAGADYSRWLDQLAKDLDNPFLQRTVFERVTWIRLVASAGALALISVLAGTFVWVVRRRVGEIQSKRYQSTLALTLTALRKPLAFFLWMCGGAFALMPIATGMIGRPTRVFYVGLLTAIFYAGWIIALLWLVF
Ga0173483_1046674513300015077SoilVSPSSATTLTQSVSELSAQRAAAERSQIYRAGAAYSRWLDQVAKNSGSEFLRRPLFDRVTWMRLLTCVTSLGLLALITGWFLWIVRRRAGLLQSRKEQSWLAVSAAAIRKPLALFVWVIGGAFALMPIVNGIASRPTRVFWARTLTGIV
Ga0137418_1056016923300015241Vadose Zone SoilMPSPSPSPVESESPAATIVQSVSELSAQRAAAERSKIYRSGATYSRWLDQVAKNSSSAFLQRTVFDRVTWMRLLASAGALALLALLAGWFVWIVRRRAGEIQSNRPQSWLALSASAIRKPFAFLLVVCGGGFALMPIVTGIVGRPTRLFWAGALTGLLY
Ga0137403_1037859113300015264Vadose Zone SoilMKISCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTQLSAQRAATERSNIYRAGAAYGHWLDQMAKSSGNTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVWIVRQRAGEIESTRYQSWLALSAAAIRKPVALFIWMCGGTFALLPIAAGIVSRPTRIFWARALTAILYAGWIIALLWLVFRAIRAVEKRMRLWAERTRSLPG
Ga0134073_1004287113300015356Grasslands SoilSPASSESPAATLTQSVSQLSALNAAMERSKVYRAGAAYSKWLDQIATHSGNAFLQRPVFDRVTWIRLLASVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALTASAIRKPVALFLWMCGGAFALMPIATGIIGRPTGLFWVGALTAIFEAGWVIALPLARFPRHSRG*
Ga0134089_1020731513300015358Grasslands SoilMKISCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTELSAQRAAAERSKIYRTGAAYGRWLDQVAKNSGSTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVGIVRQHAGEIESTRYQSWLALSAAAIRKPVALFIWMCGGTFALLPIAAGIVSRPTRIFWARALTAI
Ga0134074_124700513300017657Grasslands SoilMKVSCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTELSAQRAAAERSKIYRTGAAYGRWLDQVAKNSGSTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVWIVRQHAGEIESTRYQSWLALSAAAIRKPVAL
Ga0134083_1022254213300017659Grasslands SoilMKISCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTELSAQRAAAERSKIYRTGAAYGRWLDQVAKNSGSTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVWIVRQRAGEIESTRYQSWLALSAAAIRKPVALFIWMCGGTFALLPIAAGIVSRPTRIFWARALTAILYAGWIIALLWLVFRAIRAVEKRMRLWAERTRSLPGK
Ga0184617_107584013300018066Groundwater SedimentMIPMRIYCALVMFLCASVVWAEPDISPSPSPTASTTLSTTITQSVADLSTQKAAAERSKIYRTGAAYGRWLDQVAKDSGNAFLQRAVFDRVTWMRLLSCAAALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIAAGVIDRPTRIFWARALTAILYAGWIIALLWLVFRAIRAAEKRMRL
Ga0066655_1015903613300018431Grasslands SoilMTTTRVLVALIGFVCISAIRAEPSATPSPSASPASSESPAATLTQSVSQLLERSKLYRAGAAYSKWLDQIATHSGNAFLQRPVFDRVTWIRLLASVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALTASAIRKPVALFLWMCGGAFALMPIATGIIGRPTGLFWVGALTAIFEAGWVIALPWLVFRAIRAVEK
Ga0066667_1012863413300018433Grasslands SoilMKISCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTQLSAQRAATERSNIYRAGAAYGHWLDQMAKSSGNTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVWIVRQHAGEIESTRYQSWLALSAAAIRKPVALFIWMCGGTFALLPIAA
Ga0066667_1040526113300018433Grasslands SoilMKISCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTELSAQRAAAERSKIYRTGAAYGRWLDQVAKNSGSTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVGIVRQHAGEIESTRYQSWLALSAAAIRKPVAPFSIQCGSSAQAFWLRLELQAS
Ga0066667_1041022723300018433Grasslands SoilMRIFCLLIVFLYQTALAATPTPSSSPSPIASASPAATITQSVSELSAQRAATERSKIYRTGAAYGRWLDQVAKDSGNAFLQRAVFDRVTWMRVFSCVVALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIVAGINSRPTRVLWG
Ga0066662_1205901113300018468Grasslands SoilAGAEPSASPSPIESASPAATIEKSISDLSAQRAAAERSKIYRSGATYSRWLDRVAKNSGRAFLQRTVFDRVTWMRLLSCAAALALLAVIASWFVWIVRRRAGEIQSKRYQSWLAVSASALRKPVALFLLMCGGGFALMPIVTGIVGRPTRLFWAGALTAILYAGWIIAVLWLIFRAIRAVEKRMNLWAERTNSLLGKVI
Ga0066669_1019835123300018482Grasslands SoilMICLVCFSAIAQEPSASLSPAASASPPPTIAESFSQLSTQNAAAERSKIYRAGAAYGRWLDQLAKDSGNSFLQRSVFDRVICMRLLSSGFALALIALLAGWFVWIVRRRAGEIQSNRYQSWLALSASAIRKPIALFLWMCGGAFALMPIATGIASRSTRIFWVGALTAIFYAGWIIAVLWLVFRAIRAVEKRMRLWAERTGSLVG
Ga0066669_1069431123300018482Grasslands SoilMRIFCLLIVFLYQTALAATPTPSSSPSPIASASPAATITQSVSELSAQRAATERSKIYRTGAAYGRWLDQVAKDSGNAFLQRAVFDRVNWMRVFSCVVALALLSLLSGWLVWIVRRHAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIVAGINSRPTRIFWGRALTAILYA
Ga0066669_1189109113300018482Grasslands SoilSPTPSPSPSPVESASPAATLTQSVSELSAQRAAAERSKIYRTGVTYSRWLDQVAKDSGNAFLQRPVFDHVTWMRLLSCVAALVLLALLASWFVWIVRRRAGEIQSDRPQSWLALSASAIRKPFALLLLACGGGFVLMPIATGIVSRPTRLFWVGALTGVLYAGWIIALLWLVFRAIRAVEKRMN
Ga0173479_1021264023300019362SoilMTARLLALFLISVMCFSAVAEPPSPTPTTSPTGSPAATLTQSVSELSAQRAAAERSQIYRTGATYGRWLDQVAKNSGSEFLRRPLFDRVTWMRLLTCVTSLGLLALITGWFLWIVKRRAGLLQSKKEQSWLAVSAAAIRKPLALFVWVIGGAFALMPIVNGIASRPTRVFWARTLTGIVYAGEIIALLWLVFRAVH
Ga0193707_106037123300019881SoilMTARLLALVLIGVVALSALATPLSPTPTATASPSVSPAESPATLVESVSELSAQRVATERSKIYQQGVDYSKWLDQVAKDSGSAFLQRRVFDHVTWMRLIASVVGLVLLSILGGTFVWFVRRRAGEIQSHRHQSALQLTAAALRKPLALFLWMCGGAFALMPIATGIVGRPTRVFYVGLLTAILYAGWIVALLWLVFRAIRAIEKR
Ga0193755_106985023300020004SoilMTLMRIYCAFVMFVYASVVGAEPIVSPSPSPTVSATPAATITQSVAELSAQRAAAERSKIYRTGAAYGRWLDQVAKDSGNAFLQRAVFDRVTWMRLLSCAVALALLSLLNCWFVWIVRRRAGEIGSTRYQSWLALSAAAIRKPVALFLWTCGGAFALLPIAAGIVSRPSRIFWASALTAILYAGWIIALLWLVFRAIRAVEKRMRMWAERTRSLPGKI
Ga0193755_121659913300020004SoilAASESPAATLTQSVSELSAQRAAAERSQIYRAGAAYSRWLDQVAKNSGSEFLRRPLFDRVTWMRLLTCVTSLGLLALITGWFLWVVRRRAGLLQSKKEQSWLAVGAAAIRKPLALFVWVVGGAFALMPIVNGIASRPTRVFWARTLTGIVYAGEIIALLWLVFRAVLAIEKRMHLWA
Ga0179594_1041325613300020170Vadose Zone SoilMRIYCALGMFLCASVVGAEPVISPSPASSPTPSTTITQSVAELSAERAAAERSKIYRAGAAYGRWLDEIAKDSRNAFLQRAVFDRVTWMRLLSCGAALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIAAGIVDRPMRIFWGR
Ga0210399_1126594313300020581SoilTSPTGSPAVSPSSATTLTQSVSELSAQRAAAERSQIYRAGATYGRWLDQVAKNSGSEFLRRPLFDRVTWMRLLTCVTSLGLLALITGWFLWVVRRRAGLLQSKQEQSWLAVGAAAIRKPLALFVWVVGGAFALMPVVNGIASRPTRIFWARTLTGIVYAGEIIALLWLVFRAVLAIEKRMHLWAERTNSVLGKV
Ga0193750_107639113300021413SoilEPVVSPSPPPSESPSPVATITQSVAELSAQRAATERSKIYRSGAAYGRCLDQVAKDSGNAFLQRTVFDRVTWMRLLSSALALALLSVVTGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFVLLPIAAGIISRPTRIFWARALTAILYAGWIIALLWLVFRAIRAVEKRMRLWAERTRSLPGKIVVPIVGHSLRLSVP
Ga0193695_102876423300021418SoilMTARLLALVLIGVVALSALATPLSPTPTATASPSVSPAESPATLVESVSELSAQRVATERSKIYQQGVDYSKWLDQVAKDSSSAFLQQRVFDHVTWMRLIASVVGLVLLSILGGTFVWFVRRRAGEIQSHRHQSALQLTAAALRKPLALFLWMCGGAFALMPIATGIVGRPTRVFYVGLLTAILYAGWIVALLWLVFRAI
Ga0222623_1035537213300022694Groundwater SedimentAFGMFVCASVVGAEPVVSPSPSPTVSATPATTITQSVAELSAQRAATERSKIYRTGAAYGRWLDEIAKDSGNAFLQRGVFDRVTWMRLLSCAVALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIVAGIDSRPTRIFWGRALTAILYAGWIIALLWLVF
Ga0193714_100582613300023058SoilMIPMRIYCALVMFLCASVVWAEPDISPSPSPTASTTLSTTITQSVADLSTQKAAAERSKIYRTGAAYGRWLDQVAKDSGNAFLQRAVFDRVTWMRLLSCAAALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFVL
Ga0207687_1128034913300025927Miscanthus RhizosphereMNSTRILFALIWLTCASAMVADPTATPISSPSPAGSESPTLVESVSELSAQRAAADRSKIYRAGADYSRWLDQLAKDLDNPFLQRTVFERVTWIRLVASAGALALISVLAGTFIWVVRRRVGEIQSKRYQSTLALTLTALRKPLAFFLWMCGGAFALMPIATGMIGRPTRVFYVGLL
Ga0207641_1204998313300026088Switchgrass RhizospherePSSATTLTQSVSELSAQRAAAERSQIYRAGAAYSRWLDQVAKNSGSEFLRRPLFDRVTWMRLLTCVTSLGLLALITGWFLWIVRRRAGLLQSKKEQSWLAVSAAAIRKPLALFVWVIGGAFALMPIVNGIASRPTRVFWARTLTGIVYAGEIIALLWLVFRAVHAIEKRMHLWAERTNSVLGKVIVPITGQ
Ga0209688_109127513300026305SoilMKISCFLLVFLCQFAVAAIPTPSPSPLPTAPASPAATITQSVSELSAQRAATERSKVYRSGAAYGRWLDQTAKDSGNVFLQRAVFDRVTWMRLLSCAVALALLSLLSGWFVRIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIVAGINSRPTRIFW
Ga0209686_115203213300026315SoilMTATRILFALIGFACVSAMGAEPSATPPPSASPAASESPPATLTQSVSQLLERSKVYRAGAAYSKWLDQIATHSGNAFLQRPVFDRVTWIRLLASVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALTASAIRKPVALFLWMCGGAFALMPIATGIIGRPTGLFWVGALTAIFEAGWVIALPWLVFRAIRAVEK
Ga0209154_119961513300026317SoilMTATRILFALIGFACVSAMGAEPSATPPPSASPAASESPPATLTQSVSQLLERSKVYRAGAAYSKWLDQIATHSGNAFLQRPVFNRVTWIRLLSSVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALTASAIRKPVALFLWMCGGAFALMPIATGIIGRPTGLFWVGALTAIFEAGWVIALPWLVFRAIRAIEKRMRQWADRTASLVGKVMTPIVGDTLRLAVPLLVIILLLPLLRLPKN
Ga0209470_104831833300026324SoilMKISCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTQLSTQRAATERSKIYRTGAAYGRWLDQVAKSSGNTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVWIVRQHAGEIESTRYQSWLALSAAAIRKPVALFIWMCGGTFALLPIAAGIVSRPTR
Ga0209473_101485033300026330SoilMKISCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTQLSAQRAATERSNIYQAGAAYGHWLDQMAKSSGNTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVWIVRQRAGEIESTRYQSWLALSAAAIRKPVALFIWMCGGTFALLPIAAGIVSRPTR
Ga0209473_125717013300026330SoilMTATRILFALIGFACVSAMGAEPSATPPPSASPAASESPPATLTQSVSQLLERSKVYRAGAAYSKWLDQIATHSGNAFLQRPVFNCVTWIRLLASVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALTASAIRKPVALFLWMCGGAFALMPIATGIIGRPTGLFWVGALTAIFEAGWV
Ga0209057_121556413300026342SoilAQRAAAERSQIYRTGATYGRWLDQVAKDSGNSFLQRAVFDRVTWMRLLSCVVALALLSLLTGWFVWIVRQHAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGTFALLPIVAGINSRPTRIFWGRALTAILYAGWIIALLWLVFRAIRAVEKRMRVWAERTRSLPGKIVVPIVGHSLRF
Ga0209057_124792713300026342SoilSPASSESPAATLTQSVSQLSALNAAMERSKVYRAGAAYSKWLDQIAKDSGNAFLQRPVFDRVTWIRLLASVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALTASAIRKPVALFLWMCGGAFALMPIATGIIGRPTSLFWVGALTAIFEAGWIIALPWLVFRAI
Ga0257156_112516813300026498SoilAASESPVATIAQSVSELSAQRAAAERSKIYRAGADYGRWLDNLAKNSGSAFLQRPVFDRVAWMRLLASAGALTLLALLAGWFVWIVRRRAGEIQSSRPQSWLALSASAIRKPFALFLLMCGGGFALMPIVTGIVGRPTRLFWAGALTAILYAGWIIAVLWLIFRAIRAVEKRTNLWAE
Ga0209808_125717313300026523SoilASPAATLTQSVSELSAQRAAAERSKIYRTGVTYSRWLDQVAKDSGNAFLQRPVFDHVTWMRLLSCVAALVLLALLASWFVWIVRRRAGEIQSDRPQSWLALSASAIRKPFALLLLACGGGFALMPIVTGIVGRPTRLFWAGALTGVLYAGWIIALLWLIFRAIRAVEKRMNLWAERMNSLLGKVV
Ga0209058_100050213300026536SoilMKISCLLIVFLCQSAVAAIPTPLPSPSVTASVSPAATITQSVTELSAQRAAAERSKIYRTGAAYGRWLDQVAKNSGSTFLQRAIFDRVTWVRLLSCVVALALLSLLTGWFVGIVRQHAGEIESTRYQSWLALSAAAIRKPVALFIWMCCGTFALLPIAAGIVSRPTRIFWARALTAILYAGWIIALL
Ga0209156_1037579213300026547SoilMKISCFLLVFLCQSAVAAIPTPSPSPLPTALASPAATITQSVSELSAQRAATERSKIYRSGAAYGRWLDQIAKDSGNAFLQRAVFDRVTWMRVFSCVVALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIVAGINS
Ga0209648_1082631913300026551Grasslands SoilVLISVVCLNAVAEPPLPTPIPTKSPAGSPAVSPSSATTLTQSVSELSAQRAAAERSQIYRAGATYSRWLDQVAKNSGSEFLRRPLFDRVTWMRLLTCVTSLGLLALITGWFLWVVRRRAGLLQSKQEQSWLAVGAAAIRKPLALFVWVVGGAFALMPVVNGIASRPTR
Ga0208708_10237413300026702SoilIGFACVSAMGAEPSATPSPSASPGASESPTATLTQSVSQLLERSKVYRAGAAYSKWLDQIAKDSGNAFLQRPVFDRVTWIRLLASVGALALLSIFAGWFVWFVRRHAGEIQSHRYQSALALAASAIRKPVALFLWMCGGAFALMPIATGIIGRPTGLFWKGALTAIFEAGWIIALPW
Ga0209625_103545123300027635Forest SoilMATEPSASPSPSPVGSPAAFPVATLEQSFSELTAQRAAAERSKIYRAGATYSRWLDGIAKNSGYAFLQRQVVDRVTWMRLLTCAGTLAFLALLTGWFLWIVRRRAGAIQSKRHQSWLALSAAAIRKPLALYLWVVGGAFAFMPIVTGIVSRPTRVFWAGALTAILYS
Ga0137415_1101147413300028536Vadose Zone SoilMPTPSASPSLPASASPVTTITQSVTELSAQGAAAERSKIYRTGAAYGRWLDQLAKDSGNAFLQGAVFDRVTWMRLLSCVLALALLSSFTAWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVAPFLWTCG
Ga0307293_1017394723300028711SoilMIPMRIYCALVMFLCASVVWAEPDISPSPSPTASTTLSTTITQSVADLSTQKAAAERSKIYRTGAAYGRWLDQVAKDSGNAFLQRAVFDRVTWMRLLSCAAALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCG
Ga0307298_1023236113300028717SoilSTTLSTTITQSVADLSTQKAAAERSKIYRTGAAYGRWLDQVAKDSGNAFLQRAVFDRVTWMRLLSCAAALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIAAGIVDRPTRIFWGRAFTAILYAGGIIALLWLVFRAIRAVEKRMRLWAERTRS
Ga0307296_1027903713300028819SoilMIPMRIYCALVMFLCASVVWAEPDISPSPSPTASTTLSTTITQSVADLSTQKAAAERSKIYRTGAAYGRWLDQVAKDSGNAFLQRAVFDRVTWMRLLSCAAALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFVLLPIAAGIVDRPTRIFWGRALTAT
Ga0307310_1050742513300028824SoilVFMIPMRIYCALVMFLCASVVWAEPDISPSPSPTASTTLSTTITQSVADLSTQKAAAERSKIYRTGAAYGRWLDQVAKDSGNAFLQRAVFDRVTWMRLLSCAAALALLSLLSGWFVWIVRRRAGEIESTRYQSWLALSAAAIRKPVALFLWMCGGAFALLPIAAGIVDRPTRIFWGRAFTAILYAGGIIALLWLVFRAIRAV
Ga0307277_1012356333300028881SoilMKLFYLLIVFLWQPALAAIPTPSTSPPPSESASPAATITQSVSELSAQRAATERSKIYWTGAAYGHWLDEVAKDSGNAFLQRPVFDRVTWMRLLSSVVALALLSLLTGWFVWIVRRRAGEIESTRYQSWLALSAAAVRKPV
Ga0307308_1047443713300028884SoilRILCALICLLCLSSGWMTAAEPSASPSPSPIESESPPATIAQSVSELSAQRAAAERSKIYRAGVAYSRWLDQAGKDSGNAFLQRPVFDRVTWVRLLSCAGALALLAVIASWFVWIIRRRAGEIQSKRYQSWLAVSASAIRKPVALFLLMCGGGFALMPIVTGIVSRPTRVFWAGTLTAVLYAGWIISSLWLIFRAIRAV
Ga0307469_1023196833300031720Hardwood Forest SoilMAATRIFLIGFICVSAIAAEPGSTPVPSPSPSASESPVATIAQSVTELSAQRVAAERSKIYRSGATYSRWLDQVAKDSDSAFLQRPVYDRITWMRLLVSAGAVALLGLISLWFVRFVRRHAGTIQSNRYQSWLAVSASAIRKPAALFLLMCGGGFALMPIVTGIASRPTRVFWA
Ga0307468_10163022013300031740Hardwood Forest SoilFICVSAIAAEPGSTPVPSPSASASESPVATIAQSVTELSAQRVAAERSKIYRSGATYSRWLDQVAKDSDSAFLQRPVYDRITWMRLLVSASAVALLGLISLWFVRFVRRHAGTIQSNRYQSWLAVSASAIRKPAALFLLMCGGGFALMPIVTGIASRPTRVFWAGTLTAVLYAGWIIALLWLMFRAVRAVEKRMTLWAERT
Ga0307477_1064996313300031753Hardwood Forest SoilMKRILCALICFGCLSSSWIKAAEPSPSPAPSPVETVSTAATLTQSVSELSAQRAMAERSKIYRAGADYSRWLDQIAKDSGSAFLQRTVFDRVTWMRLLSCAAALFLLGLVASWFVWIVRRRAGEIQSNRYQSWLALSASAIRKPFAFLLVVCGGGFALMPIVTAIVGRPTRVFWAGTLTAVLYAGWIIAFLWLI
Ga0307475_1044677613300031754Hardwood Forest SoilMKRILCALICFGCLSSSWIKAAEPSPSPAPSPVETESTAATLTQSVSELSAQRAMDERSKIYRAGADYSRWLDQIAKDSGSAFLQRTVFDRVTWMRLLSCAAALFLLTLIASWFVWIVRRRAGEIQSNRYQSWLALSASAIRKPFAFLLVVCGGGFALMPIVTAIVGRPTRVFWAGTLTAVLYAGWIIAFLWLIFRAIRAVEKRMNQWAEQTDSLL
Ga0307473_1077954413300031820Hardwood Forest SoilMATEPSPASSPSPVASPAASPLATLEQSVSELTAQRAAAERSKIYQAGATYGRWLDGIAKNSGHAFLQRQVVDRVTWMRLLTCAGTLAFLALLTGWFLWIVRRRAGAIQSKRHQSWLALSAAAIRKPLALYLWVVGGAFAFMPIVTGIASRPTRVFWAGALTAILYAGQIVTVLWLVFRAIRAVEKRTRLWAERTGSVLGKVIVPILGQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.