NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F058048

Metagenome Family F058048

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F058048
Family Type Metagenome
Number of Sequences 135
Average Sequence Length 190 residues
Representative Sequence MLLLATACERPTTAGPAPSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQVRLLGDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGNLTSTDNDRHVTGNGADAFGFRAQGSVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLIPQ
Number of Associated Samples 92
Number of Associated Scaffolds 135

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 55.97 %
% of genes near scaffold ends (potentially truncated) 42.96 %
% of genes from short scaffolds (< 2000 bps) 63.70 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.66

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.259 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(35.556 % of family members)
Environment Ontology (ENVO) Unclassified
(51.111 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(51.852 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 6.36%    β-sheet: 33.18%    Coil/Unstructured: 60.45%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.66
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 135 Family Scaffolds
PF07676PD40 16.30
PF05685Uma2 6.67
PF00202Aminotran_3 5.19
PF01042Ribonuc_L-PSP 3.70
PF00041fn3 2.96
PF12770CHAT 2.96
PF13205Big_5 2.96
PF13540RCC1_2 2.96
PF03704BTAD 2.22
PF01144CoA_trans 0.74
PF06224HTH_42 0.74
PF08281Sigma70_r4_2 0.74
PF16326ABC_tran_CTD 0.74
PF12441CopG_antitoxin 0.74
PF00578AhpC-TSA 0.74
PF03781FGE-sulfatase 0.74
PF064393keto-disac_hyd 0.74
PF03825Nuc_H_symport 0.74
PF12728HTH_17 0.74
PF00801PKD 0.74
PF00005ABC_tran 0.74
PF00082Peptidase_S8 0.74
PF00069Pkinase 0.74
PF02838Glyco_hydro_20b 0.74
PF00326Peptidase_S9 0.74

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 135 Family Scaffolds
COG4636Endonuclease, Uma2 family (restriction endonuclease fold)General function prediction only [R] 6.67
COG0251Enamine deaminase RidA/Endoribonuclease Rid7C, YjgF/YER057c/UK114 familyDefense mechanisms [V] 3.70
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 2.96
COG3629DNA-binding transcriptional regulator DnrI/AfsR/EmbR, SARP family, contains BTAD domainTranscription [K] 2.22
COG3947Two-component response regulator, SAPR family, consists of REC, wHTH and BTAD domainsTranscription [K] 2.22
COG1262Formylglycine-generating enzyme, required for sulfatase activity, contains SUMF1/FGE domainPosttranslational modification, protein turnover, chaperones [O] 0.74
COG1788Acyl CoA:acetate/3-ketoacid CoA transferase, alpha subunitLipid transport and metabolism [I] 0.74
COG2057Acyl-CoA:acetate/3-ketoacid CoA transferase, beta subunitLipid transport and metabolism [I] 0.74
COG2814Predicted arabinose efflux permease AraJ, MFS familyCarbohydrate transport and metabolism [G] 0.74
COG3214DNA glycosylase YcaQ, repair of DNA interstrand crosslinksReplication, recombination and repair [L] 0.74
COG3525N-acetyl-beta-hexosaminidaseCarbohydrate transport and metabolism [G] 0.74
COG4670Acyl CoA:acetate/3-ketoacid CoA transferaseLipid transport and metabolism [I] 0.74


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.26 %
UnclassifiedrootN/A0.74 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002560|JGI25383J37093_10065506All Organisms → cellular organisms → Bacteria1140Open in IMG/M
3300002908|JGI25382J43887_10049202All Organisms → cellular organisms → Bacteria2290Open in IMG/M
3300005166|Ga0066674_10071654All Organisms → cellular organisms → Bacteria1587Open in IMG/M
3300005172|Ga0066683_10097776All Organisms → cellular organisms → Bacteria1779Open in IMG/M
3300005174|Ga0066680_10031091All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium3021Open in IMG/M
3300005180|Ga0066685_10001783All Organisms → cellular organisms → Bacteria9609Open in IMG/M
3300005181|Ga0066678_10352905All Organisms → cellular organisms → Bacteria970Open in IMG/M
3300005440|Ga0070705_100250847All Organisms → cellular organisms → Bacteria1242Open in IMG/M
3300005446|Ga0066686_10301669All Organisms → cellular organisms → Bacteria1088Open in IMG/M
3300005446|Ga0066686_10525544All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium805Open in IMG/M
3300005451|Ga0066681_10259236All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1057Open in IMG/M
3300005467|Ga0070706_100793082All Organisms → cellular organisms → Bacteria877Open in IMG/M
3300005536|Ga0070697_101156218All Organisms → cellular organisms → Bacteria689Open in IMG/M
3300005540|Ga0066697_10598190All Organisms → cellular organisms → Bacteria612Open in IMG/M
3300005546|Ga0070696_100923081All Organisms → cellular organisms → Bacteria725Open in IMG/M
3300005552|Ga0066701_10014644All Organisms → cellular organisms → Bacteria3717Open in IMG/M
3300005553|Ga0066695_10271193All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1072Open in IMG/M
3300005554|Ga0066661_10228500All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1153Open in IMG/M
3300005555|Ga0066692_10082184All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1873Open in IMG/M
3300005555|Ga0066692_10549137All Organisms → cellular organisms → Bacteria731Open in IMG/M
3300005555|Ga0066692_10857430All Organisms → cellular organisms → Bacteria557Open in IMG/M
3300005556|Ga0066707_10019045All Organisms → cellular organisms → Bacteria3605Open in IMG/M
3300005556|Ga0066707_10173983All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1377Open in IMG/M
3300005558|Ga0066698_10730903All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium649Open in IMG/M
3300005559|Ga0066700_10006181All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes5875Open in IMG/M
3300005568|Ga0066703_10607880All Organisms → cellular organisms → Bacteria636Open in IMG/M
3300005586|Ga0066691_10052725All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium2184Open in IMG/M
3300005586|Ga0066691_10491855All Organisms → cellular organisms → Bacteria733Open in IMG/M
3300005598|Ga0066706_10629035All Organisms → cellular organisms → Bacteria852Open in IMG/M
3300005598|Ga0066706_10919960All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium680Open in IMG/M
3300006034|Ga0066656_10992043All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300006755|Ga0079222_11875725All Organisms → cellular organisms → Bacteria583Open in IMG/M
3300006791|Ga0066653_10502631All Organisms → cellular organisms → Bacteria614Open in IMG/M
3300006794|Ga0066658_10066316All Organisms → cellular organisms → Bacteria1601Open in IMG/M
3300006796|Ga0066665_10005061All Organisms → cellular organisms → Bacteria7028Open in IMG/M
3300006796|Ga0066665_10087850All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes2251Open in IMG/M
3300006844|Ga0075428_100390132All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1492Open in IMG/M
3300006845|Ga0075421_102411306All Organisms → cellular organisms → Bacteria550Open in IMG/M
3300006852|Ga0075433_11297176All Organisms → cellular organisms → Bacteria631Open in IMG/M
3300006854|Ga0075425_100059580All Organisms → cellular organisms → Bacteria4281Open in IMG/M
3300006854|Ga0075425_103001226All Organisms → cellular organisms → Bacteria516Open in IMG/M
3300006903|Ga0075426_10124468All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1853Open in IMG/M
3300006904|Ga0075424_100121852All Organisms → cellular organisms → Bacteria2755Open in IMG/M
3300009012|Ga0066710_103050979All Organisms → cellular organisms → Bacteria650Open in IMG/M
3300009012|Ga0066710_103459773All Organisms → cellular organisms → Bacteria598Open in IMG/M
3300009090|Ga0099827_10000338All Organisms → cellular organisms → Bacteria20417Open in IMG/M
3300009090|Ga0099827_10631172All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_69_5925Open in IMG/M
3300009137|Ga0066709_100006383All Organisms → cellular organisms → Bacteria9946Open in IMG/M
3300009137|Ga0066709_100049961All Organisms → cellular organisms → Bacteria4709Open in IMG/M
3300009137|Ga0066709_100480128All Organisms → cellular organisms → Bacteria1746Open in IMG/M
3300009137|Ga0066709_102160163All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium768Open in IMG/M
3300010323|Ga0134086_10295795All Organisms → cellular organisms → Bacteria627Open in IMG/M
3300010329|Ga0134111_10060747All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1391Open in IMG/M
3300010333|Ga0134080_10162633All Organisms → cellular organisms → Bacteria951Open in IMG/M
3300010333|Ga0134080_10164470All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium946Open in IMG/M
3300010337|Ga0134062_10558847All Organisms → cellular organisms → Bacteria584Open in IMG/M
3300012198|Ga0137364_10040250All Organisms → cellular organisms → Bacteria3052Open in IMG/M
3300012199|Ga0137383_10002793All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → Gemmatimonadaceae → Gemmatirosa → Gemmatirosa kalamazoonesis10808Open in IMG/M
3300012199|Ga0137383_10011392All Organisms → cellular organisms → Bacteria5964Open in IMG/M
3300012200|Ga0137382_10019318All Organisms → cellular organisms → Bacteria3870Open in IMG/M
3300012200|Ga0137382_10185416All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_69_51425Open in IMG/M
3300012201|Ga0137365_10012960All Organisms → cellular organisms → Bacteria → Proteobacteria6568Open in IMG/M
3300012201|Ga0137365_10019519All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium5277Open in IMG/M
3300012201|Ga0137365_10085460All Organisms → cellular organisms → Bacteria2376Open in IMG/M
3300012201|Ga0137365_10167060All Organisms → cellular organisms → Bacteria1650Open in IMG/M
3300012201|Ga0137365_10219085All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1420Open in IMG/M
3300012204|Ga0137374_10706452All Organisms → cellular organisms → Bacteria757Open in IMG/M
3300012206|Ga0137380_10005984All Organisms → cellular organisms → Bacteria10835Open in IMG/M
3300012206|Ga0137380_10006753All Organisms → cellular organisms → Bacteria10245Open in IMG/M
3300012206|Ga0137380_10056746All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium3586Open in IMG/M
3300012206|Ga0137380_10248340All Organisms → cellular organisms → Bacteria1604Open in IMG/M
3300012207|Ga0137381_10234680All Organisms → cellular organisms → Bacteria1592Open in IMG/M
3300012208|Ga0137376_10094023All Organisms → cellular organisms → Bacteria2532Open in IMG/M
3300012209|Ga0137379_10095694All Organisms → cellular organisms → Bacteria2857Open in IMG/M
3300012209|Ga0137379_10300365All Organisms → cellular organisms → Bacteria1516Open in IMG/M
3300012209|Ga0137379_10663969All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium948Open in IMG/M
3300012211|Ga0137377_10406199All Organisms → cellular organisms → Bacteria1300Open in IMG/M
3300012285|Ga0137370_10087970All Organisms → cellular organisms → Bacteria1733Open in IMG/M
3300012349|Ga0137387_10007739All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_40CM_3_65_56143Open in IMG/M
3300012349|Ga0137387_10008759All Organisms → cellular organisms → Bacteria5845Open in IMG/M
3300012350|Ga0137372_10057307All Organisms → cellular organisms → Bacteria3408Open in IMG/M
3300012350|Ga0137372_10154821All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_40CM_3_65_51869Open in IMG/M
3300012350|Ga0137372_10198346All Organisms → cellular organisms → Bacteria1609Open in IMG/M
3300012350|Ga0137372_10204948All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1576Open in IMG/M
3300012351|Ga0137386_10001622All Organisms → cellular organisms → Bacteria14223Open in IMG/M
3300012354|Ga0137366_10021764All Organisms → cellular organisms → Bacteria4993Open in IMG/M
3300012354|Ga0137366_10217089All Organisms → cellular organisms → Bacteria1425Open in IMG/M
3300012356|Ga0137371_10027812All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria4361Open in IMG/M
3300012356|Ga0137371_10069857All Organisms → cellular organisms → Bacteria2720Open in IMG/M
3300012356|Ga0137371_10582883All Organisms → cellular organisms → Bacteria860Open in IMG/M
3300012357|Ga0137384_10209615All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1630Open in IMG/M
3300012357|Ga0137384_10679059All Organisms → cellular organisms → Bacteria837Open in IMG/M
3300012358|Ga0137368_10610439All Organisms → cellular organisms → Bacteria692Open in IMG/M
3300012362|Ga0137361_10456621All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1173Open in IMG/M
3300012927|Ga0137416_10161892All Organisms → cellular organisms → Bacteria1763Open in IMG/M
3300012927|Ga0137416_10572850All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium980Open in IMG/M
3300014150|Ga0134081_10111295All Organisms → cellular organisms → Bacteria870Open in IMG/M
3300014154|Ga0134075_10097286All Organisms → cellular organisms → Bacteria1242Open in IMG/M
3300014166|Ga0134079_10045892All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1521Open in IMG/M
3300015372|Ga0132256_100000030All Organisms → cellular organisms → Bacteria58939Open in IMG/M
3300017657|Ga0134074_1207747All Organisms → cellular organisms → Bacteria696Open in IMG/M
3300018431|Ga0066655_10048324All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium2180Open in IMG/M
3300018433|Ga0066667_10043422All Organisms → cellular organisms → Bacteria2652Open in IMG/M
3300018433|Ga0066667_10227764All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1394Open in IMG/M
3300018468|Ga0066662_10649477All Organisms → cellular organisms → Bacteria997Open in IMG/M
3300018482|Ga0066669_12158364All Organisms → cellular organisms → Bacteria527Open in IMG/M
3300025910|Ga0207684_11168973All Organisms → cellular organisms → Bacteria638Open in IMG/M
3300025922|Ga0207646_11815779All Organisms → cellular organisms → Bacteria521Open in IMG/M
3300026296|Ga0209235_1051495All Organisms → cellular organisms → Bacteria1976Open in IMG/M
3300026296|Ga0209235_1087087All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1378Open in IMG/M
3300026297|Ga0209237_1026393All Organisms → cellular organisms → Bacteria3233Open in IMG/M
3300026298|Ga0209236_1044452All Organisms → cellular organisms → Bacteria2261Open in IMG/M
3300026309|Ga0209055_1271375All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300026313|Ga0209761_1073749All Organisms → cellular organisms → Bacteria1798Open in IMG/M
3300026313|Ga0209761_1084276All Organisms → cellular organisms → Bacteria1636Open in IMG/M
3300026326|Ga0209801_1000486All Organisms → cellular organisms → Bacteria26348Open in IMG/M
3300026329|Ga0209375_1042834All Organisms → cellular organisms → Bacteria2324Open in IMG/M
3300026332|Ga0209803_1117678All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1067Open in IMG/M
3300026334|Ga0209377_1170630All Organisms → cellular organisms → Bacteria788Open in IMG/M
3300026529|Ga0209806_1000420All Organisms → cellular organisms → Bacteria28858Open in IMG/M
3300026532|Ga0209160_1007946All Organisms → cellular organisms → Bacteria8066Open in IMG/M
3300026532|Ga0209160_1092009All Organisms → cellular organisms → Bacteria1562Open in IMG/M
3300026537|Ga0209157_1106227All Organisms → cellular organisms → Bacteria1323Open in IMG/M
3300026538|Ga0209056_10008365All Organisms → cellular organisms → Bacteria10615Open in IMG/M
3300026538|Ga0209056_10013580All Organisms → cellular organisms → Bacteria8160Open in IMG/M
3300027748|Ga0209689_1045844All Organisms → cellular organisms → Bacteria2482Open in IMG/M
3300027882|Ga0209590_10001183All Organisms → cellular organisms → Bacteria9720Open in IMG/M
3300027882|Ga0209590_10169502All Organisms → cellular organisms → Bacteria1365Open in IMG/M
3300028536|Ga0137415_10016474All Organisms → cellular organisms → Bacteria7325Open in IMG/M
3300028536|Ga0137415_10194775All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1848Open in IMG/M
3300028536|Ga0137415_11421108All Organisms → cellular organisms → Bacteria516Open in IMG/M
3300031199|Ga0307495_10074750All Organisms → cellular organisms → Bacteria752Open in IMG/M
3300031720|Ga0307469_12355379All Organisms → cellular organisms → Bacteria519Open in IMG/M
3300031965|Ga0326597_10044468All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes5586Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil35.56%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil30.37%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil14.07%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil6.67%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere5.19%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.44%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.48%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.74%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.74%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.74%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031199Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 7_SEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031965Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT100D185EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25383J37093_1006550623300002560Grasslands SoilMRPQYLIAVGLAALLPATGXESPIRVAGQAQSAQFDFMNGPADLPNVFRGDSVLLFVWADVASDLVIVVNAPSGGVHALRRCGGSLTPEPQPVQTVGEMQDVLRQVRLLRDVNIHLYSPVPVPFRNFMDLCQLTPFAQGTGALTSTDNDRHVTGNGANAFGFRAQGIVDLASGGSARVIAESERLIEPDGTLTEILVNNVMLVPR*
JGI25382J43887_1004920233300002908Grasslands SoilMSSQILVAVGSAMLLLATGCERPTTAGPAPSAQFDFMNGPSDLPNVFRGDSVLLFVWADVSTNYVIVVNAPTGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFARGTGNLTSTDNDRHVTGSGANAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLIPQ*
Ga0066674_1007165433300005166SoilMRAHPLLTAGLAVLFPVTGCDNPTTVAARPPSVQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPMQTVGEMQDVLRQLRLLRDVNIHVYSPVPSPFRNFMDLCQLSPYAQGTGNLTSTDNDRTVTGNGADAFGFRAQGIVDLVSGGTARVLAESERMIAPDGTVTEILVKNVRLIPQ*
Ga0066683_1009777623300005172SoilMRAHPLLTAGLAVLFPVTGCDNPTTVAARPPSVQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPSGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLGDVNIHIYSPVPSPFRNFMDLCQLSPFAQGSGNLTSTDNDRHVTGNGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTITERLVNNVRLIPQ*
Ga0066680_1003109123300005174SoilMKPQSLVTLGLLLPLLPVACDRPATVVTSAPPPQFDFTNGPSDLPNGFRGDSILLFVWPDYTTNLVIAVNAPDGGVSSLRRCGGSLRPEPQPVQTVGEMQDVMRQLRLLRDVNIHVYSPVPAGFSGFMSLCGATPIAQGTGNLTSTDNDRLVTGNGANAFGFRAEGTVDLVAGGSARVLAEEQMLILPDGTCCLLRARHVMLLAH*
Ga0066685_1000178333300005180SoilMKTKSIAAVAVLVPLLALGCEQSPPVANDMPAVQADFMNGPSDLPNVFRGDSILLFVWADFTRDLVIVVNAPTGGVHALRRCGGKLTPDPQPVQTVGELQGVARQVRLLRDVNIDVYSPVPPGFGGFADLCQASPFAHGTGNLTSTDNDRHVTGDGANAFGFRAEGIVDLADGGSARVTAESERLIKPDGTREVLASYVMLHPF*
Ga0066678_1035290513300005181SoilMRSHILGAVGSAALLLIIGCEGPPTSTVGARTAQFDFMNGPSDLPNVFRGDSVLLFVWADLSSNLVIVVNAPTAGVGALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGTLTSTDNDRQVTGSGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVTLIPR*
Ga0070705_10025084723300005440Corn, Switchgrass And Miscanthus RhizosphereMKLWMLATVSLLMPLLAVACDPPAAFTANTPSPQFDFSNGPADLPNVFRGDSILLFVWPDYTTNLVIAVNAPPGGVSSVRRCGGTLRPAPQPVQTVGEMQNVMRQLRLLRNVNIHVYSPVPEGFSGFLSLCNATPLAAGIGNLTSTDNDRMVTGHGANAFGFRAEGIVALAAGGSARVLAELQMLILPDGTCCLQRASHVTLLEH*
Ga0066686_1030166913300005446SoilQSPPVANDMPAVQADFMNGPSDLPNVFRGDSILLFVWADFTRDLVIVVNAPTGGVHALRRCGGKLTPDPQPVQTVGELQGVARQVRLLRDVNIDVYSPVPPGFGGFADLCQASPFAHGTGNLTSTDNDRHVTGDGANAFGFRAEGIVDLADGGSARVTAESERLIKPDGTREVLASYVMLHPF*
Ga0066686_1052554423300005446SoilTTAGSARSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTNYVIVVNAPLGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPYAQGSGNLTSTDNDRTVTGNGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVKNVRLIPQ*
Ga0066681_1025923623300005451SoilMRAHPLLTAGLAVLFPVTGCDNPTTVAARPPSVQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPMQTVGEMQDVLRQLRLLRDVNIHVYSPVPSPFKNFMDLCQLSPYAQGTGNLTSTDNDRTVTGNGADAFGFRAQGIVDLVSGGTARVLAESERMIAPDGTVTEILVKNVRLIPQ*
Ga0070706_10079308223300005467Corn, Switchgrass And Miscanthus RhizosphereMKLRMLATVSLLMPLLAVACDPPAAFTANTPSPQFDFSNGPADLPNVFRGDSILLFVWPDYTTNLVIAVNAPPGGVSSVRRCGGTLRPAPQPVQTVGEMQNVMRQLRLLRNVNIHVYSPVPEGFSGFLSLCNATPLAAGIGNLTSTDNDRMVTGHGANAFGFRAEGIVALAAGGSARVLAELQMLILPDGTCCLQRASHVTLLEH*
Ga0070697_10115621823300005536Corn, Switchgrass And Miscanthus RhizosphereFSNGPADLPNVFRGDSILLFVWPDYTTNLVIAVNAPPGGVSSVRRCGGTLRPAPQPVQTVGEMQNVMRQLRLLRNVNIHVYSPVPEGFSGFLSLCNATPLAAGIGNLTSTDNDRMVTGHGANAFGFRAEGIVALAAGGSARVLAELQMLILPDGTCCLQRASHVTLLEH*
Ga0066697_1059819013300005540SoilAVAVLVPLLALGCEQSPPVANDMPAVQADFMNGPSDLPNVFRGDSILLFVWADFTRDLVIVVNAPTGGVHALRRCGGKLTPDPQPVQTVGELQGVARQVRLLRDVNIDVYSPVPPGFGGFADLCQASPFAHGTGNLTSTDNDRHVTGDGANAFGFRAEGIVDLADGGSARVTAESERLIKPDGTREVLASYVMLHPF*
Ga0070696_10092308123300005546Corn, Switchgrass And Miscanthus RhizosphereADLPNVFRGDSILLFVWPDYTTNLVIAVNAPPGGVSSVRRCGGTLRPAPQPVQTVGEMQNVMRQLRLLRNVNIHVYSPVPEGFSGFLSLCNATPLAAGIGNLTSTDNDRMVTGHGANAFGFRAEGIVALAAGGSARVLAELQMLILPDGTCCLQRASHVTLLEH*
Ga0066701_1001464413300005552SoilMRPQILVAVGAAMLLLATACEHPTTAGPARSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTNYVIVVNAPAGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGNLTSTDNDRQVTGNGADAFGFRAQGIVDLVNGGSARLLAESERLIAPDGTVTEILVNNVTLIPR*
Ga0066695_1027119333300005553SoilVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPMQTVGEMQDVLRQLRLLRDVNIHVYSPVPSPFRNFMDLCQLSPYAQGTGNLTSTDNDRTVTGNGADAFGFRAQGIVDLVSGGTARVLAESERMIAPDGTVTEILVKNVRLIPQ*
Ga0066661_1022850023300005554SoilMRSHILGAVGSAALLLIIGCEGPPTSTVGARTAQFDFMNGPSDLPNVFRGDSVLLFVWADLSSNLVIVVNAPTAGVGALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGTLTSTDNDRQVTGSGADAFGFRAQGIV
Ga0066692_1008218423300005555SoilMKPQSLVTLGLLLPLLAVACDRPATVVTSAPPPQFDFTNGPSDLPNVFRGDSILLFVWPDYTTNLVIAVNAPDGGVRSLRRCGGSLRPEPQPVQTVGEMQDVMRQLRLLRDVNIHVYSPVPAGFSGFMSLCGATPIAQGTGNLTSTDNDRLVTGNGANAFGFRAEGTVDLVAGGSARVLAEEQMLILPDGTCCLLRARHVMLLAH*
Ga0066692_1054913713300005555SoilMRSHILVAVGSAALLLIIGCEGPPTSTVGARTAQFDFMNGPSDLPNVFRGDSVLLFVWADLSSNLVIVVNAPTAGVGALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGTLTSTDNDRQVTGSGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVTLIPR*
Ga0066692_1085743013300005555SoilQYLIAVGLAALLPATGCESPIRVAGQAQSAQFDFMNGPADLPNVFRGDSVLLFVWADVASDLVIVVNAPSGGVHALRRCGGSLTPEPQPVQTVGEMQDVLRQVRLLRDVNIHLYSPVPVPFRNFMDLCQLTPFAQGTGALTSTDNDRHVTGNGANAFGFRAQGIVDLASGGSARVIAESERLIEP
Ga0066707_1001904533300005556SoilMRPQSLLAVGLAVLLPAAGCESPTTGAARPRSAQFDFMNGPSDLPNVFRGDSVLLFVWADVSTNYVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPTPFRNFIDLCQLSPYAQGSGNLTSTDNDRTVRGNGADAFGFRAQGIVDLLSGGSARVVAESERLIAPDGTVTEILVRNVRLIPQ*
Ga0066707_1017398313300005556SoilDLPNVFRGDSVLLFVWADLSTNYVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQVRLLGDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGNLTSTDNDRHVTGNGADAFGFRAQGSVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLIPQ*
Ga0066698_1073090313300005558SoilTTAGSARSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTNYVIVVNAPLGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPYAQGSGNLTSTDNDRTVTGNGADAFGFRAQGIVDLVGGGSARVLAESERLIAPDGTVTEILVRNVRLIPQ*
Ga0066700_1000618153300005559SoilMKPQSLVTLGLLLPLLAVACDRPATVVTSAPPPQFDFTNGPSDLPNVFRGDSILLFVWPDYTTNLVIAVNAPDGGVSSLRRCGGSLRPEPQPVQTVGEMQDVMRQLRLLRDVNIHVYSPVPAGFSGFMSLCGATPIAQGTGNLTSTDNDRLVTGNGANAFGFRAEGTVDLVAGGSARVLAEEQMLILPDGTCCLLRARHVMLLAH*
Ga0066703_1060788013300005568SoilLLLPLLAVACDRPATVVTSAPPPQFDFTNGPSDLPNVFRGDSILLFVWPDYTTNLVIAVNAPDGGVSSLRRCGGSLRPEPQPVQTVGEMQDVMRQLRLLRDVNIHVYSPVPAGFSGFMSLCGATPIAQGTGNLTSTDNDRLVTGNGANAFGFRAEGTVDLVAGGSARVLAEEQMLILPDGTCCLLRARHVMLLAH*
Ga0066691_1005272523300005586SoilMKPQSLVTLELLLPLLAVACDRPATVVTSAPPPQFDFTNGPSDLPNVFRGDSILLFVWPDYTTNLVIAVNAPDGGVSSLRRCGGSLRPEPQPVQTVGEMQDVMRQLRLLRDVNIHVYSPVPAGFSGFMSLCGATPIAQGTGNLISTDNDRLVTGNGANAFGFRAEGTVDLVAGGSARVLAEEQMLILPDGTCCLLRARHVMLLAH*
Ga0066691_1049185513300005586SoilMRSHILGAVGSAALLLIIGCEGPPTSTVGARTAQFDFMNGPSDLPNVFRGDSVLLFVWADLSSNLVIVVNAPTAGVDALRRCGGQLTPEPQPVQTVGEMQDVLRQLRLLRDVNIHLYSPVPSPFRTFMDLCQLSPFAQGTGNLTSTDNDRQVTGSGANAFGFRAQGIVNLASGGIGRVIAESERLIAPDGTLTDILVNNVTLSPR*
Ga0066706_1062903513300005598SoilPQILVAVGAAMLLLATACEHPTTAGPARSAQFDFMNGPADLPNVFRGDSVLLFVWADLSTNYVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQVRLLGDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGNLTSTDNDRHVTGNGADAFGFRAQGSVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLIPQ*
Ga0066706_1091996013300005598SoilMKPQLAATVLLVPLLVVACDPPAAVTTRSPNPQFDFMNGPADLPNVFRGDSVLLFVWADVTTNYVIVVNAPPGGVHALRRCGGTLTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPYAQGSGNLTSTDNDRTVTGNGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILV
Ga0066656_1099204313300006034SoilVLLPVTGCDNPTTVAARPPSVRFDFMNGPSDLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHVYSPVPSPFRNFTDLCQLSPYGQGSGNLTSTDNDRQVSGNGADAFGFRAQGIVDLVSGGRARVLAESERMIAPDGT
Ga0079222_1187572513300006755Agricultural SoilCDTGAPIAGQTSGATFDFSNGPPDLPNVFRGDSILLFVWPDYATNLVIAVNAPAGGVSSVRRCGGSLRPDPQPVQTVGEMQDVMRQLRLLRDVNIHVYSPVPDGFSGFMSLCGATPIAQGRGNLTSTDNDRLVTGNGANAFGFRAQGIVELVGGGSARVTAELQQLIRPDGTCCATHVSRVVLH*
Ga0066653_1050263113300006791SoilMRAHPLLTAGLAVLFPVTGCDNPTTVAARPPSVQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPSGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLGDVNIHIYSPVPSPFRNFMDLCQLSPFAQGSGNLTSTDNDRHVTGNGADAFGFRAQGI
Ga0066658_1006631623300006794SoilMKPQSLVTLELLLPLLAVACDRPATVVTSAPPPQFDFTNGPSDLPNVFRGDSILLFVWPDYTTNLVIAVNAPDGGVSSLRRCGGSLRPEPQPVQTVGEMQDVMRQLRLLRDVNIHVYSPVPAGFSGFMSLCGATPIAQGTGNLTSTDNDRLVTGNGANAFGFRAEGTVDLVAGGSARVLAEEQMLILPDGTCCLLRARHVMLLAH*
Ga0066665_1000506163300006796SoilMKPQLAATVLLVPLLVVACDPPAAVTTRSPNPQFDFMNGPADLPNVLRGDSVLLFVWADVTTNYVIVVNAPPGGVHALRRCGGTLTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPYAQGSGNLTSTDNDRTVTGNGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVKNVRLIPQ*
Ga0066665_1008785033300006796SoilMRPQILVAVGAAMLLLATACEHPTTAGPARSAQFDFMNGPADLPNVFRGDSVLLFVWADLSTNYVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQVRLLGDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGNLTSTDNDRHVTGNGADAFGFRAQGSVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLIPQ*
Ga0075428_10039013213300006844Populus RhizosphereHFDFMNGPADLPNVVRGDSVLLFLWADLSSNYVIVVNAPAAGVSALRRCGGPLTPELQPVQTVGEMQDVLRQVRLLRDVNIHLYSPARPGGTFADLCPLSPVAQGTGNLTSTDNDRTVTGHGADAFGFRAQGVVTFAGGGSARVTAESQRLIRPDGSCCDILVNNVTVHAP*
Ga0075421_10241130613300006845Populus RhizosphereMRPQIPLAVSLAVLLPATGCENPTTVAARAPSPHFDFMNGPADLPNVVRGDSVLLFLWADLSSNYVIVVNAPAAGVSALRRCGGPLTPELQPVQTVGEMQDVLRQVRLLRDVNIHLYSPALPGGTFNDLCPLSPVAQGTGNLTSTDNDRTVTGHGADAFGFRAQGVVTFAGGGS
Ga0075433_1129717613300006852Populus RhizosphereMRPHTLVTIGLAALLPAIGCERPTSVDRARSAQFDFMNGPSDLPNVFRGDSTLLFVWADTSTDLVIVINAPAAGVSAVRRCGGPSTPEPQPVQTAGELQDVLHQVRLWRDVNIHIYSPVPSPFRSFLDLCQLSPIATGTGNFTSTDNDRHNAGSGANAFGFRAEGIVDLVGGGSARVLAESERLIAPDGTITEILVNNVRLIPQ*
Ga0075425_10005958023300006854Populus RhizosphereMRPHTLVTIGLAALLPAIGCERPTSVDRARSAQFDFMNGPSDLPNVFRGDSTLLFVWADTSTDLVIVINAPAAGVSAVRRCGGPSTPEPQPVQTAGELQDVLRQVRLWRDVNIHIYSPVPSPFRNFLDLCQLSPIATGTGNFTSTDNDRHNAGSGANAFGFRAEGIVDLVGGGSARVLAESERLIAPDGTITEILVNNVRLIPQ*
Ga0075425_10300122613300006854Populus RhizosphereLLFALACDKAAPVANDAPAIRADFTNGPSDLPNVFRGDSVLLFVWADVASDFVIVVNAPTGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQVRLLRDVNIHIYSPVPSPFRNFMDLCQLSPYAQGSGTLTSTDNDRTVTGNGADAFGFRAQGIVDLASGGSARVLAESER
Ga0075426_1012446823300006903Populus RhizosphereMRTRSFCAAGVAALLFALACDKAAPVANDAPAIRADFTNGPSDLPNVFRGDSVLLFVWADVASDFVIVVNAPTGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQVRLLRDVNIHIYSPVPSPFRNFMDLCQLSPYAQGSGTLTSTDNDRTVTGNGADAFGFRAQGIVDLASGGSARVLAESERLIAPDGTITAILVNNVRLVPQ*
Ga0075424_10012185223300006904Populus RhizosphereMRPQIFFAVGAAMLLVSGCERPTTAASAGSAQFDFMNGPTDLPNVVRGDSTLLFVWADTSTDLVIVVNAPAAGVSAVRRCGGPSTPEPQPVQTAGELQDVLHQVRLWRDVNIHIYSPVPSPFRSFLDLCQLSPIATGTGNFTSTDNDRHNAGSGANAFGFRAEGIVDLVSGGSARVLAESERLIAPDGTITEILVNNVSLIPQ*
Ga0066710_10305097913300009012Grasslands SoilMRPHILVAVSSAMLLLATACERPTTAGPAPSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQVRLLGDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGNLTSTDNDRHVTGNGADAFGFRAQGSVDLVSGGSARVLAE
Ga0066710_10345977313300009012Grasslands SoilVALFAACTGCERPTTAGSARSAQFDFMNGPSDLPNVFRGDSVLLFVWADLSTNYVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPTPFRNFIDLCQLSPYAQGSGNLTSTDNDRTVRGNGADAFGFRAQGIVDLLSGGSARVVAESERLIAPD
Ga0099827_1000033853300009090Vadose Zone SoilMRPQILVAVGAAMLLVAAGCERPTSAGSARSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTNYVIVVNAPLGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFARGTGTLTSTDNDRQVTGNGADAFGFRGQGIVDLVSGGSARVLAESQRLIAPDGTITEILVSNVRLIPQ*
Ga0099827_1063117213300009090Vadose Zone SoilMRPQILVAVGAAMLLVATGCEHPTTAGSAGSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGNLTSTDNDRHVTGSGANAFGFRAEGIVDLVSGGSTRV
Ga0066709_10000638333300009137Grasslands SoilMPAVQADFMNGPSDLPNVFRGDSILLFVWADFTRDLVIVVNAPTGGVHALRRCGGKLTPDPQPVQTVGELQGVARQVRLLRDVNIDVYSPVPPGFGGFADLCQASPFAHGTGNLTSTDNDRHVTGDGANAFGFRAEGIVDLADGGSARVTAESERLIKPDGTREVLASYVMLHPF*
Ga0066709_10004996163300009137Grasslands SoilMRPQYLIAVGLAALLPATGCESPITVAGQAQSAQFDFMNGPADLPNVFRGDSVLLFVWADVASDLVIVVNAPSGGVHALRRCGGSLTPEPQPVQTVGEMQDVLRQVRLLRDVNIHLYSPVPVPFRNFMDLCQLTPFAQGTGALTSTDNDRHVTGNGANAFGFRAQGIVDLASGGSARVIAESERLIEPDGTLTEILVNNVMLVPR*
Ga0066709_10048012813300009137Grasslands SoilMLLLATACERPTTAGPAPSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVVVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLGDVNIHIYSPVPSPFRNFMDLCQLSPYAQGTGSLTSTDNDRHVTGNGADAFGFRAQGSVDLVSGGSARVLAESERLIAPDGTVTEI
Ga0066709_10216016323300009137Grasslands SoilAAVTTRSPNPQFDFMNGPADLPNVFRGDSVLLFVWADVTTNYVIVVNAPPGGVHALRRCGGTLTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPYAQGSGNLTSTDNDRTVTGNGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVKNVRLIPQ*
Ga0134086_1029579513300010323Grasslands SoilMPAVQADFMNGPSDLPNVFRGDSILLFVWADFTRDLVIVVNAPTGGVHALRRCGGKLTPDPQPVQTVGELQGVARQVRLLRDVNIDVYSPVPPGFGGFADLCQASPFAHGTGNLTSTDNDRHVTGDGANAFGFRAEGIVDLADGGSARVTAESER
Ga0134111_1006074723300010329Grasslands SoilMLGVATGCERPNMAGPARSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPSGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQVRLLGDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGNLTSTDNDRHVTGNGAAAFGFRAQGSVDLVRGASARVLAESERLIAPDGTVTEILVKNVRLIPQ*
Ga0134080_1016263323300010333Grasslands SoilLPNVFRGDSILLFVWADFTRDLVIVVNAPTGGVHALRRCGGKLTPDPQPVQTVGELQGVARQVRLLRDVNIDVYSPVPPGFGGFADLCQASPFAHGTGNLTSTDNDRHVTGDGANAFGFRAEGIVDLADGGSARVTAESERLIKPDGTREVLASYVMLHPF*
Ga0134080_1016447023300010333Grasslands SoilMRPQILVAVGAAMLLVAAGCERPTTAGSARSAQFDFMNGPADLPNVFRGDSVLLFVWADVTTNYVIVVNAPPGGVHALRRCGGALTPEPQPMQTVGEMQDVLRQLRLLRDVNIHVYSPVPSPFRNFMDLCQLSPYAQGTGNLTSTDNDRTVTGNGADAFGFRAQGIVDLVSGGTARVLAESERMIAPDGTVTE
Ga0134062_1055884713300010337Grasslands SoilMLVVATGCERPTMAGPARSAQFDFMNGPADLPNVFRGDSVLLFVWADLSTNYVIVVNAPAGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRSFMDLCQLSPVAQGTGNLTSTDNDRQVTGNGADAFGFRAQGIVDLVSGGSAGVLAESERLIAPDG
Ga0137364_1004025043300012198Vadose Zone SoilMLLLATACEHPTTAGPARSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVIAVNAPTGGVHTLRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGTLTSTDNDRHVTGNGADAFGFRAQGIVDLVSGGSARVLAESQRLIAPDGAITEILVNNVRLIAQ*
Ga0137383_1000279373300012199Vadose Zone SoilMLVVATGCERPTMAGPAGSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPSGGVHALRRCGGALTPEPQPMQTVGEMQDVLRQLRLLGDVNIHIYSPVPSPFRNFMDLCQLTPFAQGSGNLTSTDNDRHVTGNGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTITEILVNNVRLIPQ*
Ga0137383_1001139223300012199Vadose Zone SoilMKPQLAATVLLVPLLVVACDPPAAVTTRSPNPQFDFMNGPADLPNVFRGDSVLLFVWADVTTNYVIVVNAPLGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPYAQGSGNLTSTDNDRTVTGNGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVKNVRLIPQ*
Ga0137382_1001931823300012200Vadose Zone SoilMRPQILVAVGAAMLLVAAGCERPTTAGSARSAQFDFMNGPADLPNVVRGDSVLLLVWADVSTNYVIVVNAPPDGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPYAQGTGNLTSTDNDRHVTGSGANAFGFRAEGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLIPQ*
Ga0137382_1018541633300012200Vadose Zone SoilMLLLATACERPTTAGSAGSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRTFMDLCQLSPFAQGTGTLTSTDNDRTVTGNGADAFGFRAQGIVDLVSGGSARVLAESERLIGTDGTITEILVNNVRLIPQ*
Ga0137365_1001296053300012201Vadose Zone SoilMRPQILVAAGAAMLVVATGCERPTMAGPARSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPSGGVHALRRCGGALTPEPQPMQTVGEMQDVLRQLRLLGDVNIHIYSPVPSPFRNFMDLCQLTPFAQGSGNLTSTDNDRHVTGNGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTITEILVNNVRLIPQ*
Ga0137365_1001951913300012201Vadose Zone SoilMLVAVGAAMLLLATGCERPTTPGPARSAQFDFMNGPSDLPNVFRGDSVLLFVWADLSTNYVIVVNAPPDGVHALRRCGGALTPEPQPVQTVGEMQDVLRQVRLLRDVNIHIYSPVPSPFRNFTDLCQLSPFAQGTGNLTSTDNDRHVTGSGANAFGFRAEGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLIPQ
Ga0137365_1008546033300012201Vadose Zone SoilMLVAVGSAMLLLATGCERPTTAGSARSAQFDFMNGPADLPNVVRGDSVLLLVWADVSTNYVIVVNAPPDGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPYAQGTGNLTSTDNDRHVTGSGANAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLIPQ
Ga0137365_1016706023300012201Vadose Zone SoilMNGPPDLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFAKGTGNLTSTDNDRQVTGSGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILANNVRLIPQ*
Ga0137365_1021908523300012201Vadose Zone SoilMRSPILVAVGAAMLLVATGCERPTTAGPAPIAQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPMQTVGEMQDVLRQLRLLRDVNIHVYSPVPSPFRNFMDLCQLSPYAQGTGNLTSTDNDRKVTGNGADAFGFRAQGIVDLVSGGTARVLAESERLIAPDGTVTEILVKNVRLIPQ*
Ga0137374_1070645213300012204Vadose Zone SoilFMNGPPDLPNVFRGDSILLFVWADFATDLVIVVNAPTGGVHALVRCGGTLRPDPQPVQTVGELQDVARQVRLLRDVNIHLYRPVPPGFAGFADLCQVSPFAVGTGNLTSTDNDRHVTGDGANAFGFRVEGIVDFADGGSAGVTAESERLIKPDGTREVLTNTVRLHPR*
Ga0137380_1000598423300012206Vadose Zone SoilMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFAKGTGNLTSTDNDRQVTGSGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVTLIPR*
Ga0137380_10006753113300012206Vadose Zone SoilMNGPSDLPNVFRGDSVLLFVWADVSTDLVIVVNAPAGGVRALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPTPFRNFMDLCQLSPYAQGSGNLTSTDNDRTVTGNGADAFGFRAQGVVDLVSGGSARVLAESERLIAPDGTVTEILVRNVRLIQQ*
Ga0137380_1005674623300012206Vadose Zone SoilMLVAVGAAMLLLATGCERPTTPGPARSAQFDFMNGPSDLPNVFRGDSVLLFVWADLSTNYVIVVNAPPDGVHALRRCGGALTPEPQPVQTVGEMQDVLRQVRLLRDVNIHIYSPVPSPFRNFTDLCQLSPFAQGTGNLTSTDNDRHVTGSGANAFGFRAEGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLIPQ*
Ga0137380_1024834013300012206Vadose Zone SoilLAALLPATGCESPITVAGQAQSAQFDFMNGPADLPNVFRGDSVLLFVWADVASDLVIVVNAPSGGVHALRRCGGSLTPEPQPVQTVGEMQDVLRQVRLLRDVNIHLYSPVPVPFRNFMDLCQLTPFAQGTGALTSTDNDRHVTGNGANAFGFRAQGIVDLASGGSARVIAESERLIEPDGTLTEILVNNVMLVPR*
Ga0137381_1023468023300012207Vadose Zone SoilMKVQSFLAGAMATLLPLLACESPARLASDTPGVEFDFTNGPSDLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFARGSGTLTSTDNDRHVTGSGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLIPQ*
Ga0137381_1154846313300012207Vadose Zone SoilMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFARGTGNLTSTDNDRHVTGSGANAFGFRAEGIVDLV
Ga0137376_1009402333300012208Vadose Zone SoilMLLLATACERPTTAGSAGSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRTFMDLCQLSPFAQGTGTLTSTDNDRTVTGNGADAFGFRAQGIVDLVSGGSARVLAESERLIGPDGTITEILVNNVRLIPQ*
Ga0137379_1009569413300012209Vadose Zone SoilMLVAVGAAMLLLATGCERPTTPGPARSAQFDFMNGPSDLPNVFRGDSVLLFVWADLSTNYVIVVNAPPDGVHALRRCGGALTPEPQPVQTVGEMQDVLRQVRLLRDVNIHIYSPVPSPFRNFTDLCQLSPFAQGTGNLTSTDNDRHVTGSGANAFGFRAEGIVDLVSGGSARV
Ga0137379_1030036523300012209Vadose Zone SoilMLLLATACERPTTAGPAPSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQVRLLGDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGNLTSTDNDRHVTGNGADAFGFRAQGSVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLIPQ*
Ga0137379_1066396923300012209Vadose Zone SoilMKVQSFLAGAMATLLPLLACQSPARLASDTPGVEFDFTNGPSDLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFARGSGTLTSTDNDRHVTGSGADAFGFRAQGIVDLVSGGSARVLAESERLIGPDGTITEILVNNVRLIPQ*
Ga0137377_1040619913300012211Vadose Zone SoilMRPQYLIAVGLAALLPATGCESPITVAGQAQSAQFDFMNGPADLPNVFRGDSVLLFVWADVASDLVIVVNAPSGGVHALRRCGGSLTPEPQPVQTVGEMQDVLRQVRLLRDVNIHLYSPVPVPFRNFMDLCQLTPFAQGTGALTSTDNDRHVTGNGANAFGFRAQGIV
Ga0137370_1008797033300012285Vadose Zone SoilMLLLATACEHPTTAGPARSAQFDFMNGPADLPNVFRGDSVWLCVWADVSTDLVIAVNAPTGGVHTLRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGTLTSPDNDRHVTGNGADAFGFRAQGIVDLVSGGSARVLAESQRLIAPDGAITEILVNNVRLIAQ*
Ga0137387_1000773913300012349Vadose Zone SoilMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFARGTGNLTSTDNDRHVTGSGANAFGFRAEGIVDLVSGGSAR
Ga0137387_1000875913300012349Vadose Zone SoilPNVFRGDSVLLFVWADLSTNYVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFAKGTGNLTSTDNDRQVTGSGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVTLIPR*
Ga0137372_1005730763300012350Vadose Zone SoilMKTEFFSAASLLVPLLALGCEQPAPVANDISVVQADFMNGPPDLPNVFRGDSILLFVVADFATDLVIVVNAPTGGVHALVRCGGTLRPDPQPVQTVGELQDVARQVRLLRDVTIHLYRPVPPGFAGFADLCQASPFAVGSGNLTSTDNDRHVTGNGANAFGFRVEGIVDFADGRSAGVTAESERLIKPDGTREVLVSSVMLHSQ*
Ga0137372_1015482123300012350Vadose Zone SoilMLVAVGSAVLLLVTACEHPTTAGSARSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTNYVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFTDLCQLSPFAQGTGTLTSTDNDRTVTGNGADAFGFRAHGIVDLVSGGSARVLAESERLIGPDGTITRILVNNVRLIPQ*
Ga0137372_1019834633300012350Vadose Zone SoilMKTLPLFTSAVLVPLMAFGCEQSPPVANDMPTVQADFMNGPPDLPNVFRGDSILLFVWADFTRDLVIVVNAPTGGVHALRRCGGKLTPDPQPVQTVGELQDVARQVRLLRDVNIDVYSPVPPAFRGFADLCQASPFAHGTGNLTSTDNDRHVTGDGANAFGFRAEGIVDLADGGSAGVTAESERLIKPDGTREVLASYVMLHPF*
Ga0137372_1020494823300012350Vadose Zone SoilMKVQSFLAAAMATLLPLLACESPAPLASDTPGVEFDFTNGPSDLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFARGSGTLTSTDNDRHVTGSGADAFGFRAQGIVDLVSGGSARVLAESERLIGPDGTITEILVNNVRLIPQ*
Ga0137386_1000162213300012351Vadose Zone SoilMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFAKGTGNLTSTDNDRQVTGSGADAFGFRAQGIVDLVSGGSARVLAESERLI
Ga0137366_1002176463300012354Vadose Zone SoilMRPHLLVVVGSAMLLLATGCERPTTVGPTRSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPVQTVAEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFTDLCQLSPFAQGTGTLTSTDNDRQVTGSGADAFGFRAQGIVDLVSGGSARVLAESERLI
Ga0137366_1021708933300012354Vadose Zone SoilMKTLPLFTSAVLVPLMAFGCEQSPPVANDMPTVQADFMNGPPDLPNVFRGDSILLFVWADFTRDLVIVVNAPTGGVHALRRCGGKLTPDPQPVQTVGELQDVARQVRLLRDVNIDVYSPVPPEFSGFADLCQASPFAHGTGNLTSTDNDRHVTGDGANAFGFRAEGIVDLADGGSAGVTAESERLIKPDGTREVLASYVMLHPF*
Ga0137371_1002781223300012356Vadose Zone SoilMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPSGGVHALRRCGGALTPEPQPMQTVGEMQDVLRQLRLLGDVNIHIYSPVPSPFRNFMDLCQLTPFAQGSGNLTSTDNDRHVTGNGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTITEILVNNVRLIPQ*
Ga0137371_1006985743300012356Vadose Zone SoilFGSAALLLTTGCDNPFTAGARPPSAQFDFMNGPADLPNVVRGDSVLLFVWADVSTDLVIVVNAPAGGVHALRRCGGPFTPEPQPVQTAGELQDVLHQVRLWRDVNIHIYSPVPSPFRNFLDLCQLSPIATGTGNFTSTDNDRHNTGSGANAFGFRAQGIVNLVGGGSARVLAESERLIAPDGTITDILVNNVRLIPQ*
Ga0137371_1058288323300012356Vadose Zone SoilMNGPPDLPNVFRGDSVLLFVWADVSTDLVIGVNAPPGGVHALRRGGGALTPEPQPVQTVGEMQDVLRQLRLLSDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGNLTSTDNDRHVTGSGANAFGFRAQGIVNLASGGSGRVIAESERLIAPDGTLTDILVNNVTLIPR*
Ga0137384_1020961523300012357Vadose Zone SoilMKPQLAATVLLVPLLVVACDPPAAVTTRSPNPQFDFMNGPADLPNVFRGDSVLLFVWADVTTNYVIVVNAPLGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFTDLCQLSPFAQGTGNLTSTDNDRHVTGSGANAFGFRAEGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLIPQ*
Ga0137384_1067905913300012357Vadose Zone SoilMKVQSFLAAAMATLLPLLACESPAPLASDTPGVEFDFTNGPSDLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFTDLCQLSPFAQGTGTLTSTDNDRQVTGSGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVTLIPR*
Ga0137368_1061043913300012358Vadose Zone SoilLLPLLACESPAPLASDTPGVQFDFMNGPPDLPNVFRGDSILLFVWADVATDLVIVVNAPAGGVHALVRCGGTLRPDPQPMQTVGELQDVARQVRLLRDVNIHLYRPVPPGFAGFADLCQVSPFAVGTGNLTSTDNDRHVTGDGANAFGFRVEGIVDFADGGSAGVTAESERLIKPDGTREVLTNNVRLHPR*
Ga0137361_1045662123300012362Vadose Zone SoilMSSPILVAVGSAMLLLATGCERPTTAGPAPSAQFDFMNGPSDLPNVFRGDSVLLFVWADVSTNYVIVVNAPTGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFARGTGNLTSTDNDRTVRGNGADAFGFRAQGIVDLLSGGSARVVAESERLIAPDGTVTEILVNNVRLIPQ*
Ga0137416_1016189223300012927Vadose Zone SoilMNGPSDLPNVFRGDSVLLFVWADLSTNYVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPTPFRNFMDLCQLGPYAQGSGNLTSTDNDRTVRGNGADAFGFRAQGIVDLLSGGSARVVAESERLIAPDGTVTEILVNNVRLIPQ*
Ga0137416_1057285033300012927Vadose Zone SoilSAQFDFMNGPSDLPNVFRGDSVLLFVWADVSTSYVIVVNAPTGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFARGTGNLTSTDNDRHVTGSGANAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLILQ*
Ga0134081_1011129523300014150Grasslands SoilVQADFMNGPSDLPNVFRGDSILLFVWADFTRDLVIVVNAPTGGVHALRRCGGKLTPDPQPVQTVGELQGVARQVRLLRDVNIDVYSPVPPGFGGFADLCQASPFAHGTGNLTSTDNDRHVTGDGANAFGFRAEGIVDLADGGSARVTAESERLIKPDGTREVLASYVMLHPF*
Ga0134075_1009728623300014154Grasslands SoilMKPQLAATVLLVPLLVVACDPPAAVTTRSPNPQFDFMNGPADLPNVFRGDSVLLFVWADVTTNYVIVVNAPPGGVHALRRCGGTLTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPTPFRNFMDLCQLSPYAQGSGNLTSTDNDRTVTGNGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVKNVRLIPQ*
Ga0134079_1004589223300014166Grasslands SoilMRPYILVAGAAMLLLATGCEHPTTVGPARSAQFDFMNGPADLPNVFRGDSVLLFVWADLSTNYVIVVNAPAGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRSFMDLCQLSPVAQGTGNLTSTDNDRQVTGNGADAFGFRAQGIVDLVSGGSAGVLAESERLIAPDGTLTDILVNNVTLIPR*
Ga0132256_100000030373300015372Arabidopsis RhizosphereMLLVSGCERPTTAASAGSAQFDFMNGPTDLPNVVRGDSTLLFVWADTATDLVIVVNAPAAGVSAVRRCGGPSTPEPQPVQTAGELQDVLHQVRLWRDVNIHIYSPVPSPFRSFLDLCQLSPIATGTGNFTSTDNDRHNAGSGANAFGFRAEGIVDLVSGGSARVLAESERLIAPDGTITEILVNNVSLIPQ*
Ga0134074_120774713300017657Grasslands SoilMKTKSIAAVAVLVPLLALGCEQSPPVANDMPAVQADFMNGPSDLPNVFRGDSILLFVWADFTRDLVIVVNAPTGGVHALRRCGGKLTPDPQPVQTVGELQGVARQVRLLRDVNIDVYSPVPPGFGGFADLCQASPFAHGTGNLTSTDNDRHVTGDGANAFGFRAEGIVDL
Ga0066655_1004832423300018431Grasslands SoilMKTKSIAAVAVLVPLLALGCEQSPPVANDMPAVQADFMNGPSDLPNVFRGDSILLFVWADFTRDLVIVVNAPTGGVHALRRCGGKLTPDPQPVQTVGELQGVARQVRLLRDVNIDVYSPVPPGFGGFADLCQASPFAHGTGNLTSTDNDRHVTGDGANAFGFRAEGIVDLADGGSARVTAESERLIKPDGTREVLASYVMLHPF
Ga0066667_1004342243300018433Grasslands SoilMRPQSLLAVGLAVLLPAAGCESPTTGAARPRSAQFDFMNGPSDLPNVFRGDSVLLFVWADVSTNYVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPTPFRNFIDLCQLSPYAQGSGNLTSTDNDRTVRGNGADAFGFRAQGIVDLLSGGSARVVAESERLIAPDGTVTEILVRNVRLIPQ
Ga0066667_1022776423300018433Grasslands SoilFMNGPADLPNVFRGDSVLLFVWADVSTDLVVVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLGDVNIHIYSPVPSPFRNFMDLCQLSPYAQGTGSLTSTDNDRHVTGNGADAFGFRAQGSVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLIPQ
Ga0066662_1064947713300018468Grasslands SoilMKPQSLVTLELLLPLLAVACDRPATVVTSAPPPQFDFTNGPSDLPNVFRGDSILLFVWPDYTTNLVIAVNAPDGGVSSLRRCGGSLRPEPQPVQTVGEMQDVMRQLRLLRDVNIHVYSPVPAGFSGFMSLCGATPIAQGTGNLTSTDNDRLVTGNGANAFGFRAEGTVDLVAGGSARVLAEEQMLILPDGTCCLLRARHVMLLAH
Ga0066669_1215836413300018482Grasslands SoilLATGCEHPTTVGPARSAQFDFMNGPADLPNVFRGDSVLLFVWADLSTNYVIVVNAPAGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRSFMDLCQLSPVAQGTGNLTSTDNDRQVTGNGADAFGFRAQGIVDLVSGGSAGVLAESERLIAPDGT
Ga0207684_1116897313300025910Corn, Switchgrass And Miscanthus RhizosphereAVACDPPAAFTANTPSPQFDFSNGPADLPNVFRGDSILLFVWPDYTTNLVIAVNAPPGGVSSVRRCGGTLRPAPQPVQTVGEMQNVMRQLRLLRNVNIHVYSPVPEGFSGFLSLCNATPLAAGIGNLTSTDNDRMVTGHGANAFGFRAEGIVALAAGGSARVLAELQMLILPDGTCCLQRASHVTLLEH
Ga0207646_1181577913300025922Corn, Switchgrass And Miscanthus RhizosphereQFDFSNGPADLPNVFRGDSILLLLWADPTTDLVIVVNAPDGGVSRVRRCGGPDRPALQPVQTVGEMQGVMRQLRLLRDVNIHIYSPVPPGFSSFLDLCDKTPIAAGTGNLTSTDNDRLVTGNGSDAFGFRAEGTVTLVAGGSARVHAESQRLILPDGACCRVLVSSVSLLPR
Ga0209235_105149533300026296Grasslands SoilMRPQYLIAVGLAALLPATGCESPIRVAGQAQSAQFDFMNGPADLPNVFRGDSVLLFVWADVASDLVIVVNAPSGGVHALRRCGGSLTPEPQPVQTVGEMQDVLRQVRLLRDVNIHLYSPVPVPFRNFMDLCQLTPFAQGTGALTSTDNDRHVTGNGANAFGFRAQGIVDLASGGSARVIAESERLIEPDGTLTEILVNNVMLVPR
Ga0209235_108708723300026296Grasslands SoilMSSQILVAVGSAMLLLATGCERPTTAGPAPSAQFDFMNGPSDLPNVFRGDSVLLFVWADVSTNYVIVVNAPTGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFARGTGNLTSTDNDRHVTGSGANAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLIPQ
Ga0209237_102639333300026297Grasslands SoilMSSPILVAVGSAMLLLATGCERPTTAGPAPSAQFDFMNGPSDLPNVFRGDSVLLFVWADVSTNYVIVVNAPTGGVHALRRCGGALTPVNIHIYSPVPSPFRNFMDLCQLSPFARGTGNLTSTDNDRHVTGSGANAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLIP
Ga0209236_104445233300026298Grasslands SoilMSSQILVAVGSAMLLLATGCERPTTAGPAPSAQFDFMNAPSDLPNVFRGDSVLLFVWADVSTNYVIVVNAPTGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFARGTGNLTSTDNDRHVTGSGANAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLIPQ
Ga0209055_127137513300026309SoilMRSQILVAVGSAALLLIIGCEGPPTSTVGARTAQFDFMNGPSDLPNVFRGDSVLLFVWADLSSNLVIVVNAPTAGVGALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGTLTSTDNDRQVTGSGADAFGFRAQGIVD
Ga0209761_107374923300026313Grasslands SoilMKPQSLVTLGLLLPLLAVACDRPATVVTSAPPPQFDFTNGPSDLPNVFRGDSILLFVWPDYTTNLVIAVNAPDGGVSSLRRCGGSLRPEPQPVQTVGEMQDVMRQLRLLRDVNIHVYSPVPAGFSGFMSLCGATPIAQGTGNLTSTDNDRLVTGNGANAFGFRAEGTVDLVAGGSARVLAEEQMLILPDGTCCLLRARHVMLLAH
Ga0209761_108427613300026313Grasslands SoilMSSQILVAVGSAMLLLATGCERPTTAGPAPSAQFDFMNGPSDLPNVFRGDSVLLFVWADVSTNYVIVVNAPTGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFARGTGNLTSTDNDRHVTGSGANAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVR
Ga0209801_100048623300026326SoilMRSHILGAVGSAALLLIIGCEGPPTSTVGARTAQFDFMNGPSDLPNVFRGDSVLLFVWADLSSNLVIVVNAPTAGVGALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGTLTSTDNDRQVTGSGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVTLIPR
Ga0209375_104283423300026329SoilMRAHPLLTAGLAVLFPVTGCDNPTTVAARPPSVQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPMQTVGEMQDVLRQLRLLRDVNIHVYSPVPSPFRNFMDLCQLSPYAQGTGNLTSTDNDRTVTGNGADAFGFRAQGIVDLVSGGTARVLAESERMIAPDGTVTEILVKNVRLIPQ
Ga0209803_111767813300026332SoilMRSHILGAVGSAALLLIIGCEGPPTSTVGARTAQFDFMNGPSDLPNVFRGDSVLLFVWADLSSNLVIVVNAPTAGVGALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGTLTSTDNGRQVTGSGADAFGF
Ga0209377_117063013300026334SoilMKPQSLVTLELLLPLLAVACDRPATVVTSAPPPQFDFTNGPSDLPNVFRGDSILLFVWPDYTTNLVIAVNAPDGGVRSLRRCGGSLRPEPQPVQTVGEMQDVMRQLRLLRDVNIHVYSPVPAGFSGFMSLCGATPIAQGTGNLTSTDNDRLVTGNGANAFGFRAEGTVDLVAGGSARVLAEEQMLILPDGTCCLLRARHVMLLAH
Ga0209806_100042013300026529SoilLLLPLLAVACDRPATVVTSAPPPQFDFTNGPSDLPNVFRGDSILLFVWPDYTTNLVIAVNAPDGGVSSLRRCGGSLRPEPQPVQTVGEMQDVMRQLRLLRDVNIHVYSPVPAGFSGFMSLCGATPIAQGTGNLISTDNDRLVTGNGANAFGFRAEGTVDLVAGGSARVLAEEQMLILPDGTCCLLRARHVMLLAH
Ga0209160_100794623300026532SoilMKPQSLVTLGLLLPLLPVACDRPATVVTSAPPPQFDFTNGPSDLPNVFRGDSILLFVWPDYTTNLVIAVNAPDGGVSSLRRCGSSLRPEPQPVQTVGEMQDVMRQLRLLRDVNIHVYSPVPAGFSGFMSLCGATPIAQGTGNLTSTDNDRLVTGNGANAFGFRAEGTVDLVAGGSARVLAEEQMLILPDGTCCLLRARHVMLLAH
Ga0209160_109200923300026532SoilMRSHILGAVGSAALLLIIGCEGPPTSTVGARTAQFDFMNGPSDLPNVFRGDSVLLFVWADLSSNLVIVVNAPTAGVGALRRCGGALTPEPQPVQTVCEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGTLTSTDNDRQVTGSGADAFGFRAQGIVD
Ga0209157_110622733300026537SoilVLVPLLALGCEQSPPVANDMPAVQADFMNGPSDLPNVFRGDSILLFVWADFTRDLVIVVNAPTGGVHALRRCGGKLTPDPQPVQTVGELQGVARQVRLLRDVNIDVYSPVPPGFGGFADLCQASPFAHGTGNLTSTDNDRHVTGDGANAFGFRAEGIVDLADGGSARVTAESERLIKPDGTREVLASYVMLHPF
Ga0209056_1000836543300026538SoilMKPQLAATVLLVPLLVVACDPPAAVTTRSPNPQFDFMNGPADLPNVFRGDSVLLFVWADVTTNYVIVVNAPPGGVHALRRCGGTLTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPYAQGSGNLTSTDNDRTVTGNGADAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVKNVRLIPQ
Ga0209056_1001358053300026538SoilMRPQILVAVGAAMLLLATACEHPTTAGPARSAQFDFMNGPADLPNVFRGDSVLLFVWADLSTNYVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLGDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGNLTSTDNDRHVTGNGADAFGFRAQGSVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLIPQ
Ga0209689_104584423300027748SoilMKPQSLVTLGLLLPLLAVACDRPATVVTSAPPPQFDFTNGPSDLPNVFRGDSILLFVWPDYTTNLVIAVNAPDGGVSSLRRCGGSLRPEPQPVQTVGEMQDVMRQLRLLRDVNIHVYSPVPAGFSGFMSLCGATPIAQGTGNLISTDNDRLVTGNGANAFGFRAEGTVDLVAGGSARVLAEEQMLILPDGTCCLLRARHVMLLAH
Ga0209590_1000118353300027882Vadose Zone SoilMRPQILVAVGAAMLLVAAGCERPTSAGSARSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTNYVIVVNAPLGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFARGTGTLTSTDNDRQVTGNGADAFGFRGQGIVDLVSGGSARVLAESQRLIAPDGTITEILVSNVRLIPQ
Ga0209590_1016950223300027882Vadose Zone SoilMRPQILVAVGAAMLLVATGCEHPTTAGSAGSAQFDFMNGPADLPNVFRGDSVLLFVWADVSTDLVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFAQGTGNLTSTDNDRHVTGSGANAFGFRAEGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLTPQ
Ga0137415_1001647443300028536Vadose Zone SoilMRPKSLLAVGLAMLLPTGACESPTTGTARPRSAQFDFMNGPSDLPNVFRGDSVLLFVWADLSTNYVIVVNAPPGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPTPFRNFMDLCQLGPYAQGSGNLTSTDNDRTVRGNGADAFGFRAQGIVDLLSGGSARVVAESERLIAPDGTVTEILVNNVRLIPQ
Ga0137415_1019477533300028536Vadose Zone SoilDLPNVFRGDSVLLFVWADVSTSYVIVVNAPTGGVHALRRCGGALTPEPQPVQTVGEMQDVLRQLRLLRDVNIHIYSPVPSPFRNFMDLCQLSPFARGTGNLTSTDNDRHVTGSGANAFGFRAQGIVDLVSGGSARVLAESERLIAPDGTVTEILVNNVRLILQ
Ga0137415_1142110813300028536Vadose Zone SoilPSDLPNVFRGDSILLFVWPDYTTNLVIAVNAPDGGVSSLRRCGGSLRPEPQPVQTVGEMQDVMRQLRLLRDVNIHVYSPVPAGFSGFMSLCGATPIAQGTGNLTSTDNDRLVTGNGANAFGFRAEGTVDLVAGGSARVLAEEQMLILPDGTCCLLRARHVMLLAH
Ga0307495_1007475023300031199SoilMRPHMLVAFGLAALLVTTGCDNRITAGARPPSAQFDFMNGPADLPNVVRGDSTLLFVWADTSTDLVIVVNAPAAGVSAVRRCGGAFTPEPQPVQTAGELQDVLHQVRLWRDVNIHIYSPVPSPFRNFLDLCQLSPIATGTGNFTSTDNDRHNAGSGANAFGFRAQGIVDLVGGGSAGVLAESERMIAPDGTITEILVSNVRLIP
Ga0307469_1235537913300031720Hardwood Forest SoilVTANMPSPQFDFSNGPADLPNVFRGDSILLFVWPDYTTNLVIAVNAPPGGVSSVRRCGGTLRPAPQPVQTVGEMQDVMRQLRLLRNVHIHVYSPVPAGFSGFMSLCSATPIAAGTGNLTSTDNDRMVTGNGANAFGFRAEGIVALAAGGSARVLAELQMLILPDGACCLQRA
Ga0326597_1004446843300031965SoilMKIQSLSAAGFLVPLLAFGCEKPVPVANDAPIAQLDFTNGPSDLPNVLRGDSILLFAWADPATDLVIVINAPSGGVHEVVRCGGSQRPERQPVQSVGEVQGVLRQLRILRDANIHLYRPVPPGFTNVLDLCQHMPFAHGTGNLTSTDNDRFVTGDGANAFGSRAEGVVDFVAGGSARVLAERQALVKPDGTREVLVSNVVLLPH


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.