NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F066187

Metagenome Family F066187

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F066187
Family Type Metagenome
Number of Sequences 127
Average Sequence Length 162 residues
Representative Sequence VEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKISTDAAGISRMSCYRVLKSEGLIQPKRIGRNLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQ
Number of Associated Samples 112
Number of Associated Scaffolds 127

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 67.21 %
% of genes near scaffold ends (potentially truncated) 88.19 %
% of genes from short scaffolds (< 2000 bps) 94.49 %
Associated GOLD sequencing projects 105
AlphaFold2 3D model prediction Yes
3D model pTM-score0.45

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (77.953 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Agricultural → Soil
(11.024 % of family members)
Environment Ontology (ENVO) Unclassified
(29.921 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(40.945 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 46.03%    β-sheet: 3.17%    Coil/Unstructured: 50.79%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.45
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 127 Family Scaffolds
PF01527HTH_Tnp_1 49.61
PF13551HTH_29 1.57
PF13683rve_3 1.57
PF02604PhdYeFM_antitox 0.79
PF05988DUF899 0.79
PF09601DUF2459 0.79
PF14319Zn_Tnp_IS91 0.79
PF13565HTH_32 0.79
PF13701DDE_Tnp_1_4 0.79
PF05090VKG_Carbox 0.79
PF02518HATPase_c 0.79
PF00156Pribosyltran 0.79

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 127 Family Scaffolds
COG2161Antitoxin component YafN of the YafNO toxin-antitoxin module, PHD/YefM familyDefense mechanisms [V] 0.79
COG4118Antitoxin component of toxin-antitoxin stability system, DNA-binding transcriptional repressorDefense mechanisms [V] 0.79
COG4312Predicted dithiol-disulfide oxidoreductase, DUF899 familyGeneral function prediction only [R] 0.79


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A77.95 %
All OrganismsrootAll Organisms22.05 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000709|KanNP_Total_F14TBDRAFT_1006506Not Available817Open in IMG/M
3300000787|JGI11643J11755_11163191All Organisms → cellular organisms → Bacteria → Thermotogae → Thermotogae → Thermotogales → Thermotogaceae → Pseudothermotoga → Pseudothermotoga thermarum616Open in IMG/M
3300000953|JGI11615J12901_10120402Not Available830Open in IMG/M
3300000956|JGI10216J12902_100740047All Organisms → cellular organisms → Bacteria1219Open in IMG/M
3300002917|JGI25616J43925_10150941Not Available929Open in IMG/M
3300004024|Ga0055436_10163173Not Available685Open in IMG/M
3300004025|Ga0055433_10095001Not Available670Open in IMG/M
3300005178|Ga0066688_10296517All Organisms → cellular organisms → Bacteria1042Open in IMG/M
3300005181|Ga0066678_10250632All Organisms → cellular organisms → Bacteria1145Open in IMG/M
3300005294|Ga0065705_10353823Not Available951Open in IMG/M
3300005343|Ga0070687_101208233Not Available558Open in IMG/M
3300005356|Ga0070674_100419921Not Available1097Open in IMG/M
3300005457|Ga0070662_100718514Not Available846Open in IMG/M
3300005536|Ga0070697_102102551Not Available505Open in IMG/M
3300005543|Ga0070672_100464898Not Available1091Open in IMG/M
3300005614|Ga0068856_102545791Not Available518Open in IMG/M
3300005719|Ga0068861_102655314Not Available505Open in IMG/M
3300005836|Ga0074470_11202683All Organisms → cellular organisms → Bacteria847Open in IMG/M
3300005889|Ga0075290_1043074Not Available608Open in IMG/M
3300005981|Ga0081538_10170179All Organisms → cellular organisms → Bacteria952Open in IMG/M
3300006049|Ga0075417_10237024Not Available872Open in IMG/M
3300006163|Ga0070715_10319716Not Available837Open in IMG/M
3300006806|Ga0079220_11293647Not Available610Open in IMG/M
3300006847|Ga0075431_100278528All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria1694Open in IMG/M
3300006852|Ga0075433_10370471All Organisms → cellular organisms → Bacteria1265Open in IMG/M
3300006853|Ga0075420_100221830All Organisms → cellular organisms → Bacteria1647Open in IMG/M
3300006853|Ga0075420_101305542Not Available623Open in IMG/M
3300006871|Ga0075434_102228119Not Available551Open in IMG/M
3300006871|Ga0075434_102234137Not Available551Open in IMG/M
3300006904|Ga0075424_101306197Not Available771Open in IMG/M
3300007004|Ga0079218_10404096All Organisms → cellular organisms → Bacteria1165Open in IMG/M
3300007076|Ga0075435_100685237Not Available890Open in IMG/M
3300009081|Ga0105098_10187477Not Available949Open in IMG/M
3300009093|Ga0105240_12406982Not Available545Open in IMG/M
3300009094|Ga0111539_12144224Not Available648Open in IMG/M
3300009444|Ga0114945_10170949All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1252Open in IMG/M
3300009789|Ga0126307_10520285Not Available960Open in IMG/M
3300009792|Ga0126374_10153050Not Available1400Open in IMG/M
3300009801|Ga0105056_1009676Not Available1066Open in IMG/M
3300009817|Ga0105062_1022786Not Available1062Open in IMG/M
3300010166|Ga0126306_10624786Not Available860Open in IMG/M
3300010166|Ga0126306_10740362Not Available790Open in IMG/M
3300010304|Ga0134088_10251674Not Available850Open in IMG/M
3300010400|Ga0134122_11421992Not Available708Open in IMG/M
3300011422|Ga0137425_1051356Not Available933Open in IMG/M
3300012198|Ga0137364_10755178Not Available735Open in IMG/M
3300012285|Ga0137370_10698399Not Available630Open in IMG/M
3300012355|Ga0137369_10957532Not Available571Open in IMG/M
3300012474|Ga0157356_1001164All Organisms → cellular organisms → Bacteria → PVC group → Candidatus Omnitrophica → unclassified Candidatus Omnitrophica → Candidatus Omnitrophica bacterium1118Open in IMG/M
3300012493|Ga0157355_1015393Not Available653Open in IMG/M
3300012511|Ga0157332_1002338All Organisms → cellular organisms → Bacteria1396Open in IMG/M
3300012582|Ga0137358_10908426Not Available577Open in IMG/M
3300012672|Ga0137317_1020575Not Available626Open in IMG/M
3300012922|Ga0137394_10628308Not Available907Open in IMG/M
3300012931|Ga0153915_13157085Not Available536Open in IMG/M
3300013296|Ga0157374_12390835Not Available556Open in IMG/M
3300013297|Ga0157378_10215406All Organisms → cellular organisms → Bacteria → Proteobacteria1823Open in IMG/M
3300013297|Ga0157378_10220270All Organisms → cellular organisms → Archaea → Euryarchaeota → Stenosarchaea group → Methanomicrobia → Methanosarcinales → unclassified Methanosarcinales → Methanosarcinales archaeon1804Open in IMG/M
3300013306|Ga0163162_11104581Not Available898Open in IMG/M
3300014262|Ga0075301_1012521Not Available1333Open in IMG/M
3300014497|Ga0182008_10450741Not Available699Open in IMG/M
3300015262|Ga0182007_10038589All Organisms → cellular organisms → Bacteria1601Open in IMG/M
3300015374|Ga0132255_101480389Not Available1027Open in IMG/M
3300015374|Ga0132255_102934313Not Available729Open in IMG/M
3300017939|Ga0187775_10363270Not Available589Open in IMG/M
3300017966|Ga0187776_10636555Not Available747Open in IMG/M
3300018052|Ga0184638_1195862Not Available712Open in IMG/M
3300018053|Ga0184626_10292820Not Available676Open in IMG/M
3300018061|Ga0184619_10198064All Organisms → cellular organisms → Bacteria923Open in IMG/M
3300018075|Ga0184632_10009633All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Planctomycetales → Planctomycetaceae4050Open in IMG/M
3300018081|Ga0184625_10231393All Organisms → cellular organisms → Bacteria969Open in IMG/M
3300018081|Ga0184625_10658092Not Available507Open in IMG/M
3300018084|Ga0184629_10526886Not Available611Open in IMG/M
3300018089|Ga0187774_11013569Not Available580Open in IMG/M
3300018422|Ga0190265_10243137All Organisms → cellular organisms → Bacteria1842Open in IMG/M
3300018422|Ga0190265_10725463Not Available1116Open in IMG/M
3300018422|Ga0190265_12691085Not Available594Open in IMG/M
3300018465|Ga0190269_10768160Not Available687Open in IMG/M
3300018468|Ga0066662_10142588All Organisms → cellular organisms → Bacteria1796Open in IMG/M
3300019377|Ga0190264_10699627All Organisms → cellular organisms → Bacteria748Open in IMG/M
3300020003|Ga0193739_1067740All Organisms → cellular organisms → Bacteria911Open in IMG/M
3300020199|Ga0179592_10261189Not Available775Open in IMG/M
3300021510|Ga0222621_1139718Not Available516Open in IMG/M
3300022563|Ga0212128_10502573Not Available741Open in IMG/M
3300025562|Ga0210099_1129331Not Available510Open in IMG/M
3300025912|Ga0207707_10619508Not Available914Open in IMG/M
3300025918|Ga0207662_10522629Not Available819Open in IMG/M
3300025919|Ga0207657_11442501Not Available516Open in IMG/M
3300025945|Ga0207679_11583082Not Available600Open in IMG/M
3300026118|Ga0207675_102542307Not Available522Open in IMG/M
3300026536|Ga0209058_1332246Not Available525Open in IMG/M
3300026557|Ga0179587_11089484Not Available526Open in IMG/M
3300027163|Ga0209878_1029973Not Available710Open in IMG/M
3300027379|Ga0209842_1022286Not Available1218Open in IMG/M
3300027880|Ga0209481_10426956Not Available681Open in IMG/M
3300027907|Ga0207428_10900208Not Available625Open in IMG/M
3300027956|Ga0209820_1067005Not Available959Open in IMG/M
3300028592|Ga0247822_10695119Not Available822Open in IMG/M
3300028828|Ga0307312_10616696All Organisms → cellular organisms → Bacteria718Open in IMG/M
3300028828|Ga0307312_10619938All Organisms → cellular organisms → Bacteria716Open in IMG/M
3300031226|Ga0307497_10353244Not Available690Open in IMG/M
(restricted) 3300031248|Ga0255312_1067403Not Available861Open in IMG/M
3300031847|Ga0310907_10773894Not Available535Open in IMG/M
3300031852|Ga0307410_11733101Not Available554Open in IMG/M
3300031858|Ga0310892_11117322Not Available559Open in IMG/M
3300031943|Ga0310885_10902473Not Available506Open in IMG/M
3300031944|Ga0310884_10209305Not Available1045Open in IMG/M
3300032000|Ga0310903_10254824Not Available846Open in IMG/M
3300032000|Ga0310903_10285816Not Available806Open in IMG/M
3300032012|Ga0310902_10067433All Organisms → cellular organisms → Bacteria1808Open in IMG/M
3300032017|Ga0310899_10063976Not Available1390Open in IMG/M
3300032122|Ga0310895_10762638Not Available507Open in IMG/M
3300032205|Ga0307472_100815732Not Available854Open in IMG/M
3300032211|Ga0310896_10542864Not Available642Open in IMG/M
3300032211|Ga0310896_10610705Not Available609Open in IMG/M
3300032211|Ga0310896_10707663Not Available570Open in IMG/M
3300033412|Ga0310810_10403600Not Available1409Open in IMG/M
3300033480|Ga0316620_11925010Not Available587Open in IMG/M
3300033486|Ga0316624_10037845All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium2959Open in IMG/M
3300034176|Ga0364931_0017572All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1982Open in IMG/M
3300034417|Ga0364941_046499Not Available963Open in IMG/M
3300034417|Ga0364941_058544Not Available876Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil11.02%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere9.45%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil8.66%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment7.09%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil5.51%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil4.72%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand3.15%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere3.15%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands2.36%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil2.36%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland2.36%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment2.36%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere2.36%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment1.57%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs1.57%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil1.57%
Unplanted SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Unplanted Soil1.57%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.57%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.57%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.57%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.57%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.57%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.57%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.57%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere1.57%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere1.57%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.79%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.79%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)0.79%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.79%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.79%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.79%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.79%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.79%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.79%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.79%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil0.79%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.79%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.79%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.79%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.79%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.79%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Unclassified → Tabebuia Heterophylla Rhizosphere0.79%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.79%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.79%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000709Amended soil microbial communities from Kansas Great Prairies, USA - Total DNA F1.4 TB amended with BrdU and acetate no abondanceEnvironmentalOpen in IMG/M
3300000787Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000953Soil microbial communities from Great Prairies - Kansas Corn soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300004024Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D2EnvironmentalOpen in IMG/M
3300004025Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D1EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005343Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaGEnvironmentalOpen in IMG/M
3300005356Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaGHost-AssociatedOpen in IMG/M
3300005457Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaGHost-AssociatedOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005543Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M1-3 metaGHost-AssociatedOpen in IMG/M
3300005564Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaGHost-AssociatedOpen in IMG/M
3300005614Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C6-2Host-AssociatedOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005836Microbial communities from Youngs Bay mouth sediment, Columbia River estuary, Oregon - S.42_YBBEnvironmentalOpen in IMG/M
3300005889Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_80N_201EnvironmentalOpen in IMG/M
3300005981Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S5T2R1Host-AssociatedOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009081Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm May2015EnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009167Freshwater wetland microbial communities from Ohio, USA, analyzing the effect of biotic and abiotic controls - Mud 3 Core 4 Depth 3 metaG - Illumina Assembly (version 2)EnvironmentalOpen in IMG/M
3300009444Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3EnvironmentalOpen in IMG/M
3300009789Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot28EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300009801Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_20_30EnvironmentalOpen in IMG/M
3300009817Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_10_20EnvironmentalOpen in IMG/M
3300010166Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot27EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300011422Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT640_2EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012474Unplanted soil (control) microbial communities from North Carolina - M.Soil.3.yng.040610EnvironmentalOpen in IMG/M
3300012493Unplanted soil (control) microbial communities from North Carolina - M.Soil.10.yng.090610EnvironmentalOpen in IMG/M
3300012511Unplanted soil (control) microbial communities from North Carolina - M.Soil.8.old.080610_10EnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012672Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT266_2EnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300014262Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D1EnvironmentalOpen in IMG/M
3300014497Rhizosphere microbial communities from Sorghum bicolor, Mead, Nebraska, USA - 072115-129_1 metaGHost-AssociatedOpen in IMG/M
3300015262Rhizosphere microbial communities from Sorghum bicolor, Mead, Nebraska, USA - 072115-113_1 MetaGHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017939Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_10_MGEnvironmentalOpen in IMG/M
3300017966Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_20_MGEnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018089Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP05_20_MGEnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018465Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 ISEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019377Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 112 TEnvironmentalOpen in IMG/M
3300020003Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a2EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021510Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_coexEnvironmentalOpen in IMG/M
3300022563OV2_combined assemblyEnvironmentalOpen in IMG/M
3300025562Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Joice_ThreeSqA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025918Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025919Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025945Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027163Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300027379Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_10_20 (SPAdes)EnvironmentalOpen in IMG/M
3300027880Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3 (SPAdes)Host-AssociatedOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027956Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm May2015 (SPAdes)EnvironmentalOpen in IMG/M
3300028592Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Cellulose_Day30EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300031226Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 10_SEnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031847Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D4EnvironmentalOpen in IMG/M
3300031852Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-O-3Host-AssociatedOpen in IMG/M
3300031858Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D2EnvironmentalOpen in IMG/M
3300031943Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D2EnvironmentalOpen in IMG/M
3300031944Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D1EnvironmentalOpen in IMG/M
3300032000Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D3EnvironmentalOpen in IMG/M
3300032012Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D3EnvironmentalOpen in IMG/M
3300032017Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D4EnvironmentalOpen in IMG/M
3300032122Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D4EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032211Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D1EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M
3300033480Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M1_C1_D5_BEnvironmentalOpen in IMG/M
3300033486Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_N3_C1_D5_AEnvironmentalOpen in IMG/M
3300034176Sediment microbial communities from East River floodplain, Colorado, United States - 21_j17EnvironmentalOpen in IMG/M
3300034417Sediment microbial communities from East River floodplain, Colorado, United States - 17_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
KanNP_Total_F14TBDRAFT_100650623300000709SoilVEAARVGEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKITADAAGISRMSCYRVLRSEGLIQPKRIGHDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGXT
JGI11643J11755_1116319113300000787SoilEVEAARVAEGVTVKQAMKLLGLSRSSYYRRVRGMKDYRARGRQAPSSQHREVLREVALKRVEAGHRRVRAYAVAWGKLTAGESGSSRMSCYRLLKGEGLLQSKRIGHDLRQAAEQRRQRLEAPGKINQVLQSDFTDYVTEDTEKHHIGCVTEYVSRFNLVSAVSDTETALDLIAVVEAGLKEIAELGHTLTDEIILVTDNGPAM
JGI11615J12901_1012040223300000953SoilVEAARVAEGLTVKQAMKLIGLSRTSYYRQVRGMKDYRKRAREIQSAKHTEVLREVAIKRAEAGHRRVRAYALAWGKITTDATGSSRMSCYRVLKSEGMIQPKRMGHNLREAAEQRRQRLTAPDTLNAVLQGDFTDYVTADGEKY
JGI10216J12902_10074004723300000956SoilMKLLGLSRTSYYRRIRGMKDYRVRGRQAASAKHKEVLREVAIKRVEAGHRRVRAYALAWGKISLQAAGSSRMSCYRVLKSEGLIQPKCIGHDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEGALKEIAKLGHELASQIILVTDNGPAMQGAPV*
JGI25616J43925_1015094113300002917Grasslands SoilVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRQRPRDVPSAKHIEVLREVAIKRVEAGHRRVRAYALAWGKIPTQATGSSRMSCYRVLKSEGLIQPKRIGHDLRQAAEQRHQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEA
Ga0055436_1016317323300004024Natural And Restored WetlandsMKLLGLSRSSYYRQVRGMKDYRVRGRQAASAKHKDVLRDVAIKRVEAGHRRVRAYALAWGKITADAAGSSRMSCYRVLKSEGLIQPKRIGHDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEK
Ga0055433_1009500113300004025Natural And Restored WetlandsRSSEVNRRASHRDPFFKTAVEELRPSWAEVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRVRGRQAASAKHKDVLRDVAIKRVEAGHRRVRAYALAWGKITADGAGSSRMSCYRVLKGEGLIQPKRIGHGLREAAEQRRQRLTAPDKLNTVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEAALREITDLGHKLVAQV
Ga0066688_1029651723300005178SoilVKAAMEILGLSRSSYYRRVRGMSDYKAHPRKVGSTQHAEVLREVALKRVEAGHRRVRAYAVAWGKLAQDSPASSRMSCYRVLKSEGLIQPKRIGHDLRETAERRRQMLKAPEKINEVLQGDFTDYTTEDGEKYRIGCVTEYVSRFNLVSEVLDTETALDLIAVTEAALNEIIELGHELAAQVILVTDNGPAMKSRR
Ga0066678_1025063223300005181SoilVKAAMEILGLSRSSYYRRVRGMSDYKAHPRKVGSTQHAEVLREVALKRVEAGHRRVRAYAVAWGKLAQDSPASSRMSCYRVLKSEGLIQPKRIGHDLRETAERRRQMLKAPEKINEVLQGDFTDYTTEDGEKYRIGCVTEYVSRFNLVSEVLDTETALDLIAVTEAARNEIIELGHELAAQVILVTDNGPAMKSRRFKN
Ga0065705_1035382313300005294Switchgrass RhizosphereVARVAEGLTVKQAMKLLKLSRTSYYRQVRGMKDYRVRGRRAASTKHKEALREVALKRVEAGHRRVRAYALAWGKLTDGTTGSSRMSCYRVLKSEGLIQPKRIGHDLRQAAEQRRQWLQAPEKLNQVLQADFTDYVTEDNEKHRIGCVTEYLSRFNLIS
Ga0070687_10120823313300005343Switchgrass RhizosphereAEVEVARVAEGLTVKRAMKLSGLSRSSYYRQVRGMKDYHVRGRQAASAKHKDVLREVAIKRVEAGHRRVRAYAMAWGKITADAAGISRMSCYRVLRREGLIQPKRIGHDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVPSTPRRRWT*
Ga0070674_10041992133300005356Miscanthus RhizosphereVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYHVRGRQAASAKHKDVLREVAIKRVEAGHRRVRAYAMAWGKITADAAGISRMSCYRVLRSEGLIQPKRIGHDLRQAAEQRRQWLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVL
Ga0070662_10071851423300005457Corn RhizosphereVARVAEGLTVKQAMKLLKLSRTSYYRQVRGMKDYRVRGRRAASTKHKEALREVALKRVEAGHRRVRAYALAWGKLTDGTTGSSRMSCYRVLKSEGLIQPKRIGHDLRQAAEQRRQWLQAPEKLNQVLQADFTDYVTEDNEKHRIGCV
Ga0070697_10210255113300005536Corn, Switchgrass And Miscanthus RhizosphereRPSWTDVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRQRPRDVPSAKHIEVLREVAIKRVEAGHRRVRAYALAWGKIPTQATGSSRMSCYRVLKSEGLIQPKRIGHDLRQAAEQRHQRLSAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQV
Ga0070672_10046489813300005543Miscanthus RhizosphereMKDYHVRGRQAASAKHKDVLREVAIKRVEAGHRRVRAYAMAWGKITADAAGISRMSCYRVLRREGLIQPKRIGHDLRQAAEQRRQWLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETAL
Ga0070664_10170163113300005564Corn RhizosphereVSTQHADILREVALKRVEAGHRRVRAYAIAWGKLQADSAASSRMSCYRVLKSQGLLQPQRIGRDLREAAERRRQILKAPEKLNELLQADFTDYVTEDGEKYRIGCVTEYLSRFNLVSEVLDTETALDLI
Ga0068856_10254579113300005614Corn RhizosphereDPFFKTAVQELRPSWAEVEAARRAEGLTVKAAMDILRLSRTTYYRQVRGMIDYEAHPRKAVSTEHAEVLREVALKRVEAGHRRVRAYAIAWGKLKADSAASSRMSCYRVLKSQGLLQPQRIGRDLREAAERRRQMLKAPEKLNELLQADFTDYVTEDGEKYRIGCVTEYLSR
Ga0068861_10265531413300005719Switchgrass RhizosphereMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKISTDAAGISRMSCYRVLKSEGLIQPKRIGRNLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEAAL
Ga0074470_1120268323300005836Sediment (Intertidal)MKLLGLSRTSYYRQVRGMKDYRQRPREVPSAKHLEVLREVAIKRVEAGHRRVRAYALAWGKLSTQATGSSRMSCYRVLKSEGLIQPKRIGHDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDAFVCAPS*
Ga0075290_104307413300005889Rice Paddy SoilSFFKTAVEGLRPSWAEIEAAREAEGLTVKQAMKLLGLSRTSYYRQVRGMKDYRARSCNVASHQHREVLREVALKRVEAGHRRVRADAVAWGKISTDAAGSSRMSCYRVLKGEGLIQRKRIGHDLRQAAEQRRQRLTAPDKFNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEAALREISDLG
Ga0081538_1017017933300005981Tabebuia Heterophylla RhizosphereVEVETARVAEGLTVNKAMKLLGLSRTSYYRRVRGMTDYTGRSRQGVSTQHSEALREVALKRVEAGHRRVRAYAMAWGKLSADAAGSSRMSCYRVLKRAGLIQPKRIGHDLREAAERRRQMLRAPEQLNEVLQGDFTDYVTGDGEK
Ga0075417_1023702423300006049Populus RhizosphereVEAARVGEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKITADAAGISRMSCYRVLRSEGLIQPKRIGHDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKY
Ga0070715_1031971613300006163Corn, Switchgrass And Miscanthus RhizosphereVEVEAARVAEGLTVKRAMKLLGLSRTSYYRQVRGMKDYTAQTRKVISIQHSEVLREVALKRVEAGHRRVRAYAVAWGKISVDAVGSSRMSCYRVLKSAGLIQSKRIGHDLREAAERRRQMLKAPEKLNEVLQGDFTDYVTEDGEKYRIGCVTE
Ga0079220_1129364723300006806Agricultural SoilVEAARVAEGLTVKQAMKLIGLSRTSYYRQVRGMKDYRKRAREIQSAKHTEVLREVAIKRAEAGHRRVRAYALAWGKITTDATGSSRMSCYRVLKSEGMIQPKRMGHNLREAAEQRRQRLTAPDTLNAVLQGDFTDYVTADGEKYHIGG
Ga0075431_10027852813300006847Populus RhizosphereVGEGLTVKAAMAVLGLSRSSYYRQVRGMIDYKCHGRKTESTQHAEVLREVALKRVEAGHRRVRAYAIAWGKLLQDRAASSRMSCYRVLKREGLIQPKRLGHDLRQAAERRRQMLTAPEKLNEVLQGDF
Ga0075433_1037047123300006852Populus RhizosphereVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKISTDAAGISRMSCYRVLKSEGLIQPKRIGRNLRQAAEQRRQRLPAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETGWT*
Ga0075420_10022183013300006853Populus RhizosphereVEAARVGEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKITADAAGISRMSCYRVLRSEGLIQPKRIGQDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEAAL
Ga0075420_10130554213300006853Populus RhizosphereVEAARVAEGLTVKGVMEILGLSRSSYYRQVRGMTDYGAHPRKAESTQHAEVLREVALKRVEAGHRRVRAYALAWGKLSQDSPASSRMSCYRVLKSEGLIQSKRIGHDLREAAERRRQMLKAPEKLNEVLQGDFTDYTTEDGQKYRIGCITEYLSRFNLVSEVLDTETALDLIAVTELALKEIVELGHELKSQVILVTDNGPAMKSRR
Ga0075434_10222811923300006871Populus RhizosphereVKGVMEILGLSRSSYYRQVRGMTDYGAHPRKAESTQHAEVLREVALKRVEAGHRRVRAYALAWGKLSQDSPASSRMSCYRVLKSEGLIQSKRIGHDLREAAERRRQMLKAPEKLNEVLQGDFTDYTTEDGQKYRIGCI
Ga0075434_10223413713300006871Populus RhizosphereHRDPFFKTAVEELRPSWAEVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKISTDAAGISRMSCYRVLKSEGLIQPKRIGRNLREAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLD
Ga0075424_10130619723300006904Populus RhizosphereVAEGLTAKGVMEILGLSRSSYYRQVRGMTDYGAHPRKAESTQHAEVLREVALKRVEAGHRRVRAYALAWGKLSQDSPASSRMSCYRVLKSEGLIQSKRIGHDLREAAERRRQMLKAPEKLNEVLQGDFTDYTTEDGQKYRIGCI
Ga0079218_1040409633300007004Agricultural SoilVKAAMEILELSRSSYYRQVRGMADYSAHRRKVASTQHAEVLREVALKRVEAGHRRVRAYAMACGKISQDSVASSRMSCYRVLKREGLIQRKRIGHDLREAAERRRQMLKAPEKFNEVLQGDFTDYVTEDGEKYRIGCVTEYLSRFNLVSEVLDTETALDLIAVT
Ga0075435_10068523723300007076Populus RhizosphereVKGVMEILGLSRSSYYRQVRGMTDYGAHPRKAESTQHAEVLREVALKRVEAGHRRVRAYALAWGKLSQDSPASSRMSCYRVLKSEGLIQSKRIGHDLREAAERRRQMLKAPEKLNEVLQGDFTDYTTEDGQKYRIGCITEYLSRFNLVSEVLD
Ga0105098_1018747733300009081Freshwater SedimentVEAARVAEGLTVKAAMNLLGLSRTSYYRQVRGMTDYKAQPRKAESTQHAQILREVALKRVEAGHRRVRAYALAWGRLSQDSPASSRMSCYRVLKGEGLIQPKRIGQDLREAAERRRQMLRAPEKLNELLQGDFTDYVTEDGEKYRIGCVTEYLS
Ga0105240_1240698213300009093Corn RhizosphereMKDYRVRGRRAASTKHKEALREVALKRVEAGHRRVRAYALAWGKLTDGTTGSSRMSCYRVLKSEGLIQPKRIGHDLRQAAEQRRQWLQAPEKLNQVLQADFTDYVTEDNEKHRIGCVTEYLSRFNLISAVSDTETALDLIAVVEGALEEIVELGHGLAAQIILVTDNGPA
Ga0111539_1214422413300009094Populus RhizosphereVEAARVGEGLTVKQAMKLLGLSRSSYYRQVRGMKDYHVRGRQAASAKHKDVLREVAIKRVEAGHRRVRAYAMAWGKIPADAAGISRMSCYRVLRSEGLIQRKRIGHDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLV
Ga0113563_1076847433300009167Freshwater WetlandsLPGDQSAKHTEALREVAIKRIEAGHRRVRAYAMAWGKISTDAAASSRMSCYRVLQSEGLIQPKRIGHNLREAAEQRRQRVKAPEKCNEVLEVLEGNFADYETEDLSPYQGGRIPRWRCSRKEL
Ga0114945_1017094913300009444Thermal SpringsVKAAMEILGLSRTSYYRQVRGMTDYKAHGRGTPSSQHIEALREVALKRVEAGHRRVRAYALAWGTLSRDTLGSSRMSCYRVLKSEGLIQPKRIGHDLREAAERRRQMLKAPEKLNEVLQGDFTDYVTEDGEKYRIGCVTEYLSRFNLISEVLDTETA
Ga0126307_1052028523300009789Serpentine SoilVEAARVAEGLTVKRAMKLLGLSRSSYYRQVRGMKDYRERGRQAPSAQHREVLREVALKRVEAGHRRVRAYALAWGKMTVNLAGSSRMSCYRMLKSEGLLQPKGIGHDLRQAAEQRRQRLKTPEKINQVLQSDFTDYVSEDGEKHRIGCVTEYLSRFNLIS
Ga0126374_1015305033300009792Tropical Forest SoilVAEGLTVNKAMKLLGLSRSSYYRQVRGMKDYTAQSRKVISTQHSEVLREVALKRVEAGHRRVRAYAVAWGKISEDAAGSSRMSCYRVLKSAGLIQPKRIAHDLREAAERRREMLKAPEQLNEVLQGDFTDYVTEDGEKYRIGCITEYLSRFNLVSEVLDTETALDLIAVTETALKQITELGHELAAQVVLVTDNGPAMKS
Ga0105056_100967613300009801Groundwater SandVEVEGARVAEGLTVKKAMKLLGLSRTSYYRQVRGMKDYTTQPRKVISTQHSEVLREVALKRVEAGHRRVRAYAVAWGKISQDAAGSSRMSCYRVLKSAGLIQPKRIGHDLREAAERRRQMLKAPEQLNEVLQGDFTDYTTEDGEKYRIGCITEYLSRFNLVSKFLNRLDFIAGPLSVTRITCAASSWPSSV
Ga0105062_102278623300009817Groundwater SandVEVEGARVAEGLTVKKAMKLLGLSRTSYYRQVRGMKDYTTQPRKVISTQHSEVLREVALKRVEAGHRRVRAYAVAWGKISQDAAGSSRMSCYRVLKSAGLIQPKRIGHDLREAAERRRQMLKAPEQLNEVLQGDFTDYTTEDGEKYRIGCITEYLSRFNLVSEVLDTETALDLIAVTEAALKQITELGHELAAQVILVTD
Ga0126306_1062478613300010166Serpentine SoilVKEAMKLLGLSRSSYYRQVRGMKDYQVRGRQRASSKHKEALREVAIKRVEAGHRRVRAYALAWGKLSQEAAGSSRMSCYRVLKSEGLIQPKRIGQDLRQAAEQRRQRLKAPEKINEILQGDFTDYVTEDGEKYRIGGVTEYLSRFNL
Ga0126306_1074036223300010166Serpentine SoilMEVEAARVAEGLTVKGAMKILGLSRSSYYRQVRGMSDYQAHPRKAHSAQHAEVLREVALKRVEAGHRRVRAYAIAWGQLSESSPASSRMSCYRVLKSEGLMQPKRLGQELREASQRRRQMLKAPERFNEVLQGDFTDYTTEDGEKYRIGCITEYLSRFNLVSEVLDTE
Ga0134088_1025167413300010304Grasslands SoilVEAARVAEGLTVKGAMEILGLSRSSYYRQVRGMTDYGAHPRKAESTQHAEVLREVALKRVEAGHRRVRAYALAWGKLSQDSPASSRMSCYRVLKSEGLIQSKRIGHDLREAAERRRQMLKAPEKLNEVLQGDFTDYTTEDGQKYRIGCITE
Ga0134122_1142199213300010400Terrestrial SoilVEAARVAEGLTVKQVMKLLGLLRSSYYRQVRGMKDYRQRPRDVPSAKHIEVLREVAIKRVEAGHRRVRAYALAWGKIPTQATGSSRMSCYRVLKSEGLIQPKRIGHDLRQAAAQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEAALEEITDLGHELAAQVILVTDNGPAMKSR
Ga0137425_105135613300011422SoilVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYHVRGRQAASAKHKDVLREVAIKRVEAGHRRVRAYAMAWGKITADAAGISRMSCYRVLRSEGLIQPKRIGHDLRQAAEQRRQWLTAPDKLNAVLQGDFTDYV
Ga0137364_1075517813300012198Vadose Zone SoilMKDYGVGGRQAASAKHKEALREMAIKRVEAGHRRVRAYAVAWGKISAQAAGASRMSCYRVLKSEGLIQPKRIGYDLRQAAEQRRQRLKAPENTNEVLQGDFTDDGTEDGEKYRIGGVTEYRAGSI*
Ga0137370_1069839923300012285Vadose Zone SoilVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRQRPRDVPSAKHIEVLREVAIKRVEAGHRRVRAYALAWGKIPTQATGRSRMSCYRVLKSEGLIQPKRIGHDLRQAAEQRHQRLTAPDKLNAVLQGDFTDYVTE
Ga0137369_1095753213300012355Vadose Zone SoilFFKAACQELKPSWAEVEAARVAEGLTVKGAMEILGLSRSSYYRQVRGRTDYGAHPRKAESTQHAEVLREVALKRVEAGHRRVRAYALAWGKLSQDSPASSRMSCYRVLKSEGLIQSKRIGHDLREAAERRRQMLKAPEKLNEVLQGDFTDYTTEDGQKYRIGCITEYLSRFNLVSEVLDTETALDLIAV
Ga0157356_100116433300012474Unplanted SoilVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKITADAAGISRMSCYRVLRSEGLIQPKRIGHDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRI
Ga0157355_101539323300012493Unplanted SoilVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYPVRGRQEASAKHKEVLREVALMRMEAGHRRVRAYAMAWGKISLDAAGISRMSCYRVLKSEGLIQPKRIGRNLREAAEQRRQRLTAPDKLNAMLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEAALR
Ga0157332_100233813300012511SoilVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAVAWGKISTDAAGISRMSCYRVLKSEGLIQPKRIGRNLREAAEQRRQRLPAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTGYLSRFNLVSQVLDTETALDLIAVVEAALTEMTSWAMSLQARSFW*
Ga0137358_1090842613300012582Vadose Zone SoilDPFFKTAVEELRPSWAEVEAARVAERLTVKQAMKLLGLSRTSYYRQVRGMKDYGVGGRQAASAKHKEALREVAIKRVEAGHRRVRAYAVAWGKISAQAVGASRMSCYRVLKSEGLIQPKRIGYDLRQAAEQRRQRLKAPENTNEVLQGDFTDYGTEDGEKYRIGGVTEYLSRFNLVSEVLDTETALDLIAV
Ga0137317_102057513300012672SoilVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYHVRGRQAASAKHKDVLREVAIKRVEAGHRRVRAYAMAWGKITVDAAGISRMSCYRVLRSEGLIQPKRIGHDLRQAAEQRRQWLTAPDKLNAVLQGDFTDYVTEDGEKYR
Ga0137394_1062830813300012922Vadose Zone SoilVEAARVAERLTVKQAMKLLGLSRTSYYRQVRGMKDYRVGGRQAASAKHKEALREVAIKRVEAGHRRVRAYAVAWGKISAQAAGASRMSCYRVLKSEGLIQPKRIGYDLRQAAEQRRQRLKASENTNEVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSEVLDTETALDLIAVVEAALREIAELGHEL
Ga0153915_1315708513300012931Freshwater WetlandsSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVAIKRVEAGHRRVRAYALAWSKITADAAGSSRMSCYRVLKSEGLIQPKRIGHNLRQAAEQRRQRLTAPDKLNAVLQGDFTDDVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEAALREIADLGHDLAAPVILVTDNGP
Ga0157374_1239083513300013296Miscanthus RhizosphereVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYQVRGRQAASAKHKDVLREVAIKRVEAGHRRVRAYAMAWGKITVDAAGISRMSCYRVLRSEGLIQPKRIGHDLRQAGEQRRQWLTAPDKLNAVLQGDFTDYVTEDGEKYRAHLRR
Ga0157378_1021540613300013297Miscanthus RhizosphereVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKISTDAAGISRMSCYRVLKSEGLIQPKRIGRNLREAAEQRRQRLTAPDKLNAVLQGDFTDYVTEEGEKYRIGGVTEYLSR
Ga0157378_1022027023300013297Miscanthus RhizosphereVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYHLRGRQAASAKHKDVLREVAIKRVEAGHRRVRAYAMAWGKITADAAGISRMSCYRVLRREGLIQPKRIGHDLRQAAEQRRQWLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVARHRDGVGLDRGG*
Ga0163162_1110458133300013306Switchgrass RhizosphereVEAARVAEGLTGKQAMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKISTDAAGISRMSCYRVLKSEGLIQPKRIGRNLRQAAEQRRQRLPAPDKLNAVLQGDFTDYVTED
Ga0075301_101252113300014262Natural And Restored WetlandsLTVKQAMKMLGLSRTSYYRQVRGMKDYRVRGRGAASAKHKEVLREVALKRVEAGHRRVRAYALAWGKLTDGTAGSSRMSCYRVLKSEGLIQPKRIGHDLRQAAEQRRQRLTAPEKLNQVLQADF
Ga0182008_1045074123300014497RhizosphereVEAARVAEGLTVKQAMKLIGLSRTSYYRQVRGMKDYRKRAREIQSAKHTEVLREVAIKRAEAGHRRVRAYALAWGKITTDATGSSRMSCYRVLKSEGMIQPKRMGHNLREAAEQRRQRLTAPDTLNAVLQGDFTDYVTADGEKYHIGGVTEYLSRFNLISEVSDTETALDLIAVVE
Ga0182007_1003858923300015262RhizosphereVEAARVAEGLTVKQAMKLIGLSRTSYYRQVRGMKDYRKRAREIQSAKHTEVLREVAIKRAEAGHRRVRAYALAWGKITTDATGSSRMSCYRVLKSEGMIQPKRMGHNLREAAEQRRQRLTAPDTLNAVLQGDFTDYVTADGEKYHIGGVTEYLSRFNLISEVSDTETALDLIAVVEGALKEIAELRP*
Ga0132255_10148038913300015374Arabidopsis RhizosphereVEGARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRKRRREAQSAKHTEVLREVAIKRVEAGHRRVRAYALAWGKISTDAAGSSRMSCYRVLKGEGLIQPKRIGHDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEAALREMTELGHELASQII
Ga0132255_10293431313300015374Arabidopsis RhizosphereVEAARVGEGLTVKQAMKLLGLSRSSYYRQVRGMKDYPVRGRQEASAKHKEVLREVALMRMEAGHRRVRAYAMAWGKITADAAGISRMSCYRVLRSEGLIQPKRIGHDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLD
Ga0187775_1036327013300017939Tropical PeatlandVEAARGAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYSARGRQAPSAQHQEVLREVALKRVEAGHRRVRAYALAWGKLTANAAGSSRMSCYRVLKSEGLLQPKRIGEDLREAAEQRRQRLKAPEKMNEVLQADFTDYVTEDSEK
Ga0187776_1063655523300017966Tropical PeatlandVEAARGAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYSARGRQAPSAQHQEVLREVALKRVEAGHRRVRAYALAWGKLTANAAGSSRMSCYRVLKSEGLLQPKRIGEDLREAAEQRRQRLKAPEKMNEVLQADFTDYVTEDSEKHRIGCVTEYLSRFNLISEVSNTETALDLIAVVEGALKEIIELGHELAG
Ga0184638_119586213300018052Groundwater SedimentVKGAMEILGLSRSSYYRQVRGMTDYSAHPRKGVSTQYAEVLREVALKRVEAGHRRVRAYALAWGKLSQDSPASSRMSCYRVLKSEGLIQPKRIGHDLREAAERRRQMLKAPEKLNEVLQGDFTDYTTE
Ga0184626_1029282013300018053Groundwater SedimentMEILGLSRSSYYRQVRGMTDYSAHPRKGVSTQYAEVLREVALKRVEAGHRRVRAYALAWGKLSQDSPASSRMSCYRVLKSEGLIQPKRIGHDLREAAERRRQMLKAPEKLNEVLQGDFTDYTTEDGEKYRIGCITEYLSRFNL
Ga0184619_1019806423300018061Groundwater SedimentVKGAMEILGLSRSSYYRQVRGMTDYGAHPRKVESTQHAEVLREVALKRVEAGHRRVRAYALAWGKLSQDSPASSRMSCYRVLKSEGLIQPKRIGHDLREAAERRRQMLKAPEKLNEVLQGDFTDYTTEDGEKYRIGCITEYLSRFNLV
Ga0184618_1022509513300018071Groundwater SedimentVGEGLTVKKAMKLLGLSRTSYYRQIRGMKDYQARARKVVSANHEEILREVALKRVEAGHRRVRAYAIAWEKLSSDTAGSSRMSCYRVLKSAGLIQPQRVGQELREGAERRRRMLKAPENFNAVLQ
Ga0184632_1000963373300018075Groundwater SedimentVKGAMEILGLSRSSYYRQVRGMTDYSAHPRKGVSTQYAEVLREVALKRVEAGHRRVRAYALAWGKLSQDSPASSRMSCYRVLKSEGLIQPKRIGHDLREAAERRRQMLKAPEKLNEVLQGDFTDYTTEDGEKYRIGCITEYLSRFNLVSEVLIPRRRWI
Ga0184625_1023139313300018081Groundwater SedimentMEILGLSRSSYYRQVRGMTDYGSHPRKAQSTQHAEVLREVAFKRVEAGHRRVRAYALAWGKLSQDSPASSRMSCYRVLKSEGLIQPKRIGHDLREAAERRRQMLKAPEKLNEVLQGDFTDYVTEDGEKYRIGCITEYLSRFNLVSEVLDTETALDLIAVTELALKEIVELGHELKSQVILVTDNGPAMKS
Ga0184625_1055589813300018081Groundwater SedimentMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKISTDAAGISRMSCYRVLKSEGLIQPKRIERNLREAAEQRRQRLTAPDKLNAVLQGDFTDYLTEDGEKYRIGGVTEY
Ga0184625_1065809213300018081Groundwater SedimentVEAAVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRKRAREVQSAKHAEVLREVALMRMEAGHRRVRAYAVAWGKISTEAAGISRMSCYRVLKSEGLIQPKRIGRNLREAAEQRRQRLTAPDKLNAVLQGDF
Ga0184629_1052688623300018084Groundwater SedimentVEAARVAEGLTVKQAMKLLGLSRTSYYRQVRGMKDYRKRPREIQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKISTDAAGISRMSCYRVLKSEGLIQPKRIGRNLREAAEQRRQRLTAPDKLNAVLQGDFTDY
Ga0187774_1101356913300018089Tropical PeatlandAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYQVRGRQAASAQHKEVLREVALRRVESGHRRVRAYAMAWGKLAAGAAGSSRMSCYRVLKSEGLIQPKRIGQDLRQAAEHRRQRLTAPEQLNQVLQADFTDYVTEDNEKHRIGCVTEYLSRFNLISAVSDTETALDLIAVVEGALKEITELGHELTNQIVLVTDN
Ga0190265_1024313713300018422SoilRQVRGMTDYQAHRRTAPSTQHMEVWREVALKGVEAGHRRVRAYALAWAQLLPNALGSSRMSCYRMLESAGLIQPKRIGHDLRAAAERRRRMLKAPEKLNEVLQGDFTDYVTEDGEKYRIGCITEYLSRFNV
Ga0190265_1072546313300018422SoilMKLLGLSRTSYYREVHGMTDYRVHPRKASSRQHAEVLREVALKRVEAGHRRVRAYAMAWGKLSQDSPASSRMSCYRVLKSEGLIQPKRIGQDLREAAERRRQMLKAPEKFNEVLQGDFTDYVTEDGEKYRIGCITEYLSRFNLVSEVLDTETALDLIAVTDQALKEIVELGHELKLQMILVTDNGPAMKSRRF
Ga0190265_1269108513300018422SoilLTPSWAEVEAARVAEGLTVKQAMKLLGLSRSSYYRQVHGMKDYRKRPREVQSAKHTEVLREVALKRVEAGHRRVRAYALAWGKITTDAAGSSRMSCYRVLKSEGLIQPKRIGQDLRQAAEQRRQRLIAPDRLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVIESALREIAELGHELAS
Ga0190269_1076816023300018465SoilVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRQRPRDVPSAKHIEVLREVAIKRVEAGHRRVRAYALAWGKIPTQATGSSRMSCYRVLKSEGLIQPKRIGHDLRQAAEQRHQRLTAPDKLNAVL
Ga0066662_1014258833300018468Grasslands SoilVKAAMEILGLSRSSYYRRVRGMSDYKAHPRKVGSTQHAEVLREVALKRVEAGHRRVRAYAVAWGKLAQDSPASSRMSCYRVLKSEGLIQPKRIGHDLRETAERRRQMLKAPEKINEVLQGDFTDYTTEDGEKYRIGCVTEYVSRFNLVSEVLDTETALDLIAVTEAALNEIIELGH
Ga0190264_1069962723300019377SoilMEILGLSRSSYYRQVRGMADYSAHRRKVASTQHAEVLREVALKRVEAGHRRVRAYAMAWGKLSQDSAASSRMSCYRVLKSEGLIQPKRIGHDLREAAERRRQMLKAPEKFNEVLQGDFTDYVTEDGEKYRIGCVTEYLSRFNLVSEVF
Ga0193739_106774013300020003SoilVGEGLTVKKAMKLLGLSRTSYYRQIRGMKDYQARARKVVSANHEEILREVALKRVEAGHRRVRAYAIAWEKLSSDTAGSSRMSCYRVLKSAGLIQPQRVGQELREGAERRRRMLKAPENFNAVLQGDFTDYTTEDGERYPIGGITEYHSRFNLVSEVLDT
Ga0179592_1026118923300020199Vadose Zone SoilVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRQRPRDVPSAKHIEVLREVAIKRVEAGHRRVRAYALAWGKIPTQATGSSRMSCYRVLKSEGLIQPKRIGHDLRQAAEQRHQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIG
Ga0222621_113971823300021510Groundwater SedimentMKLLGLSRSSYYRQVRGMKDYRKRAREVQSAKHAEVLREVALMRMEAGHRRVRAYAVAWGKISTDAAGISRMSCYRVLKSEGLIQPKRIGRNLRQAAEQRRQRLTAPDKLNAVLQGDFTDYV
Ga0212128_1050257313300022563Thermal SpringsVKAAMEILGLSRTSYYRQVRGMTDYKAHGRGTPSSQHIEALREVALKRVEAGHRRVRAYALAWGTLSRDTLGSSRMSCYRVLKSEGLIQPKRIGHDLREAAERRRQMLKAPEKLNEVLQGDFTDYVTEDGEKYRIGCVTEYLSRFNLISEVLD
Ga0210099_112933113300025562Natural And Restored WetlandsSAHPRKLASIEHAEVLREVALKRVEAGHRRVRAYALAWGRLSLEAAGSSRMSCYRVLKREGLIQARRVGRDLREAAEGRRQMLKAPEKLNELLQGDFTDYVTEDGEKYRIGCVTEYLSRFNLVSEVLDTETALDLIAVTEAALKEVVALGHELPDQVVLVTDNGPAMKS
Ga0207707_1061950823300025912Corn RhizosphereVKAAMEILGLSRSTYYRQVRGMIDYEAHPRKAGSIQHGEILREVALKRVEAGHRRVRAYALAWGKLAADSPASSRMSCYRVLKSQGLLQPQRIGRDLREAAERRRQMLKAPGKLNELLQGDFTDYVTEDGEKYRIGCVTEYLSRFNLVSEVLDT
Ga0207662_1052262913300025918Switchgrass RhizosphereWTQRERWPDSGLGAPGGRSSEVDRRASHRDPFFKTAVEELRPSWAEVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYHVRGRQAASAKHKDVLREVAIKRVEAGHRRVRAYAMAWGKITADAAGISRMSCYRVLRREGLIQPKRIGHDLRQAAEQRRQRLTVPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVPSTPRRRRSFTRALPLRN
Ga0207657_1144250113300025919Corn RhizosphereVEELRPSWAEVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYHVRGRQAASAKHKDVLREVAIKRVEAGHRRVRAYAMAWGKITADAAGISRMSCYRVLRSEGLIQPKRIGHDLRQAAEQRRQWLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQ
Ga0207679_1158308213300025945Corn RhizosphereVSTQHADILREVALKRVEAGHRRVRAYAIAWGKLQADSAASSRMSCYRVLKSQGLLQPQRIGRDLREAAERRRQILKAPEKLNELLQADFTDYVTEDGEKYRIGCVTEYLSRFNLVSEVLDTETALDLIVVIEQALHEIRELGHELPAQVVLVTDN
Ga0207675_10254230713300026118Switchgrass RhizosphereTGKQAMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKISTDAAGISRMSCYRVLKSEGLIQPKRIGRNLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEAALDAGA
Ga0209058_133224613300026536SoilVKGAMEILGLSRSSYYRQVRGMTDYGAHPRKAESTQHAEVLREVALKRVEAGHRRVRAYALAWGKLSQDSPASSRMSCYRVLKSEGLIQSKRIGHDLREAAERRRQMLKAPEKLNEVLQGDFTDYTTEDGQK
Ga0179587_1108948423300026557Vadose Zone SoilMKLLGLSRTSYYRQVRGMKDYGVGGRQAASAKHKEALREVAIKRVEAGHRRVRAYAVAWGKISAQAVGASRMSCYRVLKSEGLIQPKRIGYDLRQAAEQRRQRLKAPENTNEVFQGDFTDDVTEDGEK
Ga0209878_102997313300027163Groundwater SandVEVEGARVAEGLTVKKAMKLLGLSRTSYYRQVRGMKDYTTQPRKVISTQHSEVLREVALKRVEAGHRRVRAYAVAWGKISQDAAGSSRMSCYRVLKSAGLIQPKRIGHDLREAAERRRQMLKAPEQLNEVLQ
Ga0209842_102228623300027379Groundwater SandVEVEGARVAEGLTVKKAMKLLGLSRTSYYRQVRGMKDYTTQPRKVISTQHSEVLREVALKRVEAGHRRVRAYAVAWGKISQDAAGSSRMSCYRVLKSAGLIQPKRIGHDLREAAERRRQMLKAPEQLNEVLQGDFTDYTTEDGEKYRIGCITEYLSRFNLVSEVLDTETALDLIAVTEAALKQITELGHELAAQVILVTDN
Ga0209481_1042695613300027880Populus RhizosphereVEAARVGEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKITADAAGISRMSCYRVLRSEGLIQPKRIGQDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVT
Ga0207428_1090020813300027907Populus RhizosphereDRRASHRDPFFKTAVEELRPSWAEVEAARVGEGLTVKQAMKLLGLSRSSYYRQVRGMKDYHVRGRQAASAKHKDVLREVAIKRVEAGHRRVRAYAMAWGKIPADAAGISRMSCYRVLRSEGLIQRKRIGHDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEAALDEITD
Ga0209820_106700533300027956Freshwater SedimentMNLLGLSRTSYYRQVRGMTDYKAQPRKAESTQHAQILREVALKRVEAGHRRVRAYALAWGRLSQDSPASSRMSCYRVLKGEGLIQPKRIGQDLREAAERRRQMLRAPEKLNELLQGDFTDYVTEDGEKYRIGCVTEYLS
Ga0247822_1069511923300028592SoilVEAARVGEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKITADAAGISRMSCYRVLRSEGLIQPKRIGHDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTE
Ga0307312_1061669623300028828SoilVGEGLTVKKAMKLLGLSRTSYYRQIRGMKDYQARARKVVSANHEEILREVALKRVEAGHRRVRAYAIAWEKLSSDTAGSSRMSCYRVLKSAGLIQPQRVGQELREGAERRRRMLKAPENFNAVLQGDFTDYTTE
Ga0307312_1061993813300028828SoilVKGAMEILGLSRSSYYRQVRGMTDYGAHPRKVESTQHAEVLREVALKRVEAGHWRVRAYALAWGKLSQDSPASSRMSCYRELKSEGLIQPKRIGHDLREAAERRRQMLKAPEKLNEVLQGDFTDYTTEDGEKYRI
Ga0307497_1035324413300031226SoilMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLHEVAFMRMEAGHRRVRAYAVAWGKISTEAAGISRMSCYRVLKSEGLIQPKRIGRNLREAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQ
(restricted) Ga0255312_106740323300031248Sandy SoilVEAARVGEGLTVKQAMKLLGLSRTSYYRQVRGMKDYRKRPREVQSAKHAEVLREVAIKRVEAGHRRVRAYALAWSKITADAAGISRMSCYRVLKSEGLIQPKRIGHDLRQAAEQRRQRLTAPDKLNAVLQ
Ga0310907_1077389413300031847SoilRTSYYRQVRGMKDYQVRERQAASAAHKEVLREVAIKRVEAGHRRVRAYALAWGKITADAAGSSRMSCYRVLKSEGLIQPKRIGHDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLISQVLDSETALDLIAVVEGVLTEIAELGHELAGQIILVTDN
Ga0307410_1173310113300031852RhizosphereVGEGLTVKAAMAVLGLSRSSYYRQVRGMIDYKCHGRKTESTQHAEVLREVALKRVEAGHRRVRAYAIAWGKLLQDRAASSRMSCYRVLKREGLIQPKRIGHDLREAAERRRQMLKAPEKLNEVLQGDFTDYVTEDGEKYRIGCITEYLSRFNLVSEVL
Ga0310892_1012075913300031858SoilVEAARVAEGLTVKQAMKLLGLSRTSYYRQVRGMKDYQVRERQAASAAHKEVLREVAIKRVEAGHRRVRAYALAWGKITADAAGSSRMSCYRVLKSEGLIQPKRIGHDLRQAAEQRRQRLTAPDKLKERN
Ga0310892_1111732213300031858SoilAARVGEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRKRRREAQSAKHTEVLREVAIKRVEAGHRRVRAYALAWGKISTDAAGSSRMSCYRVLKGEGLIQPKRIGHDLRQAAEQRRQRLTAPNKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEAALREIAELGHE
Ga0310885_1090247313300031943SoilMKLLGLSRTSYYRQVRGMKDYQVRERQAASAAHKEVLREVAIKRVEAGHRRVRAYALAWGKITADAAGSSRMSCYRVLKSEGLIQPKRIGHDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLISQVLDSETALDLIAVVEGALTEIA
Ga0310884_1020930513300031944SoilMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKITADAAGISRMSCYRVLRSEGLIQPKRIGQDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEAALEEITD
Ga0310903_1025482413300032000SoilVEGARVAEGVTVKQAMKLLGLSRSSYYRQVRGMKDYRKRRREAQSAKHTEVLREVAIKRVEAGHRRVRAYALAWGKISTDAAGSSRMSCYRVLKSEGLIQPKRIGHDLRQAAEQRRQRLTAPNKLNAVLQGDFTDYVTED
Ga0310903_1028581613300032000SoilVEAARVAEGLTVKQAMKLLGLSRTSYYRQVRGMKDYQVRERQAASAAHKEVLREVAIKRVEAGHRRVRAYALAWGKITADAAGSSRMSCYRVLKSEGLIQPKRIGHDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKY
Ga0310902_1006743323300032012SoilVEAARVGEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKITADAAGISRMSCYRVLRSEGLIQPKRIGHDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEAALEEITDLGHE
Ga0310899_1006397633300032017SoilVEAARVGEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKITADAAGISRMSCYRVLRSEGLIQPKRIGHDLRQAAEQRRQWLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEAALEEITDLGHELAAQVILVTDNGPAMKSR
Ga0310895_1076263813300032122SoilLGLSRSSYYRQVRGMKDYRKRRREAQSAKHTEVLREVAIKRVEAGHRRVRAYALAWGKISTDAAGSSRMSCYRVLKGEGLIQPKRIGHDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEAALREIAELG
Ga0307472_10081573213300032205Hardwood Forest SoilVEAARVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRKRPREVQSAKHAEVLREVALMRMEAGHRRVRAYAMAWGKISTDAAGISRMSCYRVLKSEGLIQPKRIGRNLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQ
Ga0310896_1054286413300032211SoilMKDYTAQTRKVISTQHSEVLREVALKRVEAGHRRVRAYAVAWGKISEDAAGSSRMSCYRVLKSAGLIQPKRIGHDLREAAERRRQMLKAPEQLNEVLQGDFTDYVTEDGEKYRIGCITEYLSRFNLVSEVLDTETALDLIAVTEAALKQITELGHELA
Ga0310896_1061070513300032211SoilSGARAPGGRSSEVNRRASHRDPFFKTAVEELRPSWAEVEAARVAEGLTVKQAMKLLGLSRTSYYRQVRGMKDYQVRERQAASAAHKEVLREVAIKRVEAGHRRVRAYALAWGKITADAAGSSRMSCYRVLKSEGLIQPKRIGHDLRQAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLISQVLDSE
Ga0310896_1070766313300032211SoilGVTVKQAMKLLGLSRSSYYRQVRGMKDYRKRRREAQSAKHTEVLREVAIKRVEAGHRRVRAYALAWGKISTDAAGSSRMSCYRVLKGEGLIQPKRIGHDLRQAAEQRRQRLTAPNKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEAALREMTELGHELASQIILVTD
Ga0310810_1040360033300033412SoilVEAARVAEGLTVKQAMKLIGLSRTSYYRQVRGMKDYRKRAREIQSAKHTEVLREVAIKRAEAGHRRVRAYALAWGKITTDATGSSRMSCYRVLKSEGMIQPKRMGHNLREAAEQRRQRLTAPDTLNAVLQGDFTDYVTADGEKYHIGGVTEYLSRFNLISEVSDTETALDLIAVVEGALKEIAELRP
Ga0316620_1192501013300033480SoilVEAARVGEGLTVKQAMKLLGLSRTSYYRQVRGMKDYRKRPREVQSAKHAEVLREVAIKRVEAGHRRVRAYALAWSKITADAAGSSRMSCYRVLKSEGLIQPKRIGHNLRQAAEQRRQRLTAPDKLNAVLQGDFTDDVTEDGEKYRIGGVTEYLSRFNLV
Ga0316624_1003784513300033486SoilVEAARIGEGLTVKQAMKLLGLSRGSYYRQVRGMKDYRKRPREVQSAKHAEVLREVAIKRVEAGHCRVRAYALAWSKITADAAGSSRMSCYRVLKSEGLIQPKRIGHNLRQAAEQRRQRLTAPDKLNAVLQG
Ga0364931_0017572_451_9633300034176SedimentVEAARVAEGLTVKGAMEILGLSRSSYYRQVRGMTDYSAHPRKGVSTQYAEVLREVALKRVEAGHRRVRAYALAWGKLSQDSPASSRMSCYRVLKSEGLIQPKRIGHDLREAAERRRQMLKAPEKLNEVLQGDFTDYTTEDGEKYRIGCITEYLSRFNLVSEVLIPRRRWI
Ga0364941_046499_1_5283300034417SedimentVEAAVAEGLTVKQAMKLLGLSRSSYYRQVRGMKDYRKRPREIQSAKHAEVLREVALMRMEAGHRRVRAYAVAWGKISTDAAGISRMSCYRVLKSEGLIQPKRIGRNLREAAEQRRQRLTAPDKLNAVLQGDFTDYVTEDGEKYRIGGVTEYLSRFNLVSQVLDTETALDLIAVVEA
Ga0364941_058544_1_4803300034417SedimentVEAARVAEGLTVKGAMEILGLSRSSYYRQVRGMTDYSAHPRKGVSTQYAEVLREVALKRVEAGHRRVRAYALAWGKLSQDSPASSRMSCYRVLKSEGLIQPKRIGHDLREAAERRRQMLKAPKKLNEVLQGDFTDYTTEDGEKYRIGCITEYLSRFNLVS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.