NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F015466

Metagenome / Metatranscriptome Family F015466

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F015466
Family Type Metagenome / Metatranscriptome
Number of Sequences 254
Average Sequence Length 95 residues
Representative Sequence MRLKGRHWLMLWLLIFLGVLVAITTRQTAGFRTARRLHDLREERLALEARRADLERRIRVGSSRQVLVPIVERALGLHEPADSEFVLFALPSAASGMP
Number of Associated Samples 185
Number of Associated Scaffolds 254

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 0.39 %
% of genes from short scaffolds (< 2000 bps) 0.39 %
Associated GOLD sequencing projects 173
AlphaFold2 3D model prediction Yes
3D model pTM-score0.34

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (98.819 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(20.079 % of family members)
Environment Ontology (ENVO) Unclassified
(25.197 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(36.220 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 56.35%    β-sheet: 0.00%    Coil/Unstructured: 43.65%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.34
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 254 Family Scaffolds
PF01795Methyltransf_5 57.48
PF03717PBP_dimer 21.26
PF08245Mur_ligase_M 1.57
PF00905Transpeptidase 1.18
PF02492cobW 0.39
PF03975CheD 0.39
PF01225Mur_ligase 0.39
PF01584CheW 0.39
PF01740STAS 0.39
PF03793PASTA 0.39

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 254 Family Scaffolds
COG027516S rRNA C1402 N4-methylase RsmHTranslation, ribosomal structure and biogenesis [J] 57.48
COG0768Cell division protein FtsI, peptidoglycan transpeptidase (Penicillin-binding protein 2)Cell cycle control, cell division, chromosome partitioning [D] 21.26
COG1871Chemotaxis receptor (MCP) glutamine deamidase CheDSignal transduction mechanisms [T] 0.79
COG0769UDP-N-acetylmuramyl tripeptide synthaseCell wall/membrane/envelope biogenesis [M] 0.39
COG0770UDP-N-acetylmuramyl pentapeptide synthaseCell wall/membrane/envelope biogenesis [M] 0.39
COG0773UDP-N-acetylmuramate-alanine ligase MurC and related ligases, MurC/Mpl familyCell wall/membrane/envelope biogenesis [M] 0.39


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A98.82 %
All OrganismsrootAll Organisms1.18 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300026029|Ga0208002_1002889All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1620Open in IMG/M
3300027787|Ga0209074_10012049All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium2171Open in IMG/M
3300031965|Ga0326597_10042733All Organisms → cellular organisms → Bacteria5713Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil20.08%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment6.69%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere6.30%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil4.72%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands3.54%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil3.15%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil3.15%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil3.15%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil3.94%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.94%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere3.94%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere2.76%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere2.36%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.97%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil1.97%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil1.97%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.97%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment1.57%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.57%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil1.57%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.57%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.57%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.18%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil1.18%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.18%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.18%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.18%
Polar Desert SandEnvironmental → Aquatic → Freshwater → Ice → Unclassified → Polar Desert Sand0.79%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.79%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.79%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.79%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.79%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.79%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.79%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.79%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.39%
Wetland SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Wetland Sediment0.39%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.39%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost0.39%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.39%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.39%
SoilEnvironmental → Terrestrial → Agricultural Field → Unclassified → Unclassified → Soil0.39%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.39%
Switchgrass RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Switchgrass Rhizosphere0.39%
Rhizosphere SoilHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere Soil0.39%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.39%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000033Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000363Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300001686Grasslands soil microbial communities from Hopland, California, USAEnvironmentalOpen in IMG/M
3300003267Sugarcane bulk soil Sample L1EnvironmentalOpen in IMG/M
3300003321Sugarcane bulk soil Sample H1EnvironmentalOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300003987Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_TuleB_D2EnvironmentalOpen in IMG/M
3300003990Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Goodyear_PhragC_D2EnvironmentalOpen in IMG/M
3300003992Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_TuleB_D1EnvironmentalOpen in IMG/M
3300003997Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailC_D1EnvironmentalOpen in IMG/M
3300004013Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailA_D2EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300004782Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare2FreshEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005205Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleC_D2EnvironmentalOpen in IMG/M
3300005329Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaGEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005340Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaGEnvironmentalOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005530Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005539Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2Host-AssociatedOpen in IMG/M
3300005548Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3 metaGHost-AssociatedOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005563Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2Host-AssociatedOpen in IMG/M
3300005564Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaGHost-AssociatedOpen in IMG/M
3300005577Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2Host-AssociatedOpen in IMG/M
3300005616Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C2-2Host-AssociatedOpen in IMG/M
3300005618Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005841Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2Host-AssociatedOpen in IMG/M
3300005873Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_301EnvironmentalOpen in IMG/M
3300005884Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_80N_302EnvironmentalOpen in IMG/M
3300005886Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_205EnvironmentalOpen in IMG/M
3300006573Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtLAC (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006576Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtLPA (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006581Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtLPB (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006918Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS100EnvironmentalOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009078Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm September2015EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009168Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 19-21cm September2015EnvironmentalOpen in IMG/M
3300009789Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot28EnvironmentalOpen in IMG/M
3300010039Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot56EnvironmentalOpen in IMG/M
3300010040Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot55EnvironmentalOpen in IMG/M
3300010042Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot105BEnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011414Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT266_2EnvironmentalOpen in IMG/M
3300011431Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT157_2EnvironmentalOpen in IMG/M
3300012045Polar desert sand microbial communities from Dry Valleys, Antarctica - metaG UQ449 (21.06)EnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012226Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT400_2EnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012668Arctic soils microbial communities. Combined Assembly of 23 SPsEnvironmentalOpen in IMG/M
3300012898Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S194-509B-1EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012941Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t4i015EnvironmentalOpen in IMG/M
3300012955Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_216_MGEnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012984Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300012988Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_242_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013770Permafrost microbial communities from Nunavut, Canada - A15_5cm_18MEnvironmentalOpen in IMG/M
3300014268Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_CattailA_D1EnvironmentalOpen in IMG/M
3300014325Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S6-5 metaGHost-AssociatedOpen in IMG/M
3300015079Arctic soil microbial communities from a glacier forefield, Storglaci?ren, Tarfala, Sweden (Sample st-6b, vegetation/snow interface)EnvironmentalOpen in IMG/M
3300015086Arctic soil microbial communities from a glacier forefield, Storglaci?ren, Tarfala, Sweden (Sample st-5c, rocky medial moraine)EnvironmentalOpen in IMG/M
3300015209Arctic soil microbial communities from a glacier forefield, Russell Glacier, Kangerlussuaq, Greenland (Sample G3B, Proglacial river margin, by glacier terminus)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017695Polar desert sand microbial communities from Dry Valleys, Antarctica - metaG UQ540 (21.06) (version 2)EnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018077Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1EnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b2EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018466Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 TEnvironmentalOpen in IMG/M
3300018469Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 TEnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300018920Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 ISEnvironmentalOpen in IMG/M
3300019361Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S133-311R-2 (version 2)EnvironmentalOpen in IMG/M
3300019362Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S104-311B-1 (version 2)EnvironmentalOpen in IMG/M
3300019877Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m1EnvironmentalOpen in IMG/M
3300020060Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c2EnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300021445Bulk soil microbial communities from the field in Mead, Nebraska, USA - 072115-187_1 MetaGEnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025313Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_3 (SPAdes)EnvironmentalOpen in IMG/M
3300025327Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025537Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailC_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025556Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Goodyear_PhragA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025936Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025945Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025988Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_10C_0N_101 (SPAdes)EnvironmentalOpen in IMG/M
3300026000Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_404 (SPAdes)EnvironmentalOpen in IMG/M
3300026003Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_80N_201 (SPAdes)EnvironmentalOpen in IMG/M
3300026014Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_205 (SPAdes)EnvironmentalOpen in IMG/M
3300026029Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_CattailA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300026041Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026095Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026905Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G01A4-10 (SPAdes)EnvironmentalOpen in IMG/M
3300027682Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M3 S AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027713Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 10-12cm March2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027743Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 19-21cm September2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027876Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M3 S PM (SPAdes)Host-AssociatedOpen in IMG/M
3300028592Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Cellulose_Day30EnvironmentalOpen in IMG/M
3300028799Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_123EnvironmentalOpen in IMG/M
3300028803Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_120EnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300028809Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_PalmiticAcid_Day48EnvironmentalOpen in IMG/M
3300028812Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48EnvironmentalOpen in IMG/M
3300028824Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_197EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028875Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_143EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300031226Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 10_SEnvironmentalOpen in IMG/M
3300031229Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT155D38EnvironmentalOpen in IMG/M
3300031547Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D4EnvironmentalOpen in IMG/M
3300031548Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-C-3Host-AssociatedOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031731Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-C-1Host-AssociatedOpen in IMG/M
3300031824Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-O-2Host-AssociatedOpen in IMG/M
3300031852Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-O-3Host-AssociatedOpen in IMG/M
3300031938Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R1EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300031965Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT100D185EnvironmentalOpen in IMG/M
3300031995Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-O-2Host-AssociatedOpen in IMG/M
3300032002Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-3Host-AssociatedOpen in IMG/M
3300032004Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-O-3Host-AssociatedOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032126Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-2Host-AssociatedOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032516Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G02_0EnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300034817Populus rhizosphere microbial communities from soil in West Virginia, United States - GW9791_WV_N_1Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
ICChiseqgaiiDRAFT_242692123300000033SoilMRLKGRHWLMVWLLIFVGVLVAITARQSAGFRTARRLRDLREQRTTLEARRADLERQIRVASSRQVLIPIAERELGLHLPSDSEFTLLVLPTAPAPGR*
ICChiseqgaiiFebDRAFT_1434189023300000363SoilMRLKGRHWLMLWLVIFLGVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREGSSRQVLVPIVERALGLHEPADSEFVLFAIPPAGPEGR*
JGI10216J12902_10877762923300000956SoilMVWLLIFLGVLLAIAARQTAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPTPPVSGH*
C688J18823_1004105033300001686SoilMRLKGRHWLMVWLLVXLGVLIAIATRQTAGFRTARRLHDLREQRTTLEARRADLERQIRIASSRQVLVPIAERDLGLHLPSDSEFTLLVLPVPAAPGR*
soilL1_1009683543300003267Sugarcane Root And Bulk SoilMRLKGRHWLMVWLLIFVGVLVAITARQSAGFRTARRLHDLREQRTTLEARRADLERQIRVAGSRQVLIPIAERELGLHLPSDSEFTLLVLPTAPAPGR*
soilH1_1002760723300003321Sugarcane Root And Bulk SoilMVWLLIFVGVLVAITARQSAGFRTARRLHDLREQRTTLEARRADLERRIRVAGSRQVLIPIAERELGLHLPSDSEFTLLVLPTAPAPGR*
soilH2_1001381633300003324Sugarcane Root And Bulk SoilMVWLLIFVGVLVAITARQSAGFRTARRLHDLREQRTTLEARRADLERQIRVAGSRQVLIPIAERELGLHLPSDSEFTLLVLPTAPAPGR*
soilH2_1001381743300003324Sugarcane Root And Bulk SoilMVWLLIFVGVLVAITARQSAGFRTARRLHDLREERTTLEARRADLERQIRVAGSRQVLIPIAERELGLHLPSDSEFTLLVLPTAPVSGR*
soilH2_1011341923300003324Sugarcane Root And Bulk SoilMRLKGRHWLMLWLIIFLGVLVAITTRQTAGFQTARRVRELREQRMALEAQRGDLERRIRAGSSRQVLVPIVERALGLHEPADSEFVLFAVPPAGPGGR*
Ga0055471_1013199523300003987Natural And Restored WetlandsMRLKGRHWLMLWLVMFVVVLLAVATRQSAGFRTARRLGELREERTTLEARRAELERQIRVASSRQVLVPLAKRRLSLHEPSDSEFTLLPLPPLPESEP*
Ga0055471_1018590823300003987Natural And Restored WetlandsGRHWLMVWLLISLGVLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPTAPER*
Ga0055455_1001533823300003990Natural And Restored WetlandsMHLKGRHWLMLWLLVSLGVLVAITTRQTAGFRTARRLRDLREERLTLEARRGDLERRIRFASSRQVLVPIVERALGLHEPVDSEFVLFAVPAGPEER*
Ga0055470_1006996723300003992Natural And Restored WetlandsMVWLLISLGVLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPTAPER*
Ga0055466_1005024623300003997Natural And Restored WetlandsMRLKGRHWLMVWLLISLGVLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPTAPER*
Ga0055465_1029956923300004013Natural And Restored WetlandsMRLKGRHWLMLWLVMFVVVLLAVATRQSAGFRTARRLGELREERTTLEARRAELERQIRVASSRQVLVPLAKRRLSLHEPSDSE
Ga0062589_10229183113300004156SoilFVGVLLAIAARQTAGFRTARRLHDLREQRTNLEAQRADLERQIRVASSRQVLVPIAERELGLHLPSDSEFTLLVLPAPPAAGR*
Ga0063356_10070990613300004463Arabidopsis Thaliana RhizosphereMVWLLIFVGVLVAITARQSAGFRTARRLRDLREQRTTLEARRADLERQIRVASSRQVLIPIAERELGLHLPSDSEFTLLVLPTAPAPGR*
Ga0063356_10118039823300004463Arabidopsis Thaliana RhizosphereMQLKGRHWMLLWLLIFLGAAVAVVTRQTAALQTARRLEDLREERRSLEARRAEFERRIRLASSRQVLVPMAKRTFGLHEPADSEFVLFPEPPVPGEIEP*
Ga0063356_10297146113300004463Arabidopsis Thaliana RhizosphereMRLKGRHWLMLWLFIFLVVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIRVASSRQVLVPIAARVLGLHEPTDSEFVLFAVPASGPGER*
Ga0062595_10007414113300004479SoilGSSRRAMRLKGRHWLMLWLIVFLGVLVAITTRQTAGFRTARRVRELREQRLALEAQRGDLERRIRAGSSRQVLVPIVQRALGLHEPADSEFVLFAVPPAGPGNR*
Ga0062595_10016417623300004479SoilMVWLLIFVGVLLAIAARQTAGFRTARRLHDLREQRTNLEAQRADLERQIRVASSRQVLVPIAERELGLHLPSDSEFTLLVLPAPPAAGR*
Ga0062595_10017496423300004479SoilMLWLVIFLGVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREGSSRQVLVPIVERALGLHEPADSEFVLFAVPPAGPEAR*
Ga0062595_10105104723300004479SoilMRLKGRHWLMLWLIIFLGVLVAITTRQTAGFQTARRVRELREQRLALEAQRGDLERRIRAGSSRQVLVPIVQRALGLHEPADSEFVLFAVPPAGPGGR*
Ga0062592_10051425723300004480SoilMQPLKGRHWVLLWLLIFLGGAVAVVTRQTAALQTARRLHDLREERGSLEARRAELERRIRIASSRQVLVPMAKRTFGLHEPADSEFVLFPVPAAESVR*
Ga0062591_10032386623300004643SoilVLLWLLIFLGGAVAVVTRQTAALQTARRLHDLREERGSLEARRAELERRIRIASSRQVLVPMAKRTFGLHEPADSEFVLFPVPAAESVR*
Ga0062591_10140186023300004643SoilMRLKGRHWLMLWLVIFLGVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREGSSRQVLVPIVERALGLHEPADSEFVLFAVPPAGPEAR*
Ga0062591_10142029113300004643SoilMRLKGRHWLMLWLVIFLGVLVAITTRQAAGFRTARRLRELREERLALEARRGDLERRIRAGSSRQVLVPIVERALGLHEPADSEFVLFALPPAGPGGR*
Ga0062382_1057490813300004782Wetland SedimentKGRHWLMLWLLLFLVVLVAITTRQTEGFRTARRLRDLREERMALEARRGDLERRIRLASSRQVLVPIVERALGLHEPADSEFVLFAVPPAGPAER*
Ga0066674_1029641223300005166SoilMRLKGRHWLMLWLLIFLCALLAITTRQAAGFRTARRLRDLREQRLSLEAQRGDLERRIRMGSSRQVLVPTARALGLHEPGDSEFVLFAVPRSESGKR*
Ga0068999_1010359323300005205Natural And Restored WetlandsRHWLMLWLLIFLGVLVAITTRQTAGFRTARRLRELREERMALEARRGDLERRIRVGSSRQVLVPIVQQALGLHEPADSEFVLFVVPPAGSGSR*
Ga0070683_10099177923300005329Corn RhizosphereMRLKGRHWLMLWLVIFLGVLVAITTRQTEGFRTARRLRELREERLALEARRGDLERRIREGSSRQILVPIVERALGLHEPADSEFVLFTVPPAGP
Ga0066388_10166561213300005332Tropical Forest SoilMRLKGRHWLMLWLIIFLGVLVAITTRQTAGFRTARRVRDLREQRMALEAQRGDLERRIRAGSSRQVLVPIVQRALDLHEPADSEFVLFAVPPAGPGNR*
Ga0066388_10333827223300005332Tropical Forest SoilMRLKGRHWLMLWLIIFLGVLVAITARQTAGFRTARRVRELREERMALEAQRGDLERRIRVGSSRQVLVPIVERALGLHEPADSEFVLFAVPPAGRANR*
Ga0066388_10721598713300005332Tropical Forest SoilMRLKGRHWLMLWLIIFLGVLVAITTRQTAGFQTARRLRELREQRLALEAQRGDLERRIRAGSSRQVLVPIVQRALGLHEPADSEFVLFALPPAGPGGR*
Ga0070689_10138108723300005340Switchgrass RhizosphereLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPTAPER*
Ga0070689_10209167313300005340Switchgrass RhizosphereVLLWLLIFLGCAVAIVSRQTTALRTARRLHDLREQRGILEARRADLERRIRVASSREVLVPIARRNLGLHEPADSEFVLFAIPSETGHR*
Ga0070691_1080988113300005341Corn, Switchgrass And Miscanthus RhizosphereMRLKGRHWLMVWLLISLGVLIAIAARQAAGFRTARRLHDLREQRTTLEARPADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPTAPER*
Ga0070703_1042708623300005406Corn, Switchgrass And Miscanthus RhizosphereMRLKGRHWLMLWLFIFLVVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREASSRQVLVPIAARVLGLHEPADSEFVLFAVPGSGPGER*
Ga0070708_10056675923300005445Corn, Switchgrass And Miscanthus RhizosphereMKVKGRHWLLLWLLLFLLVTAVVVGRQTAAFKVARRVGELREQRTALEARRADLERRIREASGRQVLVPKAVHDLGLHLPADNEFILFAVPSGPADRSKR*
Ga0070706_10032604113300005467Corn, Switchgrass And Miscanthus RhizosphereRAMRLKGRHWLMLWLLIFLCVLVAITTRQTAGFRTARRLHDLREERLALEARRADLERRIRVASSRQVLVPIVERALGLHEPSDSEFVLFALPPAGPVER*
Ga0070706_10167275923300005467Corn, Switchgrass And Miscanthus RhizosphereMLWLFIFLVVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREASSRQVLVPIAARVLGLHEPADSEFV
Ga0070707_10178648223300005468Corn, Switchgrass And Miscanthus RhizosphereMRLKGRHWLMLWLFIFLVVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREASSRQVLVPIAARVLGLHEPADSEFVLFA
Ga0070679_10162840723300005530Corn RhizosphereMRLKGRHWLMLWLIIFLGVLVAITTRQTAGFRTAGHVRELREQRMALEAQRGELERRIRAGSSRQVLVPIVQRALGLHEPADSEFVLFTVPPAGPG
Ga0070697_10028459933300005536Corn, Switchgrass And Miscanthus RhizosphereMKVKGRHWLVLWLLLFLLVAAVVIARQTAAFKVARRVGELREQRTALEARRADLERRIREASGRQVLVPKAERDLGLHFPADNEFILFAVPAGPARRPKP*
Ga0070697_10035256913300005536Corn, Switchgrass And Miscanthus RhizosphereMLWLFIFLVVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIRVASSRQVLVPIAARVLGLHEPADSEFVLFAVPGSGPGER*
Ga0070697_10071666123300005536Corn, Switchgrass And Miscanthus RhizosphereMLWLLIFLCVLVAITTRQTAGFRTARRLHDLREERLALEARRADLERRIRVASSRQVLVPIVERALGLHEPSDSEFVLFALPPAGPVER*
Ga0068853_10058042523300005539Corn RhizosphereMVWLLISLGVLIAIAARQAAGFRTARRLHDLREQRATLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPTAPER*
Ga0070665_10127461423300005548Switchgrass RhizosphereMVWLLIFVGVLLAIAARQTAGFRTARRLHDLREQRTNLEAQRADLERQIRVASSRQVLVPIAERELGLHLPSDSEFTLLVL
Ga0070704_10081269013300005549Corn, Switchgrass And Miscanthus RhizosphereMRLKGRHWLMLWLFIFLVVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREASSRQVLVPIAARVLGLHEPADSEFVLFAVPASGPGER*
Ga0068855_10124169723300005563Corn RhizosphereMRLKGRHWLMVWLLISLGVLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPTA
Ga0070664_10139029023300005564Corn RhizosphereMRLKGRHWLMVWLLIFLGVLLAIAARQTAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPSAPGR*
Ga0068857_10062541013300005577Corn RhizosphereGSSRRAMRLKGRHWLMLWLVVFLGVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREGSSRQVLVPIVERALGLHEPADSEFILFAVPPAGPEVR*
Ga0068857_10127591013300005577Corn RhizosphereLKGRHWLMVWLLIFLGVLLAIAARQTAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPSAPGR*
Ga0068852_10225018023300005616Corn RhizosphereMRLKGRHWLMVWLLISLGVLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERELGLHLPSDSEFTLLVLPAPTAPER*
Ga0068864_10001334513300005618Switchgrass RhizosphereMRLKGRHWLMLWLVVFLGVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREGSSRQVLVPIVERALGLHEPADSEFILFAVPPAGPEVR*
Ga0068864_10088563513300005618Switchgrass RhizosphereSSRRAMRLKGRHWLMLWLVIFLGVLVAITTRQTEGFRTARRLRELREERLALEARRGDLERRIREGSSRQILVPIVERALGLHEPADSEFVLFTVPPAGPEAR*
Ga0066903_10311040223300005764Tropical Forest SoilMRLKGRHWLMLWLLIFLCVLVAITTRQTAGFRTARSLRELREERMALEARRGDLERRIRVSSSRQVLVPIVERALGLHEPADSEFVLFTIPPAGGERR*
Ga0068863_10040669923300005841Switchgrass RhizosphereMRLKGRHWLMLWLVIFLGVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREGSSRQILVPIVERALGLHEPADSEFVLFTVPPAGPEAR*
Ga0068863_10200264423300005841Switchgrass RhizosphereMVWLLIFLGVLLAIAARQTAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPSAPGR*
Ga0075287_100238943300005873Rice Paddy SoilMRLKGRHWLTFWLAIFLLVLVAITTRQTAGFQTAQHVRELREERLALEAQRAELERRIRMASSRQVLVPVAERLLGMHEPSDSEFVLFAVPSGPVKP*
Ga0075291_104434713300005884Rice Paddy SoilMHLKGRHWLMLWLVLLLCVLGAITARQTAGFRTARRLRELREQRMALEARRAELERRIRVASSRQVLVPIVERVLGLHEPSDSEFVLFALPPAGP
Ga0075286_101513733300005886Rice Paddy SoilMHLKGRHWLMLWLVLLLCVLGAITARQTAGFRTARRLRELREQRMALEARRAELERRIRVASSRQVLVPIVERALGLHEPSDSEFVLFALPAAGPVGR*
Ga0075286_103779623300005886Rice Paddy SoilMRLKGRHWLMVWLLIFLGVLVAITARQSAGFRTARHLRDLREQRTTLEARRADLERQIRVGSSRQVLIPIAERELGLHLPSDSEFTLLVLPTAPPPGR*
Ga0074055_1034827423300006573SoilMRLKGRHWLMLWLFFFLCVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIRLASSRQVLVPIVERALGLHEPADSEFVLFAVPAAGPPER*
Ga0074047_1187006213300006576SoilMRLKGRHWLMLWLFFFLCVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIRLASSRQVLVPIVERALGLHEPADSEFVLFAVPSAGPPAR*
Ga0074048_1319159623300006581SoilMLWLFFFLCVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIRIASSRQVLVPIVERALGLHEPADSEFVLFAVPSAGPPAR*
Ga0079222_1000403623300006755Agricultural SoilMRLKGRHWLMLWLIIFLGVLVAITTRQTAGFQTARRVRELREQRLALEAQRGDLERRIRAGSSRQVLVPIVQRALGLHEPADSEFVLFAMPPAGPGGR*
Ga0079222_1001277223300006755Agricultural SoilVFLWLLIFLGCAVAIVSRQTTALRTARRLHDLREQRGILEARRADLERRIRVASSREVLVPIARRNLGLHEPADSEFVLFAIPSESGHR*
Ga0079221_1065376923300006804Agricultural SoilMRLKGRHWLMLWLIIFLGVLVAITTRQTAGFQTARRVRELREQRMALEAQRGDLERRIRAGSSRQVLVPIVQRALGLHEPAD
Ga0079220_1021795723300006806Agricultural SoilMRLKGRHWLMLWLIIFLGVLVAITTRQTAGFRTAGHVRELREQRMALEAQRGDLERRIRAGSSRQVLAPIVQRALGLHEPADSEFVLFAVPPAGPDHR*
Ga0075433_1005642343300006852Populus RhizosphereMRLKGRHWLMLWLFIFLVVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREASSRQVLVPIAARLLGLHEPADSEFVLFAVPASGPGER*
Ga0075425_10010977423300006854Populus RhizosphereMRLKGRHWMVIWLVVMLCVLVAITTRQTAGFRTARHVRELREERLALEARRADLERRIRIASSRQVLVPQAQRALGLHEPSDSEFVLFALPAISPDKP*
Ga0075425_10023688223300006854Populus RhizosphereMRLKGRHWLMLWLVIFLGVLVAITTRQTEGFRTARRLRELREERLALEARRGDLERRIREGSSRQILVPIVERALGLHEPADSEFVLFTVPPAGPEAR*
Ga0075434_10054481613300006871Populus RhizosphereMRLKGRHWLMLWLVVFLGVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREGSSRQVLVPIVERALGLHEPADSEFILFAVPPAGP
Ga0075434_10102950813300006871Populus RhizosphereVLLAIAARQTAGFRTARRLHDLREQRTNLEAQRADLERQIRVASSRQVLVPIAERELGLHLPSDSEFTLLVLPAPPAAGR*
Ga0075426_1064431023300006903Populus RhizosphereVLLWLLIFLGCAVAIVSRQTTALRTARRLHDLREQRGILEARRADLERRIRVASSREVLVPIARRNLGLHEPADSEFVLFAIPSESGHR*
Ga0079216_1003196923300006918Agricultural SoilMQLKGRHWMLLWLLIFLGGAVAVVTRQTAALQTARRLEDLREERRSLEARRAELERRIRLASSRQVLVPMAKRTFGLHEPADSEFVLFPEPPVPGETEP*
Ga0079219_1050639123300006954Agricultural SoilVFLWLLIFLGGALAIVSRQTSALRTARRLHDLREQRSVLEARRADLERRIRIASSREVLVPLAKRTLGLHEPADSEFVLFAVPSRAGQP*
Ga0079219_1051567623300006954Agricultural SoilMRLKGRHWLMLWLIIFLGVLVAITTRQTAGFQTARRVRELREQRMALEAQRGDLERRIRAGSSRQVLVPIVQRALGLHEPADSEFVLFAVPRAG
Ga0079218_1108151523300007004Agricultural SoilAVAVVTRQTAALQTARRLEDLREERRSLEARRAELERRIRLASSRQVLVPMAKRTFGLHEPADSEFVLFPEPPVPGESEP*
Ga0079218_1212774833300007004Agricultural SoilIVFVCVLLAIATRQSSGFRTARRLGELREERAALEARRGELERQIRVASSRQVLVPLAKRRLGLHEPSDSEFTLLPLPAAREERP*
Ga0066710_10364642123300009012Grasslands SoilMRLKGRHWLMLWLLIFLCALLAITTRQAAGFRTARRLRDLREQRLSLEAQRGDLERRIRMGSSRQVLVPTARALGLHEPGDSEFVLFAVPRSESGKR
Ga0105106_1002901923300009078Freshwater SedimentMKPLKGRHWVLLWLLIFLGGAVAVVTRQTAALHTARRLHDLREERGSLEARRAELERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPVPAAPGVSVP*
Ga0114129_1328406013300009147Populus RhizosphereMKRLKGRHWVFLWLLIFLGGALAVVSRQTTALRTARRLHDLREQRGILEARRADLERRIRVASSREVLVPIAKRSLGLHEPADSEFVLFAVPSAPDRP*
Ga0075423_1009448223300009162Populus RhizosphereMLWLFIFLVVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREASSRQVLVPIAARLLGLHEPADSEFVLFAVPASGPGER*
Ga0075423_1062649123300009162Populus RhizosphereMLWLIIFLGVLVAITTRQTEGFRTATRVRELREERMALEARRGDLERRIRVGSSRQVLVPIVERALGLHEPADSEFVLFAVPPAGAGSR*
Ga0075423_1089117423300009162Populus RhizosphereSPMRLKGRHWMVIWLVVMLCVLVAITTRQTAGFRTARHVRELREERLALEARRADLERRIRIASSRQVLVPQAQRALGLHEPSDSEFVLFALPAISPDKP*
Ga0105104_1025741013300009168Freshwater SedimentMRLKGRHWLMLWLVVFVVVLLAITTRQSAGFRTARRLGELREERTTLEARRAELERQIRVASSRQVLVPLVERRLSLHEPSDSEFTLLPLPPLPQGEP*
Ga0126307_1016677123300009789Serpentine SoilMQLKGRHWMLLWLLIFLGGAVAVVTRQTAALQTARRLDDLRQERRSLEARRAELERRIRLGSSRQVLVPMAKRTFGLHEPADSEFVLFPEPAAPGQGEP*
Ga0126309_1044836223300010039Serpentine SoilMQLKGRHWVLLWLFIFLGGAVAVVARQPAALQTAPRLHDLREERRSLEAQRADLERRIRLASSREVLVPMARRTFGLHEPADSEFVLFPAPLAPGEIQP*
Ga0126309_1061527613300010039Serpentine SoilMRLKGRHWMILWLLIFLGGALAVVSRQTAAFRTARRLHDLRDERSSLESRRAELERRIRLASSRQVLVPVAERSLGLHEPADSEFVLFVVPA
Ga0126308_1088551723300010040Serpentine SoilMLWLLLFLVVLVAITARQTEGFRTARRLRDLREERLALEARRGDLERRIRLASSRQVLVPIVARALGLHEPADSEFVLFAVPPAGPVEER*
Ga0126314_1069825613300010042Serpentine SoilTRARGARTYGSSRRAMHLKGRHWLMLWLLLSLVVLVAITTRQTEGFRTARRLRDLREDRMSLEARRGDLERRIRLASSRQVLVPIVERALGLHEPVDSEFVLFALPDGPPEAG*
Ga0134125_1219647923300010371Terrestrial SoilMLWLLIFLIVLFAITTRQTAGFRTARRLRELREERLALEARRADLERRIRVASSRQVLVPIVERALGLHEPADSEFVLFVLPSQQRGRR*
Ga0134126_1234994213300010396Terrestrial SoilMRLKGRHWLMLWLIIFLGVLVAITTRQTAGFQTARRVRELREERMALEAQRGDLERRIRAGSSRQVLVPIVERALGLHEPADSEFVLFAVPPAGPGSR*
Ga0134127_1037443723300010399Terrestrial SoilMQPLKGRHWVLLWLLIFLGGAVAVVTRQTAALQTARRLHDLREERGSLEARRAELERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPVPAAESVR*
Ga0134121_1227046813300010401Terrestrial SoilLKGRHWLMLWLVIFLGVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREGSSRQVLVPIVERALGLHEPADSEFVLFAVPPAGPEAR*
Ga0134123_1170167423300010403Terrestrial SoilMQPLKGRHWVLLWLLIFLGGAVAVVTRQTAALQTARRLHDLREERGGLEARRAELERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPVPAAESVR*
Ga0137442_102314123300011414SoilMKPLKGRHWVLLWLLIFLGGAVAVVTRQTAALGTARRLHDLREERASLEARRADLERRIRVASSRQVLVPMAERTFGLHEPADSEFVLFPIPTAQDESVP*
Ga0137438_111600913300011431SoilMKHLKGRHWVLLWLLIFLGGAVAVVTRQTAGLRTARRLHDLREEGGSLEARRAELERRIRVASSRQVLVPMAKRTFGLHEPADSEFV
Ga0136623_1028555013300012045Polar Desert SandMKSLKGRHWVLLWLVIFLGGAVAVVTRQTAGLRTARRIHDLREERGSLEARRAELERRIRVASSRQVLVPIAKRNFGLHEPADSEFVLFPVPAAPSLSVP*
Ga0137380_1129171923300012206Vadose Zone SoilMRLKGRHWLMLWLLIFLSALLAITTRQAAGFRTARRLRDLRDQRLALEAQQGDLERRIRMGSSRQVLVPTARALGLHEPGDSEFVLFAVPPTEPGKR*
Ga0137377_1085726513300012211Vadose Zone SoilLLVFLSVLVAITARQAAGFRTARRLHDLREERLTLEARRADLERRIRAGSSRQVLVPIVERALGLHEPADSEFVLFTLPPVGPAER*
Ga0150985_10264920123300012212Avena Fatua RhizosphereVLLWLLIFLGGALIVVGRQTTAFKTARQLRDLREIRGSLEARRAELERRIRVASSREVLVPLARRALGLHEPADSEFVLFALPSPAAGVSR*
Ga0137447_111778113300012226SoilMKQLKGRHWVLLWLLIFLGGAVAVVTRQTAGLRTARRLHDLREEGGSLEARRAELERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPVPALPGESVP*
Ga0137372_1095925123300012350Vadose Zone SoilMRLKGRHWLMLWLLIFLSALLAITTRQAAGFRTARRLRDLREQRLSLEAQRGDLERRIRMGSSRQVLVPTARALGLQEPADSEFILFAVPRSESGKR*
Ga0137371_1117199823300012356Vadose Zone SoilMLWLLIFLSALLAITTRQTAGFRTARRLRDLRDQRLALEAQQGDLERRIRMGSSRQVLVPMARGLGLHEPADSEFVLFAVPPTEPGKR*
Ga0137368_1029392623300012358Vadose Zone SoilMRLKGRHWLMLWLLIFLCALLAITTRQAAGFRTARRLRDLREQRLSLEAQRGDLERRIRMGSSRQVLVPTARALGLHEPADSEFVLFAIPLSESGKR*
Ga0137375_1029162723300012360Vadose Zone SoilMRLKGRHWLMLWLLIFLCALLAITTRQAAGFRTARRLRDLREQRLSLEAQRGDLERRIRMGSSRQVLVPTARALGLHEPADSEFVLFAVPRSESGKR*
Ga0157216_1003186133300012668Glacier Forefield SoilMKPLKGRHWVLLWLLIFLGGAVAVVTRQTAALHTARRLHDLREERGSLEARRAELERRIRVASSRQVLVPKAKRTFGLHEPADSEFVLFPVPTAPSVSVP*
Ga0157293_1018854923300012898SoilMVWLLISLGVLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHPPSDSEFTLLVLPAPTAPER*
Ga0137407_1085984723300012930Vadose Zone SoilMLWLLIFLCALLAITTRQAAGFRTARRLRDLREQRLSLEAQRGDLERRIRMGSSRQVLVPTARALGLHEPGDSEFVLFAVPRSESGKR*
Ga0153915_1058749913300012931Freshwater WetlandsMLWLLIFLGVLVAITTRQTAGFRTARRLHDLREERLALEARRADLERRIRVGSSRQVLVPIVERALGLHEPADSEFVLFALPSAASGMP*
Ga0162652_10002418623300012941SoilMKRLKGRHWVFLWLLIFLGGALAVVSRQTTALRTARRLHDLREQRGILEARRADLERRIRVASSREVLVPIAKRSLGLHEPADSEFVLFAVPSAPERP*
Ga0164298_1031428423300012955SoilMLWLVIFLGVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREGSSRQVLVPIVERALGLHEPADSEFILLAVPPAGPEVR*
Ga0164298_1036165923300012955SoilVFLWLLIFLGGALAIVSRQTSALRTARRLHDLREQRSVLEARRADLERRIRIASSREVLVPLAKRTLGLHEPADSEFVLFAVPSPAGQP*
Ga0164298_1097135223300012955SoilLKGRHWLMLWLIIFLGVLVAITTRQTEGFRTATRVRELREERMALEARRGDLERRIRVGSSRQVLVPIVERTLGLHEPADSEFVLFALPPAGPGSR*
Ga0164298_1100333323300012955SoilMRLKGRHWLMLWLVIFLGVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREGSSRQVLVPIVERALGLHEPADSEFVLFAVPPAGPEVR*
Ga0164303_1022121413300012957SoilMRLKGRHWLMVWLIIFLCVLVAITTRQTEGFRTARSLRELREARLALEARRGDLERRIRAGSSRQVLVPIVERSLGLHEPADSEFVLFAVPPAGGERR*
Ga0164299_1025376123300012958SoilMRLKGRHWLMVWLIIFLCVLVAITTRQTEGFRTARSLRELREARLALEARRGDLERRIRAGSSRQVLVPIVERALGLHEPVDSEFVLFAVPAGPEPR*
Ga0164299_1171080723300012958SoilVFLWLLIFLGGALAIVSRQTSALRTARRLHDLREQRSVLEARRADLERRIRLGSSREVLVPLAKRTLGLHEPADSEFVLFAVPSRAGQP*
Ga0164301_1068172723300012960SoilMRLKGRHWLMLWLVIFLGVLVAITTRQTEGFRTARRLRELREERLALEARRGDLERRIREGSSRQVLVPIVERALGLHEPADSEFVLFAVPPAGPEAR*
Ga0164302_1063326913300012961SoilEARTSGSSRRAMRLKGRHWLMLWLVIFLGVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREGSSRQVLVPIVERALGLHEPADSEFILLAVPPAGPEVR*
Ga0164302_1069996513300012961SoilMRLKGRHWLMVWLLIFLCVLVAITTRQTEGFRTARSLRELREARLALEARRGDLERRIRAGSSRQVLVPIVERALGLHEPADSEFVLFAVPPAGGERR*
Ga0164302_1148691723300012961SoilMLWLFFFLCVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIRIASSRQILVPIVERALGLHEPADSEFVLFAVPSAGPPER*
Ga0164309_1034423623300012984SoilMLWLFFFLCVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIRIASSRQVLVPIVERALGLHEPADSEFVLFAVPSAGPPER*
Ga0164309_1054105613300012984SoilHWLMVWLLIFVGVLLAIAARQTAGFRTARRLHDLREQRTNLEAQRADLERQIRVASSRQVLVPIAERELGLHLPSDSEFTLLVLPAPPAAGR*
Ga0164309_1105534223300012984SoilMLWLVIFLGVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREGSSRQVLVPIVERALGLHEPADSEFVLFAVPPAGPEVR*
Ga0164304_1001579563300012986SoilMRLKGRHWLMVWLIIFLCVLVAITTRQTEGFRTARSLRELREARLALEARRGDLERRIRAGSSRQVLVPIVERSLGLHEPADSEFVLFAV
Ga0164304_1097372013300012986SoilVFLWLLIFLGGALAIVSRQTSALRTARRLHDLREQRSVLEARRADLERRIRLGSSREVLVPLAKRTLGLHEPADSEF
Ga0164306_1103839823300012988SoilWLIIFLCVLVAITTRQTEGFRTARSLRELREARLALEARRGDLERRIRAGSSRQVLVPIVERALGLHEPADSEFVLFAVPPAGGERR*
Ga0164305_1017679913300012989SoilARTSGSSRRLMRLKGRHWLMLWLFFFLCVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIRIASSRQILVPIVERALGLHEPADSEFVLFAVPSAGPPER*
Ga0164305_1054946013300012989SoilMLWLVIFLGVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREGSSRQVLVPIVERALGLHEPADSEFVLF
Ga0120123_110745213300013770PermafrostMRLKGRHWLMLWLLIFLCALLAITTRQTAGFRTARRLRDLREQRLSLEAQRGDLEQRIRMGSSRQVLVPAARVLGLHEPADSELVLFAVPPTEPGKR*
Ga0075309_111117323300014268Natural And Restored WetlandsMLLWLLIFLGGAVAVVTRQTAALQTARRLEDLREERRSLEARRAELERRIRLASSRQVLVPMAKRTFGLHEPADSEFVLFPEPPVPGESEP*
Ga0163163_1232114423300014325Switchgrass RhizosphereMLWLVIFLGVLVAITTRQTEGFRTARRLRELREERLALEARRGDLERRIREGSSRQILVPIVERALGLHEPADSEFVLFTVPPAGPEAR*
Ga0167657_100238623300015079Glacier Forefield SoilMKQLKGRHWVLLWLLIFLGGAVAVVTRQTTALHTARRLHDLREERSSLEARRAELERRIRVASSRQVLVPKAERTFGLHEPADSEFVLFLVPPLPGESVP*
Ga0167655_105940423300015086Glacier Forefield SoilMKQLKGRHWVLLWLLIFLGGAVAVVTRQTTALHTARRLHDLREERSSLEARRAELERRIRVASSRQVLVPKAERTFGLHEPADSEFVLFL
Ga0167629_101841623300015209Glacier Forefield SoilMKPLKGRHWVLLWLLIFLGGAVAVVTRQTAALRTARRLHDLREERGSLEARRAELERRIRVASSRQVLVPKAKRTFGLHEPADSEFVLFPIPDK*
Ga0137403_1082945723300015264Vadose Zone SoilMRLKGRHWLMLWLLIFLCALLAITTRQAAGFRTARRLRDLREQRLALEAQRGDLERRIRMGSSRQVLVPTARALGLHEPADSEFVLFAVPPTESGKR*
Ga0132258_10008675153300015371Arabidopsis RhizosphereMVWLLISRGVLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPTAPER*
Ga0132258_1017733143300015371Arabidopsis RhizosphereMLWLVIFLGVLVAITPRQTEGFRTARRLRELREERLALEARRGDLERRIREGSSRQILVPIVERALGLHEPADSEFVLFTVPPAGPEAR*
Ga0132255_10115542723300015374Arabidopsis RhizosphereMRLKGRHWLMVWLIIFLCVLVAITTRQTEGFRTARSLRELREARLALEARRGDLERRIRAGSSRQVLVPIVEPSLGLHEPADSEFVLFAVPPAGGERR*
Ga0180121_1040831323300017695Polar Desert SandMLWLLLFLCMLVAITTRQTAGFRTARRLRDLQEERLSMEARRADLERRIRAGSSRQVLVPIVQRSLGLHEPADSEFVLFALPSATPGGR
Ga0184610_103313823300017997Groundwater SedimentIFLGGAVAVVTRQTAALRTARRLHDLREERGSLEARRADLERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPVPALPGESVP
Ga0184610_132157413300017997Groundwater SedimentMKQLKGRHWVLLWLLIFLGGAVAVVTRQTAALHTARRLHDLREERVSLEARRADLERRIRVASSRQVLVPMSERTFGLHEPADSEFVLFPIPAARSESAP
Ga0184604_1001028323300018000Groundwater SedimentMKHLKGRHWVLLWLLIFLGGAVAVVTRQTAGLRTARRLHDLREERVSLEARRAELERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPVPALPAESVP
Ga0184608_1001786323300018028Groundwater SedimentMKHLKGRHWVLLWLLIFLGGAVAVVTRQTAGLRTARRLHDLREERVSLEARRAELERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFAVPALPAESVP
Ga0184626_1008182623300018053Groundwater SedimentMKQLKGRHWVLLWLLIFLGGAVAVVTRQTAALRTARRLHDLREERGSLEARRADLERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPVPALPGETAP
Ga0184623_1004508823300018056Groundwater SedimentMKQLKGRHWVLLWLLIFLGGAVAVVTRQTAGLRTARRLHDLREERGSLEARRADLERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPVPALPGETVP
Ga0184615_1018522413300018059Groundwater SedimentMKQLKGRHWVVLWLSIFLGGAVAVVTRQTAALHTARRLHDLREERGSLEARRAELERRIRIASSRQVLVPMAERTFGLHEPADSEFVLFPVPAAPGVSVP
Ga0184637_1003287223300018063Groundwater SedimentMKQLKGRHWVLLWLLIFLGGAVAVVTRQTAALRTARRLHDLREERGSLEARRADLERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPVPALPGETVP
Ga0184637_1015054123300018063Groundwater SedimentMKPLKGRHWVLLWLLIFLGGAVAVVTRQTAALHTARRLHDLREERGSLEARRAELERRIRVASSRQVLVPMAKRAFGLHEPADSEFVLFPVPAAVGDSIP
Ga0184637_1055428623300018063Groundwater SedimentMKRLKGRHWVLLWLLIFLGGAVAVVTRQTAGLRTARRLHDLREERGSLEARRADLERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPVPALPGESAP
Ga0184618_1045364213300018071Groundwater SedimentMRLKGRHWLMLWLLIFLCALLAITTRQAAGFRTARRLRDLREQRLSLEAQLGDLERRIRMGSSRQVLVPTARALGLHEPADSEFVLFAVPRSESGKR
Ga0184632_1016352413300018075Groundwater SedimentGAVAVVTRQTAALRTARRLHDLREERGSLEARRAELERRIRIASSRQVLVPMAKRTFGLHEPADSEFVLFPIPASPGESVP
Ga0184632_1022332323300018075Groundwater SedimentMKHLKGRHWVLLWLLIFLGGAVAVVTRQTAALRTERRLHDLREERGSLEARRADLERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPVPALPGESVP
Ga0184609_1006521433300018076Groundwater SedimentMKQLKGRHWVLLWLLIFLGGAVAVVTRQTAALRTARRLHDLREERGSLEARRADLERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPVPASPGESVP
Ga0184633_1025552923300018077Groundwater SedimentMKQLKGRHWVLLWLLIFLGGAVAVVTRQTAALHTARRLHDLREERGSLEARRADLERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPVPALPGESVP
Ga0184627_1014472223300018079Groundwater SedimentMKQLKGRHWVLLWLLIFLGGAVAVVTRQTAALRTARRLHDLREERGSLEARRADLERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPVPALPGESVP
Ga0184639_1014791523300018082Groundwater SedimentMKQLKGRHWVLLWLLIFLGGAVAVVTRQTAGLRTARRLHDLREERGSLEARRADLERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPVPALPGESAP
Ga0190265_1003095733300018422SoilMKPLKGRHWVLLWLLIFLGGAVAVVTRQTAALHTARRLHDLREERGSLEARRAELERRIRVASSRQVLVPKAKRTFGLHEPADSEFVLFPVPVAPEASVP
Ga0190265_1033292223300018422SoilMKPLKGRHWVLLWLLIFLGGAVAVVTRQTAALHTARRLHDLREERSSLEARRAELERRIRVASSRQVLVPKARRTFGLHEPADSEFVLFPVPMAPGVSVP
Ga0190265_1122558923300018422SoilMKQLKGRHWVLLWLMIFLGGAVAVVTRQTAALQTARRLHDLREERSSLEARRADLERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPIPAARSESAP
Ga0190265_1155360923300018422SoilMKPLKGRHWVLLWLFLFLGGAVAVVTRQTAALHTARRLHYLREERGSLEARRAELERRIRVASSREVLVPMAKRTFGLHEPADSEFVLFPVPAA
Ga0190272_1000205173300018429SoilMKQLKGRHWVLLWLFIFLGGAVAVVTRQTAALHTARRLHDLREERGSLEARRAELERRIRVASSRQVLVPMAERTFGLHEPADSEFVLFPVPAAESVP
Ga0190272_1005367023300018429SoilMKQLKGRHWVLLWLLIFLGGAVAVVTRQTAALHTARRLHDLREERGNLEARRADLERRIGVASSRQVLVPMAERTFGLHEPADSEFVLFPIPAVRGESVP
Ga0190272_1139639423300018429SoilMHFKGRHWLMLWLLIFLCVLVGITTRQTAGFRTARRLRDLREERMTLEARRGDLERRIRFASSRQVLVPIVERALGLHEPVDSEFVLFTVPAGPEKR
Ga0190272_1205107723300018429SoilMKPLKGRHWVLLWLLIFLGGAVAVVTRQTAALHTARRLHDLREERGHLEARRADLERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFSIPAVGRESIP
Ga0190272_1226959423300018429SoilMRVKGRHWLLLWLLLFLLVAVVVIARQTAAFKVARRLRDLRDQRAALEARRADLERRIREASGRQVLVPKAERDLGLHLPADNEFILFAAPTGPVPKR
Ga0190268_1018692223300018466SoilMQLKGRHWMLLWLLIFLGSAVAVVTRQTAALQTARRLEDLREERRSLEARRAELERRIRLASSRQVLVPMAKRTFGLHEPADSEFVLFPEPPVPGEIEP
Ga0190270_1063302923300018469SoilMLLWLLIFLGGAVAVVTRQTAALQTARRLDDLRQERRSLEARRAELERRIRLASSRQVLVPMAKRTFGLHEPADSEFVLFPEPPAPAQGEP
Ga0066669_1160661023300018482Grasslands SoilMRLKGRHWLMLWLVIFLGVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREGSSRQVLVPIVERALGLHEPADSEFVLFAVPPAGPEVR
Ga0190273_1226134623300018920SoilLCVLVAITTRQTAGFRTARRLRDLREERMTLEARRGDLERRIRFASSRQVLVPIVERALGLHEPVDSEFVLFAVPAAPEER
Ga0173482_1008878323300019361SoilMRLKGRHWLMVWLLISLGVLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPTAPER
Ga0173479_1050738013300019362SoilGAPNSASSRSRMRLKGRHWLMVWLLISLGVLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPTAPER
Ga0193722_114787523300019877SoilMRLKGRHWLMLWLLIFVCALLAITTRQAAGFRTARRLRDLRDQRLSLEAQRGDLERRIRMGSSRQVLVPMARALGLHEPADSEFILFAVPPSESGKR
Ga0193717_104793923300020060SoilMKPLKGRHWVLLWLLIFLGGAVAVVTRQTAALHTARRLHDLREERGSLEARRAELERRIRVASSRQVLVPKARRTFGLHEPADSEFVLFPVPVAPEASVP
Ga0210382_1003727723300021080Groundwater SedimentMRLKGRHWLMLWLLIFLCALLAITTRQAAGFRTARRLRDLREQRLSLEAQRGDLERRIRMGSSRQVLVPTARALGLHEPADSEFVLFAVPRSESGKR
Ga0210382_1032727423300021080Groundwater SedimentMKQLKGRHWVLLWLLIFLGGAVAVVTRQTAGLRTARRLHDLREERGSLEARRAELERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPIPASPGESVP
Ga0210377_1064683613300021090Groundwater SedimentMKQLKGRHWVLLWLLIFLGGAVAVVTRQTAALHTARRLHDLRDERGSLEARRSELERRIRVASSRQVLVPMAERTFGLHEPADSEFVLFPVPAAPVESVP
Ga0182009_1002647323300021445SoilMRLKGRHWLMLWLIVFLGVLVAITTRQTAGFRTARRVRELREQRLALEAQRGDLERRIRAGSSRQVLVPIVQRALGLHEPADSEFVLFAVPPAGPGGR
Ga0224452_117768623300022534Groundwater SedimentMKHLKGRHWVLLWLLIFLGGAVAVVTRQTAGLRTARRLHDLREERGSLEARRAELERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPIPTSPGESVP
Ga0222622_1058186823300022756Groundwater SedimentMKHLKGRHWVLLWLLIFLGGAVAVVTRQTAGLRTARRLHDLREERGSLEARRAELERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPIPASPGESVP
Ga0209431_1016018223300025313SoilMKQLKGRHWVLLWLLIFLGGAVAVVTRQTAALHTARRLHDLREERGNLEARRADLERRIRVASSRQILVPMAERTFGLHEPADSEFVLFPIPAVRGESVP
Ga0209751_1093394723300025327SoilFRSMKPLKGRHWVLLWLLIFLGGAVAVVTRQTAALHTARRLHDLREERGSLEARRAELERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPVPTAPGVSVP
Ga0210061_101270623300025537Natural And Restored WetlandsMRLKGRHWLMLWLVMFVVVLLAVATRQSAGFRTARRLGELREERTTLEARRAELERQIRVASSRQVLVPLAKRRLSLHEPSDSEFTLLPLPPLPESEP
Ga0210120_100207133300025556Natural And Restored WetlandsMHLKGRHWLMLWLLVSLGVLVAITTRQTAGFRTARRLRDLREERLTLEARRGDLERRIRFASSRQVLVPIVERALGLHEPVDSEFVLFAVPAGPEER
Ga0207670_1116490113300025936Switchgrass RhizosphereGVLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPTAPER
Ga0207679_1156943313300025945Corn RhizosphereMVWLLIFLGVLLAIAARQTAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPSAPGR
Ga0208141_100532323300025988Rice Paddy SoilMRLKGRHWLTFWLAIFLLVLVAITTRQTAGFQTAQHVRELREERLALEAQRAELERRIRMASSRQVLVPVAERLLGMHEPSDSEFVLFAVPSGPVKP
Ga0208143_10349323300026000Rice Paddy SoilMRLKGRHWLTFWLAIFLLVLVAITTRQTAGFQTAQHVRELREERLALEAQRAELERRIRMASSRQVLVPVAERLLGMHEPSDSEFVLFAVPSGPV
Ga0208284_101295423300026003Rice Paddy SoilMHLKGRHWLMLWLVLLLCVLGAITARQTAGFRTARRLRELREQRMALEARRAELERRIRVASSRQVLVPIVERALGLHEPSDSEFVLFALPAAGPVGR
Ga0208776_102501613300026014Rice Paddy SoilMHLKGRHWLMLWLVLLLCVLGAITARQTAGFRTARRLRELREQRMALEARRAELERRIRVASSRQVLVPIVERALGLHEPSDSEFVLFALPAAG
Ga0208002_100288913300026029Natural And Restored WetlandsMRLKGRHWLMLWLVMFVVVLLAVATRQSAGFRTARRLGELREERTTLEARRAELERQIRVASSRQVLVPLAKRRLSLHEPSDSEFTLLPLPPLPE
Ga0207639_1015357823300026041Corn RhizosphereMRLKGRHWLMVWLLISLGVLIAIAARQAAGFRTARRLHDLREQRATLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPTAPER
Ga0207641_1007534143300026088Switchgrass RhizosphereMRLKGRHWLMLWLVVFLGVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREGSSRQVLVPIVERALGLHEPADSEFILFAVPPAGPEVR
Ga0207676_1019335323300026095Switchgrass RhizosphereMRLKGRHWLMLWLVIFLGVLVAITTRQTEGFRTARRLRELREERLALEARRGDLERRIREGSSRQILVPIVERALGLHEPADSEFVLFTVPPAGPEAR
Ga0207676_1207317323300026095Switchgrass RhizosphereMRLKGRHWLMVWLLISLGVLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPAAPDS
Ga0207516_102080813300026905SoilGRHWLMVWLLISLGVLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPTAPER
Ga0209971_111950913300027682Arabidopsis Thaliana RhizosphereLLISLGVLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPTAPER
Ga0209286_115368613300027713Freshwater SedimentMKPLKGRHWVLLWLLIFLGGAVAVVTRQTAALHTARRLHDLREERGSLEARRAELERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPVPAAPGVSVP
Ga0209593_1021357423300027743Freshwater SedimentMRLKGRHWLMLWLVVFVVVLLAITTRQSAGFRTARRLGELREERTTLEARRAELERQIRVASSRQVLVPLVERRLSLHEPSDSEFTLLPLPPLPEGEP
Ga0209177_1012849813300027775Agricultural SoilIFLGVLVAITTRQTAGFQTARRVRELREQRMALEAQRGDLERRIRAGSSRQVLVPIVQRALGLHEPADSEFVLFAVPRAGPSGR
Ga0209074_1001204923300027787Agricultural SoilMRLKGRHWLMLWLIIFLGVLVAITTRQTAGFQTARRVRELREQRMALEAQRGDLERRIRAGSSRQVLVPIVQRALGLHEPADSEFVLFAVPRAGPGGR
Ga0209074_1022176423300027787Agricultural SoilVFLWLLIFLGCAVAIVSRQTTALRTARRLHDLREQRGILEARRADLERRIRVASSREVLVPIARRNLGLHEPADSEFVLFAIPSESGHR
Ga0209974_1009924823300027876Arabidopsis Thaliana RhizosphereMQPLKGRHWVLLWLLIFLGGAVAVVTRQTAALQTARRLHDLREERGSLEARRAELERRIRIASSRQVLVPMAKRTFGLHEPADSEFVLFPVPAAESVR
Ga0247822_1111225323300028592SoilISLGVLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPTAPER
Ga0247822_1159923423300028592SoilGARTSGSSRRLMRFKGRHWLMLWLLLFLLVLLAITTRQTEGFRTARRLRDLREERIALEARRGDLERRIRLASSRQVLVPIVQRALGLHEPADSEFVLFAVPPAGPQEER
Ga0307284_1020546323300028799SoilMKRVKGRHWVLLWLLIFLGGALAVVSRQTTALRTARRLHDLREQRGILEARRAELERRIRVASSREVLVPIARRNLGLHEPVDSEFVLFAVPPREGRP
Ga0307284_1030421623300028799SoilMRLKGRHWLMLWLLIFVCALLAITTRQAAGFRTARRLRDLREQRLSLEAQRGDLERRIRMGSSRQVLVPTARALGLHEPTDSEFILFAVPRSESDKR
Ga0307281_1001728613300028803SoilMKQLKGRHWVLLWLLIFLGGAVAVVTRQTAALHTARRLHDLRDERGSLEARRSELERRIRVASSRQVLVPMAERTFGLHEPADSEFVLFPIPAAESGP
Ga0307281_1007313223300028803SoilLIFLGGAVAVVTRQTAGLRTARRLHDLREEGGSLEARRAELERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFPIPASPGESVP
Ga0307305_1015512013300028807SoilMKHLKGRHWVLLWLLIFLGGAVAVVTRQTAGLRTARRLHDLREERGSLEARRAELERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFAVPALQ
Ga0247824_1042908713300028809SoilMRLKGRHWLMVWLLISLGVLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFT
Ga0247824_1114209023300028809SoilMHLKGRHWLMLWLLLFLLVLLAITTRQTEGFRTARRLRDLREERMALEARRGDLERRIRLASSRQVLVPIVQRALGLHEPADSEFVLFAVPPAG
Ga0247825_1026421823300028812SoilSLGILIAIAARQAAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPAAPDR
Ga0307310_1010689023300028824SoilMRLKGRHWLMLWLLIFVCALLAITTRQAAGFRTARRLRDLREQRLSLEAQRGDLERRIRMGSSRQVLVPTARALGLHEPTDSEFILFAVPRSESGKR
Ga0307310_1016353523300028824SoilESFRLMKHLKGRHWVLLWLLIFLGGAVAVVTRQTAALRTARRLHDLREERGSLEARRADLERRIRVASSRQVLVPMAKRTFGLHEPADSEFVLFAVPALPAESVP
Ga0307312_1094195613300028828SoilKGRHWLMLWLLIFLCALLAITTRQAAGFRTARRLRDLREQRLALEAQRGDLERRIRMGSSRQVLVPAARVLGLHEPADSELVLFAVPLTDPGKR
Ga0307289_1042317913300028875SoilMRLKGRHWLMLWLLIFVCALLAITTRQAAGFRTARRLRDLREQRLSLEAQRGDLERRIRMGSSRQVLVPTARALGLHEPTDSEFILFAVPRSESGE
Ga0307308_1046911613300028884SoilLWLLIFLCALLAITTRQAAGFRTARRLRDLREQRLALEAQRGDLERRIRMGSSRQVLVPAARVLGLHEPADSELVLFAVPPDDPSKR
Ga0307497_1047838423300031226SoilMVWLLIFLAVLLAIAARQTAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHQPSDSEFTLLVLPTPPAPGH
Ga0299913_1157933223300031229SoilMQLKGRHWVLLWLFIFLGGAVAVVSRQTAALQTARRLHDLREERRSLEARKAELERRIRLASSRQVLVPLAKRSFGLHEPADSEFVLLPMPAAPGEIGP
Ga0310887_1029189123300031547SoilMRLKGRHWLMVWLLISLGVLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPTAPE
Ga0307408_10015379423300031548RhizosphereMHLKGRHWLMLWLLLFLVVLVAITARQTEGFRTARRLRDLREERLALEARRGELERRIRLASSRQVLVPIVARALGLHEPADSEFVLFAVPPAGPVEER
Ga0307408_10021710123300031548RhizosphereMRLKGRHWLMVWLLIALGVLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLDRQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPTAPER
Ga0307408_10077921013300031548RhizosphereMRLKGRHWLMLWLVLLLCVLGAITARQTAGFRTARRLRELREERMALEARRAELERRIRVGSSRQVLVPIVEQSLGLHEPSDSEFVLFAMPPAGPVGR
Ga0310813_1016701323300031716SoilMVWLLIFVGVLLAIAARQTAGFRTARRLHDLREQRTNLEAQRADLERQIRVASSRQVLVPIAERELGLHLPSDSEFTLLVLPAPPAAGR
Ga0310813_1123337413300031716SoilMRLKGRHWLMLWLVIFLGVLVAITTRQTEGFRTARRLRELREERLALEARRGDLERRIREGSSRQVLVPIVERALGLHEPADSEFVLFAVPPAGPE
Ga0307405_1017811423300031731RhizosphereMHLKGRHWLMLWLLLFLVVLVAITARQTEGFRTARRLRDLREERLALEARRGELERRIRLASSRQVLVPIVARALGLHEPADSEFVLFAVPPAGSVEER
Ga0307413_1061726723300031824RhizosphereMVRLKGRHWVLLWLLIFLGGALVVVGRQTAAFSTARRLRDLREERSNLEARRADLERRIRVASSRQVLVPIARRSLGLHEPADSEFVLFAVPDPASPGRP
Ga0307413_1103972823300031824RhizosphereGRHWLMLWLVLLLGVLGGITARQTAGFRTARRLRELREERMALEARRGELERRIRAGSSRQVLVPIVERALGLHEPSDSEFVLFAMPPAGPVGR
Ga0307410_1009135323300031852RhizosphereMRLKGRHWLMLWLVVFLCALLAVAWRQTSGLRTARRLGELREERMALEARRGELERRIRVASSRQVLVPLAQRTLGLHQPSDSEFVLLVVPAGGDRR
Ga0307410_1028846823300031852RhizosphereMRLKGRHWLMLWLVLLLGVLGGITARQTEGFRTARRLRELREERMALEARRGELERRIRAGSSRQVLVPIVERALGLHEPSDSEFVLFAMPPAGPVGR
Ga0307410_1126089323300031852RhizosphereMRLKGRHWLMVWLLIFLGVLLTIAARQTAGFRTARRLHDLREQRTTLEARRADLERQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPTPPAPGH
Ga0308175_100001033143300031938SoilMRLKGRHWLMLWLIIFLGVLVAITTRQTAGFRTAARVRELREQRLALEAQRGDLERRIRAGSSRQVLVPIVERALGLHEPADSEFVLFAVPPAGPGHR
Ga0214473_1147413913300031949SoilMKPLKGRHWVLLWLLIFLGGAVAVVTRQTAALHTARRLHDLREERGSLEARRAELERRIRVGSSRQVLVPMAKRTFGLHEPADSEFVLFPVPAAPGVSVP
Ga0326597_1001721583300031965SoilMRLKGRHWMVIWLLIFLCVLVAITTRQTAGFRTARRVRELREERLALEARRADLERRIRVASSRQVLVPVAERLLGLHEPSDSEFVLFVLPASGPGKR
Ga0326597_1004273343300031965SoilMAWLLLFLCVLLAVTARQAEGFRVARRVRDLRDERAALEARRAELERRIRAGASRQVLVPKVERTLGLHEPADTEFVLFTVPHSGQGTP
Ga0307409_10011545023300031995RhizosphereMRLKGRHWLMLWLVLLLGVLGGITARQTAGFRTARRLRELREERMALEARRGELERRIRAGSSRQVLVPIVERALGLHEPSDSEFVLFAMPPAGPVGR
Ga0307409_10130507713300031995RhizosphereLLLFLVVLVAITTRQTEGFRTARRLRDLREERLALEARRGDLERRIRLASSRQVLVPIVARALGLHEPADSEFVLFAVPPAGPVEER
Ga0307416_10057634223300032002RhizosphereMRLKGRHWLMVWLLISLGVLIAIAARQAAGFRTARRLHDLREQRTTLEARRADLDRQIRVASSRQVLVPIAERDLGLHLPSDSEFTLLVLPAPTAPER
Ga0307416_10136624623300032002RhizosphereMKPLKGRHWVFLWLLIFLGGALAVVSRQTTALRTARRLHDLREQHGILEARRADLERRIRVASSREVLVPIAKRSLGLHEPADSEFVLFAVPSADGRP
Ga0307416_10177547623300032002RhizosphereMILWLLIFLGGALAVVSRQTAAFRTARRLHDLRDERSSLESRRAELERRIRLASSRQVLVPIAERSLGLHEPADSEFVLFVVPAPMTRGPGRP
Ga0307414_1013706823300032004RhizosphereMRLKGRHWLMLWLVLLLGVLGGITARQTAGFRTARRLRELREERMALEARRGELERRIRAGSSRQVLVPIVERALGLHEPTDSEFVLFAMPPAGPVGR
Ga0310890_1128454213300032075SoilRGARTSGSSRRLMRFKGRHWLMLWLLLFLLVLLAITTRQTEGFRTARRLRDLREERMALEARRGDLERRIRLASSRQVLVPIVQRALGLHEPADSEFVLFAVPPAGPQEER
Ga0307415_10020510323300032126RhizosphereAMRLKGRHWLMLWLVLLLCVLGAITARQTAGFRTARRLRELREERMALEARRAELERRIRVGSSRQVLVPIVEQSLGLHEPSDSEFVLFAMPPAGPVGR
Ga0307471_10257338023300032180Hardwood Forest SoilMKVKGRHWLLLWLLLFLLVTAVVVGRQTAAFKVARRVGELREQRTALEARRADLERRIREASGRQVLVPKAVHDLGLHLPADNEFILFAVPSGPADRSKR
Ga0307471_10386320613300032180Hardwood Forest SoilMVWLLIFLCVLVAITTRQAEGFRTARSLRELREARLALEARRGDLERRIRAGSSRQVLVPIVERALGLHEPADSEFVLFPVPPAGGERR
Ga0307472_10046525023300032205Hardwood Forest SoilMKVKGRHWLVLWLLLFLLVAAVVIARQTAAFKVARRVGELREQRTALEARRADLERRIREASGRQVLVPKAERDLGLHFPADNEFILFAVPAGPAQRPKP
Ga0315273_1050558243300032516SedimentMVIWLLIFLCVLVAITTRQTAGFRTARRVRELREERLALEARRADLERRIRVASSRQVLVPVAQRLLGLHEPGDSEFV
Ga0316628_10028788623300033513SoilMRLKGRHWLMLWLLIFLGVLVAITTRQTAGFRTARRLHDLREERLALEARRADLERRIRVGSSRQVLVPIVERALGLHEPADSEFVLFALPSAASGMP
Ga0373948_0038237_347_6433300034817Rhizosphere SoilMRLKGRHWLMLWLVVFLGVLVAITTRQTEGFRTARRLRELREERMALEARRGDLERRIREGSSRQVLVPIVERALGLHEPADSEFVLFAVPPAGPGSR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.