NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F063011

Metagenome / Metatranscriptome Family F063011

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F063011
Family Type Metagenome / Metatranscriptome
Number of Sequences 130
Average Sequence Length 182 residues
Representative Sequence MTISTFGFALMTMASVGACLGQDPASPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDDAVTAVRAMRATIYSANVAEDVVDQNGAVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVKVKGVRYPVATTSEPPGAGGLGLERNGPKWVDGRAAQGEVLTSGPRIKVPARTLLAFQIAEPIRLTGFR
Number of Associated Samples 80
Number of Associated Scaffolds 130

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 64.62 %
% of genes near scaffold ends (potentially truncated) 49.23 %
% of genes from short scaffolds (< 2000 bps) 76.92 %
Associated GOLD sequencing projects 70
AlphaFold2 3D model prediction Yes
3D model pTM-score0.43

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (56.923 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(47.692 % of family members)
Environment Ontology (ENVO) Unclassified
(47.692 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(69.231 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Fibrous Signal Peptide: Yes Secondary Structure distribution: α-helix: 3.21%    β-sheet: 27.06%    Coil/Unstructured: 69.72%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.43
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 130 Family Scaffolds
PF00106adh_short 12.31
PF02776TPP_enzyme_N 11.54
PF00210Ferritin 6.15
PF13561adh_short_C2 3.85
PF07077DUF1345 2.31
PF02775TPP_enzyme_C 2.31
PF00702Hydrolase 2.31
PF00122E1-E2_ATPase 1.54
PF02492cobW 0.77
PF00486Trans_reg_C 0.77
PF03306AAL_decarboxy 0.77
PF13426PAS_9 0.77
PF01979Amidohydro_1 0.77
PF11154DUF2934 0.77
PF00378ECH_1 0.77
PF05194UreE_C 0.77
PF01730UreF 0.77

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 130 Family Scaffolds
COG4291Uncharacterized membrane proteinFunction unknown [S] 2.31
COG0474Magnesium-transporting ATPase (P-type)Inorganic ion transport and metabolism [P] 1.54
COG2216K+ transport ATPase, ATPase subunit KdpBInorganic ion transport and metabolism [P] 1.54
COG2217Cation-transporting P-type ATPaseInorganic ion transport and metabolism [P] 1.54
COG0830Urease accessory protein UreFPosttranslational modification, protein turnover, chaperones [O] 0.77
COG2371Urease accessory protein UreEPosttranslational modification, protein turnover, chaperones [O] 0.77
COG3527Alpha-acetolactate decarboxylaseSecondary metabolites biosynthesis, transport and catabolism [Q] 0.77


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A56.92 %
All OrganismsrootAll Organisms43.08 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300003505|JGIcombinedJ51221_10117484All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → Solibacteraceae → Candidatus Solibacter → Candidatus Solibacter usitatus1065Open in IMG/M
3300003505|JGIcombinedJ51221_10121193Not Available1049Open in IMG/M
3300005434|Ga0070709_11128931Not Available628Open in IMG/M
3300005467|Ga0070706_100126329All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2385Open in IMG/M
3300005468|Ga0070707_100998988Not Available802Open in IMG/M
3300005538|Ga0070731_10120830All Organisms → cellular organisms → Bacteria → Proteobacteria1735Open in IMG/M
3300005541|Ga0070733_10012911All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium5242Open in IMG/M
3300005591|Ga0070761_10081890All Organisms → cellular organisms → Bacteria1841Open in IMG/M
3300005602|Ga0070762_10217745Not Available1175Open in IMG/M
3300005610|Ga0070763_10130169All Organisms → cellular organisms → Bacteria1297Open in IMG/M
3300005712|Ga0070764_10060678All Organisms → cellular organisms → Bacteria1956Open in IMG/M
3300005921|Ga0070766_10014160All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4146Open in IMG/M
3300005921|Ga0070766_10050569All Organisms → cellular organisms → Bacteria2334Open in IMG/M
3300005921|Ga0070766_10069238All Organisms → cellular organisms → Bacteria2027Open in IMG/M
3300006173|Ga0070716_100614022Not Available820Open in IMG/M
3300006173|Ga0070716_100759729Not Available747Open in IMG/M
3300006176|Ga0070765_100804024Not Available889Open in IMG/M
3300006796|Ga0066665_11479318Not Available529Open in IMG/M
3300006854|Ga0075425_100651936Not Available1210Open in IMG/M
3300007265|Ga0099794_10344102Not Available775Open in IMG/M
3300009012|Ga0066710_102368248Not Available771Open in IMG/M
3300009176|Ga0105242_12724814Not Available544Open in IMG/M
3300009553|Ga0105249_12597283Not Available579Open in IMG/M
3300010373|Ga0134128_11758551Not Available682Open in IMG/M
3300010379|Ga0136449_100131008All Organisms → cellular organisms → Bacteria → Proteobacteria4996Open in IMG/M
3300010401|Ga0134121_10596684Not Available1032Open in IMG/M
3300011120|Ga0150983_11078832Not Available610Open in IMG/M
3300012205|Ga0137362_10026014All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4582Open in IMG/M
3300012207|Ga0137381_10404159All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Chelicerata → Arachnida → Acari → Acariformes → Trombidiformes → Prostigmata → Eupodina → Bdelloidea1191Open in IMG/M
3300013296|Ga0157374_12335315Not Available562Open in IMG/M
3300013297|Ga0157378_12491873Not Available569Open in IMG/M
3300014501|Ga0182024_10099073All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales4252Open in IMG/M
3300020579|Ga0210407_10008013All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales7958Open in IMG/M
3300020579|Ga0210407_10256967Not Available1361Open in IMG/M
3300020580|Ga0210403_10282004All Organisms → cellular organisms → Bacteria → Proteobacteria1362Open in IMG/M
3300020580|Ga0210403_10464009All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Vibrionales → Vibrionaceae → Photobacterium → Photobacterium phosphoreum1031Open in IMG/M
3300020580|Ga0210403_11039953Not Available639Open in IMG/M
3300020581|Ga0210399_10188985All Organisms → cellular organisms → Bacteria1711Open in IMG/M
3300020581|Ga0210399_10267150All Organisms → cellular organisms → Bacteria1427Open in IMG/M
3300020581|Ga0210399_10368499All Organisms → cellular organisms → Bacteria1199Open in IMG/M
3300020581|Ga0210399_11262525Not Available583Open in IMG/M
3300020583|Ga0210401_10043443All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales4254Open in IMG/M
3300020583|Ga0210401_10389505All Organisms → cellular organisms → Bacteria → Proteobacteria1256Open in IMG/M
3300020583|Ga0210401_10732040Not Available849Open in IMG/M
3300021088|Ga0210404_10083511Not Available1578Open in IMG/M
3300021088|Ga0210404_10194992Not Available1084Open in IMG/M
3300021088|Ga0210404_10255752Not Available954Open in IMG/M
3300021088|Ga0210404_10665291Not Available593Open in IMG/M
3300021168|Ga0210406_10021693All Organisms → cellular organisms → Bacteria5971Open in IMG/M
3300021168|Ga0210406_10036779All Organisms → cellular organisms → Bacteria4381Open in IMG/M
3300021168|Ga0210406_10313294All Organisms → cellular organisms → Bacteria → Proteobacteria1273Open in IMG/M
3300021168|Ga0210406_10484337Not Available980Open in IMG/M
3300021170|Ga0210400_10135289All Organisms → cellular organisms → Bacteria1976Open in IMG/M
3300021171|Ga0210405_10017229All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria5917Open in IMG/M
3300021171|Ga0210405_10049599All Organisms → cellular organisms → Bacteria → Proteobacteria3324Open in IMG/M
3300021171|Ga0210405_10111524All Organisms → cellular organisms → Bacteria2159Open in IMG/M
3300021171|Ga0210405_11017581Not Available624Open in IMG/M
3300021178|Ga0210408_10443613All Organisms → cellular organisms → Bacteria → Proteobacteria1033Open in IMG/M
3300021178|Ga0210408_10446290Not Available1030Open in IMG/M
3300021178|Ga0210408_10596707Not Available875Open in IMG/M
3300021178|Ga0210408_11019415Not Available640Open in IMG/M
3300021180|Ga0210396_10145823All Organisms → cellular organisms → Bacteria2128Open in IMG/M
3300021181|Ga0210388_10358382Not Available1282Open in IMG/M
3300021181|Ga0210388_10567362Not Available994Open in IMG/M
3300021181|Ga0210388_10944145Not Available742Open in IMG/M
3300021401|Ga0210393_11444639Not Available548Open in IMG/M
3300021401|Ga0210393_11525146Not Available531Open in IMG/M
3300021402|Ga0210385_11502053Not Available515Open in IMG/M
3300021403|Ga0210397_10506228Not Available915Open in IMG/M
3300021405|Ga0210387_10149132All Organisms → cellular organisms → Bacteria1999Open in IMG/M
3300021406|Ga0210386_10029922All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4271Open in IMG/M
3300021407|Ga0210383_11080044Not Available678Open in IMG/M
3300021407|Ga0210383_11335000Not Available598Open in IMG/M
3300021420|Ga0210394_10033853All Organisms → cellular organisms → Bacteria4565Open in IMG/M
3300021420|Ga0210394_10130862All Organisms → cellular organisms → Bacteria2174Open in IMG/M
3300021420|Ga0210394_10366356All Organisms → cellular organisms → Bacteria1267Open in IMG/M
3300021432|Ga0210384_10448785All Organisms → cellular organisms → Bacteria → Proteobacteria1161Open in IMG/M
3300021432|Ga0210384_10734894All Organisms → cellular organisms → Bacteria882Open in IMG/M
3300021432|Ga0210384_10801296Not Available840Open in IMG/M
3300021432|Ga0210384_10830713Not Available822Open in IMG/M
3300021477|Ga0210398_10136740All Organisms → cellular organisms → Bacteria → Proteobacteria1992Open in IMG/M
3300021477|Ga0210398_10194394Not Available1653Open in IMG/M
3300021478|Ga0210402_10146123All Organisms → cellular organisms → Bacteria2152Open in IMG/M
3300021478|Ga0210402_10738998All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Chelicerata → Arachnida → Acari → Acariformes → Trombidiformes → Prostigmata → Eupodina → Bdelloidea908Open in IMG/M
3300021479|Ga0210410_10706967Not Available890Open in IMG/M
3300021559|Ga0210409_10044242All Organisms → cellular organisms → Bacteria4211Open in IMG/M
3300021559|Ga0210409_10064599All Organisms → cellular organisms → Bacteria3412Open in IMG/M
3300021559|Ga0210409_10092331All Organisms → cellular organisms → Bacteria2795Open in IMG/M
3300021559|Ga0210409_10262225All Organisms → cellular organisms → Bacteria → Acidobacteria1563Open in IMG/M
3300022529|Ga0242668_1098616Not Available590Open in IMG/M
3300022533|Ga0242662_10215769Not Available610Open in IMG/M
3300022724|Ga0242665_10255097Not Available598Open in IMG/M
3300025905|Ga0207685_10631114Not Available578Open in IMG/M
3300025906|Ga0207699_10456755Not Available917Open in IMG/M
3300025910|Ga0207684_10404146Not Available1174Open in IMG/M
3300025939|Ga0207665_10095116Not Available2070Open in IMG/M
3300026555|Ga0179593_1034434Not Available1433Open in IMG/M
3300027853|Ga0209274_10706536Not Available520Open in IMG/M
3300027855|Ga0209693_10177789Not Available1049Open in IMG/M
3300027879|Ga0209169_10010692All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales5137Open in IMG/M
3300027884|Ga0209275_10182155All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1128Open in IMG/M
3300027884|Ga0209275_10215449Not Available1043Open in IMG/M
3300027889|Ga0209380_10080450All Organisms → cellular organisms → Bacteria1870Open in IMG/M
3300028906|Ga0308309_10021681All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4253Open in IMG/M
3300029636|Ga0222749_10055753All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1757Open in IMG/M
3300029636|Ga0222749_10275915All Organisms → cellular organisms → Bacteria → Proteobacteria862Open in IMG/M
3300031057|Ga0170834_104457231Not Available745Open in IMG/M
3300031057|Ga0170834_105669977Not Available754Open in IMG/M
3300031057|Ga0170834_107223424All Organisms → cellular organisms → Bacteria3278Open in IMG/M
3300031231|Ga0170824_122312223All Organisms → cellular organisms → Bacteria2222Open in IMG/M
3300031231|Ga0170824_127773259Not Available717Open in IMG/M
3300031708|Ga0310686_111540920All Organisms → cellular organisms → Bacteria → Proteobacteria1384Open in IMG/M
3300031715|Ga0307476_10178021Not Available1538Open in IMG/M
3300031715|Ga0307476_10918527Not Available646Open in IMG/M
3300031718|Ga0307474_10300950Not Available1236Open in IMG/M
3300031718|Ga0307474_10902933Not Available699Open in IMG/M
3300031720|Ga0307469_10056038All Organisms → cellular organisms → Bacteria2513Open in IMG/M
3300031720|Ga0307469_11460751Not Available654Open in IMG/M
3300031740|Ga0307468_100298641Not Available1168Open in IMG/M
3300031754|Ga0307475_10571231Not Available906Open in IMG/M
3300031754|Ga0307475_11114402Not Available617Open in IMG/M
3300031754|Ga0307475_11405939Not Available538Open in IMG/M
3300031820|Ga0307473_10211331Not Available1160Open in IMG/M
3300031962|Ga0307479_11051547Not Available782Open in IMG/M
3300031962|Ga0307479_11145834Not Available743Open in IMG/M
3300032160|Ga0311301_12181377Not Available637Open in IMG/M
3300032174|Ga0307470_10646903Not Available798Open in IMG/M
3300032180|Ga0307471_100104803All Organisms → cellular organisms → Bacteria2580Open in IMG/M
3300032205|Ga0307472_100106997Not Available1941Open in IMG/M
3300032205|Ga0307472_100658191Not Available935Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil47.69%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil13.08%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil11.54%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.92%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil3.85%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil3.08%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.31%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.54%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.54%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil1.54%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.54%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.77%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.77%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.77%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost0.77%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.77%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.77%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.77%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300003505Forest soil microbial communities from Harvard Forest LTER, USA - Combined assembly of forest soil metaG samples (ASSEMBLY_DATE=20140924)EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005538Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005591Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005712Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021402Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022529Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022533Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-7-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025905Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300027853Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1 (SPAdes)EnvironmentalOpen in IMG/M
3300027855Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027879Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4 (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032160Sb_50d combined assembly (MetaSPAdes)EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ51221_1011748413300003505Forest SoilMTVSTLGFALMTMASIGACLGQDLANPADXPPLGPLRIVAEEPGVVPTGSSLVVQTDDAVTAVRPMRATIYSARVAEDVVDQSGTVLIPKGSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLERNAPTWVGGRAAQGKVLTSGPRIKVPARTLLAFQIAEPIRLTGFR*
JGIcombinedJ51221_1012119313300003505Forest SoilMTISTFAFALMTMASVGACLGQDPXGPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDYAVTAARAMRATIYSANVAEDVVDQNGTVLIPKDSPIELGVRLLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLDRNAPKWVGGRAARGKVLTSGPRIKVPAKTLLAFQIVEPIRLTGFP*
Ga0070709_1112893113300005434Corn, Switchgrass And Miscanthus RhizosphereMTVSTLGFALMTLASVGACLGQDLASPADPPPLGPVRIVAEEPGVVPAGSSLVVQTNDAVTAVRAMRATIYSANVAEDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVRGVRYPVATASEPPGAGGLGLERNGPKWVGRRAAQGEVLTS
Ga0070706_10012632933300005467Corn, Switchgrass And Miscanthus RhizosphereMTIPTFCFALMTMASLGACPGKDLFSPADVLPPGPVRTVAEEPGVVSAGSSLVVQTDDAITAVRAMRATIYTANVAEDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILEVRAVTVKGVRYPVATASEKPGAGGVGLERNGPKRVGGRAAQGEVLTSGPRINVPAKTLLAFQIVEPIRLTGFRR*
Ga0070707_10099898823300005468Corn, Switchgrass And Miscanthus RhizosphereVPPPGPVRTIAEEPGVVPAGSSLVVQTDDAVTAVKAMRATIYTANVAEDVVDQNGTLLIPKDSHVELGVRSLPYLGPGGVGMTELILELRGVTVKGLRYPVATASENPGAGGLGLERNGPKRVGGRAAQGEVPTSGPRINVPAKTLLAFQIVEPIRLTGFRR*
Ga0070731_1012083023300005538Surface SoilMTVSTLGFALMTMASIGACLGQDLANPADPPPLGPLRIVAEEPGVVPTGSSLVVQTDDAVTAVRPMRATIYSARVAEDVVDQSGTVLIPKGSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLERNAPTWVGGRAAQGKVLTSGPRIKVPARTLLAFQIAEPIRLTGFR*
Ga0070733_1001291123300005541Surface SoilMTISTFGFALMIIASVGACFGQDPADPTGPPSLGPVRIVAEEPGVVPAGSSLVVQTDDAVTTVRAMRATIYSASVAEDVVDQNGTVLIPKDSAVELGVRWLPYLGPGGVGMTELILELRAVTVKGVRYPVATASEAPGAGGLGLARSGPRWVGGRAAQGEVLTSGSRIKVPARTLLAFQIAEPVRLTGSR*
Ga0070761_1008189033300005591SoilGQDPADPTGPPSLGPVRIVAEEPGVVPAGSSLVVQTDDAVTTVRAMRATIYSANVAEDVVDQNGTVLIPKDSAVELGVRWLPYLGPGGVGMTELILELRAVTVKGVRYPVATASEAPGAGGLGLARSGPRWVGGRAAQGEVLTSGSRIKVPARTLLAFQIAEPVRLTGSR*
Ga0070762_1021774523300005602SoilMTISTFGFALMTMASVRACLGQDLASPADPLLLGPLRIVAEEPGVLPTGSSLVVQTDDAVTAVRAMRATIYSANVAEDVVDQNGTLLIPKDSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPVATVFGPPGAGGLGLERNAPKWVGGRAAKGEVLTSGPRINVPAKTLLAFQIVEPIRLTGFRR*
Ga0070763_1013016923300005610SoilMTISTFGFALMTIASAGACFGQDRADPTGPPSLGPVRIVAEEPGVVPAGSSLVVQTDAAVTAVRAMRATIYSANVAEDVVDQNGMVLIPKDSAVELGVRWLPYLGPGGVGMTELILELRAVTVKGVRYPVATASEAPGAGGLGLARSGPRWVGGRAAQAEVLTSGSRIKVPARTLLAFQIAEPVRLTGSR*
Ga0070764_1006067823300005712SoilMTISTFGFALMIIASVAACFGQDPADPTGPPSLGPVRIVAEEPGVVPAGSSLVVQTDAAVTAVRAMRATIYSANVAEDVVDQNGMVLIPKDSAVELGVRWLPYLGPGGVGMTELILELRAVTVKGVRYPVATASEAPGAGGLGRARSGPRWVGGRAAQGEVLTSGSRIKVPARTLLAFQIAEPVRLTGSR*
Ga0070766_1001416023300005921SoilMTISTFAFALMTMASVGACLGQDPAGPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDYAVTAARAMRATIYSANVAEDVVDQNGTVLIPKDSPMELGVRLLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLDRNAPKWVGGRAARGKVLTSGPRIKVPAKTLLAFQIVEPIRLTGFP*
Ga0070766_1005056913300005921SoilLLGPLRIVAEEPGVLPTGSSLVVQTDDAVTAVRAMRATIYSANVAEDVVDQNGTLLIPKDSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPVATVFGPPGAGGLGLERNAPKWVGGRAAKGEVLTSGPRINVPAKTLLAFQIVEPIRLTGFRR*
Ga0070766_1006923823300005921SoilMTISTFGFALMTIASAGACFGQDRADPTGPPSLGPVRIVAEEPGVVPAGSSLVVQTDAAVTAVRAMRATIYSANVAEDVVDQNGMVLIPKDSAVELGVRWLPYLGPGGVGMTELILELRAVTVKGVRYPVATASEAPGAGGLGLARSGPRWVGGRAAQGEVLTSGSRIKVPARTLLAFQIAEPVRLTGSR*
Ga0070716_10061402213300006173Corn, Switchgrass And Miscanthus RhizosphereMTISTFAFALMTMASVGACLGQDPAGPADPPPLGPVRIVAEEPGVVPAGSSLVVQTNDAVTAVRAMRATIYSANVAEDVVDQNATVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVRGVRYPVATASEPPGAGGLGLERNGPKWVGGRAAQGEVLTSGPRIKVPAKT
Ga0070716_10075972913300006173Corn, Switchgrass And Miscanthus RhizosphereMTIPTFGFALMTMASLATCTRKDLSSQADLSPPGPVRTVAEEPGVVPAGSSLVVQTDDAVTAVKAMRATIYTGNVAEDVVDQNGTVLIPKDSPVEFGVRSLPYLGPGGVGMTELILELRAVTVKGVRYPVATASENPGAGGLGLERNGPKWVGGRAAKGEVLTSGPRIKVPA
Ga0070765_10080402413300006176SoilMTIPTFGFALMTMASLGACPGKDLSSPADVSAPGPVRTVAEEPGVVPAGSSLVVQTDDAVTAVKAMRATIYTANVAKDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELIVELRAVTVKDVRYPIATASEPPGAGGLGLERNGPKWVGGRATQREVLTNGPRIKVPARTLLAFQIVEPIRL
Ga0066665_1147931813300006796SoilMTIPTFCFALMTMASLGACPGKDLSSPADVLPPGPIRTVAEEPGVVPAGSSLVVQTEDAITAVRAMRTTIYTANVAEDVVDQDGTVLIPKDSPLELGVRSLPYLGPGGVGMSGLILEVRAVTVKGVRYPVATASENPGAGGLGLERNGPKRVGGRAVQGEVLTSG
Ga0075425_10065193623300006854Populus RhizosphereMTIPTFCFALMTMASLGACPGKDLSSPADVLPPGPVRTVAEEPGVVSAGSSLVVQTDDAITAVRAMRATIYTANVAEDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGLGMTELILEVRAVTVKGVRYPVATASEKPGAGGVGLERNGPKRVGGRAAQGEVLTSGPRIKVPARTLLAFQIVEPIRLIRFRR*
Ga0099794_1034410223300007265Vadose Zone SoilRTVTDEPGVVPPGTSLVVRTNDTVSTLKAMRGTIYFANVAEDVLDQDGTVLIPKESSVELVVRSLSYLGPGGVGMTELTLDIRAVTVNGVRYPVATKAGKPGAGGLGLDQNAPKGIGGGAAAGDVLTRGRRIHVPARTLIAFQIEDPIRMSGFRR*
Ga0066710_10236824813300009012Grasslands SoilFCFALMTMASLGACPGKDLSSPAEVLPTGPVRTVAEEPGVVPAGSSLVVQTDDAITAVRAMRATIYSANVGEDMVDQDGTVLIPKDSPLELGVRSLPYLGPGGVGMSELILEVRAVTVKGVRYPVATASENPGAGGLGLERNGPKRVGGRAVQGEVLTSGPRINVPAKTLLAFQIVEPIRLTGFRR
Ga0105242_1272481413300009176Miscanthus RhizosphereSVGACLGQDPAGPADPPPLGPVHIVAETPGVVPAGSSLVVQTDDAVTAVRAMRATIYSANVAENVVDQNGTVLIPKQSPIELGVRSVPYLGPGGVGMTELILELRAVTVKGVRYPVASASEPPGAGGLRLGRNAPKWVGGPAAQGEALTSGPRIKVPARTLLAFQIAEPIRLAGFP*
Ga0105249_1259728313300009553Switchgrass RhizosphereVGPAHPAPLGPVRIVAEEPGVVPAGSSLVVQTDDAVTAVRAMRATIYSANVVEDVVDQNGTVLIPKDSHVDLGVRSLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLDRDAPKLVGGRAAQGEVLMSGPRIKVPARTLLAFQIAEPIRLTGFP*
Ga0134128_1175855113300010373Terrestrial SoilMTISTFGFALMTMASVGACLGQDLASPADPPPLGPVRIVDEEPGVVPAGSSLVVQTNDAVTAVRAMRATIYSANVAEDVVDQNGTVLIPKDSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPVATVSGPPGAGGLALERNAPKWVGGRAAQGKILTSGPRIKVPAKT
Ga0136449_10013100813300010379Peatlands SoilLGQDLANPADPPPLGPVRIVAEEPGVVPTGSSLVVQTDDEVTAVRPMRATIYSARVAEDVVDQSGTVLIPKGSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPAATVSGPPGAGGLGLERNAPTWVGGRAAQGKVLTSGPRIKVPARTLLAFQIAEPIRLTGFR*
Ga0134121_1059668413300010401Terrestrial SoilNRSIQSRPPNCTAQRESNIMTISTFDFALMTMASVGVCLGQDLASPADPLPLGPVRIVAEEPSVVAAGSSLVVQTNDAVTAVRAMRATIYSANVAEDVVDQNATVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVRGVRYPVATASEPPGAGGLGLERNGPKWVGNRAAQGEVLTSGPRIKVPAKTLLAFQIVEPIRLTGFRR*
Ga0150983_1107883213300011120Forest SoilMTISTFGFALMTVAGVGACLGQDLASPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDYAVTAARAMRATIYSANVAEDVVDQNGTVLIPKDSPMELGVRLLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLERNAPTWVGGRAAQGKVLTSGPRIKVPARTLLAFQIAEPIRLTGFR*
Ga0137362_1002601463300012205Vadose Zone SoilMTMASLGACPGKDLSSPADVLPPGPIRTVAEEPGVVPAGSSLVVQTDDAITAVRAMRATIYSANVGEDMVDQDGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILEVRAVTVKRIRYPVATASENPGAGGLGLERNGPKRVGGRAAQGEVLTSGPRINVPAKTLLAFQIVEPIRLTGFRR*
Ga0137381_1040415923300012207Vadose Zone SoilMTMASLGACPGKELSSPVDVLAPGPVRTVAEEPGVVPVGSSLVVQTDDAITAFRAMRATIYTANVAEDVADQNGTVLIPKDSHVELGVRSLPYLGPGGVGMTELILELRAVTVKGERYPVATASERPGAGGLGLERNGPKRVGGRAVQGEVLTSGPRINVPAKTLLAFQIVEPIR
Ga0157374_1233531513300013296Miscanthus RhizosphereREDNLMTISTFGFALMTMASLGACPRKDLSSPADVSAPGPVRIVAEEPGVVPAGSSLVVQTNDAVTAVRAMRATIYSANVAEDVVDQNATVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKGARYPVATASEPPGAGGLGLERNGPKWVGNRAAQGEVLTSGPRIKVPARTLLAFQIV
Ga0157378_1249187313300013297Miscanthus RhizosphereLVTMASVGACLGQDPARAADPPPLGPVRIVAEKPGVVPAGSSLVVQTNDAVTAVRAMRATIYSANVAEDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLALERNAPKWVGGRAAQGKILTSGPRIKVPAKTLLAFHIVEPIRLTGFP*
Ga0182024_1009907313300014501PermafrostMTISTFGFALMTMASVGACLGQDVASPADPRPLGPVRIVAEEPGVVPTGSSLVVQADDAVTAVRAMRATIYFANVAEDVADQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLDRNAPKLVGGRAARGKVLTSGPRIKVPARTLLAFQIVEPIRLAGFR*
Ga0210407_1000801353300020579SoilMTIPTFSFALMTMASLATCNRKDLSRPADLSPPGPVRTVAEEPGVVPAGSSLVVQTDDAVTAVKAMRATIYTGNVAEDVVDQNGTVLIPKDSPVEFGVRSLPYLGPGGVGMTELILELRAVTVKGGRYPVATASENPGAGGLGLERNSPKWVGGRAAQREVLTSGPRIKVPAGTLLAFQIVEPIRLIGFRR
Ga0210407_1025696723300020579SoilMASLGARPGKDLSSPADVSAPGPVRTVAEEPGVVRAGSSPVVQTDDAVTAVKAMRATIYTANVAKDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKDVRYPIATASEHPGAGGLGLERNGPKWVGGRATQVEVLTNGPRIKVPARTLLAFQIVEPIRLTGFRR
Ga0210403_1028200423300020580SoilMTISTFAFALMTMASVGACLGQDPVGPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDYAVTAARAMRATIYSANVAEDVVDQNGTVLIPKDSPIELGVRLLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLDRNAPKWVGGRAARGKVLTSGPRIKVPAKTLLAFQIVEPIRLTGFP
Ga0210403_1046400923300020580SoilMTVSTLGFALMTMASIGACLGQDLANPADPPPLGPLRIVAEEPGVVPTGSSLVVQTDDAVTAVRPMRATIYSAKLAEDVVDQSGTVLIPKGSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLERNAPTWVGGRAAQGKVLTSGPRIKVPARTLLAFQVAEPIRLTGFR
Ga0210403_1103995313300020580SoilISTFGFALITMASLGACPEKDLSSPADASPPRPVRTVAEEPGVVSAGSYIVVQTDDTVTAVRAMGATIYTANVTEDVVDQNGTVLIPKDAPVELGVRSLPYLGPGGVGMTELILELRSVTVKGLRYPVATASENPGAGGLGLERDSPKWVGGRAAKGAVLASGPRINVPAKTLLAFQIVEPIRLTGFRR
Ga0210399_1018898533300020581SoilMTISTFGFALMTMASVGACLGQDPASPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDDAVTAVRAMRATIYSANVAEDVVDQNGAVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVKVKGVRYPVATTSEPPGAGGLGLERNGPKWVDGRAAQGEVLTSGPRIKVPARTLLAFQ
Ga0210399_1026715013300020581SoilMTISTFAFALMTMASVGACLGQDPVGPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDYAVTAARAMRATIYSANVAEDVVDQNGTVLIPKDSPIELGVRLLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLDRNAPKWVGGRAARGKVLTSGPRIKVPARTLLAFQIVEPIRLTGFP
Ga0210399_1036849913300020581SoilMTISTLGFALMTMASLGACPGKDLSSPADISPPGPVRTVAEEPDVVPAGSSLIIQTDDAVTAVKAMRATIYTANVAEDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKGVRYPVATSSEPPGAGGLGLERNGPKWVGGRATQGEVLTSGPGIKARARALLAFQI
Ga0210399_1126252513300020581SoilMTVSTLGFALMTMASIGACLGQDLANPADRPPLGPLRIVAEEPGVVPTGSSLVVQTDDAVTAVRPMRATIYSAKLAEDVVDQSGTVLIPKGSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLERNAPTWVGGRAAQGKVLTSGPRIKVPARTLLAFQIAEPIRLT
Ga0210401_1004344353300020583SoilMTIPAFGFALMTMASLGACPGRDLSSPADVSAPGPVRTVAEEPGVVPAGSSLVVQTDDAMTAVKAMRATIYTANVAKDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMAELILKLRAVTIKGVRYPVATAPEKPGAGSLELERNSRKWVGGDPAKGEVLMSDPRINVPAKTLLAFQIVEPIPLTGFRR
Ga0210401_1038950523300020583SoilSTFAFALMTMASVGACLGQDPAGPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDYAVTAARAMRATIYSANVAEDVVDQNGTVLIPKDSPIELGVRLLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLDRNAPKWVGGRAARGKVLTSGPRIKVPAKTLLAFQIVEPIRLTGFP
Ga0210401_1073204023300020583SoilASLGACLGQNLAGPADPPPLGPVRIVAEEPGVVPTGGSLVVQTDDAVTAVRAKGATIYSAKVAEDVVDQNGTVLIPKDSPAELGVRSLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLERNAPKSVGGRAAQGAVLTSGPRIKVPARTLLAFQIAEPLRLTGFR
Ga0210404_1008351113300021088SoilMTISTFAFALMTMASVGACLGQDPAGPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDYAVTAARAMRATIYSANVAEDVVDQNGTVLIPKDSPIELGVRLLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLDRNAPKWVGGRAARGKVLTSGPRIKVPARTLLAFQIVEPIRLTGFP
Ga0210404_1019499213300021088SoilMTIPTFGFALMTMASLATCTGKDLSSPADVPPPGPVRTIAEEPGVVPTGSSLVVQTDDAVTAVKAMRATIYTANVAEDVVDQNGTLLIPKDSHVELGVRSLPYLGPGGVGMTELILEVRAVTVKGVRYPVATASENPGAGGLGLERNGPKWVGGRVAQGEVLTSGPRIKVPARTLLAFQIVEPIRLTGFRR
Ga0210404_1025575213300021088SoilMTISTFGFALMTMASLGACPGRDLSSPADVSAPGPVRTVAEEPGVVPAGSSLVVQTDDAMTAVKAMRATIYTANVAKDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKGVRYPVATASEPPGAGGLGLERNGPKWVGGRAAQGEVLTSGPRIKVPAKTLLAFQIVEPIRLTGFRR
Ga0210404_1066529113300021088SoilMTISTFGFALMTMASVGACLGQDPASPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDDAVTAVRAMRATIYSANVAEDVVDQNGAVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVKVKGVRYPVATTSEPPGAGGLGLERNGPKWVDGRAAQGEVLTSGPRIKVPARTL
Ga0210406_1002169313300021168SoilSLATCNRKDLSRPADLSPPGPVRTVAEEPGVVPAGSSLVVQTDDAVTAVKAMRATIYTGNVAEDVVDQNGTVLIPKDSPVEFGVRSLPYLGPGGVGMTELILELRAVTVKGGRYPVATASENPGAGGLGLERNSPKWVGGRAAQREVLTSGPRIKVPAGTLLAFQIVEPIRLIGFRR
Ga0210406_1003677913300021168SoilMTIPTFGFALMTMASLGACPGRDLSSPADVSAPGPVRTVAEEPGVVPAGSSLVVQTDDAMTAVKAMRATIYTANVAKDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRALTVKDVRYPIATASEPPGAGGLGLERNAPSELVGTRRKATFSRAAPIKVPARTLLAFQIVEPIRLSGFRR
Ga0210406_1031329423300021168SoilMTISTFGFALMTMASVGACLGQDPASPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDDAVTAVRAMRATIYSANVAEDVVDQNGAVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVKVKGVRYPVATTSEPPGAGGLGLERNGPKWVDGRAAQGEVLTSGPRIKVPARTLLAFQIAEPIRLTGFR
Ga0210406_1048433713300021168SoilHRLKCEDNLMTISTFGFALITMASLGACPEKDLSSPADASPPRPVRTVAEEPGVVSAGSYIVVQTDDTVTAVRAMGATIYTANVTEDVVDQNGTVLIPKDAPVELGVRSLPYLGPGGVGMTELILELRSVTVKGLRYPVATASENPGAGGLGLERDSPKWVGGRAAKGAVLASGPRINVPAKTLLAFQIVEPIRLTGFRR
Ga0210400_1013528923300021170SoilMTIPTFGFALMTMASLGACPGRDLSSPADVSAPGPVRTVAEEPGVVPAGSSLVVQTDDAMTAVKAMRATIYTANVAKDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKDVRYPIATASEPPGAGGLGLERNGPKWVGGRATQREVLTNGPRIKVPARTLLAFQIVEPIRLTGFRR
Ga0210405_1001722923300021171SoilMTISTFGFALMTMASLGACPGRDLSSPADVSAPGPVRTVAEEPGVVPAGSSLVVQTDDAMTAVKAMRATIYTANVAKDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKDVRYPIATASEPPGAGGLGLERNGPKWVGGDAAKG
Ga0210405_1004959943300021171SoilMTISTFAFALMTMASVGACLGQDPAGPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDYAVTAARAMRATIYSANVAEDVVDQNGTVLIPKDSPMELGVRLLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLDRNAPKWVGGRAARGKVLTSGPRIKVPAKTLLAFQIVEPIRLTGFP
Ga0210405_1011152443300021171SoilMTISTFGFALMTMASVGACLGQDPACPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDDAVTAVRAMRATIYSANVAEDVVDQNGAVLIPKDSPVELAVRSLPYLGPGGVGMTELILELRAVKVRGVRYPVATTSEPPGAGGLGLERNGPKWVDGRAAQGEVLTSGPRIKVPARTLLAFQIAEPIRLTGFR
Ga0210405_1101758113300021171SoilMTISTFGFALMTVASVGACLGQDLASPADLPPLGPVRIVAEEPGVVPAGSSLVVQTNDAVTAVRAMRATIYSANVAEDVVDQNATVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVRGVRYPVATGSEPPGAGGLGLERNGPKWVDGRAAQ
Ga0210408_1044361323300021178SoilPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDYAVTAARAMRATIYSANVAEDVVDQNGTVLIPKDSPIELGVRLLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLDRNAPKWVGGRAARGKVLTSGPRIKVPARTLLAFQIVEPIRLTGFP
Ga0210408_1044629023300021178SoilMTIPTFSFALMTMASLATCNRKDLSRPADLSPPGPVRTVAEEPGVVPAGSSLVVQTDDAVTAVKAMRATIYTGNVAEDVVDQNGTVLIPKDSPVEFGVRSLPYLGPGGVGMTELILELRAVTVKGGRYPVATASENPGAGGLGLERNSPKWVGGRAAQGEVLTS
Ga0210408_1059670713300021178SoilMTIPAFGFALMTMASLGACPGTDLSSPADVSAPGPVRTVAEEPGVVPAGSSLVVQTDDAMTAVKAMRATIYTANVAKDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKDVRYPIATASEPPGAGGLGLEPRIKVPARTLLAFQIVEPI
Ga0210408_1101941513300021178SoilMTIPTFGFALMTMASLATCTGKDLSSPADVPPPGPVRTIAEEPGVVPTGSSLVVQTDDAVTAVKAMRATIYTANVAEDVVDQNGTLLIPKDSHVELGVRSLPYLGPGGVGMTELILEVRAVTVKGVRYPVATASENPGAGGLGLERNGPKWVGGRVAQGKVLTSGPRIKVPARTLLAFQIVEPIR
Ga0210396_1014582333300021180SoilMTIPTFGFALMTMASLATCTGKDLSSPADVPPPGPVRTIAEEPGVVPTGSSLVVQTDDAVTAVKATRATIYTANVAEDVVDQNGTLLIPKDSHVELGVRSLPYLGPGGVGMTELILEVRAVTVKGVRYPVATASENPGAGGLGLERNGPKWVGGRVAQGEVLTSGPRIKVPARTLLAFQIVEPIRLTGFRR
Ga0210388_1035838213300021181SoilMTISTFGFALMTIASAGACFGQDRADPTGPPSLGPVRIVAEEPGVVPAGSSLVVQTDAAVTAVRAMRATIYSANVAEDVVDQNGMVLIPKDSAVELGVRWLPYLGPGGVGMTELILELRAVTVKGVRYPVATASEAPGAGGLGRARSGPRWVGGRAAQGEVLTSGSRIKVPARTLLAFQIAEPVRLTGSR
Ga0210388_1056736223300021181SoilMTISTFGFALMTMASVGAYLEQNPVGPADPPLGPVRIAAEEPGVVPAGSSLIVQTDDAVTAVRAMRATIYSANVADDIVDQNGTVLIPKDSPVELGVRWLPYLGPGGVGMTELILELRAVTVKGVRYSVATASEPPGAGGLGLARSGPKWVGGRAAKGEVLTSGRRIKVPARTLLAFQTAEPIRLTGFR
Ga0210388_1094414513300021181SoilMTIPTFGFALMTMASLGACPGKDLSSPADVSAPGPVRTVAEEPGVVPAGSSLVVQTDDAVTAVKAMRATIYTANVAKDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILDLRAVTVKGVRYPVATVSGPPGAGGLGLDRNAPKLVGGRAARGKVLTSGPRIKVPARTLLAFQIVEPIRLTGFR
Ga0210393_1144463913300021401SoilMTVSTLGFALMTMASIGACLGQDLANPADPPPLGPLRIVAEEPGVVPTGSSLVVQTDDAVTAVRPMRATIYSARVAEDVVDQSGTVLIPKGSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLERNAPTWVGGRAAQGKVLTSGPRIKVPARTLL
Ga0210393_1152514613300021401SoilDPPPLGPVRIVAEEPGVVPAGSSLVVQTDYAVTAARAMRATIYSANVAEDVVDQNGTVLIPKDSPIELGVRLLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLDRNAPKWVGGRAARGKVLTSGPRIKVPAKTLLAFQIVEPIRLTGFP
Ga0210385_1150205313300021402SoilPTGPPSLGPVRIVAEEPGVVPAGSSLVVQTDAAVTAVRAMRATIYSANVAEDVVDQNGTVLIPKDSAVELGVRWLPYLGPGGVGMTELILELRAVTVKGVRYPVATASEAPGAGGLGLARSGPRWVGGRAAQGEVLTSGSRIKVPARTLLAFQIAEPVRLTGSR
Ga0210397_1050622813300021403SoilIMTVSTLGFALMTMASIGACLGQDLANPADPPPLGPLRIVAEEPGVVPTGSSLVVQTDDAVTAVRPMRATIYSARVAEDVVDQSGTVLIPKGSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLERNAPTWVGGRAAQGKVLTSGPRIKVPARTLLAFQVAEPIRLTGFR
Ga0210387_1014913233300021405SoilMTISTFGFALMTMASLGACPGRDLSSPADVSAPGPVRTVAEEPGVVPAGSSLVVQTDDAMTAVKAMRATIYTANVAKDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTGKDVRYAIATASEPPGAGGLGLERNGPKWVGGDAAKG
Ga0210386_1002992223300021406SoilMTIPAFGFALMTMASLGACPGRDLSSPADVSAPGPVRTVAEEPGVVPAGSSLVVQTDDAMTAAKAMRATIYTANVAKDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKDVRYPIATASEPPGAGGLGLERNGPKWVGGDAVKG
Ga0210383_1108004413300021407SoilASVGACFGQDPADPTGPPSLGPVRIVAEEPGVVPAGSSLVVQTGDAVTAVRAMRATIYSANVAEDVVDQNGTVLIPKDSAVELGVRWLPYLGPGGVGMTELILELRAVTVKGVRYPVATASEAPGAGGLGLARSGPKWVGGRAAQGEVLTSGSRIKVPARTLLAFQIAEPVRLTGSR
Ga0210383_1133500013300021407SoilMTISTFGFALMTMASVGACLGQDAAGPADPPPLGPVRIVAEKPGVVPAGSSLVVQTDDAVTAVRAMRATIYSANVAEDVVDQNGTVLIPKKSPIELGVRSLPYLGPGGVGMTELILELRAVTVKGVRYPVATASEPPGAGGLGLGRNGPKWVGGRAAQGEVLTSGPRIKVPARTLLAFQIAEPIRLAGFP
Ga0210394_1003385333300021420SoilMTISTFGFALMTVASVGACLGQDLASPADLPPLGPVRIVAEEPGVVPAGSSLVVQTNDAVTAVRAMRATIYSANVAEDVVDQNATVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVRGVRYPVATASEPPGTGGLGLERNGPRWVGGRAAQGEVLTSGPRIKVPAKTLLAFQIVEPIRLTGFRR
Ga0210394_1013086213300021420SoilMTVSTLGFALMTMASIGACLGQDLANPADPPPLGPLRIVAEEPGVVPTGSSLVVQTDDAVTAVRPMRATIYSARVAEDVVDQSGTVLIPKGSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLERNAPTWVGGRAAQGKVLTSGPRIKVPARTLLAFQIAEPIRLTGF
Ga0210394_1036635623300021420SoilMTIPTFGFALMTMASLGACLGQNLASPADPPPLGPVRIVAEEPGVVPTGGSLVVQTDDAVTAIRAMGATIYSAKVTEDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLERNAPKSVGGRVAQGEVLTSGPRIKVPARTLLAFQTVEPIRLAGFRR
Ga0210384_1044878513300021432SoilNCTAQREDNVMTISTFAFALMTMASVGACLGQDPVGPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDYAVTAARAMRATIYSANVAEDVVDQNGTVLIPKDSPMELGVRLLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLDRNAPKWVGGRAARGKVLTSGPRIKVPAKTLLAFQIVEPIRLTGFP
Ga0210384_1073489413300021432SoilMTIPTFCFALMTMASLGACQGKDLSNPVDVLPPGPVRTVAEEPGVVPAGSSLVVETDDAITAVRAMRATIYTANVAEDIVDQDGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRGVAVNGVHYPVASASEKPGAGGLGLGRNGPKRVGGRAAQGEVLTSG
Ga0210384_1080129613300021432SoilRRAKPLTAQWQLDDSSLKCRQTMAGKSLQSKDPEQAAKLHRLKCEDNLMTISTFGFALITMASLGACPEKDLSSPADASPPRPVRTVAEEPGVVSAGSYIVAQTDDTVTAVRAMGATIYTANVTEDVVDQNGTVLIPKDAPVELGVRSLPYLGPGGVGMTELILELRSVTVKGLRYPVATASENPGAGGLGLERDSPKWVGGRAAKGAVLASGPRINVPAKTLLSFQIVEPIRLTGFRR
Ga0210384_1083071313300021432SoilMTIPTFGFALMTMASLATCTGKDLSSPADVPPPGPVRTIAEEPGVVPAGSSLVVQTDDAVTAVKAMRATIYTANVAEDVVDQNGTLLIPKDSHVELGVRSLPYLGPGGVGMTELILEVRAVTVKGVRYPVATASENPGAGGLGLERNGPKWVGGRVAQGEVLTSGPRIKVPARTLLAFQIVEPIRLTGFRR
Ga0210398_1013674023300021477SoilMTVSTLGFALMTMASIGACLGLDLANPADPPPLGPLRIVAEEPGVVPTGSSLVVQTDDAVTAVRPMRATIYSARVAEDVVDQSGTVLIPKGSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLERNAPTWVGGRAAQGKVLTSGPRIKVPARTLLAFQIAEPIRLTGFR
Ga0210398_1019439413300021477SoilQREDNIMTISTYGFALMTIASAGACFGQDRADPTGPPSLGPVRIVAEEPGVVPAGSSLVVQTDAAVTAVRAMRATIYSANVAEDVVDQNGTVLIPKDSAVELGVRWLPYLGPGGVGMTELILELRAVTVKGVRYPVATASEAPGAGGLGLARSGPRWVGGRAAQGEVLTSGSRIKVPARTLLAFQIAEPVRLTGSR
Ga0210402_1014612333300021478SoilMTISTFGFALITMASLGACPEKDLSSPADASPPRPVRTVAEEPGVVSAGSYIVVQTDDTVTAVRAMGATIYTANVTEDVVDQNGTVLIPKDAPVELGVRSLPYLGPGGVGMTELILELRSVTVKGLRYPVATASENPGAGGLGLERDSPKWVGGRAAKGAVLASGPRINVPAKTLLAFQIVEPIRLTGFRR
Ga0210402_1073899813300021478SoilMTVATLGFALMTLASVGACLGQDLANPADPPPLGPLRIVAEEPGVVPTGSSLVVQTDDAVIAVRAMRATIYFANLAEDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLDRNAPKLVGG
Ga0210410_1070696713300021479SoilMTISTFGFALMTMASLGACPGRDLSSPAEVAAPGPVRTVAEEPGVVPAGSSLVVQTDDAVTAVKAMRATIYTANVAKDVVNQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKDVRYPIATASEPPGAGGLGLERNGPKWVGG
Ga0210409_1004424213300021559SoilMTISTFGFALITMASLGACPEKDLSSPADASPPRPVRTVAEEPGVVSAGSYIVVQTDDTVTAVRAMGATIYTANVTEDVVDQNGTVLIPKDAPVELGVRSLPYLGPGGVGMTELILELRSVTVKGLRYPVATASENAGAGGLGLERDSPKWVGGRAAKGAVLASGPRINVPAKTLLSFQIVEPIRLTGFRR
Ga0210409_1006459923300021559SoilMTISTFGFALMTVASVGACLGQDLASPADLPPLGPVRIVAEEPGVVPAGSSLVVQTNDAVTAVRAMRATIYSANVAEDVVDQNATVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVRGVRYPVATASEPPGAGGLGLERNGPKSVGGRAAQGEVLTSGPRIKVPAKTLLAFQIVEPIRLTGFRR
Ga0210409_1009233133300021559SoilMTVSTLGFALMTMASIGACLGQDLANPADPPPLGPLRIVAEEPGVVPTGSSLVVQTDDAVTAVRPMRATIYSARVAEDVVDQSGTVLIPKGSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLERNAPTWVGGRAAQGKVLTSGPRIKVPARTLLAFQIAEPIRLTGFR
Ga0210409_1026222523300021559SoilMTIPTFGFALMTMAGLGACPGKDLSSPADVSAPGPVRTVAEEPGVVRAGSSPVVQTDDAVTAVKAMRATIYTANVAKDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKDVRYPIATASEHPGAGGLGLERNGPKWVGGRATQVEVLTNGPRIKVPARTLLAFQIVEPIRLTGFRR
Ga0242668_109861613300022529SoilSTFAFALMTMASVGACLGQDPVGPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDYAVTAARAMRATIYSANVAEDVVDQNGTVLIPKDSPIELGVRLLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLDRNAPKWVGGRSARGKVLTSGPRIKVPAKTLLAFQIVEPIRLTGFP
Ga0242662_1021576913300022533SoilTISTFGFALMTVASVGACLGKDLASPADLPPLGPVRIVAEEPGVVPAGSSLVVQTNDAVTAVRAMRATIYSANVAEDVVDQNGTVLIPKDSPVELVVRSLPYLGPGGVGMTELILELRAVTVRGVRYPVATASEPPGAGGLGLERNGPKWVGGRAAQGEVLTSGPRIKVPAKTLLAFQIVEPIRLIGFRR
Ga0242665_1025509713300022724SoilTIPTFGFALMTMASLATCTGKDLSSPADVPPPGPVRTIAEEPGVVPTGSSLVVQTDDAVTAVKAMRATIYTANVAEDVVDQNGTLLIPKDSHVELGVRSLPYLGPGGVGMTELILEVRAVTVKGVRYPVATASENPGAGGLGLERNGPKWVGGRVAQGEVLTSGPRIKVPARTLLAFQIVEPIRLTGFRR
Ga0207685_1063111413300025905Corn, Switchgrass And Miscanthus RhizosphereTDLGVYGQVSFTVVVDPSGNFVDERSAKFGTHRCSAQREGNIMTISTFAFALMTMASVGACLGQDPAGLVDPPPLGPVRIVAEEPGVVPAGSSLVVQTDDAVTAVKAMRATIYTGNVAEDVVDQNGTVLIPKDSPVEFGVRSLPYLGPGGVGMTELILELRAVTVRGVRYPVTASEPPGAGGLGLERNGPNW
Ga0207699_1045675513300025906Corn, Switchgrass And Miscanthus RhizosphereMTISTFGFALMTMASVGACLGQDPAGPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDYAVTAARAMRATIYSANVAEDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVRGVRYPVATASEPPGAGGLGLERNGPKWVGGRAAQGEVLMSGPRIKVPAKTLLAFQIVEPIRLTGFRH
Ga0207684_1040414623300025910Corn, Switchgrass And Miscanthus RhizosphereACPGKDLFSPADVLPPGPVRTVAEEPGVVSAGSSLVVQTDDAITAVRAMRATIYTANVAEDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILEVRAVTVKGVRYPVATASEKPGAGGVGLERNGPKRVGGRAAQGEVLTSGPRINVPAKTLLAFQIVEPIRLTGFRR
Ga0207665_1009511623300025939Corn, Switchgrass And Miscanthus RhizosphereMTIPTFSFALMTMASLATCNRKDLSRPADLSPPGPVRTVAEEPGVVPAGSSLVVQTDDAVTAVKAMRATIYTGNVAEDVVDQNGTVLIPKDSPVEFGVRSLPYLGPGGVGMTELILELRAVTVRGVRYPVATASEPPGAGGLGLERNGPKWVGGRAAQGEVLTSGPRIKVPAKTLLAFQIVEPIRLTGFRH
Ga0179593_103443423300026555Vadose Zone SoilMTIPTFCFALTTMASLGACPGKDLSSPADVLPPGPVRTVAGEPGVVPAGSSLVVQTDDAITAVRAMRATIYTANVAEDVADQNGTVLIPKDSHVELGVRSLPYLGPGGVGMTELILELRAVTVKGERYPVATASERPGAGGLGLDRNGPKRVGGRAVQGEVLTSGPRINVPAKTLLAFQIVEPIRLTGFRR
Ga0209274_1070653613300027853SoilCLGQDLASPADPLLLGPLRIVAEEPGVLPTGSSLVVQTDDAVTAVRAMRATIYSANVAEDVVDQNGTLLIPKDSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPVATVFGPPGAGGLGLERNAPKWVGGRAAKGEVLTSGPRINVPAKTLLAFQIVEPIRLTGFRR
Ga0209693_1017778913300027855SoilMTISTFGFALMTIASAGACFGQDRADPTGPPSLGPVRIVAEEPGVVPAGSSLVVQTDAAVTAVRAMRATIYSANVAEDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVRGVRYPVATASEPPGAGGLGLERNGPKWVGGRA
Ga0209169_1001069233300027879SoilMTISTFGFALMIIASVAACFGQDPADPTGPPSLGPVRIVAEEPGVVPAGSSLVVQTDAAVTAVRAMRATIYSANVAEDVVDQNGMVLIPKDSAVELGVRWLPYLGPGGVGMTELILELRAVTVKGVRYPVATASEAPGAGGLGRARSGPRWVGGRAAQGEVLTSGSRIKVPAGTLLAFQIAEPVRLTGSR
Ga0209275_1018215523300027884SoilMTISTFGFALMIIASVGACFGQDPADPTGPPSLGPVRIVAEEPGVVPAGSSLVVQTDDAVTAVRAMRATIYSANVAEDVVDQNGMVLIPKDSAVELGVRWLPYLGPGGVGMTELILELRAVTVKGVRYPVATASEAPGAGGLGLARSGPRWVGGRAAQGEVLTSGSRIKVPARTLLAFQIAEPVRLTGSR
Ga0209275_1021544913300027884SoilMTISTFGFALMTMASVRACLGQDLASPADPLLLGPLRIVAEEPGVLPTGSSLVVQTDDAVTAVRAMRATIYSANVAEDVVDQNGTLLIPKDSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPVATVFGPPGAGGLGLERNAPKWVGGRAAKGEVLTSGPRINVPAKTLLAFQIVEPIRLTGFRR
Ga0209380_1008045023300027889SoilMTISTFGFALMTIASAGACFGQDRADPTGPPSLGPVRIVAEEPGVVPAGSSLVVQTDAAVTAVRAMRATIYSANVAEDVVDQNGTVLIPKDSAVELGVRWLPYLGPGGVGMTELILELRAVTVKGVRYPVATASEAPGAGGLGLARSGPRWVGGRAAQGEVLTSGSRIKVPARTLLAFQIAEPVRLTGSR
Ga0308309_1002168123300028906SoilMTISTFGFALMTIASAGACFGQDRADPTGPPSLGPVRIVAEEPGVVPAGSSLVVQTDAAVTAVRAMRATIYSANVAEDVVDQNGMVLIPKDSAVELGVRWLPYLGPGGVGMTELILELRAVTVKGVRYPVATASEAPGAGGLGLARSGPRWVGGRAAQGEVLTSGSRIKVPARTLLAFQIAEPVRLTGSR
Ga0222749_1005575323300029636SoilMTIPTFGFALITMASLATCTGKDLSSPADVPPPGPVRTIAEEPGVVPTGSSLVVQTDDAVTAVKAMRATIYTANVAEDVVDQNGTLLIPKDSHVDLGVRSLPYLGPGGVGMTELILEVRAVTVKGVRYPVATASENPGAGGLGLERNGPKWVGGRVAQGEVLTSGPRIKVPARTLLAFQIVEPIRLTGFRR
Ga0222749_1027591513300029636SoilLGQDLASPADLPPLGPVRIVAEEPGVVPAGSSLVVQTNDAVTAVRAMRATIYSANVAEDVVDQNGTVLIPKDSPVELGVPSLSYLGPGGVGMTELILELRAVTVKGGRYPVATASENPGAGGLGLERNSPKWVGGRAAQGEVLTSGPRIKVPARSLLAFQIVEPIRLIGFRR
Ga0170834_10445723113300031057Forest SoilMRISTFGFALMVVANAGACFGQNLASPADPTAPGPLRIVAEEPGVVPTGSSLVVQTDDAVTAVRAMRATIYSANAAEDVVDQNGTVLIPKGSPVELGVRSLSYLGPGGAGMTELILEFRAVTVKGIRYPVATVSGPPGAGGLGLDRNAPKWVGGRAAQGKVLTSGPRIEVPAK
Ga0170834_10566997713300031057Forest SoilGQDLTSPADLLPLGPLRIVAEEPGVVPTGSSLVVQTDEAVIAVRAMRATIYSASVAKDVLDQSGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKGIRYPVATVSGPPGAGGLGLDRNAPKWGGGRAAQGKILTSGLRIEVPAKTLLAFHIVEPIRLTGFR
Ga0170834_10722342423300031057Forest SoilMTVSTLGFALMTMASIGACLGQDLANPADPPPLGPVRIVAEEPGVVPTGSSLVVQTDDAVTAVRPMRATIYSARVAEDVVDQSGTVLIPKGSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPAATVSGPPGAGGLGLERNAPTWVGGRAAQGKVLTSGPRIKVPARTLLAFQIAEPIRLTGFR
Ga0170824_12231222323300031231Forest SoilMTISTFGFTLMTMASFGACLGQDLTSPADLPPLGPLRIVAEEPGVVPTGSSLVVQTDEAVIAVRAMRATIYSASVAKDVLDQSGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKGIRYPVATVSGPPGAGGLGLDRNAPKWGGGRAAQGKILTSGLRIEVPAKTLLAFHIVEPIRLTGFR
Ga0170824_12777325913300031231Forest SoilPSGPLRIVAEEAGVVPTGSALVVQTDDAVTAVRAMGATIYSAKVAEDVVDQNGTVLIPKDSPVELGLRSLSYLGPGGAGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLERNAPKWVGGRAAQGKVLTSGPRIKVPARTLLAFQIVEPIRLTGFRR
Ga0310686_11154092023300031708SoilMTVSTLGFALMTMASIGACLGQDLANPADPPPLGPLRIVAEEPGVVPTGSSLVVQTDDAVTAVRPMRATIYSARVAEDVVDQSGTVLIPKGSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPAATVSGPPGAGGLGLERNAPTWVGGRAAQGKVLTSGPRIKVPARTLLAFQVAEPIRLTGFR
Ga0307476_1017802123300031715Hardwood Forest SoilMTVSTLGFALMTMASIGACLGQDLANPADRPPLGPQRIVAEEPGVVPAGSSLVVQTDDAVTAVRPMRATIYSARVAEDVVDQSGTVLIPKGSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLERNAPTWVGGRAAQGKVLT
Ga0307476_1091852713300031715Hardwood Forest SoilQHEDNIMTISTFGFALMTMASVGACLGQDPASPADPPPLGPVRIVAEEPGVVPAGSSLVVQTGDAVTAVRAMRATIYSANVAEDVVDQNGTVLIPKDSAVELGVRWLPYLGPGGVGMTELILELRAVTVKGVRYPVATASEAPGAGGLGLARSGPGWVGGRAAQGEVLTSGSRIKVPARTLLAFQIAEPVRLTGSR
Ga0307474_1030095013300031718Hardwood Forest SoilDNIVTISTFGFALMTMASVGACLGQDPASPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDDAVTAVRAMRATIYSANVAEDVVDQNGAVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVKVKGVRYPVATTSEPPGAGGLGLERNGPKWVDGRAAQGEVLTSGPRIKVPARTLLAFQIAEPIRLTGF
Ga0307474_1090293313300031718Hardwood Forest SoilMTISTFGFALMTMASLGACPGRDLSSPADVSAPGPVRTVAEEPGVVPAGSSLVVQTDDAMTAVKAMRATIYTANVAKDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKDVRYPIATASEPPGAFGLGLERNGPKWVGGDAAKG
Ga0307469_1005603823300031720Hardwood Forest SoilMTISTFAFALMTMASVGACLGQDPAGPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDYAVTAARAVRATIYSANVAEDVVDQNGTVLIPKDSPIELGIRLLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLDRNAPKWVGGRAARGKVLTSGPRIKVPARTLLAFQIVEPIRLTGFP
Ga0307469_1146075113300031720Hardwood Forest SoilMTIPTFGFALMTMASLATCTGKDLSSPADVPPPGPVRTIAEEPGVVPAGSSLVVQTDDAVTAVKAMRATIYTANVAEDVVDQNGTLLIPKDSHVELGVRSLPYLGPGGVGMTELILEVRAVTVKGVRYPVATASESPGAGGLGLERNGPKWVGGRVAQGEVLTSGPRIKVP
Ga0307468_10029864123300031740Hardwood Forest SoilMTISTFGFALMTMASVGACLGQDLASPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDDAVTAVRAMRATIYFANVAQDVVDQNGAVLIPRDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLERNGPKWVGRRAAQGEVLTSGPRIKVPAKTLLAFQIVEPIRLTGFRR
Ga0307475_1057123123300031754Hardwood Forest SoilMTIPTFSFALMTMASLATCNRKDLSRPADLSPPGPVRTVAEEPGVVPAGSSLVVQTDDAVTAVKAMRATIYTGNVAEDVVDQNGTVLIPKDSPVEFGVRSLPYLGPGGVGMTELILELRAVTVKGGRYPVATASENPGAGGLGLERNSPKWVGGRAAQGEVLTSGPRIKVPARTLLAFQIVEPIRLIRFRR
Ga0307475_1111440213300031754Hardwood Forest SoilMTISTFAFALMTMASVGACLGQDPAGPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDYAVTAARAMRATIYSANVAEDVVDQNGTVLIPKDSPIELGIRLLPYLGPGGVGMTELILELRAVTVKGVRYPVATVSGPPGAGGLGLDRNAPKWVGGRAARG
Ga0307475_1140593913300031754Hardwood Forest SoilRWLGNRCQSKDPEQAAKLHRLKCEDNLMTISTFGFALITMASLGACPEKDLSSPADASPPRPVRTVAEEPGVVSAGSYIVVQTDDTVTAVRAMGATIYTANVTEDVVDQNGTVLIPKDAPVELGVRSLPYLGPGGVGMTELILELRSVTVKGLRYPVATASENPGAGGLGLERDSPKWV
Ga0307473_1021133123300031820Hardwood Forest SoilMTIPTFGFALMTMASLATCTGKDLSSPADVPPPGPVRTIAEEPGVVPTGSSLVVRTDDAVTAVKAMRATIYSANVAEDVVDQNGTLLIPKDSHVELGVRSLPYLGPGGVGMTELILEVRAVTVKGVRYPVATASENPGAGGLGLERNGPKWVGGRGAQGEVLTSGPRIKVPARTLLAFQIVEPIRLTGFRR
Ga0307479_1105154713300031962Hardwood Forest SoilMTIPTFGFALMTMASLGACPGKDLSRPADVSPPGHVRTVAEEPGVVPAGSSLVVQTDDAVAAVKAMRATIYTANVAEDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRAVTVKGVRYPVATASEPPGAGGLGLERNGPKWVRGRAAQGEVLTSGPRIKVPARTLLAFQIVEPIRLTGFRR
Ga0307479_1114583423300031962Hardwood Forest SoilMTISTFGFALVTMASLGACPGKDLSSPADASPPRPVRTVAEEPGVVSAGSYIVVQTDDTVTAVRAMGATIYTANVTEDVVDQNGTVLIPKDAPVELGVRSLPYLGPGGVGMTELILELRSVTVKGLRYPVATASENPGAGGLGLERDSPKWVGGRAAKGEVLASGPRINVPAKTLLVFQIVEPIRLTGFRR
Ga0311301_1218137713300032160Peatlands SoilMTVSTLGFALMTMTSIGACLGQDLANPADPPPLGPVRIVAEEPGVVPTGSSLVVQTDDEVTAVRPMRATIYSARVAEDVVDQSGTVLIPKGSPVELGVRSLSYLGPGGAGMTELILELRAVTVKGVRYPAATVSGPPGAGGLGLERNAPTLVGGRAAQGKVLTSGPRIKVPARTLLAFQV
Ga0307470_1064690313300032174Hardwood Forest SoilNAVKRWLGNHCNRSIQSRPPNCTAQREDNIMTISTFGFALMTMASVGACLGQDLASLADPPPLGPVRIVAEEPGVVPAGSSLVVQTNDAVTAVRAMRATIYSANVAEDVVDQNGTVLIPKDSPVELGVRSLSYLGPGGAGMRELILELRAVTVMGVRYPVATVSGPPGAGGLALERNAPKWVGGRAAQGMILTSGPRIKGAR
Ga0307471_10010480333300032180Hardwood Forest SoilMTIPTFGFALMTMASLATCTGKDLSSPADVPPPGPVRTIAEEPGVVPTGSSLVVRTDDAVTAVKAMRATIYTANVAEDVVDQNGTLLIPKDSHVELGVRSLPYLGPGGVGMTELILEVRAVTVKGVRYPVATASENPGAGGLGLERNGPKWVGGRVAQGEVLTSGPRIKVPARTLLAFQIVEPIRLTGFRR
Ga0307472_10010699713300032205Hardwood Forest SoilPPGPVRTIAEEPGVVPTGSSLVVRTDDAVTAVKAMRATIYSANVAEDVVDQNGTLLIPKDSHVELGVRSLPYLGPGGVGMTELILEVRAVTVKGVRYPVATASENPGAGGLGLERNGPKWVGGRVAQGEVLTSGPRIKVPARTLLAFQIVEPIRLTGFRR
Ga0307472_10065819113300032205Hardwood Forest SoilMTISTSAFALMTMASVGACLGQDPAGPADPPPLGPVRIVAEEPGVVPAGSSLVVQTDYAVTAARAMRATIYSANVAEDVVDQNGTVLIPKDSPVELGVRSLPYLGPGGVGMTELILELRGVTVNGVRYPVATASEKPGAGGLGLGRNGPKRVGGRAAQGEVPTSGHRINVPAKTLLAFQIVEPIRLTGFRR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.