NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F072640

Metagenome Family F072640

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F072640
Family Type Metagenome
Number of Sequences 121
Average Sequence Length 144 residues
Representative Sequence MPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLTAPQFVAERRQIQQALDELDGAQSR
Number of Associated Samples 98
Number of Associated Scaffolds 121

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 17.65 %
% of genes near scaffold ends (potentially truncated) 33.88 %
% of genes from short scaffolds (< 2000 bps) 62.81 %
Associated GOLD sequencing projects 84
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (98.347 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(31.405 % of family members)
Environment Ontology (ENVO) Unclassified
(55.372 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(56.198 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 78.68%    β-sheet: 0.00%    Coil/Unstructured: 21.32%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 121 Family Scaffolds
PF03167UDG 38.02
PF07238PilZ 22.31
PF13424TPR_12 3.31
PF13660DUF4147 2.48
PF08369PCP_red 1.65
PF00196GerE 1.65
PF00923TAL_FSA 1.65
PF08281Sigma70_r4_2 1.65
PF01464SLT 1.65
PF02358Trehalose_PPase 0.83
PF01343Peptidase_S49 0.83
PF02470MlaD 0.83
PF00275EPSP_synthase 0.83
PF00005ABC_tran 0.83
PF02518HATPase_c 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 121 Family Scaffolds
COG0692Uracil-DNA glycosylaseReplication, recombination and repair [L] 38.02
COG1573Uracil-DNA glycosylaseReplication, recombination and repair [L] 38.02
COG3663G:T/U-mismatch repair DNA glycosylaseReplication, recombination and repair [L] 38.02
COG0176Transaldolase/fructose-6-phosphate aldolaseCarbohydrate transport and metabolism [G] 1.65
COG0616Periplasmic serine protease, ClpP classPosttranslational modification, protein turnover, chaperones [O] 1.65
COG1877Trehalose-6-phosphate phosphataseCarbohydrate transport and metabolism [G] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms98.35 %
UnclassifiedrootN/A1.65 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002558|JGI25385J37094_10031881All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae1856Open in IMG/M
3300002558|JGI25385J37094_10037926All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae1670Open in IMG/M
3300002560|JGI25383J37093_10030770All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae1796Open in IMG/M
3300002912|JGI25386J43895_10019924All Organisms → cellular organisms → Bacteria → Nitrospirae1970Open in IMG/M
3300005166|Ga0066674_10017169All Organisms → cellular organisms → Bacteria3083Open in IMG/M
3300005167|Ga0066672_10868321All Organisms → cellular organisms → Bacteria → Nitrospirae561Open in IMG/M
3300005171|Ga0066677_10502556All Organisms → cellular organisms → Bacteria → Nitrospirae696Open in IMG/M
3300005172|Ga0066683_10391257All Organisms → cellular organisms → Bacteria → Nitrospirae860Open in IMG/M
3300005174|Ga0066680_10022565All Organisms → cellular organisms → Bacteria3462Open in IMG/M
3300005178|Ga0066688_10059438All Organisms → cellular organisms → Bacteria → Nitrospirae2237Open in IMG/M
3300005180|Ga0066685_10487246All Organisms → cellular organisms → Bacteria → Nitrospirae854Open in IMG/M
3300005186|Ga0066676_10170447All Organisms → cellular organisms → Bacteria → Nitrospirae1381Open in IMG/M
3300005445|Ga0070708_100893327All Organisms → cellular organisms → Bacteria → Nitrospirae834Open in IMG/M
3300005446|Ga0066686_10078659All Organisms → cellular organisms → Bacteria → Nitrospirae2083Open in IMG/M
3300005467|Ga0070706_101521415All Organisms → cellular organisms → Bacteria → Nitrospirae611Open in IMG/M
3300005518|Ga0070699_101332430All Organisms → cellular organisms → Bacteria → Nitrospirae658Open in IMG/M
3300005536|Ga0070697_100056170All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira defluvii3202Open in IMG/M
3300005540|Ga0066697_10170857All Organisms → cellular organisms → Bacteria → Nitrospirae1288Open in IMG/M
3300005540|Ga0066697_10222027All Organisms → cellular organisms → Bacteria → Nitrospirae1122Open in IMG/M
3300005554|Ga0066661_10090488All Organisms → cellular organisms → Bacteria → Nitrospirae1816Open in IMG/M
3300005555|Ga0066692_10663183All Organisms → cellular organisms → Bacteria → Nitrospirae650Open in IMG/M
3300005556|Ga0066707_10802165All Organisms → cellular organisms → Bacteria → Nitrospirae582Open in IMG/M
3300005598|Ga0066706_10313745All Organisms → cellular organisms → Bacteria → Nitrospirae1237Open in IMG/M
3300005598|Ga0066706_11175205All Organisms → cellular organisms → Bacteria → Nitrospirae584Open in IMG/M
3300006174|Ga0075014_100230685All Organisms → cellular organisms → Bacteria → Nitrospirae947Open in IMG/M
3300006791|Ga0066653_10111861All Organisms → cellular organisms → Bacteria → Nitrospirae1265Open in IMG/M
3300006794|Ga0066658_10276556All Organisms → cellular organisms → Bacteria → Nitrospirae899Open in IMG/M
3300006797|Ga0066659_10452674All Organisms → cellular organisms → Bacteria → Nitrospirae1020Open in IMG/M
3300007265|Ga0099794_10091283All Organisms → cellular organisms → Bacteria → Nitrospirae1511Open in IMG/M
3300009012|Ga0066710_100450126All Organisms → cellular organisms → Bacteria → Nitrospirae1930Open in IMG/M
3300009012|Ga0066710_100586835All Organisms → cellular organisms → Bacteria → Nitrospirae1689Open in IMG/M
3300009012|Ga0066710_101457347All Organisms → cellular organisms → Bacteria → Nitrospirae1059Open in IMG/M
3300009012|Ga0066710_102685209All Organisms → cellular organisms → Bacteria → Nitrospirae711Open in IMG/M
3300009038|Ga0099829_10209718All Organisms → cellular organisms → Bacteria → Nitrospirae1579Open in IMG/M
3300009089|Ga0099828_10130621All Organisms → cellular organisms → Bacteria → Nitrospirae2205Open in IMG/M
3300009089|Ga0099828_10671547All Organisms → cellular organisms → Bacteria → Nitrospirae931Open in IMG/M
3300009090|Ga0099827_10005367All Organisms → cellular organisms → Bacteria → Nitrospirae7685Open in IMG/M
3300009090|Ga0099827_10283384All Organisms → cellular organisms → Bacteria → Nitrospirae1399Open in IMG/M
3300009137|Ga0066709_100193747All Organisms → cellular organisms → Bacteria → Nitrospirae2660Open in IMG/M
3300009137|Ga0066709_100336458All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae2068Open in IMG/M
3300009137|Ga0066709_100602597All Organisms → cellular organisms → Bacteria → Nitrospirae1565Open in IMG/M
3300009162|Ga0075423_10770764All Organisms → cellular organisms → Bacteria → Nitrospirae1017Open in IMG/M
3300009444|Ga0114945_10000226All Organisms → cellular organisms → Bacteria35109Open in IMG/M
3300010333|Ga0134080_10016109All Organisms → cellular organisms → Bacteria → Nitrospirae2724Open in IMG/M
3300010333|Ga0134080_10613290All Organisms → cellular organisms → Bacteria → Nitrospirae531Open in IMG/M
3300010335|Ga0134063_10352375All Organisms → cellular organisms → Bacteria → Nitrospirae715Open in IMG/M
3300010336|Ga0134071_10069181All Organisms → cellular organisms → Bacteria → Nitrospirae1635Open in IMG/M
3300010391|Ga0136847_10641062All Organisms → cellular organisms → Bacteria → Nitrospirae2533Open in IMG/M
3300011269|Ga0137392_10154759All Organisms → cellular organisms → Bacteria → Nitrospirae1853Open in IMG/M
3300011270|Ga0137391_10732644All Organisms → cellular organisms → Bacteria → Nitrospirae819Open in IMG/M
3300011271|Ga0137393_10158165All Organisms → cellular organisms → Bacteria → Nitrospirae1896Open in IMG/M
3300012096|Ga0137389_10263618All Organisms → cellular organisms → Bacteria → Nitrospirae1451Open in IMG/M
3300012198|Ga0137364_10253340All Organisms → cellular organisms → Bacteria → Nitrospirae1301Open in IMG/M
3300012204|Ga0137374_10001725All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira25151Open in IMG/M
3300012206|Ga0137380_10029277All Organisms → cellular organisms → Bacteria → Nitrospirae5077Open in IMG/M
3300012206|Ga0137380_10059529All Organisms → cellular organisms → Bacteria → Nitrospirae3495Open in IMG/M
3300012207|Ga0137381_10280568All Organisms → cellular organisms → Bacteria → Nitrospirae1449Open in IMG/M
3300012285|Ga0137370_10005390All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira defluvii5802Open in IMG/M
3300012350|Ga0137372_10698088All Organisms → cellular organisms → Bacteria → Nitrospirae735Open in IMG/M
3300012351|Ga0137386_10013201All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira5538Open in IMG/M
3300012351|Ga0137386_10263291All Organisms → cellular organisms → Bacteria → Nitrospirae1240Open in IMG/M
3300012355|Ga0137369_10455263All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium912Open in IMG/M
3300012359|Ga0137385_10034406All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira4545Open in IMG/M
3300012927|Ga0137416_10090811All Organisms → cellular organisms → Bacteria → Nitrospirae2272Open in IMG/M
3300012972|Ga0134077_10061512All Organisms → cellular organisms → Bacteria → Nitrospirae1402Open in IMG/M
3300017656|Ga0134112_10429171All Organisms → cellular organisms → Bacteria → Nitrospirae550Open in IMG/M
3300017659|Ga0134083_10005487All Organisms → cellular organisms → Bacteria → Nitrospirae4038Open in IMG/M
3300018052|Ga0184638_1019382All Organisms → cellular organisms → Bacteria → Nitrospirae2397Open in IMG/M
3300018056|Ga0184623_10000533All Organisms → cellular organisms → Bacteria16181Open in IMG/M
3300018063|Ga0184637_10102097All Organisms → cellular organisms → Bacteria → Nitrospirae1766Open in IMG/M
3300018079|Ga0184627_10073345All Organisms → cellular organisms → Bacteria → Nitrospirae1791Open in IMG/M
3300018082|Ga0184639_10090047All Organisms → cellular organisms → Bacteria → Nitrospirae1616Open in IMG/M
3300018431|Ga0066655_11307368All Organisms → cellular organisms → Bacteria → Nitrospirae521Open in IMG/M
3300018468|Ga0066662_10152846All Organisms → cellular organisms → Bacteria → Nitrospirae1749Open in IMG/M
3300018468|Ga0066662_11349369All Organisms → cellular organisms → Bacteria → Nitrospirae734Open in IMG/M
3300021357|Ga0213870_1137000All Organisms → cellular organisms → Bacteria → Nitrospirae742Open in IMG/M
3300022563|Ga0212128_10000315All Organisms → cellular organisms → Bacteria35008Open in IMG/M
3300025173|Ga0209824_10107649All Organisms → cellular organisms → Bacteria → Nitrospirae1024Open in IMG/M
3300025312|Ga0209321_10535062All Organisms → cellular organisms → Bacteria → Nitrospirae520Open in IMG/M
3300026296|Ga0209235_1004631All Organisms → cellular organisms → Bacteria → Nitrospirae7691Open in IMG/M
3300026296|Ga0209235_1056888All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira1848Open in IMG/M
3300026296|Ga0209235_1148103All Organisms → cellular organisms → Bacteria → Nitrospirae938Open in IMG/M
3300026297|Ga0209237_1000957All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira16882Open in IMG/M
3300026297|Ga0209237_1016559All Organisms → cellular organisms → Bacteria → Nitrospirae4290Open in IMG/M
3300026298|Ga0209236_1058528All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira defluvii1882Open in IMG/M
3300026309|Ga0209055_1002796All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira10840Open in IMG/M
3300026309|Ga0209055_1049856All Organisms → cellular organisms → Bacteria → Nitrospirae1812Open in IMG/M
3300026313|Ga0209761_1077543All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira1735Open in IMG/M
3300026313|Ga0209761_1238636All Organisms → cellular organisms → Bacteria → Nitrospirae738Open in IMG/M
3300026318|Ga0209471_1046569All Organisms → cellular organisms → Bacteria → Nitrospirae2032Open in IMG/M
3300026324|Ga0209470_1044549All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira2173Open in IMG/M
3300026325|Ga0209152_10043779All Organisms → cellular organisms → Bacteria → Nitrospirae1558Open in IMG/M
3300026326|Ga0209801_1000635All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira22871Open in IMG/M
3300026327|Ga0209266_1001433All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira15866Open in IMG/M
3300026331|Ga0209267_1044746All Organisms → cellular organisms → Bacteria → Nitrospirae2025Open in IMG/M
3300026332|Ga0209803_1246925All Organisms → cellular organisms → Bacteria → Nitrospirae618Open in IMG/M
3300026333|Ga0209158_1002510All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira11224Open in IMG/M
3300026335|Ga0209804_1188664All Organisms → cellular organisms → Bacteria → Nitrospirae880Open in IMG/M
3300026342|Ga0209057_1025126All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira moscoviensis3258Open in IMG/M
3300026532|Ga0209160_1052285All Organisms → cellular organisms → Bacteria → Nitrospirae2351Open in IMG/M
3300026532|Ga0209160_1296496All Organisms → cellular organisms → Bacteria → Nitrospirae549Open in IMG/M
3300026536|Ga0209058_1121914All Organisms → cellular organisms → Bacteria → Nitrospirae1283Open in IMG/M
3300026538|Ga0209056_10029639All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira moscoviensis5185Open in IMG/M
3300026540|Ga0209376_1177595All Organisms → cellular organisms → Bacteria → Nitrospirae993Open in IMG/M
3300026548|Ga0209161_10012514All Organisms → cellular organisms → Bacteria → Nitrospirae6324Open in IMG/M
3300026548|Ga0209161_10016919All Organisms → cellular organisms → Bacteria → Nitrospirae5323Open in IMG/M
3300026552|Ga0209577_10030139All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira4719Open in IMG/M
3300027835|Ga0209515_10067283All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira2595Open in IMG/M
3300027846|Ga0209180_10201907All Organisms → cellular organisms → Bacteria → Nitrospirae1146Open in IMG/M
3300027862|Ga0209701_10216780All Organisms → cellular organisms → Bacteria → Nitrospirae1134Open in IMG/M
3300027882|Ga0209590_10008155All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira4742Open in IMG/M
3300028536|Ga0137415_10113334All Organisms → cellular organisms → Bacteria → Nitrospirae2559Open in IMG/M
3300031820|Ga0307473_10135653All Organisms → cellular organisms → Bacteria → Nitrospirae1376Open in IMG/M
3300031820|Ga0307473_11232030All Organisms → cellular organisms → Bacteria → Nitrospirae557Open in IMG/M
3300032163|Ga0315281_10912378All Organisms → cellular organisms → Bacteria → Nitrospirae897Open in IMG/M
3300032180|Ga0307471_100898753All Organisms → cellular organisms → Bacteria → Nitrospirae1053Open in IMG/M
3300032256|Ga0315271_10004860All Organisms → cellular organisms → Bacteria8612Open in IMG/M
3300032275|Ga0315270_10138164All Organisms → cellular organisms → Bacteria → Nitrospirae1457Open in IMG/M
3300034165|Ga0364942_0235210All Organisms → cellular organisms → Bacteria → Nitrospirae598Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil31.41%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil22.31%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil18.18%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil5.79%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment4.96%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.31%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment2.48%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.48%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs1.65%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater0.83%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment0.83%
WastewaterEnvironmental → Aquatic → Freshwater → Drinking Water → Unchlorinated → Wastewater0.83%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.83%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.83%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.83%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.83%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006174Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2014EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009444Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b2EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021357Freshwater microbial communities from subterranean cave lake in Wind Cave National Park, South Dakota, United States - WICALVC2017EnvironmentalOpen in IMG/M
3300022563OV2_combined assemblyEnvironmentalOpen in IMG/M
3300025173Wastewater microbial communities from Netherlands to study Microbial Dark Matter (Phase II) - VDW unchlorinated drinking water (SPAdes)EnvironmentalOpen in IMG/M
3300025312Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 4 - CSP-I_5_4EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027835Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW60B uncontaminated upgradient, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032163Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G07_0EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032256Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_topEnvironmentalOpen in IMG/M
3300032275Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C1_bottomEnvironmentalOpen in IMG/M
3300034165Sediment microbial communities from East River floodplain, Colorado, United States - 19_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1003188133300002558Grasslands SoilAARLIALAFLLLAQTACMPNYRAFTKQADSLGAACNQAAAQFIVTSTTEARREVLSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVAERRQIQQALDELDGVQSR*
JGI25385J37094_1003792633300002558Grasslands SoilLIALAFLLLAQTACMPNYRAFTKQADSLGAVCNQAAAQFIVTPTMEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRALVAIGERDSARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAPSR*
JGI25383J37093_1003077013300002560Grasslands SoilAARLIALAFLLLAQTACMPNYRAFTKQADSLGAVCNQAAAQFIVTPTMEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRALVAIGERDSARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAPSR*
JGI25386J43895_1001992433300002912Grasslands SoilMPNYRAFTKQADSLGAVCNQAAAQFIVTPTMEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRALVAIGERDSARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAQSR*
Ga0066674_1001716943300005166SoilMPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAPSR*
Ga0066672_1086832113300005167SoilVGRKKIIPAARLIALAFLLVAPTACMPNYRAFMKQADSLGAACNQAAAQFAVTPTTEARQEALSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGERDTARHNYQGLLSRLTAPQFVAERRQIQQALDELDGAQSR*
Ga0066677_1050255623300005171SoilVGRKKIIPAARLIALAFLLVAPTACMPNYRAFMKQADSLGAACNQAAAQFAVTPTTEARQEALSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGERDTARHNYQGLLSRLTAPQFVAERRQIQ
Ga0066683_1039125713300005172SoilRALTNDASALGAACNQAAAQFAITLSPEARQEVLSQLTALNAALIETAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNKVLVAIGERDAARHNYQGLMSRLTAPQFVAERRQIQLALDELDGAKSR*
Ga0066680_1002256543300005174SoilMPNYRAFTKQADSLGAVCNQAAAQFIVTPTMEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRALVAIGERDSARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAPSR*
Ga0066688_1005943833300005178SoilVGRKKIIPAARLIALAFLLVAPTACMPNYRAFMKQADSLGAACNQAAAQFAVTPTTEARQEALSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGERDTARHNYQGLLSRLTAPQFVAERRQIQQALDELDSAQSR*
Ga0066685_1048724613300005180SoilEGLWHNTAVGKWAAARLIVFAFLLFAPTACAPNYRALTNEASALGAACIQAAAQFAVTLTPEARREVLNQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGICSLEYNKVLVAIGERDAARHNYLGLLSRLTFPQFVAERRQIQLALDELDGATSR*
Ga0066676_1017044723300005186SoilLWHNTAVGKWAAARLIVFAFLLFAPTACAPNYRALTNEASALGAACIQAAAQFAVTLTPEARREVLNQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGICSLEYNKVLVAIGERDAARHNYLGLLSRLTFPQFVAERRQIQLALDELDGATSR*
Ga0070708_10089332723300005445Corn, Switchgrass And Miscanthus RhizosphereAAAQFAVTLTPEARRDVLSQLTALNDALIQTAGYERKARRTNSVDLIDANRAFLETGRAWGNCSLEYNKVLVAIGERDAARHNYQGLLSRLTFPQFVAERRQIQLALDELDGAKNP*
Ga0066686_1007865923300005446SoilLWHNTAVGKWAAARLIVFAFLLFAPTACAPNYRALTNEASALGAACIQAAAQFAVTLTPEARREVLNQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGNCSLEYNKVLVAIGERDAARHNYLGLLSRLTFPQFVAERRQIQLALDELDGATSR*
Ga0070706_10152141523300005467Corn, Switchgrass And Miscanthus RhizosphereFAPTACAPNYRALTNEASALGAACNQAAAQFAVTLTPLARRDVLSQLTALNDALIQTAGYERKARRTNSVDLIDANRAFLETGRAWGNCSLEYNKVLVAIGERDAARHNYQGLLSRLTFPQFVAERRQIQLALDELDGAKNP*
Ga0070699_10133243023300005518Corn, Switchgrass And Miscanthus RhizosphereAPTACAPNYRALTNEASALGAACNQAAAQFAVTLTPEARRDVLSQLTALNDALIQTAGYERKARRTNSVDLIDANRAFLETGRAWGNCSLEYNKVLVAIGERDAARHNYQGLLSRLTFPQFVAERRQIQLALDELDGAKNP*
Ga0070697_10005617043300005536Corn, Switchgrass And Miscanthus RhizosphereLWHNTAVGKWAAARLIMFAILLFAPTACAPNYRALTNEASALGAACNQAAAQFAVTLTPEARREVLSQLTALNDALIQTAGYERKARRTNSVDLIDANRAFLETGRAWGNCSLEYNKVLVAIGERDAARHNYQGLLSRLTFPQFVAERRQIQLALDELDGAKNP*
Ga0066697_1017085723300005540SoilVGKWAAARLIVFAFLLFAPTACAPNYRALTNEASALGAACIQAAAQFAVTLTPEARREVLNQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGNCSLEYNKVLVAIGERDAARHNYLGLLSRLTFPQFVAERRQIQLALDELDGATSR*
Ga0066697_1022202723300005540SoilMHEASALGAACNQATAQFAVTLTPEARREVLSQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGGCSLEYNKVLVAIGERDAARHNYQGLMSRLTAPQFVAERRQIQLALDELDGAKSR*
Ga0066661_1009048823300005554SoilMPNYRAFMKQADSLGAACNQAAAQFAVTPTTEARQEALSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGNCSLEYNRVLVAIGERDTARHNYQGLLSRLTAPQFVAERRQIQQALDELDGAQSR*
Ga0066692_1066318313300005555SoilVGRKKIIPAARLIALAFLLVAPTACMPNYRAFMKQADSLGAACNQAAAQFAVTPTTEARQEALSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGNCSLEYNRVLVAIGERDTARHNYQGLLSRLTAPQFVAERRQIQQALDELDGAQSR*
Ga0066707_1080216513300005556SoilVGSLPAVRLIALAVLLAAPTACTPNYRVLTQQADSLSAACTQAAAQFAVTPTMESRQEVLGKLKDLNDALIKTAGYERKARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGERDAARYNYEGLLSRLNA
Ga0066706_1031374523300005598SoilVGSLPAVRLIALAVLLAAPTACTPNYRVLTQQADSLSAACTQAAAQFAVTPTMESRQEVLGKLKDLNDALIKTAGYERKARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGERDAARYNYEGLLSRLDAPQFVAERRQVQKALDELDGAKSP*
Ga0066706_1117520523300005598SoilFAFLLCASTACAPNYRALMHEASALGAACNQATAQFAVTLTPEARREVLSQLKALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGGCSLEYNKVLVAIGERDAARHNYQGLMSRLTAPQFVAERRQIQLALDELDGAKSR*
Ga0075014_10023068523300006174WatershedsLRLLPAIGLILLALASCAPTALNLRALTKQAGSLGAACNQAASRFAAMPTAETRREALGQLKDLNEALIETAGYEQDARSTNSVELIDANRAFLETGRAWANCSLQYNRTLVATGERETARHNYQGVLVRLSGPQFIAERRLVQAALNELGPAPSSP*
Ga0066653_1011186133300006791SoilMPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLARLAAPQFVAERRQIQQALDELDGAPSR*
Ga0066658_1027655623300006794SoilVGRKKIIPAARLIALAFLLVAPTACMPNYRAFMKQADSLGAACNQAAAQFAVTPTTEARQEALSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGGRDTARHNYQGLLSRLTAPQFVAERRQIQQALDELDGAQSR*
Ga0066659_1045267423300006797SoilMHEASALGAACNQATAQFAVTLTPEARREVLSQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNKVLVAIGERDAARHNYQGLMSRLTAPQFVAERRQIQLALDELDGAKSR*
Ga0099794_1009128313300007265Vadose Zone SoilMPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAQSR*
Ga0066710_10045012643300009012Grasslands SoilVGSLPAARLIALAFLLLAPTACTPNYRALTQQADLLGAACTQAAAQFAVTPTTESRQEVLGKLKDLNDALIKTAGYERKARRTNSVDLIDANRAFRETGRAWGSCSLEYNRVLVAIGERDAARYNYEGLLSRLDAPQFVAERRQVQHALDELDGAKSP
Ga0066710_10058683533300009012Grasslands SoilVGSLPAARFIALAFLLLALTACTPNYRALTQQADSLGAACTQAAAQFAVTPTTESRQEVLGKLKDLNDALIKTAGYERKARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGERDAARYNYEGLLSRLNAPQFVAERRHVQEALDELDGAKSP
Ga0066710_10145734733300009012Grasslands SoilLAQTACMPNYRAFTKQADSLGAVCNQAAAQFIVTPTMEARREVLSKLKELNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRALVAIGERDSARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAPSR
Ga0066710_10268520923300009012Grasslands SoilLAQTACMPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAPSR
Ga0099829_1020971833300009038Vadose Zone SoilVGKWAAARLIAFAFLLCASTACAPNYHALTNETSALGAACNQATAHFAVTLTPEARREVLSQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGNCSLEYNKVLVAIGERDAAQHNYQGLLSRLTAPQFVAERRQIQQALDELDGAKS
Ga0099828_1013062133300009089Vadose Zone SoilMPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKNLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVAERRQIQQALDELDGSQSR*
Ga0099828_1067154713300009089Vadose Zone SoilHSAKDQESVRTGREKIIPAARLIALAFLLLATTACMPNYRAFTKQADSLGAACNQAAAQFTVTPTTEARQEVLSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGERDTARHNYQGLLSRLTAPQFVAERRQIQQALDELDGAQSR*
Ga0099827_1000536763300009090Vadose Zone SoilMPNYRAFTKQADSLGAACNQAAAQFTVTPTTEARQEVLSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGERDTARHNYQGLLSRLTAPQFVAERRQIQQALDELDGAQSR*
Ga0099827_1028338423300009090Vadose Zone SoilVGKWAAARLIAFAFLLCASTACAPNYHALTNETSALGAACNQATAHFAVTLTPEARREVLSQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGNCSLEYNKVLVAIGERDAAQHNYQGLLSRLTAPQFVAERRQIQQALDELDGAKSR*
Ga0066709_10019374743300009137Grasslands SoilVGKWAAARLIVFAFLLFAPTACAPNYRALTNEASALGAACIQAAAQFAVTLTPEARREVLNQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGICSLEYNKVLVAIGERDAARHNYLGLLSRLTFPQFVAERRQIQLALDELDGATSR*
Ga0066709_10033645833300009137Grasslands SoilMHEASALGAACNQATAQFAVTLTPEARREVLSQLTALNDALIETAGYEREARRTNSVDIIDANRAFLETGRAWGSCSLEYNKVLVAIGERDAARHNYQGLMSRLTAPQFVAERRQIQLALDELDGAKSR*
Ga0066709_10060259723300009137Grasslands SoilLPAARFIALAFLLLALTACTPNYRALTQQADSLGAACTQAAAQFAVTPTTESRQEVLGKLKDLNDALIKTAGYERKARRTNSVDLIDANRAFRETGRAWGSCSLEYNRVLVAIGERDAARYNYEGLLSRLNAPQFVAERRHVQEALDELDGAKSP*
Ga0075423_1077076423300009162Populus RhizosphereSKRKQGDEGLWHNTAVGKWAAARLIMFAILLFAPTACAPNYRALTNEASALGAACNQAAAQFAVTLTPEARRDVLSQLTALNDALIQTAGYERKARRTNSVDLIDANRAFLETGRAWGNCSLEYNKVLVAIGERDAARHNYQGLLSRLTFPQFVAERRQIQLALDELDGAKNP*
Ga0114945_1000022673300009444Thermal SpringsVGSCLLAVTSCAPTAPNLRALTKQASAVGSACHQSAARFAAAPTREARQEVLGKLTELNEALIQAAEYEHEARRSNSVDLVDANRAFLEAGRAWGRCSLQYNRVLVAIGERKAARQNYQGLLARLTGPQFVAERRQVQTALEELDR*
Ga0134080_1001610953300010333Grasslands SoilMPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRALVAIGERDSARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAPSR*
Ga0134080_1061329013300010333Grasslands SoilRGIWRHTSSKRKQGDEGLWHNTAVGKWAAARLIVFAFLLFAPTACAPNYRALTNEASALGAACIQAAAQFAVTLTPEARREVLNQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGICSLEYNKVLVAIGERDAARHNYLGLLSRLTFPQFVAERRQIQLALDELDG
Ga0134063_1035237523300010335Grasslands SoilAPNYRALMHEASALGAACNQATAQFAVTLTPEARREVLSQLKALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGGCSLEYNKVLVAIGERDAARHNYLGLLSRLTFPQFVAERRQIQLALDELDGATSR*
Ga0134071_1006918113300010336Grasslands SoilVGKWAAARLIVFAFLLFAPTACAPNYRALTNEASALGAACIQAAAQFAVTLTPEARREVLNQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGNCSLEYNKVLVAIGERDAARHNYLGLLSRLTFPQFVA
Ga0136847_1064106213300010391Freshwater SedimentMQKQRLRRCLLIPLSVCRLLTLGVVLLELTACAPSLPALTKQAVTFGAACDQAAARFAAAPGGETRQDVLGQLKELNAALIETARYEKEARRTNSVDLIDANRAFLETGRAWGTCSLKYNRALVAIGEREAARHNYQGL
Ga0137392_1015475953300011269Vadose Zone SoilMPNYRAFTKQADSLGAACNQAAAQFTVTPTTEARQEVLSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAQSR*
Ga0137391_1073264423300011270Vadose Zone SoilVFLLCASTACAPNYRALTNEASALGAACNQAAAQFAVTLTPEARREVLSQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGSCSFEYNKVLVAIGERDAARHNYQGLLSRLTAPQFVAERRQIQQALDELDGAKSR*
Ga0137393_1015816533300011271Vadose Zone SoilMPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKNLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGERDTARRNYQGLLSRLTAPQFVAERRQIQQALDELDGAQSR*
Ga0137389_1026361833300012096Vadose Zone SoilMPNYRVFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANSAFLETGRGWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVAERRQIQQALDELDGSQSR*
Ga0137364_1025334023300012198Vadose Zone SoilMPNYHAFTKQADSLGATCNQAAAQFAVTPTTEARQEVLSKLKDLNDALIKTARYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGEWDTARHNYQGLLTRLTDPQFVAERRQIQQALDELDGALSRQ*
Ga0137374_10001725103300012204Vadose Zone SoilMPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVVERRQIQQALDELDGAQSR*
Ga0137380_1002927763300012206Vadose Zone SoilMPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVAERRQIQQALDELDGALSR*
Ga0137380_1005952933300012206Vadose Zone SoilMPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSVEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVVERRQIQQALDELDGAQSR*
Ga0137381_1028056813300012207Vadose Zone SoilMHEASALGAACNQATAQFAVTLTPEARREVLSQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNKVLVAIGERDAARHNYQGLLSRLTAPQFVAERRQIQQALDELDGTKSR*
Ga0137381_1038188213300012207Vadose Zone SoilQFIVTPTTEARREVLSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVVERRQIQQALDELDGAQSR*
Ga0137370_1000539023300012285Vadose Zone SoilMPNYHAFTKQADSLGATCNQAAAQFAVTPTTEARQEVLSKLKDLNDALIKTARYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGEWDTARHNYQGLLARLTDPQFVAERRQIQQALDELDGALSRQ*
Ga0137372_1069808813300012350Vadose Zone SoilMKRATTLNAACNQAAARFAAAPIRETRLEVLGRLKELNAAMVETAEYEREARRSNSVDLVDANRAFLETGRAWGSCSLQYNRTLVAIGEVDAARYNYQGLLNRLAGPQFVSERRQIQTALHELER*
Ga0137386_1001320133300012351Vadose Zone SoilMPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVVERRQIQQALDELDGAQSR*
Ga0137386_1026329133300012351Vadose Zone SoilNQATAQFAVTLTPEARREVLNQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNKVLVAIGERDAARHNYQGLLSRLTAPQFVAERRQIQQALDELDGAKSR*
Ga0137369_1045526313300012355Vadose Zone SoilMKRATALSAACNQAAVRFAAAPIRETRLEVLGRLKELNAAMVETAEYEREARRSNSVDLVDANRAFLETGRAWGSCSLQYNRTLVAIGEVDAARYNYQGLLNRLAGPQFVSERRQIQTALHELER
Ga0137385_1003440643300012359Vadose Zone SoilMPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDPARHNYQGLLSRLAAPQFVAERRQIQQALDELDGALSR*
Ga0137416_1009081123300012927Vadose Zone SoilMPNYRAFTKQADSLGAACNQAAAQFTVTPTTEARQEVLSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGERDTARRNYQGLLSRLTAPQFVAERRQIQQALDELDGAQSR*
Ga0134077_1006151233300012972Grasslands SoilMPNYRAFTKQADALGAVCNQAAAQFIVTPTMEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRALVAIGERDSARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAPSR*
Ga0134112_1042917113300017656Grasslands SoilEGLWHNTAVGKWAAARLIVFAFLLFAPTACAPNYRALTNEASALGAACIQAAAQFAVTLTPEARREVLNQLTALNDALIETAGYEREARRINSVDLIDANRAFLETGRAWGNCSLEYNKVLVAIGERDAARHNYLGLLSRLTFPQFVAERRQIQLALDELDGATSR
Ga0134083_1000548733300017659Grasslands SoilMPNYRAFTKQADSLGAVCNQAAAQFIVTPTMEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLARLAAPQFVAERRQIQQALDELDGAPSR
Ga0184638_101938223300018052Groundwater SedimentMAACAPTPPTGSTDPNLRALTKQAISLGAACDTAAARFAAAPTKEMRREVLARLKELNAVLIEAAGYEREARRSNSVDLVDANRAFHETGRAWGGCSLKYNRMLVAIGEVDTARYNYQGLLDRLAGPQFVSERRQIQTALRELER
Ga0184626_1025562513300018053Groundwater SedimentDTAAARFAAAPTKEMRREVLARLKELNAVLIEAAGYEREARRSNSVDLVDANRAFHETGRAWGGCSLKYNRTLVAMGEVDTARYNYQGLLDRLAGPQFVSERRQIQTALRELER
Ga0184623_10000533183300018056Groundwater SedimentMAACAPTPPTGSTDPNLRALTKQAISLGAACDTAAARFAAAPTKEMRREVLARLKELNAVLIEAAGYEREARRSNSVDLVDANRAFHETGRAWGGCSLKYNRTLVAIGEVDTARYNYQGLLDRLTGPQFVSERRQIQTALRELER
Ga0184637_1010209723300018063Groundwater SedimentMRRMQKRRLRQCLLIPLSVCRLLTLGVVLLELTACAPSLPALTKQAVTFGAACDQAAARFAAAPGGETRQDVLGQLKELNAALIETARYEKEARRTNSVDLIDANRAFLETGRAWGTCSLQYNRALVAIGEREAARHNYQGLLARLTGPQFVAARRLVQAALNELESASSSP
Ga0184627_1007334513300018079Groundwater SedimentMRRMQKRRLRQCLLIPLSVCRLLTLGVVLLELTACAPSLPALTKQAVTFGAACDQAAARFAAAPGGETRQDVLGQLKELNAALIETARYEKEARRTNSVDLIDANRAFLETGRAWGTCSLQYNRALVAIGEREAARHNYQGLLARLTGPQFVAERRLVQAALNELESASSSP
Ga0184639_1009004743300018082Groundwater SedimentMRRMQKRRLRRCPLIPLSVCRLLTLGVVLLELTACAPSLPALTKQAVTFGAACDQAAARFAAAPGGETRQDVLGQLKELNAALIETARYEKEARRTNSVDLIDANRAFLETGRAWGTCSLKYNRALVAIGEREAARHNYQGLLARLTGPQFVAERRLVQAALNELESASSSP
Ga0066655_1130736823300018431Grasslands SoilASTACAPNYRALMHEASALGAACNQATAQFAVTLTPEARREVLSQLKALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNKVLVAIGERDAARHNYQGLLSRLTAPQFVAERRQIQQALDELDGAKSR
Ga0066662_1015284633300018468Grasslands SoilMPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRALVAIGERDSARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAPSR
Ga0066662_1134936913300018468Grasslands SoilVGRKKIIPAARLIALAFLLVAPTACMPNYRAFMKQADSLGAACNQAAAQFAVTPTTEARQEALSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGERDTARHNYQGLLSR
Ga0213870_113700013300021357FreshwaterMRTVYAPTATTSRLLMYGFLLWGLAACAPSPRVLTEQAVSLGTACEQAVARFAAAPSRERRQNVLGQLRELNAVLIEAAEYEQQARRSNSIDLPDANRAFLETGRAWGACSLRYNKVLVAIGERGTARHNYQGLLARLTGPQFIAERHQIQAALDALERAERH
Ga0212128_10000315443300022563Thermal SpringsVGSCLLAVTSCAPTAPNLRALTKQASAVGSACHQSAARFAAAPTREARQEVLGKLTELNEALIQAAEYEHEARRSNSVDLVDANRAFLEAGRAWGRCSLQYNRVLVAIGERKAARQNYQGLLARLTGPQFVAERRQVQTALEELDR
Ga0209824_1010764913300025173WastewaterATCAPTAPNYRALTKQSAARGAVCAQAVARFAAAPGGETRQDILGQLKELNAALIETAEYEKEARRTNSVDLIDANRAFLETGRAWANCSLKYNKVLVAIGDRDVARHNYMGLLVRLTGPQFVSERRLVQAALNELGPAPSLP
Ga0209321_1053506223300025312SoilASILSVPNYRALTKQAAARGAACGQAVARFAAAPGGEMRQDVLERLKELNAALIETAGYEQEARRENSVDLIDANRAFLETGRAWTNCSLKYNRVLAAIGERDAARHNYTGLLARLEGPQFVTERRRIQTALEELGR
Ga0209235_100463163300026296Grasslands SoilMPNYRAFTKQADSLGAACNQAAAQFIVTSTTEARREVLSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVAERRQIQQALDELDGVQSR
Ga0209235_105688813300026296Grasslands SoilRTGREKIIPAARLIALAFLLLAQTACMPNYRAFTKQADSLGAVCNQAAAQFIVTPTMEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRALVAIGERDSARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAPSR
Ga0209235_114810323300026296Grasslands SoilMPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAQSR
Ga0209237_100095793300026297Grasslands SoilMPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVAERRQIQQALDELDGVQSR
Ga0209237_101655983300026297Grasslands SoilMPNYRAFTKQADSLGAVCNQAAAQFIVTPTMEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRALVAIGERDSARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAPSR
Ga0209236_105852813300026298Grasslands SoilRTGREKIIPAARLIALAFLLLAQTACMPNYRAFTKQADSLGAACNQAAAQFIVTSTTEARREVLSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVAERRQIQQALDELDGVQSR
Ga0209055_100279693300026309SoilMHEASALGAACNQATAQFAVTLTPEARREVLSQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNKVLVAIGERDAARHNYQGLMSRLTAPQFVAERRQIQLALDELDGAKSR
Ga0209055_104985633300026309SoilMPNYRAFMKQADSLGAACNQAAAQFAVTPTTEARQEALSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGNCSLEYNRVLVAIGERDTARHNYQGLLSRLTAPQFVAERRQIQQALDELDGAQSR
Ga0209761_107754313300026313Grasslands SoilSLGAVCNQAAAQFIVTPTMEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRALVAIGERDSARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAPSR
Ga0209761_123863623300026313Grasslands SoilIIPAARLIALAFLLLAQTACMPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAQIR
Ga0209471_104656913300026318SoilVGRKKIIPAARLIALAFLLVAPTACMPNYRAFMKQADSLGAACNQAAAQFAVTPTTEARQEALSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGERDTARHNYQGLLSRLTAPQFVA
Ga0209470_104454933300026324SoilNYRALTHEASALGAACNQATAQFAVTLTPEARREVLSQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNKVLVAIGERDAARHNYQGLLSRLTAPQFVAERRQIQQALDELDGAKSR
Ga0209152_1004377923300026325SoilVGKWAAARLIAFAFLLCASTACAPNYRALTNEASALGTACNQATAQFAVTLTTEARREVLSQLTALNDALIETAGYEREARRTNSVDLIDANRAFMETGRAWGSCSLEYNKVLVAIGERDAARHNYQGLMSRLTAPQFVAERRQIQLALDELDGAKSR
Ga0209801_1000635213300026326SoilMHEASALGAACNQATAQFAVTLTPEARREVLSQLKALNDALIETAGYESEARRTNSVDLIDANRAFLETGRAWGGCSLEYNKVLVAIGERDAARHNYQGLMSRLTAPQFVAERRQIQLALDELDGAKSR
Ga0209266_100143363300026327SoilMPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAPSR
Ga0209267_104474633300026331SoilMPNYRAFMKQADSLGAACNQAAAQFAVTPTTEARQEALSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGERDTARHNYQGLLSRLTAPQFVAERRQIQQALDELDSAQSR
Ga0209803_124692523300026332SoilSTACAPNYRALMHEASALGAACNQATAQFAVTLTPEARREVLSQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNKVLVAIGERDAARHNYQGLMSRLTAPQFVAERRQIQLALDELDGAKSR
Ga0209158_1002510133300026333SoilVGRKKIIPAARLIALAFLLVAPTACMPNYRAFMKQADSLGAACNQAAAQFAVTPTTEARQEALSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGNCSLEYNRVLVAIGERDTARHNYQGLLSRLTAPQFVAERRQIQQALDELDGAQSR
Ga0209804_118866413300026335SoilAPTACMPNYRAFMKQADSLGAACNQAAAQFAVTPTTEARQEALSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGNCSLEYNRVLVAIGERDTARHNYQGLLSRLTAPQFVAERRQIQQALDELDGAQSR
Ga0209057_102512623300026342SoilMHEASALGAACNQATAQFAVTLTPEARREVLSQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGGCSLEYNKVLVAIGERDAARHNYQGLMSRLTAPQFVAERRQIQLALDELDGAKSR
Ga0209160_105228553300026532SoilMPNYRAFTKQADSLGAACNQAAAQFTVTPTTEARQEVLSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGERDTARHNYQGLLSRLTAPQFVAERRQIQQALDELDGAQSR
Ga0209160_129649623300026532SoilVGRKKIIPAARLIALAFLLVAPTACMPNYRAFMKQADSLGAACNQAAAQFAVTPTTEARQEALSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGNCSLEYNRVLVAIGERDTARHNYQGLLSRLTAPQFVA
Ga0209058_112191423300026536SoilVGKWAAARLIVFAFLLFAPTACAPNYRALTNEASALGAACIQAAAQFAVTLTPEARREVLNQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGNCSLEYNKVLVAIGERDAARHNYLGLLSRLTFPQFVAERRQIQLALDELDGATSR
Ga0209056_1002963913300026538SoilVLTQQADSLSAACTQAAAQFAVTPTMESRQEVLGKLKDLNDALIKTAGYERKARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGERDAARYNYEGLLSRLDAPQFVAERRQVQKALDELDGAKSP
Ga0209376_117759523300026540SoilLWHNTAVGKWAAARLIVFAFLLFAPTACAPNYRALTNEASALGAACIQAAAQFAVTLTPEARREVLNQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGICSLEYNKVLVAIGERDAARHNYLGLLSRLTFPQFVAERRQIQLALDELDGATSR
Ga0209161_1001251433300026548SoilMHEASALGAACNQATAQFAVTLTPEARREVLSQLKALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGGCSLEYNKVLVAIGERDAARHNYQGLMSRLTAPQFVAERRQIQLALDELDGAKSR
Ga0209161_1001691933300026548SoilMPNYRAFTKQADSLGAVCNQAAAQFIVTPTMEARREVLNKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRALVAIGERDSARHNYQGLLSRLAAPQFVAERRQIQQALDELDGAPSR
Ga0209577_1003013923300026552SoilVGRKKIIPAARLIALAFLLVAPTACMPNYRAFMKQADSLGAACNQAAAQFAVTPTTEARQEALSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSLEYNRVLVAIGERDTARHNYQGLLSRLTAPQFVAERRQIQQALDELDGAQSR
Ga0209515_1006728343300027835GroundwaterALTKQAMTFGAACDQAAARFAAAPGGETRQDVLGQLKELNAALIETAGYEKEARRTNSVDLIDANRAFLETGRAWGTCSLKYNRALVAIGEREAARHNYQGLLARLTGPQFVAERRLVQAALNELESASFSP
Ga0209180_1020190733300027846Vadose Zone SoilMPNYRAFTKQADSLGAACNQAAAQFIVTPTTEARREVLSKLKDLNDALIRTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLTAPQFVAERRQIQQALDELDGAQSR
Ga0209701_1021678033300027862Vadose Zone SoilLWHNTAVGKWAAARLIAFAFLLCVSTACAPNYRALRNEASALGAACNQATAQFAVTLTPEARREVLSQLTALNDALIETAGYEREARRTNSVNLIDANRAFLETGRAWGSCSLEYNKVLVAIGERDAARHNYQGLLSRLTAPQFVAELRQIQQALDELDGAKSR
Ga0209590_1000815593300027882Vadose Zone SoilMHCRRRGIWQPTSSKRKHGGEGLWHNTAVGKWAAARLIAFAFLLCASTACAPNYHALTNETSALGAACNQATAHFAVTLTPEARREVLSQLTALNDALIETAGYEREARRTNSVDLIDANRAFLETGRAWGNCSLEYNKVLVAIGERDAAQHNYQGLLSRLTAPQFVAERRQIQQALDELDGAKSR
Ga0137415_1011333433300028536Vadose Zone SoilMPNYRAFTKQADSLGAACNQAAAQFTVTPTTEARQEVLSKLKDLNDALIKTAGYEREARRTNSVDLIDANRAFLETGRAWGSCSIEYNRVLVAIGERDTARHNYQGLLSRLAAPQFVAERRQIQQALDELDGVQSR
Ga0307473_1013565323300031820Hardwood Forest SoilLFLLSVVSCAPPAENYRTLTKQAGTLGAACNEAAVRFAAEPTKEMRREVLAKLKELNAALIEAAGHEREARRSNSVDLVDANRAFHETGRAWGGCSLKYNRTLVAIGEVDAARYNYQGLLNRLAGPQFVSERRQIQAALQEIER
Ga0307473_1123203013300031820Hardwood Forest SoilLTNEASALGAACNQAAAQFAVTLTPEARRDVLSQLTALNDALIQTAGYERKARRTNSVDLIDANRAFLETGRAWGNCSLEYNKVLVAIGERDAARHNYQGLLSRLTFPQFVAERRQIQLALDELDGAKNP
Ga0315281_1091237813300032163SedimentMIGLVLLALASCAPTAPNLRALTKQAGSLGAACSQAASRFVVTPTAEARREVLGKLKDLNEALVKTAGYEQEARRTNSVDLIDANRAFLETGRAWTSCSLKYNKVLMAAGEHEAARHNYQGLLARLTGPQFVAERHLVQAAL
Ga0307471_10089875323300032180Hardwood Forest SoilLWHNTAVGKWAAARLIVFAFLLFAPTACAPNYRALTNEASALGAACNQAAAQFAVTLTPEARRDVLSQLTALNDALIQTAGYERKARRTNSVDLIDANRAFLETGRAWGNCSLEYNKVLVAIGERDAARHNYQGLLSRLTFPQFVAERRQIQLALDELDGAKNP
Ga0315271_1000486043300032256SedimentMIGLVLLALASCAPTAPNLRALTKQAGSLGAACSQAASRFVVTPTAEARREVLGKLTDLNEALVKTAGYEQEARRTNSVDLIDANRAFLETGRAWTSCSLKYNKVLMAAGEHEAARHNYQGLLARLTGPQFVAERRLVQAALNELGPAPSSP
Ga0315270_1013816423300032275SedimentMIGLVLLALASCAPTAPNLRALTKQAGSLGAACSQAASRFVVTPTAEARREVLGKLKDLNEALVKTAGYEQEARRTNSVDLIDANRAFLETGRAWTSCSLKYNKVLMAAGEHEAARHNYQGLLARLTGPQFVAERRLVQAALNELGPAPSSP
Ga0364942_0235210_76_4563300034165SedimentVTFGAACDQAAARFAAAPGGETRQDVLGQLKELNAALIETARYEKEARRTNSVDLIDANRAFLETGRAWGTCSLKYNRALVAIGEREAARHNYQGLLARLTGPQFVAERRLVQAALNELESASSSP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.