NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F088592

Metagenome / Metatranscriptome Family F088592

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F088592
Family Type Metagenome / Metatranscriptome
Number of Sequences 109
Average Sequence Length 119 residues
Representative Sequence MNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRNLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT
Number of Associated Samples 96
Number of Associated Scaffolds 109

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 88.99 %
% of genes near scaffold ends (potentially truncated) 41.28 %
% of genes from short scaffolds (< 2000 bps) 75.23 %
Associated GOLD sequencing projects 85
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(40.367 % of family members)
Environment Ontology (ENVO) Unclassified
(42.202 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(50.459 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 71.09%    β-sheet: 0.00%    Coil/Unstructured: 28.91%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 109 Family Scaffolds
PF00440TetR_N 23.85
PF02776TPP_enzyme_N 12.84
PF12680SnoaL_2 7.34
PF11066DUF2867 6.42
PF00392GntR 1.83
PF00072Response_reg 0.92
PF14520HHH_5 0.92
PF01804Penicil_amidase 0.92
PF04392ABC_sub_bind 0.92
PF01436NHL 0.92
PF13610DDE_Tnp_IS240 0.92
PF00848Ring_hydroxyl_A 0.92
PF00005ABC_tran 0.92
PF00067p450 0.92
PF01066CDP-OH_P_transf 0.92
PF13450NAD_binding_8 0.92
PF13683rve_3 0.92
PF02518HATPase_c 0.92
PF01370Epimerase 0.92
PF01814Hemerythrin 0.92
PF13335Mg_chelatase_C 0.92

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 109 Family Scaffolds
COG4638Phenylpropionate dioxygenase or related ring-hydroxylating dioxygenase, large terminal subunitInorganic ion transport and metabolism [P] 1.83
COG0558Phosphatidylglycerophosphate synthaseLipid transport and metabolism [I] 0.92
COG1183Phosphatidylserine synthaseLipid transport and metabolism [I] 0.92
COG2124Cytochrome P450Defense mechanisms [V] 0.92
COG2366Acyl-homoserine lactone (AHL) acylase PvdQSecondary metabolites biosynthesis, transport and catabolism [Q] 0.92
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 0.92
COG5050sn-1,2-diacylglycerol ethanolamine- and cholinephosphotranferasesLipid transport and metabolism [I] 0.92


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005167|Ga0066672_10167036All Organisms → cellular organisms → Bacteria1388Open in IMG/M
3300005171|Ga0066677_10022484All Organisms → cellular organisms → Bacteria2932Open in IMG/M
3300005172|Ga0066683_10090208All Organisms → cellular organisms → Bacteria1853Open in IMG/M
3300005174|Ga0066680_10557461All Organisms → cellular organisms → Bacteria720Open in IMG/M
3300005177|Ga0066690_10200520All Organisms → cellular organisms → Bacteria1328Open in IMG/M
3300005177|Ga0066690_10965678All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300005178|Ga0066688_10082195All Organisms → cellular organisms → Bacteria1936Open in IMG/M
3300005187|Ga0066675_10160180All Organisms → cellular organisms → Bacteria1552Open in IMG/M
3300005436|Ga0070713_100583913All Organisms → cellular organisms → Bacteria1060Open in IMG/M
3300005445|Ga0070708_100810597All Organisms → cellular organisms → Bacteria880Open in IMG/M
3300005450|Ga0066682_10043578All Organisms → cellular organisms → Bacteria2698Open in IMG/M
3300005467|Ga0070706_100045983All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia4030Open in IMG/M
3300005468|Ga0070707_100149408All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2275Open in IMG/M
3300005471|Ga0070698_100010796All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia9719Open in IMG/M
3300005536|Ga0070697_100036590All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae3965Open in IMG/M
3300005540|Ga0066697_10049383All Organisms → cellular organisms → Bacteria2381Open in IMG/M
3300005552|Ga0066701_10289404All Organisms → cellular organisms → Bacteria1015Open in IMG/M
3300005557|Ga0066704_10945367All Organisms → cellular organisms → Bacteria532Open in IMG/M
3300005561|Ga0066699_10913677All Organisms → cellular organisms → Bacteria612Open in IMG/M
3300005574|Ga0066694_10522455All Organisms → cellular organisms → Bacteria553Open in IMG/M
3300005598|Ga0066706_11199538All Organisms → cellular organisms → Bacteria576Open in IMG/M
3300006034|Ga0066656_10932312All Organisms → cellular organisms → Bacteria556Open in IMG/M
3300006796|Ga0066665_11073387All Organisms → cellular organisms → Bacteria614Open in IMG/M
3300006844|Ga0075428_100306304All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales1708Open in IMG/M
3300007258|Ga0099793_10198623All Organisms → cellular organisms → Bacteria961Open in IMG/M
3300007265|Ga0099794_10244374All Organisms → cellular organisms → Bacteria925Open in IMG/M
3300009012|Ga0066710_100188811All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2919Open in IMG/M
3300009012|Ga0066710_100660191All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1592Open in IMG/M
3300009038|Ga0099829_10358703All Organisms → cellular organisms → Bacteria1201Open in IMG/M
3300009038|Ga0099829_10754534All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria808Open in IMG/M
3300009038|Ga0099829_11368205All Organisms → cellular organisms → Bacteria585Open in IMG/M
3300009038|Ga0099829_11467146All Organisms → cellular organisms → Bacteria563Open in IMG/M
3300009088|Ga0099830_10476609All Organisms → cellular organisms → Bacteria1016Open in IMG/M
3300009088|Ga0099830_10941140All Organisms → cellular organisms → Bacteria715Open in IMG/M
3300009090|Ga0099827_10088452All Organisms → cellular organisms → Bacteria2436Open in IMG/M
3300009147|Ga0114129_10966570All Organisms → cellular organisms → Bacteria1075Open in IMG/M
3300009810|Ga0105088_1013249All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1242Open in IMG/M
3300009816|Ga0105076_1002511All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia2713Open in IMG/M
3300009819|Ga0105087_1037120All Organisms → cellular organisms → Bacteria760Open in IMG/M
3300010154|Ga0127503_10894016All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia923Open in IMG/M
3300010320|Ga0134109_10164621All Organisms → cellular organisms → Bacteria804Open in IMG/M
3300010336|Ga0134071_10100085All Organisms → cellular organisms → Bacteria1375Open in IMG/M
3300011269|Ga0137392_10004841All Organisms → cellular organisms → Bacteria8403Open in IMG/M
3300011270|Ga0137391_11275331All Organisms → cellular organisms → Bacteria582Open in IMG/M
3300011270|Ga0137391_11338444All Organisms → cellular organisms → Bacteria563Open in IMG/M
3300012199|Ga0137383_10101011All Organisms → cellular organisms → Bacteria2095Open in IMG/M
3300012201|Ga0137365_10012195All Organisms → cellular organisms → Bacteria6780Open in IMG/M
3300012202|Ga0137363_10147941All Organisms → cellular organisms → Bacteria1842Open in IMG/M
3300012203|Ga0137399_10376095All Organisms → cellular organisms → Bacteria1182Open in IMG/M
3300012204|Ga0137374_10030488All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia5904Open in IMG/M
3300012204|Ga0137374_10031757All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia5763Open in IMG/M
3300012205|Ga0137362_10273628All Organisms → cellular organisms → Bacteria1460Open in IMG/M
3300012207|Ga0137381_10109994All Organisms → cellular organisms → Bacteria2344Open in IMG/M
3300012207|Ga0137381_11339000All Organisms → cellular organisms → Bacteria609Open in IMG/M
3300012211|Ga0137377_11216418All Organisms → cellular organisms → Bacteria683Open in IMG/M
3300012350|Ga0137372_10115977All Organisms → cellular organisms → Bacteria2225Open in IMG/M
3300012353|Ga0137367_10155992All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1664Open in IMG/M
3300012355|Ga0137369_10317295All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1150Open in IMG/M
3300012357|Ga0137384_11474786All Organisms → cellular organisms → Bacteria529Open in IMG/M
3300012359|Ga0137385_10348204All Organisms → cellular organisms → Bacteria1268Open in IMG/M
3300012360|Ga0137375_10016129All Organisms → cellular organisms → Bacteria8761Open in IMG/M
3300012360|Ga0137375_10037487All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia5422Open in IMG/M
3300012362|Ga0137361_10165719All Organisms → cellular organisms → Bacteria1985Open in IMG/M
3300012362|Ga0137361_11946350All Organisms → cellular organisms → Bacteria504Open in IMG/M
3300012582|Ga0137358_10909081All Organisms → cellular organisms → Bacteria576Open in IMG/M
3300012917|Ga0137395_10445047All Organisms → cellular organisms → Bacteria930Open in IMG/M
3300012923|Ga0137359_10120154All Organisms → cellular organisms → Bacteria2331Open in IMG/M
3300012927|Ga0137416_10243523All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1465Open in IMG/M
3300012929|Ga0137404_10489650All Organisms → cellular organisms → Bacteria1096Open in IMG/M
3300012944|Ga0137410_10231899All Organisms → cellular organisms → Bacteria1440Open in IMG/M
3300012972|Ga0134077_10216999All Organisms → cellular organisms → Bacteria783Open in IMG/M
3300012976|Ga0134076_10019323All Organisms → cellular organisms → Bacteria2403Open in IMG/M
3300017654|Ga0134069_1221764All Organisms → cellular organisms → Bacteria651Open in IMG/M
3300018053|Ga0184626_10053617All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1690Open in IMG/M
3300018076|Ga0184609_10558373All Organisms → cellular organisms → Bacteria516Open in IMG/M
3300018433|Ga0066667_10086835All Organisms → cellular organisms → Bacteria2035Open in IMG/M
3300018482|Ga0066669_10077998All Organisms → cellular organisms → Bacteria2218Open in IMG/M
3300025910|Ga0207684_10523667All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1015Open in IMG/M
3300025922|Ga0207646_10582153All Organisms → cellular organisms → Bacteria1005Open in IMG/M
3300025928|Ga0207700_10575902All Organisms → cellular organisms → Bacteria1001Open in IMG/M
3300026301|Ga0209238_1154434All Organisms → cellular organisms → Bacteria695Open in IMG/M
3300026309|Ga0209055_1060551All Organisms → cellular organisms → Bacteria1602Open in IMG/M
3300026310|Ga0209239_1121917All Organisms → cellular organisms → Bacteria1079Open in IMG/M
3300026315|Ga0209686_1145671All Organisms → cellular organisms → Bacteria736Open in IMG/M
3300026326|Ga0209801_1014172All Organisms → cellular organisms → Bacteria → Proteobacteria4032Open in IMG/M
3300026342|Ga0209057_1128603All Organisms → cellular organisms → Bacteria918Open in IMG/M
3300026360|Ga0257173_1014200All Organisms → cellular organisms → Bacteria944Open in IMG/M
3300026469|Ga0257169_1037664All Organisms → cellular organisms → Bacteria738Open in IMG/M
3300026498|Ga0257156_1098853All Organisms → cellular organisms → Bacteria607Open in IMG/M
3300026507|Ga0257165_1003301All Organisms → cellular organisms → Bacteria2182Open in IMG/M
3300026529|Ga0209806_1063935All Organisms → cellular organisms → Bacteria1678Open in IMG/M
3300026532|Ga0209160_1114957All Organisms → cellular organisms → Bacteria1334Open in IMG/M
3300026536|Ga0209058_1296613All Organisms → cellular organisms → Bacteria556Open in IMG/M
3300027511|Ga0209843_1000986All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria6590Open in IMG/M
3300027846|Ga0209180_10145011All Organisms → cellular organisms → Bacteria1367Open in IMG/M
3300027846|Ga0209180_10191207All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1181Open in IMG/M
3300027862|Ga0209701_10086360All Organisms → cellular organisms → Bacteria1973Open in IMG/M
3300027862|Ga0209701_10139206All Organisms → cellular organisms → Bacteria1490Open in IMG/M
3300027875|Ga0209283_10125165All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1696Open in IMG/M
3300027882|Ga0209590_10410474All Organisms → cellular organisms → Bacteria875Open in IMG/M
3300027949|Ga0209860_1010151All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1281Open in IMG/M
3300028041|Ga0247719_1054067All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1085Open in IMG/M
3300028536|Ga0137415_10063881All Organisms → cellular organisms → Bacteria3546Open in IMG/M
3300028673|Ga0257175_1111601All Organisms → cellular organisms → Bacteria541Open in IMG/M
(restricted) 3300031197|Ga0255310_10158171All Organisms → cellular organisms → Bacteria625Open in IMG/M
(restricted) 3300031248|Ga0255312_1097755All Organisms → cellular organisms → Bacteria716Open in IMG/M
3300032180|Ga0307471_102410871All Organisms → cellular organisms → Bacteria665Open in IMG/M
3300033407|Ga0214472_10253408All Organisms → cellular organisms → Bacteria1687Open in IMG/M
3300034178|Ga0364934_0049597All Organisms → cellular organisms → Bacteria1551Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil40.37%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil20.18%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.26%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.50%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil4.59%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.59%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand4.59%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.83%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.83%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil1.83%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.83%
SoilEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil0.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.92%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.92%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.92%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009810Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_20_30EnvironmentalOpen in IMG/M
3300009816Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10EnvironmentalOpen in IMG/M
3300009819Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_40_50EnvironmentalOpen in IMG/M
3300010154Soil microbial communities from Willow Creek, Wisconsin, USA - WC-WI-TBF metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026360Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-BEnvironmentalOpen in IMG/M
3300026469Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-BEnvironmentalOpen in IMG/M
3300026498Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-49-AEnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300027511Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027949Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300028041Soil microbial communities from hillslope of Landscape Evolution Observatory, University of Arizona, Oracle, AZ, United States - 4-1-E_DEnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028673Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-BEnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033407Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT140D175EnvironmentalOpen in IMG/M
3300034178Sediment microbial communities from East River floodplain, Colorado, United States - 27_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
Ga0066672_1016703623300005167SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT*
Ga0066677_1002248433300005171SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITILCWLNT*
Ga0066683_1009020823300005172SoilMNVGLVIAGSLCLVLAGGHTLVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT*
Ga0066680_1055746113300005174SoilSVRLSRASAFLWPAMSPALRYRGPNEALQLGRCVCDMDRHGAAIRAGGVRRMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT*
Ga0066690_1020052013300005177SoilMNIGLVVAGSLCLVLAGGHTFVGRRLLDRQPRHLDPDGALTRGAVVFTWYALSVMLTTMGAVLIALARGTHAHDRGEVVFLVGAAYAAAT
Ga0066690_1096567813300005177SoilFLWPAMSPALRYRGPNEALQLGRCVCDMDRHGAAIRAGGVRRMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT*
Ga0066688_1008219523300005178SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWCSRSSSPSCAG*
Ga0066675_1016018013300005187SoilMNVGLVIAGSLCLVLAGGHTLVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIAITVLCWLNT*
Ga0070713_10058391323300005436Corn, Switchgrass And Miscanthus RhizosphereMNVELVIAGSLCLVLAGGHTLTGRLVLDRLPRNFQPTRFGDGAHTRGLLVFTWHALSLMLTTTGVVLIAIAQREPADERGAVVFLVGAAYAAAALLLAWMTRKRPSDLLRIPVWAPFIVIIVLCWLNT*
Ga0070708_10081059723300005445Corn, Switchgrass And Miscanthus RhizosphereMNVGLVIAGSLCLVLAGGHTLVGRLVLDRLPRDLDPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLVWRSRRRPSDLLRTPVWVLQIVITVLCWLNT*
Ga0066682_1004357823300005450SoilMNVGLVIAGSLCLVLAGGHTLVGRLVLDRPPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIAITVLCWLNT*
Ga0070706_10004598343300005467Corn, Switchgrass And Miscanthus RhizosphereMNVGLVIAGSLCLVLALGHTLVGRGVLDSLPRTLQPTRFGDGALTRGAVVFTWYALSVMLTTTGAVLIALARGTHAHDRGEVVFLVGAAYAAATVLLVWRSRRRPSDLLRTPVWVLQIVITVLCWLNT*
Ga0070707_10014940823300005468Corn, Switchgrass And Miscanthus RhizosphereMNLGLVVAGSLCLVLAGGHTVVGRRVLDRLPRHLDPTRFGDGALTRGAVVFTWYALSVMLTTTGAVLIALARGTHAHDRGEVVFLVGAAYAAATVLLVWRSRRRPSDLLRTPVWVLQIVITVLCWLNT*
Ga0070698_10001079633300005471Corn, Switchgrass And Miscanthus RhizosphereMNVELVIAGSLCLVLAGGHTLTGRLVLDRLPRNFQPTRFGDGAHTRGLFVFTWHALSLMLTTTGVVLIAIAQREPADERGAVVFLVGAAYAAAALLLAWMTRKRPSDLLRIPVWAPFIVIIVLCWLNT*
Ga0070697_10003659013300005536Corn, Switchgrass And Miscanthus RhizosphereMNLGLVVAGSLCLVLAGGHTVVGRRVLDRLPRHLDPTRFGDGALTRGAVVYTWHGLGLMLTTTGAILIALASSAPADDPSGVLLLVGAAYAAATVLLVWRSRRRPSDLLQAP
Ga0066697_1004938323300005540SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIAITVLCWLNT*
Ga0066701_1028940413300005552SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDL
Ga0066704_1094536723300005557SoilMNIGLGIAGSLCLVLAGGHAFVGRAVLDLLPRNLPATRFGDGAFTRGLFVFTWHALTVMQITTGAVLLALSRGEPADERGQVVFLVGAAYAAATLLLVWRTRRRPSDLVRAPLWAPFIAITILCWLYR*
Ga0066699_1091367723300005561SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIAITVLCWL
Ga0066694_1052245523300005574SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSR
Ga0066706_1119953813300005598SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAW
Ga0066656_1093231213300006034SoilNEALQLGRCVCDMDRHGAAIRAGGVRRMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT*
Ga0066665_1107338723300006796SoilVGRAVLDLLPRNLPATRFGDGAFTRGLFVFTWHALTVMQITTGAVLLALSRGEPADERGQVVFLVGAAYAAATLLLVWRTRRRPSDLVRAPLWAPFIAITILCWLYR*
Ga0075428_10030630423300006844Populus RhizosphereMNIGLVIAGSLCLVLAIGHTLVGLRVVDRFARTFQPTQFGDGALTRGAVVFTWYALSVMLTTTGAILLALAQGEPADDRGEVVLLIGASYAAATVLLALRNRRRPSDLLRTPVWALMIVITVLCWLNT*
Ga0099793_1019862323300007258Vadose Zone SoilMNIGLVVAGSLCLVLAGGHTFVGRLVLDRLPRNLDPTRFGDGALTRGAVVFTWYALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT*
Ga0099794_1024437423300007265Vadose Zone SoilMNVGLVIAGSLCLVLAAGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRAPVWVLQIVITVLCWLNT*
Ga0066710_10018881123300009012Grasslands SoilVGRAVLDLLPRNLPATRFGDGAFTRGLFVFTWHALTVMQITTGAVLLALSRGEPADERGQVVFLVGAAYAAATLLLVWRTRRRPSDLVRAPLWAPFIAITILCWLYR
Ga0066710_10066019123300009012Grasslands SoilMNVGLVIAGSLCLVLAGGHTFVGRRLLDRQPRHLDPDGALTRGAVVFTWYALSVMLTTMGAVLIALARGTHAHDRGEVVFLVGAGYAAATVLLIWRSRRRPSDLLRTPVWVLQIVITVLCWLNT
Ga0099829_1035870323300009038Vadose Zone SoilLVLAGGHTFVGRRVLDRLPRNLDPTRFGDGALTRGAVVFTWYALSVMLTTTGAVLIALARGKHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRAPVWVLQIVITVLCWLNT*
Ga0099829_1075453413300009038Vadose Zone SoilMNVGLVIAGSLCLVLAAGHTLVGRLVLDRLPRNLDPTRFGDGARTRGAVVFTWYALSLMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAAT
Ga0099829_1136820513300009038Vadose Zone SoilMNVGLVIAGSLCLVLAGGHTLVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSLMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT*
Ga0099829_1146714613300009038Vadose Zone SoilMNVGLVLAGSLCLVLAAGHTLVGRLVLDRLPRNLDPTRFGDGALTRGAVVFTWHALSLMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAAT
Ga0099830_1047660933300009088Vadose Zone SoilMNVGLVLAGSLCLVLAAGHTLVGRLVLDRLPRNLDPTRFGDGALTRGAVVFTWHALSLMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAA
Ga0099830_1094114023300009088Vadose Zone SoilMNVGLVIAGSLCLVLAAGHTLVGRLVLDRLPRNLDPTRFGDGARTRGAVVFTWYALSLMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLVWRSRRRPSDLLRTPVWVLQIVITVLCWLNT*
Ga0099827_1008845213300009090Vadose Zone SoilMNLGLVIAGSLCLVLAGGHTFVGRLVLDRLPRNLDPTRFGDGALTRGAVVFTWYALSVMLTTTGAVLIALARGTHAHDRGEVVFLVGAAYAAATVLLVWRSRRRPSDLLRTPVWVLQ
Ga0114129_1096657013300009147Populus RhizosphereMNVGLVVAGSLCLVLAGGHTLVGRLVLDRLPRNFQPTRFGDGAYTRGLLVFTWHTLSLMLTTAGVVLIALAQRQPADQRGAVVFLVGAAYAAAAVLLAWMSRKRPSDLLRIPVWAPFIVIVILCWLNT*
Ga0105088_101324913300009810Groundwater SandMNVGLVVAGSLCLVLAGGHTLVGRLVLDRLPRNFQPTRFGDGAYTRGLLVFTWYALSLMLTTTGLVLIALAQREPADGRGAVVFLVGAAYAAAVLLLAWMSRKRPSDLLRIPVWAPL
Ga0105076_100251143300009816Groundwater SandMNVGLVVAGSLCLVLAGGHTLVGRLVLDRLPRNFQPTRFGDGAYTRGLLVFTWYALSLMLTTTGLVLIALAQREPADGRGAVVFLVGAAYAAAVLLLAWMSRKRPSDLLRIPVWAPLIVVIVLCWLNT*
Ga0105087_103712023300009819Groundwater SandMNVGLVVAGSLCLVLAGGHTLVGRLVLDRLPRNFQPTRFGDGAYTRGLLVFTWHALTLWLTTSGAVLIALARAEPADHRGLVVFLVGAAY
Ga0127503_1089401633300010154SoilMNLGLVIAGSLCFVLAAGHTFVGRLVLDRLPRNFQPTRFGDGAYTRGLVRFTWHALSLMLTTTGAVLIALAQGESADDRGEVVFLIGTAYAATAVVLAWMTRRRPSDLLRIPVWAPFIAITVLCWLNT*
Ga0134109_1016462113300010320Grasslands SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTQFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGA
Ga0134071_1010008513300010336Grasslands SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSD
Ga0137392_1000484113300011269Vadose Zone SoilMNVGLVLAGSLCLVLAAGHTLVGRLVLDRLPRNLDPTRFGDGALTRGAVVFTWHALSLMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLL
Ga0137391_1127533113300011270Vadose Zone SoilMNLGLVVAGSLCLVLAGGHTFVGRQVLDRLPRKLDPTRFGDGALTRGAVVFTWYALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLV
Ga0137391_1133844413300011270Vadose Zone SoilMNVGLVIAGSLCLVLAAGHTLVGRLVLDRLPRNLDPTRFGDGTRTRGAVVFTWYALSLMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAAT
Ga0137383_1010101123300012199Vadose Zone SoilVLAGGHTLVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVAAAYAAATVLRAWRSRRRPSDLLRTPVWVLQIAITVLCWLNT*
Ga0137365_1001219583300012201Vadose Zone SoilMNVGLVIAGSLCLVLAGGHTIVGRRVLDRLPRILDPTRFGDGALTRGAVVFTWYALSVMLTTTGAVLIALARGTHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRAPVWVLQIVITVLCWLNT*
Ga0137363_1014794123300012202Vadose Zone SoilMNVGLVIAGSLCLVLAAGHTLVGRRVLDRLPRNLDPTRFGDGALTRGAVVFTWYALSVMLTTTGAVLIALARGTHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRAPVWVLQIVITVLCWLNT*
Ga0137399_1037609523300012203Vadose Zone SoilMNVGLVVAGSLCLVLAGGHTFVGRLVLDRLPRNLDPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT*
Ga0137374_1003048833300012204Vadose Zone SoilMNVGLVIAGSLCLVLAGGHTLTGRLVLDRLPRNFQPTRFGDGAYTRGLLVFTWHALSLMLTTTGVVLIALAQREPADERGTVVFLVGAAYAAAALLLAWMTRKRPSDLLRIPVWAPFIVVIVLCWLNT*
Ga0137374_10031757123300012204Vadose Zone SoilMNVGLVVAGSLCLVLAAGHTLTGRLVLDRLPRNFLPTRFGDGAYTRGLVRFTWHALSLMLTTTGVVLIALAQRAPADGRGAVVFLVGAAYAAAALLLAWMTRKRPSDLLRIPVLAPYIVIIVLCWLNT*
Ga0137362_1027362813300012205Vadose Zone SoilMNVGLVIAGSLCLVLAAGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAIVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT*
Ga0137381_1010999443300012207Vadose Zone SoilMNVGLVIAGSLCLVLAGGHTIVGRRVLDRLPRHLDPTRFGDGALTRGAVVFTWYALSVMLTTTGAVLIALARGTHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRAPVWVLQIVITVLCWLNT*
Ga0137381_1133900013300012207Vadose Zone SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQI
Ga0137377_1121641823300012211Vadose Zone SoilMNVGLVIAGSLCLVLAGGHTIVGRRVLDRLPRILDPTRFGDGALTRGAVVFTWYALSVMLTTTGAVLIALARGTHAHDRGEVVFLVGA
Ga0137372_1011597723300012350Vadose Zone SoilMNVELVIAGSLCLVLAGGHTLTGRLVLDRLPRNFQPTRFGDGAHTRGLLVFTWHALSLMLTTTGVVLIALAQREPADERGTVVFLVGAAYAAAALLLAWMTRKRPSDLLRIPVWAPFIVIIVLCWLNT*
Ga0137367_1015599223300012353Vadose Zone SoilMNVGLVIAGSLCLVLAGGHTLTGRLVLDRLPRNFQPTRFGDGAYTRGLLVFTWHALSLMLTTTGVVLIAIAQREPADERGAVVFLVGAAYAAAALLLAWMTRKRPSDLLRIPVWAPFIVVIVLCWLNT*
Ga0137369_1031729523300012355Vadose Zone SoilMNVGLVIAGSLCLVLAGGHTLIGRLVLDRLPRNFQPTRFGGGAYTRGLLVFTWHTLSLMLITTGVVLIALAQREPADERGALVFLVGAAYAATALLLAWMTRKRPSDLLRIPVWAPFIVIVVLCWLNT*
Ga0137384_1147478613300012357Vadose Zone SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVVTVLCWLNT*
Ga0137385_1034820413300012359Vadose Zone SoilMNVGLVIAGSLCLVLAGGHTIVGRRVLDRLPRRLDPTRFGDGALTRGAVVFTWYALSVMLTTTGAVLIALARGTHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRAPVWVLQIVITVLCWLNT*
Ga0137375_10016129143300012360Vadose Zone SoilMNLGLVVAGSLSFVLAAGHALVGRAVLDSLPRNLPATRFGDRAFTRGLFVLTWHALTLWLTTAGAVLIALARSEPTDDRGLVVFLVGAAYIAATVLMAWRSPGGHQIYSAYRYGSSSSSSSDSAG*
Ga0137375_1003748773300012360Vadose Zone SoilMNVGLVIAGSLCLVLAGGHTLTGRLVLDRLPRNFQPTRFGDGAYTRGLLVFTWHALSLMLTTTGVVLIALAQREPADERGTVVFLVGAAYAAAAL
Ga0137361_1016571913300012362Vadose Zone SoilMNVGLVIASSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIAITVLCWLNT*
Ga0137361_1194635013300012362Vadose Zone SoilMNVGLVIAGSLCLVLAAGHTFVGRRVLDRLPRSLDPTRFGDGALTRGAVVFTWYALSVMLTTTGAVLIALARGEHAHDRGE
Ga0137358_1090908113300012582Vadose Zone SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRNLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT*
Ga0137395_1044504723300012917Vadose Zone SoilMNLGLVIAGSLCLVLAGGHTLVGRRVLDRLPRHLDPTRFGDGALTRGAVVFTWYALSVMLTTTGAVLIALARGKHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRAPVWVLQIVITVLCWLNT*
Ga0137359_1012015423300012923Vadose Zone SoilMNVGLVIAGSLCLVLAAGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT*
Ga0137416_1024352313300012927Vadose Zone SoilMNVGLVVAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWYALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRAPVWVLQIVITVLCWLNT*
Ga0137404_1048965023300012929Vadose Zone SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRAPVWVLQIVITVLCWLNT*
Ga0137410_1023189913300012944Vadose Zone SoilMNVELVIAGSLCLVLAGGHTLAGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT*
Ga0134077_1021699923300012972Grasslands SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIGI
Ga0134076_1001932313300012976Grasslands SoilLVLAGGHAFVGRAVLDLLPRNLPATRFGDGAFTRGLFVFTWHALTVMQITTGAVLLALSRGEPADERGQVVFLVGAAYAAATLL
Ga0134069_122176423300017654Grasslands SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTSHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIAITVLCWLNT
Ga0184626_1005361713300018053Groundwater SedimentMNVGLVVAGSLCLVLAGGHTLVGRLVLDRLPRNFQPTRFGDGAYTRGLLVFTWHALTLMLTTTGVVLIALAQREPAEERGAVVVLVGAAYAAAALLLAWMSRKRPSDLLRIPVWAPFIVIVVLCWLNT
Ga0184609_1055837313300018076Groundwater SedimentMNLRLVIAGSFCFVLAAGHTFVGRLVLDRLPGNFQPTRFGDGAYTRGLVRFTWHALSLMLTMTGAVLIALAQGAPADDRGQVVFLLGTVYAAAAVVLAWMTRRRPSDLLRIPVWAPFIAIIVLCWLNT
Ga0066667_1008683543300018433Grasslands SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT
Ga0066669_1007799813300018482Grasslands SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEAGFLVGAACAAATVLLPRRSRRWPSDWRRTPVEVRQIARTLQRWLNT
Ga0207684_1052366713300025910Corn, Switchgrass And Miscanthus RhizosphereMNVGLVIAGSLCLVLALGHTLVGRGVLDSLPRTLQPTRFGDGALTRGAVVFTWYALSVMLTTTGAVLIALARGTHAHDRGEVVFLVGAAYAAATVLLVWRSRRRPSDLLRTPVWVLQIVITVLCWLNT
Ga0207646_1058215313300025922Corn, Switchgrass And Miscanthus RhizosphereLAGGHTVVGRRVLDRLPRHLDPTRFGDGALTRGAVVFTWYALSVMLTTTGAVLIALARGTHAHDRGEVVFLVGAAYAAATVLLVWRSRRRPSDLLRTPVWVLQIVITVLCWLNT
Ga0207700_1057590223300025928Corn, Switchgrass And Miscanthus RhizosphereMNVELVIAGSLCLVLAGGHTLTGRLVLDRLPRNFQPTRFGDGAHTRGLLVFTWHALSLMLTTTGVVLIAIAQREPADERGAVVFLVGAAYAAAALLLAWMTRKRPSDLLRIPVWAPFIVIIVLCWLNT
Ga0209238_115443413300026301Grasslands SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLIWRSRRRPSDLLRTPVWVLQIAI
Ga0209055_106055133300026309SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLL
Ga0209239_112191713300026310Grasslands SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTGFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT
Ga0209686_114567123300026315SoilAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITILCWLNT
Ga0209801_101417213300026326SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWLLQIVITVLCWLNT
Ga0209057_112860313300026342SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIAITVLCWLNT
Ga0257173_101420013300026360SoilMNVGLVLAGSLCLVLAAGHTLVGRLVLDRLPRNLDPTRFGDGALTRGAVVFTWHALSLMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT
Ga0257169_103766423300026469SoilMNVGLVIAGSLCLVLAGGHTLVGRLVLDRLPRNLDPTRFGDGALTRGAVVFTWHALSLMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLVWRSRR
Ga0257156_109885323300026498SoilMNVGLVIAGSLCLVLAGGHTVVGRLVLDRLPRHLDPTRFGDGALTRGAVVFTWYALSLMLTTTGAVLIALARGTHAHDRGEVVFLVGAAYAAATVLLVWRSRRRPSDLLRTPVWVLQIVITVLCWLNT
Ga0257165_100330123300026507SoilMNVGLVIAGSLCLVLAAGHTLVGRLVLDRLPRNLDPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGGHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT
Ga0209806_106393533300026529SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRR
Ga0209160_111495723300026532SoilMNVGLVIAGSLCLVLAGGHTFVGRLVLDRLPRDLHPTGFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWCSRSSSPSCAG
Ga0209058_129661323300026536SoilVLAGGHTFVGRLVLDRLPRDLHPTRFGDGALTRGAVVFTWHALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT
Ga0209843_1000986133300027511Groundwater SandMNVGLVVAGSLCLVLAGGHTLVGRLVLDRLPRNFQPTRFGDGAYTRGLLVFTWYALSLMLTTTGLVLIALAQREPADGRGAVVFLVGAAYAAAVLLLAWMSRKRPSDLLRIPVWAPLIVVIVLCWLNT
Ga0209180_1014501123300027846Vadose Zone SoilMNVGLVLAGSLCLVLAAGHTLVGRLVLDRLPRNLDPTRFGDGALTRGAVVFTWHALSLMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLVWRSRRRPSDLLRTPVWVLQIVITVLCWLNT
Ga0209180_1019120723300027846Vadose Zone SoilMNVGLVIAGSLCLVLAAGHTLVGRLVLDRLPRNLDPTRFGDGARTRGAVVFTWYALSLMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYA
Ga0209701_1008636043300027862Vadose Zone SoilMNVGLVIAGSLCLVLAAGHTLVGRLVLDRLPRNLDPTRFGDGARTRGAVVFTWYALSLMLTTTGAVLIALARGEHAHDRGEVVFLV
Ga0209701_1013920643300027862Vadose Zone SoilMNVGLVLAGSLCLVLAAGHTLVGRLVLDRLPRNLDPTRFGDGALTRGAVVFTWHALSLMLTTTGAVLIALARGEHAHDRGEVVFLV
Ga0209283_1012516513300027875Vadose Zone SoilMNLGLVIAGSLCLVLAGGHTFVGRLVLDRLPRNLDPTRFGDGARTRGAVVFTWYALSLMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLVWRSRRRPSDLLRTPVWVLQIVITVLCWLNT
Ga0209590_1041047413300027882Vadose Zone SoilMNLGLVIAGSLCLVLAGGHTFVGRLVLDRLPRNLDPTRFGDGALTRGAVVFTWYALSVMLTTTGAVLIALARGTHAHDRGEVVFLVGAAYAAATVLLVWRSRRRPSDLLRTPVWVLQIVITVLCWLNT
Ga0209860_101015123300027949Groundwater SandMNVGLVVAGSLCLVLAGGHTLVGRLVLDRLPRNFQPTRFGDGAYTRGLLVFTWYALSLMLTTTGLVLIALAQREPADGRGAVVFLVFDNENALVHAASHAFLSAARR
Ga0247719_105406723300028041SoilMALVNLQLFLAGATCLTLALGHALTGRAVLTALPRTLPSTRFGDGAYTRGLLRFTWHALTLMLTVTGAVLIAVALGNPQDARGDIAFLIGSAYAAALVVLLWMTRRRPSDLLRIPVWAGFVVISVLCWLNV
Ga0137415_1006388143300028536Vadose Zone SoilMNVGLVVAGSLCLVLAGGHTFVGRLVLDGLPRNLDPTRFGDGALTRGAVVFTWYALSVMLTTTGAVLIALARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT
Ga0257175_111160113300028673SoilMNVGLVIAGSLCLVLAAGHTLVGRLVLDRLPRNLDPTRFGDGARTRGAVVFTWYALSLMLTTTGAVLIALARGKHAHDRGEVVFLVGAAYAAATVLLAW
(restricted) Ga0255310_1015817123300031197Sandy SoilMNVGLVIAGSLCLVLAAGHTLVGRLVLDRLPRDLDPTRFGDGALTRGAVVFTWHALSLMLTTTGAVLITLARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT
(restricted) Ga0255312_109775513300031248Sandy SoilMNVGLVLAGSLCLVLAAGHTLVGRLVLDRLPRDLDPTRFGDGALTRGAVVFTWHALSLMLTTTGAVLITLARGEHAHDRGEVVFLVGAAYAAATVLLAWRSRRRPSDLLRTPVWVLQIVITVLCWLNT
Ga0307471_10241087123300032180Hardwood Forest SoilMNVGLVFAGSLCLVLALGHTLVGRGVLDSLPRTLQPTRFGDGALTRGAIVFTWHGLGLMLTTTGAILIALASGAPADDRSEVPLLVGAAYAAAT
Ga0214472_1025340843300033407SoilLLLALGHTLVGRLVVDRLPPTLHPTRFGDGARTRGALVFTYYALSLMLTITGAILFALAQGEPADDRGEVVFLVGAVYAAAAVLLAWRVRRTPSLLVRTPVWGLMILVAVLCWTNTQGS
Ga0364934_0049597_201_5873300034178SedimentMNVGLVFAGSLCFVLAAGHAFVGRAVLDLLPRNLHATRFGDGAFTRGLFVFTWHALTLWLTTSGAVLIALARGEPADYRGPVVFLVGAAYAAATVLLAWRSRRRPSDLLRIPVWVLVIVIIVICLLNT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.