NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F100678

Metagenome / Metatranscriptome Family F100678

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F100678
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 55 residues
Representative Sequence DGARVQPSALGDTQAFIRLIAPDSLSPATRAALAETDDRQAAALLLAAPEFQRR
Number of Associated Samples 98
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 1.96 %
% of genes near scaffold ends (potentially truncated) 98.04 %
% of genes from short scaffolds (< 2000 bps) 95.10 %
Associated GOLD sequencing projects 91
AlphaFold2 3D model prediction Yes
3D model pTM-score0.54

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (85.294 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(12.745 % of family members)
Environment Ontology (ENVO) Unclassified
(37.255 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(50.980 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 45.12%    β-sheet: 0.00%    Coil/Unstructured: 54.88%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.54
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF07394DUF1501 44.12
PF08811DUF1800 3.92
PF00664ABC_membrane 0.98
PF02347GDC-P 0.98
PF00005ABC_tran 0.98
PF00355Rieske 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG5267Uncharacterized conserved protein, DUF1800 familyFunction unknown [S] 3.92
COG0403Glycine cleavage system protein P (pyridoxal-binding), N-terminal domainAmino acid transport and metabolism [E] 0.98
COG1003Glycine cleavage system protein P (pyridoxal-binding), C-terminal domainAmino acid transport and metabolism [E] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms85.29 %
UnclassifiedrootN/A14.71 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2088090005|LWSO_GFMCQXZ02IDWZTAll Organisms → cellular organisms → Bacteria510Open in IMG/M
2209111006|2214609143All Organisms → cellular organisms → Bacteria521Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_103817142All Organisms → cellular organisms → Bacteria535Open in IMG/M
3300003324|soilH2_10217682All Organisms → cellular organisms → Bacteria1130Open in IMG/M
3300003996|Ga0055467_10059191All Organisms → cellular organisms → Bacteria1011Open in IMG/M
3300003997|Ga0055466_10049059All Organisms → cellular organisms → Bacteria1039Open in IMG/M
3300004052|Ga0055490_10114028All Organisms → cellular organisms → Bacteria772Open in IMG/M
3300004463|Ga0063356_104703067All Organisms → cellular organisms → Bacteria587Open in IMG/M
3300005295|Ga0065707_10493950All Organisms → cellular organisms → Bacteria763Open in IMG/M
3300005334|Ga0068869_100852625All Organisms → cellular organisms → Bacteria786Open in IMG/M
3300005336|Ga0070680_101808175All Organisms → cellular organisms → Bacteria529Open in IMG/M
3300005340|Ga0070689_100326911Not Available1282Open in IMG/M
3300005345|Ga0070692_10055283All Organisms → cellular organisms → Bacteria2075Open in IMG/M
3300005441|Ga0070700_100660582All Organisms → cellular organisms → Bacteria826Open in IMG/M
3300005535|Ga0070684_101604659All Organisms → cellular organisms → Bacteria614Open in IMG/M
3300005543|Ga0070672_101067705All Organisms → cellular organisms → Bacteria717Open in IMG/M
3300005544|Ga0070686_100503826All Organisms → cellular organisms → Bacteria940Open in IMG/M
3300005560|Ga0066670_10507636All Organisms → cellular organisms → Bacteria740Open in IMG/M
3300005577|Ga0068857_102529021All Organisms → cellular organisms → Bacteria504Open in IMG/M
3300005614|Ga0068856_100504814Not Available1230Open in IMG/M
3300005616|Ga0068852_101696073All Organisms → cellular organisms → Bacteria654Open in IMG/M
3300005618|Ga0068864_100142726All Organisms → cellular organisms → Bacteria2162Open in IMG/M
3300005718|Ga0068866_11432596All Organisms → cellular organisms → Bacteria506Open in IMG/M
3300005843|Ga0068860_100222075All Organisms → cellular organisms → Bacteria1835Open in IMG/M
3300005890|Ga0075285_1004377All Organisms → cellular organisms → Bacteria1480Open in IMG/M
3300006058|Ga0075432_10314293All Organisms → cellular organisms → Bacteria654Open in IMG/M
3300006804|Ga0079221_10654959All Organisms → cellular organisms → Bacteria721Open in IMG/M
3300006845|Ga0075421_102675699All Organisms → cellular organisms → Bacteria516Open in IMG/M
3300006852|Ga0075433_10267915All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1514Open in IMG/M
3300006880|Ga0075429_100671077All Organisms → cellular organisms → Bacteria908Open in IMG/M
3300006880|Ga0075429_100841048All Organisms → cellular organisms → Bacteria803Open in IMG/M
3300006894|Ga0079215_10126121Not Available1178Open in IMG/M
3300006904|Ga0075424_101025355All Organisms → cellular organisms → Bacteria880Open in IMG/M
3300009094|Ga0111539_10072685All Organisms → cellular organisms → Bacteria4055Open in IMG/M
3300009137|Ga0066709_100427796All Organisms → cellular organisms → Bacteria1845Open in IMG/M
3300009156|Ga0111538_14006609All Organisms → cellular organisms → Bacteria509Open in IMG/M
3300009157|Ga0105092_10181058Not Available1175Open in IMG/M
3300009162|Ga0075423_10315455Not Available1639Open in IMG/M
3300009162|Ga0075423_12494744All Organisms → cellular organisms → Bacteria564Open in IMG/M
3300009174|Ga0105241_11884587All Organisms → cellular organisms → Bacteria586Open in IMG/M
3300009177|Ga0105248_10656768All Organisms → cellular organisms → Bacteria1183Open in IMG/M
3300009444|Ga0114945_10734104All Organisms → cellular organisms → Bacteria603Open in IMG/M
3300009691|Ga0114944_1087342Not Available1180Open in IMG/M
3300010399|Ga0134127_11706465All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria705Open in IMG/M
3300011436|Ga0137458_1113749All Organisms → cellular organisms → Bacteria787Open in IMG/M
3300012022|Ga0120191_10049601All Organisms → cellular organisms → Bacteria727Open in IMG/M
3300012198|Ga0137364_10681562All Organisms → cellular organisms → Bacteria775Open in IMG/M
3300012200|Ga0137382_10874329All Organisms → cellular organisms → Bacteria648Open in IMG/M
3300012207|Ga0137381_11190037All Organisms → cellular organisms → Bacteria655Open in IMG/M
3300012210|Ga0137378_10296353Not Available1505Open in IMG/M
3300012359|Ga0137385_11418437All Organisms → cellular organisms → Bacteria558Open in IMG/M
3300012360|Ga0137375_10177504All Organisms → cellular organisms → Bacteria2047Open in IMG/M
3300012478|Ga0157328_1009962All Organisms → cellular organisms → Bacteria649Open in IMG/M
3300012495|Ga0157323_1022455All Organisms → cellular organisms → Bacteria612Open in IMG/M
3300012918|Ga0137396_11281573All Organisms → cellular organisms → Bacteria510Open in IMG/M
3300012925|Ga0137419_11925458All Organisms → cellular organisms → Bacteria508Open in IMG/M
3300012958|Ga0164299_10672630All Organisms → cellular organisms → Bacteria719Open in IMG/M
3300012985|Ga0164308_10218577Not Available1465Open in IMG/M
3300013297|Ga0157378_10156757All Organisms → cellular organisms → Bacteria2126Open in IMG/M
3300013297|Ga0157378_11772401All Organisms → cellular organisms → Bacteria665Open in IMG/M
3300014300|Ga0075321_1009446Not Available1353Open in IMG/M
3300015373|Ga0132257_102189022All Organisms → cellular organisms → Bacteria715Open in IMG/M
3300017994|Ga0187822_10043266Not Available1244Open in IMG/M
3300018481|Ga0190271_13302724All Organisms → cellular organisms → Bacteria541Open in IMG/M
3300019362|Ga0173479_10274967All Organisms → cellular organisms → Bacteria754Open in IMG/M
3300019377|Ga0190264_12240734All Organisms → cellular organisms → Bacteria510Open in IMG/M
3300019879|Ga0193723_1133046All Organisms → cellular organisms → Bacteria681Open in IMG/M
3300021859|Ga0210334_10710571All Organisms → cellular organisms → Bacteria562Open in IMG/M
3300025149|Ga0209827_10826973All Organisms → cellular organisms → Bacteria508Open in IMG/M
3300025160|Ga0209109_10416240All Organisms → cellular organisms → Bacteria624Open in IMG/M
3300025167|Ga0209642_10330016All Organisms → cellular organisms → Bacteria863Open in IMG/M
3300025315|Ga0207697_10106224All Organisms → cellular organisms → Bacteria1200Open in IMG/M
3300025791|Ga0210115_1067027All Organisms → cellular organisms → Bacteria750Open in IMG/M
3300025927|Ga0207687_11846792All Organisms → cellular organisms → Bacteria517Open in IMG/M
3300025942|Ga0207689_10804596All Organisms → cellular organisms → Bacteria793Open in IMG/M
3300025992|Ga0208775_1015587All Organisms → cellular organisms → Bacteria587Open in IMG/M
3300026014|Ga0208776_1011691All Organisms → cellular organisms → Bacteria729Open in IMG/M
3300026044|Ga0208287_1005528Not Available1185Open in IMG/M
3300026062|Ga0208654_1007131Not Available1426Open in IMG/M
3300026066|Ga0208290_1015598All Organisms → cellular organisms → Bacteria780Open in IMG/M
3300026095|Ga0207676_10919217All Organisms → cellular organisms → Bacteria859Open in IMG/M
3300027665|Ga0209983_1041414All Organisms → cellular organisms → Bacteria997Open in IMG/M
3300027907|Ga0207428_10215192Not Available1442Open in IMG/M
3300027909|Ga0209382_12221156All Organisms → cellular organisms → Bacteria518Open in IMG/M
3300028536|Ga0137415_11181213All Organisms → cellular organisms → Bacteria580Open in IMG/M
3300028711|Ga0307293_10209002All Organisms → cellular organisms → Bacteria621Open in IMG/M
3300028807|Ga0307305_10316810All Organisms → cellular organisms → Bacteria709Open in IMG/M
3300030006|Ga0299907_10727935All Organisms → cellular organisms → Bacteria756Open in IMG/M
3300030006|Ga0299907_10865375All Organisms → cellular organisms → Bacteria675Open in IMG/M
3300030620|Ga0302046_10734229All Organisms → cellular organisms → Bacteria798Open in IMG/M
3300031184|Ga0307499_10266223All Organisms → cellular organisms → Bacteria553Open in IMG/M
(restricted) 3300031197|Ga0255310_10186877All Organisms → cellular organisms → Bacteria577Open in IMG/M
3300031198|Ga0307500_10192982All Organisms → cellular organisms → Bacteria604Open in IMG/M
3300031229|Ga0299913_10618501All Organisms → cellular organisms → Bacteria1065Open in IMG/M
3300031547|Ga0310887_10508166All Organisms → cellular organisms → Bacteria727Open in IMG/M
3300031562|Ga0310886_10910332All Organisms → cellular organisms → Bacteria560Open in IMG/M
3300031720|Ga0307469_11324015All Organisms → cellular organisms → Bacteria685Open in IMG/M
3300031890|Ga0306925_10874974All Organisms → cellular organisms → Bacteria926Open in IMG/M
3300031947|Ga0310909_10194969Not Available1682Open in IMG/M
3300031954|Ga0306926_10509587Not Available1478Open in IMG/M
3300033406|Ga0316604_10684258All Organisms → cellular organisms → Bacteria564Open in IMG/M
3300034354|Ga0364943_0285879All Organisms → cellular organisms → Bacteria622Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil12.75%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere11.76%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil8.82%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands3.92%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands3.92%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs2.94%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil2.94%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere2.94%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere2.94%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere2.94%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil1.96%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.96%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.96%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.96%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.96%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.96%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Rhizosphere1.96%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.96%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.98%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lentic → Sediment → Freshwater Sediment0.98%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.98%
EstuarineEnvironmental → Aquatic → Marine → Intertidal Zone → Estuary → Estuarine0.98%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.98%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.98%
TerrestrialEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial0.98%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.98%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.98%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.98%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.98%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.98%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.98%
Corn, Switchgrass And Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.98%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.98%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.98%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.98%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.98%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090005Sediment microbial communities from Lake Washington, Seattle, for Methane and Nitrogen Cycles, original sample replicate 1EnvironmentalOpen in IMG/M
2209111006Arabidopsis rhizosphere microbial communities from the University of North Carolina - sample Wild type Col-0Host-AssociatedOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300003996Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailB_D2EnvironmentalOpen in IMG/M
3300003997Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailC_D1EnvironmentalOpen in IMG/M
3300004052Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005340Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaGEnvironmentalOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005441Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaGEnvironmentalOpen in IMG/M
3300005535Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.2-3L metaGEnvironmentalOpen in IMG/M
3300005543Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M1-3 metaGHost-AssociatedOpen in IMG/M
3300005544Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3L metaGEnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005577Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2Host-AssociatedOpen in IMG/M
3300005614Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C6-2Host-AssociatedOpen in IMG/M
3300005616Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C2-2Host-AssociatedOpen in IMG/M
3300005618Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2Host-AssociatedOpen in IMG/M
3300005718Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2Host-AssociatedOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300005890Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_104EnvironmentalOpen in IMG/M
3300006058Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1Host-AssociatedOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300006894Agricultural soil microbial communities from Utah to study Nitrogen management - NC ControlEnvironmentalOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009157Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009177Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaGHost-AssociatedOpen in IMG/M
3300009444Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3EnvironmentalOpen in IMG/M
3300009691Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP2EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300011436Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT642_2EnvironmentalOpen in IMG/M
3300012022Terrestrial microbial communites from a soil warming plot in Okalahoma, USA - C6EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012478Arabidopsis rhizosphere microbial communities from North Carolina - M.Oy.9.old.080610Host-AssociatedOpen in IMG/M
3300012495Arabidopsis rhizosphere microbial communities from North Carolina - M.Oy.5.old.040610Host-AssociatedOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012985Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_246_MGEnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300014300Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailA_D1EnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300017994Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_2EnvironmentalOpen in IMG/M
3300018481Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 356 TEnvironmentalOpen in IMG/M
3300019362Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S104-311B-1 (version 2)EnvironmentalOpen in IMG/M
3300019377Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 112 TEnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300021859Metatranscriptome of estuarine sediment microbial communities from the Columbia River estuary, Oregon, United States ? S.306 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300025149Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP2 (SPAdes)EnvironmentalOpen in IMG/M
3300025160Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 2EnvironmentalOpen in IMG/M
3300025167Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 19_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025315Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA, with PhiX - S5 (SPAdes)Host-AssociatedOpen in IMG/M
3300025791Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_TuleB_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025942Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025992Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_104 (SPAdes)EnvironmentalOpen in IMG/M
3300026014Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_205 (SPAdes)EnvironmentalOpen in IMG/M
3300026044Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_CattailB_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300026062Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_CattailC_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300026066Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailB_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300026095Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300027665Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M1 S PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028711Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_150EnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300030620Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111EnvironmentalOpen in IMG/M
3300031184Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 13_SEnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031198Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 14_SEnvironmentalOpen in IMG/M
3300031229Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT155D38EnvironmentalOpen in IMG/M
3300031547Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D4EnvironmentalOpen in IMG/M
3300031562Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031890Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176 (v2)EnvironmentalOpen in IMG/M
3300031947Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.T000HEnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300033406Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_soil_day20_CTEnvironmentalOpen in IMG/M
3300034354Sediment microbial communities from East River floodplain, Colorado, United States - 23_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
LWSO_042155002088090005Freshwater SedimentLPGSRVRSDAVGSADAVARLIAPDSLSSATRAALAETKEREAVALLLAAPEFQRR
22136447352209111006Arabidopsis RhizosphereLGDTQAFIRLIAPDSLSPATRAALAETDDRQAAALLLAAPEFQRR
INPhiseqgaiiFebDRAFT_10381714213300000364SoilSPDMLLTRMNFAADLVGNRLDGARVPPNALRDTQALVRLIAPDSLSPVTRQALAEADEKEAPALLLAAPEFQRR*
soilH2_1021768213300003324Sugarcane Root And Bulk SoilGALEDAQAFVRLIAPDSLSSETRQALAETDDRQAAALLLAAPEFQRR*
Ga0055467_1005919123300003996Natural And Restored WetlandsGDTQPLIGLIAPGSLSSATQAALSETAGSDTLALLLAAPEFQRR*
Ga0055466_1004905923300003997Natural And Restored WetlandsFIRLVAPDSLSPSTLAALAETRGTDTLALLLAAPEFQRR*
Ga0055490_1011402823300004052Natural And Restored WetlandsKDALSRLIAPDALAPATRSALAETEGGQAVALLLAAPEFQRR*
Ga0063356_10470306713300004463Arabidopsis Thaliana RhizosphereSWISPDMLLTRMNFVSDLVANRLAGARVPKDTVGDPEAIIPLIAPDSISSSTRAALNQTKGAESVALLLAAPEFQRR*
Ga0065707_1049395023300005295Switchgrass RhizosphereGNRLEGARVAPAALSDTQALIRLVAPDSLSPATRRALAETDEKEAPAMLLAAPEFQRR*
Ga0068869_10085262513300005334Miscanthus RhizosphereWISPDMLLTRMNFAADLVGNRLDGARVEPAALRDTQGLIRLVAPDSLSPATRRALAETDEKEGPAMLLAAPEFQRR*
Ga0070680_10180817523300005336Corn RhizosphereSWISPDMLLTRMNFAADLVGNRLDGARVEPAALRDTQGLIRLVAPDSLSPATRRALAETDEKEGPAMLLAAPEFQRR*
Ga0070689_10032691113300005340Switchgrass RhizosphereDLVGNRLDGARVEPAALRDTQGLIRLVAPDSLSPATRRALAETDEKEAPAMLLAAPEFQRR*
Ga0070692_1005528333300005345Corn, Switchgrass And Miscanthus RhizosphereRDTQGLIRLVAPDSLSPATRRALAETDEKEGPAMLLAAPEFQRR*
Ga0070700_10066058223300005441Corn, Switchgrass And Miscanthus RhizosphereGLIRLVAPDSLSPATRRALAETDEKEGPAMLLAAPEFQRR*
Ga0070684_10160465913300005535Corn RhizosphereAPAALSDTQALIRLVAPDSLSPATRRALAETDEKEAPAILLAAPEFQRR*
Ga0070672_10106770513300005543Miscanthus RhizosphereLLTRMNFAADLVSNRLDGARVQPSALGDTQAFIRLIAPDSLSPATREALAETDDRQAAALLLAAPEFQRR*
Ga0070686_10050382613300005544Switchgrass RhizosphereLIRLVAPDSLSPTTRRALAETDEKEAPAILLAAPEFQRR*
Ga0066670_1050763623300005560SoilRLDGARVQADALRDTQAFVRLIAPDSLSPATRAALAETDAREAPALLLAAPEFQRR*
Ga0068857_10252902123300005577Corn RhizosphereNRLEGARVAPVALSDTQALIRLVAPDSLSPATRRALAETDEKEAPAMLLAAPEFQRR*
Ga0068856_10050481413300005614Corn RhizosphereIRLVAPDSLSPATRRALAETDEKEGPAMLLAAPEFQRR*
Ga0068852_10169607323300005616Corn RhizosphereLVGNRLDGARVEPAALRDTQGLIRLVAPDSLSPATRRALAETDEKEGPAMLLAAPEFQRR
Ga0068864_10014272633300005618Switchgrass RhizosphereMLLTRMNFAADLVGNRLDGARVEPAALRDTQGLIRLVAPDSLSPATRRALAETDEKEGPAMLLAAPEFQRR*
Ga0068866_1143259613300005718Miscanthus RhizosphereSPDMLLTRMNFAADLVGNRLDGARVEPAALRDTQGLIRLVAPDSLSPATRRALAETDEKEGPAMLLAAPEFQRR*
Ga0068860_10022207533300005843Switchgrass RhizosphereRVNFAADLVSNRLDGARVQPSALGDTQAFIRLIAPDSLSPATRAALAETDDRQAAALLLAAPEFQRR*
Ga0075285_100437723300005890Rice Paddy SoilTDLAANRLDGARVQSRDPDNVQALARLVAPDSLSQATQSALAESAASEKLALLLAAPEFQRR*
Ga0075432_1031429313300006058Populus RhizosphereQALIRLVAPDSLSPATRRVLADIDEKEAPAVLLAAPEFQRR*
Ga0079221_1065495923300006804Agricultural SoilRLVAPDSLSPATRRALAETDEKEGPAMLLAAPEFQRR*
Ga0075421_10267569913300006845Populus RhizosphereRVPPNALRDTQALVRLIAPDSLSPVTRQALAEADEKEAPALLLAAPEFQRR*
Ga0075433_1026791513300006852Populus RhizosphereDSQALIRLVAPDSLSPASRRVLADIDEKEAPAMLLAAPEFQRR*
Ga0075429_10067107713300006880Populus RhizosphereIAPDSLSTATRAALAETAGSDALALLLAAPEFQRR*
Ga0075429_10084104823300006880Populus RhizosphereDLVGNRLDGARVPPNALRDTQALVRLIAPDSLSPVTRQALAEADEKEAPALLLAAPEFQRR*
Ga0079215_1012612113300006894Agricultural SoilPDVQTFIRLIAPDSLSTATRAALAETTGSDTLALLLAAPEFQRR*
Ga0075424_10102535513300006904Populus RhizosphereSALGDTQAFIRLIAPDSLSPATRAALAETDDRQAAALLLAAPEFQRR*
Ga0111539_1007268513300009094Populus RhizospherePDMLLTRMNFAADLVGNRLEGARVAPAALSDTQALIRLVAPDSLSPTTRRALAETDEKEAPAILLAAPEFQRR*
Ga0066709_10042779633300009137Grasslands SoilVSNRLDGARVQADALRDTQAFVRLIAPDSLSPATRAALAETDAREAPALLLAAPEFQRR*
Ga0111538_1400660913300009156Populus RhizosphereLGDTQAFIRLIAPDSLSAATRQALAETDDRQAAALLLAAPEFQRR*
Ga0105092_1018105823300009157Freshwater SedimentLIAPDTLSAGTRSALAETQDSQAFALLLAAPEFQRR*
Ga0075423_1031545513300009162Populus RhizospherePEALRDSQALIRLVAPDSLSPTTRRALAETDEKEAPAILLAAPEFQRR*
Ga0075423_1249474423300009162Populus RhizosphereVQPSALGDTQAFIRLIAPDSLSPATRQALAETDDRQAAALLLAAPEFQRR*
Ga0105241_1188458713300009174Corn RhizosphereSSDMLLTRMNFAIDLTANRIDGVRVQAQAPRDTQAFIRMVAPDSLSPATRAALAESADSDRLALLLAAPEFQRR*
Ga0105248_1065676823300009177Switchgrass RhizosphereARVEPAALRDTQGLIRLVAPDSLSPATRRALAETDEKEGPAMLLAAPEFQRR*
Ga0114945_1073410413300009444Thermal SpringsIAPESLSSSARAAIAETEDKEAVALLLAAPAFQRR*
Ga0114944_108734213300009691Thermal SpringsIAPESLSSSTRVVLAQTEGPEAVALLLAAPEFQRR*
Ga0134127_1170646523300010399Terrestrial SoilFAADLVANRLDGARVQTQPARDNQAFIRLIAPDSLSTATRAALAETTGSDALALLLAAPEFQRR*
Ga0137458_111374913300011436SoilRRIAPDSLSPATQAALAETVGHQAIALLLAAPEFQRR*
Ga0120191_1004960123300012022TerrestrialMLLTRMNFAADLVANRIDGARVQSTALRDTQQFVRVIAPDSLSSGTRAALAETEGLETLALLLAAPEFQRR*
Ga0137364_1068156213300012198Vadose Zone SoilNRLDGAHVQPDALRDPQAFVRLIAPDSLSPATRAALAETDAREAPALLLAAPEFQRR*
Ga0137382_1087432923300012200Vadose Zone SoilPQAFVRLIAPDSLSPATRAALAETDAREAPALLLAAPEFQRR*
Ga0137381_1119003723300012207Vadose Zone SoilAPDALSTTTRSVLAETEGSQAIALLMAAPEFQRR*
Ga0137378_1029635323300012210Vadose Zone SoilDGARVQPSALGDPQAFIRLIAPDSLSPATRQALAETDDRQAAALLLAAPEFQRR*
Ga0137385_1141843713300012359Vadose Zone SoilTRMNFAADLVSNRLDGTRVQADALRDTQAFVRLIAPDSLSPATRAALAETDAREAPALLLAAPEFQRR*
Ga0137375_1017750413300012360Vadose Zone SoilMNFAADLVSNRLDGARVQSTALHDSQQFVRLIAPDSLSAATRAALAETEGADALALLLAAPEFQRR*
Ga0157328_100996223300012478Arabidopsis RhizosphereADLISNRLDGARVQPSALSDTQAFIRLIAPDSLSPATRQALAETDDRQAAALLLAAPEFQRR*
Ga0157323_102245513300012495Arabidopsis RhizosphereRMNFAADLVSNRLDGARVQPSALGDTQAFIRLIAPDSLSPATRAALAETDDRQAAALLLAAPEFQRR*
Ga0137396_1128157313300012918Vadose Zone SoilRLDGARVQADALRDTQAFVRLIAPDSLSPATRAAWAETDAREAPALLLAAPEFQRR*
Ga0137419_1192545823300012925Vadose Zone SoilIAPDSLSPATRQALAETDDRQAAALLLAAPEFQRR*
Ga0164299_1067263023300012958SoilDGARVQPSALGDTQAFIRLIAPDSLSPATRAALAETDDRQAAALLLAAPEFQRR*
Ga0164308_1021857723300012985SoilARVAPAALSDTQALIRLVAPDSLSPTTRRALAETDEKEAPAILLAAPEFQRR*
Ga0157378_1015675733300013297Miscanthus RhizosphereVQPSALSDTQAFIRLIAPDSLSPATRQALAETDDRQAAALLLAAPEFQRR*
Ga0157378_1177240123300013297Miscanthus RhizosphereVEPAALRDTQGLIRLVAPDSLSPATRRALAETDEKEGPAMLLAAPEFQRR*
Ga0075321_100944623300014300Natural And Restored WetlandsDGLRDPGSLIRLVAPDSLSPSTRAALADTAGADTLALLMAAPEFQRR*
Ga0132257_10218902223300015373Arabidopsis RhizosphereEGARVAPAALSDTQALIRLVAPDSLSPTTRRALAETDEKEAPAILLAAPEFQRR*
Ga0187822_1004326613300017994Freshwater SedimentGARVELDGLRDRGRYIRLVAPDSLSPSTRAALADTDGNDALALLLAAPEFQRR
Ga0190271_1330272413300018481SoilIRRIAPDSLSPATQTALAETDGNQAIALLLAAPEFQRR
Ga0173479_1027496723300019362SoilLLGARVEPTALRDTQGLIRLVAPDSLSPATRRALAETDEKEGPAMLLAAPEFQRR
Ga0190264_1224073413300019377SoilASWVSPDMLLTRINFVSDLVGNRLPGSRVQKDAVGDLESIIRLIAPDSISPATRAALNENKGAESVALLLAAPEFQRR
Ga0193723_113304623300019879SoilNFAADLVSNRLDGARVQADALRDTQAFVRLIAPDSLSPATRAALAETDAREAPALLLAAPEFQRR
Ga0210334_1071057123300021859EstuarineIAPDSLSAATRAALAETKEREVVALLLAAPEFQRR
Ga0209827_1082697323300025149Thermal SpringsIAPESLSSSTRVVLAQTEGPEAVALLLAAPEFQRR
Ga0209109_1041624023300025160SoilIAPDSLSSATQAALAETEGRDAVALLLAAPEFQRR
Ga0209642_1033001613300025167SoilMLLTRMNFASDLVSNRLPGSRVRNDTVGGADAVARLIAPDSLSAATRAALAETDGRDAVALLLAAPEFQRR
Ga0207697_1010622423300025315Corn, Switchgrass And Miscanthus RhizosphereAALSDTQALIRLVAPDSLSPTTRRALAETDEKEAPAILLAAPEFQRR
Ga0210115_106702713300025791Natural And Restored WetlandsPDNVQALARLVAPDSLSQATQSALAESAASEKLALLLAAPEFQRR
Ga0207687_1184679223300025927Miscanthus RhizosphereVGNRLDGARVEPAALRDAQGLIRLVAPDSLSPATRRALAETDEKEGPAMLLAAPEFQRR
Ga0207689_1080459623300025942Miscanthus RhizosphereSWISPDMLLTRMNFAADLVGNRLDGARVEPAALRDTQGLIRLVAPDSLSPATRRALAETDEKEGPAMLLAAPEFQRR
Ga0208775_101558723300025992Rice Paddy SoilAANRLDGARVQSRDPDNVQALARLVAPDSLSQATQSALAESAASEKLALLLAAPEFQRR
Ga0208776_101169123300026014Rice Paddy SoilANRLDGARVQSRDPNNVQALARLVAPDSLSQATQSALAESAASEKLALLLAAPEFQRR
Ga0208287_100552823300026044Natural And Restored WetlandsQPLIGLIAPGSLSSATQAALSETAGSDTLALLLAAPEFQRR
Ga0208654_100713113300026062Natural And Restored WetlandsDMLLTRMNFASDLAANRIDGARVPMQVGDTQPLIGLIAPGSLSSATQAALSETAGSDTLALLLAAPEFQRR
Ga0208290_101559823300026066Natural And Restored WetlandsTDLAANRLDGARVQSRDPDNVQALARLVAPDSLSQATQSALAESAASEKLALLLAAPEFQRR
Ga0207676_1091921713300026095Switchgrass RhizosphereLTRMNFAADLVGNRLEGARVAPAALSDTQALIRLVAPDSLSPATRRALAETDEKEGPAMLLAAPEFQRR
Ga0209983_104141413300027665Arabidopsis Thaliana RhizosphereTTDLVANRVDGARVQSHGPDDIQALARLIAPDSLSQATQAALAESASSERLALLLAAPEFQRR
Ga0207428_1021519223300027907Populus RhizospherePDMLLTRMNFAADLVGNRLEGARVAPAALSDTQALIRLVAPDSLSPTTRRALAETDEKEAPAILLAAPEFQRR
Ga0209382_1222115623300027909Populus RhizosphereRVPPNALRDTQALVRLIAPDSLSPVTRQALAEADEKEAPALLLAAPEFQRR
Ga0137415_1118121323300028536Vadose Zone SoilPDMLLTRMNFAADLVSNHLDGARVQADALRDTQAFVRLIAPDSLSPATRAALAETDAREAPALLLAAPEFQRR
Ga0307293_1020900213300028711SoilQPFVRLIAPDSLSSATRAALAETEGPDALALLLATPEFQRR
Ga0307305_1031681023300028807SoilLDGARVQADALRDTQAFVRLIAPDSLSPATRAALAETDAREAPALLLAAPEFQRR
Ga0299907_1072793513300030006SoilNRLDGARVRTQSAPDVQAFIRLIAPDSLSTATQAALAETTGSDALALLLAAPEFQRR
Ga0299907_1086537513300030006SoilALSDAQTFIRLIAPDSLDPATRAALSEMAGPDALAILLAGPEFQRR
Ga0302046_1073422923300030620SoilDGARMQSEVTGHTQAFIRLIAPDSLSSATQAALSETAGSDTLALLLAAPEFQRR
Ga0307499_1026622323300031184SoilSPDMLLTRMNFAADLVNNRLDGARVQPSALGDTQAFIRLIAPDSLSPATRQALAETDDRQAAALLLAAPEFQRR
(restricted) Ga0255310_1018687713300031197Sandy SoilISSDMLLTRMNFAIDLVANRIDGARVQSQAPRDIQAFARMVAPDSLSSATRVALTESAESDKLALLLAAPEFQRR
Ga0307500_1019298213300031198SoilLIAPDSLSPATRQALAETDDRQSVALLLAAPEFQRR
Ga0299913_1061850113300031229SoilMLLTRMNFASDLVANRLDGARVQTQSAPDVQAFIRLIAPDSLSTATQAALAETTGSDTLALLLAAPEFQRR
Ga0310887_1050816613300031547SoilAFIHLIAPDSLSPATRQALAEADDRQAAALLLAAPEFQRR
Ga0310886_1091033223300031562SoilARVQPSALGDTQAFIRLIAPDSLSAATRQALAETDDRQAAALLLAAPEFQRR
Ga0307469_1132401513300031720Hardwood Forest SoilISPDMLLTRMNFAADLVSNRLDGARVQPSALGDTQAFIRLIAPDSLSPATRAALAETDDRQAAALLLAAPEFQRR
Ga0306925_1087497413300031890SoilLDGARMTAEGLRDTQALIRLIAPDSLSPATRQALAQTDQKDAPALLLAAPEFQRR
Ga0310909_1019496933300031947SoilAEGLRDTQALIRLIAPDSLSPATRQALAQTDQKDAPALLLAAPEFQRR
Ga0306926_1050958713300031954SoilLLTRMNFAADLVGNRLDGARVAAEGLRDTQALIRLIAPDSLSPATRQALAQTDQKDAPALLLAAPEFQRR
Ga0316604_1068425823300033406SoilGLMRRIAPGSLSPATQAGLAEADGNQAIALLLAAPEFQRR
Ga0364943_0285879_434_6223300034354SedimentADLVSNRLDGARVQPSALGDTQAFIRLIAPDSLSPATRQALAETDDRQAAALLLAAPEFQRR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.