NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F040299

Metagenome / Metatranscriptome Family F040299

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F040299
Family Type Metagenome / Metatranscriptome
Number of Sequences 162
Average Sequence Length 72 residues
Representative Sequence MKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRPSEGQETPK
Number of Associated Samples 145
Number of Associated Scaffolds 162

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 70.99 %
% of genes near scaffold ends (potentially truncated) 33.33 %
% of genes from short scaffolds (< 2000 bps) 72.22 %
Associated GOLD sequencing projects 132
AlphaFold2 3D model prediction Yes
3D model pTM-score0.15

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (79.630 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere
(9.259 % of family members)
Environment Ontology (ENVO) Unclassified
(30.864 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(45.062 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Mixed Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 0.00%    Coil/Unstructured: 100.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.15
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 162 Family Scaffolds
PF04972BON 36.42
PF00171Aldedh 11.11
PF11752DUF3309 5.56
PF03631Virul_fac_BrkB 1.85
PF01435Peptidase_M48 1.23
PF12840HTH_20 1.23
PF00753Lactamase_B 1.23
PF02776TPP_enzyme_N 0.62
PF13416SBP_bac_8 0.62
PF01869BcrAD_BadFG 0.62
PF00210Ferritin 0.62
PF03704BTAD 0.62

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 162 Family Scaffolds
COG0014Gamma-glutamyl phosphate reductaseAmino acid transport and metabolism [E] 11.11
COG1012Acyl-CoA reductase or other NAD-dependent aldehyde dehydrogenaseLipid transport and metabolism [I] 11.11
COG4230Delta 1-pyrroline-5-carboxylate dehydrogenaseAmino acid transport and metabolism [E] 11.11
COG1295Uncharacterized membrane protein, BrkB/YihY/UPF0761 family (not an RNase)Function unknown [S] 1.85
COG3629DNA-binding transcriptional regulator DnrI/AfsR/EmbR, SARP family, contains BTAD domainTranscription [K] 0.62
COG3947Two-component response regulator, SAPR family, consists of REC, wHTH and BTAD domainsTranscription [K] 0.62


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms79.63 %
UnclassifiedrootN/A20.37 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000364|INPhiseqgaiiFebDRAFT_101385699Not Available1349Open in IMG/M
3300000550|F24TB_10114372All Organisms → cellular organisms → Bacteria1602Open in IMG/M
3300000955|JGI1027J12803_107293283Not Available984Open in IMG/M
3300001431|F14TB_100326264All Organisms → cellular organisms → Bacteria1903Open in IMG/M
3300002245|JGIcombinedJ26739_101053199Not Available699Open in IMG/M
3300003319|soilL2_10044016All Organisms → cellular organisms → Bacteria1352Open in IMG/M
3300003324|soilH2_10055608All Organisms → cellular organisms → Bacteria11202Open in IMG/M
3300003349|JGI26129J50193_1000663All Organisms → cellular organisms → Bacteria → Proteobacteria2199Open in IMG/M
3300003371|JGI26145J50221_1003242All Organisms → cellular organisms → Bacteria → Proteobacteria1292Open in IMG/M
3300003503|JGI26141J51220_1002215All Organisms → cellular organisms → Bacteria1186Open in IMG/M
3300004803|Ga0058862_11815676All Organisms → cellular organisms → Bacteria → Proteobacteria1006Open in IMG/M
3300005176|Ga0066679_10253589All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1134Open in IMG/M
3300005329|Ga0070683_100336019All Organisms → cellular organisms → Bacteria1438Open in IMG/M
3300005334|Ga0068869_100001241All Organisms → cellular organisms → Bacteria → Proteobacteria15051Open in IMG/M
3300005336|Ga0070680_101960409All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria507Open in IMG/M
3300005338|Ga0068868_100207719All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1635Open in IMG/M
3300005354|Ga0070675_100156490All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1957Open in IMG/M
3300005364|Ga0070673_100009631All Organisms → cellular organisms → Bacteria6497Open in IMG/M
3300005434|Ga0070709_10930400All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria689Open in IMG/M
3300005439|Ga0070711_100161737All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1698Open in IMG/M
3300005444|Ga0070694_100193135All Organisms → cellular organisms → Bacteria1513Open in IMG/M
3300005444|Ga0070694_100672877All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria839Open in IMG/M
3300005445|Ga0070708_100285529All Organisms → cellular organisms → Bacteria1553Open in IMG/M
3300005455|Ga0070663_100012077All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium5450Open in IMG/M
3300005457|Ga0070662_100097325All Organisms → cellular organisms → Bacteria2221Open in IMG/M
3300005467|Ga0070706_100005052All Organisms → cellular organisms → Bacteria12609Open in IMG/M
3300005468|Ga0070707_100699549All Organisms → cellular organisms → Bacteria976Open in IMG/M
3300005471|Ga0070698_100390000All Organisms → cellular organisms → Bacteria1325Open in IMG/M
3300005518|Ga0070699_100024825All Organisms → cellular organisms → Bacteria → Proteobacteria5167Open in IMG/M
3300005542|Ga0070732_10495268All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria740Open in IMG/M
3300005546|Ga0070696_100152609All Organisms → cellular organisms → Bacteria1697Open in IMG/M
3300005564|Ga0070664_101988410Not Available552Open in IMG/M
3300005586|Ga0066691_10047029All Organisms → cellular organisms → Bacteria2299Open in IMG/M
3300005618|Ga0068864_100235825All Organisms → cellular organisms → Bacteria1693Open in IMG/M
3300005713|Ga0066905_100256483All Organisms → cellular organisms → Bacteria1347Open in IMG/M
3300005875|Ga0075293_1041820Not Available640Open in IMG/M
3300005879|Ga0075295_1027011Not Available702Open in IMG/M
3300005937|Ga0081455_10000654All Organisms → cellular organisms → Bacteria → Proteobacteria44913Open in IMG/M
3300005937|Ga0081455_10665906Not Available668Open in IMG/M
3300005985|Ga0081539_10205914Not Available905Open in IMG/M
3300006028|Ga0070717_11274685Not Available668Open in IMG/M
3300006041|Ga0075023_100321604Not Available644Open in IMG/M
3300006050|Ga0075028_100614633Not Available647Open in IMG/M
3300006173|Ga0070716_100005549All Organisms → cellular organisms → Bacteria → Proteobacteria6120Open in IMG/M
3300006755|Ga0079222_10022691All Organisms → cellular organisms → Bacteria → Proteobacteria2549Open in IMG/M
3300006755|Ga0079222_10501932All Organisms → cellular organisms → Bacteria889Open in IMG/M
3300006804|Ga0079221_10102533All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1409Open in IMG/M
3300006852|Ga0075433_10005678All Organisms → cellular organisms → Bacteria9807Open in IMG/M
3300006854|Ga0075425_100341409All Organisms → cellular organisms → Bacteria1724Open in IMG/M
3300006854|Ga0075425_100526138All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1361Open in IMG/M
3300006903|Ga0075426_10015830All Organisms → cellular organisms → Bacteria5381Open in IMG/M
3300006903|Ga0075426_10047105All Organisms → cellular organisms → Bacteria3081Open in IMG/M
3300006904|Ga0075424_101408373Not Available740Open in IMG/M
3300006954|Ga0079219_10019095All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2499Open in IMG/M
3300006954|Ga0079219_10095594All Organisms → cellular organisms → Bacteria1443Open in IMG/M
3300007076|Ga0075435_100088370All Organisms → cellular organisms → Bacteria → Proteobacteria2554Open in IMG/M
3300009038|Ga0099829_10278981All Organisms → cellular organisms → Bacteria1367Open in IMG/M
3300009098|Ga0105245_12297378All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria593Open in IMG/M
3300009147|Ga0114129_10196070All Organisms → cellular organisms → Bacteria2738Open in IMG/M
3300009147|Ga0114129_11903123Not Available721Open in IMG/M
3300009148|Ga0105243_10065521All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2918Open in IMG/M
3300009162|Ga0075423_10089228All Organisms → cellular organisms → Bacteria → Proteobacteria3220Open in IMG/M
3300010110|Ga0126316_1051398All Organisms → cellular organisms → Bacteria2366Open in IMG/M
3300010145|Ga0126321_1088906All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1397Open in IMG/M
3300010154|Ga0127503_10029344All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria599Open in IMG/M
3300010154|Ga0127503_10403450All Organisms → cellular organisms → Bacteria1356Open in IMG/M
3300010154|Ga0127503_10443201All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria538Open in IMG/M
3300010371|Ga0134125_10035847All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium5533Open in IMG/M
3300010397|Ga0134124_10003707All Organisms → cellular organisms → Bacteria12293Open in IMG/M
3300010403|Ga0134123_12209664All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria612Open in IMG/M
3300012202|Ga0137363_10662041Not Available883Open in IMG/M
3300012205|Ga0137362_10382480Not Available1219Open in IMG/M
3300012208|Ga0137376_11203223Not Available647Open in IMG/M
3300012362|Ga0137361_10094364All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium2588Open in IMG/M
3300012582|Ga0137358_10490075Not Available828Open in IMG/M
3300012896|Ga0157303_10140403All Organisms → cellular organisms → Bacteria636Open in IMG/M
3300012916|Ga0157310_10286085All Organisms → cellular organisms → Bacteria640Open in IMG/M
3300012923|Ga0137359_10341096All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1332Open in IMG/M
3300012931|Ga0153915_10369630All Organisms → cellular organisms → Bacteria1617Open in IMG/M
3300012958|Ga0164299_10002216All Organisms → cellular organisms → Bacteria5866Open in IMG/M
3300012960|Ga0164301_10327612All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1043Open in IMG/M
3300012986|Ga0164304_10006246All Organisms → cellular organisms → Bacteria4907Open in IMG/M
3300013306|Ga0163162_10929199All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria982Open in IMG/M
3300013307|Ga0157372_10000347All Organisms → cellular organisms → Bacteria → Proteobacteria51034Open in IMG/M
3300015371|Ga0132258_10736957All Organisms → cellular organisms → Bacteria2482Open in IMG/M
3300015374|Ga0132255_101345541Not Available1078Open in IMG/M
3300017927|Ga0187824_10296411Not Available571Open in IMG/M
3300017930|Ga0187825_10046952All Organisms → cellular organisms → Bacteria1462Open in IMG/M
3300017936|Ga0187821_10047585All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1534Open in IMG/M
3300017936|Ga0187821_10108813Not Available1026Open in IMG/M
3300017959|Ga0187779_10131714All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1530Open in IMG/M
3300017993|Ga0187823_10354402Not Available523Open in IMG/M
3300017994|Ga0187822_10010072All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2244Open in IMG/M
3300019254|Ga0184641_1102390All Organisms → cellular organisms → Bacteria → Proteobacteria1933Open in IMG/M
3300019259|Ga0184646_1555473All Organisms → cellular organisms → Bacteria892Open in IMG/M
3300019885|Ga0193747_1092110All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria740Open in IMG/M
3300020021|Ga0193726_1183926Not Available890Open in IMG/M
3300020022|Ga0193733_1179186Not Available557Open in IMG/M
3300020069|Ga0197907_10735597All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1692Open in IMG/M
3300020070|Ga0206356_10077220All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1422Open in IMG/M
3300020075|Ga0206349_1009063All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1677Open in IMG/M
3300020077|Ga0206351_10633146All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1586Open in IMG/M
3300020080|Ga0206350_11153280All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1186Open in IMG/M
3300020082|Ga0206353_10755087All Organisms → cellular organisms → Bacteria2561Open in IMG/M
3300020199|Ga0179592_10515593Not Available511Open in IMG/M
3300020579|Ga0210407_10016278All Organisms → cellular organisms → Bacteria5516Open in IMG/M
3300020579|Ga0210407_10213315All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1501Open in IMG/M
3300020583|Ga0210401_11419592All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria552Open in IMG/M
3300021432|Ga0210384_10350369All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1330Open in IMG/M
3300022467|Ga0224712_10006922All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3269Open in IMG/M
3300025906|Ga0207699_10081822All Organisms → cellular organisms → Bacteria2004Open in IMG/M
3300025910|Ga0207684_10024561All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria5138Open in IMG/M
3300025912|Ga0207707_10000677All Organisms → cellular organisms → Bacteria → Proteobacteria33771Open in IMG/M
3300025925|Ga0207650_10644685All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium893Open in IMG/M
3300025939|Ga0207665_10002259All Organisms → cellular organisms → Bacteria13001Open in IMG/M
3300025940|Ga0207691_10097945All Organisms → cellular organisms → Bacteria2621Open in IMG/M
3300025944|Ga0207661_10269197All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1520Open in IMG/M
3300026023|Ga0207677_10103472All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2102Open in IMG/M
3300026095|Ga0207676_11623264Not Available644Open in IMG/M
3300026469|Ga0257169_1053258Not Available639Open in IMG/M
3300026480|Ga0257177_1007341All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1393Open in IMG/M
3300026499|Ga0257181_1024341Not Available919Open in IMG/M
3300026508|Ga0257161_1066177All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria738Open in IMG/M
3300026514|Ga0257168_1074549Not Available750Open in IMG/M
3300026873|Ga0207620_1003271All Organisms → cellular organisms → Bacteria1098Open in IMG/M
3300027617|Ga0210002_1017075All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1153Open in IMG/M
3300027655|Ga0209388_1001004All Organisms → cellular organisms → Bacteria6378Open in IMG/M
3300027765|Ga0209073_10029619All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1685Open in IMG/M
3300027765|Ga0209073_10120922All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria943Open in IMG/M
3300027775|Ga0209177_10093605All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria941Open in IMG/M
3300027775|Ga0209177_10261636All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria644Open in IMG/M
3300027787|Ga0209074_10060528All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1185Open in IMG/M
3300027787|Ga0209074_10133208All Organisms → cellular organisms → Bacteria876Open in IMG/M
3300027842|Ga0209580_10346443All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria740Open in IMG/M
3300027846|Ga0209180_10572862All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium626Open in IMG/M
3300027862|Ga0209701_10041515All Organisms → cellular organisms → Bacteria → Proteobacteria2971Open in IMG/M
3300027910|Ga0209583_10366767Not Available675Open in IMG/M
3300028047|Ga0209526_10007119All Organisms → cellular organisms → Bacteria7680Open in IMG/M
3300028587|Ga0247828_11044779All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria537Open in IMG/M
3300028596|Ga0247821_11158268Not Available524Open in IMG/M
3300028784|Ga0307282_10348700All Organisms → cellular organisms → Bacteria715Open in IMG/M
3300031114|Ga0308187_10284612All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria614Open in IMG/M
(restricted) 3300031197|Ga0255310_10014520All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2026Open in IMG/M
3300031716|Ga0310813_10244399All Organisms → cellular organisms → Bacteria1492Open in IMG/M
3300031720|Ga0307469_10698606All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria919Open in IMG/M
3300031720|Ga0307469_10932858Not Available806Open in IMG/M
3300031740|Ga0307468_100516165All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria952Open in IMG/M
3300031820|Ga0307473_10679629All Organisms → cellular organisms → Bacteria720Open in IMG/M
3300031949|Ga0214473_10183380All Organisms → cellular organisms → Bacteria2427Open in IMG/M
3300031949|Ga0214473_10219388All Organisms → cellular organisms → Bacteria2194Open in IMG/M
3300031962|Ga0307479_10052876All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Thermoleophilia → Solirubrobacterales → Solirubrobacteraceae → environmental samples → uncultured Solirubrobacteraceae bacterium3909Open in IMG/M
3300032012|Ga0310902_10567079All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria750Open in IMG/M
3300032017|Ga0310899_10102598All Organisms → cellular organisms → Bacteria1157Open in IMG/M
3300032180|Ga0307471_100505160All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1357Open in IMG/M
3300032180|Ga0307471_100570950All Organisms → cellular organisms → Bacteria → Proteobacteria1287Open in IMG/M
3300032205|Ga0307472_100854473All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria837Open in IMG/M
3300033433|Ga0326726_11835862Not Available590Open in IMG/M
3300033480|Ga0316620_11719069All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria622Open in IMG/M
3300033486|Ga0316624_11404127Not Available640Open in IMG/M
3300033513|Ga0316628_100765237All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1271Open in IMG/M
3300034818|Ga0373950_0120487All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria580Open in IMG/M
3300034820|Ga0373959_0193546All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria534Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere9.26%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil6.79%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil6.79%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil6.79%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere6.17%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil5.56%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.94%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere4.32%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment3.70%
SoilEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil3.09%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil2.47%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil2.47%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere2.47%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere2.47%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.85%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.85%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.85%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere1.85%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.85%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere1.85%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.23%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.23%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil1.23%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.23%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil1.23%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.23%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.23%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.23%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.23%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.23%
Rhizosphere SoilHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere Soil1.23%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.23%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere1.23%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.62%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.62%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.62%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.62%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.62%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.62%
Host-AssociatedHost-Associated → Human → Digestive System → Large Intestine → Fecal → Host-Associated0.62%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.62%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.62%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001431Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.4TB clc assemlyEnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300003319Sugarcane bulk soil Sample L2EnvironmentalOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300003349Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Endophyte Co-N S PMHost-AssociatedOpen in IMG/M
3300003371Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M3 S PMHost-AssociatedOpen in IMG/M
3300003503Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M3 S AMHost-AssociatedOpen in IMG/M
3300004803Switchgrass rhizosphere and bulk soil microbial communities from Kellogg Biological Station, Michigan, USA for expression studies - soil CB-2 (Metagenome Metatranscriptome)Host-AssociatedOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005329Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaGEnvironmentalOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005354Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M4-3 metaGHost-AssociatedOpen in IMG/M
3300005364Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaGHost-AssociatedOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005455Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3 metaGHost-AssociatedOpen in IMG/M
3300005457Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaGHost-AssociatedOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005564Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaGHost-AssociatedOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005618Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2Host-AssociatedOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005875Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_101EnvironmentalOpen in IMG/M
3300005879Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_301EnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300005985Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S4T2R2Host-AssociatedOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010110Soil microbial communities from Illinois, USA to study soil gas exchange rates - BV-IL-AGR metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010145Soil microbial communities from Hawaii, USA to study soil gas exchange rates - KP-HI-INT metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010154Soil microbial communities from Willow Creek, Wisconsin, USA - WC-WI-TBF metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012896Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S118-311C-2EnvironmentalOpen in IMG/M
3300012916Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S213-509R-2EnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300013307Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C5-5 metaGHost-AssociatedOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017959Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_10_MGEnvironmentalOpen in IMG/M
3300017993Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_3EnvironmentalOpen in IMG/M
3300017994Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_2EnvironmentalOpen in IMG/M
3300019254Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019259Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019885Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m2EnvironmentalOpen in IMG/M
3300020021Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c1EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300020069Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-2 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300020070Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-1 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300020075Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5pm-5 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300020077Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5pm-1 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300020080Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5pm-4 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300020082Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-4 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300022467Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5pm-2 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025925Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S6-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025940Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M1-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025944Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026023Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026095Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026469Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-BEnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026499Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-BEnvironmentalOpen in IMG/M
3300026508Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-AEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026873Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05A5-12 (SPAdes)EnvironmentalOpen in IMG/M
3300027617Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M2 S AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028587Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day3EnvironmentalOpen in IMG/M
3300028596Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Glycerol_Day14EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300031114Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_182 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032012Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D3EnvironmentalOpen in IMG/M
3300032017Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D4EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033480Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M1_C1_D5_BEnvironmentalOpen in IMG/M
3300033486Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_N3_C1_D5_AEnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300034818Populus rhizosphere microbial communities from soil in West Virginia, United States - GW9791_WV_N_3Host-AssociatedOpen in IMG/M
3300034820Populus rhizosphere microbial communities from soil in West Virginia, United States - WV94_WV_N_2Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10138569933300000364SoilMKELRRKTAPPIGRKDQQDPDRGTRSPREGSVDRGPERSDRDAGAGRPVQLDVEGRARRPESDSPXXIRXDTPPAK*
F24TB_1011437223300000550SoilMKEPKGKPAPRIGRKDQQDPDRGSRSPREGSVEPGPERSDRDAGAGRPVQLDVEGRARRPESDSPRPTDDTPPAK*
JGI1027J12803_10729328313300000955SoilMKELRRKTAPPIGRKDQQDPDRGTRSPREGSVDRGPERSDRDAGAGRPVQLDVEGRARRPES
F14TB_10032626423300001431SoilMKELRRKTAPPIGRKDQQDPDRGTRSPREGSVDRGPERSDRDAGAGRPVQLDVEGRARRPESDSPRPTDDTPPAK*
JGIcombinedJ26739_10105319913300002245Forest SoilMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDREAGAGRPVQLDDRPTEGQERPK*
soilL2_1004401633300003319Sugarcane Root And Bulk SoilMSKEPKKETPRIGRKDQQDPDRATSGPREGAWDDDAERSDREAGAGRPVRLDEEADRRRPGSDSS
soilH2_1005560863300003324Sugarcane Root And Bulk SoilMSKEPKKETPRIGRKDQQDPDRATSGPREGAWDDDAERSDREAGAGRPVRLDEEADRRRPGSDSSRPTPAK*
JGI26129J50193_100066353300003349Arabidopsis Thaliana RhizosphereMKEPRGKAAPRIGRKDQQDPDRGNRTPREGVVDHEPERSDRDAGAGRPVQLDVEDRPESDSPRRTDDTPPAK*
JGI26145J50221_100324223300003371Arabidopsis Thaliana RhizosphereMKEPRGKAAPRIGRKDQQDPDRGNRTPREGVVDHEPERSDRDAGAGRPVQLDVEXRPESDSPRRTDDTPPAK*
JGI26141J51220_100221513300003503Arabidopsis Thaliana RhizosphereRGKAAPRIGRKDQQDPDRGNRTPREGVVDHEPERSDRDAGAGRPVQLDVEDRPESDSPRRTDDTPPAK*
Ga0058862_1181567633300004803Host-AssociatedMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLDEEGRSRRPGTDSPRPAPAK*
Ga0066679_1025358913300005176SoilMKEPRQKPPPRIGRKDLQDPDRDVRTPREGGVDREPDRNERSDRDAGAGRPVQHDEEGRSRRPGTDSARPSEEQEIPK*
Ga0070683_10033601943300005329Corn RhizosphereMKKEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLEEEGRSRRPGTDSPRPAPAK*
Ga0068869_100001241143300005334Miscanthus RhizosphereMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLEEEGRSRRPGTDSPRPAPAK*
Ga0070680_10196040913300005336Corn RhizosphereGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLDEEGRSRRPGTDSPRPAPAK*
Ga0068868_10020771933300005338Miscanthus RhizosphereMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDREAGAGRPVQLDEEGRSRRPSTDSPRPAPAK*
Ga0070675_10015649033300005354Miscanthus RhizosphereMKKEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDREAGAGRPVQLDEEGRSRRPSTDSPRPAPAK*
Ga0070673_100009631103300005364Switchgrass RhizosphereMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLDEEGRSRRPVTDSPRPAPAK*
Ga0070709_1093040013300005434Corn, Switchgrass And Miscanthus RhizospherePRQKPAPRIGRKDLQDPDRDVRTPREGGVDREPDRTERSDRDAGAGRPVQLDEEGRSRRPGTDSARPSEEQEIPK*
Ga0070711_10016173723300005439Corn, Switchgrass And Miscanthus RhizosphereMKEPRQKPPPRIGRKDLQDPDRDVRTPREGGVDREPDRNERSDRDAGAGRPVQLDEEGRSRRPETDSARPSEEQEIPK*
Ga0070694_10019313543300005444Corn, Switchgrass And Miscanthus RhizosphereMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDARAGRPVQLDEEGRSRRSDSPRPSEGQETSK*
Ga0070694_10067287713300005444Corn, Switchgrass And Miscanthus RhizosphereMKKEPKEKQTPRIGRKDLQDPDRDIRGPREGRVDEPERSDREAGAGRPVQLDEEGRSRRPVTDSPRPSDDGAPETTK*
Ga0070708_10028552913300005445Corn, Switchgrass And Miscanthus RhizosphereAMKEPRQKTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRPSEGQETPK*
Ga0070663_10001207713300005455Corn RhizosphereMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDREAGAGRPVQLDEEGRSRRPG
Ga0070662_10009732583300005457Corn RhizosphereAAPRIGRKDQQDPDRGNRTPREGVVDHEPERSDRDAGAGRPVQLDVEDRPESDSPRRTDDTPPAK*
Ga0070706_10000505273300005467Corn, Switchgrass And Miscanthus RhizosphereMKEPRQKPAPRIGRKDLQDPDRDVRTPREGGVDREPDRTERSDRDAGAERPVQLDEEGRSRRPGTDSTRPSEEQEIPK*
Ga0070707_10069954933300005468Corn, Switchgrass And Miscanthus RhizosphereGDTMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDARAGRPVQLDEEGRSRRSDSPRPSEGQETSK*
Ga0070698_10039000033300005471Corn, Switchgrass And Miscanthus RhizosphereMKEPKQKTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRPSEGQETPK*
Ga0070699_10002482543300005518Corn, Switchgrass And Miscanthus RhizosphereMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRPSEGQETSK*
Ga0070732_1049526813300005542Surface SoilKEPKEKQTSRIGRKDLQDPDRDIRGPREGRVDREPERSDREAGAGRPVQLDEEERSRRPTTDSPRPRDDGAAETTK*
Ga0070696_10015260923300005546Corn, Switchgrass And Miscanthus RhizosphereMKEPRPKQTPRIGRKDLQDPDRDVRSPREGGLDREPERSDRDAGAGRPVQLDEEGRSRRANSPRPPEGQETAK*
Ga0070664_10198841013300005564Corn RhizosphereMKKEPRETPRIGRKDQQDPDRETRMPREGAFDHDAERSDREAGAGRPVQLDEEGHSRRPGTDSPRPAPAK*
Ga0066691_1004702963300005586SoilMKEPKQRTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRPSEGQETPK*
Ga0068864_10023582523300005618Switchgrass RhizosphereMKKEPRETPRIGRKDQQDPDRETRMPREGAFDHDAERSDREAGAGRPVQLDEEGRSRRPSTDSPRPAPAK*
Ga0066905_10025648323300005713Tropical Forest SoilMKEPRKPAPRIGRKDQQDPDRGNRSPREGSVEPGPERSDRDAGAGRPVQLDVEGRARRPESDSPRPTEDTPPAK*
Ga0075293_104182023300005875Rice Paddy SoilMKKEPKEKQTPRIGRKDLQDPDRDIRGPRKGRVDEPERSDREAGAGRPVQLDEEGRSRRPVTDSPRPSDDGAPETTK*
Ga0075295_102701123300005879Rice Paddy SoilMKEPREKKPTPRVGRKDLQDPDRDIREPERSDRDAGAGRPVQLDEEGRSHRPGTDSPRPSNDGAPETSK*
Ga0081455_10000654203300005937Tabebuia Heterophylla RhizosphereMKEPRGKPAPRIGRKDQQDPDRGTRDPRKGVDREPERSDRDAGAGRPVQLDVEGRPARPESDSARPADDTPPAK*
Ga0081455_1066590623300005937Tabebuia Heterophylla RhizosphereMKEPRKKSAPRIGRKDLQDPDRGVRDPRQGRVDREPERSDRDAGAGRPVQLDVEGRSRRPESDSPRPIDDSPPAK*
Ga0081539_1020591413300005985Tabebuia Heterophylla RhizosphereKKEPKETPRIGRKDQQDPDRETHGPREGSFDQDAERSDREAGAGRPVQLDEEGRSRRPGTDSPRPAPAK*
Ga0070717_1127468513300006028Corn, Switchgrass And Miscanthus RhizosphereMKEPRQKTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRPSEGQETPK*
Ga0075023_10032160423300006041WatershedsMKEPKEKQAPRIGRKDLQDPDRDIRGPREGRVDREPERSDREAGAGRPVQLDEEGRSHRPGTDLPRPADDGAPQTAK*
Ga0075028_10061463323300006050WatershedsMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDREAGAGRPVQLDDRPTEGQETPK*
Ga0070716_10000554943300006173Corn, Switchgrass And Miscanthus RhizosphereMKQPRQKPAPRIGRKDLQDPDRDVRTPREGGVDREPDRTERSDRDAGAGRPVQLDEEGRSRRPGTDSARPSEEQEIPK*
Ga0079222_1002269113300006755Agricultural SoilKGTPRIGRKDQQDPDRETGAPREGAFDHDAERSDHEAGAGRPVQLDEEGRSRRPGTDSPRPAPAK*
Ga0079222_1050193213300006755Agricultural SoilDRATSGPREGAWDDDAERSDREAGAGRPVRLDEEADRARRPGRDSSRPTPAK*
Ga0079221_1010253313300006804Agricultural SoilMKKEPRGTPRIGRKDQQDPDREPRTAREGAFDHDAERSDREAGAGRPVQLDEEGRSRRPGTDSPRPAPAK*
Ga0075433_1000567863300006852Populus RhizosphereMKEPRQKPAPRIGRKDLQDPDRDVRTPREGGVDREPDRNERSDRDAGAGRPVQLDEEGRSRRPGTDSARPSEEQEIPK*
Ga0075425_10034140923300006854Populus RhizosphereMKKEPKGTPRIGRKDQQDPDRETRTTREGAFDHDAERSDREAGAGRPVQLDEEGRSRRPGTDSPRPAPAK*
Ga0075425_10052613813300006854Populus RhizosphereMKEPRQKPSPRIGRKDLQDPDRDVRTPREGGVDREPDRNERSDRDAGAGRPVQLDEEGRSRRPGT
Ga0075426_1001583063300006903Populus RhizosphereMKEPRQKPAPRIGRKDLQDPDRDVRTPREGGVDREPDRNERSDRDAGAGRPVQLDEEGRSRRPGT
Ga0075426_1004710523300006903Populus RhizosphereMKEPRQKPPPRIGRKDLQDPDRDVRTPREGGVDREPDRNERSDRDAGAGRPVQLDEEGRSRRPGTDSARPSEEQEIPK*
Ga0075424_10140837323300006904Populus RhizosphereMKKEPKGTPRIGRKDQQDPDRETGAPREGAFDHDAERSDHEAGAGRPVQLEEEGRSRRPGTDSPRPAPAK*
Ga0079219_1001909513300006954Agricultural SoilMKKEPRETQRIGRKDQQDPDRETRMPREGAFDHDAERSDREAGAGRPVQLDEEGRSRRPSTDSPRPAPAK*
Ga0079219_1009559443300006954Agricultural SoilMKKEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLDEEGRSRRPGTDSPRPAPAK*
Ga0075435_10008837013300007076Populus RhizosphereMKEEPKGTPRIGRKDQQDPDRETGAPREGAFDHDAERSDHEAGAGRPVQLEEEGRSRRPGTDSPRPAPAK*
Ga0099829_1027898133300009038Vadose Zone SoilMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRPSEGQETPK*
Ga0105245_1229737813300009098Miscanthus RhizosphereRSRCRIREEHMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLDEEGRSRRPGTDSPRPAPAK*
Ga0114129_1019607053300009147Populus RhizosphereMKKEPRETQRIGRKDQQDPDRETRMPREGAFDHDAERSDREAGAGRPVQLDEEGHSRRPGTDSPRPAPAK*
Ga0114129_1190312323300009147Populus RhizosphereMKELRRKTAPPIGRKDQQVPDRGTRSPREGSVDRGPERSDRDAGAGRPVQLDVEGRARRPESDSPRPTDDTPPAK*
Ga0105243_1006552143300009148Miscanthus RhizosphereMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLDEEGRSRRPSTDSPRPAPAK*
Ga0075423_1008922883300009162Populus RhizosphereMKEPRQKPSPRIGRKDLQDPDRDVRTPREGGVDREPDRNERSDRDAGAGRPVQLDEEGRSRRPGTDSARPSEEQEIPK*
Ga0126316_105139813300010110SoilSRCRIREEHMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLDEEGRSRRPGTDSPRPAPAK*
Ga0126321_108890613300010145SoilAADAAPGRNDMKKDPKETPRIGRKDQQDPDRETRGPREGATETDSERSDREAGAGRPVQLDEEGRSRRPGTDSPRPAPAK*
Ga0127503_1002934413300010154SoilQLHDATGETTMKEPRQKPSPRIGRKDLQDPDRDVRTPREGGVDREPDRNERSDRDAGAGRPVQLDEEGRSRRPGTDSARPSEEQEIPK*
Ga0127503_1040345013300010154SoilASSRDREDTMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDREAGAGRPVQLDDRPTEGQETPK*
Ga0127503_1044320113300010154SoilMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGMDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRPSEGQETPK*
Ga0134125_1003584713300010371Terrestrial SoilMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDREAGAGRPVQLDEEGHSRRPGTDSPRP
Ga0134124_10003707113300010397Terrestrial SoilMKEEPKGTPRIGRKDQQDPDRETGAPREGAFDHDAERSDHEAGAGRPVQLDEEGRSRRPGTDSPRPAPAK*
Ga0134123_1220966413300010403Terrestrial SoilQKPTPRIGRKDLQDPDRDVRTPREGGVDREPERSDREAGAGRPVQLDEEGRSRRSDSPRPTEGQETPK*
Ga0137363_1066204113300012202Vadose Zone SoilMKEPREKQPPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRPSEGQETPK*
Ga0137362_1038248023300012205Vadose Zone SoilMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRLTEGQETPK*
Ga0137376_1120322313300012208Vadose Zone SoilMKEPRQKQTPRIGRKDLQDSDRDVRSPRGGGVDREPERSDREAGAGRPVQLDEEGRSRRSDSPRPTEGQETPK*
Ga0137361_1009436433300012362Vadose Zone SoilMNEPRQKQTPRIGRKDLQDPDRDVRSPREGGREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRLTE
Ga0137358_1049007523300012582Vadose Zone SoilMKEPRQKQTPRIGRKDLQDPDRDVHSPRGGGVDREPERSDREAGAGRPVQLDEEGRSRRSDSPRPTEGQETPK*
Ga0157303_1014040313300012896SoilIGRKDQQDPDRGNRTPREGVVDHEPERSDRDAGAGRPVQLDVEDRPESDSPRRTDDTPPAK*
Ga0157310_1028608513300012916SoilPGGRSMKEPRGKAAPRIGRKDQQDPDRGNRTPREGVVDHEPERSDRDAGAGRPVQLDVEDRPESDSPRRTDDTPPAK*
Ga0137359_1034109623300012923Vadose Zone SoilMNEPRQKQTPRIGRKDLQDPDRDVRSPREGGREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRLTEGQETPK*
Ga0153915_1036963043300012931Freshwater WetlandsMNESRDKKPAPRVGRKDLQDPDRDIRGPREGHVIREPERSDRDAGAGRPVQLDEEGRSHRPGTDSPRPGNDGAPETSK*
Ga0164299_1000221623300012958SoilMKKEPKGTPRIGRKDQQDPDRETGAPREGAFDHDAERSDREAGAGRPVQLDEEGRSRRPSTDSPRPAPAK*
Ga0164301_1032761233300012960SoilDQQDPDRETRMPREGAFDHDAERSDREAGAGRPVQLDEEGRSRRPSTDSPRPAPAK*
Ga0164304_10006246113300012986SoilMKNEPRETPRIGRKDQQDPDRETRMPREGAFDHDAERSDREAGAGRPVQLDEEGRSRRPSTDSPRPAPAK*
Ga0163162_1092919933300013306Switchgrass RhizospherePRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLEEEGRSRRPGTDSPRPAPAK*
Ga0157372_10000347243300013307Corn RhizosphereMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDREAGAGRPVQLDEEGRSRRPGTDSPRPAPAK*
Ga0132258_1073695793300015371Arabidopsis RhizosphereKDQQDPDRETVTPREGAFDHDAERSDHEAGAGRPVQLEEEGRSRRPGTDSPRPAPAKSRPAPAK*
Ga0132255_10134554113300015374Arabidopsis RhizosphereMKKEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLEEEGRSRRPGTDSPRPAPAK
Ga0187824_1029641123300017927Freshwater SedimentMKEPKEEQTPRIGRKHLQDPDRDIRAPREGRVDREPERSDRDAGAGRPVQLDEEGRTHRPGTDSPRRGHDEAPETTK
Ga0187825_1004695243300017930Freshwater SedimentMKERNEKQPPRIGRKDLQDPDRDLRGPREGRVDREPERSDRDAGAGRPVQLDEERRGHRPGTDSPRPGHDGAPETSK
Ga0187821_1004758523300017936Freshwater SedimentMKEPKEEQTPRIGRKDLQDPDRDIRAPREGRVDREPERSDRDAGAGRPVQLDEERRGHRPGTDSPRPGHDGAPETSK
Ga0187821_1010881323300017936Freshwater SedimentMKEPKDQQTPRIGRKDLQDPDRDLRGPREGRVDPEPERSDRDAGAGRPVQLDEEARPHRPGTDSPRSGHDGAPETTK
Ga0187779_1013171423300017959Tropical PeatlandMSRETRKNPTPRIGRKDLQDPDRDVREKKDEPEPQRSDRDAGGGRPVQLDEEGGSRRPEPASPRPSVDK
Ga0187823_1035440223300017993Freshwater SedimentMKEPKEEQTPRIGRKDLQDPDRDIRAPREGRVDREPERSDRDAGAGRPVQLDEEGRTHRPGTDSPRPGHDGAPETTK
Ga0187822_1001007233300017994Freshwater SedimentMNEPKEEQTPRIGRKDLQDPDRDIRAPREGRVDREPERSDRDAGAGRPVQLDEEGRTHRPGTDSPRRGHDGAPETTK
Ga0184641_110239013300019254Groundwater SedimentMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDREAGAGRPVQLDEEGRSRRSDSPRPTEGQETPK
Ga0184646_155547313300019259Groundwater SedimentASSRDREDTMKEPRQKQPPRIGRKDLQDPDRDVRSPREGGVDREPERSDREAGAGRPVHFDEEGRSRRSDSPRPTEGQETPK
Ga0193747_109211023300019885SoilMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRPSEGQETPK
Ga0193726_118392613300020021SoilMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDREAGAGRPVQLDEEGRPRRSDSPRPTEGQETPK
Ga0193733_117918613300020022SoilMKEPRQKQTPRIGRKDLQDPDRDVRSPRGGGVDREPERSDREAGAGRPVQLDEEGRPTEGQETPK
Ga0197907_1073559753300020069Corn, Switchgrass And Miscanthus RhizosphereRSRCRIREEHMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLDEEGRSRRPGTDSPRPAPAK
Ga0206356_1007722013300020070Corn, Switchgrass And Miscanthus RhizosphereMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLDEEGRSRRPGTDSPRPAPAK
Ga0206349_100906313300020075Corn, Switchgrass And Miscanthus RhizosphereRIREEHMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLDEEGRSRRPGTDSPRPAPAK
Ga0206351_1063314613300020077Corn, Switchgrass And Miscanthus RhizosphereSRCRIREEHMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLDEEGRSRRPGTDSPRPAPAK
Ga0206350_1115328013300020080Corn, Switchgrass And Miscanthus RhizosphereARRSRCRIREEHMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLDEEGRSRRPGTDSPRPAPAK
Ga0206353_1075508793300020082Corn, Switchgrass And Miscanthus RhizosphereRSRCRIREEHMKEEPKGTPRIGRKDQQDPDRETGAPREGAFDHDAERSDHEAGAGRPVQLEEEGRSRRPGTDSPRPAPAK
Ga0179592_1051559313300020199Vadose Zone SoilMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRSTEGQETPK
Ga0210407_1001627843300020579SoilMKEPRQKPSPRIGRKDLQDPDRDVRTPREGGVDREPDRNERSDRDAGAGRPVQLDEEGRSRRPGTDSARPSEEQEIPK
Ga0210407_1021331523300020579SoilMNRETRRDPTPRIGRKDLQDPDRDVRDRQDRGNEREPERGDRDAGAGRPVPLDEEGHSRRPESDSPRPSDDGSK
Ga0210401_1141959223300020583SoilMNRETRRDPTPRIGRKDLQDPDRDGRDRQDRGNEREPERGDRDAGAGRPVPLDEEGHSRRPESDSPRPSDDGSK
Ga0210384_1035036923300021432SoilMKEPRQKQTPRIGRKDLQDPDRDVRSPRGGGVDREPERSDREAGAGRPVQLDEEGRSRRSDSPRPTEGQETPK
Ga0224712_1000692213300022467Corn, Switchgrass And Miscanthus RhizosphereEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLEEEGRSRRPGTDSPRPAPAK
Ga0207699_1008182223300025906Corn, Switchgrass And Miscanthus RhizosphereMKEPRQKPPPRIGRKDLQDPDRDVRTPREGGVDREPDRNERSDRDAGAGRPVQLDEEGRSRRPGTDSARPSEEQEIPE
Ga0207684_10024561113300025910Corn, Switchgrass And Miscanthus RhizosphereMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDARAGRPVQLDEEGRSRRSDSPRPSEGQETSK
Ga0207707_10000677243300025912Corn RhizosphereMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLEEEGRSRRPGTDSPRPAPAK
Ga0207650_1064468523300025925Switchgrass RhizosphereMKKEPRETPRIGRKDQQDPDRETRMPREGAFDHDAERSDREAGAGRPVQLDEEGRSRRPSTDSPRPAPAK
Ga0207665_1000225913300025939Corn, Switchgrass And Miscanthus RhizosphereMKQPRQKPAPRIGRKDLQDPDRDVRTPREGGVDREPDRTERSDRDAGAGRPVQLDEEGRSRRSGTDSAR
Ga0207691_1009794513300025940Miscanthus RhizosphereMKKEPKETPRIGRKDQQDPDRETRTPREGAFDHDAERSDREAGAGRPVQLDEEGRSRRPGTDSPRPAPAK
Ga0207661_1026919713300025944Corn RhizosphereMKKEPKGTPRIGRKDQQDPDRETGAPREGAFDHDAERSDHEAGAGRPVQLEEEGRSRRPGTDSPRPAPAK
Ga0207677_1010347223300026023Miscanthus RhizosphereMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDREAGAGRPVQLDEEGRSRRPGTDSPRPAPAK
Ga0207676_1162326413300026095Switchgrass RhizosphereMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLEEEGRSRRPGTDSP
Ga0257169_105325813300026469SoilMKEPREKQPPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRQSDSPRPSEGQETPK
Ga0257177_100734123300026480SoilMKEPREKQPPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRPSEGQEIPK
Ga0257181_102434123300026499SoilMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDS
Ga0257161_106617733300026508SoilRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRPTEGQETPK
Ga0257168_107454923300026514SoilMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDREAGAGRPVQLDEEGRSRRSDSPRSSEGQETPK
Ga0207620_100327113300026873SoilMKEPRGKAAPRIGRKDQQDPDRGNRTPREGVVDHEPERSDRDAGAGRPVQLDVEDRPESDSPRRTDDTPPAK
Ga0210002_101707523300027617Arabidopsis Thaliana RhizosphereMKEPRGKAAPRIGRKDQQDPDRGNRTPREGVVDHEPERSDRDAGAGRPVQLDVEDRPESDSPRRTDD
Ga0209388_1001004133300027655Vadose Zone SoilMKEPREKQPPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRPSEGQETPK
Ga0209073_1002961923300027765Agricultural SoilMKEEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDHEAGAGRPVQLEEEGRSRRPGTDSPRPAPAQ
Ga0209073_1012092213300027765Agricultural SoilMKERNEKQPPRIGRKDLQDPDRDLRGPREGRVDREPERSDRDAGAGRPVQLDEERRGHRPGTDSPRPGHNGAPETSK
Ga0209177_1009360533300027775Agricultural SoilMKKEPKGTPRIGRKDQQDPDRETGAPREGAFDHDAERSDHEAGAGRPVQLDEEGRSRRPGTDSPRPAPAK
Ga0209177_1026163633300027775Agricultural SoilDLQDPDRDVRTPREGGVDREPDRNERSDRDAGAGRPVQLDEEGRSRRPGTDSARPSEEQEIPK
Ga0209074_1006052813300027787Agricultural SoilTPRIGRKDQQDPDRETGAPREGAFDHDAERSDHEAGAGRPVQLEEEGRSRRPGTDSPRPAPAK
Ga0209074_1013320813300027787Agricultural SoilKDQQDPDRATSGPREGAWDDDAERSDREAGAGRPVRLDEEADRARRPGRDSSRPTPAK
Ga0209580_1034644313300027842Surface SoilKEPKEKQTSRIGRKDLQDPDRDIRGPREGRVDREPERSDREAGAGRPVQLDEEERSRRPTTDSPRPRDDGAAETTK
Ga0209180_1057286213300027846Vadose Zone SoilQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRPSEGQETPK
Ga0209701_1004151523300027862Vadose Zone SoilMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRPSEGQEIPK
Ga0209583_1036676713300027910WatershedsMKEPKEKQAPRIGRKDLQDPDRDIRGPREGRVDREPERSDREAGAGRPVQLDEEGRSHRPGTDLPRPADDGAPQTAK
Ga0209526_10007119143300028047Forest SoilMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRPTEGQETPK
Ga0247828_1104477923300028587SoilHMKKEPRETPRIGRKDQQDPDRETRMPREGAFDHDAERSDREAGAGRPVQLDEEGRSRRPSTDSPRPAPAK
Ga0247821_1115826823300028596SoilMKKEPKGTPRIGRKDQQDPDRETGTPREGAFDHDAERSDREAGAGRPVQLDEEGRSRRPGTDSPRPA
Ga0307282_1034870023300028784SoilEPRIGRKDLQDPDRDTRGPRKGSVESEPERSDRDAGAGRPVQLDEEGRSRPAGSDSPRPADDAAVPK
Ga0308187_1028461223300031114SoilMKEPRRKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRPTEGQETPK
(restricted) Ga0255310_1001452023300031197Sandy SoilMKEPQDKQTPRIGRKDLQDPDRDIRGPREGRAAPEPERSDRDAGAGRPVQLDEEGRAHRPGTDSPRPGHDGAPETTK
Ga0310813_1024439923300031716SoilMKEPKDKQTPRIGRKDLQDPDRDIRGPREGRVDREPERSDREAGAGRPVQLDEEGRSRRPGTDSPRPAPETTK
Ga0307469_1069860633300031720Hardwood Forest SoilMKEPRQKPSPRIGRRDLQDPDRDVRTPREGGVDREPDRNERSDRDAGAGRPVQLDEEGRSRRSGTDSARPSEEQEIPK
Ga0307469_1093285813300031720Hardwood Forest SoilMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDHEPERSDRDAGAGRPVQLDEEGRSRRSDSPRPSEGQETSK
Ga0307468_10051616533300031740Hardwood Forest SoilMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPIQLDEEGRSRRSDSPRPSEGQETPK
Ga0307473_1067962923300031820Hardwood Forest SoilRIGRKDLQDPDRDVRDRLGRGNDREPERGDRDAGAGRPVQLDEEGRSRRPESDSPRPSDDGSSRRNGS
Ga0214473_1018338063300031949SoilMKKEPRKKEPRGKPAPRIGRKDLQDRDRDVRNPREARVEREPDRSDRDAGAGRPVQLDEEGRSRRPEMDSPRPSDDVPPAK
Ga0214473_1021938823300031949SoilMKKDPRGKPAPRIGRKDLQDPDRDVRNPREGRVECEPERSDRGAGAGRPVQLDEEGRSRRPETESPRASDDVPPAK
Ga0307479_1005287643300031962Hardwood Forest SoilMKEPRQKPAPRIGRKDLQDPDRDVRTPREGGVDREPDRTERSDRDAGAERPVQLDEEGRSRRPGTDSARPSEEQEIPK
Ga0310902_1056707913300032012SoilQTPRIGRKDLQDPDRDIRGPREGRVDEPERSDREAGAGRPVQLDEEGRSRRPVTDSPRPSDDGAPETTK
Ga0310899_1010259833300032017SoilDIMKEPRGKAAPRIGRKDQQDPDRGNRTPREGVVDHEPERSDRDAGAGRPVQLDVEDRPESDSPRPTDDTPPAK
Ga0307471_10050516043300032180Hardwood Forest SoilMKEPRQKQTPRIGRKDLQDPDRDVRSPREGGVDREPERSDRDAGAGRPVQLDEEGRSRRSDSPRPSEGQETSK
Ga0307471_10057095043300032180Hardwood Forest SoilMNRETRRDPTPRIGRKDLQDPDRDVRDRLGRGNDREPERGDRDAGAGRPVQLDEEGRSRRPESDSPRPSDDGSSRRNGS
Ga0307472_10085447313300032205Hardwood Forest SoilMKEPRQKPSPRIGRRDLQDPDRDVRTPREGGVDREPDRNERSDRDAGAGRPVQLDEEGRSRRPETDSARPSEEQEIPK
Ga0326726_1183586213300033433Peat SoilMKEPKEKQTPRIGRKDLQDPDRDIRGPREGRVNREPERSDRKAGAGRPVQLDEEGRSRRPGMDSPRPAPETTK
Ga0316620_1171906923300033480SoilMKEPTEKQTPRIGRKDLQDPDRDTRDPRESRVDREPERSDRGAGAGRPVQLDEEARSRRPGTDSPRPGDDGAPETSK
Ga0316624_1140412713300033486SoilMNESRDKKPAPRVGRKDLQDPDRDIRGPREGRADREPERSDRDAGAGRPVQLDEEGRTTRPGTDSPRPGHDGAPETTK
Ga0316628_10076523733300033513SoilMNEPRDKKPAPRVGRKDLQDPDRDIRGPREGRVDREPERSDRDAGAGRPVQLDEEGRSHRRGTDSPRPGNDGAPETSK
Ga0373950_0120487_387_5783300034818Rhizosphere SoilRIGRKDLQDPDRDVRSPREGGLDREPERSDRDAGAGRPVQLDEVGRSRRANSPRPPEGQETAK
Ga0373959_0193546_293_5143300034820Rhizosphere SoilMKEPRPKQTPRIGRKDLQDPDRDVRSPREGGLDREPERSDRDAGAGRPVQLDEEGRSRRANSPRPPEGQETAK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.