NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F045958

Metagenome Family F045958

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F045958
Family Type Metagenome
Number of Sequences 152
Average Sequence Length 78 residues
Representative Sequence MPVARIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRGLLEPLGFELDRPILVREPEGEDALEFRQD
Number of Associated Samples 103
Number of Associated Scaffolds 152

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 67.76 %
% of genes near scaffold ends (potentially truncated) 27.63 %
% of genes from short scaffolds (< 2000 bps) 75.66 %
Associated GOLD sequencing projects 89
AlphaFold2 3D model prediction Yes
3D model pTM-score0.77

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (94.737 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(23.026 % of family members)
Environment Ontology (ENVO) Unclassified
(35.526 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(52.632 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 22.33%    β-sheet: 17.48%    Coil/Unstructured: 60.19%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.77
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
d.110.2.1: GAF domaind1mc0a11mc00.60616
d.110.2.0: automated matchesd6p58a_6p580.54669
d.110.2.0: automated matchesd7ckva17ckv0.54014
d.110.2.1: GAF domaind1mc0a21mc00.53618
d.145.1.4: CorC/HlyC domain-liked2p3ha12p3h0.52935


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 152 Family Scaffolds
PF01061ABC2_membrane 6.58
PF00753Lactamase_B 5.92
PF02515CoA_transf_3 5.26
PF13436Gly-zipper_OmpA 3.29
PF00691OmpA 3.29
PF12698ABC2_membrane_3 3.29
PF05685Uma2 3.29
PF13488Gly-zipper_Omp 2.63
PF12705PDDEXK_1 2.63
PF02668TauD 2.63
PF04519Bactofilin 2.63
PF12695Abhydrolase_5 1.97
PF13744HTH_37 1.97
PF13361UvrD_C 1.97
PF00296Bac_luciferase 1.97
PF07690MFS_1 1.97
PF03631Virul_fac_BrkB 1.32
PF13533Biotin_lipoyl_2 1.32
PF12697Abhydrolase_6 1.32
PF01734Patatin 0.66
PF16576HlyD_D23 0.66
PF00486Trans_reg_C 0.66
PF12146Hydrolase_4 0.66
PF13304AAA_21 0.66
PF00990GGDEF 0.66
PF09350DJC28_CD 0.66
PF00529CusB_dom_1 0.66
PF13441Gly-zipper_YMGG 0.66
PF02082Rrf2 0.66
PF13711DUF4160 0.66
PF01244Peptidase_M19 0.66
PF00575S1 0.66
PF00326Peptidase_S9 0.66
PF10518TAT_signal 0.66
PF01425Amidase 0.66

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 152 Family Scaffolds
COG1804Crotonobetainyl-CoA:carnitine CoA-transferase CaiB and related acyl-CoA transferasesLipid transport and metabolism [I] 5.26
COG4636Endonuclease, Uma2 family (restriction endonuclease fold)General function prediction only [R] 3.29
COG1664Cytoskeletal protein CcmA, bactofilin familyCytoskeleton [Z] 2.63
COG2175Taurine dioxygenase, alpha-ketoglutarate-dependentSecondary metabolites biosynthesis, transport and catabolism [Q] 2.63
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 1.97
COG1295Uncharacterized membrane protein, BrkB/YihY/UPF0761 family (not an RNase)Function unknown [S] 1.32
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 0.66
COG0640DNA-binding transcriptional regulator, ArsR familyTranscription [K] 0.66
COG1414DNA-binding transcriptional regulator, IclR familyTranscription [K] 0.66
COG1725DNA-binding transcriptional regulator YhcF, GntR familyTranscription [K] 0.66
COG1752Predicted acylesterase/phospholipase RssA, containd patatin domainGeneral function prediction only [R] 0.66
COG1959DNA-binding transcriptional regulator, IscR familyTranscription [K] 0.66
COG2186DNA-binding transcriptional regulator, FadR familyTranscription [K] 0.66
COG2188DNA-binding transcriptional regulator, GntR familyTranscription [K] 0.66
COG2355Zn-dependent dipeptidase, microsomal dipeptidase homologPosttranslational modification, protein turnover, chaperones [O] 0.66
COG2378Predicted DNA-binding transcriptional regulator YobV, contains HTH and WYL domainsTranscription [K] 0.66
COG2524Predicted transcriptional regulator, contains C-terminal CBS domainsTranscription [K] 0.66
COG3621Patatin-like phospholipase/acyl hydrolase, includes sporulation protein CotRGeneral function prediction only [R] 0.66
COG4667Predicted phospholipase, patatin/cPLA2 familyLipid transport and metabolism [I] 0.66


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms94.74 %
UnclassifiedrootN/A5.26 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000364|INPhiseqgaiiFebDRAFT_101591877All Organisms → cellular organisms → Bacteria4072Open in IMG/M
3300000955|JGI1027J12803_108243037All Organisms → cellular organisms → Bacteria601Open in IMG/M
3300002558|JGI25385J37094_10029760All Organisms → cellular organisms → Bacteria1929Open in IMG/M
3300002562|JGI25382J37095_10000502All Organisms → cellular organisms → Bacteria9898Open in IMG/M
3300002908|JGI25382J43887_10042861All Organisms → cellular organisms → Bacteria2460Open in IMG/M
3300003324|soilH2_10003947All Organisms → cellular organisms → Bacteria → Proteobacteria8958Open in IMG/M
3300005166|Ga0066674_10046809All Organisms → cellular organisms → Bacteria → Proteobacteria1944Open in IMG/M
3300005167|Ga0066672_10218023All Organisms → cellular organisms → Bacteria1220Open in IMG/M
3300005174|Ga0066680_10092221All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1842Open in IMG/M
3300005174|Ga0066680_10172343All Organisms → cellular organisms → Bacteria1361Open in IMG/M
3300005176|Ga0066679_10226307All Organisms → cellular organisms → Bacteria1199Open in IMG/M
3300005176|Ga0066679_10299923All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1044Open in IMG/M
3300005176|Ga0066679_10904834All Organisms → cellular organisms → Bacteria555Open in IMG/M
3300005177|Ga0066690_10137280All Organisms → cellular organisms → Bacteria1599Open in IMG/M
3300005181|Ga0066678_10544908All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium770Open in IMG/M
3300005184|Ga0066671_10140458All Organisms → cellular organisms → Bacteria1395Open in IMG/M
3300005186|Ga0066676_10005666All Organisms → cellular organisms → Bacteria → Proteobacteria5782Open in IMG/M
3300005187|Ga0066675_10010503All Organisms → cellular organisms → Bacteria4842Open in IMG/M
3300005187|Ga0066675_11260100All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium547Open in IMG/M
3300005332|Ga0066388_100089638All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3492Open in IMG/M
3300005332|Ga0066388_100482217All Organisms → cellular organisms → Bacteria → Proteobacteria1878Open in IMG/M
3300005332|Ga0066388_100626223All Organisms → cellular organisms → Bacteria1695Open in IMG/M
3300005332|Ga0066388_100823309All Organisms → cellular organisms → Bacteria1518Open in IMG/M
3300005332|Ga0066388_107069103All Organisms → cellular organisms → Bacteria564Open in IMG/M
3300005406|Ga0070703_10525579All Organisms → cellular organisms → Bacteria536Open in IMG/M
3300005445|Ga0070708_100016878All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6071Open in IMG/M
3300005445|Ga0070708_100098962All Organisms → cellular organisms → Bacteria → Proteobacteria2667Open in IMG/M
3300005445|Ga0070708_100115321All Organisms → cellular organisms → Bacteria → Proteobacteria2473Open in IMG/M
3300005446|Ga0066686_10036361All Organisms → cellular organisms → Bacteria2921Open in IMG/M
3300005447|Ga0066689_10001830All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria7712Open in IMG/M
3300005450|Ga0066682_10017859All Organisms → cellular organisms → Bacteria3945Open in IMG/M
3300005454|Ga0066687_10392629All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium801Open in IMG/M
3300005467|Ga0070706_101870070All Organisms → cellular organisms → Bacteria545Open in IMG/M
3300005468|Ga0070707_100796734All Organisms → cellular organisms → Bacteria909Open in IMG/M
3300005518|Ga0070699_100303432All Organisms → cellular organisms → Bacteria → Proteobacteria1433Open in IMG/M
3300005518|Ga0070699_101846738All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium553Open in IMG/M
3300005536|Ga0070697_100161532All Organisms → cellular organisms → Bacteria → Proteobacteria1893Open in IMG/M
3300005536|Ga0070697_100814907All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria826Open in IMG/M
3300005536|Ga0070697_101222324All Organisms → cellular organisms → Bacteria670Open in IMG/M
3300005540|Ga0066697_10070460All Organisms → cellular organisms → Bacteria2006Open in IMG/M
3300005546|Ga0070696_100530765All Organisms → cellular organisms → Bacteria → Proteobacteria940Open in IMG/M
3300005546|Ga0070696_101155495All Organisms → cellular organisms → Bacteria → Proteobacteria653Open in IMG/M
3300005549|Ga0070704_100716150All Organisms → cellular organisms → Bacteria889Open in IMG/M
3300005554|Ga0066661_10460017All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium776Open in IMG/M
3300005557|Ga0066704_10389198All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria928Open in IMG/M
3300005559|Ga0066700_10157977All Organisms → cellular organisms → Bacteria1541Open in IMG/M
3300005559|Ga0066700_10268674All Organisms → cellular organisms → Bacteria1196Open in IMG/M
3300005559|Ga0066700_10582757All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium780Open in IMG/M
3300006049|Ga0075417_10013196All Organisms → cellular organisms → Bacteria → Proteobacteria3129Open in IMG/M
3300006049|Ga0075417_10177300All Organisms → cellular organisms → Bacteria1001Open in IMG/M
3300006049|Ga0075417_10222493All Organisms → cellular organisms → Bacteria899Open in IMG/M
3300006173|Ga0070716_100526073All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium877Open in IMG/M
3300006804|Ga0079221_10877511All Organisms → cellular organisms → Bacteria655Open in IMG/M
3300006844|Ga0075428_100872278All Organisms → cellular organisms → Bacteria955Open in IMG/M
3300006845|Ga0075421_101363608All Organisms → cellular organisms → Bacteria782Open in IMG/M
3300006852|Ga0075433_10218209All Organisms → cellular organisms → Bacteria1694Open in IMG/M
3300006854|Ga0075425_101451745All Organisms → cellular organisms → Bacteria775Open in IMG/M
3300006871|Ga0075434_101062681All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria823Open in IMG/M
3300006871|Ga0075434_102073331All Organisms → cellular organisms → Bacteria573Open in IMG/M
3300006904|Ga0075424_101488157All Organisms → cellular organisms → Bacteria719Open in IMG/M
3300006904|Ga0075424_102270427All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300006954|Ga0079219_10006638All Organisms → cellular organisms → Bacteria3601Open in IMG/M
3300006969|Ga0075419_10015319All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria4723Open in IMG/M
3300007076|Ga0075435_101840867All Organisms → cellular organisms → Bacteria531Open in IMG/M
3300007265|Ga0099794_10061459All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → unclassified Acidobacteriaceae → Acidobacteriaceae bacterium1825Open in IMG/M
3300009012|Ga0066710_101412016All Organisms → cellular organisms → Bacteria → Proteobacteria1078Open in IMG/M
3300009012|Ga0066710_104283744All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium533Open in IMG/M
3300009038|Ga0099829_10902998Not Available733Open in IMG/M
3300009089|Ga0099828_10056531All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Methylococcales → Methylococcaceae → Methyloterricola → Methyloterricola oryzae3277Open in IMG/M
3300009090|Ga0099827_10687186All Organisms → cellular organisms → Bacteria → Proteobacteria884Open in IMG/M
3300009090|Ga0099827_10749189All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria845Open in IMG/M
3300009100|Ga0075418_10033895All Organisms → cellular organisms → Bacteria5550Open in IMG/M
3300009137|Ga0066709_102170469All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium766Open in IMG/M
3300009137|Ga0066709_102646892All Organisms → cellular organisms → Bacteria670Open in IMG/M
3300009137|Ga0066709_103179771Not Available599Open in IMG/M
3300009147|Ga0114129_10017055All Organisms → cellular organisms → Bacteria10339Open in IMG/M
3300009147|Ga0114129_11094726Not Available998Open in IMG/M
3300009147|Ga0114129_11971710All Organisms → cellular organisms → Bacteria707Open in IMG/M
3300009162|Ga0075423_12344325All Organisms → cellular organisms → Bacteria581Open in IMG/M
3300009162|Ga0075423_12361525All Organisms → cellular organisms → Bacteria579Open in IMG/M
3300009777|Ga0105164_10014662All Organisms → cellular organisms → Bacteria → Proteobacteria4484Open in IMG/M
3300010046|Ga0126384_10005032All Organisms → cellular organisms → Bacteria8063Open in IMG/M
3300010046|Ga0126384_10155419All Organisms → cellular organisms → Bacteria1766Open in IMG/M
3300010046|Ga0126384_10434612All Organisms → cellular organisms → Bacteria1117Open in IMG/M
3300010048|Ga0126373_10355286All Organisms → cellular organisms → Bacteria1478Open in IMG/M
3300010301|Ga0134070_10001545All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria6903Open in IMG/M
3300010304|Ga0134088_10016499All Organisms → cellular organisms → Bacteria3224Open in IMG/M
3300010320|Ga0134109_10505155All Organisms → cellular organisms → Bacteria500Open in IMG/M
3300010359|Ga0126376_10110860All Organisms → cellular organisms → Bacteria → Proteobacteria2116Open in IMG/M
3300010359|Ga0126376_10705491All Organisms → cellular organisms → Bacteria971Open in IMG/M
3300010359|Ga0126376_10983257All Organisms → cellular organisms → Bacteria842Open in IMG/M
3300010359|Ga0126376_13203018All Organisms → cellular organisms → Bacteria506Open in IMG/M
3300010360|Ga0126372_10292782All Organisms → cellular organisms → Bacteria1425Open in IMG/M
3300010376|Ga0126381_101248644All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1074Open in IMG/M
3300010398|Ga0126383_10996962All Organisms → cellular organisms → Bacteria926Open in IMG/M
3300012096|Ga0137389_10758812Not Available833Open in IMG/M
3300012199|Ga0137383_10512261All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium879Open in IMG/M
3300012205|Ga0137362_10170175All Organisms → cellular organisms → Bacteria1869Open in IMG/M
3300012208|Ga0137376_10701193All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium873Open in IMG/M
3300012285|Ga0137370_10157146All Organisms → cellular organisms → Bacteria1317Open in IMG/M
3300012362|Ga0137361_10004456All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales9279Open in IMG/M
3300012362|Ga0137361_10769225All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria877Open in IMG/M
3300012362|Ga0137361_11802456All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300012685|Ga0137397_10202145All Organisms → cellular organisms → Bacteria1478Open in IMG/M
3300012922|Ga0137394_10164453All Organisms → cellular organisms → Bacteria1891Open in IMG/M
3300012922|Ga0137394_10267921All Organisms → cellular organisms → Bacteria → Proteobacteria1461Open in IMG/M
3300012922|Ga0137394_11094682All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium660Open in IMG/M
3300012922|Ga0137394_11473437All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300012923|Ga0137359_10450882All Organisms → cellular organisms → Bacteria1138Open in IMG/M
3300012948|Ga0126375_10577871Not Available854Open in IMG/M
3300012971|Ga0126369_11738523All Organisms → cellular organisms → Bacteria713Open in IMG/M
3300017959|Ga0187779_10596704All Organisms → cellular organisms → Bacteria739Open in IMG/M
3300017966|Ga0187776_10558745Not Available791Open in IMG/M
3300018431|Ga0066655_10039906All Organisms → cellular organisms → Bacteria2355Open in IMG/M
3300018433|Ga0066667_10063934All Organisms → cellular organisms → Bacteria2292Open in IMG/M
3300018433|Ga0066667_11031034All Organisms → cellular organisms → Bacteria710Open in IMG/M
3300018433|Ga0066667_11132360All Organisms → cellular organisms → Bacteria678Open in IMG/M
3300018468|Ga0066662_10496430All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1110Open in IMG/M
3300018482|Ga0066669_10000603All Organisms → cellular organisms → Bacteria → Proteobacteria11926Open in IMG/M
3300018482|Ga0066669_10135140All Organisms → cellular organisms → Bacteria1785Open in IMG/M
3300018482|Ga0066669_11020865All Organisms → cellular organisms → Bacteria → Proteobacteria747Open in IMG/M
3300019883|Ga0193725_1000121All Organisms → cellular organisms → Bacteria → Proteobacteria26296Open in IMG/M
3300024284|Ga0247671_1062759All Organisms → cellular organisms → Bacteria599Open in IMG/M
3300025173|Ga0209824_10012899All Organisms → cellular organisms → Bacteria → Proteobacteria3574Open in IMG/M
3300025910|Ga0207684_10177258All Organisms → cellular organisms → Bacteria → Proteobacteria1838Open in IMG/M
3300026298|Ga0209236_1035065All Organisms → cellular organisms → Bacteria2640Open in IMG/M
3300026313|Ga0209761_1066557All Organisms → cellular organisms → Bacteria1932Open in IMG/M
3300026318|Ga0209471_1176960All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium847Open in IMG/M
3300026318|Ga0209471_1228565All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium672Open in IMG/M
3300026324|Ga0209470_1127388All Organisms → cellular organisms → Bacteria1127Open in IMG/M
3300026332|Ga0209803_1002182All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria12358Open in IMG/M
3300026524|Ga0209690_1045778All Organisms → cellular organisms → Bacteria1989Open in IMG/M
3300026532|Ga0209160_1098556All Organisms → cellular organisms → Bacteria1486Open in IMG/M
3300026536|Ga0209058_1030419All Organisms → cellular organisms → Bacteria3342Open in IMG/M
3300027748|Ga0209689_1037671All Organisms → cellular organisms → Bacteria → Proteobacteria2811Open in IMG/M
3300027748|Ga0209689_1228505All Organisms → cellular organisms → Bacteria790Open in IMG/M
3300027748|Ga0209689_1236881All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium766Open in IMG/M
3300027873|Ga0209814_10014169All Organisms → cellular organisms → Bacteria3176Open in IMG/M
3300027875|Ga0209283_10240142Not Available1202Open in IMG/M
3300027882|Ga0209590_10307041All Organisms → cellular organisms → Bacteria → Proteobacteria1019Open in IMG/M
3300027909|Ga0209382_10369446All Organisms → cellular organisms → Bacteria1603Open in IMG/M
(restricted) 3300031248|Ga0255312_1090443All Organisms → cellular organisms → Bacteria744Open in IMG/M
3300031720|Ga0307469_10615303Not Available973Open in IMG/M
3300031720|Ga0307469_10800339All Organisms → cellular organisms → Bacteria865Open in IMG/M
3300031720|Ga0307469_11871790All Organisms → cellular organisms → Bacteria581Open in IMG/M
3300031720|Ga0307469_12520860All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium503Open in IMG/M
3300031740|Ga0307468_100371774All Organisms → cellular organisms → Bacteria → Proteobacteria1077Open in IMG/M
3300031820|Ga0307473_10164466All Organisms → cellular organisms → Bacteria1279Open in IMG/M
3300032180|Ga0307471_100046943All Organisms → cellular organisms → Bacteria3524Open in IMG/M
3300032180|Ga0307471_102929839All Organisms → cellular organisms → Bacteria → Proteobacteria606Open in IMG/M
3300032180|Ga0307471_103558115All Organisms → cellular organisms → Bacteria551Open in IMG/M
3300032205|Ga0307472_100328815All Organisms → cellular organisms → Bacteria → Proteobacteria1244Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil23.03%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil13.82%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere13.82%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil11.84%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere10.53%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil8.55%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil6.58%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.29%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.97%
WastewaterEnvironmental → Aquatic → Freshwater → Drinking Water → Unchlorinated → Wastewater1.32%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.32%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.32%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.66%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.66%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.66%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.66%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300006969Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009777Wastewater microbial communities from Netherlands to study Microbial Dark Matter (Phase II) - VDW unchlorinated drinking waterEnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300017959Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_10_MGEnvironmentalOpen in IMG/M
3300017966Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_20_MGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300024284Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK12EnvironmentalOpen in IMG/M
3300025173Wastewater microbial communities from Netherlands to study Microbial Dark Matter (Phase II) - VDW unchlorinated drinking water (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027873Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10159187783300000364SoilVPFARIPFWQLRRHGVVEEAVRGGTRRRVVGRDWALPEAVREPLRALLEPLGFDVDRSITVRELADENTLEFRQDESP*
JGI1027J12803_10824303723300000955SoilAVRPRTRRVATPVPFARIPFWQLRRHGVVEEAVRGGTRRRVVGRDWALPEAVREPLRALLEPLGFDVDRSITVRELADENT
JGI25385J37094_1002976023300002558Grasslands SoilMLVTRIPFWRLRQHGVVEEAVRGGSRCRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD*
JGI25382J37095_10000502103300002562Grasslands SoilMPVTRIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLEPLGFELRRPILVSEPEDENALEFRQDDGGDGD*
JGI25382J43887_1004286123300002908Grasslands SoilMLVTRIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD*
soilH2_1000394783300003324Sugarcane Root And Bulk SoilMPVARIPFWRLRAQGVVEEAVRGGSRRRVTDREWPLPDAVRDKMRGMLEPLGFDLARVVTVREPAGEEALEFHQD*
Ga0066674_1004680923300005166SoilMLVTRIPFWRLRQHGVVEEAVRGGSRRRIGGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD*
Ga0066672_1021802323300005167SoilMPVARIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRGLLEPLGFELDRPILVREPEGEDALEFRQDG*
Ga0066680_1009222133300005174SoilMALARIPFWRLRAHGVVEEAVRGGSRRRLLGHEWPLPDGVRERMRGLLEPLGFDLARPVAVREPEGEDALEFSQDA*
Ga0066680_1017234323300005174SoilMPVTRIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLEPLGFELRRPILVSEPEDENALEFRQDDGGDG
Ga0066679_1022630713300005176SoilARIPFWQLRRHGVVEEAIRGGSRRRIVGRDWALPEAVRERMRGLLEPLGFELDRPILVREPEGEDALEFRQDVQEGNNGPNHG*
Ga0066679_1029992323300005176SoilMPVARIPFWQLRRHGVVEEAIRGGSRRRIVGRDWALPEAVRDRMRELLEPLGFELGRPILVREPEGEDALEFRQDGDPAVV*
Ga0066679_1090483423300005176SoilMPVARIPFWQLRRQGVVEEAIRGGSRRRIVGRDWVLPEAMRDRMRELLEPLGFELGRPILVREPEGEDALEFRQDVQGEQ*
Ga0066690_1013728013300005177SoilMPITRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAMRDRMRELLEPLGFELGRPILVREPEGEDALEFRQDVQGEQ*
Ga0066678_1054490823300005181SoilMPVARIPFWQLRRHGVVEEAIRGGSRRRIVGRDWALPEAVRDRMRELLEPLGFELGRPILVREPEGEDALEF
Ga0066671_1014045833300005184SoilMPVTRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPEGVRERMRELLAPLGFDLGRPIFVREPEGEDALEFRQDA*
Ga0066676_1000566653300005186SoilVTSRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPDAVFERVRELLEPLGFDLGRPIFVREPEGEDALEFRQDV*
Ga0066675_1001050333300005187SoilVTSRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPDAVLERVRELLEPLGFDLGRPIFVREPEGEDALEFRQDV*
Ga0066675_1126010023300005187SoilMPITRIPFWQLRRHGVVEEAIRGGSRRRIVGRDWVLPEAVRERMRELLEPLGFELDRPILVREPEGEDALEFRQDGDPAVV*
Ga0066388_10008963833300005332Tropical Forest SoilVRPWRGSVATCRRYSERMPVTRIPFWRLRAQGVVEEAVRGGSRRRATDHDWPLPDAVREKMRGVLEPLGFDLARAVTVREPSGEDALEFQQD*
Ga0066388_10048221743300005332Tropical Forest SoilVSVATPVPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPESVREPLRALLEPLGFDVGRSISVREPEGENALEFRQE*
Ga0066388_10062622333300005332Tropical Forest SoilMPVTRIPFWQLRRHGVVEEAVRGGHRRRIVGRDWLLPDAVRDRVREMLEPLGFDVGRPILVREPDGEDALEFRQDDT
Ga0066388_10082330913300005332Tropical Forest SoilVSVATPVPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPESVREPLRALLEPLGFDVGRSISVREPEGENALEFRQD*
Ga0066388_10706910313300005332Tropical Forest SoilVSVATAVPFARIPFWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSISVRELADENTLEFRQD*
Ga0070703_1052557923300005406Corn, Switchgrass And Miscanthus RhizosphereVSVATPVPITRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDSVRERMRDLLEPLGFDVARSIIVRELEDENALEFRQD*
Ga0070708_10001687833300005445Corn, Switchgrass And Miscanthus RhizosphereMPVARIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRALLEPLGFELDRPILVREPEGEDALEFRQDGV*
Ga0070708_10009896233300005445Corn, Switchgrass And Miscanthus RhizosphereVPIALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPEPVRERVRPLLEPLGFDLTRPISVREPANQDALEFTQPE*
Ga0070708_10011532113300005445Corn, Switchgrass And Miscanthus RhizosphereVSVTPAVPITRIPFWELRRHGVVEEAVRGGSRRRVVGRDWPLPDSVRDRMRDLLEPLGFDVARAIMVRELEDQNALEFRQD*
Ga0066686_1003636133300005446SoilVTSRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPEAVLERVRELLEPLGFDLGRPIFVREPEGEDERT*
Ga0066689_1000183023300005447SoilMLVTRIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGADGD*
Ga0066682_1001785933300005450SoilVTSRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPDAVFERVRELLEPLGFDLGRPIFVREPEGEDALEFRQDVIDG*
Ga0066687_1039262913300005454SoilMPVARIPFWQLRRQGVVEEAIRGGSRRRIVGRDWVLPEAMRDRMRELLEPLGLELGRPILVREPEGEDALEFRQDVQGEQ*
Ga0070706_10187007023300005467Corn, Switchgrass And Miscanthus RhizosphereVSVTPAVPITRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDSVRERMRDLLEPLGFDVARPIMVRELEDQNALE
Ga0070707_10079673423300005468Corn, Switchgrass And Miscanthus RhizosphereMPVALIPFWRLRQHAVSEEAIRGGSRRRALDRTWPLPDAVKERMRDLLEPLGFDLDRPISVNEPANQDALEFTQPE*
Ga0070699_10030343223300005518Corn, Switchgrass And Miscanthus RhizosphereVPVALIPFWRLRQHAASEEAVRGGSRRRVLDRTWPLPDAVKERLRPLLEPLGFDVDQPVSVSEPVGQDALEFTQP*
Ga0070699_10184673813300005518Corn, Switchgrass And Miscanthus RhizosphereMPVTRIPFWQLRRHGVAEEAVRGGSRRRILGRDWPLPEPVRERLRALLEPLGFDLQRPVSVREPEGEDALEFSQS
Ga0070697_10016153213300005536Corn, Switchgrass And Miscanthus RhizosphereARIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRTLLEPLGFELDRPILVREPEGEDALEFRQDGV*
Ga0070697_10081490723300005536Corn, Switchgrass And Miscanthus RhizosphereMTALARIPFWRLRAHGVVEEAVRGGSRRRQIGHEWLLPAGVRERMRGLLEPLGFDLARPVAVREPEGEDALEFSQDA*
Ga0070697_10122232423300005536Corn, Switchgrass And Miscanthus RhizosphereVSVTPAVPITRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDSVRERMRDLLEPLGFDVARPIMVRELEDQNALEFRQD*
Ga0066697_1007046043300005540SoilPFWQLRRHGVVEEAIRGGSRRRIVGRDWALPEAVRDRMRELLEPLGFELGRPILVREPEGEDALEFRQDGDPAVV*
Ga0070696_10053076523300005546Corn, Switchgrass And Miscanthus RhizosphereVPVTRIPFWRLRAHGVVEEAVRGGTRRRLVGRDWTLPDAVHERVRGLLEPLGFDLGRPVSVREPEGEDALEFRQD*
Ga0070696_10115549513300005546Corn, Switchgrass And Miscanthus RhizosphereMPVTRIPFWQLRRHGVAEEAVRGGSRRRILGRDWPLPEPVRERLRALLEPLGFDLQRPVSVREPEGEDALEFSQS*
Ga0070704_10071615023300005549Corn, Switchgrass And Miscanthus RhizosphereVPVALIPFWRLRQHAASEEAVRGGSRRRVLDRTWPLPDAVKERLRPLLEPLGFDVDQPVSVSEPANQDALEFTQP*
Ga0066661_1046001713300005554SoilMPVARIPFWQLRRQGVVEEAIRGGSRRRIVGRDWVLPEAMRDRMRELLEPLGFELGRPILVREPEGEDALEFRQDGDPGVV*
Ga0066704_1038919823300005557SoilVTRIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLEPLGFELRRPILVSEPEDENALEFRQDDGGDGN*
Ga0066700_1015797723300005559SoilMPVARIPFWQLRRQGVVEEAIRGGSRRRIVGRDWVLPEAMRDRMRELLEPLGFELGRPILVCEPEGEDALEFRQDVQGEQ*
Ga0066700_1026867423300005559SoilVTSRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPDAVLERVRELLEPLGFDLGRPIFVREPEGEEPSAVSNSRH*
Ga0066700_1058275723300005559SoilMPVARIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRGLLEPLGFELDRPILVREPEGEDALEFRQD
Ga0075417_1001319633300006049Populus RhizosphereVPATRIPFWQLRRHGVAEEAVRGGSRRRIVGRDWPLPDGVRERLRGLLEPIGFDLARPIVVREPHGEDALEFQQD*
Ga0075417_1017730043300006049Populus RhizosphereVSVATPVPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPASVREPLRALLEPLGFDVGRSISVREPEDENALEFRQD*
Ga0075417_1022249333300006049Populus RhizosphereGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSISVRELADDNTLEFRQD*
Ga0070716_10052607313300006173Corn, Switchgrass And Miscanthus RhizosphereMPVARIPFWQLRRHGVVEEAIRGGSRRRIVGRDWVLPEAVRERMRELLEPLGFELGRPILVREPEGEDALEFRQDGDPAVV*
Ga0079221_1087751113300006804Agricultural SoilMARARGRGAACRSRTSRVTRVPIARIPFWQLRRHGVVEEAVRGGTRRRIVGRDWPLPDAVRETMRGLLEPLGFDLGRAISVREPDGEEALE
Ga0075428_10087227833300006844Populus RhizosphereAAGPRPRGLAAPVPATRIPFWQLRRHGVAEEAVRGGSRRRIVGRDWPLPDGVRERLRGLLEPIGFDLARPIVVREPHGEDALEFQQD*
Ga0075421_10136360833300006845Populus RhizosphereLATPVPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDAVREPLRGLLEPLGFDVTRSIIVRELGDDNALEFRQD*
Ga0075433_1021820933300006852Populus RhizosphereVSVATAVPSARIPFWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSINVRELADDNTLEFRQD*
Ga0075425_10145174533300006854Populus RhizosphereWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSINVRELADDNTLEFRQD*
Ga0075434_10106268123300006871Populus RhizosphereMPVTRVPFWRLRAQGVVEEAVRGGTRRRVRGHDWPLPEAVRDRMREVLEPLGFDLARPVSVREPEGEDALEFSQDEPGPGGECRGS*
Ga0075434_10207333133300006871Populus RhizosphereWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSISVRELADDNTLEFRQD*
Ga0075424_10148815713300006904Populus RhizosphereVPLARIPFWQLRRHGVVEEAVRGGTRRRVVGRDWALPEAVREPLRALLEPLGFDVDRSISVRELADENTLEFRQDESG*
Ga0075424_10227042733300006904Populus RhizosphereLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSISVRELADDNTLEFRQD*
Ga0079219_1000663873300006954Agricultural SoilPIARIPFWQLRRHGVVEEAVRGGTRRRIVGRDWPLPDAVRETMRGLLEPLGFDLGRAISVREPDGEEALEFRQDAGGH*
Ga0075419_1001531943300006969Populus RhizosphereVSVATSVPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPASVREPLRALLEPLGFDVGRSISVREPEDENALEFRQD*
Ga0075435_10184086713300007076Populus RhizosphereSRIPFWQLRRHGVVEEAVRGGARRRIVGRDWPLPDAVRERMRGLLEPVGFDFARPILVREPDGEDALEFRQD*
Ga0099794_1006145933300007265Vadose Zone SoilMPITRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRELLEPLGFELDRPILVREPEG
Ga0066710_10141201623300009012Grasslands SoilMPVTRIPFWQLRRHGVAEEAVRGGSRRRILGRDWPLPEAVHERVRTLLEPLGFDLERPVSVREPEDEDALEFRQDDGQN
Ga0066710_10428374413300009012Grasslands SoilMPITRIPFWQLRRHGVVEEAIRGGSRRRIVGRDWVLPEAVRERMRELLEPLGFELDRPILVREPEGEDALEFRQDGDPAVV
Ga0099829_1090299823300009038Vadose Zone SoilMPIARIPFWELRRHGVAEEAVRGGVRTRILDREWPLPDATRERLRELLEPLGFDLARPVSVREPAGEDALEFRQEEPPA*
Ga0099828_1005653143300009089Vadose Zone SoilVPIALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPDAVRERMRNLLEPLGFDLDRPISVREPANQDALEFIQPE*
Ga0099827_1068718623300009090Vadose Zone SoilVPIALIPFWRLRQHAVSEEAVRGGSRRRALDRTWSLPEPVRERMRPLLEPLGFDLDRPISVREPANQDALEFIQPE*
Ga0099827_1074918933300009090Vadose Zone SoilMTALARIPFWRLRAHGVFEEAVRGGSRRRQIGHEWPLPDGVRERMRGLLEPLGFDLARPVAVREPEGEDALEFSQ
Ga0075418_10033895103300009100Populus RhizosphereVPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPASVREPLRALLEPLGFDVGRSISVREPEDENALEFRQD*
Ga0066709_10217046923300009137Grasslands SoilMALARIPFWRLRAHGVVEEAVRGGSRRRLVGREWPLPAGVRERMRGLLEPFGFDLARPVAVREPEGEDALEFSQDA*
Ga0066709_10264689223300009137Grasslands SoilMPVARIPFWQLRRQGVVEEAIRGGNRRRIVGRDWVLPEAMRDRMRELLEPLGFELGRPILVCEPEGEDALEFRQDVQGEQ*
Ga0066709_10317977113300009137Grasslands SoilMALARIPFWRLRAHGVVEEAVRGGSRRRLIGHEWPLPEGVREGLRGLLEPLGFDLGRTVAVREPEGEDALE
Ga0114129_1001705573300009147Populus RhizosphereVSVATAVPSARIPFWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSISVRELADDNTLEFRQD*
Ga0114129_1109472623300009147Populus RhizosphereMPVTRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPEAVRERMRELLAPLGFDLGRPIFVREPEGEDALEFRQDA*
Ga0114129_1197171033300009147Populus RhizosphereVSLATPVPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDAVREPLRGLLEPLGFDVTRSIIVRELGDDNALEFRQD*
Ga0075423_1234432533300009162Populus RhizosphereFWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSINVRELADDNTLEFRQD*
Ga0075423_1236152533300009162Populus RhizosphereFWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSISVRELADDNTLEFRQD*
Ga0105164_1001466233300009777WastewaterVPTALIPFWRLRQHAVTEEAVRGGSRRRALDQSWPLPATVTERLRPLLEPLGFDVDRPVSVREPAGQDALEFTQDQ*
Ga0126384_10005032113300010046Tropical Forest SoilMPITRIPFWQLRRHGVVEEAVRGGHRRRIVGRDWPLPDAVRERVRELLEPLGFDVGRPILVREPDGEDALEFRQD*
Ga0126384_1015541943300010046Tropical Forest SoilVSVATAVPFARIPFWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSISVRELADDNTLEFRQD*
Ga0126384_1043461233300010046Tropical Forest SoilVVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDAVREPLRDLLEPLGFDVARSITVRELGDDNALEFQQD*
Ga0126373_1035528623300010048Tropical Forest SoilVVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDAVREPLRGLLEPLGFDVARSITVRELGDDNALEFQQD*
Ga0134070_1000154573300010301Grasslands SoilMPTMLVTRIPFWRLRQHGVVEEAVRGGSRRRIGGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD*
Ga0134088_1001649933300010304Grasslands SoilMPTMLVTRIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD*
Ga0134109_1050515513300010320Grasslands SoilMPTMLVTRIPFWRLRQHGVVEEAVRGGSRRRIGGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDVGGDGD*
Ga0126376_1011086043300010359Tropical Forest SoilVPSARIPFWQLRSHGVVEEAVRGGTRRRVVGRDWALPEAVREPLRALLEPLGFDVERSISVRELADENTLEFRQD*
Ga0126376_1070549123300010359Tropical Forest SoilVPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDAVREPLRGLLEPLGFDVTRSIIVRELGDDNALEFRQD*
Ga0126376_1098325743300010359Tropical Forest SoilVSVATPVPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPESVREPLRALLEPLGFDVGRSISVREPEGGNA
Ga0126376_1320301813300010359Tropical Forest SoilVSVATAVPFARIPFWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRSFLEPLGFDFDRSISVRELADDNTLEFRQD*
Ga0126372_1029278223300010360Tropical Forest SoilVVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDAVREPLRGLLEPLGFDVARSIMVRELGDDNALEFRQD*
Ga0126381_10124864413300010376Tropical Forest SoilVSVATAVPFARIPFWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSISVRELADDNTL
Ga0126383_1099696223300010398Tropical Forest SoilVSVATAVPSARIPFWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVSRSISVRELADENTLEFRQD*
Ga0137389_1075881223300012096Vadose Zone SoilVPVALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPDAVRERVRPLLEPLGFDLDKPISVSEPTNEDALEFTQPE*
Ga0137383_1051226113300012199Vadose Zone SoilMPTMLVTRIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDG
Ga0137362_1017017523300012205Vadose Zone SoilMPITRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRELLEPLGFELDRPILVREPEGEDALEFRQDGDPAVV*
Ga0137376_1070119323300012208Vadose Zone SoilMPVARVPFWQLRRHGVVEEAVRGGSRRRLVGREWPLPEAVREGLRALLEPLGFDLGRPISVREPDGEDALEFLQADVPEPPQAP*
Ga0137370_1015714633300012285Vadose Zone SoilMPVARVPFWQLRRHGVVEEAVRGGSRRRLVGREWPLPEAVREGLRALLEPLGFDLGRPISVREPDGEDALEFLQADVPEPPEAP*
Ga0137361_1000445683300012362Vadose Zone SoilMPTMPVTRVPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD*
Ga0137361_1076922513300012362Vadose Zone SoilMTALARIPFWRLRAHGVVEEAVRGGSRRRQIGHEWPLPDGVRERMRGLLEPLGFDLARPVAVREPEGEDALEFSQ
Ga0137361_1180245623300012362Vadose Zone SoilMPITRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRELLEPLGFELGRPILVREPEGEDALEFRQDGDPAVV*
Ga0137397_1020214533300012685Vadose Zone SoilMPITRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRELLEPLGFELDRPILVREPEGEDALEFRQDGGT*
Ga0137394_1016445333300012922Vadose Zone SoilPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRELLEPLGFELDRPILVREPEGEDALEFRQDGGT*
Ga0137394_1026792123300012922Vadose Zone SoilVPVALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPDAVRERMRPLLEPLGFDLARPVSVSEPAHQDALEFTQAE*
Ga0137394_1109468223300012922Vadose Zone SoilMPVIRVPFWQLRRHGVVEEAVRGGTRRRIVGRDWLLPEAVRERMRELLEPLGFDLARPVSVREPEGEDALEFRQDDSPA*
Ga0137394_1147343723300012922Vadose Zone SoilMPVARFPFWQLRRHGVVEEAIRGGSRRRIVGRDWMLPEAVRERMRELLEPLGFELGRPIVVREPEGEDALEFRQDGDPAVV*
Ga0137359_1045088233300012923Vadose Zone SoilMPITRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRGLLEPLGFELDRPILVREPEGEDALEFRQDGDPAVV*
Ga0126375_1057787123300012948Tropical Forest SoilMPVTRIPFWRLRAQGVVEEAVRGGTRRRVTGHDWPLPNAVRDRLRAMLEPLGFDLGRPVSVREPEGEDTLEFSQD*
Ga0126369_1173852313300012971Tropical Forest SoilVSVATAVPSARIPFWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSISVRELADENTLEFRQD*
Ga0187779_1059670423300017959Tropical PeatlandMPVARIPFWRLRAQGVVEEAVRGGTRRRLTGQSWPLPETVRAGLRAVLEPLGFDLARPVSVGEPADEDALEFSQD
Ga0187776_1055874513300017966Tropical PeatlandVPVARLAFWELRRHNVAEEAVRGGVRRRIHDRAWPLPEAVRERLRVVLEPLGFDLARPVSVREPEGADALEFSQDEPRA
Ga0066655_1003990623300018431Grasslands SoilMLVTRIPFWRLRQHGVVEEAVRGGSRRRIGGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD
Ga0066667_1006393433300018433Grasslands SoilMLVTRIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD
Ga0066667_1103103413300018433Grasslands SoilMPVARIPFWQLRRHGVVEEAIRGGSRRRIVGRDWALPEAVRDRMRELLEPLGFELGRPILVREPEGEDALEFRQDGDPAVV
Ga0066667_1113236023300018433Grasslands SoilMALARIPFWRLRAHGVVEEAVRGGSRRRLIGHEWPLPEGVREGLRGLLEPLGFDLGRTVAVREPEGE
Ga0066662_1049643013300018468Grasslands SoilAERIITAMALARIPFWRLRAHGVVEEAVRGGSRRRQLGHEWPLPDGVRERMRGLLEPLGFDLARPVAVREPEGEDALEFSQDA
Ga0066669_1000060353300018482Grasslands SoilVTSRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPDAVLERVRELLEPLGFDLGRPIFVREPEGEDALEFRQDV
Ga0066669_1013514023300018482Grasslands SoilMLVTRIPFWRLRQNGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD
Ga0066669_1102086523300018482Grasslands SoilMARARIPFWRLRAHGVVEEAVRGGSRRRLIGHEWPLPEGVREGLRGLLEPLGFDLGRTVAVREPEGEDALEFSQDA
Ga0193725_100012163300019883SoilVPVALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPEPVRERLRPLLEPLGFDLARPVSVSEPAGQDALRFDQQDAAG
Ga0247671_106275913300024284SoilMPVTRVPFWRLRAQGVVEEAVRGGTRRRVRGHDWPLPEAVRDRMREVLEPLGFDLARPVSVREPEGEDALEFSQDEPGPGGECRG
Ga0209824_1001289933300025173WastewaterVPTALIPFWRLRQHAVTEEAVRGGSRRRALDQSWPLPATVTERLRPLLEPLGFDVDRPVSVREPAGQDALEFTQDQ
Ga0207684_1017725823300025910Corn, Switchgrass And Miscanthus RhizosphereVPVALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPDAVKERMRDLLEPLGFDLNRPISVSEPANQDALEFTQPE
Ga0209236_103506523300026298Grasslands SoilMLVTRIPFWRLRQHGVVEEAVRGGSRCRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD
Ga0209761_106655723300026313Grasslands SoilMLVTRIPFWRLRQNGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLEPLGFELRRPILVSEPEDENALEFRQDDGGDGD
Ga0209471_117696023300026318SoilMPVARIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRGLLEPLGFELDRPILVREPEGEDALEFRQDG
Ga0209471_122856523300026318SoilMPVARIPFWQLRRHGVVEEAIRGGSRRRIVGRDWALPEAVRDRMRELLEPLGFELGRPILVREPEGEDALEFRQ
Ga0209470_112738813300026324SoilVTSRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPDAVFERVRELLEPLGFDLGRPIFVREPEGEDALEFRQDVIDG
Ga0209803_100218223300026332SoilMLVTRIPFWRLRQHGVVEEAVRGGSRCRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGADGD
Ga0209690_104577833300026524SoilMRRSSFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPEAVLERVRELLEPLGFDLGRPIFVREPEGEDALEFRQDV
Ga0209160_109855633300026532SoilMPPMPVTRIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLEPLGFELRRPILVSEPEDENALEFRQDDGGDGN
Ga0209058_103041923300026536SoilVTSRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPDAVFERVRELLEPLGFDLGRPIFVREPEGEDALEFRQDV
Ga0209689_103767143300027748SoilIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD
Ga0209689_122850523300027748SoilMPVARIPFWQLRRQGVVEEAIRGGSRRRIVGRDWVLPEAMRDRMRELLEPLGFELGRPILVCEPEGEDALEFRQDVQGEQ
Ga0209689_123688123300027748SoilMPVARIPFWQLRRHGVVEEAIRGGSRRRIVGRDWALPEAVRDRMRELLEPLGFELGRPILVREPEGEDALEFRQDGDP
Ga0209814_1001416943300027873Populus RhizosphereVPATRIPFWQLRRHGVAEEAVRGGSRRRIVGRDWPLPDGVRERLRGLLEPIGFDLARPIVVREPHGEDALEFQQD
Ga0209283_1024014223300027875Vadose Zone SoilVPIALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPDAVRERMRNLLEPLGFDLDRPISVREPANQDALEFIQPE
Ga0209590_1030704113300027882Vadose Zone SoilWPRPARSELASGSDRVPVALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPDAVRERMRNLLEPLGFDLDRPISVREPANQDALEFIQPE
Ga0209382_1036944643300027909Populus RhizosphereVPLATPVPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDAVREPLRGLLEPLGFDVTRSIIVRELGDDNALEFRQD
(restricted) Ga0255312_109044333300031248Sandy SoilMPRALIPFWRLRQHAASEEAVRGGSRRRAVDRSWPLPEAVTERLRPLLEPLGFDLARPITVSEPAGQDALQFDQPDPD
Ga0307469_1061530323300031720Hardwood Forest SoilVPVALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPDAVRERMRDVLEPLGFDLDRPISVSEPANKDALEFTQPE
Ga0307469_1080033923300031720Hardwood Forest SoilMPVTRVPFWRLRAQGVVEEAVRGGTRRRVRGHDWPLPEAVRDRMREVLEPLGFDLARPVSVREPEGEDALEFSQDEPGPGGECRGS
Ga0307469_1187179013300031720Hardwood Forest SoilVPVALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPEPVRERLRPLLEPLGFDLARPISVSEPPGQDALQFDQP
Ga0307469_1252086023300031720Hardwood Forest SoilARIPYWRLRAQAVAEEAVRGGTRRRLTEREWLLPEAVRDRLRDVLEPLGFDLARPVSVREAEGEDALEFSQDEGHKE
Ga0307468_10037177423300031740Hardwood Forest SoilVPLARIPYWRLRAQAVAEEAVRGGTRRRLTEREWLLPEAVRDRLRDVLEPLGFDLARPVSVREAEGEDALEFSQDEGNKE
Ga0307473_1016446623300031820Hardwood Forest SoilMPVARIPFWQLRRHGVVEEAIRGGSRRRIVGRDWVLPEAVRERMRDLLEPLGFELGRPILVREPEGEDALEFRQDGDPAVV
Ga0307471_10004694343300032180Hardwood Forest SoilMPVARIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRTLLEPLGFELDRPILVREPEGEDALEFRQDGV
Ga0307471_10292983923300032180Hardwood Forest SoilMPTTRIPFWELRRHGVVEEAVRGGSRRRIVGRDWPLPEAVRERMGALLAPLGFDLGRPIVVREPDGEDALEFSQE
Ga0307471_10355811513300032180Hardwood Forest SoilAVPITRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDSVRERMRDLLEPLGFDVARPIMVRELEDQNALEFRQD
Ga0307472_10032881523300032205Hardwood Forest SoilMPVTRIPFWRLRSQGVAEEAVRGGSRRRLTGRDWPLPDEVRDRMRGVLEPLGFDLGRPVSVREPEGEDAREFSQD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.