NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F097019

Metagenome Family F097019

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F097019
Family Type Metagenome
Number of Sequences 104
Average Sequence Length 195 residues
Representative Sequence MRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDEALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR
Number of Associated Samples 91
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 47.12 %
% of genes near scaffold ends (potentially truncated) 50.96 %
% of genes from short scaffolds (< 2000 bps) 60.58 %
Associated GOLD sequencing projects 77
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(41.346 % of family members)
Environment Ontology (ENVO) Unclassified
(40.385 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(49.038 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 70.10%    β-sheet: 0.00%    Coil/Unstructured: 29.90%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF01180DHO_dh 20.19
PF13524Glyco_trans_1_2 13.46
PF00296Bac_luciferase 5.77
PF12996DUF3880 2.88
PF00291PALP 1.92
PF01361Tautomerase 1.92
PF01790LGT 0.96
PF02511Thy1 0.96
PF01555N6_N4_Mtase 0.96
PF13103TonB_2 0.96
PF14238DUF4340 0.96
PF02152FolB 0.96
PF01618MotA_ExbB 0.96
PF01425Amidase 0.96
PF02472ExbD 0.96

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 104 Family Scaffolds
COG0042tRNA-dihydrouridine synthaseTranslation, ribosomal structure and biogenesis [J] 20.19
COG0069Glutamate synthase domain 2Amino acid transport and metabolism [E] 20.19
COG0167Dihydroorotate dehydrogenaseNucleotide transport and metabolism [F] 20.19
COG1304FMN-dependent dehydrogenase, includes L-lactate dehydrogenase and type II isopentenyl diphosphate isomeraseEnergy production and conversion [C] 20.19
COG2070NAD(P)H-dependent flavin oxidoreductase YrpB, nitropropane dioxygenase familyGeneral function prediction only [R] 20.19
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 5.77
COG1942Phenylpyruvate tautomerase PptA, 4-oxalocrotonate tautomerase familySecondary metabolites biosynthesis, transport and catabolism [Q] 1.92
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 0.96
COG0682Prolipoprotein diacylglyceryltransferaseCell wall/membrane/envelope biogenesis [M] 0.96
COG0848Biopolymer transport protein ExbDIntracellular trafficking, secretion, and vesicular transport [U] 0.96
COG0863DNA modification methylaseReplication, recombination and repair [L] 0.96
COG1041tRNA G10 N-methylase Trm11Translation, ribosomal structure and biogenesis [J] 0.96
COG1351Thymidylate synthase ThyX, FAD-dependent familyNucleotide transport and metabolism [F] 0.96
COG1539Dihydroneopterin aldolaseCoenzyme transport and metabolism [H] 0.96
COG2189Adenine specific DNA methylase ModReplication, recombination and repair [L] 0.96


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005166|Ga0066674_10145955All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1115Open in IMG/M
3300005167|Ga0066672_10035757All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2756Open in IMG/M
3300005174|Ga0066680_10031046All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3022Open in IMG/M
3300005174|Ga0066680_10524132All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium745Open in IMG/M
3300005176|Ga0066679_10581507All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium730Open in IMG/M
3300005177|Ga0066690_10002458All Organisms → cellular organisms → Bacteria7609Open in IMG/M
3300005180|Ga0066685_10349265All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1026Open in IMG/M
3300005186|Ga0066676_10075741All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1986Open in IMG/M
3300005332|Ga0066388_102240909All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium987Open in IMG/M
3300005445|Ga0070708_100027252All Organisms → cellular organisms → Bacteria4901Open in IMG/M
3300005445|Ga0070708_100275992All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1581Open in IMG/M
3300005446|Ga0066686_10008181All Organisms → cellular organisms → Bacteria5240Open in IMG/M
3300005447|Ga0066689_10611980All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium686Open in IMG/M
3300005467|Ga0070706_100030641All Organisms → cellular organisms → Bacteria4959Open in IMG/M
3300005471|Ga0070698_100572629All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1069Open in IMG/M
3300005557|Ga0066704_10476634All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium824Open in IMG/M
3300005559|Ga0066700_10280365All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1171Open in IMG/M
3300005561|Ga0066699_10004204All Organisms → cellular organisms → Bacteria6139Open in IMG/M
3300005569|Ga0066705_10110480All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1646Open in IMG/M
3300005586|Ga0066691_10325932All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium908Open in IMG/M
3300006046|Ga0066652_100022707All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium4348Open in IMG/M
3300006794|Ga0066658_10156483All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1161Open in IMG/M
3300006797|Ga0066659_10422881All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1053Open in IMG/M
3300006852|Ga0075433_10477797All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1098Open in IMG/M
3300007076|Ga0075435_100780843All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium831Open in IMG/M
3300007255|Ga0099791_10070574All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1583Open in IMG/M
3300007265|Ga0099794_10505937All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium636Open in IMG/M
3300009012|Ga0066710_103384174All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium606Open in IMG/M
3300009038|Ga0099829_10010016All Organisms → cellular organisms → Bacteria → Proteobacteria6106Open in IMG/M
3300009038|Ga0099829_10122408All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2049Open in IMG/M
3300009088|Ga0099830_10153053All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1775Open in IMG/M
3300009088|Ga0099830_10362346All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1167Open in IMG/M
3300009089|Ga0099828_10027793All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium4518Open in IMG/M
3300009090|Ga0099827_10008674All Organisms → cellular organisms → Bacteria6409Open in IMG/M
3300009090|Ga0099827_10065724All Organisms → cellular organisms → Bacteria2782Open in IMG/M
3300009090|Ga0099827_10708448All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium870Open in IMG/M
3300009818|Ga0105072_1090858All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium606Open in IMG/M
3300010359|Ga0126376_11861647All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium641Open in IMG/M
3300010360|Ga0126372_10530956All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1113Open in IMG/M
3300010398|Ga0126383_11145003All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium867Open in IMG/M
3300011271|Ga0137393_10536907All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1003Open in IMG/M
3300012096|Ga0137389_10044054All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3330Open in IMG/M
3300012096|Ga0137389_10393839All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1182Open in IMG/M
3300012189|Ga0137388_10014889All Organisms → cellular organisms → Bacteria5661Open in IMG/M
3300012189|Ga0137388_10396277All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1277Open in IMG/M
3300012199|Ga0137383_10039467All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3347Open in IMG/M
3300012201|Ga0137365_10110506All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2068Open in IMG/M
3300012202|Ga0137363_10073463All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2525Open in IMG/M
3300012205|Ga0137362_10092389All Organisms → cellular organisms → Bacteria2538Open in IMG/M
3300012206|Ga0137380_10000448All Organisms → cellular organisms → Bacteria29541Open in IMG/M
3300012285|Ga0137370_10220132All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1118Open in IMG/M
3300012350|Ga0137372_10083727All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2710Open in IMG/M
3300012357|Ga0137384_10017045All Organisms → cellular organisms → Bacteria → Proteobacteria5908Open in IMG/M
3300012359|Ga0137385_10017445All Organisms → cellular organisms → Bacteria6393Open in IMG/M
3300012363|Ga0137390_11379888All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium649Open in IMG/M
3300012685|Ga0137397_10000644All Organisms → cellular organisms → Bacteria24039Open in IMG/M
3300012917|Ga0137395_10314207All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1112Open in IMG/M
3300012918|Ga0137396_10394679All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1025Open in IMG/M
3300012922|Ga0137394_10004284All Organisms → cellular organisms → Bacteria10819Open in IMG/M
3300012925|Ga0137419_11061427All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium673Open in IMG/M
3300012925|Ga0137419_11613995All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium552Open in IMG/M
3300012927|Ga0137416_10535138All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1013Open in IMG/M
3300012944|Ga0137410_10059439All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2746Open in IMG/M
3300012944|Ga0137410_11320810All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium624Open in IMG/M
3300012976|Ga0134076_10345057All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium654Open in IMG/M
3300015241|Ga0137418_10168673All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1915Open in IMG/M
3300015245|Ga0137409_10639661All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium895Open in IMG/M
3300015264|Ga0137403_10109289All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2768Open in IMG/M
3300017656|Ga0134112_10157421All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium876Open in IMG/M
3300018056|Ga0184623_10333896All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium681Open in IMG/M
3300018431|Ga0066655_10594239All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium745Open in IMG/M
3300018433|Ga0066667_11193428All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium662Open in IMG/M
3300018482|Ga0066669_10011591All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium4531Open in IMG/M
3300025910|Ga0207684_10042445All Organisms → cellular organisms → Bacteria3855Open in IMG/M
3300025922|Ga0207646_10212249All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1748Open in IMG/M
3300026309|Ga0209055_1012372All Organisms → cellular organisms → Bacteria4406Open in IMG/M
3300026317|Ga0209154_1081533All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1395Open in IMG/M
3300026328|Ga0209802_1092910All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1373Open in IMG/M
3300026328|Ga0209802_1246535All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium618Open in IMG/M
3300026332|Ga0209803_1030449All Organisms → cellular organisms → Bacteria2559Open in IMG/M
3300026334|Ga0209377_1012024All Organisms → cellular organisms → Bacteria4774Open in IMG/M
3300026335|Ga0209804_1174843All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium935Open in IMG/M
3300026342|Ga0209057_1115131All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1028Open in IMG/M
3300026354|Ga0257180_1042975All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium633Open in IMG/M
3300026360|Ga0257173_1013442All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium962Open in IMG/M
3300026371|Ga0257179_1025257All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium707Open in IMG/M
3300026469|Ga0257169_1009234All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1228Open in IMG/M
3300026480|Ga0257177_1017839All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium990Open in IMG/M
3300026528|Ga0209378_1003226All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria11074Open in IMG/M
3300026532|Ga0209160_1144387All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1110Open in IMG/M
3300026532|Ga0209160_1265607All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium583Open in IMG/M
3300026538|Ga0209056_10001986All Organisms → cellular organisms → Bacteria22300Open in IMG/M
3300026542|Ga0209805_1010822All Organisms → cellular organisms → Bacteria4917Open in IMG/M
3300026548|Ga0209161_10002029All Organisms → cellular organisms → Bacteria16648Open in IMG/M
3300027577|Ga0209874_1099744All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium693Open in IMG/M
3300027655|Ga0209388_1085851All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium905Open in IMG/M
3300027846|Ga0209180_10071972All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1940Open in IMG/M
3300027862|Ga0209701_10069397All Organisms → cellular organisms → Bacteria2232Open in IMG/M
3300027862|Ga0209701_10439783All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium719Open in IMG/M
3300027875|Ga0209283_10021479All Organisms → cellular organisms → Bacteria3963Open in IMG/M
3300028047|Ga0209526_10028975All Organisms → cellular organisms → Bacteria3892Open in IMG/M
3300028536|Ga0137415_10016830All Organisms → cellular organisms → Bacteria → Proteobacteria7249Open in IMG/M
(restricted) 3300031150|Ga0255311_1058073All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium818Open in IMG/M
(restricted) 3300031248|Ga0255312_1009898All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2270Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil41.35%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil30.77%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.77%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.81%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.85%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.88%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.92%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand1.92%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil1.92%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.92%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.96%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.96%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.96%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009818Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_30_40EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026354Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-BEnvironmentalOpen in IMG/M
3300026360Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-BEnvironmentalOpen in IMG/M
3300026371Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-BEnvironmentalOpen in IMG/M
3300026469Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-BEnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027577Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
Ga0066674_1014595523300005166SoilETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR*
Ga0066672_1003575723300005167SoilMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR*
Ga0066680_1003104623300005174SoilMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPPRIR*
Ga0066680_1052413213300005174SoilMSRETLTPPRGARTVRVIGKIAARTGLFGAVRERSSVTPRRALPSLSIGLAAVQIFLLLGSAAVWPLAEVAAQDDPWQWPADQKVEKATERLRDLKDAAQQFSRRDCPTARAHYDAWLDEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRAGNGADGLVELTDLLDRCARRVRESDAAFRPPSREELVRRAAERIGEKKKSELAAVGI
Ga0066679_1058150713300005176SoilARGAMSRETLTPPRGARTVRVIGKIAARTGLFGAVRERSSVTPRRALPSLSIGLAAVQIFLLLGSAAVWPLAEVAAQDDPWQWPADQKVEKATERLRDLKDAAQQFSRRDCPTARAHYDAWLDEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRAGNGADGLVELTDLLDRCARRVRESDAAFRPPSREELVRRAAERIGEKKKSELAAVGISLEEKDEFKVEGSPPRRIR*
Ga0066690_1000245843300005177SoilMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETATQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPPRIR*
Ga0066685_1034926513300005180SoilMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVG
Ga0066676_1007574113300005186SoilMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDEALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR*
Ga0066388_10224090913300005332Tropical Forest SoilVFAAVQIFLSVVTGAWWPLASALAQDDPWQWPADQKVEKATERLKDLKEWAQQFSKRDCPSTRVDYDGWLEEALDLVGSLVGLQDSTRQIRKTGAGADKLQAWDAVRAANGPGALAELTELFGRCGRRLMESDAAFRPPGQEEIIRRASVRIAEKKRSELAAVGIPFEEKDEPKIR*
Ga0070708_10002725243300005445Corn, Switchgrass And Miscanthus RhizosphereVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPADQKVEKAIERLKDLKDTAQQFSRRDCPTGRADYDAWLNEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRVGNGADGFGELTDLLDRCGRRVRESDAAFRPPSREELVRRAVERIGEKKKSELAAVGISLEEKDELKVEGSPPRRIR*
Ga0070708_10027599223300005445Corn, Switchgrass And Miscanthus RhizosphereMGLAAVQIFLSLGSAAVWPLAEAVAQDDPWQWPADQKVEKATERLRDLKDTAQQFSRRDCPTARADYEAWLEEALDLAGSLIGLQDSTRPIRKTGAAADKLRTWDAVRASNGPDGIAELTDLFGRCGRRVRESDAAFRPPSREELFRRAAARIGEKKKSELASFGIPLEEKDEFKAEGLPARSIP*
Ga0066686_1000818113300005446SoilAARERRLVMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR*
Ga0066689_1061198013300005447SoilRNLDTVPGRPYGTNDRNNCHADGSSTAARERRLVMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKLEDSPPRRIR
Ga0070706_10003064123300005467Corn, Switchgrass And Miscanthus RhizosphereMRPTPCAAARERCLVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPADQKVEKAIERLKDLKDTAQQFSRRDCPTGRADYDAWLNEALDLAGSLIGLQNSTRQIRKTGAGADKLRTWDAVRVGNGADGFGELTDLLDRCGRRVRESDAAFRPPSREELVRRAVERIGEKKKSELAAVGISLEEKDELKVEGSPPRRIR*
Ga0070698_10057262913300005471Corn, Switchgrass And Miscanthus RhizosphereGSAAVWPLAEAVAQDDPWQWPADQKVEKATERLRDLKDTAQQFSRRDCPTARADYEAWLEEALDLAGSLIGLQDSTRPIRKTGAAADKLRTWDAVRASNGPDGIAELTDLFGRCGRRVRESDAAFRPPSREELFRRAAARIGEKKKSELASFGIPLEEKDEFKAEGLPARSIP*
Ga0066704_1047663413300005557SoilMSRETLTPPRGARTVRVIGKIAARTGLFGAVRERSSVTPRRALPSLSIGLAAVQIFLLLGSAAVWPLAEVAAQDDPWQWPADQKVEKATERLRDLKDAAQQFSRRDCPTARAHYDAWLDEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRAGNGADGLVELTDLLDRCARRVRESDAAFRPPSREELVRRAAERIGEK
Ga0066700_1028036523300005559SoilMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKLEDSPPRRIR*
Ga0066699_1000420473300005561SoilLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKLEDSPPRRIR*
Ga0066705_1011048013300005569SoilGRPYGTNDRNNCHADGSSTAARERCLVMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR*
Ga0066691_1032593223300005586SoilMSRETLTPPRGARTVRVIGKIAARTGLFGAVRERSSVTPRRALPSLSIGLAAVQIFLLLGSAAVWPLAEVAAQDDPWQWPADQKVEKATERLRDLKDAAQQFSRRDCPTARAHYDAWLDEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRAGNGADGLVELTDLLDRCARRVRESDAAFRPPSREELVRRAAERIGEKKKSEL
Ga0066652_10002270733300006046SoilMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSRDELLRRAAARISEKKKSELAAVGIALEEKDEFKVEDSPPPRIR*
Ga0066658_1015648323300006794SoilMRRALPRLSIGLAAVQIFLSLVSVAVWPLAETAVQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEK
Ga0066659_1042288123300006797SoilLVMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR*
Ga0075433_1047779713300006852Populus RhizosphereVIRRALRRLSFGLVAVQVFLSLGAAAVWPLGETAAQDDPSQWPADQKVERATERLKDLKDAAQQFSRRDCPTARAGYHAWLDEALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDTAFRPPSREELLRRAAARISEKKKSELAAVGISLEEKDELKVEDSPPRRIR*
Ga0075435_10078084323300007076Populus RhizosphereHADGSGTVARERFVVICHALPRLSIGLAAVQIFVLFGTAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQQFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKLEDSPPRRIR*
Ga0099791_1007057413300007255Vadose Zone SoilVIRRALPGLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVERATERLRDLKDAAQQFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDVVRASNGPDGIAELTDLLARCGRRVQESDAAFRPPSREELFRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR*
Ga0099794_1050593723300007265Vadose Zone SoilWQWPADQKVEKATERLRDLKAAAQQFSRRDCPTARAHYDAWLDEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRAGNGADALVELTDLLDRCGRRVRESDAAFRPPSREELVRRAAERIGEKKKSELAAVGISLEEKDEFKVEGSPPRRVR*
Ga0066710_10338417413300009012Grasslands SoilVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR
Ga0099829_1001001623300009038Vadose Zone SoilVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPVDQKVEKAIERLKDLKDTAQQFSRRDCPTGRADYDAWLNEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRVGNGADGLGELTDLLDRCGRRVRESDAAFRPPSREELVRRAVERIGEKKKSELAAVGISLEEKDELKVEGSPRRRIR*
Ga0099829_1012240823300009038Vadose Zone SoilMIRVRALQRFSMALAATLVFLASGSAIVWPMAKAVARDDPWQWPADEKVERATERLKELKDTAEKFSRRDCPTARTDFDAWLDEALELAGSLIGLQDSTRQIRKTGAGLDKLRAWDAVRASNGPGGMAELTDLLSRCGRRVRESDAAFRSPSQEELLRRASQRIAEKKKSELASFGIPLEEKDEFKVEGSAPRSIR*
Ga0099830_1015305323300009088Vadose Zone SoilMIRVRALQRFSMALAATLVFLASGSAIVWPMAKAVARDDPWQWPADEKVERATERLKELKDTAEKFSRRDCPTARTDFDAWLDEALELAGSLIGLQDSTRQIRKTGAGLDKLRAWDAVRASNGPGGMAELTDLLSRCGRRVREGDAAFRPPSPEELLRRASQRIAEKKKSELASFGIPLEEKDEFKVEGSAPRSIR*
Ga0099830_1036234623300009088Vadose Zone SoilVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPVDQKVEKAIERLKDLKDTAQQFSRRDCPTGRADYDAWLNEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRVGNGADGLGELTDLLDRCGRRVRESDAAFRPPSREELVRRAVERIGEKKKSELAAVGISLEEKDELKVEGSPPRRIR*
Ga0099828_1002779343300009089Vadose Zone SoilMIRVRALQRFSMALAATLVFLASGSAIVWPMAKAVARDDPWQWPADEKVERATERLKELKDTAEKFSRRDCPTARTDFDAWLDEALELAGSLIGLQDSTRQIRKTGAGLDKLRAWDAVRASNGPGGMAELTDLLSRCGRRVREGDAAFRPPSPEELLRRASQRIAEKKKSELGSFGIPLEEKDEFKVEGSAPRSIR*
Ga0099827_1000867443300009090Vadose Zone SoilMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRLIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVRESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR*
Ga0099827_1006572413300009090Vadose Zone SoilMAKAVARDEPWQWPADEKVEKATERLKELKDTAEKFSRRDCPTARTDFDAWLDEALELAGSLIGLQDSTRQIRKTGAGLDKLRAWDAVRASHGPGGMAELTDLLSRCGRRVREGDAAFRPPSPEELLRRASQRIAEKKKSELASFGIPLEEKDEFKVEGSAPRSIR*
Ga0099827_1070844813300009090Vadose Zone SoilVIGKIAARTGLFGAVRERSSVTPRRALPSLSIGLAAVQIFLLLGSAAVWPLAEVAAQDDPWQWPADQKVEKATERLRDLKDAAQQFSRRDCPTARAHYDAWLDEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRANNGPDGMAELTDLLGRCGRRVQESDAAFRPPSREELLRRAATRIGERKKSELAAVGISLEE
Ga0105072_109085813300009818Groundwater SandARDDPWQWPADEKVEKATERLKELKDTAEKFSRRDCPTARTDFAAWLDEALELAGSLVGLQDSTRQIRKTGAGADKLRAWDAVRASNGPGGIADLTDLLGRCGRRVRESDASFRPPSQEELLRRAAQRIAEKKKSELASFGIPLEEKDEFKVEGSAPRSLR*
Ga0126376_1186164713300010359Tropical Forest SoilVTKDSGILRRFSIVFVALKIFLLVVTGARWSPDGARAQDDPWQWPADQKVEKATERLKDLKESAQQFSRRDCPSTRVGYDAWLEEALDLLGSLVGLQDSTRQIRKTGAGADKLQAWDTVRAANGPGDLTELMDLFGRCGRRVTESDATFRPPGQEEIIRRASFRIAEKKRSEL
Ga0126372_1053095613300010360Tropical Forest SoilVTKDSRILRRFSIVFVALQIFLLVVTGARWPLDGARAQDDPWQWPADQKVEKATERLKDLKESAQQFSRRDCPSTRVGYDAWLEEALDLVGSLVGLQDSTRQIRKTGAGADKLQAWDTVRTANGPGDLTELMDLFGRCGRRVTESDATFRPPGQEEIIRRASFRIAEKKRSELAAIGIALEEQDEPKIR*
Ga0126383_1114500323300010398Tropical Forest SoilVTKDSGILRRFSIVFVALQIFLLVVTGALWSLGGARAQDDPWQWPADQKVEKATERLKDLKESAQQFSRRDCPSTRVGYDAWLEEALDLLGSLVGLQDSTRQIRKTGAGADKLQAWDTVRAANGPGDLTELMDLFGRCGRRVTESDAAFRTPGQEEIIRRASLRIAEKKRSEL
Ga0137393_1053690713300011271Vadose Zone SoilMIRVRALQRFSMALAATLVFLASGSAIVWPMAKAVARDDPWQWPADEKVERATERLKELKDTAEKFSRRDCPTARTDFDAWLDEALELAGSLIGLQDSTRQIRKTGAGLDKLRAWDAVRASNGPGGMAELTDLLSRCGRRVREGDAAFRPPSPEELLRRASQRIA
Ga0137389_1004405433300012096Vadose Zone SoilVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPADQKVEKAIERLKDLKDTAQQFSRRDCPTGRADYDAWLNEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRVGNGADGLGELTDLLDRCGRRVRESDAAFRPPSREELVRRAVERIGEKKKSELAAVGISLEEKDELKVEGSPRRRIR*
Ga0137389_1039383923300012096Vadose Zone SoilQEMIRVRARQRFSMALAATLVFVAPGTAIVWPVAKAVARDDPWRWPADEKVERATERLKELKDTAEKFSRRDCPTARTDFDAWLDEALELAGSLIGLQDSTRQIRKTGAGLDKLRAWDAVRASNGPGGMAELTDLLDRCGRRVREGDAAFRPPSPEELLRRASQRIAEKKKSELASFGIPLEEKDEFKVEGSAPRSIR*
Ga0137388_1001488953300012189Vadose Zone SoilVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPADQKVEKAIERLKDLKDTAQQFSRRDCPTGRADYDAWLNEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRVGNGADGLGELTDLLDRCGRRVRESDAAFRPPSREELVRRAVERIGEKKKSELAAVGISLEEKDELKVEGSPPRRIR*
Ga0137388_1039627723300012189Vadose Zone SoilMAKAVARDDPWQWPADEKVEKATERLKELKDTAEKFSRRDCPTARTDFRAWLDEALELAGSLIGLQDSTRQIRKTGAGADQLRAWDAVRASNGPGGMAELTDLLGRCGRRVRESDAAFRPPSQEELLRRASQRIAEKKRSELASFGIPLEEKDEFKVEGSAPRSTR*
Ga0137383_1003946733300012199Vadose Zone SoilMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQESTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSRDELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR*
Ga0137365_1011050613300012201Vadose Zone SoilMRRSLPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRHDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARIGEKKKSELAAVGIAVEEKDELKMEDSPP
Ga0137363_1007346323300012202Vadose Zone SoilVIRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVERATERLRDLKDAAQQFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDVVRASNGPDGIAELTDLLARCGRRVQESDAAFRPPSREELFRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR*
Ga0137362_1009238913300012205Vadose Zone SoilVIRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVERATERLRDLKDAAQQFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDVVRASNGPDGIAELTDLLARCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR*
Ga0137380_10000448103300012206Vadose Zone SoilMRRALPRLSIGLAAVQIFLSLVSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARIGEKKKSELAAVGIAVEEKDELKLEDSPPRRIR*
Ga0137370_1022013213300012285Vadose Zone SoilLVMRRALPRLSIGLAAVQIFLSLGSAAVGPLAETAAQDDPSQWPADQKVENATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQESTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSRDELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR*
Ga0137372_1008372733300012350Vadose Zone SoilMHRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR*
Ga0137384_1001704563300012357Vadose Zone SoilMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRHDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR*
Ga0137385_1001744563300012359Vadose Zone SoilMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARIGEKKKSELAAVGIALEEEDELKVEDSPPRRIR*
Ga0137390_1137988813300012363Vadose Zone SoilMIRVRALQRFSMALAATLVLLASGSAIVWPRAKAVAREDPWQWPADEKVEKATERLKELKDTAEKFSRRDCPTARTDFDAWLDEALELAGSLIGLQDSTRQIRKTGAGADKLRAWDAVRAGNGADGLVELTDLLDRCARRVRESDAAFRPPSREELVRRAAERIGEKKKSELAAVGISLEEKDEFKVEGSPPRRIR*
Ga0137397_10000644233300012685Vadose Zone SoilVIGKFAARTGLFGAVRERSFVTRCRALPRLSIGLAAVQIFLSLGPAAVWPLAEAVAQDDPWQWPADQKVEKATERLKDLKDAAQQFSRRDCPTARAHYDAWLGEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRAGNGADALVELTDLLDRCGRRVRESDAAFRPPSREELVRRAAERIGEKKKSELAAVGISLEEKDEFKVEGSPPRRIR*
Ga0137395_1031420713300012917Vadose Zone SoilMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVERATERLRDLKDAAQQFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAA
Ga0137396_1039467923300012918Vadose Zone SoilVIRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVERATERLRDLKDAAQQFSRRDCPTARSGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDVVRASNGPDGIAELTDLLARCGRRVQESDAAFRPPSREELFRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR*
Ga0137394_1000428443300012922Vadose Zone SoilVIGKFAARTGLFGAVRERSFVTRCRALPRLSIGLAAVQIFLSLGAAAVWPLAEAVAQDDPWQWPADQKVEKATERLKDLKDAAQQFSRRDCPTARAHYDAWLGEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRAGNGADALVELTDLLDRCGRRVRESDAAFRPPSREELVQRAAERIGEKKKSELAAVGISLEEKDEFKVEGSPPRRIR*
Ga0137419_1106142713300012925Vadose Zone SoilIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVERATERLRDLKDAAQQFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDVVRASNGPDGIAELTDLLARCGRRVQESDAAFRPPSREELFRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR*
Ga0137419_1161399513300012925Vadose Zone SoilLSIGLAAVQIFTSLGAAAVWPHAEAVAQDDPWQWPADQKVEKATERLKDLKDAAQQFSRRDCPTARAHYDAWLGEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRAGNGADALVELTDLLDRCGRRVRESDAAFRPPSREELVQRAAERIGEKKKSELAAVGISLEEKDEFKVEGSPP
Ga0137416_1053513823300012927Vadose Zone SoilVIGKFAARTGLFGAVRERSSVTPRRALPSLSIGLAAFQIFLLLGSAAVWPLAEVAAQDDPWQWPADQKVEKATERLRDLKDAAQQFSRRDCPTARAHYDAWLDEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRAGNGADALVELTDLLDRCGRRVRESDAAFRPPSREELVRRAAERIGEKKKSELAAVGISLEEKDEFKVEGSPPRRIR*
Ga0137410_1005943953300012944Vadose Zone SoilVIGKFAARTGLFGAVRERSFVTRCRALPRLSIGLAAVQIFLSLGPAAVWPLAEAVAQDDPWQWPADQKVEKATERLKDLKDAAQQFSRRDCPTARAHYDAWLDEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRAGNGADALVELTDLLDRCGRRVRESDAAFRPPSREELVQRAAEGIGEKKKSELAAVGISLEEKDEFKVEGSP
Ga0137410_1132081013300012944Vadose Zone SoilGTNDQNNCHADGFSTAARERRLVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPADQKVEKAIERLKDLKDTAQQFSRRDCPTARADYDAWLNEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRAGNGADGLGELTDLLDRCGRRVRERDAAFRPPSREELVRRAVERIGEKKKSELAAVGISLEEKDELK
Ga0134076_1034505713300012976Grasslands SoilMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDVAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR*
Ga0137418_1016867323300015241Vadose Zone SoilVIGKFAARTGLFGAVRERSSVTPRRALPSLSIGLAAVQIFTSLGAAAVWPHAEAVAQDDPWQWPADQKVEKATERLKDLKDAAQQFSRRDCPTARAHYDAWLGEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRAGNGADALVELTDLLDRCGRRVRESDAAFRPPSREELVQRAAERIGEKKKSELAAVGIALEEKDELKVEDSPPRRIR*
Ga0137409_1063966113300015245Vadose Zone SoilGTNDQNNCHADGFSTAARERRLVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPAEQKVEKAIERLKDLKDTAQQFSRRDCPTARADYDAWLNEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRAGNGADGLGELTDLLDRCGRRVRERDAAFRPPSREELVRRAVERIGEKKKSELAAVGISLEEKDELKVEGSPPRRIR*
Ga0137403_1010928933300015264Vadose Zone SoilMAQDDPWQWPADQKVEKATERLKDLKDAAQQFSRRDCPTARAHYDAWLDEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRAGNGADALVELTDLLDRCGRRVRESDAAFRPPSREELVRRAAERIGEKKKSELAAVGISLEEKDEFKVEGSPPRRIR*
Ga0134112_1015742113300017656Grasslands SoilEARNLDTVPGRPYGTNDRNNCHADGSSTAARERRLVMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKLEDSPPPRIR
Ga0184623_1033389613300018056Groundwater SedimentRARQRFSMVLAAALVFLASASAIVWPMAKTVARDDPWQWPADEKVGKATERLKELKDTAEKFSRRDCPTARTDFGAWLDEALELAGSLIGLQDSTRQIRKTGAGGDKLRAWDAVRASNGSGGMAELTDLLGRCGRRVRESDAAFRPPSPEELFRRTSQRIAEKKKSELASFGIPLEERDEFEVEGSAPRSIR
Ga0066655_1059423923300018431Grasslands SoilEGSLNVKATLRADGSSAAARERRLVMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGANKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSPEELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR
Ga0066667_1119342813300018433Grasslands SoilLVMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR
Ga0066669_1001159153300018482Grasslands SoilMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATDRLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR
Ga0207684_1004244533300025910Corn, Switchgrass And Miscanthus RhizosphereMRPTPCAAARERCLVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPADQKVEKAIERLKDLKDTAQQFSRRDCPTGRADYDAWLNEALDLAGSLIGLQNSTRQIRKTGAGADKLRTWDAVRVGNGADGFGELTDLLDRCGRRVRESDAAFRPPSREELVRRAVERIGEKKKSELAAVGISLEEKDELKVEGSPPRRIR
Ga0207646_1021224923300025922Corn, Switchgrass And Miscanthus RhizosphereMGLAAVQIFLSLGSAAVWPLAEAVAQDDPWQWPADQKVEKATERLRDLKDTAQQFSRRDCPTARADYEAWLEEALDLAGSLIGLQDSTRPIRKTGAAADKLRTWDAVRASNGPDGIAELTDLFGRCGRRVRESDAAFRPPSREELFRRAAARIGEKKKSELASFGIPLEEKDEFKAEGLPARSIP
Ga0209055_101237243300026309SoilMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPPRIR
Ga0209154_108153313300026317SoilMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKK
Ga0209802_109291023300026328SoilRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPPRIR
Ga0209802_124653513300026328SoilPRGARTVRVIGKIAARTGLFGAVRERSSVTPRRALPSLSIGLAAVQIFLLLGSAAVWPLAEVAAQDDPWQWPADQKVEKATERLRDLKDAAQQFSRRDCPTARAHYDAWLDEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRAGNGADGLVELTDLLDRCARRVRESDAAFRPPSREELVRRAAERIGEKKKSELAAVGI
Ga0209803_103044933300026332SoilYRTNDRNNCHADGSSAAARERRLVMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR
Ga0209377_101202463300026334SoilDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPPRIR
Ga0209804_117484313300026335SoilAAVQIFLSLGSAAVWPLAETATQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPPRIR
Ga0209057_111513123300026342SoilGTNDRNNCHADGSSTAARERCLVMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGANKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR
Ga0257180_104297513300026354SoilPPPGARTVRMIGEIGMRPTPCAAARERCRVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPADQKVEKAIERLKDLKDTAQQFSRRDCPTGRADYDAWLNEALDLAGSLIGLQDSTRQIRKTGAGADKLRIWDAVRVGNGADGLGELTDLLDRCGRRVRESDAAFRPPSREELVRRAVERIGEKKKSELAAVGISLEEKD
Ga0257173_101344213300026360SoilPPGARTVRMIGEIGMRPTPCAAARERCRVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPADQKVEKAIERLKDLKDTAQQFSRRDCPTGRADYDAWLNEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRVGNGADGLGELTDLLDRCGRRVRESDAAFRPPSREELVRRAVERIGEKKKSELAAVGISLEEKDELKVEGSPPRRIR
Ga0257179_102525713300026371SoilPCAAARERCLVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPADQKVEKAIERLKDLKDTAQQFSRRDCPTGRADYDAWLNEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRVGNGADGLGELTDLLDRCGRRVRESDAAFRPPSREELVRRAVERIGEKKKSELAAVGISLEEKDELKVEGSPPRRIR
Ga0257169_100923413300026469SoilLVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPADQKVEKAIERLKDLKDTAQQFSRRDCPTGRADYDAWLNEGLDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRVGNGADGLGELTDLLDRCGRRVRESDAAFRPPSREELVRRAVERIGEKKKSELAAVGISLEEKDELKVEGSPPRRIR
Ga0257177_101783923300026480SoilMRPTPCAAARERCRVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPADQKVEKAIERLKDLKDTAQQFSRRDCPTGRADYDAWLNEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRAGNGADALVELTDLLDRCGRRVRESDAAFRPPSREELVRRAAERIGEKKKSELAAVGISLEEKDELKVEGSPPRRIR
Ga0209378_1003226113300026528SoilNNCHADGSSTAARERRLVMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSRDELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR
Ga0209160_114438723300026532SoilRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR
Ga0209160_126560713300026532SoilRGARTVRVIGKIAARTGLFGAVRERSSVTPRRALPSLSIGLAAVQIFLLLGSAAVWPLAEVAAQDDPWQWPADQKVEKATERLRDLKDAAQQFSRRDCPTARAHYDAWLDEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRAGNGADGLVELTDLLDRCARRVRESDAAFRPPSREELVRRAAERIGEK
Ga0209056_10001986113300026538SoilMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR
Ga0209805_101082213300026542SoilMRDALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKLEDSPPRRIR
Ga0209161_10002029103300026548SoilMRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVEKATERLRDLKDAAQRFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDAVRTSNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELLRRAAARISEKKKSELAAVGIALEEKDELKLEDSPPPRIR
Ga0209874_109974413300027577Groundwater SandVIRLRALQRFSMALAATLVFLAAGSVMVWPIAKAVARDDPWQWPADEKVEKATERLKELKDTAEKFSRRDCPTARTDFAAWLDEALELAGSLVGLQDSTRQIRKTGAGADKLRAWDVVRASNGPGGIADLTDLLGRCGRRVRESDASFRPPSQEELLRRAAQRIAEKKKSELASFGI
Ga0209388_108585113300027655Vadose Zone SoilRPYGTNDRNNCHADGSSTAARERRLVIRRALPRLSIGLAAVQIFLSLGSAAVWPLAETAAQDDPSQWPADQKVERATERLRDLKDAAQQFSRRDCPTARAGYDAWLDDALDLAGSLIGLQDSTRQIRKTGAGADKLRSWDVVRASNGPDGIAELTDLLARCGRRVQESDAAFRPPSREELFRRAAARISEKKKSELAAVGIALEEKDELKVEDSPPRRIR
Ga0209180_1007197223300027846Vadose Zone SoilMRPTPCAAARERCLVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPVDQKVEKAIERLKDLKDTAQQFSRRDCPTGRADYDAWLNEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRVGNGADGLGELTDLLDRCGRRVRESDAAFRPPSREELVRRAVERIGEKKKSELAAVGISLEEKDELKVEGSPRRRIR
Ga0209701_1006939723300027862Vadose Zone SoilMALAATLVFLASGSAIVWPMAKAVARDDPWQWPADEKVERATERLKELKDTAEKFSRRDCPTARTDFDAWLDEALELAGSLIGLQDSTRQIRKTGAGLDKLRAWDAVRASNGPGGMAELTDLLSRCGRRVREGDAAFRPPSPEELLRRASQRIAEKKKSELASFGIPLEEKDEFKVEGSAPRSIR
Ga0209701_1043978313300027862Vadose Zone SoilERCLVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPVDQKVEKAIERLKDLKDTAQQFSRRDCPTGRADYDAWLNEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRVGNGADGLGELTDLLDRCGRRVRESDAAFRPPSREELVRRAVERIGEKKKSELAAVGISLEEKDELKVEGSPRRRIR
Ga0209283_1002147913300027875Vadose Zone SoilMRPTPCAAARERCLVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPADQKVEKAIERLKDLKDTAQQFSRRDCPTGRADYDAWLNEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRVGNGADGLGELTDLLDRCGRRVRESDAAFRPPSREELVRRAVERIGEKKK
Ga0209526_1002897523300028047Forest SoilVIRRALPRLSIGLAAVHIFLSLGAATVWPLAEAVAQDDPWQWPADQKVEKATVRLMDLKDAARQFSRRDCPTAREDYDAWLDEALDLVGSLIGLQDSTRQIRKTGAGADKLRSWDAVRASNGPDGIAELTDLLGRCGRRVQESDAAFRPPSREELFRRAAARIGEKKKSELAAVGISVEEKDEFKVEDSPPRRIR
Ga0137415_1001683033300028536Vadose Zone SoilVTPRRALPSLSIGLAAVQIFLLLGSAAVWPLAEVAAQDDPWQWPADQKVEKATERLRDLKDAAQQFSRRDCPTARAHYDAWLDEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRAGNGADALVELTDLLDRCGRRVRESDAAFRPPSREELVRRAAERIGEKKKSELAAVGISLEEKDEFKVEGSPPRRIR
(restricted) Ga0255311_105807323300031150Sandy SoilMRPTPCAAARERCLVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPADQKVEKAIERLKDLKDTAQQFSRRDCPTGRADYGAWLNEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRVGNGADGLGELTDLLDRCGRRVRESDAAFRPPSREELVRRAVERIGEKKKSEL
(restricted) Ga0255312_100989823300031248Sandy SoilMRPTPCAAARERCLVIRRALPRLSIGLAAVQIFLSFGAAAVWPLADAVAQDDPSQWPADQKVEKAIERLKDLKDTAQQFSRRDCPTGRADYGAWLNEALDLAGSLIGLQDSTRQIRKTGAGADKLRTWDAVRVGNGADGLGELTDLLDRCGRRVRESDAAFRPPSREELVRRAVERIGEKKKSELTAVGISLEEKDELKVEGSPPRRIR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.