NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F103637

Metagenome / Metatranscriptome Family F103637

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103637
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 102 residues
Representative Sequence VPLGPPPERRPRPASRAQPLAGPAPETGFSPDLAALCHHGERAVTVAGGPDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLDLQLSAGAFVTAVATYE
Number of Associated Samples 75
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 65.35 %
% of genes near scaffold ends (potentially truncated) 37.62 %
% of genes from short scaffolds (< 2000 bps) 84.16 %
Associated GOLD sequencing projects 65
AlphaFold2 3D model prediction Yes
3D model pTM-score0.34

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (74.257 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(20.792 % of family members)
Environment Ontology (ENVO) Unclassified
(27.723 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(51.485 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 8.33%    β-sheet: 27.27%    Coil/Unstructured: 64.39%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.34
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF01883FeS_assembly_P 24.75
PF13686DrsE_2 23.76
PF01022HTH_5 13.86
PF01597GCV_H 12.87
PF02754CCG 2.97
PF01522Polysacc_deac_1 0.99
PF01012ETF 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG0509Glycine cleavage system protein H (lipoate-binding)Amino acid transport and metabolism [E] 12.87
COG0247Fe-S cluster-containing oxidoreductase, includes glycolate oxidase subunit GlcFEnergy production and conversion [C] 2.97
COG2048Heterodisulfide reductase, subunit BEnergy production and conversion [C] 2.97
COG0726Peptidoglycan/xylan/chitin deacetylase, PgdA/NodB/CDA1 familyCell wall/membrane/envelope biogenesis [M] 0.99
COG2025Electron transfer flavoprotein, alpha subunit FixBEnergy production and conversion [C] 0.99
COG2086Electron transfer flavoprotein, alpha and beta subunitsEnergy production and conversion [C] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms74.26 %
UnclassifiedrootN/A25.74 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300003324|soilH2_10027880All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia2133Open in IMG/M
3300004153|Ga0063455_100545693Not Available737Open in IMG/M
3300005093|Ga0062594_101202010Not Available751Open in IMG/M
3300005167|Ga0066672_10008640All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Pseudonocardiales → Pseudonocardiaceae → Saccharomonospora → Saccharomonospora marina4759Open in IMG/M
3300005171|Ga0066677_10054822All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia2004Open in IMG/M
3300005171|Ga0066677_10811596Not Available517Open in IMG/M
3300005174|Ga0066680_10219618All Organisms → cellular organisms → Bacteria1204Open in IMG/M
3300005176|Ga0066679_10525640All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria771Open in IMG/M
3300005186|Ga0066676_11072329Not Available532Open in IMG/M
3300005187|Ga0066675_10567072All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria852Open in IMG/M
3300005329|Ga0070683_100338659All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales1432Open in IMG/M
3300005445|Ga0070708_100617399All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1022Open in IMG/M
3300005445|Ga0070708_101266919All Organisms → cellular organisms → Bacteria689Open in IMG/M
3300005445|Ga0070708_102153645Not Available515Open in IMG/M
3300005447|Ga0066689_10053830All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia2169Open in IMG/M
3300005454|Ga0066687_10278232All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria942Open in IMG/M
3300005468|Ga0070707_100549695All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1117Open in IMG/M
3300005468|Ga0070707_101483553Not Available645Open in IMG/M
3300005529|Ga0070741_10001611All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia67111Open in IMG/M
3300005529|Ga0070741_10003720All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia37001Open in IMG/M
3300005529|Ga0070741_10666795All Organisms → cellular organisms → Bacteria921Open in IMG/M
3300005535|Ga0070684_102094125All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria534Open in IMG/M
3300005546|Ga0070696_100167156All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1624Open in IMG/M
3300005560|Ga0066670_10125369All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1473Open in IMG/M
3300005575|Ga0066702_10813882Not Available556Open in IMG/M
3300005575|Ga0066702_10859105Not Available541Open in IMG/M
3300005576|Ga0066708_10625319All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria688Open in IMG/M
3300005586|Ga0066691_10224124All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1100Open in IMG/M
3300005614|Ga0068856_101408892All Organisms → cellular organisms → Bacteria711Open in IMG/M
3300006032|Ga0066696_10616678All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria703Open in IMG/M
3300006175|Ga0070712_101621243Not Available566Open in IMG/M
3300006797|Ga0066659_10616059All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria882Open in IMG/M
3300006800|Ga0066660_10868169Not Available735Open in IMG/M
3300006876|Ga0079217_10034511All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1919Open in IMG/M
3300006894|Ga0079215_10343311All Organisms → cellular organisms → Bacteria850Open in IMG/M
3300006894|Ga0079215_10791304All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia660Open in IMG/M
3300006918|Ga0079216_11121372All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia623Open in IMG/M
3300006918|Ga0079216_11248437All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria602Open in IMG/M
3300007004|Ga0079218_10353638All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Pseudonocardiales → Pseudonocardiaceae → Saccharomonospora → Saccharomonospora marina1225Open in IMG/M
3300007004|Ga0079218_10457862All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Pseudonocardiales → Pseudonocardiaceae → Saccharomonospora → Saccharomonospora marina1111Open in IMG/M
3300009012|Ga0066710_100872694All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1383Open in IMG/M
3300009012|Ga0066710_103482629Not Available596Open in IMG/M
3300010335|Ga0134063_10300401All Organisms → cellular organisms → Bacteria772Open in IMG/M
3300011271|Ga0137393_11742644Not Available511Open in IMG/M
3300012212|Ga0150985_104806609All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1204Open in IMG/M
3300012922|Ga0137394_10110350All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2322Open in IMG/M
3300012924|Ga0137413_11763654Not Available510Open in IMG/M
3300012925|Ga0137419_10517329All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria949Open in IMG/M
3300012929|Ga0137404_10621548All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria972Open in IMG/M
3300012929|Ga0137404_11880333All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria557Open in IMG/M
3300012930|Ga0137407_11523771All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria636Open in IMG/M
3300015241|Ga0137418_10946853Not Available628Open in IMG/M
3300018422|Ga0190265_10413566All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia → Acidimicrobiales → Acidimicrobiaceae1449Open in IMG/M
3300018429|Ga0190272_10064620All Organisms → cellular organisms → Bacteria2190Open in IMG/M
3300018429|Ga0190272_11942204All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria620Open in IMG/M
3300018429|Ga0190272_12726955Not Available543Open in IMG/M
3300018466|Ga0190268_10002112All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria3968Open in IMG/M
3300018468|Ga0066662_11447275All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria713Open in IMG/M
3300018482|Ga0066669_10275819All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1348Open in IMG/M
3300019377|Ga0190264_10227105All Organisms → cellular organisms → Bacteria1058Open in IMG/M
3300019377|Ga0190264_10272524All Organisms → cellular organisms → Bacteria1001Open in IMG/M
3300019377|Ga0190264_10612254All Organisms → cellular organisms → Bacteria780Open in IMG/M
3300020181|Ga0196958_10001907All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria6475Open in IMG/M
3300021184|Ga0196959_10041072All Organisms → cellular organisms → Bacteria933Open in IMG/M
3300021184|Ga0196959_10202589Not Available510Open in IMG/M
3300022467|Ga0224712_10021864All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2196Open in IMG/M
3300025910|Ga0207684_10245203All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1546Open in IMG/M
3300025912|Ga0207707_10087101All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2729Open in IMG/M
3300025932|Ga0207690_11767015Not Available516Open in IMG/M
3300025944|Ga0207661_10368736All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia → Acidimicrobiales → Acidimicrobiaceae → Ferrithrix → Ferrithrix thermotolerans1298Open in IMG/M
3300026298|Ga0209236_1089334All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1409Open in IMG/M
3300026315|Ga0209686_1101889All Organisms → cellular organisms → Bacteria995Open in IMG/M
3300026332|Ga0209803_1153289Not Available890Open in IMG/M
3300026542|Ga0209805_1400947All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria528Open in IMG/M
3300026547|Ga0209156_10496488Not Available505Open in IMG/M
3300026550|Ga0209474_10125136All Organisms → cellular organisms → Bacteria1700Open in IMG/M
3300026550|Ga0209474_10648666Not Available543Open in IMG/M
3300026552|Ga0209577_10116579All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2144Open in IMG/M
3300026557|Ga0179587_10528650All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria775Open in IMG/M
3300027637|Ga0209818_1281258All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria510Open in IMG/M
3300027639|Ga0209387_1122495Not Available658Open in IMG/M
3300027886|Ga0209486_10600451All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria698Open in IMG/M
3300030006|Ga0299907_10572034All Organisms → cellular organisms → Bacteria883Open in IMG/M
(restricted) 3300031197|Ga0255310_10032343All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1354Open in IMG/M
3300031229|Ga0299913_11913949All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria540Open in IMG/M
3300031229|Ga0299913_12081001Not Available513Open in IMG/M
3300031720|Ga0307469_10434885All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1131Open in IMG/M
3300031720|Ga0307469_10930772All Organisms → cellular organisms → Bacteria807Open in IMG/M
3300031720|Ga0307469_11155154All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium730Open in IMG/M
3300031720|Ga0307469_11202938All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria716Open in IMG/M
3300031720|Ga0307469_11617510All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria623Open in IMG/M
3300031820|Ga0307473_10486072All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria830Open in IMG/M
3300032002|Ga0307416_100364899All Organisms → cellular organisms → Bacteria1468Open in IMG/M
3300032002|Ga0307416_101877151Not Available703Open in IMG/M
3300032005|Ga0307411_10312257Not Available1266Open in IMG/M
3300032174|Ga0307470_10257953All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1156Open in IMG/M
3300032180|Ga0307471_100003612All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia9315Open in IMG/M
3300032180|Ga0307471_100075879All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia → Acidimicrobiales → Acidimicrobiaceae → Ferrithrix → Ferrithrix thermotolerans2928Open in IMG/M
3300032180|Ga0307471_104228689Not Available507Open in IMG/M
3300032205|Ga0307472_102390588Not Available536Open in IMG/M
3300034268|Ga0372943_0056126All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia → Acidimicrobiales → Acidimicrobiaceae → Ferrithrix → Ferrithrix thermotolerans2215Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil20.79%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil10.89%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil9.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil8.91%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil8.91%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere7.92%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.95%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil3.96%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere3.96%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.97%
SoilEnvironmental → Terrestrial → Soil → Sand → Desert → Soil2.97%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere2.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.98%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.99%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.99%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.99%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.99%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.99%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.99%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.99%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300004153Grasslands soil microbial communities from Hopland, California, USA (version 2)EnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005329Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005535Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.2-3L metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005614Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C6-2Host-AssociatedOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006876Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS200EnvironmentalOpen in IMG/M
3300006894Agricultural soil microbial communities from Utah to study Nitrogen management - NC ControlEnvironmentalOpen in IMG/M
3300006918Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS100EnvironmentalOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018466Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 TEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019377Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 112 TEnvironmentalOpen in IMG/M
3300020181Soil microbial communities from Anza Borrego desert, Southern California, United States - S1+v_10EnvironmentalOpen in IMG/M
3300021184Soil microbial communities from Anza Borrego desert, Southern California, United States - S1+v_20EnvironmentalOpen in IMG/M
3300022467Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5pm-2 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025932Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025944Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027637Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027639Agricultural soil microbial communities from Utah to study Nitrogen management - NC Control (SPAdes)EnvironmentalOpen in IMG/M
3300027886Agricultural soil microbial communities from Utah to study Nitrogen management - NC Compost (SPAdes)EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031229Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT155D38EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032002Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-3Host-AssociatedOpen in IMG/M
3300032005Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-O-1Host-AssociatedOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300034268Forest soil microbial communities from Eldorado National Forest, California, USA - SNFC_MG_FRD_1.2EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
soilH2_1002788053300003324Sugarcane Root And Bulk SoilMPLGDAPERKQRPAARAQPLAGEPPVTTFSSDLATLCHHGERAVTVAGGPGQELVAVLGSPFDPEAFVTWRSDGIQVFFRGEAPARLDLQLSAGAFLTATAVYD*
Ga0063455_10054569313300004153SoilMPLGDAPERKQRPSARAEPLAGPAPATSFSPDLRMLCHHGERAVTVAPGPDRELVAVLGSPFDPEAFVTWRADGIQVFFRGEAPARLDLQLSAGAFVTA
Ga0062594_10120201023300005093SoilMPLGQPPERMPRPAARAQPLTGPPPVTTFSPELVTLCHHGERAITVAGGPDRDPVAVLGSPFDPEAFVTWRAEGIQVFYRGEAPTRLDLALSAGAFVTASATYD*
Ga0066672_1000864063300005167SoilVPLGPPPDRRPRPASRAQPLAGPGPETRFSPDLAALCHHGERAVTVAGGPDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLDLQLSAGAFVTAVATYE*
Ga0066677_1005482223300005171SoilVPLGPPPERRPRPASRAQPLAGPAPETGFSPDLAALCHHGERAVTVAGGPDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLDLQLSAGAFVTAVATYE*
Ga0066677_1081159613300005171SoilMPLGDAPERKRRPAARSEPVPGPAPETTFSSDLLTLCHHGERAVTVAGAPDGSLVAALGSPFDPEAFVTWRSEGIQVFFRGEIPAR
Ga0066680_1021961823300005174SoilVPLGPPPERRPRPASRAQPLAGPGPETRFSPDLAALCHHGGRAVTVAGGPDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLDLQLSAGAFVTAVATYE*
Ga0066679_1052564013300005176SoilDVPLGPPPERRPRPASRAQPLAGPAPETGFSPDLAALCHHGERAVTVAGGPDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLDLQLSAGAFVTAVATYE*
Ga0066676_1107232913300005186SoilMPLGERPERKPRPAARAQPLTGPPPATTFSSDLAALCHHGERAVTVAGGRDRELVAVLGSPFDPEAFTTWRAEGIQVFFRGEAPTRL
Ga0066675_1056707233300005187SoilVPLGEPPRRQPRPAARAEPLAGPAPETTFSPDLLTLCHHGERAVTVAGGPDGGLVAVLGSPFDPEAFVTWRTEGIQVFFRGEAPARLDLQLSAGAFVTAKAIYA*
Ga0070683_10033865913300005329Corn RhizospherePERKPRPAARAQAFAGDPPVTTFSSDLATLCHHGERAVTVAGRPGGELVAALGSPFDPEAFVTWRAEGIQVFFRGEAPARLELQLSAGAFVTATATYD*
Ga0070708_10061739923300005445Corn, Switchgrass And Miscanthus RhizosphereMPLGERPERRPRPAAKAQPLPGPPPVTTVSSDLATLCHHGERAVTVAVGPDRQLVAVLGSPFDPEAFVTWRVEGIQVFFRGEAPARLDLRLSAGAFVTATAGYD*
Ga0070708_10126691923300005445Corn, Switchgrass And Miscanthus RhizosphereMPLGDAPERKQRPSARAEPLAGPAPETTFSPDLRMLCHHGERAVTVAPGPDQELVAVLGSPFDPEAFVTWRADGIQVFFRGEAPARLELQLSAGAFVTAVATYE*
Ga0070708_10215364523300005445Corn, Switchgrass And Miscanthus RhizosphereMPLGQPPERRLRPAARAQPVTGPPPTTTFSADLATLCYHGERAVTVAGGPGAELVAVLGSPFDAEAFVTWRAEGIQVFYRGEAPTRLDLALSAGAFVTASATYD*
Ga0066689_1005383053300005447SoilVPLGPPPERRPRPASRAQPLAGPGPETRFSPDLAALCHHGERAVTVAGGPDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLDLQLSAGAFVTAVATYE*
Ga0066687_1027823233300005454SoilVPLGEPPRRQPRPAARAEPLAGPAPETTFSPDLLTLCHHGERAVTVAGGPDGGLVAVLGSPFDPEAFVTWRTEGIQVFFRGEAPAR
Ga0070707_10054969513300005468Corn, Switchgrass And Miscanthus RhizosphereMPLGQPPERMPRPAARAQPLTGPPPVTTFSPELVTLCHHGERAITVAGGPDRDPVAVLGSPFDPEAFVTWRADGIQVFYRGEAPA
Ga0070707_10148355323300005468Corn, Switchgrass And Miscanthus RhizosphereMPLGEPPRRTPRPAARAEPLTGPPPETTFSPDLVALCHHGERAVTVAGGPDGGLVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPARLELQLSAGAFVTAVAYG*
Ga0070741_10001611193300005529Surface SoilVPLGEPPKREPRPAARAEPLAGPAPETTFSPDLLTLCHHGERAVTVAGGPDGRLVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPARLDLQLSAGAFLTAVATYP*
Ga0070741_10003720143300005529Surface SoilMPLDRPPERKARPATRAQALPGPAPETTFSSDLATLCHHGERAVTVAARPGGELVAALGSPFDPEAFVTWRADGIQVFFRGEAPARLELQLSAGASVTAIAAYE*
Ga0070741_1066679533300005529Surface SoilMPLASDGMGSAPGRKRRPAAQATPWPGPAPETHFSPDLMTLCHHGERAVTVAGTPDGSLVAALGSPFDPEAFVTWRTEGIQVFFRGDPPA
Ga0070684_10209412523300005535Corn RhizosphereVPLGEPPERKPRPATQAQALSGPPPETTFSSDLAALCHHGERAVTVAVRPGGDLVAALGSPFDPEAFLTWRAEGIQVFFRGEAPARLELQLSAGAMVTAMADYGQPSGGPRGAEPPEC*
Ga0070696_10016715613300005546Corn, Switchgrass And Miscanthus RhizosphereMPLGQPPERRLRPAARAQPVTGPPPTTTFSADLATLCYHGERAVTVAGGPGAELVAVLGSPFDAEAFVTWRAEGIQVFYRGEAPTRLDLAL
Ga0066670_1012536923300005560SoilVPLGEPPRRQPRPAARAEPLAGPAPETTFSPDLLTLCHHGERAVTVAGGPDGGLVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPARLDLQLSAGAFVTAKAIYA*
Ga0066702_1081388213300005575SoilVPLDRPPERRPRPAARAQPLAGPAPETHFSPDLITLCHHGERAVTVAGGPGRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLDLQLSAGAFVTAVATYE*
Ga0066702_1085910523300005575SoilMPLGDAPERRRRPAARAEPLAGPAPDTIFSSDLATLCHLGERAVTVAGGRDGDLVAVLGSPFDPEAFVTWRADGIQVFFRGEAPARLELQLSAGAFLSAVATWD*
Ga0066708_1062531913300005576SoilAPVPLGEPPRRQPRPAARAEPLAGPAPETTFSPDLLTLCHHGERAVTVAGGPDGGLVAVLGSPFDPEAFVTWRTEGIQVFFRGEAPARLDLQLSAGAFVTAKAIYA*
Ga0066691_1022412443300005586SoilVPLGPPPERRPRPASRAQPLAGPAPETGFSPDLAALCHHGERAVTVAGGPDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLDLQLSAGAFVT
Ga0068856_10140889223300005614Corn RhizosphereVPLGEPPERKPRPATQAQALSGPPPETTFSSDLAALCHHGERAVTVAVRPGGDLVAALGSPFDPEAFLTWRAEGIQVFFRGEAPARLELQLSAGASVTATATY*
Ga0066696_1061667823300006032SoilMPLGERPERRPRPAARAQPLPGLPPVTTVSSDLATLCHHGERAVTVAAGPDRQLVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPARLDLGLSAGAFVTATAGYD*
Ga0070712_10162124323300006175Corn, Switchgrass And Miscanthus RhizosphereVPLGEPPPRRPRPVARAEPLAGPAPETTFSADLMALCHHGERAVTVAIGPDGGLVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPARLDLQLSAGAFVTAVATYPSR*
Ga0066659_1061605913300006797SoilQPLAGPGPETGFSPDLAALCHHGERAVTVAGGPDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLDLQLSAGAFVTAVATYE*
Ga0066660_1086816913300006800SoilVPLDRPPERRPRPAARAQPLAGPAPETHFSPDLITLCHHGERAVTVAGGPGRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLELQLSAGASVTATAMYS*
Ga0079217_1003451123300006876Agricultural SoilMPLGDAPERKRRPAARAEPLAGPAPETVFSSDLLTLCHHGERAVTVAGAPGHELVAALGSPFDPEAFVTWRAEGIQVFFRGEAPARLELQLSAGAFVTAVATY*
Ga0079215_1034331123300006894Agricultural SoilMPLGDAPERKRRPAARAQPLAGPPPATTFSPELATLCHHGERAVTVAGGPDRNLVAVLGSPFDPEAFVTWRADGIQVFFRGEAPTRLDLELSAGAFVTATATYD*
Ga0079215_1079130423300006894Agricultural SoilMPLGDRPDRKRRPAARAQPLTGPPPETTFSPDLATLCHHGERAVTVARGPDSEFVAVLGSPFDPEAFVTWRADGIQVFFRGEAPTRLDLRLSAGAFLTATATYD*
Ga0079216_1112137213300006918Agricultural SoilPDLGRRQSVPGNLMPLGDAPERKRRPAARAQPLAGPPPATTFSPELATLCHHGERAVTVAGGPDRNLVAVLGSPFDPEAFVTWRADGIQVFFRGEAPTRLDLELSAGAFVTATATYD*
Ga0079216_1124843713300006918Agricultural SoilRGQGLPGDLMPLGDAPERKRRPAARAEPLAGPAPETIFSSDLRALCHHGERAVTVAAPPGQEPVAALGSPFDPEAFVTWRAEGIQVFFRGEAPARLELQLSAGAFVTAVATY*
Ga0079218_1035363833300007004Agricultural SoilARAEQLAGPVPETVFSPDLLTLCHHGERAVTVAAPPGQEPVAALGSPFDPEAFVTWRAEGIQVFFRGEAPARLELQLSAGAFVTAVATY*
Ga0079218_1045786233300007004Agricultural SoilMPLGDAPERKRRPAARAEPLAGPAPETIFSSDLRALCHHGERAVTVAAPPGQEPVAALGSPFDPEAFVTWRAEGIQVFFRGEAPARLELQLSAGAFVTAVATY*
Ga0066710_10087269413300009012Grasslands SoilVPLGPPPERRPRPASRAQPLAGPAPETRFSPDLAALCHHGERAVTVASGPDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLDLQLSAGAFVTAVATYE
Ga0066710_10348262913300009012Grasslands SoilVPLGEPPRRQPRPAARAEPLAGPAPETTFSPDLLTLCHHGERAVTVAGGPDGGLVAVLGSPFDPEAFVTWRAEGIQVFFRGKAPARLELQLGAGAFVTAVATYAD
Ga0134063_1030040133300010335Grasslands SoilMPLGERPERRPRPAARAQPLPGLPPVTTVSSDLATLCHHGERAVTVAAGPDHQLVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPARLDLGLSAGAFVT
Ga0137393_1174264423300011271Vadose Zone SoilMPLGERPERRPRPAARAQPLPGPPPATTVSSDLATLCHHGERAVTVAAGPDRQLVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPARLDLRLSAGAFVTATAVYD*
Ga0150985_10480660923300012212Avena Fatua RhizosphereMPLGDAPERKQRPSARAEPLAGPAPATSFSPDLRMLCHHGERAVTVAPGPDRELVAVLGSPFDPEAFVTWRADGIQVFFRGEAPARLDLQLSAGAFVTATATYA*
Ga0137394_1011035053300012922Vadose Zone SoilMPLGDAPQRKARPAARAEPLSGPVPETTFSADLRMLCQHGERAITVAGGPDGQLVAALGSPFDPEAFVTWRAEGIQVFFRGEAPARLELQLSAGAFVTAVATYE*
Ga0137413_1176365423300012924Vadose Zone SoilMPLGDAPQRKARPAARAEPLAGPVPETTFSADLRMLCQHGERAVTVAGGPDGELVAALGSPFDPEAFVTWRAEGIQVFFRGEAPTRLELQLSAGAFVTAVAT
Ga0137419_1051732923300012925Vadose Zone SoilMPLANDSMGSAPERKQRPSARAEPLAGPAPETTFSSDLVALCHHGERAVTVATGPDGNLVAALGSPFDPEAFVTWRADGIQVFFRGTKAPARLELQLSAGAFVEALATY*
Ga0137404_1062154823300012929Vadose Zone SoilMPLGDAPQRKARPAARAEPLAGPVPETTFSADLRMLCQHGERAVTVAGGPDGELVAALGSPFDPEAFVTWRAEGIQVFFRGEAPTRLELQLSAGAFVTAVATYE*
Ga0137404_1188033323300012929Vadose Zone SoilMPLANDSMGSAPERKQRPSARAEPLAGPAPETTFSSDLVALCHHGERAVTVATGPDGNLVTALGSPFDPEAFVTWRADGIQVFFRGTKAPARLELQLSAGAFVEALATY*
Ga0137407_1152377113300012930Vadose Zone SoilNDSMGSAPERKQRPSARAEPLAGPAPETTFSSDLVALCHHGERAVTVATGPDGNLVAALGSPFDPEAFVTWREDGIQVFFRGTKAPARLELQLSAGAFVEALATY*
Ga0137418_1094685313300015241Vadose Zone SoilMPLGDAPQRKARPATRAEPLAGPVPETTFSADLRMLCQHGERAVTVAGGPDGELVAALGSPFDPEAFVTWRAEGIQVFFRGEAPARLELQLSAGAFVTAVATYE*
Ga0190265_1041356633300018422SoilMPLGDAPERKRRPAARAEPLAGPPPETTFSSDLATLCHHGERAVTVAQRPDGNVVATLGSPFDPEAYVTWRADGIQVFFRGAKAPARLELQLSAGAFVDAVATY
Ga0190272_1006462043300018429SoilMPLANDSMGSAPERKQRPAARAEPLAGPAPETHFSPDLMTLCHHGERAVTVAGRPDGELVAALGSPFDPEAFITWRAEGIQVFFRGEAPARLDLQL
Ga0190272_1194220413300018429SoilAVVTGADLGRGQGVPGHLMPLGDAPERKRRPAARAEPLAGPPPETTFSSDLVTLCHYGERAVTVAGSPDGKLVATLGSPFDPEAFVTWRAEGIQVFFRGTKAPARLELQLSAGAFVDARAMYPDGL
Ga0190272_1272695523300018429SoilMPLGDAPERKRRPAARAEPLAGPPPETTFSSDLATLCHYGERAVTVAGGPDGKLVAALGSPFDPETFVTWQAEGIQVFFRGAKAPARLDLQL
Ga0190268_1000211253300018466SoilMPLGERPERKPRPAARAQPLAGPPPVTTFSSDLATLCHHGERALTVACAPDGGLVAALGSPFDPEAFVTWRAEGIQVFFRGEAPARLELELSAGAFVTATAVYD
Ga0066662_1144727523300018468Grasslands SoilVPLGPPPERRPRPASRAQPLAGPAPETGFSPDLAALCHHGERAVTVAGGADRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLDLQLSAGAFVTAVATYE
Ga0066669_1027581913300018482Grasslands SoilVPLGDAPERKRRPAARAEPLAGPAPETTFSSDLAALCHHGERAVTVAGGPDGELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPARLELQLSAGAFVTARGYYLP
Ga0190264_1022710523300019377SoilMPLGERPERRPRPAARAQPLPGPPPVTTFSSDLATLCHHGERAVTVAGAPDGRLVAALGSPFDPEAFVTWRAEGIQVFFRGEPPARLELQLSAGAFVTATAVYDD
Ga0190264_1027252433300019377SoilMPLGDAPERKRRPAARAQPLAGPPPATTFSPELATLCHHGERAVTVASGPDRNLVAVLGSPFDPEAFVTWRADGIQVFFRGEAPTRLDLELSAGAFVTATATYD
Ga0190264_1061225433300019377SoilMPLGDAPERKRRPAARAVPLAGPVPETTVSPDLAAICHHGERAVTVAVGPDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPARLDLQLSAGAFVT
Ga0196958_1000190743300020181SoilMPLGERPDRRPRPAARAQPLTGPAPTTTFSPDLATLCHHGERAVTVAGGPEEGLVAVLGSPFDPEAFVTWRAEGIQVFFRGVAPTRLELRLSAGAFVTATASYD
Ga0196959_1004107233300021184SoilMPLGDRPDRKPRPAARAQPLTGPEPTTTFSADLATLCHHGERAVTVARGPGQELVAVLGSPFDPEAFVTWRADGIQVFFRGEPPSRLELRLSAGAFVTATATYD
Ga0196959_1020258913300021184SoilALVARADLRRRQGVPRDLTIMPLGERPDRRPRPAARAQPLTGPAPTTTFSPDLATLCHHGERAVTVAGGPEQGLVAVLGSPFDPEAFVTWRAEGIQVFFRGVAPTRLELRLSAGAFVTATASYD
Ga0224712_1002186443300022467Corn, Switchgrass And Miscanthus RhizosphereVPLGEPPERKPRPATQAQALSGPPPETTFSSDLAALCHHGERAVTVAVRPGGDLVAALGSPFDPEAFLTWRAEGIQVFFRGEAPARLELQLSAGASVTATATY
Ga0207684_1024520313300025910Corn, Switchgrass And Miscanthus RhizosphereMPLGQPPERMPRPAARAQPLTGPPPVTTFSPELVTLCHHGERAITVAGGPDRDPVAVLGSPFDPEAFVTWRAEGIQVFYRGEAPTRLDLALSAGAFVTASATYD
Ga0207707_1008710143300025912Corn RhizosphereMPLGQPPERRLRPAARAQPVTGPPPTTTFSADLATLCYHGERAVTVAGGPGAELVAVLGSPFDAEAFVTWRAEGIQVFYRGEAPTRLDLALSAGAFVTASATYD
Ga0207690_1176701523300025932Corn RhizosphereMPLGDAPERKRRPAAQAVPLPGPAPETTFSPDLVTLCHHGERAVTVAGSPDGELVAVLGSPFDPEAFVTWRANGIQVFFRGEAPARLELQLSAGAFLTATATY
Ga0207661_1036873613300025944Corn RhizospherePRPAARAQAFAGDPPVTTFSSDLATLCHHGERAVTVAGRPGGELVAALGSPFDPEAFVTWRAEGIQVFFRGEAPARLELQLSAGAFVTATATYD
Ga0209236_108933433300026298Grasslands SoilVPLGPPPERRPRPASRAQPLAGPAPETGFSPDLAALCHHGERAVTVAGGPDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLDLQLSAGAFVTAVATYE
Ga0209686_110188923300026315SoilVPLGPPPERRPRPASRAQPLAGPAPETGFSPDLAALCHHGERAVTVASGPDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLDLQLSAGAFVTAVATYE
Ga0209803_115328933300026332SoilVPLGPPPERRPRPASRAQPLAGPGPETRFSPDLAALCHHGERAVTVAGGPDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLDLQLSAGAFVTAVATYE
Ga0209805_140094723300026542SoilERRPRPAARAQPLAGPAPETHFSPDLITLCHHGERAVTVAGGPGRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLELQLSAGAFVTAVATY
Ga0209156_1049648823300026547SoilVPLGEPPRRQPRPAARAEPLAGPAPETTFSPDLLTLCHHGERAVTVAGGPDGGLVAVLGSPFDPEAFVTWRTEGIQVFFRGEAPARLDLQLSAGAFVTAKAIYA
Ga0209474_1012513623300026550SoilMPLGERPERRPRPAARAQPLPGLPPVTTVSSDLATLCHHGERAVTVAAGPDRQLVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPARLDLGLSAGAFVTATAGYD
Ga0209474_1064866623300026550SoilMPLDRSPERRPRPAARAQPLAGPAPETRFSPDLIVLCHHGERAVTVAGEPDRDLVAVLGSPFEPEAFVTWRAEGIQVFFRGEAPTRLELQLS
Ga0209577_1011657913300026552SoilAPRPASRAQPLAGPAPETGFSPDLAALCHHGERAVTVAGGPDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLDLQLSAGAFVTAVATYE
Ga0179587_1052865013300026557Vadose Zone SoilMPLGDAPQRKARPATRAEPLAGPVPETTFSADLRMLCQHGERAVTVAGGPDGQLVAALGSPFDPEAFVTWRAEGIQVFFRGEAPTRLELQLSAGAFVTAVATYE
Ga0209818_128125813300027637Agricultural SoilGQGLLGHLMPLGDAPERKRRPAARAEPLAGPAPETIFSADLRALCHHGERAVTVAAPPGQEPVAALGSPFDPEAFVTWRAEGIQVFFRGEAPARLELQLSAGAFVTAVATY
Ga0209387_112249523300027639Agricultural SoilMPLGDAPERKRRPAARAQPLAGPAPATTFSPELATLCHHGERAVTVAGGPDRNLVAVLGSPFDPEAFVTWRADGIQVFFRGEAPTRLDLELSAGAFVTATATYD
Ga0209486_1060045123300027886Agricultural SoilAARAEPLAGPAPETIFSADLRALCHHGERAVTVAAPPGQEPVAALGSPFDPEAFVTWRAEGIQVFFRGEAPARLELQLSAGAFVTAVATY
Ga0299907_1057203433300030006SoilMPLGDAPERKRRPAARAVPLAGPVPETTVSRDLAALCHHGERAVTVAGGPDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPARLDLQLSAGAFVTATAIYE
(restricted) Ga0255310_1003234313300031197Sandy SoilAAQATPLPGPAPETHFSADLRTLCHHGERAVTVAGRPDGELVAALGSPFDPEAFVTWRADGIQVFYRGQPPARLDLQLSAGAFVSAVATY
Ga0299913_1191394913300031229SoilVVPGADLARGQGLPGHLMPLGERPERKPRPAARAQPLTGSPPITTFSSDLAVLCHHGERALTVAGSPDGGLVAALGSPFDPEAFVTWRADGIQVFFRGEAPARLDLQLSAGAFVTATATY
Ga0299913_1208100113300031229SoilMPLGDAPERKRRPAARAVPLAGPVPNTTVSPDLAALCHQGERAVTVAGRPDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPARLDLQLSAGAFVTATAIYE
Ga0307469_1043488513300031720Hardwood Forest SoilLTGPPPEPTFSADLLTLCHHGERAVTVAGRPDGGLAAALGSPFDPEAFVTWRAEGIQVFFRGEAPARLELQLSAGAFVTAVATYGEATR
Ga0307469_1093077213300031720Hardwood Forest SoilMPLGDAPESKRRPAARATPLAGPPPETTFSSDLVTLCHHGERAVTVAGGPDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPARLDLQLSAGAFVTAVASYDD
Ga0307469_1115515433300031720Hardwood Forest SoilVPLGEPPPRRPRPAARAEPLAGPAPETSFSADLVALCHHGERAVTVASGPDGGLVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPARLDLQLSAGAFVTAVATYPS
Ga0307469_1120293813300031720Hardwood Forest SoilMPLGQTPERRARPAARAAPLAGPPPATTFSSDLAALCHHGERAVTVAGGPDGKLVAVLGSPFDPEAFVTWRAEGIQVFYRGEAPTRLDLQ
Ga0307469_1161751023300031720Hardwood Forest SoilMPLGQPPERKPRPAARAEPLAGPPPETTISPDLAALCHHGERAVTVAGGRDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLALQLSAGAFVAANATWD
Ga0307473_1048607213300031820Hardwood Forest SoilVPLGEPPRRTPRPAARAEPLTGPTPETTFSADLLTLCHHGERAVTVAAGPDGGLVAALGSPFDPEAFVTWRAEGIQVFFRGEAPTRLELQLSAGA
Ga0307416_10036489943300032002RhizosphereMPLGDAPERKRRPAARAQPLAGPPPATTFSSELATLCHHGERAVTVAGGRDGNLVAVLGSPFDPEAFVTWRADGIQIFFRGEVPSRLDLQLSAGAFVTATATYD
Ga0307416_10187715123300032002RhizosphereMPLGDAPERKRRPAAQATPLPGPAPETHFSSDLRTLCHHGERAVTVAGKPDGDLVAALGSPFDPEAFVTRRSEGIQVFFRGEAPARLELQLSAGAFVT
Ga0307411_1031225723300032005RhizosphereMPLADRSMGGAPERKKRPAAQATPLPGPAPETHFSPDLRTLCHHGERAVTVANRPDGQLVAALGSPFDPEAFVTWRADGIQVFFRGEAPARLDLQLSAGAFLTATATY
Ga0307470_1025795333300032174Hardwood Forest SoilVPLGEPPRRTPRPAARAEPLTGPPPETTFSADLLTLCHHGERAVTVAAGPDGGLVAALGSPFDPEAFVTWRAEGIQVFFRGEAPTRLDLQLSAGAFLTATAIYE
Ga0307471_100003612133300032180Hardwood Forest SoilMPLGQPPERKPRPAARAEPLAGPPPETTISPDLAALCHHGERAVTVAGGRDRELVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPTRLALQLSAGAFVAATATWD
Ga0307471_10007587953300032180Hardwood Forest SoilVPLGEPPRRTPRPAARAEPLTGPPPETTFSADLLTLCHHGERAVTVAGRPDGGLAAALGSPFDPEAFVTWRAEGIQVFFRGEAPARLELQLSAGAFVTAVATYGDATR
Ga0307471_10422868923300032180Hardwood Forest SoilVPLGEPPPRRPRPTARAEPLAGPAPETTFSADLVALCHHGERAVTVAIGPDGGLVAVLGSPFDPEAFVTWRAEGIQVFFRGEAPARLDLQLSAGAFVTAVATYPSR
Ga0307472_10239058823300032205Hardwood Forest SoilVPLGEPPRRTPRPAARAEPLTGPTPETTFSADLLTLCHHGERAVTVAAGPDGGLVAALGSPFDPEAFVTWRAEGIQVFFRGEAPTRL
Ga0372943_0056126_634_9423300034268SoilMGSAPERKRRPSARAEPLAGPPPETSFSSDLVTLCQHGERAVTVATGPDGKLVAALGSPIDPEAFVTWRADGIQVFFRGTKAPARLELQLSAGAFVDARATY


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.