NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F035656

Metagenome / Metatranscriptome Family F035656

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F035656
Family Type Metagenome / Metatranscriptome
Number of Sequences 171
Average Sequence Length 76 residues
Representative Sequence MAYDVMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRSDTEHEIPNVVLTLERMSKLGL
Number of Associated Samples 122
Number of Associated Scaffolds 171

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 46.15 %
% of genes near scaffold ends (potentially truncated) 32.16 %
% of genes from short scaffolds (< 2000 bps) 83.04 %
Associated GOLD sequencing projects 112
AlphaFold2 3D model prediction Yes
3D model pTM-score0.64

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (70.760 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(20.468 % of family members)
Environment Ontology (ENVO) Unclassified
(28.655 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Subsurface (non-saline)
(43.860 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 24.21%    β-sheet: 21.05%    Coil/Unstructured: 54.74%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.64
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 171 Family Scaffolds
PF00196GerE 7.02
PF01842ACT 3.51
PF13335Mg_chelatase_C 2.34
PF12974Phosphonate-bd 1.75
PF01068DNA_ligase_A_M 1.75
PF00072Response_reg 1.17
PF07238PilZ 1.17
PF14534DUF4440 1.17
PF08241Methyltransf_11 1.17
PF13408Zn_ribbon_recom 0.58
PF06224HTH_42 0.58
PF13489Methyltransf_23 0.58
PF01593Amino_oxidase 0.58
PF13545HTH_Crp_2 0.58
PF00118Cpn60_TCP1 0.58
PF00535Glycos_transf_2 0.58
PF07992Pyr_redox_2 0.58
PF028262-Hacid_dh_C 0.58
PF01717Meth_synt_2 0.58
PF09335SNARE_assoc 0.58
PF00005ABC_tran 0.58
PF00109ketoacyl-synt 0.58
PF01769MgtE 0.58
PF13541ChlI 0.58
PF13650Asp_protease_2 0.58
PF09347DUF1989 0.58
PF00326Peptidase_S9 0.58

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 171 Family Scaffolds
COG1423ATP-dependent RNA circularization protein, DNA/RNA ligase (PAB1020) familyReplication, recombination and repair [L] 1.75
COG1793ATP-dependent DNA ligaseReplication, recombination and repair [L] 1.75
COG0398Uncharacterized membrane protein YdjX, related to fungal oxalate transporter, TVP38/TMEM64 familyFunction unknown [S] 0.58
COG0459Chaperonin GroEL (HSP60 family)Posttranslational modification, protein turnover, chaperones [O] 0.58
COG0586Membrane integrity protein DedA, putative transporter, DedA/Tvp38 familyCell wall/membrane/envelope biogenesis [M] 0.58
COG0620Methionine synthase II (cobalamin-independent)Amino acid transport and metabolism [E] 0.58
COG1238Uncharacterized membrane protein YqaA, VTT domainFunction unknown [S] 0.58
COG1824Permease, similar to cation transportersInorganic ion transport and metabolism [P] 0.58
COG2239Mg/Co/Ni transporter MgtE (contains CBS domain)Inorganic ion transport and metabolism [P] 0.58
COG3214DNA glycosylase YcaQ, repair of DNA interstrand crosslinksReplication, recombination and repair [L] 0.58


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A70.76 %
All OrganismsrootAll Organisms29.24 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300004156|Ga0062589_102658552Not Available520Open in IMG/M
3300004463|Ga0063356_100300640Not Available1989Open in IMG/M
3300004463|Ga0063356_101900831Not Available897Open in IMG/M
3300005093|Ga0062594_101909413Not Available630Open in IMG/M
3300005206|Ga0068995_10062614Not Available695Open in IMG/M
3300005295|Ga0065707_10170930Not Available1485Open in IMG/M
3300005295|Ga0065707_11090456Not Available518Open in IMG/M
3300005332|Ga0066388_103904721Not Available760Open in IMG/M
3300005332|Ga0066388_105640244All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium633Open in IMG/M
3300005440|Ga0070705_101327976Not Available597Open in IMG/M
3300005444|Ga0070694_100289108Not Available1252Open in IMG/M
3300005445|Ga0070708_100548777Not Available1090Open in IMG/M
3300005458|Ga0070681_11988508Not Available509Open in IMG/M
3300005467|Ga0070706_100037428All Organisms → cellular organisms → Bacteria4481Open in IMG/M
3300005468|Ga0070707_100421318All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1295Open in IMG/M
3300005468|Ga0070707_102091718Not Available534Open in IMG/M
3300005518|Ga0070699_100253867All Organisms → cellular organisms → Bacteria1571Open in IMG/M
3300005549|Ga0070704_100378413All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → Gemmatimonadaceae → Gemmatimonas → unclassified Gemmatimonas → Gemmatimonas sp. SM23_521202Open in IMG/M
3300006845|Ga0075421_100124212All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3238Open in IMG/M
3300007265|Ga0099794_10165921Not Available1124Open in IMG/M
3300009038|Ga0099829_10002719All Organisms → cellular organisms → Bacteria10242Open in IMG/M
3300009087|Ga0105107_10495594Not Available851Open in IMG/M
3300009087|Ga0105107_10614668Not Available757Open in IMG/M
3300009088|Ga0099830_10155990All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Candidatus Troglogloeales → Candidatus Manganitrophaceae → Candidatus Manganitrophus → Candidatus Manganitrophus noduliformans1759Open in IMG/M
3300009090|Ga0099827_10774165Not Available830Open in IMG/M
3300009153|Ga0105094_10749444Not Available573Open in IMG/M
3300009157|Ga0105092_10733194Not Available576Open in IMG/M
3300009157|Ga0105092_10922064Not Available516Open in IMG/M
3300009166|Ga0105100_10288694Not Available984Open in IMG/M
3300009804|Ga0105063_1041638Not Available630Open in IMG/M
3300009804|Ga0105063_1052509Not Available588Open in IMG/M
3300011269|Ga0137392_10002438All Organisms → cellular organisms → Bacteria11195Open in IMG/M
3300011271|Ga0137393_10007970All Organisms → cellular organisms → Bacteria7128Open in IMG/M
3300012174|Ga0137338_1019671Not Available1317Open in IMG/M
3300012203|Ga0137399_10354176All Organisms → cellular organisms → Bacteria1219Open in IMG/M
3300012207|Ga0137381_10566995Not Available990Open in IMG/M
3300012225|Ga0137434_1003158Not Available1513Open in IMG/M
3300012355|Ga0137369_10233716Not Available1400Open in IMG/M
3300012363|Ga0137390_11622918Not Available584Open in IMG/M
3300012685|Ga0137397_10245323Not Available1334Open in IMG/M
3300012918|Ga0137396_10999120Not Available606Open in IMG/M
3300012925|Ga0137419_10597208Not Available886Open in IMG/M
3300012927|Ga0137416_10561771All Organisms → cellular organisms → Bacteria990Open in IMG/M
3300012929|Ga0137404_10012327All Organisms → cellular organisms → Bacteria5955Open in IMG/M
3300012929|Ga0137404_11473305Not Available629Open in IMG/M
3300012930|Ga0137407_10326479Not Available1408Open in IMG/M
3300012944|Ga0137410_10657606Not Available870Open in IMG/M
3300012944|Ga0137410_11392751Not Available609Open in IMG/M
3300013100|Ga0157373_10920884Not Available649Open in IMG/M
3300014873|Ga0180066_1071279Not Available704Open in IMG/M
3300014881|Ga0180094_1024253Not Available1200Open in IMG/M
3300014884|Ga0180104_1139949Not Available709Open in IMG/M
3300015170|Ga0120098_1009490Not Available1042Open in IMG/M
3300015241|Ga0137418_10204780Not Available1702Open in IMG/M
3300015245|Ga0137409_10660842Not Available878Open in IMG/M
3300015259|Ga0180085_1101325Not Available849Open in IMG/M
3300017939|Ga0187775_10004002All Organisms → cellular organisms → Bacteria3509Open in IMG/M
3300017997|Ga0184610_1141072Not Available787Open in IMG/M
3300018000|Ga0184604_10069922All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1022Open in IMG/M
3300018028|Ga0184608_10175888Not Available931Open in IMG/M
3300018028|Ga0184608_10201047Not Available872Open in IMG/M
3300018028|Ga0184608_10451399Not Available553Open in IMG/M
3300018031|Ga0184634_10506268Not Available539Open in IMG/M
3300018051|Ga0184620_10026191All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1496Open in IMG/M
3300018052|Ga0184638_1083611Not Available1170Open in IMG/M
3300018052|Ga0184638_1099253Not Available1068Open in IMG/M
3300018052|Ga0184638_1113479Not Available992Open in IMG/M
3300018052|Ga0184638_1254945Not Available603Open in IMG/M
3300018052|Ga0184638_1302105Not Available541Open in IMG/M
3300018052|Ga0184638_1344218Not Available500Open in IMG/M
3300018053|Ga0184626_10086565Not Available1325Open in IMG/M
3300018054|Ga0184621_10234420Not Available656Open in IMG/M
3300018056|Ga0184623_10118565Not Available1225Open in IMG/M
3300018056|Ga0184623_10384393Not Available622Open in IMG/M
3300018061|Ga0184619_10148807Not Available1068Open in IMG/M
3300018063|Ga0184637_10007261All Organisms → cellular organisms → Bacteria6710Open in IMG/M
3300018063|Ga0184637_10095234Not Available1831Open in IMG/M
3300018063|Ga0184637_10123860All Organisms → cellular organisms → Bacteria1594Open in IMG/M
3300018075|Ga0184632_10036056Not Available2118Open in IMG/M
3300018075|Ga0184632_10078040Not Available1445Open in IMG/M
3300018075|Ga0184632_10273655Not Available734Open in IMG/M
3300018075|Ga0184632_10327015Not Available660Open in IMG/M
3300018075|Ga0184632_10386114Not Available591Open in IMG/M
3300018076|Ga0184609_10011679All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Comamonadaceae3294Open in IMG/M
3300018076|Ga0184609_10027385All Organisms → cellular organisms → Bacteria2316Open in IMG/M
3300018078|Ga0184612_10322788Not Available788Open in IMG/M
3300018422|Ga0190265_10175838Not Available2125Open in IMG/M
3300018422|Ga0190265_10201138All Organisms → cellular organisms → Bacteria2003Open in IMG/M
3300018422|Ga0190265_10295025Not Available1689Open in IMG/M
3300018429|Ga0190272_10052110All Organisms → cellular organisms → Bacteria → Terrabacteria group → unclassified Terrabacteria group → Terrabacteria group bacterium ANGP12376Open in IMG/M
3300018429|Ga0190272_10165853Not Available1545Open in IMG/M
3300018469|Ga0190270_12352865Not Available594Open in IMG/M
3300019360|Ga0187894_10074829All Organisms → cellular organisms → Bacteria1881Open in IMG/M
3300019360|Ga0187894_10142498All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1227Open in IMG/M
3300019458|Ga0187892_10003359All Organisms → cellular organisms → Bacteria29460Open in IMG/M
3300019458|Ga0187892_10013136All Organisms → cellular organisms → Bacteria9641Open in IMG/M
3300019458|Ga0187892_10463202All Organisms → cellular organisms → Bacteria590Open in IMG/M
3300019487|Ga0187893_10080871All Organisms → cellular organisms → Bacteria2994Open in IMG/M
3300019881|Ga0193707_1015812All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2525Open in IMG/M
3300019886|Ga0193727_1070109Not Available1085Open in IMG/M
3300019889|Ga0193743_1227825Not Available564Open in IMG/M
3300019998|Ga0193710_1006478Not Available1175Open in IMG/M
3300020001|Ga0193731_1135417Not Available616Open in IMG/M
3300020002|Ga0193730_1062254Not Available1069Open in IMG/M
3300020003|Ga0193739_1007924All Organisms → cellular organisms → Bacteria2823Open in IMG/M
3300020003|Ga0193739_1009816All Organisms → cellular organisms → Bacteria2521Open in IMG/M
3300020006|Ga0193735_1108956Not Available765Open in IMG/M
3300020061|Ga0193716_1135557Not Available1009Open in IMG/M
3300020063|Ga0180118_1304145Not Available576Open in IMG/M
3300020170|Ga0179594_10206496Not Available737Open in IMG/M
3300021073|Ga0210378_10014568All Organisms → cellular organisms → Bacteria → Terrabacteria group → unclassified Terrabacteria group → Terrabacteria group bacterium ANGP13239Open in IMG/M
3300021073|Ga0210378_10038660Not Available1894Open in IMG/M
3300021073|Ga0210378_10045360Not Available1735Open in IMG/M
3300021081|Ga0210379_10059098Not Available1542Open in IMG/M
3300021081|Ga0210379_10221369Not Available818Open in IMG/M
3300021081|Ga0210379_10411237Not Available598Open in IMG/M
3300021344|Ga0193719_10349190Not Available615Open in IMG/M
3300021344|Ga0193719_10405517All Organisms → cellular organisms → Bacteria561Open in IMG/M
3300022534|Ga0224452_1084127Not Available966Open in IMG/M
3300022534|Ga0224452_1126159Not Available787Open in IMG/M
3300022694|Ga0222623_10018691All Organisms → cellular organisms → Bacteria2579Open in IMG/M
3300022694|Ga0222623_10062727Not Available1435Open in IMG/M
3300022694|Ga0222623_10173033Not Available840Open in IMG/M
3300025324|Ga0209640_10015459All Organisms → cellular organisms → Bacteria6651Open in IMG/M
3300025910|Ga0207684_10079458Not Available2790Open in IMG/M
3300025912|Ga0207707_11458880Not Available544Open in IMG/M
3300025922|Ga0207646_10182031All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1898Open in IMG/M
3300026351|Ga0257170_1068600Not Available501Open in IMG/M
3300026358|Ga0257166_1004750All Organisms → cellular organisms → Bacteria1515Open in IMG/M
3300026480|Ga0257177_1024326Not Available873Open in IMG/M
3300026507|Ga0257165_1087138Not Available576Open in IMG/M
3300027187|Ga0209869_1007716All Organisms → cellular organisms → Bacteria1161Open in IMG/M
3300027527|Ga0209684_1014662All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1229Open in IMG/M
3300027650|Ga0256866_1047452Not Available1136Open in IMG/M
3300027650|Ga0256866_1065228All Organisms → cellular organisms → Bacteria971Open in IMG/M
3300027815|Ga0209726_10246096Not Available907Open in IMG/M
3300027818|Ga0209706_10448186Not Available595Open in IMG/M
3300027862|Ga0209701_10187675All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Candidatus Troglogloeales → Candidatus Manganitrophaceae → Candidatus Manganitrophus → Candidatus Manganitrophus noduliformans1240Open in IMG/M
3300027909|Ga0209382_10648421All Organisms → cellular organisms → Bacteria1143Open in IMG/M
3300027947|Ga0209868_1036000Not Available529Open in IMG/M
3300027954|Ga0209859_1022398All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → candidate division KSB1 → unclassified candidate division KSB1 → candidate division KSB1 bacterium1099Open in IMG/M
3300027957|Ga0209857_1062973Not Available641Open in IMG/M
3300028536|Ga0137415_10452110All Organisms → cellular organisms → Bacteria1090Open in IMG/M
3300028673|Ga0257175_1000003All Organisms → cellular organisms → Bacteria9224Open in IMG/M
3300028792|Ga0307504_10406129Not Available537Open in IMG/M
3300028803|Ga0307281_10007766All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2959Open in IMG/M
3300028814|Ga0307302_10347023Not Available732Open in IMG/M
3300028819|Ga0307296_10287041Not Available897Open in IMG/M
3300028819|Ga0307296_10740250Not Available537Open in IMG/M
3300028824|Ga0307310_10443418Not Available649Open in IMG/M
3300028828|Ga0307312_10476132Not Available823Open in IMG/M
3300030006|Ga0299907_10825066Not Available697Open in IMG/M
3300030006|Ga0299907_10980840Not Available622Open in IMG/M
3300030006|Ga0299907_11265931Not Available526Open in IMG/M
3300030619|Ga0268386_10005315All Organisms → cellular organisms → Bacteria9582Open in IMG/M
3300030619|Ga0268386_10265842Not Available1255Open in IMG/M
3300030619|Ga0268386_10623267Not Available716Open in IMG/M
3300030620|Ga0302046_10271971Not Available1394Open in IMG/M
3300031229|Ga0299913_10624222Not Available1059Open in IMG/M
3300031229|Ga0299913_11204058All Organisms → cellular organisms → Bacteria717Open in IMG/M
(restricted) 3300031248|Ga0255312_1013103Not Available1981Open in IMG/M
3300031740|Ga0307468_100107874Not Available1675Open in IMG/M
3300032180|Ga0307471_100665718Not Available1204Open in IMG/M
3300033811|Ga0364924_149006Not Available551Open in IMG/M
3300034155|Ga0370498_058216All Organisms → cellular organisms → Bacteria865Open in IMG/M
3300034155|Ga0370498_184852Not Available513Open in IMG/M
3300034164|Ga0364940_0107701Not Available788Open in IMG/M
3300034257|Ga0370495_0025657Not Available1758Open in IMG/M
3300034257|Ga0370495_0132787All Organisms → cellular organisms → Bacteria784Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil20.47%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment17.54%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil14.04%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.85%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment4.09%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment4.09%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil3.51%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand3.51%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil2.34%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.92%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.75%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks1.75%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze1.75%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere1.17%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.17%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.17%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.17%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.17%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.17%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.17%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.17%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.58%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.58%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.58%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.58%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.58%
FossillEnvironmental → Terrestrial → Soil → Fossil → Unclassified → Fossill0.58%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.58%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005206Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D2EnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009087Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 19-21cm September2015EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009153Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 10-12cm March2015EnvironmentalOpen in IMG/M
3300009157Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009166Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm May2015EnvironmentalOpen in IMG/M
3300009804Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_30_40EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012174Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT366_2EnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012225Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT860_2EnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013100Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C6-5 metaGHost-AssociatedOpen in IMG/M
3300014873Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT200B_16_10DEnvironmentalOpen in IMG/M
3300014881Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_1DaEnvironmentalOpen in IMG/M
3300014884Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_1DaEnvironmentalOpen in IMG/M
3300015170Fossil microbial communities from human bone sample from Teposcolula Yucundaa, Mexico - TP48EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015259Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_10DEnvironmentalOpen in IMG/M
3300017939Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_10_MGEnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018051Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_b1EnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018469Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 TEnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019886Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2EnvironmentalOpen in IMG/M
3300019889Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2c2EnvironmentalOpen in IMG/M
3300019998Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m1EnvironmentalOpen in IMG/M
3300020001Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a2EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020003Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a2EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300020061Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c1EnvironmentalOpen in IMG/M
3300020063Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT730_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redoEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026351Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-05-BEnvironmentalOpen in IMG/M
3300026358Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-14-BEnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300027187Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027527Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 6 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027650Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67 HiSeqEnvironmentalOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027818Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 19-21cm September2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300027947Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300027954Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027957Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_10_20 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028673Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-BEnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028803Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_120EnvironmentalOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300028824Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_197EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300030619Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (Novaseq)EnvironmentalOpen in IMG/M
3300030620Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111EnvironmentalOpen in IMG/M
3300031229Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT155D38EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033811Sediment microbial communities from East River floodplain, Colorado, United States - 28_j17EnvironmentalOpen in IMG/M
3300034155Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_05D_17EnvironmentalOpen in IMG/M
3300034164Sediment microbial communities from East River floodplain, Colorado, United States - 14_s17EnvironmentalOpen in IMG/M
3300034257Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_02D_17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
Ga0062589_10265855223300004156SoilVTYRIGRICIRRTALAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPPERIDEALDALRHDTEHEIPNVRLTLERMSKLGL*
Ga0063356_10030064063300004463Arabidopsis Thaliana RhizosphereMFAPTGNGEFRIATAAGLRAFLWEATIPPERIDEALDALRHDTEHEIPNVRLTLERMSKLGL*
Ga0063356_10190083123300004463Arabidopsis Thaliana RhizosphereMFAPTGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPNVMLTLERMSKLGL*
Ga0062594_10190941333300005093SoilVTYRIGRICIRRTALAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPPERIDEALDALRHDTEHEIPNVR
Ga0068995_1006261423300005206Natural And Restored WetlandsLRRSSVRTEQSVRNRAVTHRVGRLWIRRSGLVYQVMFVPTGNGEFQIKTEDGLRVFLWGASIPPERIEEAVTALRREPEHEIPNVVLTLERMSKLGL*
Ga0065707_1017093023300005295Switchgrass RhizosphereMVYDVMFAPTGNGEFRIKTEDGLRAFLWGASIPAERIEEAVAALRHDAEHEIPNVLLTLERMSKLGL*
Ga0065707_1109045613300005295Switchgrass RhizosphereGVTRRVGRLWIRRTGGKTYDVMFRPYGNGEFRIATEDGLRAFLWGATIPAERIEEAVAALRGDTEHEIPNVVLTLERMSKLGL*
Ga0066388_10390472123300005332Tropical Forest SoilMGRLWIRRTGPSTYEVMFRPTGNGEFRVATEDGLRALLWGATISSDRIEQAILALRKETEHEIPDVVLTLERMSKLRL*
Ga0066388_10564024423300005332Tropical Forest SoilMGRLWIRRTGPSTYDVMFRPSGNGEFRVATEDGLRALLWGATVPSDRIELAILALRKETEHEIHDVVLTLERMSKLRL*
Ga0070705_10132797623300005440Corn, Switchgrass And Miscanthus RhizosphereLIPAPLSGDRGTGRRGAVTYRVGRICIRRTGLAYDVMFAPTGNGEFHIATAAGLRAFLWEATIPPERIDEALDALRHGTEHEIHNVMLTLERMSKLGL*
Ga0070694_10028910823300005444Corn, Switchgrass And Miscanthus RhizosphereMTPAVTHRLGRLWIRRTGGKTYDVMFRPYGNGEFRIATENGLRAFLWGATIPTERIEEAVAALRGDTEHEIPNVVLTLERMSKLGL*
Ga0070708_10054877733300005445Corn, Switchgrass And Miscanthus RhizosphereVTHRLGRLWIRRVGMVYDVMFAPTGNGEFRIKTEDGLRAFLWGATIPADRIEQAVAALRSDTEHEIRNVVLTLEQMSKLGL*
Ga0070681_1198850813300005458Corn RhizosphereTPAPLPGERGTGWRGAVTYRIGRICIRRTALAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPPERIDEALDALRHDTEHEIPNVRLTLERMSKLGL*
Ga0070706_10003742863300005467Corn, Switchgrass And Miscanthus RhizosphereMVYDVMLAPTGNGEFRIKTEDGLRAFLWGATIPADRIEQAVAALRSDTEHEIPNVVLTLEQMSRLGL*
Ga0070707_10042131823300005468Corn, Switchgrass And Miscanthus RhizosphereMFRPTGNGEFRSTTEDGLRAFLWGATIPAERIEEAVAALRDEMELEIPDVVLTLERMSKLGL*
Ga0070707_10209171813300005468Corn, Switchgrass And Miscanthus RhizosphereAPTGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPNVMLTLERMSKLGL
Ga0070699_10025386733300005518Corn, Switchgrass And Miscanthus RhizosphereVTHRLGRLWIRRVGMVYDVMLAPTGNGEFRIKTEDGLRAFLWGATIPADRIEQAVAALRSDTEHEIPNVVLTLEQMSRLGL*
Ga0070704_10037841313300005549Corn, Switchgrass And Miscanthus RhizosphereMTHAVTHRVGRLWIRRTGGKTYDVMFRPYGNGDFRIATEDGLRAFLWGATIPGERIEEAVAALRGDTEHEIPNVVLTLERMSKLGL*
Ga0075421_10012421243300006845Populus RhizosphereMAYDVMFAPTGNGEFRIASAAGLRAFLWEATIPAERIDDALDALRTDTEHEIPDVRLTLERMSKLGL*
Ga0099794_1016592123300007265Vadose Zone SoilMAYDVMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRSDTEHEIPNVVLTLERMSKLGL*
Ga0099829_1000271993300009038Vadose Zone SoilMAYDVMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRSDTEHEIPNVVLSLERMSKLGL*
Ga0105107_1049559423300009087Freshwater SedimentLPGEQGIGRIGAVTHRSGRIWIRRTALAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPPERIDEALDALRHDTEHEIPNVMLTLERMSKLGL*
Ga0105107_1061466813300009087Freshwater SedimentRTGMAYDVMFAPTGNGEFRIKTEDGLRAFLWGASIPAERIEEAVAALRHDAEHEIPDVVLTLERMSKLGL*
Ga0099830_1015599023300009088Vadose Zone SoilMFAPTGNGEFRIKTENGLRAFLWGATIPSERIEEAVAALRNDTEHEIPNVVLTLEQMSQLGL*
Ga0099827_1077416523300009090Vadose Zone SoilMFRPTGNGEFRITTEDGLRTFLWGATIPSERIEEAVAALRTDTEHEIPDVVLTLERMSELGL*
Ga0105094_1074944413300009153Freshwater SedimentMGRLWIRRTGMAYDVMFAPTGNGEFRIKTEDGLRAFLWGASIPAERIEEAVAALRHDAEHEIPDVVLTLERMSKLGL*
Ga0105092_1073319413300009157Freshwater SedimentQGIERTGAVTHRIGRIWIRRTAQTYDVMFAPTGNGEFRIATAAGLRAFLWEATVPPERIDEALDALRHETEHEIPNVVLTLELMSKLGL*
Ga0105092_1092206413300009157Freshwater SedimentTHRTGRIWIRHTGSAYDVMFAPSGNGEFRIATAAGLRAFLWQVTIPAERIDEALDALRHDTEHEIPNVVLTLERMSKLGL*
Ga0105100_1028869423300009166Freshwater SedimentMGRLWIRRSGLVYDVMFAPTGNGEFRIKTEDGLRAFLWGASIPAERIEEAVAALRHDAEHEIPDVVLTLERMGKLGL*
Ga0105063_104163813300009804Groundwater SandMFRPTGNGEFRIKTEDGLRALLWGATIPAERIEEAVAALRNDTEHEIPNVVLTLERMSKLGL*
Ga0105063_105250913300009804Groundwater SandMFAPAGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRNDTEHEIPNVMLTLERMSKLGL*
Ga0137392_1000243853300011269Vadose Zone SoilMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRSDTEHEIPNVVLTLERMSKLGL*
Ga0137393_1000797073300011271Vadose Zone SoilMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRSDTEHEIPNVVLSLERMSKLGL*
Ga0137338_101967133300012174SoilMAYDVMFAPTGNGEFRIKTEDGLRAFLGDATIPVDRIEEAVAALRRDTEHEIRNVVLTLERMSKLGL*
Ga0137399_1035417623300012203Vadose Zone SoilLTRDHLLAAQDAGRSGAVTHRIGRIWIRRTGLAYDVMFAPTGNGEFRIKTEDGLRVLLWGASIPAERIEEAVVALRRETEHEIPNVVLTLERMRKLGL*
Ga0137381_1056699513300012207Vadose Zone SoilMGRAGRSGSTPAEFPLSAPLLGEPGTARSGAVTHRIGRIWIRRTGLAYDVMFAPTGNGDVRIATAAGLRAFLWEATIPPERIDVALDALRHDTEHEIPNVMLTLERMSKLGL*
Ga0137434_100315813300012225SoilMAYDVMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRNDTEHEIPNVVLTLERMSKLGL*
Ga0137369_1023371623300012355Vadose Zone SoilMFRPSGNGEFRIVTEDGLRAFLWGATIPGERIEEAVAALRSDTEHEIPDVVLTLERMSKLGL*
Ga0137390_1162291813300012363Vadose Zone SoilLTRDHLLAEQDPGRSGAVTHRIGRIWIRRTGSVYDVMFAPTGNGEFRLATAAGLRAFLWEATIPAERIDEALDALRTDTEHEIPNVRLTLERMSQIGL*
Ga0137397_1024532323300012685Vadose Zone SoilMFAPTGNGECHIATAAGLRAFLWEATISAERIDEALDALRTDTEHEIRNVVLTLERMSKLGL*
Ga0137396_1099912023300012918Vadose Zone SoilMFAPTGNGEFRIKTEDGLRVLLWGASIPAERIEEAVVALRRETEHEIPNVVLTLERMRKLGL*
Ga0137419_1059720823300012925Vadose Zone SoilVEFPLTRDHLLAAQDAGRSGAVTHRIGRIWIRRTGLAYDVMFAPTGNGEFRIKTEDGLRVLLWGASIPAERIEEAVVALRRETEHEIPNVVLTLERMRKLGL*
Ga0137416_1056177123300012927Vadose Zone SoilGAGRAGRPRAAYAEFPLTRDHLLAAQDAGRSGAVTHRIGRIWIRRTGLAYDVMFAPTGNGEFRIKTEDGLRVLLWGASIPAERIEEAVVALRRETEHEIPNVVLTLERMRKLGL*
Ga0137404_1001232733300012929Vadose Zone SoilMFRPYGNGEFRIATENGLRAFLWGATIPTERIEEAVAALRGDTEHEIPNVVLTLERMRVS
Ga0137404_1147330523300012929Vadose Zone SoilLTRDHLLAAQDAGRSGAVTHRIGRIWIRRTGLAYDVMFAPTGNGEFRIKTEEGLRVLLWGASIPAERIEEAVVALRRETEHEIPNVVLTLERMRKLGL*
Ga0137407_1032647923300012930Vadose Zone SoilMFAPTGNGECHIATAAGLRAFLWEATISAERIDEALDALRTDTEHEIRNVVLTLERMSKLRL*
Ga0137410_1065760623300012944Vadose Zone SoilMFAPTGNGECHIATAAGLRAFLWEATISAERIDEALDALRTDTEHEIRNVVLTLERVSKLRL*
Ga0137410_1139275113300012944Vadose Zone SoilMGRAGRSGSTPAEFPLSAPLLGEQGSGRIGAVTHRIGRLWIRRTGSAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPNVMLTLERMSKLGL*
Ga0157373_1092088413300013100Corn RhizospherePAPLPGERGTGWRGAVTYRIGRICIRRTALAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPPERIDEALDALRHDTEHEIPNVRLTLERMSKLGL*
Ga0180066_107127923300014873SoilMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRTDTEHEIPNVVLTLERMSKLGL*
Ga0180094_102425333300014881SoilMAYDVMFAPTGNGEFRIKTEDGLRAFLGDATIPADRIEEAVAALRRDTEHEIRNVVLTLERMSKLGL*
Ga0180104_113994923300014884SoilMFAPTGNGECRIATAAGLRAFLWEATIPAERIDEALDALRTDTEHEIPNVLLTLERMSKLGL*
Ga0120098_100949043300015170FossillMFAPTGNGEFRLASAAGLRAFLWEATIPAERIDEALDALRSDTEHEIPNVTLTLERMSKLGL*
Ga0137418_1020478033300015241Vadose Zone SoilVEFPLTRDHLLAAQDAGRSGAVTHRIGRIWIRRTGLAYDVMFAPTGNGEFRIKTEDGLRVLLWGASIPAERIEEAVVALRRETEHEIPNVVLTLERMSKLGL*
Ga0137409_1066084213300015245Vadose Zone SoilMGRAGRSRSTPAEFPLSTPLLGEQGSGRIGAVTHRIGRLWIRRTGSAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPNVMLTLERMSKLGL*
Ga0180085_110132543300015259SoilMFAPTGNGECRIATAAGLRAFLWEATIPAERIDEALDALRTDTEHEIPNVMLTLERMSKLGL*
Ga0187775_1000400233300017939Tropical PeatlandVNSRAGRIEIRRQGKAYDVMFAPRGNGEFRIQTESALRAFLMQVAVPGDRIEDAVEALRRDSQHAIPDVVLTLERMSKLGL
Ga0184610_114107213300017997Groundwater SedimentMAYDVMFVPTGNGEFRIKTEDGLRAFLSGATIPAERIEEAVAALRSDTEHEIPNVVLTLERMSKLGL
Ga0184604_1006992223300018000Groundwater SedimentLTRDHLLAEQDAERSGAVTHRIGRIWIRRTGLAYDVMFAPTGNGEFRIKTEDGLRVLLWGASIPAERIEEAVVALRRETEHEIPNVVLTLERMSKLGL
Ga0184608_1017588823300018028Groundwater SedimentMFAPTGNGECHIATAAGLRAFLWEATISAERIDEALDALRTDTEHEINVVLTLERMSKLG
Ga0184608_1020104713300018028Groundwater SedimentRIWIRRTGLAYDVMFAPTGNGEFRIATAAGLRAFLCDAPIPAKRIDEALDALRHDTEHEIPDVMLTLERMSKLGL
Ga0184608_1045139913300018028Groundwater SedimentVTHRIGRIWTRRAGSAYDVMFAPAGNGEFRLATAAGLRAFLWEATIPAERIDEALDALRTDTEHEINVVLTLERMRKLGL
Ga0184634_1050626823300018031Groundwater SedimentYDVMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRSDTDHEIPNVVLTLERMSKLGL
Ga0184620_1002619133300018051Groundwater SedimentMFAPTGNGECHIATAAGLRAFLWEATISAERIDEALDALRTDTEHEIRNVVLTLERMSKLRL
Ga0184638_108361113300018052Groundwater SedimentVTHRIGRIWIRRTGSAYDVMFAPAGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPDVMLTLERMSKL
Ga0184638_109925323300018052Groundwater SedimentMTYDVMFRPTGNGEFRIATENGLRAFLWGATIPADRIEEAVAALRNDTEHEIPNVVLTLERMSKLGL
Ga0184638_111347943300018052Groundwater SedimentMFAPTGNGEFRLATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPNVRLTLERMSKL
Ga0184638_125494513300018052Groundwater SedimentVTHRVGRIWIRRTGMAYDVMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRSDTEHEIPNVVLTLERMSKLGL
Ga0184638_130210523300018052Groundwater SedimentFRIKTEDGLRAFLWGASIPAERIEEAVAVLRNDTEHEIPNVVLTLERMSKLGL
Ga0184638_134421813300018052Groundwater SedimentIRRTALAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPPERIDEALDALRHDTEHEIPDVMLTLERMSKLGL
Ga0184626_1008656533300018053Groundwater SedimentVTHRIGRIWIRRTGSAYDVMFAPAGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRTDTDHEIPDVMLTLERMSKLGL
Ga0184621_1010888823300018054Groundwater SedimentVTHRIGRIWIRRTGLAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRHDTEHE
Ga0184621_1023442023300018054Groundwater SedimentHLLAQQDTGRRRAVTHRVGRIWIRRTGSAYDVMFAPAGNGEFRLATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPDVMLTLERMSKLGL
Ga0184623_1011856523300018056Groundwater SedimentVTHRRGRIWIRRTGMAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIRNVVLTLERMSKLGL
Ga0184623_1038439323300018056Groundwater SedimentVTHRVGRIWIRRTGMAYDVMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRSDTEHEIPNVVLTLERM
Ga0184619_1014880733300018061Groundwater SedimentMFAPTGNGECHLATAAGLRAFLWEATISAERIDEALDALRTDTEHEIRNVVLTLERMSKLRL
Ga0184637_1000726153300018063Groundwater SedimentVTHRLGRLWIRRTGVVYDVMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRNDTEHEIPNVALTLEQMSKLGL
Ga0184637_1009523423300018063Groundwater SedimentVTHRIGRIWIRRTGSAYDVMFAPAGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPDVMLTLERMSKLGL
Ga0184637_1012386033300018063Groundwater SedimentAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRSDTEHEIPNVVLTLERMSKLGL
Ga0184632_1003605613300018075Groundwater SedimentHRVGRIWIRRTGMAYDVMFVPTGNGEFRIKTEDGLRAFLWGASIPAERIEEAVAVLRNDTEHEIPNVVLTLERMSKLGL
Ga0184632_1007804023300018075Groundwater SedimentMFAPTGNGEFRLATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPNVRLTLERMSKLGL
Ga0184632_1027365533300018075Groundwater SedimentMFRPTGNGEFRIATEDGLRAFLWGATIPADRIEEAVAALRSETEHEIPDAVLRLERMSKR
Ga0184632_1032701513300018075Groundwater SedimentMTYDVMFRPTGNGEFRIATENGLRAFLWGATIPADRIEEAVAALRNDTEHEIPNVVLTL
Ga0184632_1038611413300018075Groundwater SedimentGRSRAAHAEFPLTRGNRLAEQDTGRRGAVTHRIGRIWIRRTGSAYDVMFAPAGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRTDTDHEIPDVMLTLERMSKLGL
Ga0184609_1001167953300018076Groundwater SedimentVTHRVGRLWIRQTGSAYGVMFAPTGNGECRIATAAGLRAFLWEATIPPERIDEALDARRHDTEHEIPNVMLTLDRMSKLGL
Ga0184609_1002738553300018076Groundwater SedimentVTHRVGRIWIRRTGMAYDVMFAPTGNGELRIKTEDGLRAFLWGATIPAERIEEAVAALRSDTEHEIPNVVLTLERMSKLGL
Ga0184612_1032278823300018078Groundwater SedimentVGRIWIRRTGSAYDVMFAPAGNGEFRLATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPNVMLTLERMSKLGL
Ga0190265_1017583833300018422SoilMFAPTGNGELRLATAAGLRAFLWEATIPAERIDEALDTLRNDTEHEIPVRLTLERMSKLG
Ga0190265_1020113823300018422SoilVGCAGRSRAAPAEFPLTSAPLPDEQGIGRTGTVTHRTGRIWIRRTALAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPPERIDEALDALRHDTEHEIPDVTLTLERMSKLGL
Ga0190265_1029502523300018422SoilMFAPTGNGECRLATAAGLRAFLWQATIPAERIDEALDALRTDTEHEIRNVVLTLERMSKLGL
Ga0190272_1005211063300018429SoilMFAPAGNGEFRLATAAGLRAFLWEATIPAERIDEALDALRTDTEHEINVVLTLDRMSKLG
Ga0190272_1016585343300018429SoilVTHRIGRICIRRTGSAYDVMFAPTGNGEFRLATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPDVMLTLERMSKLGL
Ga0190270_1235286513300018469SoilVTHRIGRLWIRRSGMVYDVMFAPTGNGEFRIKTEDGLRAFLWGASIPAERIEEAVAALRHDVEHEIPNVVLTLERMSKLGL
Ga0187894_1007482923300019360Microbial Mat On RocksMAYDVMFAPSGNGEFRIASAAGLRAFLWEATIPAERIDDALDALRTDTEHEIPNVTLTLERMSKLGL
Ga0187894_1014249823300019360Microbial Mat On RocksFAPTGNGGFRIATAAGLRAFLWEATILPERIDEAPDALRHDTEHEIPNVMLTLERMSKLG
Ga0187892_10003359143300019458Bio-OozeVHVRRHVVTHRVGRLWIRRTDVAYDVMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRNDTEHEIPNVALSLEQMSKLGL
Ga0187892_1001313643300019458Bio-OozeMAYDVMFAPSGNGEFRIASAAGLRAFLWEATIPAERIDDALDALRTDTEHEIPNVSLTLERMSKLGL
Ga0187892_1046320213300019458Bio-OozeSGNGEFRIASAAGLRAFLWEATIPAERIDDALDALRTDTEHEIPNVTLTLERMSKLGL
Ga0187893_1008087143300019487Microbial Mat On RocksMAYDVMFAPSGNGEFRIASAAGLRAFLWEATIPAERIDDALDALRTDTEHEIPNVMLTLERMSKLGL
Ga0193723_106933133300019879SoilVTSRVGRLWIRRTGPAYDVMFAPTGNGECHIATAAGLRAFLWEATISAERIDEALDALRTDTEHEI
Ga0193707_101581243300019881SoilVTSRVGRLWIRRTGPAYDVMFAPTGNGECHIATAAGLRAFLWEATISAERIDEALDALRTDTEHEIRNVVLTLERMSKLGL
Ga0193727_107010933300019886SoilVTSRVGRLWIRRTGPAYDVMFAPTGNGECHIATAAGLRAFLWEATISAERIDEALDALRTDTEHEIRNVVLTLERMSKLRL
Ga0193743_122782513300019889SoilMFAPAGNGEFRLATAAGLRAFLWEATIPAERIDEALDALRTDTEHEIPDVRLTLERMSKLGL
Ga0193710_100647833300019998SoilMFAPTGNGECRIATAAGLRAFLWEATISAERIDEALDALRTDTEHEIRNVVLTLERMSKLGL
Ga0193731_113541723300020001SoilVTSRVGRLWIRRTGPAYDVMFAPTGNGECHIATAAGLRAFLWEATISAERIDEALDALRHDTEHEIPNVMLTLERMSKLGLLTL
Ga0193730_106225423300020002SoilVTSRVGRLWIRRTGPGYDVMFAPTGNGECHIATAAGLRAFLWEATISAERIDEALDALRTDTEHEIRNVVLTLERMSKLRL
Ga0193739_100792443300020003SoilVTHRVGRIWIRRTGMAYDVMFAPTGNGEFRIKTEDGLRAFLWGSTIPAERIEEAVAALRSDTEHEIPNVVLTLEPGA
Ga0193739_100981673300020003SoilMFAPAGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPDVMLT
Ga0193735_110895623300020006SoilAEQDAGRSGAVTHRIGRIWIRRTGLAYDVMFAPTGNGEFRIKTEDGLRVFLWGASIPAERIEEAVVALRRETEHEIPNVVLTLERMSKLGL
Ga0193716_113555733300020061SoilRVGRLWIRRTGPAYDVMFAPTGNGECRIATAAGLRAFLWEATISAERIDEALDALRTDTEHEIRNVELTLERMSKLGL
Ga0180118_130414513300020063Groundwater SedimentVTHRVGRIWIRRTGMAYDVMFAPTGNGEFRIKTEDGLRAFLGDATIPVDRIEEAVAALRRDTEHEIRNVVLTLERMSKLGL
Ga0179594_1020649613300020170Vadose Zone SoilVTHRIGRIWIRRTGLAYDVMFAPTGNGEFRIKTEDGLRVLLWGASIPAERIEEAVVALRRETEHEIPNVVLTLERMRKLGL
Ga0210378_1001456823300021073Groundwater SedimentVTHRIGRLWIRRTGSAYDVMFAPTGNGEFRLATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPNVRLTLERMSKLGL
Ga0210378_1003866013300021073Groundwater SedimentVTHRIGRIWIRRTGSAYDVMFAPAGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPNVMLTLERMSKLGL
Ga0210378_1004536043300021073Groundwater SedimentLTRDHLLAQQNTGRIGAVTHRIGRIWIRRTGSAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRTDREHEIPNVLLTLERMSKLGL
Ga0210379_1005909823300021081Groundwater SedimentVTHRIGRIWIRRTGSAYDVMFAPAGNGEFRLATAAGLRAFLWEATIPAERIDDALDALRHDTEHEIPDVMLTLERMSKLGL
Ga0210379_1022136913300021081Groundwater SedimentVTHRVGRIWIRRTGLVYDVMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRSDTEHEIPNVVLTLERMSKLGL
Ga0210379_1041123713300021081Groundwater SedimentFRIKTEDGRRAFLWGATIPAERIEEAVAALRSDTEHEIPNVVLTLERMSKLGL
Ga0193719_1034919013300021344SoilGRSGAAYADFPLTRDHLLAEQDAGRSGAVTHRIGRIWIRRTGLAYDVMFAPTGNGEFRIKTEDGLRVFLWGASIPAERIEEAVVALRRETEHEIPNVVLTLERMSKLGL
Ga0193719_1040551713300021344SoilRRTGGKTYDVMFRPYGNGEFRIATENGLRAFLWGATIPAERIEEAVAALRGDTEHEIPNVVLTLERMSKLGL
Ga0224452_108412743300022534Groundwater SedimentMAYDVMFAPTGNGEFRLATAAGLRAFLWEATIPAERIDEALDALRTDTEHEISNVVLTLN
Ga0224452_112615923300022534Groundwater SedimentTGNGECRIATAAGLRAFLWEATIPPERIDEALDARRHDTEHEIPNVMLTLDRMSKLGL
Ga0222623_1001869143300022694Groundwater SedimentHRVGRIWIRRTGMAYDVMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRSDTEHEIPNVVLTLEPGA
Ga0222623_1006272723300022694Groundwater SedimentMFAPTGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRTDTEHEIPDVMLTLERLSKLGL
Ga0222623_1017303343300022694Groundwater SedimentVTHRLGRIWIRRTGMAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIRNVVLT
Ga0209640_1001545943300025324SoilVTHREGRIWIRRTGMAYDVMFAPTGNGEFRIKTEAGLRAFLWDATIPADRIEEAVAALRGDTEHEIRNVVLTLERMSKLGL
Ga0207684_1007945873300025910Corn, Switchgrass And Miscanthus RhizosphereVTHRLGRLWIRRVGMVYDVMLAPTGNGEFRIKTEDGLRAFLWGATIPADRIEQAVAALRSDTEHEIPNVVLTLEQMSRLGL
Ga0207707_1145888023300025912Corn RhizosphereGRSLLAPPEVPLTPAPLPGERGTGWRGAVTYRIGRICIRRTALAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPPERIDEALDALRHDTEHEIPNVRLTLERMSKLGL
Ga0207646_1018203123300025922Corn, Switchgrass And Miscanthus RhizosphereVQRRAVTHRLGRIWIRRTDRAYDVMFRPTGNGEFRSTTEDGLRAFLWGATIPAERIEEAVAALRDEMELEIPDVVLTLERMSKLGL
Ga0257170_106860023300026351SoilVTHRVGRIWIRRTGMAYDVMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRSDTEHEIPNVVLSLERMSKLGL
Ga0257166_100475043300026358SoilRRRAVTHRVGRIWIRRTGMAYDVMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRSDTEHEIPNVVLTLERMSKLGL
Ga0257177_102432643300026480SoilMAYDVMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRSDTEHEIPNVVLTLERMSKLGL
Ga0257165_108713813300026507SoilARPARRRPCGGRRCVGAARAGRSRPAHAGFPLTHDHLLAQQDTGRRRAVTHRVGRIWIRRTGSAYDVMFAPAGNGEFRLATAAGLRAFLWEATIPAERIDEALDALRTDTEHEIRNVMLTLERMSKLGL
Ga0209869_100771623300027187Groundwater SandYDVMFRPTGNGEFRIKTEDGLRALLWGATIPAERIEEAVAALRNDTEHEIPNVVLTLERMSKLGL
Ga0209684_101466233300027527Tropical Forest SoilMGRLWIRRTGPSTYDVMFRPSGNGEFRVATEDGLRALLWGATVPSDRIEQAILALRKETEHEIHDVVLTLERMSKLRL
Ga0256866_104745253300027650SoilVTHRLGRIWIRRTGMAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPDVMLTLERMSKLGL
Ga0256866_106522823300027650SoilMFAPTGNGEFRIATAAGLRAFLWEATMPPERIDEALDALRHDTEHEIPNVMLTLERMSKLGL
Ga0209726_1024609613300027815GroundwaterMAYDVMFAPTGNGEFRIASAAGLRAFLWEATIPAERIDEALDALRSDTEHEIPNVMLTLERMSKLGL
Ga0209706_1044818623300027818Freshwater SedimentMFAPTGNGEFRIATAAGLRAFLWEATIPPERIDEALDALRHDTEHEIPNVMLTLERMSKLGL
Ga0209701_1018767523300027862Vadose Zone SoilMFAPTGNGEFRIKTENGLRAFLWGATIPSERIEEAVAALRNDTEHEIPNVVLTLEQMSQLGL
Ga0209382_1064842113300027909Populus RhizosphereMAYDVMFAPTGNGEFRIASAAGLRAFLWEATIPAERIDDALDALRTDTEHEIPDVRLTLERMSKLGL
Ga0209868_103600023300027947Groundwater SandRISFRVHRRTVTDRRGRIWIRRTGMVYDVMFRPTGNGEFRIKTEDGLRALLWGATIPAERIEEAVAALRNDTEHEIPNVVLTLERMSKLGL
Ga0209859_102239833300027954Groundwater SandMVYDVMFRPTGNGEFRIKTEDGLRALLWGATIPAERIEEAVAALRNDTEHEIPNVVLTLE
Ga0209857_106297323300027957Groundwater SandVSQRVGRIWIRRTGSAYDVMFAPAGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRNDTEHEIPNVMLTLERMSKLGL
Ga0137415_1045211023300028536Vadose Zone SoilGGRGRLGASRAGRPRAAYAEFPLTRDHLLAAQDAGRSGAVTHRIGRIWIRRTGLAYDVMFAPTGNGEFRIKTEDGLRVLLWGASIPAERIEEAVVALRRETEHEIPNVVLTLERMRKLGL
Ga0257175_100000373300028673SoilMAYDVMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRSDTEHEIPNVVLSLERMSKLGL
Ga0307504_1040612913300028792SoilVTPRVGRIEIRRIGLAYDVMFAPRGNGEFRIKTEPGFRAFLWAVGIPADRIEEAVDALRRDTLHAIPD
Ga0307281_1000776613300028803SoilVTHRQGRIWIRRTGSAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRTDTEHEIPNVRLTLERMSKLGL
Ga0307302_1034702323300028814SoilVTHRVGRIWIRRTGLAYDVMFAPTGNGELRLATAAGLRAFLWEATIEPERIDEALDALRNDTEHEIPVRLTLERMSKLGL
Ga0307296_1028704123300028819SoilMGGAGRSRSTPTESPLSAPLLGEQGTGRRGAVTHRIGRIWIRRTGLAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPNVMLTLERMSKLGL
Ga0307296_1074025013300028819SoilLAYDVMFAPTGNGEFRIKTEDGLRALLWGASIPAERIEEAVVALRRETEHEIPNVVLTLERMSKLGL
Ga0307310_1044341823300028824SoilIGRIWIRRTGLAYDVMFAPTGNGEFRIKTEDGLRAFLWGATIPAERIEEAVAALRSDTEHEIPNVVLTLERMSKLGL
Ga0307312_1047613223300028828SoilAYAEFPLTRDHLLAEQDAGRSGAVTHRIGRIWIRRTGLAYDVMFAPTGNGEFRIKTEDGLRVFLWGASIPAERIEEAVVALRRETEHEIPNVVLTLERMSKLGL
Ga0299907_1082506623300030006SoilMVYDVMFAPAGNGEFRIKTEDGLRALLWGASIPAERIEEAVAALRTDTEHEIPNVVLTLERLSKLGL
Ga0299907_1098084033300030006SoilLTRSPLPGEQGIGRIGAVTHRIGRIWIRRTALAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPPERIDEALDALRHDTEHEIPNVMLTLERM
Ga0299907_1126593113300030006SoilYDVMFAPTGNGECRIATAAGLRAFLWEATIPPERIDEALDALRTDTEHEIPNVLLTLERMSKLGL
Ga0268386_10005315133300030619SoilVTHRVGRLWIRRTGLAFDVMFAPTGNGEFRIVTAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPNVMLTLERMSKLGL
Ga0268386_1026584223300030619SoilVAHRLGRIWIRRTGMAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPDVMLTLERMSKLGL
Ga0268386_1062326733300030619SoilLTRSPLPGEQGIGRIGAVTHRIGRIWIRRTALAYDVMFAPTGNGEFRIATAAGLRAFLWEATIPPERIDEALDALRHDTEHEILNVMLTLERMSKLGL
Ga0302046_1027197133300030620SoilMFAPAGNGEFRLATAAGLRAFLWEATIPADRIDEALEALRTDTEHEIRNVRLTLERMSKLGL
Ga0299913_1062422213300031229SoilMAYDVMFAPTGNGEFRIATAGGLRAFLWEATIPAERIDEALDALRTDTEHEIPNVLLTLERMSKLGL
Ga0299913_1120405813300031229SoilVMFAPTGNGEFRIATAAGLRAFLWEATIPPERIDEALDALRQDTEHEIPDVMLTLERMSKLGL
(restricted) Ga0255312_101310353300031248Sandy SoilMAYDVMFAPTGNGEFRIKTEDGLRAFLWGASIPAERIEEAVAALRHDAEHEIPDVVLTLERMSKLGL
Ga0307468_10010787433300031740Hardwood Forest SoilLIPAPLSGDRGTGRRGAVTYRVGRICIRRTGLAYDVMFAPTGNGEFHIATAAGLRAFLWEATIPPERIDEALDALRHGTEHEIHNVMLTLERMSKLGL
Ga0307471_10066571813300032180Hardwood Forest SoilYDVMFAPTGNGEFHIATAAGLRAFLWEATIPPERIDEALDALRHGTEHEIHNVMLTLERMSKLGL
Ga0364924_149006_1_1683300033811SedimentGEFRIATAAGLRAFLWEATIPAERIDEALDALRHDTEHEIPNVMLTLERMSKLGL
Ga0370498_058216_660_8483300034155Untreated Peat SoilMFAPTGNGEFRIATAAGLRAFLWEATIPAERIDEALDALRHDTDHEILNVALTLDRMSKLGL
Ga0370498_184852_49_2373300034155Untreated Peat SoilMFAPTGNGECRIATAAGLRAFLWEATIPAERIDETLDALRHDRDHEILNVALTLDRMSKLGL
Ga0364940_0107701_475_6783300034164SedimentMAYDVMFAPTGNGEFRIKTEDGLRAFLWDATIPAERIEEAVAALRRDTEHEIPNVVLTLERMSKLGL
Ga0370495_0025657_1405_15933300034257Untreated Peat SoilMFAPAGNGEVRIKTEDGLHALLWGASIPAERIEEAVAALRHDTEHEIPNVTLTLERLSKLGL
Ga0370495_0132787_531_7193300034257Untreated Peat SoilMFAPTGNGEFRIATAAGLRAFLWEATIPAERIDKALDALGHDTEHEILNVALTLERMSKLGL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.