NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F096548

Metagenome Family F096548

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F096548
Family Type Metagenome
Number of Sequences 104
Average Sequence Length 73 residues
Representative Sequence MTLLPNHALQRTPGFGVQLPGAALVRPAQSRAVLPAMKPGTARAFASRRRAHSRAPGPESLSLGSLGV
Number of Associated Samples 34
Number of Associated Scaffolds 98

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 34.62 %
% of genes near scaffold ends (potentially truncated) 54.81 %
% of genes from short scaffolds (< 2000 bps) 72.12 %
Associated GOLD sequencing projects 34
AlphaFold2 3D model prediction Yes
3D model pTM-score0.25

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (62.500 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(40.385 % of family members)
Environment Ontology (ENVO) Unclassified
(79.808 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(43.269 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Fibrous Signal Peptide: No Secondary Structure distribution: α-helix: 18.75%    β-sheet: 0.00%    Coil/Unstructured: 81.25%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.25
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 98 Family Scaffolds
PF05598DUF772 2.04
PF00717Peptidase_S24 2.04
PF13676TIR_2 1.02
PF05257CHAP 1.02
PF01844HNH 1.02
PF13495Phage_int_SAM_4 1.02
PF00550PP-binding 1.02
PF12680SnoaL_2 1.02
PF09965DUF2199 1.02
PF14237GYF_2 1.02
PF06296RelE 1.02
PF07661MORN_2 1.02
PF14137DUF4304 1.02
PF14300DUF4375 1.02
PF03372Exo_endo_phos 1.02
PF01926MMR_HSR1 1.02
PF01551Peptidase_M23 1.02
PF13673Acetyltransf_10 1.02
PF13751DDE_Tnp_1_6 1.02

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 98 Family Scaffolds
COG2849Antitoxin component YwqK of the YwqJK toxin-antitoxin moduleDefense mechanisms [V] 1.02
COG4737Uncharacterized conserved proteinFunction unknown [S] 1.02


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A62.50 %
All OrganismsrootAll Organisms37.50 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002835|B570J40625_100128676All Organisms → cellular organisms → Bacteria2978Open in IMG/M
3300005093|Ga0062594_101028771Not Available796Open in IMG/M
3300005187|Ga0066675_11234389All Organisms → cellular organisms → Bacteria553Open in IMG/M
3300005445|Ga0070708_100928876All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → unclassified Chthoniobacterales → Chthoniobacterales bacterium816Open in IMG/M
3300005450|Ga0066682_10700387Not Available622Open in IMG/M
3300005544|Ga0070686_100140642All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales1680Open in IMG/M
3300005585|Ga0049084_10238821Not Available613Open in IMG/M
3300006059|Ga0075017_100050041All Organisms → cellular organisms → Bacteria → PVC group2805Open in IMG/M
3300006059|Ga0075017_100291252Not Available1203Open in IMG/M
3300009012|Ga0066710_102266178All Organisms → cellular organisms → Bacteria → Proteobacteria791Open in IMG/M
3300009181|Ga0114969_10044067All Organisms → cellular organisms → Bacteria3014Open in IMG/M
3300011271|Ga0137393_10515278Not Available1026Open in IMG/M
3300012096|Ga0137389_10900295Not Available759Open in IMG/M
3300012201|Ga0137365_10826341Not Available677Open in IMG/M
3300012207|Ga0137381_11084738Not Available689Open in IMG/M
3300012349|Ga0137387_11137534Not Available554Open in IMG/M
3300012361|Ga0137360_11242219Not Available644Open in IMG/M
3300012361|Ga0137360_11714768Not Available534Open in IMG/M
3300012362|Ga0137361_10287295All Organisms → cellular organisms → Bacteria1504Open in IMG/M
3300012362|Ga0137361_10325009Not Available1410Open in IMG/M
3300012362|Ga0137361_10432332All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → Roseimicrobium → unclassified Roseimicrobium → Roseimicrobium sp. ORNL11209Open in IMG/M
3300012362|Ga0137361_10440551Not Available1197Open in IMG/M
3300012362|Ga0137361_10642951Not Available971Open in IMG/M
3300012362|Ga0137361_10649309Not Available965Open in IMG/M
3300012362|Ga0137361_10964229Not Available772Open in IMG/M
3300012362|Ga0137361_11074633Not Available725Open in IMG/M
3300012362|Ga0137361_11116835Not Available709Open in IMG/M
3300012362|Ga0137361_11151642Not Available697Open in IMG/M
3300012362|Ga0137361_11241368Not Available669Open in IMG/M
3300012362|Ga0137361_11344161Not Available638Open in IMG/M
3300012362|Ga0137361_11374255Not Available630Open in IMG/M
3300012362|Ga0137361_11478371Not Available602Open in IMG/M
3300012362|Ga0137361_11540692Not Available586Open in IMG/M
3300012362|Ga0137361_11590880Not Available574Open in IMG/M
3300012362|Ga0137361_11928840Not Available507Open in IMG/M
3300012917|Ga0137395_10658869Not Available757Open in IMG/M
3300012917|Ga0137395_10659622Not Available757Open in IMG/M
3300012930|Ga0137407_12111009Not Available538Open in IMG/M
3300014969|Ga0157376_11323370Not Available751Open in IMG/M
3300017656|Ga0134112_10259876Not Available691Open in IMG/M
3300018468|Ga0066662_12786439All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium518Open in IMG/M
3300019888|Ga0193751_1036793All Organisms → cellular organisms → Bacteria2230Open in IMG/M
3300019888|Ga0193751_1080442All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Chthoniobacter → Chthoniobacter flavus → Chthoniobacter flavus Ellin4281303Open in IMG/M
3300019888|Ga0193751_1177600All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria736Open in IMG/M
3300019888|Ga0193751_1186517Not Available708Open in IMG/M
3300019890|Ga0193728_1185040Not Available887Open in IMG/M
3300019890|Ga0193728_1339609All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Parcubacteria group → unclassified Parcubacteria group → Parcubacteria group bacterium GW2011_GWC2_39_11546Open in IMG/M
3300020016|Ga0193696_1094934Not Available769Open in IMG/M
3300020059|Ga0193745_1002453All Organisms → cellular organisms → Bacteria → PVC group3836Open in IMG/M
3300020059|Ga0193745_1003188All Organisms → cellular organisms → Bacteria3420Open in IMG/M
3300020059|Ga0193745_1003594All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3249Open in IMG/M
3300020059|Ga0193745_1003826All Organisms → cellular organisms → Bacteria3158Open in IMG/M
3300020059|Ga0193745_1003988All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales3100Open in IMG/M
3300020059|Ga0193745_1005406Not Available2715Open in IMG/M
3300020059|Ga0193745_1006049All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiota incertae sedis → Lentimonas2582Open in IMG/M
3300020059|Ga0193745_1006049All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiota incertae sedis → Lentimonas2582Open in IMG/M
3300020059|Ga0193745_1006121All Organisms → cellular organisms → Bacteria2565Open in IMG/M
3300020059|Ga0193745_1006386All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2519Open in IMG/M
3300020059|Ga0193745_1007270Not Available2369Open in IMG/M
3300020059|Ga0193745_1007270Not Available2369Open in IMG/M
3300020059|Ga0193745_1007807Not Available2292Open in IMG/M
3300020059|Ga0193745_1008185All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Pirellulales → Lacipirellulaceae → Lacipirellula → Lacipirellula limnantheis2245Open in IMG/M
3300020059|Ga0193745_1008787Not Available2172Open in IMG/M
3300020059|Ga0193745_1009173Not Available2129Open in IMG/M
3300020059|Ga0193745_1009173Not Available2129Open in IMG/M
3300020059|Ga0193745_1009235All Organisms → cellular organisms → Bacteria2121Open in IMG/M
3300020059|Ga0193745_1013421Not Available1775Open in IMG/M
3300020059|Ga0193745_1016184All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1620Open in IMG/M
3300020059|Ga0193745_1018392All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1521Open in IMG/M
3300020059|Ga0193745_1018435All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → Haloferula → Haloferula luteola1519Open in IMG/M
3300020059|Ga0193745_1020357Not Available1446Open in IMG/M
3300020059|Ga0193745_1022404Not Available1379Open in IMG/M
3300020059|Ga0193745_1022809Not Available1367Open in IMG/M
3300020059|Ga0193745_1026972Not Available1260Open in IMG/M
3300020059|Ga0193745_1027076Not Available1258Open in IMG/M
3300020059|Ga0193745_1047683Not Available940Open in IMG/M
3300020059|Ga0193745_1051856Not Available898Open in IMG/M
3300020059|Ga0193745_1057102All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → Pseudanabaenales → Pseudanabaenaceae → Pseudanabaena → unclassified Pseudanabaena → Pseudanabaena sp.853Open in IMG/M
3300020059|Ga0193745_1058667Not Available841Open in IMG/M
3300020059|Ga0193745_1061138Not Available823Open in IMG/M
3300020059|Ga0193745_1064576Not Available799Open in IMG/M
3300020059|Ga0193745_1068674Not Available774Open in IMG/M
3300020059|Ga0193745_1074048Not Available742Open in IMG/M
3300021080|Ga0210382_10510197Not Available533Open in IMG/M
3300021086|Ga0179596_10016398All Organisms → cellular organisms → Bacteria → Thermodesulfobacteria → Thermodesulfobacteria → Thermodesulfobacteriales → Thermodesulfobacteriaceae → Thermosulfurimonas → Thermosulfurimonas dismutans2582Open in IMG/M
3300021086|Ga0179596_10016398All Organisms → cellular organisms → Bacteria → Thermodesulfobacteria → Thermodesulfobacteria → Thermodesulfobacteriales → Thermodesulfobacteriaceae → Thermosulfurimonas → Thermosulfurimonas dismutans2582Open in IMG/M
3300021086|Ga0179596_10025387All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia2213Open in IMG/M
3300021086|Ga0179596_10025387All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia2213Open in IMG/M
3300021086|Ga0179596_10026972Not Available2165Open in IMG/M
3300021086|Ga0179596_10028574Not Available2122Open in IMG/M
3300021086|Ga0179596_10028574Not Available2122Open in IMG/M
3300021086|Ga0179596_10163090Not Available1067Open in IMG/M
3300021086|Ga0179596_10174872All Organisms → cellular organisms → Bacteria1034Open in IMG/M
3300021086|Ga0179596_10219108All Organisms → cellular organisms → Bacteria → Proteobacteria931Open in IMG/M
3300021086|Ga0179596_10267597Not Available847Open in IMG/M
3300021086|Ga0179596_10327017Not Available767Open in IMG/M
3300021086|Ga0179596_10412045Not Available682Open in IMG/M
3300021404|Ga0210389_11088914Not Available618Open in IMG/M
3300022208|Ga0224495_10259303Not Available710Open in IMG/M
3300027512|Ga0209179_1018423Not Available1333Open in IMG/M
3300027512|Ga0209179_1110415All Organisms → cellular organisms → Bacteria613Open in IMG/M
3300032050|Ga0315906_10184068All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales1983Open in IMG/M
3300034101|Ga0335027_0302089All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1080Open in IMG/M
3300034103|Ga0335030_0189051All Organisms → cellular organisms → Bacteria1441Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil40.38%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil40.38%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater1.92%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.92%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.92%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.96%
FreshwaterEnvironmental → Aquatic → Freshwater → Lentic → Epilimnion → Freshwater0.96%
Freshwater LenticEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lentic0.96%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake0.96%
FreshwaterEnvironmental → Aquatic → Freshwater → Unclassified → Unclassified → Freshwater0.96%
SedimentEnvironmental → Aquatic → Marine → Sediment → Unclassified → Sediment0.96%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.96%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.96%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.96%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.96%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002835Freshwater microbial communities from Lake Mendota, WI - (Lake Mendota Combined Ray assembly, ASSEMBLY_DATE=20140605)EnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005544Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3L metaGEnvironmentalOpen in IMG/M
3300005585Freshwater lentic microbial communities from great Laurentian Lakes, MI, USA - Great Lakes metaG ON33MSRFEnvironmentalOpen in IMG/M
3300006059Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2012EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009181Freshwater microbial communities from Lake Montjoie, Canada to study carbon cycling - M_130807_MF_MetaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014969Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4-5 metaGHost-AssociatedOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019888Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1c2EnvironmentalOpen in IMG/M
3300019890Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c1EnvironmentalOpen in IMG/M
3300020016Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3m1EnvironmentalOpen in IMG/M
3300020059Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1a2EnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300022208Sediment microbial communities from San Francisco Bay, California, United States - SF_Jul11_sed_USGS_4_1EnvironmentalOpen in IMG/M
3300027512Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2 (SPAdes)EnvironmentalOpen in IMG/M
3300032050Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 2 MA122EnvironmentalOpen in IMG/M
3300034101Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME19Sep2005-rr0107EnvironmentalOpen in IMG/M
3300034103Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME27Sep2002-rr0119EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
B570J40625_10012867613300002835FreshwaterMNETPNHALQRTPGFGVQLPSAALIRPAQSRAVRPAMKPGTARAFALRRRAHSRAPGPESLSLGS
Ga0062594_10102877113300005093SoilVSPFSDSLNQALQRTPGFGVQLPGAALIRPAQSRAVLPAMKPGTARAFALRRRAHSRAPGPESLSLGSLDTLLGSKG*
Ga0066675_1123438913300005187SoilNHALQRTPGFGVQLPSAAVVRPAQSRAVRPAMKPGTARAFALRRRAHSRAPGPESLSLGSLAAC*
Ga0070708_10092887613300005445Corn, Switchgrass And Miscanthus RhizosphereMNTANCKSSKAPNHALQRTPGFGVQLPGAALIRPAQSRAVRPAMKPGTARAFALRRRAHSRAPGPESLSLGS
Ga0066682_1070038713300005450SoilPNHALQRTPGFGVQLPGAAVVRPAQSRAVLPALKPGTARAFASRRRAHTRAPGPESLSLGSLGVFTRLSR*
Ga0070686_10014064263300005544Switchgrass RhizosphereMAAPMTATPNQALQRTPGFGVQLPGAAVVRPAQSRAVRPAMKPGTARAFALRRRAHSRAPGPESLSLGSLG
Ga0049084_1023882123300005585Freshwater LenticMTATPNHALQRTPGFGVQLPSAALIRPAQSRAVLPAMKPGTARAFALRRRAQNRAPGPESLSLGSLGVA
Ga0075017_10005004123300006059WatershedsMTPNHALQRTPGFGVQLPGAALVRPAQSRAVRLAMKPGTARAFASRRRAPMRAPGPESLSLGSLGVATRLL*
Ga0075017_10029125223300006059WatershedsMRPETPNHALQRTPGFGVQLPGAAFVRPAQSRAVRPAMKPGTARAFALRRRAHTRAPGPESLSLGSLGVFTHSEK*
Ga0066710_10226617823300009012Grasslands SoilMITPPNHALQRTPGFGVQLPGAALIRPAQSRAVLPAMKPGTARAFALRRRAHTLAPGPESLSLGSLGATFAFAFQTKP
Ga0114969_1004406743300009181Freshwater LakeMDATPNHALQRTPGFGVQLPRAALVRPAQSRAVRPAMKPGTARAFASRRRAHSRAPGPESLSLGSLGVATRFL*
Ga0137393_1051527813300011271Vadose Zone SoilLQSLTTSVISLASRRQTKPATPNHALQRTPGFGVQLPSAAVVRPAQSRAVRPAMKPGTARAFALRRRAHSRAPGPESLSLGSLGVVARLSW*
Ga0137389_1090029513300012096Vadose Zone SoilSSSCTLTTRANTDESATPNHALQRTPGFGVQLPSAAVVRPAQSRAVLPAMKPGTARAFALRRRAHSRAPGPESLSLGSLGVFTRLSR*
Ga0137365_1082634113300012201Vadose Zone SoilMKTPNHALQRTPGFGVQLPSAALIRPAQSRAVLPAMKPGTARAFALRRRAHSRAPGPESLSLGS
Ga0137381_1108473823300012207Vadose Zone SoilHALQRTPGFGVQLPGAAVVRPSQSRAVLPAMKPGTARAFALRRRSHIRAPGPESLSLGSLGVVAPLFRERSPMKTPNTSVS*
Ga0137387_1113753413300012349Vadose Zone SoilMLSTQTATPNHALQRTPGFGVQLPSAALVRPAQSRAVLPAMKPSTARACASRRRAHSRVPGPESLSLG
Ga0137360_1124221933300012361Vadose Zone SoilLQRTPGFGVQLPGAALIRPAQSRAVLPAMKPGTARAFALRRRAHSRVPGPESLSFGSLGVARIIPPMSAEP*
Ga0137360_1171476813300012361Vadose Zone SoilLQRTPGFGVQLPGAALIRPAQSRAVLPAMKPGTARAFALRRRAHRRAPGPESLSLGSLGDFTHAIQ*
Ga0137361_1028729533300012362Vadose Zone SoilMKLRHEQRPNHALQRTPGFGVQLPGAAVIRPAQSRAVLPAMKPGTARAFALRRRAHSRVPGPESLSLGSLGVATRIC*
Ga0137361_1032500933300012362Vadose Zone SoilMTQLQMTATPNHALQRTPGFGVQLPGTAFVRPAQSRAVLPAMKPGTARAFALRRRAHSRAPGPESLSLGSLGVARVF*
Ga0137361_1043233223300012362Vadose Zone SoilMTATPNHALQRTPGFGVQFPSAAVARPAQSRAVRPAMKPGTARAFASRRRALIRAPGPESLSLGSLGVARVIP*
Ga0137361_1044055123300012362Vadose Zone SoilMTAATPNHALQRTPGFGVQLPGAALVRPAQSRAVLPAMKPGTARAFALRRRAHSRVPGPESLSFGSLGGFTS*
Ga0137361_1064295113300012362Vadose Zone SoilMQTNQTPNHALQRTPGFGVQLPSAALVRPTQSRVVLPAMKPGTACAFALRRRAHGRAPGPES
Ga0137361_1064930913300012362Vadose Zone SoilMTLLPNHALQRTPGFGVQLPGAALVRPAQSRAVLPAMKPGTARAFASRRRAHSRAPGPESLSLGSLGV
Ga0137361_1096422913300012362Vadose Zone SoilTPNHALQRTPGFGVQLPSAAVVRPAQSRAVRPAMKPGTARAFALRRRAPSRAPGPESLSLGSLGDTA*
Ga0137361_1107463313300012362Vadose Zone SoilMDTHLTPEPATPNHALQRTPGFGVQLPGAALVRPAQSRAVLPAMKPGTARAFALRRRAHSRAPGP
Ga0137361_1111683523300012362Vadose Zone SoilMTATPNHALQRTPGFGVQLPRAAVVRPAPSRAVLPAMKPGTARAFALRRRAHSRAPGPESLSLGSLGHF
Ga0137361_1115164213300012362Vadose Zone SoilMSPNLPLNDLLPTPATPNHALQRTPGFGVQLPGAALIRPAQSRAVLPAMKPGTARAFALRRRAHSRAPGPESLSLGS
Ga0137361_1124136813300012362Vadose Zone SoilLQRTPGFGVQLPSAAVVRPAQSRAVLPAMKPGTARAFALRRYASMRAPGPESLSLGSLGVARVIQH*
Ga0137361_1134416113300012362Vadose Zone SoilPGFGVQLPSAALIRPAQSRAVRPAMKPGTARAFALRRRAHTRAPGPESLSLGPLGDFARAIQ*
Ga0137361_1137425513300012362Vadose Zone SoilMTATPNHALQRTPGFGVQLPSAALIRPAQSRAVLPTMKPGTARAFALRRRAHGRAPGPESLSLGSLGVARVHR
Ga0137361_1147837133300012362Vadose Zone SoilAVVRPAQSRAVRPAMKPGTARAFALRRRAHSRVPGPESLSFGSLGVARIIPPMSAEP*
Ga0137361_1154069213300012362Vadose Zone SoilTPGFGVQLPTAALIRPAQSRAVLPAMKPGTARAFASRRRAHTRAPGPESLSLGSLGVATRLA*
Ga0137361_1159088013300012362Vadose Zone SoilNHALQRTPGFGVQLPSAAVVRPAQSRAVLPAMKPGTARAFALRRRAHSRVPGPESLSLGSLGVFRAPIHTQII*
Ga0137361_1192884013300012362Vadose Zone SoilSTESNRTPNHALQRTPGFGVQLPSAAVVRPAQSRAVLPAMKPGTARAFALRRRAHSRAPGPESLSLGSLGVFTRLSR*
Ga0137395_1065886923300012917Vadose Zone SoilMNSPATPNHALQRTPGFSVQLPGAALIRPAQSRAVLPAMKPGTARAFALRRRAHRRAPGPESLSLGSL
Ga0137395_1065962233300012917Vadose Zone SoilMPSPALQRTPGFGVQLPSAALVRPAQSRAVLPAMKPGTARAFALRRRAHTRAPGPESL
Ga0137407_1211100913300012930Vadose Zone SoilQRLIRMTATPNHALQRTPGFGVQLPGAALIRPAQSRAVRPALKPGTARAFASRRRALMRAPGPESLSLGSLGVADVLFTNEQ*
Ga0157376_1132337013300014969Miscanthus RhizosphereMTLRNQALQRTPGFGVQLPSAALIRPQSRAVLPAMKPGTARAFALRRRAHSRAPGPESLSLGSLGDSPHAHEYWLPILS*
Ga0134112_1025987613300017656Grasslands SoilHVSTESNRTPNHALQRTPGFGVQLPNAALIRPAQSRAVRPAMKPGTARAFALRRRAHSRAPGPESLSLGSLGVVAPLFRERSP
Ga0066662_1278643923300018468Grasslands SoilMNTATPNHALQRTPGFGVQLPSAAVVRPAQSRAVLPAMKPGTARAFALRRRAHSRAPGPESLSLGSLGAATRT
Ga0193751_103679323300019888SoilMRRPNQALQRTPGFGVQLPNAAVVRPAQSRAVLPTMKPGTARAFASRRRAHTRAPGPESLSLGSLGAC
Ga0193751_108044223300019888SoilMNRFQAPTPNHALQRTPGFGVQLPGAALIRPAQSRAVLPAMKPGTARAFALPRCAHSRVPGPESLSLGSLGVTACLS
Ga0193751_117760013300019888SoilMSDYPRAPNHALQRTPGFGVQLPGAAVVRPAQSRAVLPAMKPGTARAFALRRRAHSRVSG
Ga0193751_118651713300019888SoilMTPTPNHALQRTPGFGVQLPSAAVARPAQSRAVRPAMKPRTARAFALRRRAHTRAPGPGVAELGVV
Ga0193728_118504023300019890SoilALQRTPGFGVQLPGAALIRPAQSRAVLPAMKPGTARAFALRRRAPMRAPGPESLSLGSLGVATRTV
Ga0193728_133960923300019890SoilPESERTPNQALQRTPGFGVQLPSAGLVRPAQSRAVLPAMKPGTARAFALRRRAHSRAPGPESLSLGSLGDSARIL
Ga0193696_109493413300020016SoilEAVTPNHALQRTPGFGVQLPGAALIRPAQSRAVRPAMKPGTARAFALRRRAHSRAPGPESLSLGSLGVFTRLSR
Ga0193745_100245333300020059SoilMLLLNHTPNQALQRTPGFGVQLPGAALVRPAQSRAVLPAMKPGTARAFALRRRAHSRASGPESLSLGSLGVIARVLE
Ga0193745_100318833300020059SoilMNRPPNTALQRTPGFGVQLPGAAFIRPAQSRAVLPAMKPGTARAFASRRHALMRAPGPESLSFWSLGHYHALCRTHV
Ga0193745_100359443300020059SoilMTSIPITATPNQALQRTPGFGVQLPGAALIRPAQSRAVLPAMKPGTARAFALRRHAHSRTPGPESLSFGSLGVFTRLSL
Ga0193745_100382643300020059SoilMKHPRRANQALQRTPGFGVQFPGAALIRPAQSRAVLPAMKPGTARAFALRRRAHSRASGPESLSFGSLGVSHTFVLIPSGS
Ga0193745_100398823300020059SoilMTPNNALQRTPGFGVQLPRAALIRPAQSRALLPALKPGTARAFALRRRAPMRAPGPEPLS
Ga0193745_100540623300020059SoilMTNAPNHALQRTPGFGVQLPSAAEVRPAQSRAVRPAMKPGTARAFALRRRAHSRAPGPESLSLGSLGDFAHLP
Ga0193745_100604923300020059SoilMNPSPNHALQRTPGFGVQLPSAVLIRPAQSRAVLPAMKPGTARAFASRRRGHIRVPGPESLSLGSLGA
Ga0193745_100604953300020059SoilMTRQSQTPATPNHALQRTPGFGVQLPGAALIRPAQSRAVLPAMKPGTARAFASRRGAHSRAPGPESLSLGSLGDTRRLP
Ga0193745_100612123300020059SoilMTRTPPNHALQRTPGFGVQLPSAALIRPAQSRAVLPAMKPGTARAFALRRRAQSRAPGPESLSLGSLGHAAHIL
Ga0193745_100638633300020059SoilMNAPDQALQRTPGFGVQFPGAAVVRPAQSRAVLPARVKGLRPHAAGAPGTARAFALRRRARSRAPGPESLSLGSLGVAPRLL
Ga0193745_100727023300020059SoilMNSNPTSPNHALQRTPGFGVQLPGAALVRPAQSRAVLPAMKPGTARAFALRRRAHSRAPGPESLSLGSLDGQP
Ga0193745_100727043300020059SoilMNASKNNPPSKSPNHALPRTPGFGVQLPHAALIRPAQSRAVLPAMKPGTARAFASRRRAHSRAPGPESLSLGSLGDFTHLP
Ga0193745_100780723300020059SoilMTATPNQAPQRTPGFGVQLPSAALIRPAQSRAVRPAMKPGTARAFALRRRAHNRAPGPESLSLGSLGDLHP
Ga0193745_100818513300020059SoilMRDPKHQRSNHALQRTPGFGVQLPGAAFVRPAQSRAVLPAMKPGTARAFASRRRAHTRAPGPESLSLRSLGVFTRLSR
Ga0193745_100878733300020059SoilMNQIQEPATPNQALQRTPGFGVQLPGAALIRPAQSRAVLPAMKPGTARAFASRRRAHHRAPGPESLSLGSLGH
Ga0193745_100917323300020059SoilMRTQPNHALQRTPGFGVQLPSAALIRPAQSRAVRPAMKPGTARAFASRRRAHSRAPGPESLSLGSLGGIRSYGHNREFEATP
Ga0193745_100917353300020059SoilMNQTMTLNHALQRTPGFGVQLPSAALIRPAQSRAVLPAMKPGTARAFASRRRAHRRAPGPESLSLGSLGASP
Ga0193745_100923523300020059SoilMRCSELRAFGVQLPGAAFVRPAQSRAVRPAMKPGTARAFASRRHAHTRAPGPESLSLESLGYHP
Ga0193745_101342133300020059SoilMTPNHALQRTPGFGVQLPGAALIRPAQSRAVLPALKPGTARAFASRRRAHSRAPGPESLSLGSLGDFAHLP
Ga0193745_101618433300020059SoilMTTPLNHALQRTPGFGVQLPRAALVRPAQSRAVLPAMKPGTARAFALRRRAHSRAPGPESLSLG
Ga0193745_101839223300020059SoilMTALSKFRPKKSYSMNPQTPNHALQRTPGFGVQLPGAALVRPAQSRAVLPAMKPGTARAFALRRRAHSRAPGPESLSLGLLGD
Ga0193745_101843533300020059SoilMEQSKFQSAKLPVFERTSPNQALQRTPGFGVQLPGAALVRPAQSRAVLPPIKPGTARAFALRRRAHTRAPGPESLSLGSLG
Ga0193745_102035713300020059SoilPGFGVQLPGTALVRPAQSRAVLPAMKPGTARAFALRRRAQNRAPGPESLSLGSLGDFRTS
Ga0193745_102240423300020059SoilMTSTETPNHALQRTPGFGVQLPGAALIRPAQSRAVLPAMKPGTARAFALRRRAHSRAPGPESLSLGSLGDSDALSTTEANQCS
Ga0193745_102280933300020059SoilMTVWPTNHALQRTPGFGVQLPSAAVVRPAQSRAVRPAMKPGTARAFASRRRAHSRAPGPESLSLGSLGVATRSVF
Ga0193745_102697233300020059SoilMISITTSPNHALQRTPGFGVQLPSAALIRPAQSRAVRPTMKPGTARAFASRRRAQSRAPGPESLSLGSLGDYAQHGPYTRRDSL
Ga0193745_102707613300020059SoilMALAITLPASMHADTKSPNHALQRTPGFGVQLPSAAVVRPAQSRAVRPAMKPGTARAFALRRRAHSRAP
Ga0193745_104768313300020059SoilMIGGIYRKSDSRKWPNHALQRTPGFGVQLPGAALIRPAQSRAVRPAMKPGTARAFASRRRAHRRAPGPESLSLGSLGVIT
Ga0193745_105185623300020059SoilMTATPNHALQRTPGFGVQLPSAALVRPAQSRAVLPAMKPGTARAFASRRRAHSRAPGPE
Ga0193745_105710213300020059SoilMFTQETLASTPPNHALQRTPGFGVQLPRAALIRPAQSRAVRPAMKPGTARAFALRRRAHSRAPGPASLS
Ga0193745_105866723300020059SoilSPESERTPNQALQRTPGFGVQLPGAALVRPAQSRAVLPAMKPGTARAFASRRRAQSRAPGPESLSLGSLGVTTSTL
Ga0193745_106113813300020059SoilYKNRPEAGIEPSPNQALQRTPGFGVQLPSAALIRPAQSRAVLPATKPGTARAFASRRRAHSRAPGPESLSLGSLGVRSLL
Ga0193745_106457613300020059SoilWYLSIGSSFRRSVEDWQKLKRLNQALQRTPGFGVQLPGAALIRPAQSRAVLPAMKPGTARAFALRRRAHSRAPGPESLSLGSLGVAHV
Ga0193745_106867423300020059SoilMNHRKSPNQALQRTPGFGVQLPGAALIRPAQSRAVLPAMKPGTARAFALHRRAHTRAPGPESLS
Ga0193745_107404813300020059SoilDNSLSHTTSSVLHLLCEFAGPNQALQRTPGFGVQLPSAALIRPAQSRAVLPAMKPGTARAFASRRRAHTRAPGPESLSLGSLGVATRFLITTL
Ga0210382_1051019713300021080Groundwater SedimentTTTPNHALQRTPGFGVQLPSAALIRPAQSRAVLPAMKPGTARAFALPRRAHSRAPGPESLSLGSLGVFTRLSR
Ga0179596_1001639833300021086Vadose Zone SoilMTPNHALQRTPGFGVQLPSAALVRPAQSRAVRHAIKPGTARAFASRRRAHTRAPGPESLSLGSLGEDTI
Ga0179596_1001639863300021086Vadose Zone SoilMRPETPNHALQRTPGFGVQLPGAALNRPAQSRAVLPAMKPGTARAFALRRCAHSRAPGPESLSLGIS
Ga0179596_1002538713300021086Vadose Zone SoilPVSPETPNHALQRTPGFGVQLPGAALIRPAQSRAVLPAMKPSTARAFALRRRAHSRAPGPESLSLGSLGD
Ga0179596_1002538733300021086Vadose Zone SoilMNPNRPSPNHALQRTPGFGVQLPSAALIRPAQSRAVRPAMKPGTARAFALRRRAHSRVPGPESLSFGSLGVATSLP
Ga0179596_1002697223300021086Vadose Zone SoilMTATPNHALQRTPGFGVQLPSAAVIRPAQSRAVRPAMKPGTARTFALRRRAHSRAPGPESLSLGSLGV
Ga0179596_1002857433300021086Vadose Zone SoilMNTATPNQALQRTPGFGVQLPSAALVRPAQSRAVRPAMKPAYSDVAATRLYAGTARAFVLRRRAHIRAPGPESLSLGSLGHSHNRSL
Ga0179596_1002857463300021086Vadose Zone SoilRTPGIGVQLPSAALIRPAQSRAVRPAMKPGTARAFALRRRAHTRAPGPESLSLGSLGDLQAEI
Ga0179596_1016309013300021086Vadose Zone SoilVNNTTPATPNHALQRTPGFGVQLPGAALIRPAQSRAVRPAMKPGTARAFALRRRAHTPRPRPGVAELGVVRPLAPIFP
Ga0179596_1017487223300021086Vadose Zone SoilMNDFFPTPATPNHALQRTPGFGVQLPGAAVVRPAQSRAVLPAMKPGTARAFASRRRAHTRAPGPESLSLGSLGD
Ga0179596_1021910813300021086Vadose Zone SoilMNSTSHPDETPNHALQRTPGFGVQLPGAAVVRPAQSRAVLPAMKPGTARAFASRRRAHTRAPGPESLSLGSLAVASRSV
Ga0179596_1026759723300021086Vadose Zone SoilETPNHALQRTPGFGVQLPGAAFVRPAQSRAVLPAMKPGTARAFALRRRAQSRAPGPESLSLGSLGDYAHP
Ga0179596_1032701723300021086Vadose Zone SoilMTPNHALQRTPGFGVQLPGAGSNPTGTARAFALRRRAHIRVLGPESLSLGSLGVATRLL
Ga0179596_1041204513300021086Vadose Zone SoilPRPNQALQRTPGFGVQLPSAALIRPAQSRAVRPAMKPRPHRYAKHCGRAGTARAFASRRRAHTRAPGPESLSLGSLGVATHTPE
Ga0210389_1108891413300021404SoilMRPETPNHELQRTPGFGVQLPGAALIRPAQSRAALPAMKPGTARAFASRRRAHSRAPGPE
Ga0224495_1025930313300022208SedimentAMEPSTPNHALQRTPGFGVQLPSAALIRPAQSRAVLPAMKPGTARAFASRRRAHTRAPGPESLSLGSLGVFTHSEK
Ga0209179_101842333300027512Vadose Zone SoilMNSESDESPNHALQRTPGFGVQLPSAAVVRPAQSRAVRPAMKPGTARAFALRRRAHSRAPGPESLSLGSLGDTTRLSRERKP
Ga0209179_111041513300027512Vadose Zone SoilMTPALNSTATPNHALQRTPGFGVQLPSAAVVRPAQSRAVLPATKPSTARAFALRRRAHSRAPGPESLSLGSLGV
Ga0315906_1018406833300032050FreshwaterVLGQGVGRHDLKGNNHALQRTPGFGVQFPGAALIRPAQSRAVRPAMKPGTARAFASRRHAHSRAPGPESLSLGSRLATRSL
Ga0335027_0302089_353_5623300034101FreshwaterMNETPNHALQRTPGFGVQLPSAALIRPAQSRAVRPAMKPGTARAFALRRRAHSRAPGPESLSLGSSGRC
Ga0335030_0189051_1141_13623300034103FreshwaterMNETPNHALQRTPGFGVQLPSAALIRPAQSRAVRPAMKPGTARAFALRRRAHSRAPGPESLSLGSFGDAALVP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.