Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Mining conditional phosphorylation motifs

Published: 01 September 2014 Publication History

Abstract

Phosphorylation motifs represent position-specific amino acid patterns around the phosphorylation sites in the set of phosphopeptides. Several algorithms have been proposed to uncover phosphorylation motifs, whereas the problem of efficiently discovering a set of significant motifs with sufficiently high coverage and non-redundancy still remains unsolved. Here we present a novel notion called conditional phosphorylation motifs. Through this new concept, the motifs whose over-expressiveness mainly benefits from its constituting parts can be filtered out effectively. To discover conditional phosphorylation motifs, we propose an algorithm called C-Motif for a non-redundant identification of significant phosphorylation motifs. C-Motif is implemented under the Apriori framework, and it tests the statistical significance together with the frequency of candidate motifs in a single stage. Experiments demonstrate that C-Motif outperforms some current algorithms such as MMFPh and Motif-All in terms of coverage and non-redundancy of the results and efficiency of the execution. The source code of C-Motif is available at: https://sourceforge. net/projects/cmotif/.

References

[1]
P. Cohen, "The regulation of protein function by multisite phosphorylation-a 25 year update," Trends Biochem. Sci., vol. 25, no. 12, pp. 596-601, 2000.
[2]
G. Manning, G. D. Plowman, T. Hunter, and S. Sudarsanam, "Evolution of protein kinase signaling from yeast to man," Trends Biochem. Sci., vol. 27, no. 10, pp. 514-520, 2002.
[3]
S. B. Ficarro, M. L. McCleland, P. T. Stukenberg, D. J. Burke, M. M. Ross, J. Shabanowitz, D. F. Hunt, and F. M. White, "Phosphoproteome analysis by mass spectrometry and its application to saccharomyces cerevisiae," Nature Biotechnol., vol. 20, no. 3, pp. 301-305, 2002.
[4]
B. E. Turk, "Understanding and exploiting substrate recognition by protein kinases," Current Opinion Chem. Biol., vol. 12, no. 1, pp. 4-10, 2008.
[5]
R. Amanchy, B. Periaswamy, S. Mathivanan, R. Reddy, S. G. Tattikota, and A. Pandey, "A curated compendium of phosphorylation motifs," Nature Biotechnol., vol. 25, no. 3, pp. 285-286, 2007.
[6]
A. N. Kettenbach and S. A. Gerber, "Rapid and reproducible single-stage phosphopeptide enrichment of complex peptide mixtures: Application to general and phosphotyrosine-specific phosphoproteomics experiments," Anal. Chem., vol. 83, no. 20, pp. 7635-7644, 2011.
[7]
Y. Yu, S.-O. Yoon, G. Poulogiannis, Q. Yang, X. M. Ma, J. Villén, N. Kubica, G. R. Hoffman, L. C. Cantley, S. P. Gygi, and J. Blenis, "Phosphoproteomic analysis identifies Grb10 as an mTORC1 substrate that negatively regulates insulin signaling," Science, vol. 332, no. 6035, pp. 1322-1326, 2011.
[8]
S. Matsuoka, B. A. Ballif, A. Smogorzewska, E. R. McDonald, K. E. Hurov, J. Luo, C. E. Bakalarski, Z. Zhao, N. Solimini, Y. Lerenthal, Y. Shiloh, S. P. Gygi, and S. J. Elledge, "ATM and ATR substrate analysis reveals extensive protein networks responsive to DNA damage," Science, vol. 316, no. 5828, pp. 1160-1166, 2007.
[9]
D. Schwartz and S. P. Gygi, "An iterative statistical approach to the identification of protein phosphorylation motifs from large-scale data sets," Nature Biotechnol., vol. 23, no. 11, pp. 1391-1398, 2005.
[10]
A. Ritz, G. Shakhnarovich, A. R. Salomon, and B. J. Raphael, "Discovery of phosphorylation motif mixtures in phosphoproteomics data," Bioinformatics, vol. 25, no. 1, pp. 14-21, 2009.
[11]
Y.-C. Chen, K. Aguan, C.-W. Yang, Y.-T. Wang, N. R. Pal, and I.-F. Chung, "Discovery of protein phosphorylation motifs through exploratory data analysis," PloS One, vol. 6, no. 5, p. e20025, 2011.
[12]
T. Wang, A. N. Kettenbach, S. A. Gerber, and C. Bailey-Kellogg, "MMFPh: A maximal motif finder for phosphoproteomics datasets," Bioinformatics, vol. 28, no. 12, pp. 1562-1570, 2012.
[13]
Z. He, C. Yang, G. Guo, N. Li, and W. Yu, "Motif-all: Discovering all phosphorylation motifs," BMC Bioinformat., vol. 12, p. S22, 2011.
[14]
Z. He and H. Gong, "Comments on'MMFPh: A maximal motif finder for phosphoproteomics datasets," Bioinformatics, vol. 28, no. 16, pp. 2211-2212, 2012.
[15]
T. Wang, A. N. Kettenbach, S. A. Gerber, and C. Bailey-Kellogg, "Response to 'comments on 'MMFPh: A maximal motif finder for phosphoproteomics datasets," Bioinformatics, vol. 28, no. 16, pp. 2211-2212, 2012.
[16]
P. I. Good, Permutation, Parametric and Bootstrap Tests of Hypotheses, New York, NY, USA: Springer-Verlag, 2005, ch. 9.
[17]
H. Gong and Z. He, "Permutation methods for testing the significance of phosphorylation motifs," Statist. interface, vol. 5, pp. 61-73, 2012.
[18]
L. Ma, T. L. Assimes, N. B. Asadi, C. Iribarren, T. Quertermous, and W. H. Wong, "An almost exhaustive search-based sequential permutation method for detecting epistasis in disease association studies," Genetic Epidemiol., vol. 34, pp. 434-443, 2010.
[19]
R. Agrawal and R. Srikant, "Fast algorithms for mining association rules in large databases," in Proc. 20th Int. Conf. Very Large Data Bases, 1994, pp. 487-499.
[20]
R. J. Bayardo Jr., R. Agrawal, and D. Gunopulos, "Constraint-based rule mining in large, dense databases," Data Mining Knowl. Discovery, vol. 4, no. 2-3, pp. 217-240, 2000.
[21]
H. Dinkel, C. Chica, A. Via, C. M. Gould, L. J. Jensen, T. J. Gibson, and F. Diella, "Phospho. ELM: A database of phosphorylation sites-update 2011," Nucleic Acids Res., vol. 39, no. 1, pp. D261-D267, 2011.
[22]
N. Farriol-Mathis, J. S. Garavelli, B. Boeckmann, S. Duvaud, E. Gasteiger, A. Gateau, A.-L. Veuthey, and A. Bairoch, "Annotation of post-translational modifications in the Swiss-Prot knowledge base," Proteomics, vol. 4, no. 6, pp. 1537-1550, 2004.
[23]
J. Gao, J. J. Thelen, A. K. Dunker, and D. Xu, "Musite, a tool for global prediction of general and kinase-specific phosphorylation sites," Molecular Cellular Proteomics, vol. 9, no. 12, pp. 2586-2600, 2010.
[24]
S. F. Altschul, T. L. Madden, A. A. Schäffer, J. Zhang, Z. Zhang, W. Miller, and D. J. Lipman, "Gapped BLAST and PSI-BLAST: A new generation of protein database search programs," Nucleic Acids Res., vol. 25, no. 17, pp. 3389-3402, 1997.
[25]
H. Gong, X. Liu, J. Wu, and Z. He, "Data construction for phosphorylation site prediction," Briefings Bioinformat., 2013.
[26]
N. Blom, S. Gammeltoft, and S. Brunak, "Sequence and structure-based prediction of eukaryotic protein phosphorylation sites," J. Molecular Biol., vol. 294, no. 5, pp. 1351-1362, 1999.
[27]
T. H. Dang, K. Van Leemput, A. Verschoren, and K. Laukens, "Prediction of kinase-specific phosphorylation sites using conditional random fields," Bioinformatics, vol. 24, no. 24, pp. 2857-2864, 2008.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE/ACM Transactions on Computational Biology and Bioinformatics
IEEE/ACM Transactions on Computational Biology and Bioinformatics  Volume 11, Issue 5
September/October 2014
206 pages
ISSN:1545-5963
  • Editor:
  • Ying Xu
Issue’s Table of Contents

Publisher

IEEE Computer Society Press

Washington, DC, United States

Publication History

Published: 01 September 2014
Accepted: 21 April 2014
Revised: 04 April 2014
Received: 09 November 2013
Published in TCBB Volume 11, Issue 5

Author Tags

  1. data mining
  2. frequent pattern
  3. phosphorylation motif
  4. protein phosphorylation

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 20 Nov 2024

Other Metrics

Citations

Cited By

View all

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media