Abstract
The interactions between proteins are fundamental to a broad area of biological functions. In this paper, we try to predict protein-protein interactions directly from its amino acid sequence and only one associated physicochemical feature using a Support Vector Machine (SVM). We train a SVM learning system to recognize and predict interactions using a database of known protein interactions. Each amino acid has diverse features such as hydrophobicity, polarity, charge, surface tension, etc. We select only one among these features and combine it to amino acid sequence of interacting proteins. According to the experiments, we get approximately 94% accuracy, 99% precision, and 90% recall in average when using hydrophobicity feature, which is better than the result of previous work using several features simultaneously. Therefore, we can reduce a data size and processing time to 1/n and get a better result than the previous work using n features. When using other features except hydrophobicity, experiment results show approximately 50% accuracy, which is not so good to predict interactions.
This work was supported by grant NO. R01-2003-000-10860-0 from the Basic Research Program of the Korea Science & Engineering Foundation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Alberts, B., Bray, D., Lewis, J., Raff, M., Roberts, K., Watson, J.D.: Molecular Biology of the Cell, 2nd edn., Garland, New York (1989)
Bock, J.R., Gough, D.A.: Predicting protein–protein interactions from primary structure. Bioinformatics 17, 455–460 (2001)
Coward, E.: Shufflet: Shuffling sequences while conserving the k-let counts. Bioinformatics 15, 1058–1059 (1999)
Joachims, T.: Making large-scale support vector machine learning practical. In: Advances in Kernel Methods - Support Vector learning. ch. 11, pp. 169–184. MIT Press, Cambridge (1999)
Anfinsen, C.B.: Principles that govern the folding of protein chains. Science 181, 223–230 (1973)
Joachims, T.: Making large-scale support vector machine learning practical. In: Advances in Kernel Methods-Support Vector Learning. ch. 11, pp. 169–184. MIT Press, Cambridge (1999)
Baldi, P., Brunak, S.: Bioinformatics: the machine learning approach. In: Adaptive Computation and Machine learning, MIT press, Cambridge (1998)
Hecht-Nielsen, R.: Neurocomputing. Addison-Wesley, Reading (1989)
Baldi, P., Brunak, S., Chauvin, Y., Andersen, C.A., Nielsen, H.: Assessing the accuracy of prediction algorithms for classification: an overview. Bioinformatics 1(6), 412–424 (2000)
Hopp, T.P., Woods, K.R.: Predicting of protein antigenic determinants from amino acid sequences. Proc. Natl. Acad. Sci. USA 7(8), 3824–3828 (1981)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chung, Y., Kim, GM., Hwang, YS., Park, H. (2004). Predicting Protein-Protein Interactions from One Feature Using SVM. In: Orchard, B., Yang, C., Ali, M. (eds) Innovations in Applied Artificial Intelligence. IEA/AIE 2004. Lecture Notes in Computer Science(), vol 3029. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24677-0_6
Download citation
DOI: https://doi.org/10.1007/978-3-540-24677-0_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22007-7
Online ISBN: 978-3-540-24677-0
eBook Packages: Springer Book Archive