Computation of consensus hydrophobicity scales with self-organizing maps and fuzzy clustering along with applications to protein fold prediction
Abstract
Hydrophobicity is probably the most important property of amino acid that is being often exploited to predict protein folds. There are many amino acid hydrophobicity scales available in the literature. Here we propose two computational approaches based on self-organizing map (SOM) and fuzzy clustering to find some consensus scales. Although SOM and fuzzy clustering produce centroids, we propose new schemes to compute more effective representative scales that exploit the properties of SOM and fuzzy memberships. To demonstrate the utility of the new scales, we apply them to predict the protein folds of a benchmark data set using neural networks. Our experiments show that it is possible to generate useful scales with better utility compared to some existing scales. There can be other applications of the proposed scales.
References
[1]
{1} James C. Bezdek, James M. Keller, Raghu Krishnapuram, and Nikhil R. Pal. Fuzzy Models and Algorithms for Pattern Recognition and Image Processing. Boston, 1999.
[2]
{2} M Charton and BI Charton. The structural dependence of amino acid hydrophobicity parameters. Journal of Theoritical Biology, 99: 629-644, 1982.
[3]
{3} C Chothia and AV Finkelstein. The classification and origin of folding patterns. Anual Rev. Biochem., 59: 1007-1039, 1990.
[4]
{4} JL Cornette, KB Cease, H Margalit, JL Spouge, JA Berzofsky, and C DeLisi. Hydrophobiicty scales and computational techniques for detecting amphiphatic structures in proteins. Journal of Molecular Biology, 195: 659-685, 1987.
[5]
{5} I Dubchuk, I Muchnik, SR Holbrook, and SH Kim. Prediction of protein folding class using global description of amino acid sequence. Proceedings of National Academy of Sciences, USA, 92: 8700-8704, 1995.
[6]
{6} I Dubchuk, I Muchnik, C Mayor, I Dralyuk, and SH Kim. Recognition of a protein fold in the context of the scop classification. PROTEINS: Structure, Function and Genetics, 35: 401-407, 1999.
[7]
{7} Richard A. Johnson and Dean W. Wichern. Applied Multivariate Statistical Analysis. Prentice Hall, 5th edition, 2002.
[8]
{8} Teuvo Kohonen. Self-Organizing Maps, volume 30. Springer Series in Information Sciences, 2nd edition, 1997.
[9]
{9} ID Kuntz. Hydration of macromolecules. J. Amer. Chem. Soc., 93(2): 514-518, 1971.
[10]
{10} J Kyte and RF Dolittle. A simple method for displaying the hydropathic character of proteins. Journal of Molecular Biology, 157: 105-132, 1982.
[11]
{11} Arijit Laha and Nikhil Ranjan Pal. Dynamic generation of prototypes with self-organizing feature maps for classifier design. Pattern Recognition, 34: 315-321, 2001.
[12]
{12} Hao Li, Chao Tang, and Ned Wingreen. Nature of driving force for protein folding-a result from analyzing the statistical potential. Phys. Rev. Lett., 79:765, 1997.
[13]
{13} S Miyazawa and RL Jernigan. Residue-residue potentials with a favo-rable contact pair term and an unfavorable high packing den-sity term, for simulation and threading. Journal of Molecular Biology, 256: 623-644, 1996.
[14]
{14} A Neumaier, W Huyer, and E Bornberg-Bauer. Hydrophobicity analysis of amino acids, world wide web, http://www.mat.univie.ac.at/~neum/software/protein/aminoacids.html.
[15]
{15} Nikhil Ranjan Pal and Debrup Chakraborty. Some new features for protein fold prediction. In ICANN, pages 1176-1183. Springer-Verlag, Heidelberg, 2003.
Index Terms
- Computation of consensus hydrophobicity scales with self-organizing maps and fuzzy clustering along with applications to protein fold prediction
Recommendations
Transmembrane helix prediction in proteins using hydrophobicity properties and higher-order statistics
Prediction of the transmembrane (TM) helices is important in the study of membrane proteins. A novel method to predict the location and length of both single and multiple TM helices in human proteins is presented. The proposed method is based on a ...
Mining of protein contact maps for protein fold prediction
The three-dimensional structure of proteins is useful to carry out the biophysical and biochemical functions in a cell. Approaches to protein structure/fold prediction typically extract amino acid sequence features, and machine learning approaches are ...
Comments
Please enable JavaScript to view thecomments powered by Disqus.Information & Contributors
Information
Published In
Publisher
Dynamic Publishers, Inc.
United States
Publication History
Published: 01 March 2007
Author Tags
Qualifiers
- Article
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 0Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Reflects downloads up to 16 Feb 2025