Abstract
As the representative technology of protein spatial structure exploration, NMR technology provides an unprecedented opportunity for modern life science research. But subsequent large data analysis has become a major problem. It is an important means to study protein structure and functional relationship by known information proteins’ three-dimensional structures to predict the unknown spatial structure of proteins. A method for similarity comparison of 3D protein structures based on Riemannian manifold theory is proposed in this paper. By constructing Cα frames and extracting geometric feature of protein, 3D coordinates of proteins are converted into one dimension sequences with rotation and translation invariance. The Riemann distance is used as the three-dimensional structure similarity degree index. Spatial transformation on protein structure is not needed in this method, which avoiding errors when matching two proteins in the traditional method for registration by the least squares fitting. This method is independent of sequence information completely. It has realistic significance for proteins which do not have a similarity between sequences. Three experiments are designed according to 3 sets of data: proteins of different similarity, ten pairs whose protein structures are more difficult to identify proposed by Fischer, 700 proteins in the HOMSTRAD database. Compared with the traditional method, the experiment results show that the matching accuracy of this method has been greatly enhanced.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Liang, Y.: Structural Biology. Science Press, Ann Arbor (2005)
Peng, Q.S., Hu, M.: Approaches for 3D protein structure similarity comparison-a survey. J. Comput. Aided Des. Comput. Graph. 18(10), 1465–1471 (2006)
Holm, L., Sander, C.: Protein structure comparison by alignment of distance matrices. J. Mol. Biol. 233(1), 123–138 (1993)
Shindyalov, I.N., Bourne, P.E.: Protein structure alignment by incremental combinatorial extension(CE) of the optimal path. Protein Eng. 11(9), 739–747 (1998)
Gibrat, J.F., Madej, T., Bryant, S.H.: Surprising similarities in structure comparison. Curr. Opin. Struct. Biol. 6(3), 377–385 (1996)
Levitt, M.: STRUCTAL. A structural alignment program (1994)
Krissinel, E., Henrick, K.: Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions. Acta Crystallogr. Sect. D Biol. Crystallogr. 60(12), 2256–2268 (2004)
Zhang, Y., Skolnick, J.: TM-align: a protein structure alignment algorithm based on the TM-score. Nucleic Acids Res. 33(7), 2302–2309 (2005)
Louie, A.H., Somorjai, R.L.: Differential geometry of proteins: a structural and dynamical representation of patterns. J. Theor. Biol. 98(2), 189–209 (1982)
Joshi, S.H, Klassen, E., Srivastava, A., et al.: A novel representation for Riemannian analysis of elastic curves in Rn. In: IEEE Conference on Computer Vision and Pattern Recognition, Minneapolis, MN, pp. 1–7 (2007)
Klassen, E., Srivastava, A., Mio, W., et al.: Analysis of planar shapes using geodesic paths on shape spaces. IEEE Trans. Pattern Anal. Mach. Intell. 26(3), 372–383 (2004)
Liu, W., Srivastava, A., Zhang, J.: A mathematical framework for protein structure comparison. PLoS Comput. Biol. 7(2), e1001075 (2011)
Berman, H.M., Westbrook, J., Feng, Z., et al.: The protein data bank. Nucleic Acids Res. 28(1), 235–242 (2000)
Hanson, A.J., Thakur, S.: Quaternion maps of global protein structure. J. Mol. Graph. Model. 38, 256–278 (2012)
Ji, Y.Q., Xu, Z.C.: Differential Manifold and Riemannian Geometry. Shaanxi Normal University Press, Xi’an (1996)
Collard, A., Bonnabel, S., Phillips, C., et al.: An anisotropy preserving metric for DTI processing. arXiv:1210.2826 (2012)
Fischer, D., Elofsson, A., Rice, D.W., et al.: Assessing the performance of inverted protein folding methods by means of an extensive benchmark. In: Proceeding of the First Pacific Symposium on Biocomputing, pp. 300–318 (1996)
Murzin, A.G., Brenner, S.E., Hubbard, T., et al.: SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 247(4), 536–540 (1995)
Maiorov, V.N., Crippen, G.M.: Significance of root-mean-square deviation in comparing three-dimensional structures of globular proteins. J. Mol. Biol. 235(2), 625–634 (1994)
Zhang, Y., Skolnick, J.: Scoring function for automated assessment of protein structure template quality. Protein Struct. Funct. Bioinf. 57(4), 702–710 (2004)
Shindyalov, I.N., Bourne, P.E.: Protein structure alignment by incremental combinatorial extension(CE) of the optimal path. Protein Eng. 11(9), 739–747 (1998)
Yang, Y., Zhan, J., Zhao, H., et al.: A new size-independent score for pairwise protein structure alignment and its application to structure classification and nucleic-acid binding prediction. Protein Struct. Funct. Bioinf. 80(8), 2080–2088 (2012)
Elofsson, A., Fischer, D., Rice, D.W., et al.: A study of combined structure/sequence profiles. Fold Des. 1(6), 451–461 (1996)
Acknowledgment
The authors thank the members of Machine Learning and Artificial Intelligence Laboratory, School of Computer Science and Technology, Wuhan University of Science and Technology, for their helpful discussion within seminars. This work was supported in part by National Natural Science Foundation of China (No. 61502356).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Fengli, Z., Xiaoli, L. (2017). Similarity Comparison of 3D Protein Structure Based on Riemannian Manifold. In: Huang, DS., Jo, KH., Figueroa-García, J. (eds) Intelligent Computing Theories and Application. ICIC 2017. Lecture Notes in Computer Science(), vol 10362. Springer, Cham. https://doi.org/10.1007/978-3-319-63312-1_34
Download citation
DOI: https://doi.org/10.1007/978-3-319-63312-1_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-63311-4
Online ISBN: 978-3-319-63312-1
eBook Packages: Computer ScienceComputer Science (R0)