Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2649387.2660837acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
research-article

Variational Bayesian clustering on protein cavity conformations for detecting influential amino acids

Published: 20 September 2014 Publication History

Abstract

Proteins are large flexible biological molecules and conformational flexibility is a shared challenge in comparisons of protein structure. Many tools have been developed to identify remote homologs in cases where backbone flexibilities are considered. However, these methods require comparisons of structures of more than one proteins, and this is not always available. To assist this process, this paper presents an unsupervised method to predict amino acids that exhibit substantial flexibility to change the binding site when only one protein structure is available. Our method is applied on conformational samples of sequentially nonredundant structures of the serine protease proteins. We observed that influential amino acids can be predicted with high specificities in our whole data set. The results suggest our method as a tool to detect significant side chain motions that affect binding specificity of one protein in the presence of great flexibility.

References

[1]
Akaike, H. A new look at the statistical model identification. Automatic Control, IEEE Transactions on 19, 6 (1974), 716--723.
[2]
Berendsen, H., Postma, J., van Gunsteren, W., and Hermans, J. Intermolecular forces. Pullman, B., Ed.; Reidel Publishing Company: Dordrecht (1981), 331--342.
[3]
Berglund, G. I., Smalas, A. O., Outzen, H., and Willassen, N. P. Purification and characterization of pancreatic elastase from north atlantic salmon (salmo salar). Molecular marine biology and biotechnology 7, 2 (1998), 105--114.
[4]
Berman, H. M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T., Weissig, H., Shindyalov, I. N., and Bourne, P. E. The protein data bank. Nucleic acids research 28, 1 (2000), 235--242.
[5]
Binkowski, T. A., Adamian, L., and Liang, J. Inferring functional relationships of proteins from local sequence and spatial surface patterns. Journal of molecular biology 332, 2 (2003), 505--526.
[6]
Binkowski, T. A., and Joachimiak, A. Protein functional surfaces: global shape matching and local spatial alignments of ligand binding sites. BMC structural biology 8, 1 (2008), 45.
[7]
Birzele, F., Gewehr, J. E., Csaba, G., and Zimmer, R. Vorolign---fast structural alignment using voronoi contacts. Bioinformatics 23, 2 (2007), e205--e211.
[8]
Bishop, C. M., et al. Pattern Recognition and Machine Learning, vol. 1. springer New York, 2006.
[9]
Bryant, D. H., Moll, M., Finn, P. W., and Kavraki, L. E. Combinatorial clustering of residue position subsets predicts inhibitor affinity across the human kinome. PLoS computational biology 9, 6 (2013), e1003087.
[10]
Chen, B., and Bandyopadhyay, S. A statistical model of overlapping volume in ligand binding cavities. In Bioinformatics and Biomedicine Workshops (BIBMW), 2011 IEEE International Conference on (2011), IEEE, pp. 424--431.
[11]
Chen, B. Y., and Bandyopadhyay, S. VASP-S: A volumetric analysis and statistical model for predicting steric influences on protein-ligand binding specificity. In Bioinformatics and Biomedicine (BIBM), 2011 IEEE International Conference on (2011), IEEE, pp. 22--29.
[12]
Chen, B. Y., Fofanov, V. Y., Bryant, D. H., Dodson, B. D., Kristensen, D. M., Lisewski, A. M., Kimmel, M., Lichtarge, O., and Kavraki, L. E. The MASH pipeline for protein function prediction and an algorithm for the geometric refinement of 3d motifs. Journal of Computational Biology 14, 6 (2007), 791--816.
[13]
Chen, B. Y., and Honig, B. VASP: A volumetric analysis of surface properties yields insights into protein-ligand binding specificity. PLoS computational biology 6, 8 (2010), e1000881.
[14]
Connolly, M. L. Solvent-accessible surfaces of proteins and nucleic acids. Science 221, 4612 (1983), 709--713.
[15]
DeLano, W. L. The pymol molecular graphics system.
[16]
Dhanik, A., and Kavraki, L. E. Protein--ligand interactions: Computational docking. eLS (2001).
[17]
Dundas, J., Ouyang, Z., Tseng, J., Binkowski, A., Turpaz, Y., and Liang, J. CASTp: computed atlas of surface topography of proteins with structural and topographical mapping of functionally annotated residues. Nucleic acids research 34, suppl 2 (2006), W116--W118.
[18]
Gibrat, J.-F., Madej, T., and Bryant, S. H. Surprising similarities in structure comparison. Current opinion in structural biology 6, 3 (1996), 377--385.
[19]
Godshall, B. G., and Chen, B. Y. Improving accuracy in binding site comparison with homology modeling. In Bioinformatics and Biomedicine Workshops (BIBMW), 2012 IEEE International Conference on (2012), IEEE, pp. 662--669.
[20]
Godshall, B. G., Tang, Y., Yang, W., and Chen, B. Y. An aggregate analysis of many predicted structures to reduce errors in protein structure comparison caused by conformational flexibility. BMC structural biology 13, Suppl 1 (2013), S10.
[21]
Gunasekaran, K., and Nussinov, R. How different are structurally flexible and rigid binding sites? sequence and structural features discriminating proteins that do and do not undergo conformational change upon ligand binding. Journal of molecular biology 365, 1 (2007), 257--273.
[22]
Guo, Z., Kuhlengel, T., Stinson, S., Blumenthal, S., Bandyopadhyay, S. R., and Chen, Y. B. Flexible volumetric comparison of protein cavities can reveal patterns in ligand binding specificity. In Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine (2014), ACM, p. in press.
[23]
Hess, B. P-LINCS: a parallel linear constraint solver for molecular simulation. Journal of Chemical Theory and Computation 4, 1 (Jan. 2008), 116--122.
[24]
Hess, B., Kutzner, C., van der Spoel, D., and Lindahl, E. GROMACS 4: Algorithms for highly efficient, load-balanced, and scalable molecular simulation. Journal of Chemical Theory and Computation 4, 3 (Mar. 2008), 435--447.
[25]
Holm, L., and Sander, C. Mapping the protein universe. Science 273, 5275 (1996), 595--602.
[26]
Jaccard, P. The distribution of the flora in the alpine zone. 1. New phytologist 11, 2 (1912), 37--50.
[27]
Jolliffe, I. Principal Component Analysis. Wiley Online Library, 2005.
[28]
Kinoshita, K., and Nakamura, H. Identification of the ligand binding sites on the molecular surface of proteins. Protein Science 14, 3 (2005), 711--718.
[29]
Konc, J., and Janežič, D. Probis algorithm for detection of structurally similar protein binding sites by local structural alignment. Bioinformatics 26, 9 (2010), 1160--1168.
[30]
Menke, M., Berger, B., and Cowen, L. Matt: local flexibility aids protein multiple structure alignment. PLoS computational biology 4, 1 (2008), e10.
[31]
Morihara, K., and Tsuzuki, H. Comparison of the specificities of various serine proteinases from microorganisms. Archives of biochemistry and biophysics 129, 2 (1969), 620--634.
[32]
Mosca, R., and Schneider, T. R. Rapido: a web server for the alignment of protein structures in the presence of conformational changes. Nucleic acids research 36, suppl 2 (2008), W42--W46.
[33]
Nose, S., and Klein, M. Constant pressure molecular dynamics for molecular systems. Molecular Physics 50, 5 (1983), 1055--1076.
[34]
Nussinov, R., and Wolfson, H. J. Efficient detection of three-dimensional structural motifs in biological macromolecules by computer vision techniques. Proceedings of the National Academy of Sciences 88, 23 (1991), 10495--10499.
[35]
Opper, M., and Saad, D. Advanced Mean Field Methods: Theory and Practice. MIT press, 2001.
[36]
Orengo, C. A., and Taylor, W. R. Ssap: sequential structure alignment program for protein structure comparison. Computer methods for macromolecular sequence analysis (1996).
[37]
Parrinello, M., and Rahman, A. Polymorphic transitions in single crystals: A new molecular dynamics method. Journal of Applied physics 52 (1981), 7182.
[38]
Petrey, D., and Honig, B. Grasp2: visualization, surface properties, and electrostatics of macromolecular structures and sequences. Methods in enzymology 374 (2002), 492--509.
[39]
Poirrette, A. R., Artymiuk, P. J., Rice, D. W., and Willett, P. Comparison of protein surfaces using a genetic algorithm. Journal of Computer-Aided Molecular Design 11, 6 (1997), 557--569.
[40]
Rosen, M., Lin, S. L., Wolfson, H., and Nussinov, R. Molecular shape comparisons in searches for active sites and functional similarity. Protein Engineering 11, 4 (1998), 263--277.
[41]
Russell, R. B. Detection of protein three-dimensional side-chain patterns: new examples of convergent evolution. Journal of molecular biology 279, 5 (1998), 1211--1227.
[42]
Schaer, J., and Stone, M. Face traverses and a volume algorithm for polyhedra. In New Results and New Trends in Computer Science. Springer, 1991, pp. 290--297.
[43]
Schechter, I., and Berger, A. On the size of the active site in proteases. i. papain. Biochemical and biophysical research communications 27, 2 (1967), 157--162.
[44]
Shatsky, M., Nussinov, R., and Wolfson, H. J. Flexprot: alignment of flexible protein structures without a predefinition of hinge regions. Journal of Computational Biology 11, 1 (2004), 83--106.
[45]
Shindyalov, I. N., and Bourne, P. E. Protein structure alignment by incremental combinatorial extension (ce) of the optimal path. Protein engineering 11, 9 (1998), 739--747.
[46]
Shotton, D., and Watson, H. Three-dimensional structure of tosyl-elastase. Nature 225 (1970), 811--816.
[47]
Vesterstrøm, J., and Taylor, W. R. Flexible secondary structure based protein structure comparison applied to the detection of circular permutation. Journal of Computational Biology 13, 1 (2006), 43--63.
[48]
Xie, L., and Bourne, P. E. A robust and efficient algorithm for the shape description of protein structures and its application in predicting ligand binding sites. BMC bioinformatics 8, Suppl 4 (2007), S9.
[49]
Yang, A.-S., and Honig, B. An integrated approach to the analysis and modeling of protein sequences and structures. i. protein structural alignment and a quantitative measure for protein structural distance. Journal of molecular biology 301, 3 (2000), 665--678.
[50]
Ye, Y., and Godzik, A. Multiple flexible structure alignment using partial order graphs. Bioinformatics 21, 10 (2005), 2362--2369.

Cited By

View all
  • (2023)Clustering Protein Binding Pockets and Identifying Potential Drug Interactions: A Novel Ligand-Based Featurization MethodJournal of Chemical Information and Modeling10.1021/acs.jcim.3c0072263:21(6655-6666)Online publication date: 17-Oct-2023
  • (2020) Effect of light intensity on digestion and immune responses, plasma cortisol and amino acid composition of Scylla paramamosain during indoor overwintering Aquaculture Research10.1111/are.14836Online publication date: 26-Aug-2020
  • (2016)A map of binding cavity conformations reveals differences in binding specificity2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)10.1109/BIBM.2016.7822486(13-19)Online publication date: Dec-2016
  • Show More Cited By

Index Terms

  1. Variational Bayesian clustering on protein cavity conformations for detecting influential amino acids

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        BCB '14: Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics
        September 2014
        851 pages
        ISBN:9781450328944
        DOI:10.1145/2649387
        • General Chairs:
        • Pierre Baldi,
        • Wei Wang
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 20 September 2014

        Permissions

        Request permissions for this article.

        Check for updates

        Qualifiers

        • Research-article

        Funding Sources

        Conference

        BCB '14
        Sponsor:
        BCB '14: ACM-BCB '14
        September 20 - 23, 2014
        California, Newport Beach

        Acceptance Rates

        Overall Acceptance Rate 254 of 885 submissions, 29%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 28 Nov 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2023)Clustering Protein Binding Pockets and Identifying Potential Drug Interactions: A Novel Ligand-Based Featurization MethodJournal of Chemical Information and Modeling10.1021/acs.jcim.3c0072263:21(6655-6666)Online publication date: 17-Oct-2023
        • (2020) Effect of light intensity on digestion and immune responses, plasma cortisol and amino acid composition of Scylla paramamosain during indoor overwintering Aquaculture Research10.1111/are.14836Online publication date: 26-Aug-2020
        • (2016)A map of binding cavity conformations reveals differences in binding specificity2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)10.1109/BIBM.2016.7822486(13-19)Online publication date: Dec-2016
        • (2015)Superposition of protein structures using electrostatic isopotentialsProceedings of the 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)10.1109/BIBM.2015.7359658(75-82)Online publication date: 9-Nov-2015
        • (2015)Conformational Sampling Reveals Amino Acids with a Steric Influence on SpecificityJournal of Computational Biology10.1089/cmb.2015.011722:9(861-875)Online publication date: Sep-2015

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media