Nothing Special   »   [go: up one dir, main page]

Protein binding site prediction using an empirical scoring function

Nucleic Acids Res. 2006 Aug 7;34(13):3698-707. doi: 10.1093/nar/gkl454. Print 2006.

Abstract

Most biological processes are mediated by interactions between proteins and their interacting partners including proteins, nucleic acids and small molecules. This work establishes a method called PINUP for binding site prediction of monomeric proteins. With only two weight parameters to optimize, PINUP produces not only 42.2% coverage of actual interfaces (percentage of correctly predicted interface residues in actual interface residues) but also 44.5% accuracy in predicted interfaces (percentage of correctly predicted interface residues in the predicted interface residues) in a cross validation using a 57-protein dataset. By comparison, the expected accuracy via random prediction (percentage of actual interface residues in surface residues) is only 15%. The binding sites of the 57-protein set are found to be easier to predict than that of an independent test set of 68 proteins. The average coverage and accuracy for this independent test set are 30.5 and 29.4%, respectively. The significant gain of PINUP over expected random prediction is attributed to (i) effective residue-energy score and accessible-surface-area-dependent interface-propensity, (ii) isolation of functional constraints contained in the conservation score from the structural constraints through the combination of residue-energy score (for structural constraints) and conservation score and (iii) a consensus region built on top-ranked initial patches.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Algorithms
  • Amino Acids / chemistry
  • Binding Sites
  • Computational Biology / methods*
  • Empirical Research
  • Models, Molecular
  • Multiprotein Complexes / chemistry*
  • Multiprotein Complexes / metabolism
  • Protein Conformation
  • Software

Substances

  • Amino Acids
  • Multiprotein Complexes