Towards cross project vulnerability prediction in open source web applications
Proceedings of the The International Conference on Engineering & MIS 2015, 2015•dl.acm.org
Building secure software is challenging, time-consuming, and expensive. Software
vulnerability prediction models that identify vulnerable software components are usually
used to focus security efforts, with the aim of helping to reduce the time and effort needed to
secure software. Existing vulnerability prediction models use process or product metrics and
machine learning techniques to identify vulnerable software components. Cross project
vulnerability prediction plays a significant role in appraising the most likely vulnerable …
vulnerability prediction models that identify vulnerable software components are usually
used to focus security efforts, with the aim of helping to reduce the time and effort needed to
secure software. Existing vulnerability prediction models use process or product metrics and
machine learning techniques to identify vulnerable software components. Cross project
vulnerability prediction plays a significant role in appraising the most likely vulnerable …
Building secure software is challenging, time-consuming, and expensive. Software vulnerability prediction models that identify vulnerable software components are usually used to focus security efforts, with the aim of helping to reduce the time and effort needed to secure software. Existing vulnerability prediction models use process or product metrics and machine learning techniques to identify vulnerable software components. Cross project vulnerability prediction plays a significant role in appraising the most likely vulnerable software components, specifically for new or inactive projects. Little effort has been spent to deliver clear guidelines on how to choose the training data for project vulnerability prediction. In this work, we present an empirical study aiming at clarifying how useful cross project prediction techniques in predicting software vulnerabilities. Our study employs the classification provided by different machine learning techniques to improve the detection of vulnerable components. We have elaborately compared the prediction performance of five well-known classifiers. The study is conducted on a publicly available dataset of several PHP open source web applications and in the context of cross project vulnerability prediction, which represents one of the main challenges in the vulnerability prediction field.
ACM Digital Library