Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3178212.3178221acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicsebConference Proceedingsconference-collections
research-article

Empirical Study on Software Bug Prediction

Published: 28 December 2017 Publication History

Abstract

Software defect prediction is a vital research direction in software engineering field. Software defect prediction predicts whether software errors are present in the software by using machine learning analysis on software metrics. It can help software developers to improve the quality of the software. Software defect prediction is usually a binary classification problem, which relies on software metrics and the use of classifiers. There have been many research efforts to improve accuracy in software defect prediction using a variety of classifiers and data preprocessing techniques. However, the "classic classifier validity" and "data preprocessing techniques can enhance the functionality of software defect prediction" has not yet been answered explicitly. Therefore, it is necessary to conduct an empirical analysis to compare these studies. In software defect prediction, the category of interest is a defective module, and the number of defective modules is much less than that of a non-defective module in data. This leads to a category of imbalance problem that reduces the accuracy of the prediction. Therefore, the problem of imbalance is a key problem that needs to be solved in software defect prediction. In this paper, we proposed an experimental model and used the NASA MDP data set to analyze the software defect prediction. Five research questions were defined and analyzed experimentally. In addition to experimental analysis, this paper focuses on the improvement of SMOTE. SMOTE ASMO algorithm has been proposed to overcome the shortcomings of SMOTE.

References

[1]
Norman Fenton, Martin Neil, William Marsh, Peter Hearty, David Marquez, Paul Krause, Rajat Mishra. (2007). Predicting software defects in varying development lifecycles using Bayesian nets, Information and Software Technology, 49(1), 32--43.
[2]
Biwen Li, Beijun Shen, Jun Wang, Yuting Chen, Tao Zhang, Jinshuang Wang. (2014). A Scenario-Based Approach to Predicting Software Defects Using compressed C4.5 Model, Proceeding of IEEE 38th Annual International Computers, Software and Applications Conference (pp406--415).
[3]
Karim O. Elish, Mahmoud O. Elish. (2008). Predicting defect-prone software modules using support vector machines, The Journal of Systems and Software, 81(50), 649--660.
[4]
Kalai Magal. R, Shomona Gracia Jacob. (2015). Improved Random Forest Algorithm for Software Defect Prediction through Data Mining Techniques, International Journal of Computer Applications (0975-8887), 117(23), 18--22.
[5]
Tim Menzies, Jeremy Greenwald, Art Frank. (2007). Data Mining Static Code Attributes to Learn Defect Predictors, IEEE Transactions on Software Engineering, 32(11), 1--12.
[6]
Shivkumar Shivaji, E. James Whitehead, Jr. Ram Akella, Sunghun Kim. (2009). Reducing Features to Improve Bug Prediction, Proceeding of IEEE/ACM International Conference: Automated Software Engineering (pp.600--604).
[7]
Huanjing Wang, Taghi M. Khoshgoftaar, Amri Napolitano (2010). A Comparative Study of Ensemble Feature Selection Techniques for Software Defect, Prediction Proceeding of Ninth International Conference: Machine Learning and Applications (pp135--140)
[8]
Huanjing Wang, Taghi M. Khoshgoftaar, Jason Van Hulse. (2010). A Comparative Study of Threshold-based Selection Techniques, Proceeding of IEEE International Conference: Granular Computing (pp.499--504).
[9]
Ye Xia, Guoying Yan, Qianran Si. (2013). A Study on the Significance of Software Metrics in Defect Prediction, Proceeding of Sixth International Symposium: Computational Intelligence and Design (ISCID) (pp.343--346).
[10]
Ye Xia, Guoying Yan, Xingwei Jiang, Yanyan Yang. (2014). A New Metrics Selection Method for Software Defect Prediction, Proceeding of International Conference: Progress in Informatics & Computing (pp.433--436).

Cited By

View all
  • (2023)Ensemble Classifiers in Software Defect Prediction: A Systematic Literature Review2023 11th International Conference in Software Engineering Research and Innovation (CONISOFT)10.1109/CONISOFT58849.2023.00011(1-8)Online publication date: 6-Nov-2023
  • (2022)Software Quality Prediction Using Machine LearningInternational Journal of Software Innovation10.4018/IJSI.29799710:1(1-35)Online publication date: 1-Apr-2022
  • (2021)The impact of using biased performance metrics on software defect prediction researchInformation and Software Technology10.1016/j.infsof.2021.106664(106664)Online publication date: Jun-2021

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICSEB '17: Proceedings of the 2017 International Conference on Software and e-Business
December 2017
141 pages
ISBN:9781450354882
DOI:10.1145/3178212
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

  • Wuhan Univ.: Wuhan University, China

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 December 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Classification
  2. Data preprocessing
  3. Defect prediction
  4. SMOTE

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICSEB 2017

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Ensemble Classifiers in Software Defect Prediction: A Systematic Literature Review2023 11th International Conference in Software Engineering Research and Innovation (CONISOFT)10.1109/CONISOFT58849.2023.00011(1-8)Online publication date: 6-Nov-2023
  • (2022)Software Quality Prediction Using Machine LearningInternational Journal of Software Innovation10.4018/IJSI.29799710:1(1-35)Online publication date: 1-Apr-2022
  • (2021)The impact of using biased performance metrics on software defect prediction researchInformation and Software Technology10.1016/j.infsof.2021.106664(106664)Online publication date: Jun-2021

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media