Abstract
With the rapid growth of software scale and complexity, a large number of bug reports are submitted to the bug tracking system. In order to speed up defect repair, these reports need to be accurately classified so that they can be sent to the appropriate developers. However, the existing classification methods only use the text information of the bug report, which leads to their low performance. To solve the above problems, this paper proposes a new automatic classification method of bug reports. The innovation is that when categorizing bug reports, in addition to using the text information of the report, the intention of the report (i.e. “suggestion” or “explanation”) is also considered, thereby improving the performance of the classification. First, we collect bug reports from four ecosystems (Apache, Eclipse, Gentoo, Mozilla) and manually annotate them to construct an experimental data set. Then, we use Natural Language Processing technology to preprocess the data. On this basis, BERT and TF-IDF are used to extract the features of the intention and the multiple text information. Finally, the features are used to train the classifiers. The experimental result on five classifiers (including K-Nearest Neighbor, Naive Bayes, Logistic Regression, Support Vector Machine and Random Forest) show that our proposed method achieves better performance and its F-Measure achieves from 87.3% to 95.5%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Meng, F., Cheng, W., Wang, J.: Semi-supervised software defect prediction model based on tri-training. KSII Trans. Internet Inf. Syst. 15(11), 4028–4042 (2021)
Guo, S., Chen, R., Li, H.: Using knowledge transfer and rough set to predict the severity of android test reports via text mining. Symmetry 9(8), 144–161 (2017)
Yang, G., Min, K., Lee, J.W.: Applying topic modeling and similarity for predicting bug severity in cross projects. KSII Trans. Internet Inf. Syst. 13(3), 1583–1589 (2019)
Kim, S., Zhang, H., Wu, R., Gong, L.: Dealing with noise in defect prediction. In: 2011 33rd International Conference on Software Engineering (ICSE), pp. 481–490. ACM (2011)
Kochhar, P.S., Le, T.D.B., Lo, D.: Dealing with noise in defect prediction. In: 2014 11th Working Conference on Mining Software Repositories (MSR), pp. 296–299. IEEE (2014)
Antoniol, G., Ayari, K., Di, P.M., Khomh, F., Guéhéneuc, Y.G.: Is it a bug or an enhancement? A text-based approach to classify change requests. In: 2008 Conference of the Centre for Advanced Studies on Collaborative Research: Meeting of Minds, pp. 304–318 (2008)
Zhou, Y., Tong, Y., Gu, R., Gall, H.: Combining text mining and data mining for bug report classification. J. Softw.: Evol. Process 28(3), 150–176 (2016)
Lamkanfi, A., Demeyer, S., Giger, E., Goethals, B.: Predicting the severity of a reported bug. In: 2010 7th IEEE/ACM Working Conference on Mining Software Repositories (MSR), pp. 1–10. IEEE (2010)
Tian, Y., Lo, D., Sun, C.: Information retrieval based nearest neighbor classification for fine-grained bug severity prediction. In: 2012 19th Working Conference on Reverse Engineering, pp. 215–224 (2012)
Feng, Y., Chen, Z., Jones, J., Fang, C., Xu, B.: Test report prioritization to assist crowdsourced testing. In: 2015 10th Joint Meeting on Foundations of Software Engineering, pp. 225–236 (2015)
Zhang, T., Chen, Y., Yang, X., Zhu, H.: Approach of bug reports classification based on cost extreme learning machine. J. Softw. 30(5), 1386–1406 (2019)
Yang, X.L., Lo, D., Xia, X., Huang, Q., Sun, J.L.: High-impact bug report identification with imbalanced learning strategies. J. Comput. Sci. Technol. 32(1), 181–198 (2017)
Kukkar, A., Mohana, R.: A supervised bug report classification with incorporate and textual field knowledge. Proc. Comput. Sci. 132, 352–361 (2018)
Zhang, T., Jiang, H., Luo, X., Chen, A.T.: A literature review of research in bug resolution: tasks, challenges and future directions. Comput. J. 59(5), 741–773 (2016)
Chillarege, R., et al.: Orthogonal defect classification-a concept for in-process measurements. IEEE Trans. Softw. Eng. 18(11), 943–956 (1992)
Pingclasai, N., Hata, H., Matsumoto, K.I.: Classifying bug reports to bugs and other requests using topic modelling. In: 2013 20th Asia-Pacific Software Engineering Conference (APSEC), vol. 2, pp. 13–18 (2011)
Menzies, T., Marcus, A.: Automated severity assessment of software defect reports. In: 2008 IEEE International Conference on Software Maintenance (ICSM), pp. 346–355. IEEE (2008)
Sari, G.I.P., Siahaan, D.O.: An attribute selection for severity level determination according to the support vector machine classification result. In: 1st International Conference on Information Systems for Business Competitiveness (ICISBC) (2012)
Zhang, T., Chen, J., Yang, G., Lee, B., Luo, X.: Towards more accurate severity prediction and fixer recommendation of software bugs. J. Syst. Softw. 177(10), 166–184 (2016)
Kukkar, A., Mohana, R., Nayyar, A., Kim, J., Kang, B.G., Chilamkurti, N.: A novel deep-learning-based bug severity classification technique using convolutional neural networks and random forest with boosting. Sensors 19(13), 2943–2964 (2019)
Du, X., Zheng, Z., Xiao, G., Yin, B.: The automatic classification of fault trigger based bug report. In: 2017 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW), pp. 259–265. IEEE (2017)
Tan, L., Liu, C., Li, Z., Wang, X., Zhou, Y., Zhai, C.: Bug characteristics in open source software. Empir. Softw. Eng. 19(6), 1665–1705 (2013). https://doi.org/10.1007/s10664-013-9258-8
Catolino, G., Palomba, F., Zaidman, A., Ferrucci, F.: Not all bugs are the same: understanding, characterizing, and classifying bug types. J. Syst. Softw. 152(10), 165–181 (2019)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805 (2018)
Herzig, K., Just, S., Zeller, A.: It’s not a bug, it’s a feature: how misclassification impacts bug prediction. In: 2013 35th International Conference on Software Engineering (ICSE), pp. 392–401. ACM (2013)
Acknowledgment
This work is supported by the Science and Technology Research Project of the Jilin Provincial Department of Education, “Research on Overtime Risk Assessment and Early Warning Technology of Industrial Control Code” (No. JJKH20210097KJ).
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Meng, F., Wang, X., Wang, J., Wang, P. (2022). Automatic Classification of Bug Reports Based on Multiple Text Information and Reports’ Intention. In: Aït-Ameur, Y., Crăciun, F. (eds) Theoretical Aspects of Software Engineering. TASE 2022. Lecture Notes in Computer Science, vol 13299. Springer, Cham. https://doi.org/10.1007/978-3-031-10363-6_9
Download citation
DOI: https://doi.org/10.1007/978-3-031-10363-6_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-10362-9
Online ISBN: 978-3-031-10363-6
eBook Packages: Computer ScienceComputer Science (R0)