Automatic Classification of Bug Reports Based on Multiple Text Information and Reports’ Intention

Fanqi Meng⁹,
Xuesong Wang⁹,
Jingdong Wang⁹ &
…
Peifang Wang⁹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13299))

Included in the following conference series:

International Symposium on Theoretical Aspects of Software Engineering

Abstract

With the rapid growth of software scale and complexity, a large number of bug reports are submitted to the bug tracking system. In order to speed up defect repair, these reports need to be accurately classified so that they can be sent to the appropriate developers. However, the existing classification methods only use the text information of the bug report, which leads to their low performance. To solve the above problems, this paper proposes a new automatic classification method of bug reports. The innovation is that when categorizing bug reports, in addition to using the text information of the report, the intention of the report (i.e. “suggestion” or “explanation”) is also considered, thereby improving the performance of the classification. First, we collect bug reports from four ecosystems (Apache, Eclipse, Gentoo, Mozilla) and manually annotate them to construct an experimental data set. Then, we use Natural Language Processing technology to preprocess the data. On this basis, BERT and TF-IDF are used to extract the features of the intention and the multiple text information. Finally, the features are used to train the classifiers. The experimental result on five classifiers (including K-Nearest Neighbor, Naive Bayes, Logistic Regression, Support Vector Machine and Random Forest) show that our proposed method achieves better performance and its F-Measure achieves from 87.3% to 95.5%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Machine Learning-Based Methods for Identifying Bug Severity Level from Bug Reports

Feature Comparison for Automatic Bug Report Classification

Automatically Identifying Security Bug Reports via Multitype Features Analysis

References

Meng, F., Cheng, W., Wang, J.: Semi-supervised software defect prediction model based on tri-training. KSII Trans. Internet Inf. Syst. 15(11), 4028–4042 (2021)
Google Scholar
Guo, S., Chen, R., Li, H.: Using knowledge transfer and rough set to predict the severity of android test reports via text mining. Symmetry 9(8), 144–161 (2017)
Article Google Scholar
Yang, G., Min, K., Lee, J.W.: Applying topic modeling and similarity for predicting bug severity in cross projects. KSII Trans. Internet Inf. Syst. 13(3), 1583–1589 (2019)
Google Scholar
Kim, S., Zhang, H., Wu, R., Gong, L.: Dealing with noise in defect prediction. In: 2011 33rd International Conference on Software Engineering (ICSE), pp. 481–490. ACM (2011)
Google Scholar
Kochhar, P.S., Le, T.D.B., Lo, D.: Dealing with noise in defect prediction. In: 2014 11th Working Conference on Mining Software Repositories (MSR), pp. 296–299. IEEE (2014)
Google Scholar
Antoniol, G., Ayari, K., Di, P.M., Khomh, F., Guéhéneuc, Y.G.: Is it a bug or an enhancement? A text-based approach to classify change requests. In: 2008 Conference of the Centre for Advanced Studies on Collaborative Research: Meeting of Minds, pp. 304–318 (2008)
Google Scholar
Zhou, Y., Tong, Y., Gu, R., Gall, H.: Combining text mining and data mining for bug report classification. J. Softw.: Evol. Process 28(3), 150–176 (2016)
Google Scholar
Lamkanfi, A., Demeyer, S., Giger, E., Goethals, B.: Predicting the severity of a reported bug. In: 2010 7th IEEE/ACM Working Conference on Mining Software Repositories (MSR), pp. 1–10. IEEE (2010)
Google Scholar
Tian, Y., Lo, D., Sun, C.: Information retrieval based nearest neighbor classification for fine-grained bug severity prediction. In: 2012 19th Working Conference on Reverse Engineering, pp. 215–224 (2012)
Google Scholar
Feng, Y., Chen, Z., Jones, J., Fang, C., Xu, B.: Test report prioritization to assist crowdsourced testing. In: 2015 10th Joint Meeting on Foundations of Software Engineering, pp. 225–236 (2015)
Google Scholar
Zhang, T., Chen, Y., Yang, X., Zhu, H.: Approach of bug reports classification based on cost extreme learning machine. J. Softw. 30(5), 1386–1406 (2019)
Google Scholar
Yang, X.L., Lo, D., Xia, X., Huang, Q., Sun, J.L.: High-impact bug report identification with imbalanced learning strategies. J. Comput. Sci. Technol. 32(1), 181–198 (2017)
Article Google Scholar
Kukkar, A., Mohana, R.: A supervised bug report classification with incorporate and textual field knowledge. Proc. Comput. Sci. 132, 352–361 (2018)
Article Google Scholar
http://bz.apache.org
http://bugs.eclipse.org
http://bugs.gentoo.org
http://bugzilla.mozilla.org
Zhang, T., Jiang, H., Luo, X., Chen, A.T.: A literature review of research in bug resolution: tasks, challenges and future directions. Comput. J. 59(5), 741–773 (2016)
Article MathSciNet Google Scholar
Chillarege, R., et al.: Orthogonal defect classification-a concept for in-process measurements. IEEE Trans. Softw. Eng. 18(11), 943–956 (1992)
Article Google Scholar
Pingclasai, N., Hata, H., Matsumoto, K.I.: Classifying bug reports to bugs and other requests using topic modelling. In: 2013 20th Asia-Pacific Software Engineering Conference (APSEC), vol. 2, pp. 13–18 (2011)
Google Scholar
Menzies, T., Marcus, A.: Automated severity assessment of software defect reports. In: 2008 IEEE International Conference on Software Maintenance (ICSM), pp. 346–355. IEEE (2008)
Google Scholar
Sari, G.I.P., Siahaan, D.O.: An attribute selection for severity level determination according to the support vector machine classification result. In: 1st International Conference on Information Systems for Business Competitiveness (ICISBC) (2012)
Google Scholar
Zhang, T., Chen, J., Yang, G., Lee, B., Luo, X.: Towards more accurate severity prediction and fixer recommendation of software bugs. J. Syst. Softw. 177(10), 166–184 (2016)
Article Google Scholar
Kukkar, A., Mohana, R., Nayyar, A., Kim, J., Kang, B.G., Chilamkurti, N.: A novel deep-learning-based bug severity classification technique using convolutional neural networks and random forest with boosting. Sensors 19(13), 2943–2964 (2019)
Article Google Scholar
Du, X., Zheng, Z., Xiao, G., Yin, B.: The automatic classification of fault trigger based bug report. In: 2017 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW), pp. 259–265. IEEE (2017)
Google Scholar
Tan, L., Liu, C., Li, Z., Wang, X., Zhou, Y., Zhai, C.: Bug characteristics in open source software. Empir. Softw. Eng. 19(6), 1665–1705 (2013). https://doi.org/10.1007/s10664-013-9258-8
Article Google Scholar
Catolino, G., Palomba, F., Zaidman, A., Ferrucci, F.: Not all bugs are the same: understanding, characterizing, and classifying bug types. J. Syst. Softw. 152(10), 165–181 (2019)
Article Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805 (2018)
Herzig, K., Just, S., Zeller, A.: It’s not a bug, it’s a feature: how misclassification impacts bug prediction. In: 2013 35th International Conference on Software Engineering (ICSE), pp. 392–401. ACM (2013)
Google Scholar

Download references

Acknowledgment

This work is supported by the Science and Technology Research Project of the Jilin Provincial Department of Education, “Research on Overtime Risk Assessment and Early Warning Technology of Industrial Control Code” (No. JJKH20210097KJ).

Author information

Authors and Affiliations

School of Computer, Northeast Electric Power University, Jilin City, Jilin, China
Fanqi Meng, Xuesong Wang, Jingdong Wang & Peifang Wang

Authors

Fanqi Meng
View author publications
You can also search for this author in PubMed Google Scholar
Xuesong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jingdong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Peifang Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Xuesong Wang or Jingdong Wang .

Editor information

Editors and Affiliations

IRIT, Toulouse, France
Yamine Aït-Ameur
Babeș-Bolyai University, Cluj-Napoca, Romania
Florin Crăciun

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Meng, F., Wang, X., Wang, J., Wang, P. (2022). Automatic Classification of Bug Reports Based on Multiple Text Information and Reports’ Intention. In: Aït-Ameur, Y., Crăciun, F. (eds) Theoretical Aspects of Software Engineering. TASE 2022. Lecture Notes in Computer Science, vol 13299. Springer, Cham. https://doi.org/10.1007/978-3-031-10363-6_9

Download citation

DOI: https://doi.org/10.1007/978-3-031-10363-6_9
Published: 03 July 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-10362-9
Online ISBN: 978-3-031-10363-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Automatic Classification of Bug Reports Based on Multiple Text Information and Reports’ Intention

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Machine Learning-Based Methods for Identifying Bug Severity Level from Bug Reports

Feature Comparison for Automatic Bug Report Classification

Automatically Identifying Security Bug Reports via Multitype Features Analysis

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Automatic Classification of Bug Reports Based on Multiple Text Information and Reports’ Intention

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Machine Learning-Based Methods for Identifying Bug Severity Level from Bug Reports

Feature Comparison for Automatic Bug Report Classification

Automatically Identifying Security Bug Reports via Multitype Features Analysis

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation