Automatic Classification of Bug Reports Based on Multiple Text Information and Reports' Intention

Meng, Fanqi; Wang, Xuesong; Wang, Jingdong; Wang, Peifang

doi:10.1007/978-3-031-10363-6_9

Computer Science > Software Engineering

arXiv:2208.01274 (cs)

[Submitted on 2 Aug 2022]

Title:Automatic Classification of Bug Reports Based on Multiple Text Information and Reports' Intention

Authors:Fanqi Meng, Xuesong Wang, Jingdong Wang, Peifang Wang

View PDF

Abstract:With the rapid growth of software scale and complexity, a large number of bug reports are submitted to the bug tracking system. In order to speed up defect repair, these reports need to be accurately classified so that they can be sent to the appropriate developers. However, the existing classification methods only use the text information of the bug report, which leads to their low performance. To solve the above problems, this paper proposes a new automatic classification method for bug reports. The innovation is that when categorizing bug reports, in addition to using the text information of the report, the intention of the report (i.e. suggestion or explanation) is also considered, thereby improving the performance of the classification. First, we collect bug reports from four ecosystems (Apache, Eclipse, Gentoo, Mozilla) and manually annotate them to construct an experimental data set. Then, we use Natural Language Processing technology to preprocess the data. On this basis, BERT and TF-IDF are used to extract the features of the intention and the multiple text information. Finally, the features are used to train the classifiers. The experimental result on five classifiers (including K-Nearest Neighbor, Naive Bayes, Logistic Regression, Support Vector Machine, and Random Forest) show that our proposed method achieves better performance and its F-Measure achieves from 87.3% to 95.5%.

Subjects:	Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2208.01274 [cs.SE]
	(or arXiv:2208.01274v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2208.01274
Related DOI:	https://doi.org/10.1007/978-3-031-10363-6_9

Submission history

From: Fanqi Meng [view email]
[v1] Tue, 2 Aug 2022 06:44:51 UTC (791 KB)

Computer Science > Software Engineering

Title:Automatic Classification of Bug Reports Based on Multiple Text Information and Reports' Intention

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Automatic Classification of Bug Reports Based on Multiple Text Information and Reports' Intention

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators