Production and test bug report classification based on transfer learning
- Authors
- Ai, Misoo Kim; Kim, Youngkyoung; Lee, Eunseok
- Issue Date
- May-2025
- Publisher
- ELSEVIER
- Keywords
- Empirical study; Software reliability; Information retrieval-based bug localization; Bug report classification; Transfer learning
- Citation
- INFORMATION AND SOFTWARE TECHNOLOGY, v.181
- Indexed
- SCIE
SCOPUS
- Journal Title
- INFORMATION AND SOFTWARE TECHNOLOGY
- Volume
- 181
- URI
- https://scholarx.skku.edu/handle/2021.sw.skku/120800
- DOI
- 10.1016/j.infsof.2025.107685
- ISSN
- 0950-5849
1873-6025
- Abstract
- Context: Recent studies indicate that the classification of production and test bug reports can substantially enhance the accuracy of performance evaluation and the effectiveness of information retrieval-based bug localization (IRBL) for software reliability. Objective: However, manually classifying these bug reports is time-consuming for developers. This study introduces a production and test bug report classification (ProTeC) framework for automatically classifying these reports. Methods: The framework's novelty lies in leveraging a set of production- and test-source files and employing transfer learning to address the issue of insufficient and sparse bug reports in machine-learning applications. The ProTeC framework trains and fine-tunes a source file classifier to develop a bug report classifier by transferring production-test distinguishing knowledge. Results: To validate the effectiveness and general practicality of ProTeC, we conducted large-scale experiments using 2,522 bug reports across 12 machine/deep learning model variations to train an automatic classifier. Our results, on average, demonstrate that ProTeC's macro F1-score is 28.6% higher than that of a bug report-based classifier, and it can improve the mean average precision of IRBL by 17.6%. Conclusion: These positive trends were observed in most model variations, indicating that ProTeC consistently performs well in classifying bug reports regardless of the model used, thereby improving IRBL performance.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - Computing and Informatics > Computer Science and Engineering > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.