pith. sign in

arxiv: 1704.04769 · v1 · pith:RBTQATOZnew · submitted 2017-04-16 · 💻 cs.SE

Automatic Bug Triage using Semi-Supervised Text Classification

classification 💻 cs.SE
keywords reportsapproachclassifierclassificationlabeledapproachesexistinglabels
0
0 comments X
read the original abstract

In this paper, we propose a semi-supervised text classification approach for bug triage to avoid the deficiency of labeled bug reports in existing supervised approaches. This new approach combines naive Bayes classifier and expectation-maximization to take advantage of both labeled and unlabeled bug reports. This approach trains a classifier with a fraction of labeled bug reports. Then the approach iteratively labels numerous unlabeled bug reports and trains a new classifier with labels of all the bug reports. We also employ a weighted recommendation list to boost the performance by imposing the weights of multiple developers in training the classifier. Experimental results on bug reports of Eclipse show that our new approach outperforms existing supervised approaches in terms of classification accuracy.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Duplicate Bug Report Detection: How Far Are We?

    cs.SE 2022-12 unverdicted novelty 6.0

    A new bias-aware benchmark for duplicate bug report detection shows simpler techniques outperform recent sophisticated methods on most projects and match industry tools.

  2. Not All Bugs Are the Same: Understanding, Characterizing, and Classifying the Root Cause of Bugs

    cs.SE 2019-07 unverdicted novelty 5.0

    Manual analysis of 1,280 bug reports across three ecosystems produces a nine-category root cause taxonomy; an ML classifier achieves 64% F-Measure and 74% AUC-ROC overall.