Skip to main content
Article
Automatic defect categorization
WCRE 2012: Proceedings of the 19th Working Conference on Reverse Engineering, Kingston, Ontario, 15-18 October 2012
  • Ferdian THUNG, Singapore Management University
  • David LO, Singapore Management University
  • Lingxiao JIANG, Singapore Management University
Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
10-2012
Abstract

Defects are prevalent in software systems. In order to understand defects better, industry practitioners often categorize bugs into various types. One common kind of categorization is the IBM’s Orthogonal Defect Classification (ODC). ODC proposes various orthogonal classification of defects based on much information about the defects, such as the symptoms and semantics of the defects, the root cause analysis of the defects, and many more. With these category labels, developers can better perform post-mortem analysis to find out what the common characteristics of the defects that plague a particular software project are. Albeit the benefits of having these categories, for many software systems, these category labels are often missing. To address this problem, we propose a text mining solution that can categorize defects into various types by analyzing both texts from bug reports and code features from bug fixes. To this end, we have manually analyzed the data about 500 defects from three software systems, and classified them according to ODC. In addition, we propose a classification-based approach that can automatically classify defects into three supercategories that are comprised of ODC categories: control and data flow, structural, and non-functional. Our empirical evaluation shows that the automatic classification approach is able to label defects with an average accuracy of 77.8% by using the SVM multiclass classification algorithm.

ISBN
9781467345361
Identifier
10.1109/WCRE.2012.30
Publisher
IEEE Computer Society
City or Country
Los Alamitos, CA
Copyright Owner and License
Authors
Creative Commons License
Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International
Additional URL
http://doi.org/10.1109/WCRE.2012.30
Citation Information
Ferdian THUNG, David LO and Lingxiao JIANG. "Automatic defect categorization" WCRE 2012: Proceedings of the 19th Working Conference on Reverse Engineering, Kingston, Ontario, 15-18 October 2012 (2012) p. 205 - 214
Available at: http://works.bepress.com/david_lo/108/