Skip to main content
Article
Active refinement of clone anomaly reports
ICSE'12: 34th International Conference on Software Engineering: Proceedings: June 2-9, 2012, Zurich, Switzerland
  • LUCIA, Singapore Management University
  • David LO, Singapore Management University
  • Lingxiao JIANG, Singapore Management University
  • Aditya BUDI, Singapore Management University
Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
6-2012
Abstract

Software clones have been widely studied in the recent literature and shown useful for finding bugs because inconsistent changes among clones in a clone group may indicate potential bugs. However, many inconsistent clone groups are not real bugs (true positives). The excessive number of false positives could easily impede broad adoption of clone-based bug detection approaches. In this work, we aim to improve the usability of clone-based bug detection tools by increasing the rate of true positives found when a developer analyzes anomaly reports. Our idea is to control the number of anomaly reports a user can see at a time and actively incorporate incremental user feedback to continually refine the anomaly reports. Our system first presents top few anomaly reports from the list of reports generated by a tool in its default ordering. Users then either accept or reject each of the reports. Based on the feedback, our system automatically and iteratively refines a classification model for anomalies and re-sorts the rest of the reports. Our goal is to present the true positives to the users earlier than the default ordering. The rationale of the idea is based on our observation that false positives among the inconsistent clone groups could share common features (in terms of code structure, programming patterns, etc.), and these features can be learned from the incremental user feedback. We evaluate our refinement process on three sets of clone-based anomaly reports from three large real programs: the Linux Kernel (C), Eclipse, and ArgoUML (Java), extracted by a clone-based anomaly detection tool. The results show that compared to the original ordering of bug reports, we can improve the rate of true positives found (i.e., true positives are found faster) by 11%, 87%, and 86% for Linux kernel, Eclipse, and ArgoUML, respectively.

ISBN
9781467310673
Identifier
10.1109/ICSE.2012.6227175
Publisher
IEEE
City or Country
Piscataway, NJ
Copyright Owner and License
Authors
Creative Commons License
Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International
Additional URL
http://doi.org/10.1109/ICSE.2012.6227175
Citation Information
LUCIA, David LO, Lingxiao JIANG and Aditya BUDI. "Active refinement of clone anomaly reports" ICSE'12: 34th International Conference on Software Engineering: Proceedings: June 2-9, 2012, Zurich, Switzerland (2012) p. 397 - 407
Available at: http://works.bepress.com/david_lo/76/