Skip to main content
Article
Combining word embedding with information retrieval to recommend similar bug reports
ISSRE 2016: Proceedings of the 27th IEEE International Symposium on Software Reliability Engineering: Ottawa, October 23-27, 2016
  • Xinli YANG, Zhejiang University
  • David LO, Singapore Management University
  • Xin XIA, Zhejiang University
  • Lingfeng BAO, Zhejiang University
  • Jianling SUN, Zhejiang University
Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
10-2016
Abstract

Similar bugs are bugs that require handling of many common code files. Developers can often fix similar bugs with a shorter time and a higher quality since they can focus on fewer code files. Therefore, similar bug recommendation is a meaningful task which can improve development efficiency. Rocha et al. propose the first similar bug recommendation system named NextBug. Although NextBug performs better than a start-of-the-art duplicated bug detection technique REP, its performance is not optimal and thus more work is needed to improve its effectiveness. Technically, it is also rather simple as it relies only upon a standard information retrieval technique, i.e., cosine similarity. In the paper, we propose a novel approach to recommend similar bugs. The approach combines a traditional information retrieval technique and a word embedding technique, and takes bug titles and descriptions as well as bug product and component information into consideration. To evaluate the approach, we use datasets from two popular open-source projects, i.e., Eclipse and Mozilla, each of which contains bug reports whose bug ids range from [1,400000]. The results show that our approach improves the performance of NextBug statistically significantly and substantially for both projects.

Keywords
  • Information Retrieval,
  • Recommendation Systems,
  • Similar Bugs,
  • Word Embedding
ISBN
9781467390019
Identifier
10.1109/ISSRE.2016.33
Publisher
IEEE
City or Country
Piscataway, NJ
Copyright Owner and License
Authors
Creative Commons License
Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International
Additional URL
https://doi.org/10.1109/ISSRE.2016.33
Citation Information
Xinli YANG, David LO, Xin XIA, Lingfeng BAO, et al.. "Combining word embedding with information retrieval to recommend similar bug reports" ISSRE 2016: Proceedings of the 27th IEEE International Symposium on Software Reliability Engineering: Ottawa, October 23-27, 2016 (2016) p. 127 - 137
Available at: http://works.bepress.com/david_lo/226/