Skip to main content
Article
Early prediction of merged code changes to prioritize reviewing tasks
Empirical Software Engineering
  • Yuanrui FAN, Zhejiang University
  • Xin XIA, Monash University
  • David LO, Singapore Management University
  • Shanping LI, Zhejiang University
Publication Type
Journal Article
Version
acceptedVersion
Publication Date
12-2018
Abstract

Modern Code Review (MCR) has been widely used by open source and proprietary software projects. Inspecting code changes consumes reviewers much time and effort since they need to comprehend patches, and many reviewers are often assigned to review many code changes. Note that a code change might be eventually abandoned, which causes waste of time and effort. Thus, a tool that predicts early on whether a code change will be merged can help developers prioritize changes to inspect, accomplish more things given tight schedule, and not waste reviewing effort on low quality changes. In this paper, motivated by the above needs, we build a merged code change prediction tool. Our approach first extracts 34 features from code changes, which are grouped into 5 dimensions: code, file history, owner experience, collaboration network, and text. And then we leverage machine learning techniques such as random forest to build a prediction model. To evaluate the performance of our approach, we conduct experiments on three open source projects (i.e., Eclipse, LibreOffice, and OpenStack), containing a total of 166,215 code changes. Across three datasets, our approach statistically significantly improves random guess classifiers and two prediction models proposed by Jeong et al. (2009) and Gousios et al. (2014) in terms ofseveral evaluation metrics. Besides, we also study the important features which distinguishmerged code changes from abandoned ones.

Keywords
  • Code review,
  • Predictive model,
  • Features
Identifier
10.1007/s10664-018-9602-0
Publisher
Springer
Copyright Owner and License
Authors
Creative Commons License
Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International
Additional URL
https://doi.org/10.1007/s10664-018-9602-0
Citation Information
Yuanrui FAN, Xin XIA, David LO and Shanping LI. "Early prediction of merged code changes to prioritize reviewing tasks" Empirical Software Engineering Vol. 23 Iss. 6 (2018) p. 3346 - 3393 ISSN: 1382-3256
Available at: http://works.bepress.com/david_lo/247/