Skip to main content
Article
Mining Iterative Generators and Representative Rules for Software Specification Discovery
IEEE Transactions on Knowledge and Data Engineering
  • David LO, Singapore Management University
  • Jinyan LI, NTU
  • Limsoon Wong, NUS
  • Siau-Cheng Khoo, NUS
Publication Type
Journal Article
Publication Date
2-2011
Abstract
Billions of dollars are spent annually on software-related cost. It is estimated that up to 45 percent of software cost is due to the difficulty in understanding existing systems when performing maintenance tasks (i.e., adding features, removing bugs, etc.). One of the root causes is that software products often come with poor, incomplete, or even without any documented specifications. In an effort to improve program understanding, Lo et al. have proposed iterative pattern mining which outputs patterns that are repeated frequently within a program trace, or across multiple traces, or both. Frequent iterative patterns reflect frequent program behaviors that likely correspond to software specifications. To reduce the number of patterns and improve the efficiency of the algorithm, Lo et al. have also introduced mining closed iterative patterns, i.e., maximal patterns without any superpattern having the same support. In this paper, to technically deepen research on iterative pattern mining, we introduce mining iterative generators, i.e., minimal patterns without any subpattern having the same support. Iterative generators can be paired with closed patterns to produce a set of rules expressing forward, backward, and in-between temporal constraints among events in one general representation. We refer to these rules as representative rules. A comprehensive performance study shows the efficiency of our approach. A case study on traces of an industrial system shows how iterative generators and closed iterative patterns can be merged to form useful rules shedding light on software design.
Keywords
  • Frequent pattern mining,
  • sequence database,
  • iterative patterns,
  • generators,
  • representative rules,
  • software engineering,
  • reverse engineering,
  • program comprehension.
Identifier
10.1109/TKDE.2010.24
Publisher
IEEE
Additional URL
http://dx.doi.org/10.1109/TKDE.2010.24
Citation Information
David LO, Jinyan LI, Limsoon Wong and Siau-Cheng Khoo. "Mining Iterative Generators and Representative Rules for Software Specification Discovery" IEEE Transactions on Knowledge and Data Engineering Vol. 23 Iss. 2 (2011) p. 282 - 296 ISSN: 1041-4347
Available at: http://works.bepress.com/david_lo/50/