Skip to main content
Presentation
On accelerating ultra-large-scale mining
Proceedings of the 39th International Conference on Software Engineering: New Ideas and Emerging Results Track
  • Ganesha Upadhyaya, Iowa State University
  • Hridesh Rajan, Iowa State University
Document Type
Conference Proceeding
Conference
The 14th International Conference on Mining Software Repositories (ICSE-NIER '17)
Publication Version
Accepted Manuscript
Link to Published Version
https://doi.org/10.1109/ICSE-NIER.2017.11
Publication Date
1-1-2017
DOI
10.1109/ICSE-NIER.2017.11
Conference Date
May 20-28, 2017
Geolocation
(-34.6036844, -58.381559100000004)
Abstract

Ultra-large-scale mining has been shown to be useful for a number of software engineering tasks e.g. mining specifications, defect prediction. We propose a new research direction for accelerating ultra-large-scale mining that goes beyond parallelization. Our key idea is to analyze the interaction pattern between the mining task and the artifact to cluster artifacts such that running the mining task on one candidate artifact from each cluster is sufficient to produce results for other artifacts in the same cluster. Our artifact clustering criteria go beyond syntactic, semantic, and functional similarities to mining-task-specific similarity, where the interaction pattern between the mining task and the artifact is used for clustering. Our preliminary evaluation demonstrates that our technique significantly reduces the overall mining time.

Comments

This is a manuscript of a proceeding published as Upadhyaya, Ganesha, and Hridesh Rajan. "On accelerating ultra-large-scale mining." In Proceedings of the 39th International Conference on Software Engineering: New Ideas and Emerging Results Track, pp. 39-42. IEEE Press, 2017. doi: 10.1109/ICSE-NIER.2017.11. Posted with permission.

Rights
© 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Copyright Owner
IEEE Press
Language
en
File Format
application/pdf
Citation Information
Ganesha Upadhyaya and Hridesh Rajan. "On accelerating ultra-large-scale mining" Buenos Aires, ArgentinaProceedings of the 39th International Conference on Software Engineering: New Ideas and Emerging Results Track (2017) p. 39 - 42
Available at: http://works.bepress.com/hridesh-rajan/105/