Skip to main content
Article
An Improved Implementation of Parallel Selection on GPUs
In International Symposium on Software Engineering and Applications.Los Angeles.
  • Janche Sang, Cleveland State University
  • Darius Bakunas Milanowski
  • Vernon Rego
  • Chansu Yu, Cleveland State University
Document Type
Conference Proceeding
Publication Date
1-1-2015
Disciplines
Abstract

The computing power of current Graphical Processing Units (GPUs) has increased rapidly over the years. They offer much more computational power than recent CPUs by providing a vast number of simple, data parallel, multithreaded cores. In this paper, we proposed an improved implementation of parallel selection and compare the performance of different parallel selection algorithms on the current generation of NVIDIA GPUs. That is, given a massively large array of elements, we were interested in how we could use a GPU to efficiently select those elements that meet certain criteria and then store them into a target array for further processing. The optimization techniques used and implementation issues encountered are discussed in detail. Furthermore, the experimental results show that our advanced implementation performs an average of 2.88 times faster than Thrust, an open-source parallel algorithms library.

DOI
10.2316/P.2015.829-011
Citation Information
Janche Sang, Darius Bakunas Milanowski, Vernon Rego and Chansu Yu. "An Improved Implementation of Parallel Selection on GPUs" In International Symposium on Software Engineering and Applications.Los Angeles. (2015) p. 829 - 831
Available at: http://works.bepress.com/chansu_yu/31/