Skip to main content
Article
Holistic Optimization of RNA-seq Workflow for Multi-Threaded Environments
School of Engineering and Technology Publications
  • Ling-Hong Hung
  • Wes Lloyd
  • Radhika Agumbe Sridhar
  • Saranya Devi Ravishankar
  • Yuguang Xiong
  • Eric Sobie
  • Ka Yee Yeung, University of Washington Tacoma
Publication Date
6-14-2018
Document Type
Article
Abstract

For many next-generation sequencing pipelines, the most computationally intensive step is the alignment of reads to a reference sequence. As a result, alignment software such as the Burrows-Wheeler Aligner (BWA) is optimized for speed and and is often executed in parallel on the cloud. However, there are other less demanding steps that can also be optimized and significantly increase the speed especially when using many threads. We demonstrate this using a Unique-molecular-identifier (UMI) RNA sequencing pipeline consisting of 3 steps: split, align, and merge. Optimization of all three steps yields a 40% increase in speed when executed using a single thread. However, when executed using 16 threads, we observe a 4-fold improvement over the original parallel implementation and more than an 8-fold improvement over the original single-threaded implementation. In contrast, optimizing only the alignment step results in just a 13% improvement over the original parallel workflow using 16 threads.

Disciplinary Repository
bioRxiv
DOI
10.1101/345819
Publisher Policy
open access
Citation Information
Ling-Hong Hung, Wes Lloyd, Radhika Agumbe Sridhar, Saranya Devi Ravishankar, et al.. "Holistic Optimization of RNA-seq Workflow for Multi-Threaded Environments" (2018)
Available at: http://works.bepress.com/ky-yeung/12/