Skip to main content
Article
Copy number variation detection using next generation sequencing read counts
BMC Bioinformatics
  • Heng Wang, Lyman Briggs College
  • Dan Nettleton, Iowa State University
  • Kai Ying, National Institutes of Health
Document Type
Article
Publication Version
Published Version
Publication Date
1-1-2014
DOI
10.1186/1471-2105-15-109
Abstract

Background: A copy number variation (CNV) is a difference between genotypes in the number of copies of a genomic region. Next generation sequencing (NGS) technologies provide sensitive and accurate tools for detecting genomic variations that include CNVs. However, statistical approaches for CNV identification using NGS are limited. We propose a new methodology for detecting CNVs using NGS data. This method (henceforth denoted by m-HMM) is based on a hidden Markov model with emission probabilities that are governed by mixture distributions. We use the Expectation-Maximization (EM) algorithm to estimate the parameters in the model.

Results: A simulation study demonstrates that our proposed m-HMM approach has greater power for detecting copy number gains and losses relative to existing methods. Furthermore, application of our m-HMM to DNA sequencing data from the two maize inbred lines B73 and Mo17 to identify CNVs that may play a role in creating phenotypic differences between these inbred lines provides results concordant with previous array-based efforts to identify CNVs.

Conclusions: The new m-HMM method is a powerful and practical approach for identifying CNVs from NGS data.

Comments

This article is published as Wang, Heng, Dan Nettleton, and Kai Ying. "Copy number variation detection using next generation sequencing read counts." BMC bioinformatics 15 (2014): 109. doi: 10.1186/1471-2105-15-109.

Rights
Works produced by employees of the U.S. Government as part of their official duties are not copyrighted within the U.S. The content of this document is not copyrighted.
Language
en
File Format
application/pdf
Citation Information
Heng Wang, Dan Nettleton and Kai Ying. "Copy number variation detection using next generation sequencing read counts" BMC Bioinformatics Vol. 15 (2014) p. 109
Available at: http://works.bepress.com/dan-nettleton/64/