Skip to main content
Article
A Spectral Conversion Approach to Single-Channel Speech Enhancement
Departmental Papers (ESE)
  • Athanasios Mouchtaris, University of Crete
  • Jan Van der Spiegel, University of Pennsylvania
  • Paul Mueller, Corticon, Inc.
  • Panagiotis Tsakalides, University of Crete
Document Type
Journal Article
Date of this Version
5-1-2007
Comments
Copyright 2007 IEEE. Reprinted from IEEE Transactions on Audio, Speech, and Language Processing, Volume 15, Issue 4, May 2007, pages 1180-1193.

This material is posted here with permission of the IEEE. Such permission of the IEEE does not in any way imply IEEE endorsement of any of the University of Pennsylvania's products or services. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to pubs-permissions@ieee.org. By choosing to view this document, you agree to all provisions of the copyright laws protecting it.
Abstract
In this paper, a novel method for single-channel speech enhancement is proposed, which is based on a spectral conversion feature denoising approach. Spectral conversion has been applied previously in the context of voice conversion, and has been shown to successfully transform spectral features with particular statistical properties into spectral features that best fit (with the constraint of a piecewise linear transformation) different target statistics. This spectral transformation is applied as an initialization step to two well-known single channel enhancement methods, namely the iterativeWiener filter (IWF) and a particular iterative implementation of the Kalman filter. In both cases, spectral conversion is shown here to provide a significant improvement as opposed to initializations using the spectral features directly from the noisy speech. In essence, the proposed approach allows for applying these two algorithms in a user-centric manner, when "clean" speech training data are available from a particular speaker. The extra step of spectral conversion is shown to offer significant advantages regarding output signal-to-noise ratio (SNR) improvement over the conventional initializations, which can reach 2 dB for the IWF and 6 dB for the Kalman filtering algorithm, for low input SNRs and for white and colored noise, respectively.
Keywords
  • gaussian mixture model (gmm),
  • parameter adaptation,
  • spectral conversion,
  • speech enhancement
Citation Information
Athanasios Mouchtaris, Jan Van der Spiegel, Paul Mueller and Panagiotis Tsakalides. "A Spectral Conversion Approach to Single-Channel Speech Enhancement" (2007)
Available at: http://works.bepress.com/jan_vanderspiegel/38/