Skip to main content
Presentation
Robust speaker identification under noisy conditions using feature compensation and signal to noise ratio estimation
2016 IEEE 59th International Midwest Symposium on Circuits and Systems (2016)
  • Megan N. Frankle, Rowan University
  • Ravi P. Ramachandran, Rowan University
Abstract
For wireless remote access security, forensics, electronic commerce and surveillance applications, there is a growing need for biometric speaker identification systems to be robust to noise. This paper examines the robustness issue for the case of additive white noise at signal to noise ratios ranging from 0 to 30 dB. A Gaussian mixture model classifier based on adaptation of a universal background model is used. The system is trained on clean speech and tested on clean and noisy speech. To mitigate the performance loss due to mismatched training and testing conditions, five robust features, feature compensation and decision level fusion strategies are used. The feature compensation is based on blind estimation of the signal to noise ratio of the test speech and the selection of an affine transform among a repertoire. A two-way analysis of variance compares the experimental scenarios (benchmark, control and practical) and the individual features/fusion at each signal to noise ratio. The practical scenario is always statistically better than the benchmark and sometimes equivalent to the control scenario.
Publication Date
October 16, 2016
Location
Abu Dhabi, United Arab Emirates
DOI
10.1109/MWSCAS.2016.7869973
Citation Information
Megan N. Frankle and Ravi P. Ramachandran. "Robust speaker identification under noisy conditions using feature compensation and signal to noise ratio estimation" 2016 IEEE 59th International Midwest Symposium on Circuits and Systems (2016)
Available at: http://works.bepress.com/ravi-ramachandran/3/