Auditory-based speech processing based on the average localized synchrony detection
Article comments
Copyright 2000 IEEE. Reprinted from Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '2000), Volume 3, pages 1623–1626.
This material is posted here with permission of the IEEE. Such permission
of the IEEE does not in any way imply IEEE endorsement of any of the
University of Pennsylvania's products or services. Internal or personal
use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to pubs-permissions@ieee.org. By choosing to view this document, you agree to all provisions of the copyright laws protecting it.
Abstract
In this paper, a new auditory-based speech processing system based on the biologically rooted property of average localized synchrony detection (ALSD) is proposed. The system detects periodicity in the speech signal at Bark-scaled frequencies while reducing the response's spurious peaks and sensitivity to implementation mismatches, and hence presents a consistent and robust representation of the formants. The system is evaluated for its formant extraction ability while reducing spurious peaks. It is compared with other auditory-based front-end processing systems in the task of vowel recognition on clean speech from the TIMIT database and in the presence of noise. The results illustrate the advantage of the ALSD system in extracting the formants and reducing the spurious peaks. They also indicate the superiority of the synchrony measures over the mean-rate in the presence of noise.
Suggested Citation
Ahmed M. Abdelatty Ali, Jan Van der Spiegel, and Paul Mueller. "Auditory-based speech processing based on the average localized synchrony detection" Departmental Papers (ESE) (2000).
Available at: http://works.bepress.com/jan_vanderspiegel/15