Skip to main content
Contribution to Book
Evaluating Automatic Speech Recognition and Natural Language Understanding in an Incremental Setting
Proceedings of the 27th Workshop on the Semantics and Pragmatics of Dialogue - Full Papers (2023)
  • Ryan Whetten, Boise State University
  • Enoch Levandovsky, Boise State University
  • Mir Tahsin Imtiaz
  • Casey Redd Kennington, Boise State University
Abstract
Spoken dialogue systems enable people to interact with machines using speech, many of which involve the use of automatic speech recognition and language understanding in order to react to and determine a decision about how to respond. Unlike humans, many systems operate on complete sentences, waiting for a length of silence before attempting to process the input. In contrast, incremental spoken dialogue systems enable faster and more natural interaction by operating at a more fine-grained level. In this work, we evaluate six speech recognizers and RASA for language understanding in an incremental spoken dialogue system. The results suggest that, for speech recognition, online/cloud models can be slower and less stable than local models and we show that incremental language understanding can enable a system to make decisions earlier than waiting for the end of the utterance.
Keywords
  • incremental,
  • automatic speech recognition,
  • natural language understanding
Publication Date
Summer August 16, 2023
Publisher
SEMDIAL
Citation Information
Ryan Whetten, Enoch Levandovsky, Mir Tahsin Imtiaz and Casey Redd Kennington. "Evaluating Automatic Speech Recognition and Natural Language Understanding in an Incremental Setting" Maribor, SloveniaProceedings of the 27th Workshop on the Semantics and Pragmatics of Dialogue - Full Papers (2023)
Available at: http://works.bepress.com/casey-kennington/66/