"Evaluating Automatic Speech Recognition and Natural Language Understanding in an Incremental Setting" by Ryan Whetten

Selected Works of Casey R. Kennington

Follow Contact

Contribution to Book

Evaluating Automatic Speech Recognition and Natural Language Understanding in an Incremental Setting

Proceedings of the 27th Workshop on the Semantics and Pragmatics of Dialogue - Full Papers (2023)

Ryan Whetten, Boise State University
Enoch Levandovsky, Boise State University
Mir Tahsin Imtiaz
Casey Redd Kennington, Boise State University

Link

Abstract

Spoken dialogue systems enable people to interact with machines using speech, many of which involve the use of automatic speech recognition and language understanding in order to react to and determine a decision about how to respond. Unlike humans, many systems operate on complete sentences, waiting for a length of silence before attempting to process the input. In contrast, incremental spoken dialogue systems enable faster and more natural interaction by operating at a more fine-grained level. In this work, we evaluate six speech recognizers and RASA for language understanding in an incremental spoken dialogue system. The results suggest that, for speech recognition, online/cloud models can be slower and less stable than local models and we show that incremental language understanding can enable a system to make decisions earlier than waiting for the end of the utterance.

Keywords

incremental,
automatic speech recognition,
natural language understanding

Disciplines

Artificial Intelligence and Robotics

Publication Date

Summer August 16, 2023

Publisher

SEMDIAL

Citation Information

Ryan Whetten, Enoch Levandovsky, Mir Tahsin Imtiaz and Casey Redd Kennington. "Evaluating Automatic Speech Recognition and Natural Language Understanding in an Incremental Setting" Maribor, SloveniaProceedings of the 27th Workshop on the Semantics and Pragmatics of Dialogue - Full Papers (2023)
Available at: http://works.bepress.com/casey-kennington/66/