CUED Publications database

Continuous asr for flexible incremental dialogue

Breslin, C and Gasic, M and Henderson, M and Kim, D and Szummer, M and Thomson, B and Tsiakoulis, P and Young, S (2013) Continuous asr for flexible incremental dialogue. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. pp. 8362-8366. ISSN 1520-6149

Full text not available from this repository.


Spoken dialogue systems provide a convenient way for users to interact with a machine using only speech. However, they often rely on a rigid turn taking regime in which a voice activity detection (VAD) module is used to determine when the user is speaking and decide when is an appropriate time for the system to respond. This paper investigates replacing the VAD and discrete utterance recogniser of a conventional turn-taking system with a continuously operating recogniser that is always listening, and using the recogniser 1-best path to guide turn taking. In this way, a flexible framework for incremental dialogue management is possible. Experimental results show that it is possible to remove the VAD component and successfully use the recogniser best path to identify user speech, with more robustness to noise, potentially smaller latency times, and a reduction in overall recognition error rate compared to using the conventional approach. © 2013 IEEE.

Item Type: Article
Divisions: Div F > Machine Intelligence
Depositing User: Cron Job
Date Deposited: 17 Jul 2017 19:12
Last Modified: 22 May 2018 07:18