tailieunhanh - Báo cáo khoa học: "Incremental Dialogue Processing in a Micro-Domain"

This paper describes a fully incremental dialogue system that can engage in dialogues in a simple domain, number dictation. Because it uses incremental speech recognition and prosodic analysis, the system can give rapid feedback as the user is speaking, with a very short latency of around 200ms. Because it uses incremental speech synthesis and self-monitoring, the system can react to feedback from the user as the system is speaking. A comparative evaluation shows that naïve users preferred this system over a non-incremental version, and that it was perceived as more human-like. . | Incremental Dialogue Processing in a Micro-Domain Gabriel Skantze1 Dept. of Speech Music and Hearing KTH Stockholm Sweden gabriel@ David Schlangen Department of Linguistics University of Potsdam Germany das@ Abstract This paper describes a fully incremental dialogue system that can engage in dialogues in a simple domain number dictation. Because it uses incremental speech recognition and prosodic analysis the system can give rapid feedback as the user is speaking with a very short latency of around 200ms. Because it uses incremental speech synthesis and self-monitoring the system can react to feedback from the user as the system is speaking. A comparative evaluation shows that naive users preferred this system over a non-incremental version and that it was perceived as more human-like. 1 Introduction A traditional simplifying assumption for spoken dialogue systems is that the dialogue proceeds with strict turn-taking between user and system. The minimal unit of processing in such systems is the utterance which is processed in whole by each module of the system before it is handed on to the next. When the system is speaking an utterance it assumes that the user will wait for it to end before responding. Some systems accept barge-ins but then treat the interrupted utterance as basically unsaid. Obviously this is not how natural humanhuman dialogue proceeds. Humans understand and produce language incrementally - they use multiple knowledge sources to determine when it is appropriate to speak they give and receive backchannels in the middle of utterances they start to speak before knowing exactly what to say and they incrementally monitor the listener s reactions to what they say Clark 1996 . 1 The work reported in this paper was done while the first author was at the University of Potsdam. This paper presents a dialogue system called Numbers in which all components operate incrementally. We had two aims First to explore technical .

TỪ KHÓA LIÊN QUAN
crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.