tailieunhanh - Báo cáo khoa học: "Producing Contextually Appropriate Intonation in an Information-State Based Dialogue System"

Our goal is to improve the contextual appropriateness of spoken output in a dialogue system. We explore the use of the information state to determine the information structure of system utterances. We concentrate on the realization of information structure by intonation. We present the results of evaluating the contextual appropriateness of varied system output produced with a text-to-speech synthesis system that supports intonation annotation. | Producing Contextually Appropriate Intonation in an Information-State Based Dialogue System Ivana Kruijff-Korbayova1 Stina Ericsson2 Kepa J. Rodríguez1 Elena Karagjosova1 University of the Saarland Germany 2University of Gothenburg Sweden korbay kepa elka @ stinae@ Abstract Our goal is to improve the contextual appropriateness of spoken output in a dialogue system. We explore the use of the information state to determine the information structure of system utterances. We concentrate on the realization of information structure by intonation. We present the results of evaluating the contextual appropriateness of varied system output produced with a text-to-speech synthesis system that supports intonation annotation. 1 Introduction Most commercial spoken dialogue systems use carefully scripted dialogues. This has the advantage that the system output can be pre-recorded and have high quality. The disadvantage is limited dialogue flexibility as user-initiative must be restricted to ensure the dialogue adheres to the script. More flexible dialogues need dynamically produced output. As the range of possible system utterances grows pre-recording becomes infeasible and speech synthesis becomes necessary. One challenge for systems using synthesized speech is the generation of contextually appropriate intonation. With dynamically produced output the same sequence of words may appear in different contexts possibly needing different intonation. For example the intonation of an an swer needs to correspond to the respective question whereas in IS the nuclear intonation center has a default placement in 2S it does 1 U What is the status of the stove S The stove is switched ON. H LL 2 U Which device is switched on S The STOVE is switched on. H LL Contextually inappropriate intonation may have negative effect on intelligibility or even lead to confusion for example when 1U is answered with 2S or 2U with IS a mismatch arises. The details of relating .

TỪ KHÓA LIÊN QUAN