Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Combining Acoustic and Pragmatic Features to Predict Recognition Performance in Spoken Dialogue Systems"

Ngọc Quyên 71 8 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

We use machine learners trained on a combination of acoustic conﬁdence and pragmatic plausibility features computed from dialogue context to predict the accuracy of incoming n-best recognition hypotheses to a spoken dialogue system. Our best results show a 25% weighted f-score improvement over a baseline system that implements a “grammar-switching” approach to context-sensitive speech recognition. | Combining Acoustic and Pragmatic Features to Predict Recognition Performance in Spoken Dialogue Systems Malte Gabsdil Department of Computational Linguistics Saarland University Germany gabsdil@coli.uni-sb.de Oliver Lemon School of Informatics Edinburgh University Scotland olemon@inf.ed.ac.uk Abstract We use machine learners trained on a combination of acoustic confidence and pragmatic plausibility features computed from dialogue context to predict the accuracy of incoming n-best recognition hypotheses to a spoken dialogue system. Our best results show a 25 weighted f-score improvement over a baseline system that implements a grammar-switching approach to context-sensitive speech recognition. 1 Introduction A crucial problem in the design of spoken dialogue systems is to decide for incoming recognition hypotheses whether a system should accept consider correctly recognized reject assume misrecognition or ignore classify as noise or speech not directed to the system them. In addition a more sophisticated dialogue system might decide whether to clarify or confirm certain hypotheses. Obviously incorrect decisions at this point can have serious negative effects on system usability and user satisfaction. On the one hand accepting misrecognized hypotheses leads to misunderstandings and unintended system behaviors which are usually difficult to recover from. On the other hand users might get frustrated with a system that behaves too cautiously and rejects or ignores too many utterances. Thus an important feature in dialogue system engineering is the tradeoff between avoiding task failure due to misrecognitions and promoting overall dialogue efficiency flow and naturalness. In this paper we investigate the use of machine learners trained on a combination of acoustic confidence and pragmatic plausibility features i.e. computed from dialogue context to predict the quality of incoming n-best recognition hypotheses to a spoken dialogue system. These predictions are then used .

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Combining Coherence Models and Machine Translation Evaluation Metrics for Summarization Evaluation"

Báo cáo khoa học: "Combining Textual Entailment and Argumentation Theory for Supporting Online Debates Interactions"

Báo cáo khoa học: "Combining Word-Level and Character-Level Models for Machine Translation Between Closely-Related Languages"

Báo cáo khoa học: "Deciphering Foreign Language by Combining Language Models and Context Vectors"

Báo cáo khoa học: "Combining data and mathematical models of language change"

Báo cáo khoa học: "Combining Orthogonal Monolingual and Multilingual Sources of Evidence for All Words WSD"

Báo cáo khoa học: "Combining Indicators of Allophony"

Báo cáo khoa học: "Combining Lexical Semantic Resources with Question & Answer Archives for Translation-Based Answer Finding"

Báo cáo khoa học: "Combining Morpheme-based Machine Translation with Post-processing Morpheme Prediction"

Báo cáo khoa học: "Combining POMDPs trained with User Simulations and Rule-based Dialogue Management in a Spoken Dialogue System"