Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Learning to Say It Well: Reranking Realizations by Predicted Synthesis Quality"

Phi Hoàng 72 8 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

This paper presents a method for adapting a language generator to the strengths and weaknesses of a synthetic voice, thereby improving the naturalness of synthetic speech in a spoken language dialogue system. The method trains a discriminative reranker to select paraphrases that are predicted to sound natural when synthesized. The ranker is trained on realizer and synthesizer features in supervised fashion, using human judgements of synthetic voice quality on a sample of the paraphrases representative of the generator’s capability. . | Learning to Say It Well Reranking Realizations by Predicted Synthesis Quality Crystal Nakatsu and Michael White Department of Linguistics The Ohio State University Columbus OH 43210 USA cnakatsu mwhite @ling.ohio-state.edu Abstract This paper presents a method for adapting a language generator to the strengths and weaknesses of a synthetic voice thereby improving the naturalness of synthetic speech in a spoken language dialogue system. The method trains a discriminative reranker to select paraphrases that are predicted to sound natural when synthesized. The ranker is trained on realizer and synthesizer features in supervised fashion using human judgements of synthetic voice quality on a sample of the paraphrases representative of the generator s capability. Results from a cross-validation study indicate that discriminative paraphrase reranking can achieve substantial improvements in naturalness on average ameliorating the problem of highly variable synthesis quality typically encountered with today s unit selection synthesizers. 1 Introduction Unit selection synthesis1 a technique which concatenates segments of natural speech selected from a database has been found to be capable of producing high quality synthetic speech especially for utterances that are similar to the speech in the database in terms of style delivery and coverage Black and Lenzo 2001 . In particular in the limited domain of a spoken language dialogue system it is possible to achieve highly natural synthesis with a purpose-built voice Black and Lenzo 2000 . However it can be difficult to develop 1See e.g. Hunt and Black 1996 Black and Taylor 1997 Beutnagel et al. 1999 . a synthetic voice for a dialogue system that produces natural speech completely reliably and thus in practice output quality can be quite variable. Two important factors in this regard are the labeling process for the speech database and the direction of the dialogue system s further development after the voice has been built when

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Learning Condensed Feature Representations from Large Unsupervised Data Sets for Supervised Learning"

Báo cáo khoa học: "Learning Better Data Representation using Inference-Driven Metric Learning"

Báo cáo khoa học: "A Combination of Active Learning and Semi-supervised Learning Starting with Positive and Unlabeled Examples for Word Sense Disambiguation: An Empirical Study on Japanese Web Search Query"

B.A Thesis: English major students’ difficulties and expectations in learning written translation at Dong Thap university

Báo cáo đề tài nghiên cứu khoa học cấp trường: Áp dụng mô hình học tập Blended Learning trong giảng dạy học phần Basic IELTS 1 cho sinh viên theo chương trình đào tạo chất lượng cao năm thứ nhất trường Đại học Thương mại

Báo cáo đề tài nghiên cứu khoa học cấp trường: Nâng cao động lực học tiếng Anh cho sinh viên thông qua phương pháp học theo dự án (project-based learning)

Báo cáo đề tài nghiên cứu khoa học cấp trường: Nghiên cứu một số thuật toán học máy (machine learning) ứng dụng cho bài toán xác định các chủ đề quan tâm của khách hàng trực tuyến

Báo cáo khoa học: "Applications of GPC Rules and Character Structures in Games for Learning Chinese Characters"

Báo cáo khoa học: "Learning and Translating by Machines"

Báo cáo khoa học: "Discriminative Learning for Joint Template Filling"