tailieunhanh - Báo cáo khoa học: "Learning to predict pitch accents and prosodic boundaries in Dutch"

We train a decision tree inducer (CART) and a memory-based classifier (MBL) on predicting prosodic pitch accents and breaks in Dutch text, on the basis of shallow, easy-to-compute features. We train the algorithms on both tasks individually and on the two tasks simultaneously. The parameters of both algorithms and the selection of features are optimized per task with iterative deepening, an efficient wrapper procedure that uses progressive sampling of training data. Results show a consistent significant advantage of MBL over CART, and also indicate that task combination can be done at the cost of little generalization score loss | Learning to predict pitch accents and prosodic boundaries in Dutch Erwin Marsi1 Martin Reynaert1 Antal van den Bosch1 Walter Daelemans2 Veronique Hoste2 1 Tilburg University ILK Computational Linguistics and AI Tilburg The Netherlands reynaert @ 2 University of Antwerp cNts Antwerp Belgium daelem hoste @ Abstract We train a decision tree inducer CART and a memory-based classifier MBL on predicting prosodic pitch accents and breaks in Dutch text on the basis of shallow easy-to-compute features. We train the algorithms on both tasks individually and on the two tasks simultaneously. The parameters of both algorithms and the selection of features are optimized per task with iterative deepening an efficient wrapper procedure that uses progressive sampling of training data. Results show a consistent significant advantage of MBL over CART and also indicate that task combination can be done at the cost of little generalization score loss. Tests on cross-validated data and on held-out data yield F-scores of MBL on accent placement of 84 and 87 respectively and on breaks of 88 and 91 respectively. Accent placement is shown to outperform an informed baseline rule reliably predicting breaks other than those already indicated by intra-sentential punctuation however appears to be more challenging. 1 Introduction Any text-to-speech TTS system that aims at producing understandable and natural-sounding output needs to have on-board methods for predicting prosody. Most systems start with generating a prosodic representation at the linguistic or symbolic level followed by the actual phonetic realization in terms of primarily pitch pauses and segmental durations. The first step involves placing pitch accents and inserting prosodic boundaries at the right locations and may involve tune choice as well . Pitch accents correspond roughly to pitch movements that lend emphasis to certain words in an utterance. Prosodic breaks are audible .

crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.