tailieunhanh - Báo cáo khoa học: "AUTOMATIC SPEECH RECOGNITION AND ITS APPLICATION TO INFORMATION EXTRACTION"

This paper describes recent progress and the author's perspectives of speech recognition technology. Applications of speech recognition technology can be classified into two main areas, dictation and human-computer dialogue systems. In the dictation domain, the automatic broadcast news transcription is now actively investigated, especially under the DARPA project. | AUTOMATIC SPEECH RECOGNITION AND ITS APPLICATION TO INFORMATION EXTRACTION Sadaoki Furui Department of Computer Science Tokyo Institute of Technology 2-12-1 Ookayama Meguro-ku Tokyo 152-8552 Japan furui@ ABSTRACT This paper describes recent progress and the author s perspectives of speech recognition technology. Applications of speech recognition technology can be classified into two main areas dictation and human-computer dialogue systems. In the dictation domain the automatic broadcast news transcription is now actively investigated especially under the DARPA project. The broadcast news dictation technology has recently been integrated with information extraction and retrieval technology and many application systems such as automatic voice document indexing and retrieval systems are under development. In the human-computer interaction domain a variety of experimental systems for information retrieval through spoken dialogue are being investigated. In spite of the remarkable recent progress we are still behind our ultimate goal of understanding free conversational speech uttered by any speaker under any environment. This paper also describes the most important research issues that we should attack in order to advance to our ultimate goal of fluent speech recognition. 1. INTRODUCTION The field of automatic speech recognition has witnessed a number of significant advances in the past 5-10 years spurred on by advances in signal processing algorithms computational architectures and hardware. These advances include the widespread adoption of a statistical pattern recognition paradigm a data-driven approach which makes use of a rich set of speech utterances from a large population of speakers the use of stochastic acoustic and language modeling and the use of dynamic programmingbased search methods. A series of D ARPA projects have been a major driving force of the recent progress in research on large-vocabulary continuous-speech recognition. Specifically

TỪ KHÓA LIÊN QUAN
crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.