tailieunhanh - Báo cáo khoa học: "Integration of Speech to Computer-Assisted Translation Using Finite-State Automata"

State-of-the-art computer-assisted translation engines are based on a statistical prediction engine, which interactively provides completions to what a human translator types. The integration of human speech into a computer-assisted system is also a challenging area and is the aim of this paper. So far, only a few methods for integrating statistical machine translation (MT) models with automatic speech recognition (ASR) models have been studied. They were mainly based on N best rescoring approach. . | Integration of Speech to Computer-Assisted Translation Using Finite-State Automata Shahram Khadivi Richard Zens Hermann Ney Lehrstuhl fur Informatik 6 - Computer Science Department RWTH Aachen University D-52056 Aachen Germany khadivi zens ney @ Abstract State-of-the-art computer-assisted translation engines are based on a statistical prediction engine which interactively provides completions to what a human translator types. The integration of human speech into a computer-assisted system is also a challenging area and is the aim of this paper. So far only a few methods for integrating statistical machine translation MT models with automatic speech recognition ASR models have been studied. They were mainly based on N-best rescoring approach. N-best rescoring is not an appropriate search method for building a real-time prediction engine. In this paper we study the incorporation of MT models and ASR models using finite-state automata. We also propose some transducers based on MT models for rescoring the ASR word graphs. 1 Introduction A desired feature of computer-assisted translation CAT systems is the integration of the human speech into the system as skilled human translators are faster at dictating than typing the translations Brown et al. 1994 . Additionally incorporation of a statistical prediction engine . a statistical interactive machine translation system to the CAT system is another useful feature. A statistical prediction engine provides the completions to what a human translator types Foster et al. 1997 Och et al. 2003 . Then one possible procedure for skilled human translators is to provide the oral translation of a given source text and then to post-edit the recognized text. In the post-editing step a prediction engine helps to decrease the amount of human interaction Och et al. 2003 . In a CAT system with integrated speech two sources of information are available to recognize the speech input the target language speech and the .