tailieunhanh - Báo cáo khoa học: "A Web-based Demonstrator of a Multi-lingual Phrase-based Translation System"

This paper describes a multi-lingual phrase-based Statistical Machine Translation system accessible by means of a Web page. The user can issue translation requests from Arabic, Chinese or Spanish into English. The same phrase-based statistical technology is employed to realize the three supported language-pairs. New language-pairs can be easily added to the demonstrator. The Web-based interface allows the use of the translation system to any computer connected to the Internet. | A Web-based Demonstrator of a Multi-lingual Phrase-based Translation System Roldano Cattoni Nicola Bertoldi Mauro Cettolo Boxing Chen and Marcello Federico ITC-irst - Centro per la Ricerca Scientifica e Tecnologica 38050 Povo - Trento Italy surname @ Abstract This paper describes a multi-lingual phrase-based Statistical Machine Translation system accessible by means of a Web page. The user can issue translation requests from Arabic Chinese or Spanish into English. The same phrase-based statistical technology is employed to realize the three supported language-pairs. New language-pairs can be easily added to the demonstrator. The Web-based interface allows the use of the translation system to any computer connected to the Internet. 1 Introduction At this time Statistical Machine Translation SMT has empirically proven to be the most competitive approach in international competitions like the NIST Evaluation Campaigns1 and the International Workshops on Spoken Language Translation IWSLT-20042 and iWsLT-20053 . In this paper we describe our multi-lingual phrase-based Statistical Machine Translation system which can be accessed by means of a Web page. Section 2 presents the general log-linear framework to SMT and gives an overview of our phrase-based SMT system. In section 3 the software architecture of the demo is outlined. Section 4 focuses on the currently supported language-pairs Arabic-to-English Chinese-to-English and Spanish-to-English. In section 5 the Web-based interface of the demo is described. 1http speech tests mt 2http IWSLT2004 3http iwslt2005 2 SMT System Description Log-Linear Model Given a string f in the source language the goal of the statistical machine translation is to select the string e in the target language which maximizes the posterior distribution Pr e f . By introducing the hidden word alignment variable a the following approximate optimization criterion can be applied for that purpose

TỪ KHÓA LIÊN QUAN