tailieunhanh - Báo cáo khoa học: "Automatically Creating Bilingual Lexicons for Machine Translation from Bilingual Text"
A m e t h o d is presented for automatically augmenting the bilingual lexicon of an existing Machine Translation system, by extracting bilingual entries from aligned bilingual text. The proposed m e t h o d only relies on the resources already available in the MT system itself. It is based on the use of bilingual lexical templates to match the terminal symbols in the parses of the aligned sentences. | Automatically Creating Bilingual Lexicons for Machine Translation from Bilingual Text Davide Turcato TCC Communications 100-6722 Oldfield Road Victoria BC V8M 2A3 Canada Natural Language Lab School of Computing Science Simon Fraser University Burnaby BC V5A 1S6 Canada Abstract A method is presented for automatically augmenting the bilingual lexicon of an existing Machine Translation system by extracting bilingual entries from aligned bilingual text. The proposed method only relies on the resources already available in the MT system itself. It is based on the use of bilingual lexical templates to match the terminal symbols in the parses of the aligned sentences. 1 Introduction A novel approach to automatically building bilingual lexicons is presented here. The term bilingual lexicon denotes a collection of complex equivalences as used in Machine Translation MT transfer lexicons not just word equivalences. In addition to words such lexicons involve syntactic and semantic descriptions and means to perform a correct transfer between the two sides of a bilingual lexical entry. A symbolic rule-based approach of the parseparse-match kind is proposed. The core idea is to use the resources of bidirectional transfer MT systems for this purpose taking advantage of their features to convert them to a novel use. In addition to having them use their bilingual lexicons to produce translations it is proposed to have them use translations to produce bilingual lexicons. Although other uses might be conceived the most appropriate use is to have an MT system automatically augment its own bilingual lexicon from a small initial sample. The core of the described approach consists of using a set of bilingual lexical templates in matching the parses of two aligned sentences and in turning the lexical equivalences thus established into new bilingual lexical entries. 2 Theoretical framework The basic requirement that an MT system should meet for the present .
đang nạp các trang xem trước