Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Inducing Word Alignments with Bilexical Synchronous Trees"

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ

This paper compares different bilexical tree-based models for bilingual alignment. EM training for the new model benefits from the dynamic programming “hook trick”. The model produces improved dependency structure for both languages. that we must choose how to lexicalize very carefully to control complexity. In this paper we compare two approaches to lexicalization, both of which incorporate bilexical probabilities. | Inducing Word Alignments with Bilexical Synchronous Trees Hao Zhang and Daniel Gildea Computer Science Department University of Rochester Rochester NY 14627 Abstract This paper compares different bilexical tree-based models for bilingual alignment. EM training for the new model benefits from the dynamic programming hook trick . The model produces improved dependency structure for both languages. 1 Introduction A major difficulty in statistical machine translation is the trade-off between representational power and computational complexity. Real-world corpora for language pairs such as Chinese-English have complex reordering relationships that are not captured by current phrase-based MT systems despite their state-of-the-art performance measured in competitive evaluations. Synchronous grammar formalisms that are capable of modeling such complex relationships while maintaining the context-free property in each language have been proposed for many years Aho and Ullman 1972 Wu 1997 Yamada and Knight 2001 Melamed 2003 Chiang 2005 but have not been scaled to large corpora and long sentences until recently. In Synchronous Context Free Grammars there are two sources of complexity grammar branching factor and lexicalization. In this paper we focus on the second issue constraining the grammar to the binary-branching Inversion Transduction Grammar of Wu 1997 . Lexicalization seems likely to help models predict alignment patterns between languages and has been proposed by Melamed 2003 and implemented by Alshawi et al. 2000 and Zhang and Gildea 2005 . However each piece of lexical information considered by a model multiplies the number of states of dynamic programming algorithms for inference meaning that we must choose how to lexicalize very carefully to control complexity. In this paper we compare two approaches to lexicalization both of which incorporate bilexical probabilities. One model uses bilexical probabilities across languages while the other uses bilex-ical .