tailieunhanh - Báo cáo khoa học: "Resolution for Machine Translation of Telegraphic Messages"

Telegraphic messages with numerous instances of omission pose a new challenge to parsing in that a sentence with omission causes a higher degree of ambi6uity than a sentence without omission. Misparsing reduced by omissions has a far-reaching consequence in machine translation. Namely, a misparse of the input often leads to a translation into the target language which has incoherent meaning in the given context. This is more frequently the case if the structures of the source and target languages are quite different, as in English and Korean. . | Ambiguity Resolution for Machine Translation of Telegraphic Messages1 Young-Suk Lee Lincoln Laboratory MIT Lexington MA 02173 USA Clifford Weinstein Lincoln Laboratory MIT Lexington MA 02173 USA Stephanie Seneff SLS LCS MIT Cambridge MA 02139 USA Dinesh Tummala Lincoln Laboratory MIT Lexington MA 02173 USA Abstract Telegraphic messages with numerous instances of omission pose a new challenge to parsing in that a sentence with omission causes a higher degree of ambiguity than a sentence without omission. Misparsing induced by omissions has a far-reaching consequence in machine translation. Namely a misparse of the input often leads to a translation into the target language which has incoherent meaning in the given context. This is more frequently the case if the structures of the source and target languages are quite different as in English and Korean. Thus the question of how we parse telegraphic messages accurately and efficiently becomes a critical issue in machine translation. In this paper we describe a technical solution for the issue and iiresent the performance evaluation of a machine trans-ation system on telegraphic messages before and after adopting the proposed solution. The solution lies in a grammar design in which lexicalized grammar rules defined in terms of semantic categories and syntactic rules defined in terms of part-of-speech are utilized together. The proposed grammar achieves a higher parsing coverage without increasing the amount of ambiguity misparsing when compared with a purely lexical-ized semantic grammar and achieves a lower degree of ambiguity misparses without decreasing the parsing coverage when compared with a purely syntactic grammar. 1 Introduction Achieving the goal of producing high quality machine translation output is hindered by lexical and syntactic ambiguity of the input sentences. Lexical ambiguity may be greatly reduced by limiting the domain

TỪ KHÓA LIÊN QUAN
crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.