tailieunhanh - Báo cáo khoa học: "Learning Tense Translation from Bilingual Corpora"

This paper studies and evaluates disambiguation strategies for the translation of tense between German and English, using a bilingual corpus of appointment scheduling dialogues. It describes a scheme to detect complex verb predicates based on verb form subcategorization and grammatical knowledge. The extracted verb and tense information is presented and the role of different context factors is discussed. | Learning Tense Translation from Bilingual Corpora Michael Schiehlen Institute for Computational Linguistics University of Stuttgart Azenbergstr. 12 70174 Stuttgart Abstract This paper studies and evaluates disambiguation strategies for the translation of tense between German and English using a bilingual corpus of appointment scheduling dialogues. It describes a scheme to detect complex verb predicates based on verb form subcategorization and grammatical knowledge. The extracted verb and tense information is presented and the role of different context factors is discussed. 1 Introduction A problem for translation is its context dependence. For every ambiguous word the part of the context relevant for disambiguation must be identified disambiguation strategy and every word potentially occurring in this context must be assigned a bias for the translation decision disambiguation information . Manual construction of disambiguation components is quite a chore. Fortunately the task can be partly automated if the tables associating words with biases are learned from a corpus. Statistical approaches also support empirical evaluation of different disambiguation strategies. The paper studies disambiguation strategies for tense translation between German and English. The experiments are based on a corpus of appointment scheduling dialogues counting 150 281 German and 154 773 English word tokens aligned in 16 857 turns. The dialogues were recorded transcribed and translated in the German national Verbmobil project that aims to develop a tri-lingual spoken language translation system. Tense is interesting since it occurs in nearly every sentence. Tense can be ex This work was funded by the German Federal Ministry of Education Science Research and Technology BMBF in the framework of the Verbmobil Project under Grant 01 IV 101 u. Many thanks are due to G. Carroll M. Emele u. Heid and the colleagues in Verbmobil. pressed on the surface lexically as

crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.