tailieunhanh - Báo cáo khoa học: "Who, What, When, Where, Why? Comparing Multiple Approaches to the Cross-Lingual 5W Task"

Cross-lingual tasks are especially difficult due to the compounding effect of errors in language processing and errors in machine translation (MT). In this paper, we present an error analysis of a new cross-lingual task: the 5W task, a sentence-level understanding task which seeks to return the English 5W's (Who, What, When, Where and Why) corresponding to a Chinese sentence. We analyze systems that we developed, identifying specific problems in language processing and MT that cause errors. | Who What When Where Why Comparing Multiple Approaches to the Cross-Lingual 5W Task Kristen Parton Kathleen R. McKeown Bob Coyne Mona T. Diab Ralph Grishmanf Dilek Hakkani-TurỊ Mary Harper Heng Ji Wei Yun Ma Adam Meyersf Sara Stolbach Ang Sunf Gokhan Tur Wei Xuf and Sibel YamanỊ Columbia University New York NY USA kristen kathy coyne mdiab ma sara @ fNew York University New York NY USA grishman meyers asun xuwei @ International Computer Science Institute Berkeley CA USA dilek sibel @ Human Lang. Tech. Ctr. of Excellence Johns Hopkins and U. of Maryland College Park mharper@ City University of New York New York NY USA hengj i@ SRI International Palo Alto CA USA gokhan@ Abstract Cross-lingual tasks are especially difficult due to the compounding effect of errors in language processing and errors in machine translation MT . In this paper we present an error analysis of a new cross-lingual task the 5W task a sentence-level understanding task which seeks to return the English 5W s Who What When Where and Why corresponding to a Chinese sentence. We analyze systems that we developed identifying specific problems in language processing and MT that cause errors. The best cross-lingual 5W system was still 19 worse than the best monolingual 5W system which shows that MT significantly degrades sentence-level understanding. Neither source-language nor targetlanguage analysis was able to circumvent problems in MT although each approach had advantages relative to the other. A detailed error analysis across multiple systems suggests directions for future research on the problem. 1 Introduction In our increasingly global world it is ever more likely for a mono-lingual speaker to require information that is only available in a foreign language document. Cross-lingual applications address this need by presenting information in the speaker s language even when it originally appeared in some other language .

TỪ KHÓA LIÊN QUAN