tailieunhanh - Báo cáo khoa học: "Synonymous Collocation Extraction Using Translation Information"
Automatically acquiring synonymous collocation pairs such as and from corpora is a challenging task. For this task, we can, in general, have a large monolingual corpus and/or a very limited bilingual corpus. Methods that use monolingual corpora alone or use bilingual corpora alone are apparently inadequate because of low precision or low coverage. In this paper, we propose a method that uses both these resources to get an optimal compromise of precision and coverage. | Synonymous Collocation Extraction Using Translation Information Hua WU Ming ZHOU Microsoft Research Asia 5F Sigma Center Zhichun Road Haidian District Beijing 100080 China wu_hua_ @ mingzhou @ microsoft. com Abstract Automatically acquiring synonymous collocation pairs such as turn on OBJ light and switch on OBJ light from corpora is a challenging task. For this task we can in general have a large monolingual corpus and or a very limited bilingual corpus. Methods that use monolingual corpora alone or use bilingual corpora alone are apparently inadequate because of low precision or low coverage. In this paper we propose a method that uses both these resources to get an optimal compromise of precision and coverage. This method first gets candidates of synonymous collocation pairs based on a monolingual corpus and a word thesaurus and then selects the appropriate pairs from the candidates using their translations in a second language. The translations of the candidates are obtained with a statistical translation model which is trained with a small bilingual corpus and a large monolingual corpus. The translation information is proved as effective to select synonymous collocation pairs. Experimental results indicate that the average precision and recall of our approach are 74 and 64 respectively which outperform those methods that only use monolingual corpora and those that only use bilingual corpora. 1 Introduction This paper addresses the problem of automatically extracting English synonymous collocation pairs using translation information. A synonymous collocation pair includes two collocations which are similar in meaning but not identical in wording. Throughout this paper the term collocation refers to a lexically restricted word pair with a certain syntactic relation. For instance turn on OBJ light is a collocation with a syntactic relation verb-object and turn on OBJ light and switch on OBJ light are a synonymous collocation pair. In this paper .
đang nạp các trang xem trước