tailieunhanh - Báo cáo khoa học: "Grammar Error Correction Using Pseudo-Error Sentences and Domain Adaptation"

This paper presents grammar error correction for Japanese particles that uses discriminative sequence conversion, which corrects erroneous particles by substitution, insertion, and deletion. The error correction task is hindered by the difficulty of collecting large error corpora. We tackle this problem by using pseudoerror sentences generated automatically. | Grammar Error Correction Using Pseudo-Error Sentences and Domain Adaptation Kenji Imamura Kuniko Saito Kugatsu Sadamitsu and Hitoshi Nishikawa NTT Cyber Space Laboratories NTT Corporation 1-1 Hikari-no-oka Yokosuka 239-0847 Japan @ Abstract This paper presents grammar error correction for Japanese particles that uses discriminative sequence conversion which corrects erroneous particles by substitution insertion and deletion. The error correction task is hindered by the difficulty of collecting large error corpora. We tackle this problem by using pseudoerror sentences generated automatically. Furthermore we apply domain adaptation the pseudo-error sentences are from the source domain and the real-error sentences are from the target domain. Experiments show that stable improvement is achieved by using domain adaptation. 1 Introduction Case marks of a sentence are represented by postpositional particles in Japanese. Incorrect usage of the particles causes serious communication errors because the cases become unclear. For example in the following sentence it is unclear what must be deleted. mail o todoi tara sakujo onegai-shi-masu mail ACC. arrive when delete please When Ộ has arrived an e-mail please delete it. If the accusative particle o is replaced by a nominative one ga it becomes clear that the writer wants to delete the e-mail When the e-mail has arrived please delete it. . Such particle errors frequently occur in sentences written by non-native Japanese speakers. This paper presents a method that can automatically correct Japanese particle errors. This task 388 corresponds to preposition article error correction in English. For English error correction many studies employ classifiers which select the appropriate prepositions articles by restricting the error types to articles and frequent prepositions Gamon 2010 Han et al. 2010 Rozovskaya and Roth 2011 . On the contrary Mizumoto et al.

TÀI LIỆU LIÊN QUAN
TỪ KHÓA LIÊN QUAN