tailieunhanh - Báo cáo khoa học: "Automatic error detection in the Japanese learners’ English spoken data"

This paper describes a method of detecting grammatical and lexical errors made by Japanese learners of English and other techniques that improve the accuracy of error detection with a limited amount of training data. In this paper, we demonstrate to what extent the proposed methods hold promise by conducting experiments using our learner corpus, which contains information on learners’ errors. | Automatic error detection in the Japanese learners English spoken data Emi IZUMI Kiyotaka UCHIMOTOf Toyomi SAIGA emi@ uchimoto@ hoshi@ Thepchai Supnithi Hitoshi ISAHARA thepchai@ isahara@ Computational Linguistics Group Communications Research Laboratory 3-5 Hikaridai Seika-cho Soraku-gun Kyoto Japan 1 Graduate School of Science and Technology Kobe University 1-1 Rokkodai Nada-ku Kobe Japan TIS Inc. 9-1 Toyotsu Suita Osaka Japan National Electronics and Computer Technology Center 112 Pahonyothin Road Klong 1 Klong Luang Pathumthani 12120 Thailand Abstract This paper describes a method of detecting grammatical and lexical errors made by Japanese learners of English and other techniques that improve the accuracy of error detection with a limited amount of training data. In this paper we demonstrate to what extent the proposed methods hold promise by conducting experiments using our learner corpus which contains information on learners errors. 1 Introduction One of the most important things in keeping up with our current information-driven society is the acquisition of foreign languages especially English for international communications. In developing a computer-assisted language teaching and learning environment we have compiled a large-scale speech corpus of Japanese learner English which provides a great deal of useful information on the construction of a model for the developmental stages of Japanese learners speaking abilities. In the support system for language learning we have assumed that learners must be informed of what kind of errors they have made and in which part of their utterances. To do this we need to have a framework that will allow us to detect learners errors automatically. In this paper we introduce a method of detecting learners errors and we examine to what extent this could be accomplished using our learner corpus data including error tags that are labeled with the learners errors. 2 SST .

TÀI LIỆU LIÊN QUAN