tailieunhanh - Báo cáo khoa học: "Lexicon Features for Japanese Syntactic Analysis in Mu-Project-JE"
In this paper, we focus on the features of a lexicon for Japanese syntactic analysis in Japanese-to-English translation. Japanese word order is almost unrestricted and Kc~uio-~ti (postpositional case particle) i s an i m p o r t a n t device which acts as the case label(case m a r k e r ) in Japanese sentences. Therefore case grammar is the most effective grammar for Japanese syntactic analysis. The case frame governed by )buc~n and having surface case(Kakuio-shi), deep case(case label) and semantic markers for nouns is analyzed here to illustrate how we apply case grammar to. | Lexicon Features for Japanese Syntactic Analysis in Mu-Project-JE Yoshiyuki Sakamoto Masayuki Satoh Tetsuya Ishikawa Electrotechnical Laboratory Sakura-mura. Niihari-gun Ibaraki japan The Japan Information Center of Science and Technology Nagata-cho Chiyoda-ku Tokyo Japan Univ of Library Information Science Yatabe-machi Tsukuba-gun. Ibaraki. Japan 0. Abstract In this paper we focus on the features of a lexicon for Japanese syntactic analysis in Japanese-to-English translation. Japanese word order is almost unrestricted and Kcikuio-shi postpositional case particle is an important device which acts as the case label case marker in Japanese sentences. Therefore case grammar is the most effective grammar for Japanese syntactic analysis. The case frame governed by Yotiocn and having surface case Kakuio-shi . deep case case label and semantic markers for nouns is analyzed here to illustrate how we apply case grammar to Japanese syntactic analysis in our system. The parts of speech are classified into 56 sub-categories. We analyze semantic features for nouns and pronouns classified into sub-categories and we present a system for semantic markers. Lexicon formats for syntactic and semantic features are composed of different features classified by part of speech. As this system uses LISP as the programming language the lexicons are written as S-expression in LIS punched onto tapes and stored as files in the computer. 1. Introduction The Mu-project is a national project supported by the STA Science and Technology Agency the full name of which is Research on a Machine Translation System Japanese English for Scientific and Technological Documents. We are currently restricting the domain of translation to abstract papers in scientific and technological fields. The system is based on a transfer approach and consist of three phases analysis transfer and generation. In the first phase of machine translation analysis. morphological analysis divides the sentence into lexical items .
đang nạp các trang xem trước