tailieunhanh - Báo cáo khoa học: "Manually Constructed Context-Free Grammar For Myanmar Syllable Structure"

Myanmar language and script are unique and complex. Up to our knowledge, considerable amount of work has not yet been done in describing Myanmar script using formal language theory. This paper presents manually constructed context free grammar (CFG) with “111” productions to describe the Myanmar Syllable Structure. We make our CFG in conformity with the properties of LL(1) grammar so that we can apply conventional parsing technique called predictive top-down parsing to identify Myanmar syllables. We present Myanmar syllable structure according to orthographic rules. . | Manually Constructed Context-Free Grammar For Myanmar Syllable Structure Tin Htay Hlaing Nagaoka University of Technology Nagaoka JAPAN tinhtayhlaing@ Abstract Myanmar language and script are unique and complex. Up to our knowledge considerable amount of work has not yet been done in describing Myanmar script using formal language theory. This paper presents manually constructed context free grammar CFG with 111 productions to describe the Myanmar Syllable Structure. We make our CFG in conformity with the properties of LL 1 grammar so that we can apply conventional parsing technique called predictive top-down parsing to identify Myanmar syllables. We present Myanmar syllable structure according to orthographic rules. We also discuss the preprocessing step called contraction for vowels and consonant conjuncts. We make LL 1 grammar in which 1 does not mean exactly one character of lookahead for parsing because of the above mentioned contracted forms. We use five basic sub syllabic elements to construct CFG and found that all possible syllable combinations in Myanmar Orthography can be parsed correctly using the proposed grammar. 1 Introduction Formal Language Theory is a common way to represent grammatical structures of natural languages and programming languages. The origin of grammar hierarchy is the pioneering work of Noam Chomsky Noam Chomsky 1957 . A huge amount of work has been done in Natural Language Processing where Chomsky s grammar is used to describe the grammatical rules of natural languages. However formulation rules have not been established for grammar for Myanmar script. The long term goal of this study is to develop automatic syllabification of Myanmar polysyllabic words using regular grammar and or finite state methods so that syllabified strings can be used for Myanmar sorting. In this paper as a preliminary stage we describe the structure of a Myanmar syllable in context-free grammar and parse the syllables using predictive top-down .

TỪ KHÓA LIÊN QUAN