tailieunhanh - Báo cáo khoa học: "A Unification-based Approach to Morpho-syntactic Parsing of Agglutinative and Other (Highly) Inflectional Languages"

This paper introduces a new approach to morpho-syntactic analysis through Humor 99 (High-speed Unification ), a reversible and unification-based morphological analyzer which has already been integrated with a variety o f industrial applications. Humor 99 successfully copes with problems o f agglutinative (. Hungarian, Turkish, Estonian) and other (highly) inflectional languages (. Polish, Czech, German) very effectively. The authors conclude the paper by arguing that the approach used in Humor 99 is general enough to be well suitable for a wide range o f languages, and can serve as basis for higher-level linguistic operations such as shallow parsing. . | A Unification-based Approach to Morpho-syntactic Parsing of Agglutinative and Other Highly Inflectional Languages Gábor Proszeky Balazs Kis proszeky@ kis@ MorphoLogic Késmárki u. 8. Budapest Hungary H-l 118 http Abstract This paper introduces a new approach to morpho-syntactic analysis through Humor 99 High-speed Unification Morphology a reversible and unification-based morphological analyzer which has already been integrated with a variety of industrial applications. Humor 99 successfully copes with problems of agglutinative . Hungarian Turkish Estonian and other highly inflectional languages . Polish Czech German very effectively. The authors conclude the paper by arguing that the approach used in Humor 99 is general enough to be well suitable for a wide range of languages and can serve as basis for higher-level linguistic operations such as shallow parsing. Introduction There are several linguistic phenomena that are possible to process by means of morphological tools for agglutinative and other highly inflectional languages while processing the same features requires syntactic parsers in case of other languages such as English. This paper provides a brief description of Humor 99 first presenting a general theoretical background of the system. This is followed by examples of the most recent applications in addition to those listed earlier where the authors argue that the approach used in Humor 99 is general enough to be well suitable for a wide range of languages and can serve as basis for higher-level linguistic operations such as shallow or even full parsing. 1 Affix arrays rather than affixes Segmentation of a word-form in Humor 99 is based on surface patterns that is typical sequences of separate suffix morphemes are analyzed as a whole. For example the English nominal ending string er s NtoV PL POSS is a complex affix handled as an atomic string in Humor 991. The string ers is generated from er 5 5 in an

TÀI LIỆU LIÊN QUAN