tailieunhanh - Báo cáo khoa học: "A Computational Framework for Composition in Multiple Linguistic Domains"

We describe a computational framework for a grammar architecture in which different linguistic domains such as morphology, syntax, and semantics are treated not as separate components but compositional domains. The framework is based on Combinatory Categorial Grammars and it uses the morpheme as the basic building block of the categorial lexicon. | A Computational Framework for Composition in Multiple Linguistic Domains Elvan Gõgmen Computer Engineering Department Middle East Technical University 06531 Ankara Turkey elvan@ Abstract We describe a computational framework for a grammar architecture in which different linguistic domains such as morphology syntax and semantics are treated not as separate components but compositional domains. The framework is based on Combinatory Categorial Grammars and it uses the morpheme as the basic building block of the categorial lexicon. 1 Introduction In this paper we address the problem of modelling interactions between different levels of language analysis. In agglutinative languages affixes are attached to stems to form a word that may correspond to an entire phrase in a language like English. For instance in Turkish word formation is based on suffixation of derivational and inflectional morphemes. Phrases may be formed in a similar way 1 . 1 Yoksul-la -ttr-il-makta-lar poor-V-CAUS-PASS-ADV-PERS They are being made poor impoverished . In Turkish there is a significant amount of interaction between morphology and syntax. For instance causative suffixes change the valence of the verb and the reciprocal suffix subcategorize the verb for a noun phrase marked with the comitative case. Moreover the head that a bound morpheme modifies may be not its stem but a compound head crossing over the word boundaries . 2 iyi oku-muỊ Qocuk well read-REL child well-educated child In 2 the relative suffix -muị in past form of subject participle modifies iyi oku to give the scope iyi oku mu QOcuk . If syntactic composition is performed after morphological composition we would get compositions such as iyi okumuỊ Qocuk or ỊỊiyi okumu gocuk which yield ill-formed semantics for this utterance. As pointed out by Oehrle 1988 there is no reason to assume a layered grammatical architecture which has linguistic division of labor into components acting on one domain at a time. As a