tailieunhanh - Báo cáo khoa học: "Local constraints on sentence markers and focus in Somali"

We present a computationally tractable account of the interactions between sentence markers and focus marking in Somali. Somali, as a Cushitic language, has a basic pattern wherein a small ‘core’ clause is preceded, and in some cases followed by, a set of ‘topics’, which provide sceneseting information against which the core is interpreted. Some topics appear to carry a ‘focus marker’, indicating that they are particularly salient. | Local constraints on sentence markers and focus in Somali Katherine Hargreaves School of Informatics University of Manchester Manchester M60 1QD UK kat@ Allan Ramsay School of Informatics University of Manchester Manchester M60 1QD UK Abstract We present a computationally tractable account of the interactions between sentence markers and focus marking in Somali. Somali as a Cushitic language has a basic pattern wherein a small core clause is preceded and in some cases followed by a set of topics which provide scene-seting information against which the core is interpreted. Some topics appear to carry a focus marker indicating that they are particularly salient. We will outline a computationally tractable grammar for Somali in which focus marking emerges naturally from a consideration of the use of a range of sentence markers. 1 Introduction This paper presents a computationally tractable account of a number of phenomena in Somali. Somali displays a number of properties which distinguish it from most languages for which computational treatments are available and which are potentially problematic. We therefore start with a brief introduction to the major properties of the language together with a description of how we cover the key phenomena within a general purpose NLP framework. 2 Morphology Somali has a fairly standard set of inflectional affixes for nouns and verbs as outlined below. In addition there are a substantial set of spelling rules which insert and delete graphemes at the boundaries between roots and suffixes and clitics . There is not that much to be said about the spelling rules - Fig. 1 shows the format of a typical rule which we compile into an FST to be used during the process of lexical lookup. q x c h t v0 k v0 Figure 1 Insert k and a morpheme boundary between q x c h and a following vowel The rule in Fig. 1 would for instance say that the surface form saca might correspond to the underlying form sac ka

crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.