tailieunhanh - Báo cáo khoa học: "Morphological Disambiguation by Voting Constraints"

We present a constraint-based morphological disambiguation system in which individual constraints vote on matching morphological parses, and disambiguation of all the tokens in a sentence is performed at the end by selecting parses that receive the highest votes. This constraint application paradigm makes the outcome of the disambiguation independent of the rule sequence, and hence relieves the rule developer from worrying about potentially conflicting rule sequencing. Our results for disambiguating Turkish indicate that using about 500 constraint rules and some additional simple statistics, we can attain a recall of 95-96~ and a precision of 94-95~ with about parses. | Morphological Disambiguation by Voting Constraints Kemal Oflazer and Gokhan Tur Department of Computer Engineering and Information Science Bilkent University Bilkent TR-06533 Turkey ko tur @ Abstract We present a constraint-based morphological disambiguation system in which individual constraints vote on matching morphological parses and disambiguation of all the tokens in a sentence is performed at the end by selecting parses that receive the highest votes. This constraint application paradigm makes the outcome of the disambiguation independent of the rule sequence and hence relieves the rule developer from worrying about potentially conflicting rule sequencing. Our results for disambiguating Turkish indicate that using about 500 constraint rules and some additional simple statistics we can attain a recall of 95-96 and a precision of 9ị-95 with about parses per token. Our system is implemented in Prolog and we are currently investigating an efficient implementation based on finite state transducers. 1 Introduction Automatic morphological disambiguation is an important component in higher level analysis of natural language text corpora. There has been a large number of studies in tagging and morphological disambiguation using various techniques such as statistical techniques . Church 1988 Cutting et al. 1992 DeRose 1988 constraint-based techniques Karlsson et al. 1995 Voutilainen 1995b Vouti-lainen Heikkilă and Anttila 1992 Voutilainen and Tapanainen 1993 Oflazer and Kuruoz 1994 Oflazer and Tur 1996 and transformation-based techniques Brill 1992 Brill 1994 Brill 1995 . This paper presents a novel approach to constraint based morphological disambiguation which relieves the rule developer from worrying about conflicting rule ordering requirements. The approach depends on assigning votes to constraints according to their complexity and specificity and then letting constraints cast votes on matching parses of a given lexical item. This approach

TỪ KHÓA LIÊN QUAN
crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.