tailieunhanh - Báo cáo khoa học: "Insights into Non-projectivity in Hindi"

Large scale efforts are underway to create dependency treebanks and parsers for Hindi and other Indian languages. Hindi, being a morphologically rich, flexible word order language, brings challenges such as handling non-projectivity in parsing. In this work, we look at non-projectivity in Hyderabad Dependency Treebank (HyDT) for Hindi. Non-projectivity has been analysed from two perspectives: graph properties that restrict non-projectivity and linguistic phenomenon behind non-projectivity in HyDT. . | Insights into Non-projectivity in Hindi Prashanth Mannem Himani Chaudhry Akshar Bharati Language Technologies Research Center International Institute of Information Technology Gachibowli Hyderabad India - 500032 prashanth himani @ Abstract Large scale efforts are underway to create dependency treebanks and parsers for Hindi and other Indian languages. Hindi being a morphologically rich flexible word order language brings challenges such as handling non-projectivity in parsing. In this work we look at non-projectivity in Hyderabad Dependency Treebank HyDT for Hindi. Non-projectivity has been analysed from two perspectives graph properties that restrict non-projectivity and linguistic phenomenon behind non-projectivity in HyDT. Since Hindi has ample instances of non-projectivity 14 of all structures in HyDT are non-projective it presents a case for an in depth study of this phenomenon for a better insight from both of these perspectives. We have looked at graph constriants like planarity gap degree edge degree and well-nestedness on structures in HyDT. We also analyse non-projectivity in Hindi in terms of various linguistic parameters such as the causes of non-projectivity its rigidity possibility of reordering and whether the reordered construction is the natural one. 1 Introduction Non-projectivity occurs when dependents do not either immediately follow or precede their heads in a sentence Tesnire 1959 . These dependents may be spread out over a discontinuous region of the sentence. It is well known that this poses problems for both theoretical grammar formalisms as well as parsing systems. Kuhlmann and Mohl 2007 McDonald and Nivre 2007 Nivre et al. 2007 Hindi is a verb final flexible word order language and therefore has frequent occurrences of non-projectivity in its dependency structures. Bharati et al. 2008a showed that a major chunk of errors in their parser is due to non-projectivity. So there is a need to analyse non-projectivity in Hindi

crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.