tailieunhanh - Báo cáo khoa học: "Adapting Self-training for Semantic Role Labeling"

Supervised semantic role labeling (SRL) systems trained on hand-crafted annotated corpora have recently achieved state-of-the-art performance. However, creating such corpora is tedious and costly, with the resulting corpora not sufficiently representative of the language. This paper describes a part of an ongoing work on applying bootstrapping methods to SRL to deal with this problem. Previous work shows that, due to the complexity of SRL, this task is not straight forward. | Adapting Self-training for Semantic Role Labeling Rasoul Samad Zadeh Kaljahi FCSIT University of Malaya 50406 Kuala Lumpur Malaysia. rsk7945@ Abstract Supervised semantic role labeling SRL systems trained on hand-crafted annotated corpora have recently achieved state-of-the-art performance. However creating such corpora is tedious and costly with the resulting corpora not sufficiently representative of the language. This paper describes a part of an ongoing work on applying bootstrapping methods to SRL to deal with this problem. Previous work shows that due to the complexity of SRL this task is not straight forward. One major difficulty is the propagation of classification noise into the successive iterations. We address this problem by employing balancing and preselection methods for self-training as a bootstrapping algorithm. The proposed methods could achieve improvement over the base line which do not use these methods. 1 Introduction Semantic role labeling has been an active research field of computational linguistics since its introduction by Gildea and Jurafsky 2002 . It reveals the event structure encoded in the sentence which is useful for other NLP tasks or applications such as information extraction question answering and machine translation Surdea-nu et al. 2003 . Several CoNLL shared tasks Carreras and Marquez 2005 Surdeanu et al. 2008 dedicated to semantic role labeling affirm the increasing attention to this field. One important supportive factor of studying supervised statistical SRL has been the existence of hand-annotated semantic corpora for training SRL systems. FrameNet Baker et al. 1998 was the first such resource which made the emergence of this research field possible by the seminal work of Gildea and Jurafsky 2002 . However this corpus only exemplifies the semantic role assignment by selecting some illustrative examples for annotation. This questions its suita bility for statistical learning. Propbank was started by .

TỪ KHÓA LIÊN QUAN
crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.