tailieunhanh - Báo cáo khoa học: "Relation Guided Bootstrapping of Semantic Lexicons"

State-of-the-art bootstrapping systems rely on expert-crafted semantic constraints such as negative categories to reduce semantic drift. Unfortunately, their use introduces a substantial amount of supervised knowledge. We present the Relation Guided Bootstrapping (RGB) algorithm, which simultaneously extracts lexicons and open relationships to guide lexicon growth and reduce semantic drift. This removes the necessity for manually crafting category and relationship constraints, and manually generating negative categories. . | Relation Guided Bootstrapping of Semantic Lexicons Tara McIntosh Lars Yencken James R. Curran Timothy Baldwin 4 NICTA Victoria Research Lab Dept. of Computer Science and Software Engineering The University of Melbourne nlp@ lars@ School of Information Technologies The University of Sydney james@ tb@ Abstract State-of-the-art bootstrapping systems rely on expert-crafted semantic constraints such as negative categories to reduce semantic drift. Unfortunately their use introduces a substantial amount of supervised knowledge. We present the Relation Guided Bootstrapping RGB algorithm which simultaneously extracts lexicons and open relationships to guide lexicon growth and reduce semantic drift. This removes the necessity for manually crafting category and relationship constraints and manually generating negative categories. 1 Introduction Many approaches to extracting semantic lexicons extend the unsupervised bootstrapping framework Riloff and Shepherd 1997 . These use a small set of seed examples from the target lexicon to identify contextual patterns which are then used to extract new lexicon items Riloff and Jones 1999 . Bootstrappers are prone to semantic drift caused by selection of poor candidate terms or patterns Curran et al. 2007 which can be reduced by semantically constraining the candidates. Multicategory bootstrappers such as NOMEN Yangar-ber et al. 2002 and WMEB McIntosh and Curran 2008 reduce semantic drift by extracting multiple categories simultaneously in competition. The inclusion of manually-crafted negative categories to multi-category bootstrappers achieves the best results by clarifying the boundaries between categories Yangarber et al. 2002 . For example female names are often bootstrapped with 266 the negative categories flowers . Rose Iris and gem stones . Ruby Pearl Curran et al. 2007 . Unfortunately negative categories are difficult to design introducing a substantial amount of human .

TỪ KHÓA LIÊN QUAN
crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.