Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Local and Global Algorithms for Disambiguation to Wikipedia"

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ

Disambiguating concepts and entities in a context sensitive way is a fundamental problem in natural language processing. The comprehensiveness of Wikipedia has made the online encyclopedia an increasingly popular target for disambiguation. Disambiguation to Wikipedia is similar to a traditional Word Sense Disambiguation task, but distinct in that the Wikipedia link structure provides additional information about which disambiguations are compatible. In this work we analyze approaches that utilize this information to arrive at coherent sets of disambiguations for a given document (which we call “global” approaches), and compare them to more traditional (local) approaches. . | Local and Global Algorithms for Disambiguation to Wikipedia Lev Ratinov 1 Dan Roth1 Doug Downey2 Mike Anderson3 University of Illinois at Urbana-Champaign ratinov2 danr @uiuc.edu 2Northwestern University ddowney@eecs.northwestern.edu 3Rexonomy mrander@gmail.com Abstract Disambiguating concepts and entities in a context sensitive way is a fundamental problem in natural language processing. The comprehensiveness of Wikipedia has made the online encyclopedia an increasingly popular target for disambiguation. Disambiguation to Wikipedia is similar to a traditional Word Sense Disambiguation task but distinct in that the Wikipedia link structure provides additional information about which disambiguations are compatible. In this work we analyze approaches that utilize this information to arrive at coherent sets of disambiguations for a given document which we call global approaches and compare them to more traditional local approaches. We show that previous approaches for global disambiguation can be improved but even then the local disambiguation provides a baseline which is very hard to beat. 1 Introduction Wikification is the task of identifying and linking expressions in text to their referent Wikipedia pages. Recently Wikification has been shown to form a valuable component for numerous natural language processing tasks including text classification Gabrilovich and Markovitch 2007b Chang et al. 2008 measuring semantic similarity between texts Gabrilovich and Markovitch 2007a crossdocument co-reference resolution Finin et al. 2009 Mayfield et al. 2009 and other tasks Kulkarni et al. 2009 . 1375 Previous studies on Wikification differ with respect to the corpora they address and the subset of expressions they attempt to link. For example some studies focus on linking only named entities whereas others attempt to link all interesting expressions mimicking the link structure found in Wikipedia. Regardless all Wikification systems are faced with a key Disambiguation to .