tailieunhanh - Báo cáo khoa học: "Can Click Patterns across User’s Query Logs Predict Answers to Definition Questions?"

In this paper, we examined click patterns produced by users of Yahoo! search engine when prompting definition questions. Regularities across these click patterns are then utilized for constructing a large and heterogeneous training corpus for answer ranking. In a nutshell, answers are extracted from clicked web-snippets originating from any class of web-site, including Knowledge Bases (KBs). On the other hand, nonanswers are acquired from redundant pieces of text across web-snippets. The effectiveness of this corpus was assessed via training two state-of-the-art models, wherewith answers to unseen queries were distinguished. These testing queries were also submitted by search engine users,. | Can Click Patterns across User s Query Logs Predict Answers to Definition Questions Alejandro Figueroa Yahoo Research Latin America Blanco Encalada 2120 Santiago Chile afiguero@ Abstract In this paper we examined click patterns produced by users of Yahoo search engine when prompting definition questions. Regularities across these click patterns are then utilized for constructing a large and heterogeneous training corpus for answer ranking. In a nutshell answers are extracted from clicked web-snippets originating from any class of web-site including Knowledge Bases KBs . On the other hand nonanswers are acquired from redundant pieces of text across web-snippets. The effectiveness of this corpus was assessed via training two state-of-the-art models wherewith answers to unseen queries were distinguished. These testing queries were also submitted by search engine users and their answer candidates were taken from their respective returned web-snippets. This corpus helped both techniques to finish with an accuracy higher than 70 and to predict over 85 of the answers clicked by users. In particular our results underline the importance of non-KB training data. 1 Introduction It is a well-known fact that definition queries are very popular across users of commercial search engines Rose and Levinson 2004 . The essential characteristic of definition questions is their aim for discovering as much as possible descriptive information about the concept being defined . definiendum pl. definienda . Some examples of this kind of query include Who is Benjamin Millepied and Tell me about Bank of America . It is a standard practice of definition question answering QA systems to mine KBs . online encyclopedias and dictionaries for reliable descriptive information on the definiendum Sacaleanu et al. 2008 . Normally these pieces of information . nuggets explain different facets of the definiendum . ballet choreographer and born in Bordeaux and the main idea .

TỪ KHÓA LIÊN QUAN