tailieunhanh - Báo cáo khoa học: "Summarizing Definition from Wikipedia"
Wikipedia provides a wealth of knowledge, where the first sentence, infobox (and relevant sentences), and even the entire document of a wiki article could be considered as diverse versions of summaries (definitions) of the target topic. We explore how to generate a series of summaries with various lengths based on them. To obtain more reliable associations between sentences, we introduce wiki concepts according to the internal links in Wikipedia. | Summarizing Definition from Wikipedia Shiren Ye and Tat-Seng Chua and Jie Lu Lab of Media Search National University of Singapore yesr chuats luj @ Abstract Wikipedia provides a wealth of knowledge where the first sentence infobox and relevant sentences and even the entire document of a wiki article could be considered as diverse versions of summaries definitions of the target topic. We explore how to generate a series of summaries with various lengths based on them. To obtain more reliable associations between sentences we introduce wiki concepts according to the internal links in Wikipedia. In addition we develop an extended document concept lattice model to combine wiki concepts and non-textual features such as the outline and infobox. The model can concatenate representative sentences from non-overlapping salient local topics for summary generation. We test our model based on our annotated wiki articles which topics come from TREC-QA 2004-2006 evaluations. The results show that the model is effective in summarization and definition QA. 1 Introduction Nowadays ask Wikipedia has become as popular as Google it during Internet surfing as Wikipedia is able to provide reliable information about the concept entity that the users want. As the largest online encyclopedia Wikipedia assembles immense human knowledge from thousands of volunteer editors and exhibits significant contributions to NLP problems such as semantic relatedness word sense disambiguation and question answering QA . For a given definition query many search engines . specified by define in Google often place the first sentence of the corresponding wiki1 article at the top of the returned list. The use of 1 For readability we follow the upper lower case rule on web say web pages and on the Web and utilize one-sentence snippets provides a brief and concise description of the query. However users often need more information beyond such a one-sentence definition while feeling that the .
đang nạp các trang xem trước