tailieunhanh - Báo cáo khoa học: " An Intelligent Microblog Analysis and Summarization System"

This paper presents a system to summarize a Microblog post and its responses with the goal to provide readers a more constructive and concise set of information for efficient digestion. We introduce a novel two-phase summarization scheme. In the first phase, the post plus its responses are classified into four categories based on the intention, interrogation, sharing, discussion and chat. | IMASS An Intelligent Microblog Analysis and Summarization System Jui-Yu Weng Cheng-Lun Yang Bo-Nian Chen Yen-Kai Wang Shou-De Lin Department of Computer Science and Information Engineering National Taiwan University r98922060 r99944042 f92025 b97081 sdlin @ Abstract This paper presents a system to summarize a Microblog post and its responses with the goal to provide readers a more constructive and concise set of information for efficient digestion. We introduce a novel two-phase summarization scheme. In the first phase the post plus its responses are classified into four categories based on the intention interrogation sharing discussion and chat. For each type of post in the second phase we exploit different strategies including opinion analysis response pair identification and response relevancy detection to summarize and highlight critical information to display. This system provides an alternative thinking about machinesummarization by utilizing AI approaches computers are capable of constructing deeper and more user-friendly abstraction. 1 Introduction As Microblog services such as Twitter have become increasingly popular it is critical to reconsider the applicability of the existing NLP technologies on this new media sources. Take summarization for example a Microblog user usually has to browse through tens or even hundreds of posts together with their responses daily therefore it can be beneficial if there is an intelligent tool assisting summarizing those information. Automatic text summarization ATS has been investigated for over fifty years but the majority of the existing techniques might not be appropriate for Microblog write-ups. For instance a popular kind of approaches for summarization tries to identify a subset of information usually in sentence form from longer pieces of writings as summary Das and Martins 2007 . Such extraction-based 133 methods can hardly be applied to Microblog texts because many posts responses contain only one .

TỪ KHÓA LIÊN QUAN