tailieunhanh - Báo cáo khoa học: "Topical Keyphrase Extraction from Twitter"

Summarizing and analyzing Twitter content is an important and challenging task. In this paper, we propose to extract topical keyphrases as one way to summarize Twitter. We propose a context-sensitive topical PageRank method for keyword ranking and a probabilistic scoring function that considers both relevance and interestingness of keyphrases for keyphrase ranking. We evaluate our proposed methods on a large Twitter data set. Experiments show that these methods are very effective for topical keyphrase extraction. . | Topical Keyphrase Extraction from Twitter Wayne Xin Zhao Jing Jiang- Jing He Yang Song Palakorn Achananuparp-Ee-Peng Lim- Xiaoming Lit tSchool of Electronics Engineering and Computer Science Peking University School of Information Systems Singapore Management University batmanfly songyangmagic @ jingjiang eplim palakorna @ lxm@ Abstract Summarizing and analyzing Twitter content is an important and challenging task. In this paper we propose to extract topical keyphrases as one way to summarize Twitter. We propose a context-sensitive topical PageRank method for keyword ranking and a probabilistic scoring function that considers both relevance and interestingness of keyphrases for keyphrase ranking. We evaluate our proposed methods on a large Twitter data set. Experiments show that these methods are very effective for topical keyphrase extraction. 1 Introduction Twitter a new microblogging website has attracted hundreds of millions of users who publish short messages . tweets on it. They either publish original tweets or retweet . forward others tweets if they find them interesting. Twitter has been shown to be useful in a number of applications including tweets as social sensors of realtime events Sakaki et al. 2010 the sentiment prediction power of Twitter Tumasjan et al. 2010 etc. However current explorations are still in an early stage and our understanding of Twitter content still remains limited. How to automatically understand extract and summarize useful Twitter content has therefore become an important and emergent research topic. In this paper we propose to extract keyphrases as a way to summarize Twitter content. Traditionally keyphrases are defined as a short list of terms to summarize the topics of a document Turney 2000 . 379 It can be used for various tasks such as document summarization Litvak and Last 2008 and indexing Li et al. 2004 . While it appears natural to use keyphrases to summarize Twitter .

TỪ KHÓA LIÊN QUAN