tailieunhanh - Báo cáo khoa học: "Liars and Saviors in a Sentiment Annotated Corpus of Comments to Political debates"

We investigate the expression of opinions about human entities in user-generated content (UGC). A set of 2,800 online news comments (8,000 sentences) was manually annotated, following a rich annotation scheme designed for this purpose. We conclude that the challenge in performing opinion mining in such type of content is correctly identifying the positive opinions, because (i) they are much less frequent than negative opinions and (ii) they are particularly exposed to verbal irony. | Liars and Saviors in a Sentiment Annotated Corpus of Comments to Political debates Paula Carvalho University of Lisbon Faculty of Sciences LASIGE Lisbon Portugal pcc@ Jorge Teixeira Labs Sapo UP University of Porto Faculty of Engineering LIACC Porto Portugal jft@ Luís Sarmento Labs Sapo UP University of Porto Faculty of Engineering LIACC Porto Portugal las@ Mário J. Silva University of Lisbon Faculty of Sciences LASIGE Lisbon Portugal mjs@ Abstract We investigate the expression of opinions about human entities in user-generated content UGC . A set of 2 800 online news comments 8 000 sentences was manually annotated following a rich annotation scheme designed for this purpose. We conclude that the challenge in performing opinion mining in such type of content is correctly identifying the positive opinions because i they are much less frequent than negative opinions and ii they are particularly exposed to verbal irony. We also show that the recognition of human targets poses additional challenges on mining opinions from UGC since they are frequently mentioned by pronouns definite descriptions and nicknames. 1 Introduction Most of the existing approaches to opinion mining propose algorithms that are independent of the text genre the topic and the target involved. However practice shows that the opinion mining challenges are substantially different depending on these fac 564 tors whose interaction has not been exhaustively studied so far. This study focuses on identifying the most relevant challenges in mining opinions targeting media personalities namely politicians in comments posted by users to online news articles. We are interested in answering open research questions related to the expression of opinions about human entities in UGC. It has been suggested that the target identification is probably the easiest step in mining opinions on products using product reviews Liu 2010 . But is this also true for human targets namely .

TỪ KHÓA LIÊN QUAN
crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.