tailieunhanh - Báo cáo khoa học: "PageRanking WordNet Synsets: An Application to Opinion Mining∗"

This paper presents an application of PageRank, a random-walk model originally devised for ranking Web search results, to ranking WordNet synsets in terms of how strongly they possess a given semantic property. The semantic properties we use for exemplifying the approach are positivity and negativity, two properties of central importance in sentiment analysis. The idea derives from the observation that WordNet may be seen as a graph in which synsets are connected through the binary relation “a term belonging to synset sk occurs in the gloss of synset si ”, and on the hypothesis that this relation may be. | PageRanking WordNet Synsets An Application to Opinion Mining Andrea Esuli and Fabrizio Sebastiani Istituto di Scienza e Tecnologie dell Informazione Consiglio Nazionale delle Ricerche Via Giuseppe Moruzzi 1 -56124 Pisa Italy @ Abstract This paper presents an application of PageR-ank a random-walk model originally devised for ranking Web search results to ranking WordNet synsets in terms of how strongly they possess a given semantic property. The semantic properties we use for exemplifying the approach are positivity and negativity two properties of central importance in sentiment analysis. The idea derives from the observation that WordNet may be seen as a graph in which synsets are connected through the binary relation a term belonging to synset sk occurs in the gloss of synset Si and on the hypothesis that this relation may be viewed as a transmitter of such semantic properties. The data for this relation can be obtained from eXtended WordNet a publicly available sense-disambiguated version of WordNet. We argue that this relation is structurally akin to the relation between hyperlinked Web pages and thus lends itself to PageRank analysis. We report experimental results supporting our intuitions. 1 Introduction Recent years have witnessed an explosion of work on opinion mining aka sentiment analysis the dis- This work was partially supported by Project ONTOTEXT From Text to Knowledge for the Semantic Web funded by the Provincia Autonoma di Trento under the 2004-2006 Fondo Unico per la Ricerca funding scheme. 424 cipline that deals with the quantitative and qualitative analysis of text for the purpose of determining its opinion-related properties ORPs . An important part of this research has been the work on the automatic determination of the ORPs of terms as . in determining whether an adjective tends to give a positive a negative or a neutral nature to the noun phrase it appears in. While many works Esuli and .

TÀI LIỆU LIÊN QUAN