tailieunhanh - Báo cáo khoa học: "Behind the Article: Recognizing Dialog Acts in Wikipedia Talk Pages"

In this paper, we propose an annotation schema for the discourse analysis of Wikipedia Talk pages aimed at the coordination efforts for article improvement. We apply the annotation schema to a corpus of 100 Talk pages from the Simple English Wikipedia and make the resulting dataset freely available for download1 . Furthermore, we perform automatic dialog act classification on Wikipedia discussions and achieve an average F1 -score of with our classification pipeline. | Behind the Article Recognizing Dialog Acts in Wikipedia Talk Pages Oliver Ferschkf- Iryna Gurevych and Yevgen Chebotar f Ubiquitous Knowledge Processing Lab UKP-DIPF German Institute for Educational Research and Educational Information ị Ubiquitous Knowledge Processing Lab UKP-TUDA Department of Computer Science Technische Universitat Darmstadt http Abstract In this paper we propose an annotation schema for the discourse analysis of Wikipedia Talk pages aimed at the coordination efforts for article improvement. We apply the annotation schema to a corpus of 100 Talk pages from the Simple English Wikipedia and make the resulting dataset freely available for download1. Furthermore we perform automatic dialog act classification on Wikipedia discussions and achieve an average Fl-score of with our classification pipeline. 1 Introduction Over the past decade the paradigm of information sharing in the web has shifted towards participatory and collaborative content production. Texts are no longer exclusively prepared by individuals and then shared with the community. They are increasingly created collaboratively by multiple authors and iteratively revised by the community. When researchers first conducted surveys on professional writers in the 1980s they found that the collaborative writing process differs considerably from the way individual writing is done Posner and Baecker 1992 . In joint writing the writers have to externalize processes that are otherwise not made explicit like the planning and the organization of the text. The authors have to communicate how the text should be written and what exactly it should contain. Today many tools are available that support collaborative writing. A tool that has particularly taken hold is the Wiki a web-based asyn 1http data wikidiscourse chronous co-authoring tool. A unique characteristic of Wikis is the documentation of the edit history which keeps track of every change that

TỪ KHÓA LIÊN QUAN