tailieunhanh - Báo cáo khoa học: "Simple English Wikipedia: A New Text Simplification Task"

In this paper we examine the task of sentence simplification which aims to reduce the reading complexity of a sentence by incorporating more accessible vocabulary and sentence structure. We introduce a new data set that pairs English Wikipedia with Simple English Wikipedia and is orders of magnitude larger than any previously examined for sentence simplification. | Simple English Wikipedia A New Text Simplification Task William Coster Computer Science Department Pomona College Claremont CA 91711 wpc02009@ David Kauchak Computer Science Department Pomona College Claremont CA 91711 dkauchak@ Abstract In this paper we examine the task of sentence simplification which aims to reduce the reading complexity of a sentence by incorporating more accessible vocabulary and sentence structure. We introduce a new data set that pairs English Wikipedia with Simple English Wikipedia and is orders of magnitude larger than any previously examined for sentence simplification. The data contains the full range of simplification operations including rewording reordering insertion and deletion. We provide an analysis of this corpus as well as preliminary results using a phrase-based translation approach for simplification. 1 Introduction The task of text simplification aims to reduce the complexity of text while maintaining the content Chandrasekar and Srinivas 1997 Carroll et al. 1998 Feng 2008 . In this paper we explore the sentence simplification problem given a sentence the goal is to produce an equivalent sentence where the vocabulary and sentence structure are simpler. Text simplification has a number of important applications. Simplification techniques can be used to make text resources available to a broader range of readers including children language learners the elderly the hearing impaired and people with aphasia or cognitive disabilities Carroll et al. 1998 Feng 2008 . As a preprocessing step simplification can improve the performance of NLP tasks including parsing semantic role labeling machine translation and summarization Miwa et al. 2010 Jonnala-665 gadda et al. 2009 Vickrey and Koller 2008 Chan-drasekar and Srinivas 1997 . Finally models for text simplification are similar to models for sentence compression advances in simplification can benefit compression which has applications in mobile devices .

TỪ KHÓA LIÊN QUAN