tailieunhanh - Báo cáo khoa học: "Information Fusion in the Context of Multi-Document Summarization"

We present a method to automatically generate a concise s u m m a r y by identifying and synthesizing similar elements across related text from a set of multiple documents. Our approach is unique in its usage of language generation to reformulate the wording of the summary. Information overload has created an acute need for summarization. Typically, the same information is described by many different online documents. | Information Fusion in the Context of Multi-Document Summarization Regina Barzilay and Kathleen R. McKeown Dept of Computer Science Columbia University New York NY 10027 USA Michael Elhadad Dept of Computer Science Ben-Gurion University Beer-Sheva Israel Abstract We present a method to automatically generate a concise summary by identifying and synthesizing similar elements across related text from a set of multiple documents. Our approach is unique in its usage of language generation to reformulate the wording of the summary. 1 Introduction Information overload has created an acute need for summarization. Typically the same information is described by many different online documents. Hence summaries that synthesize common information across documents and emphasize the differences would significantly help readers. Such a summary would be beneficial for example to a user who follows a single event through several newswires. In this paper we present research on the automatic fusion of similar information across multiple documents using language generation to produce a concise summary. We propose a method for summarizing a specific type of input news articles presenting different descriptions of the same event. Hundreds of news stories on the same event are produced daily by news agencies. Repeated information about the event is a good indicator of its impor-tancy to the event and can be used for summary generation. Most research on single document summarization particularly for domain independent tasks uses sentence extraction to produce a summary Lin and Hovy 1997 Marcu 1997 Salton et al. 1991 . In the case of multidocument summarization of articles about the same event the original articles can include both similar and contradictory information. Extracting all similar sentences would produce a verbose and repetitive summary while ex tracting some similar sentences could produce a summary biased towards some sources. Instead we move beyond sentence extraction using a

crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.