tailieunhanh - Báo cáo khoa học: "Support Vector Machines for Query-focused Summarization trained and evaluated on Pyramid data"

This paper presents the use of Support Vector Machines (SVM) to detect relevant information to be included in a queryfocused summary. Several SVMs are trained using information from pyramids of summary content units. Their performance is compared with the best performing systems in DUC-2005, using both ROUGE and autoPan, an automatic scoring method for pyramid evaluation. | Support Vector Machines for Query-focused Summarization trained and evaluated on Pyramid data Maria Fuentes TALP Research Center Universitat Politecnica de Catalunya mfuentes@ Enrique Alfonseca Computer Science Departament Universidad Autonoma de Madrid Horacio Rodríguez TALP Research Center Universitat Politecnica de Catalunya horacio@ Abstract This paper presents the use of Support Vector Machines SVM to detect relevant information to be included in a query-focused summary. Several SVMs are trained using information from pyramids of summary content units. Their performance is compared with the best performing systems in DUC-2005 using both ROUGE and autoPan an automatic scoring method for pyramid evaluation. 1 Introduction Multi-Document Summarization MDS is the task of condensing the most relevant information from several documents in a single one. In terms of the DUC contests1 a query-focused summary has to provide a brief well-organized fluent answer to a need for information described by a short query two or three sentences . DUC participants have to synthesize 250-word sized summaries for fifty sets of 25-50 documents in answer to some queries. In previous DUC contests from 2001 to 2004 the manual evaluation was based on a comparison with a single human-written model. Much information in the evaluated summaries both human and automatic was marked as related to the topic but not directly expressed in the model summary . Ideally this relevant information should be scored during the evaluation. The pyramid method Nenkova and Pas-sonneau 2004 addresses the problem by using multiple human summaries to create a gold-standard 1http projects duc and by exploiting the frequency of information in the human summaries in order to assign importance to different facts. However the pyramid method requires to manually matching fragments of automatic summaries peers to the Semantic Content Units SCUs in the

crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.