Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Combining Multiple Knowledge Sources for Discourse Segmentation"

Kiều Trinh 59 8 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

We predict discourse segment boundaries from linguistic features of utterances, using a corpus of spoken narratives as data. We present two methods for developing segmentation algorithms from training data: hand tuning and machine learning. When multiple types of features are used, results approach human performance on an independent test set (both methods), and using cross-validation (machine learning). | Combining Multiple Knowledge Sources for Discourse Segmentation Diane J. Litman AT T Bell Laboratories 600 Mountain Avenue Murray Hill NJ 07974 diane@research.att.com Rebecca J. Passonneau Bellcore 445 South Street Morristown NJ 07960 beck@bellcore.com Abstract We predict discourse segment boundaries from linguistic features of utterances using a corpus of spoken narratives as data. We present two methods for developing segmentation algorithms from training data hand tuning and machine learning. When multiple types of features are used results approach human performance on an independent test set both methods and using cross-validation machine learning . 1 Introduction Many have argued that discourse has a global structure above the level of individual utterances and that linguistic phenomena like prosody cue phrases and nominal reference are partly conditioned by and reflect this structure cf. Grosz and Hirschberg 1992 Grosz and Sidner 1986 Hirschberg and Grosz 1992 Hirschberg and Litman 1993 Hirschberg and Pierrehumbert 1986 Hobbs 1979 Lascarides and Oberlander 1992 Linde 1979 Mann and Thompson 1988 Polanyi 1988 Reichman 1985 Webber 1991 . However an obstacle to exploiting the relation between global structure and linguistic devices in natural language systems is that there is too little data about how they constrain one another. We have been engaged in a study addressing this gap. In previous work Passonneau and Litman 1993 we reported on a method for empirically validating global discourse units and on our evaluation of algorithms to identify these units. We found significant agreement among naive subjects on a discourse segmentation task which suggests that global discourse units have some objective reality. However we also found poor correlation of three untuned algorithms based on features of referential noun phrases cue words and pauses respectively with the subjects segmentations. In this paper we discuss two methods for developing segmentation algorithms

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Combining Coherence Models and Machine Translation Evaluation Metrics for Summarization Evaluation"

Báo cáo khoa học: "Combining Textual Entailment and Argumentation Theory for Supporting Online Debates Interactions"

Báo cáo khoa học: "Combining Word-Level and Character-Level Models for Machine Translation Between Closely-Related Languages"

Báo cáo khoa học: "Deciphering Foreign Language by Combining Language Models and Context Vectors"

Báo cáo khoa học: "Combining data and mathematical models of language change"

Báo cáo khoa học: "Combining Orthogonal Monolingual and Multilingual Sources of Evidence for All Words WSD"

Báo cáo khoa học: "Combining Indicators of Allophony"

Báo cáo khoa học: "Combining Lexical Semantic Resources with Question & Answer Archives for Translation-Based Answer Finding"

Báo cáo khoa học: "Combining Morpheme-based Machine Translation with Post-processing Morpheme Prediction"

Báo cáo khoa học: "Combining POMDPs trained with User Simulations and Rule-based Dialogue Management in a Spoken Dialogue System"

crossorigin="anonymous">

Đã phát hiện trình chặn quảng cáo AdBlock

Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.