tailieunhanh - Báo cáo khoa học: "An Ontology-based Semantic Tagger for IE system"
In this paper, we present a method for the semantic tagging of word chunks extracted from a written transcription of conversations. This work is part of an ongoing project for an information extraction system in the field of maritime Search And Rescue (SAR). Our purpose is to automatically annotate parts of texts with concepts from a SAR ontology. Our approach combines two knowledge sources a SAR ontology and the Wordsmyth dictionarythesaurus, and it uses a similarity measure for the classification. Evaluation is carried out by comparing the output of the system with key answers of predefined extraction templates. . | An Ontology-based Semantic Tagger for IE system Narjes Boufaden Department of Computer Science Universite de Montreal Quebec H3C 3J7 Canada boufaden@ Abstract In this paper we present a method for the semantic tagging of word chunks extracted from a written transcription of conversations. This work is part of an ongoing project for an information extraction system in the field of maritime Search And Rescue SAR . Our purpose is to automatically annotate parts of texts with concepts from a SAR ontology. Our approach combines two knowledge sources a SAR ontology and the Wordsmyth dictionarythesaurus and it uses a similarity measure for the classification. Evaluation is carried out by comparing the output of the system with key answers of predefined extraction templates. 1 Introduction This work is a part of a project aiming to implement an information extraction IE system in the field of maritime Search And Rescue SAR . It was originally conducted by the Defense Research Establishment Valcartier DREV to develop a decision support tool to help in producing SAR plans given the information extracted by the SAR IE system from a collection of transcribed dialogs. The goal of our project is to develop a robust approach to extract relevant words for small-scale corpora and transcribed speech dialogs. To achieve this task we developed a semantic tagger which annotates words with domain-specific informations and a selection process to extract or reject a word according to the semantic tag and the context. The rationale behind our approach is that the relevance of a word depends strongly on how close it is to the SAR domain and its context of use. We believe that reasoning on semantic tags instead of the word is a way of getting around some of the problems of small-scale corpora. In this paper we focus on semantic tagging based on a domain-specific ontology a dictionarythesaurus and the overlapping coefficient similarity measure Manning and Schutze 2001 to .
đang nạp các trang xem trước