tailieunhanh - Báo cáo khoa học: "TAILORING LEXICALCHOICE TO THEUSER'S VOCABULARY IN MULTIMEDI A EXPLANATION GENERATION"

In this paper, we discuss the different strategies used in COMET (COordinated Multimedia Explanation Testbed) for selecting words with which the user is familiar. When pictures cannot be used to disambiguate a word or phrase, COMET has four strategies for avoiding unknown words. We give examples for each of these strategies and show how they are implementedin COMET. | TAILORING LEXICAL CHOICE TO THE USER S VOCABULARY IN MULTIMEDIA EXPLANATION GENERATION Kathleen McKeown Jacques Robin Michael Tanenblatt Department of Computer Science 450 Computer Science Building Columbia University New York . 10027 kathy robin tanenbla @ ABSTRACT In this paper we discuss the different strategies used in COMET coordinated Multimedia Explanation Testbed for selecting words with which the user is familiar. When pictures cannot be used to disambiguate a word or phrase COMET has four strategies for avoiding unknown words. We give examples for each of these strategies and show how they are implemented in COMET. 1. Introduction A language generation system should select words that its user knows. While this would seem to involve simply selecting a known word instead of an unknown word as is done for example in 1 in many cases it requires entữely rephrasing the rest of the sentence. For example in our domain of equipment maintenance and repair if the user does not know the word polarity a sentence like Check the polarity. will be rephrased as Make sure the plus on the battery lines up with the plus on the battery compartment. Even when alternative words can be usea instead of an unknown word . a descriptive expression can be used instead of an object name the alternative phrase may interact with other parts of the sentence which then need to be reworded as well. In this paper we discuss the different strategies used in COMET for selecting words with which the user is familiar. Since COMET integrates text and pictures in a single explanation1 unknown words are frequently disambiguated through accompanying pictures. For example when the accompanying picture clearly shows the object and its location COMET will use the most common object name even if the user is unfamiliar with the name2. When pictures cannot be used to disambiguate a word or phrase COMET has four strategies for avoiding unknown words 1. Selecting an alternative word .

TÀI LIỆU LIÊN QUAN