tailieunhanh - Báo cáo khoa học: "A Bootstrapping Approach to Unsupervised Detection of Cue Phrase Variants"

We investigate the unsupervised detection of semi-fixed cue phrases such as “This paper proposes a novel approach. . . 1 ” from unseen text, on the basis of only a handful of seed cue phrases with the desired semantics. The problem, in contrast to bootstrapping approaches for Question Answering and Information Extraction, is that it is hard to find a constraining context for occurrences of semi-fixed cue phrases. Our method uses components of the cue phrase itself, rather than external context, to bootstrap. . | A Bootstrapping Approach to Unsupervised Detection of Cue Phrase Variants Rashid M. Abdalla and Simone Teufel Computer Laboratory University of Cambridge 15 JJ Thomson Avenue Cambridge CB3 OFD UK rma33@ sht25@ Abstract We investigate the unsupervised detection of semi-fixed cue phrases such as This paper proposes a novel approach. 1 from unseen text on the basis of only a handful of seed cue phrases with the desired semantics. The problem in contrast to bootstrapping approaches for Question Answering and Information Extraction is that it is hard to find a constraining context for occurrences of semi-fixed cue phrases. Our method uses components of the cue phrase itself rather than external context to bootstrap. It successfully excludes phrases which are different from the target semantics but which look superficially similar. The method achieves 88 accuracy outperforming standard bootstrapping approaches. 1 Introduction Cue phrases such as This paper proposes a novel approach to. no method for. exists or even you will hear from my lawyer are semi-fixed in that they constitute a formulaic pattern with a clear semantics but with syntactic and lexical variations which are hard to predict and thus hard to detect in unseen text . a new algorithm for .is suggested in the current paper or I envisage legal action . In scientific discourse such metadiscourse Myers 1992 Hyland 1998 abounds and plays an important role in marking the discourse structure of the texts. Finding these variants can be useful for many text understanding tasks because semi-fixed cue phrases act as linguistic markers indicating the importance and or the rhetorical role of some adjacent text. For the summarisation of scientific dn contrast to standard work in discourse linguistics which mostly considers sentence connectives and adverbials as cue phrases our definition includes longer phrases sometimes even entire sentences. papers cue phrases such as Our paper deals with. . . are .