tailieunhanh - Báo cáo khoa học: "Resolving It, This, and That in Unrestricted Multi-Party Dialog"

We present an implemented system for the resolution of it, this, and that in transcribed multi-party dialog. The system handles NP-anaphoric as well as discoursedeictic anaphors, . pronouns with VP antecedents. Selectional preferences for NP or VP antecedents are determined on the basis of corpus counts. Our results show that the system performs significantly better than a recency-based baseline. | Resolving It This and That in Unrestricted Multi-Party Dialog Christoph Muller EML Research gGmbH Villa Bosch SchloB-Wolfsbrunnenweg 33 69118 Heidelberg Germany Abstract We present an implemented system for the resolution of it this and that in transcribed multi-party dialog. The system handles NP-anaphoric as well as discourse-deictic anaphors . pronouns with VP antecedents. Selectional preferences for NP or VP antecedents are determined on the basis of corpus counts. Our results show that the system performs significantly better than a recency-based baseline. 1 Introduction This paper describes a fully automatic system for resolving the pronouns it this and that in unrestricted multi-party dialog. The system processes manual transcriptions from the ICSI Meeting Corpus Janin et al. 2003 . The following is a short fragment from one of these transcripts. The letters FN in the speaker tag mean that the speaker is a female non-native speaker of English. The brackets and subscript numbers are not part of the original transcript. FN083 Maybe you can also read through the - all the text which is on the web pages cuz I d like to change the text a bit cuz sometimes it 1 s too long sometimes it 2 s too short inbreath maybe the English is not that good so inbreath um but anyways - So I tried to do this 3 today and if you could do it 4 afterwards it g would be really nice cuz I m quite sure that I can t find every like orthographic mistake in it 6 or something. Bns003 For each of the six 3rd-person pronouns in the example the task is to automatically identify its referent . the entity if any to which the speaker makes 816 reference. Once a referent has been identified the pronoun is resolved by linking it to one of its antecedents . one of the referent s earlier mentions. For humans identification of a pronoun s referent is often easy iti it2 and it6 are probably used to refer to the text on the web pages while it 4 is probably used

TÀI LIỆU LIÊN QUAN