tailieunhanh - Báo cáo khoa học: "State-of-the-art NLP Approaches to Coreference Resolution: Theory and Practical Recipes

The identification of different nominal phrases in a discourse as used to refer to the same (discourse) entity is essential for achieving robust natural language understanding (NLU). The importance of this task is directly amplified by the field of Natural Language Processing (NLP) currently moving towards high-level linguistic tasks requiring NLU capabilities such as . recognizing textual entailment. This tutorial aims at providing the NLP community with a gentle introduction to the task of coreference resolution from both a theoretical and an application-oriented perspective. . | State-of-the-art NLP Approaches to Coreference Resolution Theory and Practical Recipes Simone Paolo Ponzetto Seminar fur Computerlinguistik University of Heidelberg ponzetto@ Massimo Poesio DISI University of Trento 1 Introduction The identification of different nominal phrases in a discourse as used to refer to the same discourse entity is essential for achieving robust natural language understanding NLU . The importance of this task is directly amplified by the field of Natural Language Processing NLP currently moving towards high-level linguistic tasks requiring NLU capabilities such as . recognizing textual entailment. This tutorial aims at providing the NLP community with a gentle introduction to the task of coreference resolution from both a theoretical and an application-oriented perspective. Its main purposes are 1 to introduce a general audience of NLP researchers to the core ideas underlying state-of-the-art computational models of coreference 2 to provide that same audience with an overview of NLP applications which can benefit from coreference information. 2 Content Overview 1. Introduction to machine learning approaches to coreference resolution. We start by focusing on machine learning based approaches developed in the seminal works from Soon et al. 2001 and Ng Cardie 2002 . We then analyze the main limitations of these approaches . their clustering of mentions from a local pairwise classification of nominal phrases in text. We finally move on to present more complex models which attempt to model coreference as a global discourse phenomenon Yang et al. 2003 Luo et al. 2004 Daume III Marcu 2005 inter alia . 2. Lexical and encyclopedic knowledge for coreference resolution. Resolving anaphors to their correct antecedents requires in many cases lexical and encyclopedic knowledge. We accordingly introduce approaches which attempt to include semantic information into the coreference models from a variety of

TÀI LIỆU LIÊN QUAN