tailieunhanh - Báo cáo khoa học: "In-domain Relation Discovery with Meta-constraints via Posterior Regularization"

We present a novel approach to discovering relations and their instantiations from a collection of documents in a single domain. Our approach learns relation types by exploiting meta-constraints that characterize the general qualities of a good relation in any domain. These constraints state that instances of a single relation should exhibit regularities at multiple levels of linguistic structure, including lexicography, syntax, and document-level context. | In-domain Relation Discovery with Meta-constraints via Posterior Regularization Harr Chen Edward Benson Tahira Naseem and Regina Barzilay Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology harr eob tahira regina @ Abstract We present a novel approach to discovering relations and their instantiations from a collection of documents in a single domain. Our approach learns relation types by exploiting meta-constraints that characterize the general qualities of a good relation in any domain. These constraints state that instances of a single relation should exhibit regularities at multiple levels of linguistic structure including lexicography syntax and document-level context. We capture these regularities via the structure of our probabilistic model as well as a set of declaratively-specified constraints enforced during posterior inference. Across two domains our approach successfully recovers hidden relation structure comparable to or outperforming previous state-of-the-art approaches. Furthermore we find that a small set of constraints is applicable across the domains and that using domain-specific constraints can further improve performance. 1 1 Introduction In this paper we introduce a novel approach for the unsupervised learning of relations and their instantiations from a set of in-domain documents. Given a collection of news articles about earthquakes for example our method discovers relations such as the earthquake s location and resulting damage and extracts phrases representing the relations instantiations. Clusters of similar in-domain documents are 1The source code for this work is available at http rbg code relatiomextraction A strong earthquake rocked the Philippine island of Mindoro early Tuesday destroying some homes arg . A strong earthquake hit the China-Burma border early Wednesday . The official Xinhua News Agency said some hoiises alg were damaged ind . A strong earthquake

TÀI LIỆU LIÊN QUAN
TỪ KHÓA LIÊN QUAN
TÀI LIỆU MỚI ĐĂNG