tailieunhanh - Báo cáo khoa học: "Exploiting Feature Hierarchy for Transfer Learning in Named Entity Recognition"

We present a novel hierarchical prior structure for supervised transfer learning in named entity recognition, motivated by the common structure of feature spaces for this task across natural language data sets. The problem of transfer learning, where information gained in one learning task is used to improve performance in another related task, is an important new area of research. In the subproblem of domain adaptation, a model trained over a source domain is generalized to perform well on a related target domain, where the two domains’ data are distributed similarly, but not identically. . | Exploiting Feature Hierarchy for Transfer Learning in Named Entity Recognition Andrew Arnold Ramesh Nallapati and William W. Cohen Machine Learning Department Carnegie Mellon University Pittsburgh PA USA aarnold nmramesh wcohen @ Abstract We present a novel hierarchical prior structure for supervised transfer learning in named entity recognition motivated by the common structure of feature spaces for this task across natural language data sets. The problem of transfer learning where information gained in one learning task is used to improve performance in another related task is an important new area of research. In the subproblem of domain adaptation a model trained over a source domain is generalized to perform well on a related target domain where the two domains data are distributed similarly but not identically. We introduce the concept of groups of closely-related domains called genres and show how inter-genre adaptation is related to domain adaptation. We also examine multitask learning where two domains may be related but where the concept to be learned in each case is distinct. We show that our prior conveys useful information across domains genres and tasks while remaining robust to spurious signals not related to the target domain and concept. We further show that our model generalizes a class of similar hierarchical priors smoothed to varying degrees and lay the groundwork for future exploration in this area. 1 Introduction Problem definition Consider the task of named entity recognition NER . Specifically you are given a corpus of news articles in which all tokens have been labeled as either belonging to personal name mentions or not. The standard supervised machine learning problem is to learn a classifier over this training data that will successfully label unseen test data drawn from the same distribution as the training data where same distribution could mean anything from having the train and test articles written by the same author

TÀI LIỆU MỚI ĐĂNG