tailieunhanh - Báo cáo khoa học: "Using WordNet to Automatically Deduce Relations between Words in Noun-Noun Compounds"

We present an algorithm for automatically disambiguating noun-noun compounds by deducing the correct semantic relation between their constituent words. This algorithm uses a corpus of 2,500 compounds annotated with WordNet senses and covering 139 different semantic relations (we make this corpus available online for researchers interested in the semantics of noun-noun compounds). The algorithm takes as input the WordNet senses for the nouns in a compound, finds all parent senses (hypernyms) of those senses, and searches the corpus for other compounds containing any pair of those senses. . | Using WordNet to Automatically Deduce Relations between Words in Noun-Noun Compounds Fintan J. Costello School of Computer Science University College Dublin Dublin 6 Ireland. Tony Veale Department of Computer Science. University College Dublin Dublin 6 Ireland. Simon Dunne Department of Computer Science University College Dublin Dublin 6 Ireland. sdunne@ Abstract We present an algorithm for automatically disambiguating noun-noun compounds by deducing the correct semantic relation between their constituent words. This algorithm uses a corpus of 2 500 compounds annotated with WordNet senses and covering 139 different semantic relations we make this corpus available online for researchers interested in the semantics of noun-noun compounds . The algorithm takes as input the WordNet senses for the nouns in a compound finds all parent senses hypernyms of those senses and searches the corpus for other compounds containing any pair of those senses. The relation with the highest proportional cooccurrence with any sense pair is returned as the correct relation for the compound. This algorithm was tested using a leave-one-out procedure on the corpus of compounds. The algorithm identified the correct relations for compounds with high precision in 92 of cases where a relation was found with a proportional cooccurrence of it was the correct relation for the compound being disambiguated. Keywords Noun-Noun Compounds Conceputal Combination Word Relations WordNet 1 Introduction Noun-noun compounds are short phrases made up of two or more nouns. These compounds are common in everyday language and are especially frequent and important in technical documents Justeson Katz 1995 report that such phrases form the majority of technical content of scientific and technical documents surveyed . Understanding these compounds requires the listener or reader to infer the correct semantic relationship between the words making up the .

TÀI LIỆU LIÊN QUAN