tailieunhanh - Báo cáo khoa học: "Term Recognition Using Technical Dictionary Hierarchy"

In recent years, statistical approaches on ATR (Automatic Term Recognition) have achieved good results. However, there are scopes to improve the performance in extracting terms still further. For example, domain dictionaries can improve the performance in ATR. This paper focuses on a method for extracting terms using a dictionary hierarchy. Our method produces relatively good results for this task. | Term Recognition Using Technical Dictionary Hierarchy Jong-Hoon Oh KyungSoon Lee and Key-Sun Choi Computer Science Dept. Advanced Information TechnologyResearch Center AITrc and Korea Terminology Research Center for Language and Knowledge Engineering KORTERM Korea Advanced Institute of Science Technology KAIST Kusong-Dong Yusong-Gu Taejon 305-701 Republic of Korea rovellia kslee kschoi @ Abstract In recent years statistical approaches on ATR Automatic Term Recognition have achieved good results. However there are scopes to improve the performance in extracting terms still further. For example domain dictionaries can improve the performance in ATR. This paper focuses on a method for extracting terms using a dictionary hierarchy. Our method produces relatively good results for this task. Introduction In recent years statistical approaches on ATR Automatic Term Recognition Bourigault 1992 Dagan et al 1994 Justeson and Katz 1995 Frantzi 1999 have achieved good results. However there are scopes to improve the performance in extracting terms still further. For example the additional technical dictionaries can be used for improving the accuracy in extracting terms. Although the hardship on constructing an electronic dictionary was major obstacles for using an electronic technical dictionary in term recognition the increasing development of tools for building electronic lexical resources makes a new chance to use them in the field of terminology. From these endeavour a number of electronic technical dictionaries domain dictionaries have been acquired. Since newly produced terms are usually made out of existing terms dictionaries can be used as a source of them. For example distributed database is composed of distributed and database that are terms in a computer science domain. Further concepts and terms of a domain are frequently imported from related domains. For example the term Geographical Information System GIS is used not only in a computer science .

TỪ KHÓA LIÊN QUAN