tailieunhanh - Báo cáo khoa học: "PARSING VS. TEXT PROCESSING IN THE ANALYSIS OF DICTIONARY DEFINITIONS"

We have analyzed definitions from Webster's Seventh New Collegiate Dictionary using Sager's Linguistic String Parser and again using basic UNIX text processing utilities such as grep and awk. Tiffs paper evaluates both procedures, compares their results, and discusses possible future lines of research exploiting and combining their respective strengths. Introduction As natural language systems grow more sophisticated, they need larger and more d ~ l e d lexicons. Efforts to automate the process of generating lexicons have been going on for years, and have often been combined with the analysis of machine-readable dictionaries. . | PARSING VS. TEXT PROCESSING IN THE ANALYSIS OF DICTIONARY DEFINITIONS Thomas Ahiswede and Martha Evens Computer Science Dept Illinois Institute of Technology Chicago IL 60616 312-567-5153 ABSTRACT We have analyzed definitions from Webster s Seventh New Collegiate Dictionary using Sager s Linguistic String Parser and again using basic UNIX text processing utilities such as grep and awL Ulis paper evaluates both procedures compares their results and discusses possible future lines of research exploiting and combining theữ respective strengths. Introduction As natural language systems grow more sophisticated they need larger and more detailed lexicons. Efforts to automate the process of generating lexicons have been going on for years and have often been combined with the analysis of machine-readable dictionaries. Since 1979 a group at nT under the leadership of Martha Evens has been using the machine-readable version of Webster s Seventh New Collegiate Dictionary W7 in text generation information retrieval and the theory of lexical-semantic relations. This paper describes some of our recent work in extracting semantic information from W7 primarily in the form of word pairs linked by lexical-semantic relations. We have used two methods parsing definitions with Sager s Linguistic String Parser LSP and text processing with a combination of UNIX utilities and interactive editing. We will use the terms parsing and text processing here primarily with reference to our own use of the LSP and UNIX utilities respectively but will also use them more broadly. Parsing in this more general sense will mean a computational technique of text analysis drawing on an extensive database of linguistic knowledge . the lexicon syntax and or semantics of English text processing will refer to any computational technique that involves little or no such knowledge. This research is supported by National Science Foundation grant 1ST 87-03580. Our thanks also to the G c Merriam Company for .

TỪ KHÓA LIÊN QUAN