tailieunhanh - Báo cáo khoa học: "Automatic Determination of Parts of Speech of English Words"

The classifying of words according to syntactic usage is basic to language handling; this paper describes an algorithm for automatically classifying words according to thirteen commonly used parts of speech: noun, adjective, verb, past verb, adverb, preposition, conjunction, pronoun, interjection, present participle, past participle, auxiliary verb, and plural or collective noun. | Mechanical Translation and Computational Linguistics 4 September and December 1967 Automatic Determination of Parts of Speech of English Words by Lois L. Earl Lockheed Palo Alto Research Laboratory Palo Alto California Introduction This paper describes the development and details of a procedure for automatically assigning part-of-speech characteristics to English words largely from graphemic considerations. The development of the algorithm began with the observation of Dolby and Resnikoff1 that the parts of speech associated with one-syllable words are frequently noun or noun and adjective and verb while the parts of speech associated with multisyllable words are usually noun and adjective only. Development of a working part-of-speech algorithm required the study of exceptions to this general rule so that analytical subrules and exception lists sufficient to identify automatically all such exceptions could be derived. Two analyses were utilized for the isolation and study of exceptions 1 Exhaustive sorts of a 73 582-word dictionary on magnetic tape were used to separate words consistent with the general rule from those words that were not and to classify them. 2 Computer analysis of possible part-of-speech implications of affixes was carried out on the same dictionary. The algorithm developed utilizes a prepared dictionary of around nine hundred words and an affix list of less than two hundred entries. Parts of Speech Assigned and Their Abbreviations The tape dictionary used for both analyses contained 73 582 words with part-of-speech and word-status in- I wish to thank J. L. Dolby and H. L. Resnikoff who have acted as consultants on Office of Naval Research contract Nonr 4440 00 which supported this research. The classifying of words according to syntactic usage is basic to language handling this paper describes an algorithm for automatically classifying words according to thirteen commonly used parts of speech noun adjective verb past verb adverb .

TỪ KHÓA LIÊN QUAN