Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Development and Evaluation of a Broad-Coverage Probabilistic Grammar of English-Language Computer Manuals"
Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ
Tải xuống
We present an approach to grammar development where the task is decomposed into two separate subtasks. The first task is hnguistic, with the goal of producing a set of rules that have a large coverage (in the sense that the correct parse is among the proposed parses) on a bhnd test set of sentences. The second task is statistical, with the goal of developing a model of the grammar which assigns maximum probability for the correct parse. | Development and Evaluation of a Broad-Coverage Probabilistic Grammar of English-Language Computer Manuals Ezra Black John Lafferty Salim Roukos CblackIjlaffIroukosXtaatson.ibm.com IBM Thomas J. Watson Research Center P.O. Box 704 Yorktown Heights New York 10598 ABSTRACT We present an approach to grammar development where the task is decomposed into two separate subtasks. The first task is linguistic with the goal of producing a set of rules that have a large coverage in the sense that the correct parse is among the proposed parses on a blind test set of sentences. The second task is statistical with the goal of developing a model of the grammar which assigns maximum probability for the correct parse. We give parsing results on text from computer manuals. 1. Introduction Many language understanding systems and machine translation systems rely on a parser of English as the first step in processing an input sentence. The general impression may be that parsers with broad coverage of English are readily available. In an effort to gauge the state of the art in parsing the authors conducted an experiment in Summer 1990 in which 35 sentences all of length 13 words or less were selected randomly from a several-millionword corpus of Associated Press news wire. The sentences were parsed by four of the major large-coverage parsers for general English.1 Each of the authors working separately scored 140 parses for correctness of constituent boundaries constituent labels and part-of-speech labels. All that was required of parses was accuracy in delimiting and identifying obvious constituents such as noun phrases prepositional phrases and clauses along with at least rough correctness in assigning part-of-speech labels e.g. a noun could not be labelled as a verb. The tallies of each evaluator were compared and were identical or very close in all cases. The best-performing parser was correct for 60 of the sentences and the the remaining parsers were below 40 . More recently in early