tailieunhanh - Báo cáo khoa học: "OPTIMIZING THE COMPUTATION ALL EXICALIZATION OF LARGE GRAMMARS"

The computational lexicalization of a grammar is the optimization of the links between lexicalized rules and lexical items in order to improve the quality of the bottom-up filtering during parsing. This problem is N P - c o m p l e t e and untractable on large grammars. An approximation algorithm is presented. The quality of the suboptimal solution is evaluated on real-world grammars as well as on randomly generated ones. | OPTIMIZING THE COMPUTATIONAL LEXICALIZATION OF LARGE GRAMMARS Christian JACQUEMIN Institut de Recherche en Informatique de Nantes IRIN IUT de Nantes - 3 rue du Maréchal Joffre F-44041 NANTES Cedex 01 - FRANCE e-mail jacquemin@ Abstract The computational lexicalization of a grammar is the optimization of the links between lexicalized rules and lexical items in order to improve the quality of the bottom-up filtering during parsing. This problem is NP-complete and untractable on large grammars. An approximation algorithm is presented. The quality of the suboptimal solution is evaluated on real-world grammars as well as on randomly generated ones. Introduction Lexicalized grammar formalisms and more specifically Lexicalized Tree Adjoining Grammars LTAGs give a lexical account of phenomena which cannot be considered as purely syntactic Schabes et al 1990 . A formalism is said to be lexicalized if it is composed of structures or rules associated with each lexical item and operations to derive new structures from these elementary ones. The choice of the lexical anchor of a rule is supposed to be determined on purely linguistic grounds. This is the linguistic side of lexicalization which links to each lexical head a set of minimal and complete sttuctures. But lexicalization also has a computational aspect because parsing algorithms for lexicalized grammars can take advantage of lexical links through a two-step sttategy Schabes and Joshi 1990 . The first step is the selection of the set of rules or elementary structures associated with the lexical items in the input sentence1. In the second step the parser uses the rules filtered by the first step. The two kinds of anchors corresponding to these two aspects of lexicalization can be considered separately The linguistic anchors are used to access the grammar update the data gather together items with similar structures organize the grammar into a hierarchy. The computational anchors are used to .

crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.