tailieunhanh - Báo cáo khoa học: "Optimal Head-Driven Parsing Complexity for Linear Context-Free Rewriting Systems"
We study the problem of finding the best headdriven parsing strategy for Linear ContextFree Rewriting System productions. A headdriven strategy must begin with a specified righthand-side nonterminal (the head) and add the remaining nonterminals one at a time in any order. We show that it is NP-hard to find the best head-driven strategy in terms of either the time or space complexity of parsing. | Optimal Head-Driven Parsing Complexity for Linear Context-Free Rewriting Systems Pierluigi Crescenzi Dip. di Sistemi e Informatica Universita di Firenze Daniel Gildea Computer Science Dept. University of Rochester Andrea Marino Dip. di Sistemi e Informatica Universita di Firenze Gianluca Rossi Dip. di Matematica Universita di Roma Tor Vergata Giorgio Satta Dip. di Ingegneria dell Informazione Universita di Padova Abstract We study the problem of finding the best head-driven parsing strategy for Linear Context-Free Rewriting System productions. A head-driven strategy must begin with a specified righthand-side nonterminal the head and add the remaining nonterminals one at a time in any order. We show that it is NP-hard to find the best head-driven strategy in terms of either the time or space complexity of parsing. 1 Introduction Linear Context-Free Rewriting Systems LCFRSs Vijay-Shankar et al. 1987 constitute a very general grammatical formalism which subsumes context-free grammars CFGs and tree adjoining grammars TAGs as well as the synchronous context-free grammars SCFGs and synchronous tree adjoining grammars STAGs used as models in machine LCFRSs retain the fundamental property of CFGs that grammar nonterminals rewrite independently but allow nonterminals to generate discontinuous phrases that is to generate more than one span in the string being produced. This important feature has been recently exploited by Maier and Sogaard 2008 and Kallmeyer and Maier 2010 for modeling phrase structure treebanks with discontinuous constituents and by Kuhlmann and Satta 2009 for modeling non-projective dependency treebanks. The rules of a LCFRS can be analyzed in terms of the properties of rank and fan-out. Rank is the 1To be more precise SCFGs and STAGs generate languages composed by pair of strings while LCFRSs generate string languages. We can abstract away from this difference by assuming concatenation of components in a string pair. 450 number of .
đang nạp các trang xem trước