tailieunhanh - Báo cáo khoa học: "An Information-Theory-Based Feature Type Analysis for the Modelling of Statistical Parsing"

The paper proposes an information-theorybased method for feature types analysis in probabilistic evaluation modelling for statistical parsing. The basic idea is that we use entropy and conditional entropy to measure whether a feature type grasps some of the information for syntactic structure prediction. Our experiment quantitatively analyzes several feature types’ power for syntactic structure prediction and draws a series of interesting conclusions. | An Information-Theory-Based Feature Type Analysis for the Modelling of Statistical Parsing SUI Zhifang tt ZHAO Jun f Dekai WU f Hong Kong University of Science Technology Department of Computer Science Human Language Technology Center Clear Water Bay Hong Kong Peking University Department of Computer Science Technology Institute of Computational Linguistics Beijing China suizf@ zhaojun@ dekai@ Abstract The paper proposes an information-theorybased method for feature types analysis in probabilistic evaluation modelling for statistical parsing. The basic idea is that we use entropy and conditional entropy to measure whether a feature type grasps some of the information for syntactic structure prediction. Our experiment quantitatively analyzes several feature types power for syntactic structure prediction and draws a series of interesting conclusions. 1 Introduction In the field of statistical parsing various probabilistic evaluation models have been proposed where different models use different feature types Black 1992 Briscoe 1993 Brown 1991 Charniak 1997 Collins 1996 Collins 1997 Magerman 1991 Magerman 1992 Magerman 1995 Eisner 1996 . How to evaluate the different feature types effects for syntactic parsing The paper proposes an information-theory-based feature types analysis model which uses the measures of predictive information quantity predictive information gain predictive information redundancy and predictive information summation to quantitatively analyse the different contextual feature types or feature types combination s predictive power for syntactic structure. In the following Section 2 describes the probabilistic evaluation model for syntactic trees Section 3 proposes an information-theory-based feature type analysis model Section 4 introduces several experimental issues Section 5 quantitatively analyses the different contextual feature types or feature types combination in the view of information theory and draws a .

TÀI LIỆU LIÊN QUAN