tailieunhanh - Báo cáo khoa học: "An Expert Lexicon Approach to Identifying English Phrasal Verbs"

Like single-word verbs, each PV has its own lexical features including subcategorization features that determine its structural patterns [Fraser 1976; Bolinger 1971; Pelli 1976; Shaked 1994], ., look for has syntactic subcategorization and semantic features similar to those of search; carry on shares lexical features with continue. Such lexical features can be represented in the PV lexicon in the same way as those for single-word verbs, but a parser can only use them when the PV is identified. Problems like PVs are regarded as ‘a pain in the neck for NLP’ [Sag et al. 2002]. . | An Expert Lexicon Approach to Identifying English Phrasal Verbs Wei Li Xiuhong Zhang Cheng Niu Yuankai Jiang Rohini Srihari Cymfony Inc. 600 Essjay Road Williamsville NY 14221 USA wei xzhang cniu yjiang rohini @ Abstract Phrasal Verbs are an important feature of the English language. Properly identifying them provides the basis for an English parser to decode the related structures. Phrasal verbs have been a challenge to Natural Language Processing NLP because they sit at the borderline between lexicon and syntax. Traditional NLP frameworks that separate the lexicon module from the parser make it difficult to handle this problem properly. This paper presents a finite state approach that integrates a phrasal verb expert lexicon between shallow parsing and deep parsing to handle morpho-syntactic interaction. With precision recall combined performance benchmarked consistently at the Phrasal Verb identification problem has basically been solved with the presented method. 1 Introduction Any natural language processing NLP system needs to address the issue of handling multiword expressions including Phrasal Verbs PV Sag et al. 2002 Breidt et al. 1996 . This paper presents a proven approach to identifying English PVs based on pattern matching using a formalism called Expert Lexicon. Phrasal Verbs are an important feature of the English language since they form about one third of the English verb vocabulary. 1 Properly 1 For the verb vocabulary of our system based on machine-readable dictionaries and two Phrasal Verb dictionaries phrasal verb entries constitute of the entries. recognizing PVs is an important condition for English parsing. Like single-word verbs each PV has its own lexical features including subcategorization features that determine its structural patterns Fraser 1976 Bolinger 1971 Pelli 1976 Shaked 1994 . look for has syntactic subcategorization and semantic features similar to those of search shares lexical features