tailieunhanh - Báo cáo khoa học: "Three BioNLP Tools Powered by a Biological Lexicon"

In this paper, we demonstrate three NLP applications of the BioLexicon, which is a lexical resource tailored to the biology domain. The applications consist of a dictionary-based POS tagger, a syntactic parser, and query processing for biomedical information retrieval. Biological terminology is a major barrier to the accurate processing of literature within biology domain. In order to address this problem, we have constructed the BioLexicon using both manual and semiautomatic methods. We demonstrate the utility of the biology-oriented lexicon within three separate NLP applications. . | Three BioNLP Tools Powered by a Biological Lexicon Yutaka Sasaki 1 Paul Thompson1 John McNaught 1 2 Sophia Ananiadou1 2 1 School of Computer Science University of Manchester 2 National Centre for Text Mining MIB 131 Princess Street Manchester M1 7DN United Kingdom @ Abstract In this paper we demonstrate three NLP applications of the BioLexicon which is a lexical resource tailored to the biology domain. The applications consist of a dictionary-based POS tagger a syntactic parser and query processing for biomedical information retrieval. Biological terminology is a major barrier to the accurate processing of literature within biology domain. In order to address this problem we have constructed the BioLexicon using both manual and semiautomatic methods. We demonstrate the utility of the biology-oriented lexicon within three separate NLP applications. 1 Introduction Processing of biomedical text can frequently be problematic due to the huge number of technical terms and idiosyncratic usages of those terms. Sometimes general English words are used in different ways or with different meanings in biology literature. There are a number of linguistic resources that can be use to improve the quality of biological text processing. WordNet Fellbaum 1998 and the NLP Specialist Lexicon1 are dictionaries commonly used within biomedical NLP. WordNet is a general English thesaurus which additionally covers biological terms. However since WordNet is not targeted at the biology domain many biological terms and derivational relations are missing. The Specialist Lexicon is a syntactic lexicon of biomedical and general English words providing linguistic information about individual vocabulary items Browne et al 2003 . Whilst it contains a large number of biomedical terms its focus is on medical terms. Therefore some biology-specific terms . molecular biology terms are not the main target of the lexicon. In .

TỪ KHÓA LIÊN QUAN
crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.