tailieunhanh - Báo cáo khoa học: "A Best Practices Guided Development Environment for Information Extraction"

Information extraction (IE) is becoming a critical building block in many enterprise applications. In order to satisfy the increasing text analytics demands of enterprise applications, it is crucial to enable developers with general computer science background to develop high quality IE extractors. In this demonstration, we present WizIE, an IE development environment intended to reduce the development life cycle and enable developers with little or no linguistic background to write high quality IE rules. . | WizIE A Best Practices Guided Development Environment for Information Extraction Yunyao Li Laura Chiticariu Huahai Yang Frederick R. Reiss Arnaldo Carreno-fuentes IBM Research - Almaden 650 Harry Road San Jose CA 95120 yunyaoli chiti hyang frreiss acarren @ Abstract Information extraction IE is becoming a critical building block in many enterprise applications. In order to satisfy the increasing text analytics demands of enterprise applications it is crucial to enable developers with general computer science background to develop high quality IE extractors. In this demonstration we present WizIE an IE development environment intended to reduce the development life cycle and enable developers with little or no linguistic background to write high quality IE rules. WizIE provides an integrated wizard-like environment that guides IE developers step-by-step throughout the entire development process based on best practices synthesized from the experience of expert developers. In addition WizIE reduces the manual effort involved in performing key IE development tasks by offering automatic result explanation and rule discovery functionality. Preliminary results indicate that WizIE is a step forward towards enabling extractor development for novice IE developers. 1 Introduction Information Extraction IE refers to the problem of extracting structured information from unstructured or semi-structured text. It has been well-studied by the Natural Language Processing research community for a long time. In recent years IE has emerged as a critical building block in a wide range of enterprise applications including financial risk analysis social media analytics and regulatory compliance among many others. An important practical challenge driven by the use of IE in these applications is usability Chiticariu et al. 2010c specifically 109 how to enable the ease of development and maintenance of high-quality information extraction rules also known as annotators or .

TỪ KHÓA LIÊN QUAN
crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.