tailieunhanh - Báo cáo khoa học: "The Corpora Management System Based on Java and Oracle Technologies"

The paper discusses the corpora management system (CMS) design that uses Java and Oracle9i DBMS to support strategic corpora analysis. We present the pilot webbased CMS to support linguists in their daily work. The system offers facilities to assist linguists and internet users as they search for relevant material, and then classify and annotate this material. | The Corpora Management System Based on Java and Oracle Technologies Serge Yablonsky Petersburg Transport University Computer Department Moscow av. 9 190031 Russia Russicon Company Kazanskaya str. 56 190000 Russia serge yablonsky@hotmail. com http Abstract The paper discusses the corpora management system CMS design that uses Java and Oracle9i DBMS to support strategic corpora analysis. We present the pilot webbased CMS to support linguists ìn theừ daily work. The system offers facilities to assist linguists and internet users as they search for relevant material and then classify and annotate this material. 1 Introduction There s a wide class of documental management solutions and products that fall under the rubric corpora and text mining . They are similar to data mining solutions in that they deal with large volumes of data but the difference between the two technology solutions is that while data mining extracts analyzes and summarizes numerical structured data text mining handles large volumes of unstructured text-based data. Document systems with large-scale linguistic annotation are used by a wide range of research and commercial applications. This paper presents a web-based text corpora development system CMS that focuses on the development of UML-specifications architecture and actual implementations of DBMS tools to support strategic corpora analysis. We present the basic features of a prototype corpora management system under development intended to support linguists in their daily work. The system offers facilities to assist linguists and internet users as they search for relevant material and then classify and annotate this material in a repository. The CMS is implemented using Java and commercial DBMS OracleQi. 2 System Overview The Corpora management system combines Java XML XSL HTML and Oracle9i components Yablonsky . 2002 . The system was by adapting existing and new DBMS .

TỪ KHÓA LIÊN QUAN
crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.