Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "The ACL Anthology Searchbench"
Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ
Tải xuống
We describe a novel application for structured search in scientific digital libraries. The ACL Anthology Searchbench is meant to become a publicly available research tool to query the content of the ACL Anthology. The application provides search in both its bibliographic metadata and semantically analyzed full textual content. | The ACL Anthology Searchbench Ulrich Schafer Bernd Kiefer Christian Spurk Jorg Steffen Rui Wang Language Technology Lab German Research Center for Artificial Intelligence DFKI D-66123 Saarbrucken Germany ulrich.schaefer kiefer cspurk Steffen wang.rui @dfki.de http www.dfki.de lt Abstract We describe a novel application for structured search in scientific digital libraries. The ACL Anthology Searchbench is meant to become a publicly available research tool to query the content of the ACL Anthology. The application provides search in both its bibliographic metadata and semantically analyzed full textual content. By combining these two features very efficient and focused queries are possible. At the same time the application serves as a showcase for the recent progress in natural language processing NLP research and language technology. The system currently indexes the textual content of 7 500 anthology papers from 2002-2009 with predicateargument-like semantic structures. It also provides useful search filters based on bibliographic metadata. It will be extended to provide the full anthology content and enhanced functionality based on further NLP techniques. 1 Introduction and Motivation Scientists in all disciplines nowadays are faced with a flood of new publications every day. In addition more and more publications from the past become digitally available and thus even increase the amount. Finding relevant information and avoiding duplication of work have become urgent issues to be addressed by the scientific community. The organization and preservation of scientific knowledge in scientific publications vulgo text documents thwarts these efforts. From a viewpoint of 7 a computer scientist scientific papers are just unstructured information . At least in our own scientific community Computational Linguistics it is generally assumed that NLP could help to support search in such document collections. The ACL Anthology1 is a comprehensive electronic collection of .