tailieunhanh - English Access To Hindi Information

We present C*ST*RD, a cross-language information delivery system that supports cross-language information retrieval, information space visualization and navigation, machine translation, and text summarization of single documents and clusters of documents. C*ST*RD was assembled and trained within one month, in the context of DARPA's Surprise Language Exercise, that selected as source a heretofore unstudied language, Hindi. Given the brief time, we could not create deep Hindi capabilities for all the modules, but instead experimented with combining | English Access To Hindi Information Cross-lingual C ST RD English Access to Hindi Information ANTON LEUSKI CHIN-YEW LIN LIANG ZHOU ULRICH GERMANN FRANZ JOSEF OCH and EDUARD HOVY Information Sciences Institute University of Southern California We present C ST RD a cross-language information delivery system that supports cross-language information retrieval information space visualization and navigation machine translation and text summarization of single documents and clusters of documents. C ST RD was assembled and trained within one month in the context of DARPA s Surprise Language Exercise that selected as source a heretofore unstudied language Hindi. Given the brief time we could not create deep Hindi capabilities for all the modules but instead experimented with combining shallow Hindi capabilities or even English-only modules into one integrated system. Various possible configurations with different tradeoffs in processing speed and ease of use enable the rapid deployment of C ST RD to new languages under various conditions. Categories and Subject Descriptors Artificial Intelligence Natural Language Processing machine translation text analysis languagegeneration Information Storage and Retrieval Information Search and Retrieval General Terms Design Experimentation Human Factors Languages Management Performance Additional Key Words and Phrases Cross-Language Information Retrieval Hindi-to-English Machine Translation Information Retrieval and Information Space Navigation Single- and MultiDocument Text Summarization Headline Generation 1. INTRODUCTION The goal of DARPA s 2003 TIDES Surprise Language Exercise was to test the Human Language Technology community s ability to rapidly create language tools for previously unresearched languages. We focused our attention on the task of providing human access to information that is available only in a language of which the user has little or no knowledge. During 29 days in June members of ISI s Natural .

crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.