tailieunhanh - Báo cáo khoa học: "Language Resources Factory: case study on the acquisition of Translation Memories"

This paper demonstrates a novel distributed architecture to facilitate the acquisition of Language Resources. We build a factory that automates the stages involved in the acquisition, production, updating and maintenance of these resources. The factory is designed as a platform where functionalities are deployed as web services, which can be combined in complex acquisition chains using workflows. We show a case study, which acquires a Translation Memory for a given pair of languages and a domain using web services for crawling, sentence alignment and conversion to TMX. . | Language Resources Factory case study on the acquisition of Translation Memories Marc Poch UPF Barcelona Spain Antonio Toral DCU Dublin Ireland atoral@ Nuria Bel UPF Barcelona Spain Abstract This paper demonstrates a novel distributed architecture to facilitate the acquisition of Language Resources. We build a factory that automates the stages involved in the acquisition production updating and maintenance of these resources. The factory is designed as a platform where functionalities are deployed as web services which can be combined in complex acquisition chains using workflows. We show a case study which acquires a Translation Memory for a given pair of languages and a domain using web services for crawling sentence alignment and conversion to TMX. 1 Introduction A fundamental issue for many tasks in the field of Computational Linguistics and Language Technologies in general is the lack of Language Resources LRs to tackle them successfully especially for some languages and domains. It is the so-called LRs bottleneck. Our objective is to build a factory of LRs that automates the stages involved in the acquisition production updating and maintenance of LRs required by Machine Translation MT and by other applications based on Language Technologies. This automation will significantly cut down the required cost time and human effort. These reductions are the only way to guarantee the continuous supply of LRs that Language Technologies demand in a multilingual world. We would like to thank the developers of Soaplab Tav-erna myExperiment and Biocatalogue for solving our questions and attending our requests. This research has been partially funded by the EU project PANACEA 7FP-ICT-248064 . 2 Web Services and Workflows The factory is designed as a platform of web services WSs where the users can create and use these services directly or combine them in more complex chains. These chains are called workflows and can

TỪ KHÓA LIÊN QUAN
TÀI LIỆU MỚI ĐĂNG
31    254    0    01-05-2024
37    158    0    01-05-2024
75    138    0    01-05-2024
33    125    0    01-05-2024
2    110    0    01-05-2024
crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.