tailieunhanh - Báo cáo khoa học: "Loosely Tree-Based Alignment for Machine Translation"

We augment a model of translation based on re-ordering nodes in syntactic trees in order to allow alignments not conforming to the original tree structure, while keeping computational complexity polynomial in the sentence length. This is done by adding a new subtree cloning operation to either tree-to-string or tree-to-tree alignment algorithms. | Loosely Tree-Based Alignment for Machine Translation Daniel Gildea University of Pennsylvania dgildea@ Abstract We augment a model of translation based on re-ordering nodes in syntactic trees in order to allow alignments not conforming to the original tree structure while keeping computational complexity polynomial in the sentence length. This is done by adding a new subtree cloning operation to either tree-to-string or tree-to-tree alignment algorithms. 1 Introduction Systems for automatic translation between languages have been divided into transfer-based approaches which rely on interpreting the source string into an abstract semantic representation from which text is generated in the target language and statistical approaches pioneered by Brown et al. 1990 which estimate parameters for a model of word-to-word correspondences and word re-orderings directly from large corpora of parallel bilingual text. Only recently have hybrid approaches begun to emerge which apply probabilistic models to a structured representation of the source text. Wu 1997 showed that restricting word-level alignments between sentence pairs to observe syntactic bracketing constraints significantly reduces the complexity of the alignment problem and allows a polynomial-time solution. Alshawi et al. 2000 also induce parallel tree structures from unbracketed parallel text modeling the generation of each node s children with a finite-state transducer. Yamada and Knight 2001 present an algorithm for estimating probabilistic parameters for a similar model which represents translation as a sequence of re-ordering operations over children of nodes in a syntactic tree using automatic parser output for the initial tree structures. The use of explicit syntactic information for the target language in this model has led to excellent translation results Yamada and Knight 2002 and raises the prospect of training a statistical system using syntactic information for both sides of the parallel .

crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.