Exploring Document-Level Neural Machine Translation for English-Vietnamese

2020 7th NAFOSTED Conference on Information and Computer Science (NICS) Pub Date : 2020-11-26 DOI:10.1109/NICS51282.2020.9335843

D. Truong, Thang H. Nguyen-Vo, Long H. B. Nguyen, D. Dinh

引用次数: 0

Abstract

In Neural Machine Translation, the Transformer model has proven to be the state-of-the-art in multiple translation tasks. However, as a Seq2seq model, it can not abstract the contextual information when translating a document from one to another language. In the translation process, there are cases where, without the surrounding contextual information from consecutive sentences, an individual sentence causes ambiguity translations. The document-level approach makes the translation much more coherent and fluent by conserving the connectivity between sentences in the whole document to improve the quality of translation and human readability. Recent works show that models that are able to encapsulate these contextual information gain better results and evaluation than conventional sentence-level models. This paper conducts experiments and analyzes various context-aware models specifically in English-Vietnamese translation tasks.

查看原文本刊更多论文

英语-越南语的文档级神经机器翻译探索

在神经机器翻译中，Transformer模型已被证明是最先进的翻译任务。然而，作为Seq2seq模型，它不能在将文档从一种语言翻译成另一种语言时抽象上下文信息。在翻译过程中，经常会出现由于没有从连续句子中获取上下文信息而导致单个句子产生歧义翻译的情况。文档级方法通过保留整个文档中句子之间的连通性，使翻译更加连贯和流畅，从而提高翻译质量和人类的可读性。最近的研究表明，能够封装这些上下文信息的模型比传统的句子级模型获得更好的结果和评估。本文对英汉越语翻译任务中的各种语境感知模型进行了实验和分析。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 7th NAFOSTED Conference on Information and Computer Science (NICS)

自引率

0.00%

发文量