Recursive annotations for attention-based neural machine translation

2017 International Conference on Asian Language Processing (IALP) Pub Date : 2017-12-01 DOI:10.1109/IALP.2017.8300570

S. Ye, Wu Guo

引用次数: 2

Abstract

The last few years have witnessed the success of attention-based Neural Machine Translation (NMT), and many of variant models have been used to improve the performance. Most of the proposed attention-based NMT models encode the source sentence into a sequence of annotations which are kept fixed for the following steps. In this paper, we conjecture that the use of fixed annotations is the bottleneck in improving the performance ofconventional attention-based NMT. To tackle this shortcoming, we propose a novel model for attention-based NMT, which is intended to update the source annotations recursively when generating the target word at each time step. Experimental results show that the proposed approach achieves significant performance improvement over multiple test sets.

查看原文本刊更多论文

基于注意力的神经机器翻译递归注释

近年来，基于注意力的神经机器翻译(NMT)取得了巨大的成功，许多不同的模型被用来提高其翻译性能。大多数提出的基于注意力的NMT模型将源句子编码为一系列注释，这些注释在接下来的步骤中保持固定。在本文中，我们推测固定注释的使用是提高传统的基于注意力的神经网络机器翻译性能的瓶颈。为了解决这一缺点，我们提出了一种新的基于注意力的NMT模型，该模型旨在在每个时间步生成目标词时递归地更新源注释。实验结果表明，该方法在多个测试集上取得了显著的性能提升。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 International Conference on Asian Language Processing (IALP)

自引率

0.00%

发文量