A Mongolian-Chinese neural machine translation model based on Transformer’s two-branch gating structure

2022 4th International Conference on Intelligent Information Processing (IIP) Pub Date : 2022-10-01 DOI:10.1109/IIP57348.2022.00085

Genmao Zhang, Yonghong Tian, Jia Hao, Junjin Zhang

引用次数: 0

Abstract

To design a new two-branch gating structure for the existence of Transformer structure, by dividing the attention mechanism into two pieces, one part adopts the attention mechanism for global information capture and the other part adopts dynamic convolution for local information capture, and the captured two-branch is fused with features by way of gating mechanism to replace the attention mechanism and feedforward neural network, which makes the model fewer parameters and higher ability of capturing information. The experimental results show that our BLEU4 value is improved by 3. 07 compared to the Transformer structure.

查看原文本刊更多论文

基于Transformer双支路门控结构的蒙汉神经机器翻译模型

针对Transformer结构的存在，设计了一种新的双分支门控结构，将注意力机制分为两部分，一部分采用注意力机制进行全局信息捕获，另一部分采用动态卷积进行局部信息捕获，捕获的两分支通过门控机制与特征融合，取代注意力机制和前馈神经网络。这使得模型参数更少，获取信息的能力更高。实验结果表明，我们的BLEU4值提高了3。07与变压器结构相比。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 4th International Conference on Intelligent Information Processing (IIP)

自引率

0.00%

发文量