英语-越南语的文档级神经机器翻译探索

D. Truong, Thang H. Nguyen-Vo, Long H. B. Nguyen, D. Dinh
{"title":"英语-越南语的文档级神经机器翻译探索","authors":"D. Truong, Thang H. Nguyen-Vo, Long H. B. Nguyen, D. Dinh","doi":"10.1109/NICS51282.2020.9335843","DOIUrl":null,"url":null,"abstract":"In Neural Machine Translation, the Transformer model has proven to be the state-of-the-art in multiple translation tasks. However, as a Seq2seq model, it can not abstract the contextual information when translating a document from one to another language. In the translation process, there are cases where, without the surrounding contextual information from consecutive sentences, an individual sentence causes ambiguity translations. The document-level approach makes the translation much more coherent and fluent by conserving the connectivity between sentences in the whole document to improve the quality of translation and human readability. Recent works show that models that are able to encapsulate these contextual information gain better results and evaluation than conventional sentence-level models. This paper conducts experiments and analyzes various context-aware models specifically in English-Vietnamese translation tasks.","PeriodicalId":308944,"journal":{"name":"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)","volume":"76 11","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exploring Document-Level Neural Machine Translation for English-Vietnamese\",\"authors\":\"D. Truong, Thang H. Nguyen-Vo, Long H. B. Nguyen, D. Dinh\",\"doi\":\"10.1109/NICS51282.2020.9335843\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In Neural Machine Translation, the Transformer model has proven to be the state-of-the-art in multiple translation tasks. However, as a Seq2seq model, it can not abstract the contextual information when translating a document from one to another language. In the translation process, there are cases where, without the surrounding contextual information from consecutive sentences, an individual sentence causes ambiguity translations. The document-level approach makes the translation much more coherent and fluent by conserving the connectivity between sentences in the whole document to improve the quality of translation and human readability. Recent works show that models that are able to encapsulate these contextual information gain better results and evaluation than conventional sentence-level models. This paper conducts experiments and analyzes various context-aware models specifically in English-Vietnamese translation tasks.\",\"PeriodicalId\":308944,\"journal\":{\"name\":\"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)\",\"volume\":\"76 11\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NICS51282.2020.9335843\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NICS51282.2020.9335843","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在神经机器翻译中,Transformer模型已被证明是最先进的翻译任务。然而,作为Seq2seq模型,它不能在将文档从一种语言翻译成另一种语言时抽象上下文信息。在翻译过程中,经常会出现由于没有从连续句子中获取上下文信息而导致单个句子产生歧义翻译的情况。文档级方法通过保留整个文档中句子之间的连通性,使翻译更加连贯和流畅,从而提高翻译质量和人类的可读性。最近的研究表明,能够封装这些上下文信息的模型比传统的句子级模型获得更好的结果和评估。本文对英汉越语翻译任务中的各种语境感知模型进行了实验和分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Exploring Document-Level Neural Machine Translation for English-Vietnamese
In Neural Machine Translation, the Transformer model has proven to be the state-of-the-art in multiple translation tasks. However, as a Seq2seq model, it can not abstract the contextual information when translating a document from one to another language. In the translation process, there are cases where, without the surrounding contextual information from consecutive sentences, an individual sentence causes ambiguity translations. The document-level approach makes the translation much more coherent and fluent by conserving the connectivity between sentences in the whole document to improve the quality of translation and human readability. Recent works show that models that are able to encapsulate these contextual information gain better results and evaluation than conventional sentence-level models. This paper conducts experiments and analyzes various context-aware models specifically in English-Vietnamese translation tasks.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信