D. Truong, Thang H. Nguyen-Vo, Long H. B. Nguyen, D. Dinh
{"title":"Exploring Document-Level Neural Machine Translation for English-Vietnamese","authors":"D. Truong, Thang H. Nguyen-Vo, Long H. B. Nguyen, D. Dinh","doi":"10.1109/NICS51282.2020.9335843","DOIUrl":null,"url":null,"abstract":"In Neural Machine Translation, the Transformer model has proven to be the state-of-the-art in multiple translation tasks. However, as a Seq2seq model, it can not abstract the contextual information when translating a document from one to another language. In the translation process, there are cases where, without the surrounding contextual information from consecutive sentences, an individual sentence causes ambiguity translations. The document-level approach makes the translation much more coherent and fluent by conserving the connectivity between sentences in the whole document to improve the quality of translation and human readability. Recent works show that models that are able to encapsulate these contextual information gain better results and evaluation than conventional sentence-level models. This paper conducts experiments and analyzes various context-aware models specifically in English-Vietnamese translation tasks.","PeriodicalId":308944,"journal":{"name":"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)","volume":"76 11","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NICS51282.2020.9335843","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In Neural Machine Translation, the Transformer model has proven to be the state-of-the-art in multiple translation tasks. However, as a Seq2seq model, it can not abstract the contextual information when translating a document from one to another language. In the translation process, there are cases where, without the surrounding contextual information from consecutive sentences, an individual sentence causes ambiguity translations. The document-level approach makes the translation much more coherent and fluent by conserving the connectivity between sentences in the whole document to improve the quality of translation and human readability. Recent works show that models that are able to encapsulate these contextual information gain better results and evaluation than conventional sentence-level models. This paper conducts experiments and analyzes various context-aware models specifically in English-Vietnamese translation tasks.