D. Truong, Thang H. Nguyen-Vo, Long H. B. Nguyen, D. Dinh
{"title":"英语-越南语的文档级神经机器翻译探索","authors":"D. Truong, Thang H. Nguyen-Vo, Long H. B. Nguyen, D. Dinh","doi":"10.1109/NICS51282.2020.9335843","DOIUrl":null,"url":null,"abstract":"In Neural Machine Translation, the Transformer model has proven to be the state-of-the-art in multiple translation tasks. However, as a Seq2seq model, it can not abstract the contextual information when translating a document from one to another language. In the translation process, there are cases where, without the surrounding contextual information from consecutive sentences, an individual sentence causes ambiguity translations. The document-level approach makes the translation much more coherent and fluent by conserving the connectivity between sentences in the whole document to improve the quality of translation and human readability. Recent works show that models that are able to encapsulate these contextual information gain better results and evaluation than conventional sentence-level models. This paper conducts experiments and analyzes various context-aware models specifically in English-Vietnamese translation tasks.","PeriodicalId":308944,"journal":{"name":"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)","volume":"76 11","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exploring Document-Level Neural Machine Translation for English-Vietnamese\",\"authors\":\"D. Truong, Thang H. Nguyen-Vo, Long H. B. Nguyen, D. Dinh\",\"doi\":\"10.1109/NICS51282.2020.9335843\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In Neural Machine Translation, the Transformer model has proven to be the state-of-the-art in multiple translation tasks. However, as a Seq2seq model, it can not abstract the contextual information when translating a document from one to another language. In the translation process, there are cases where, without the surrounding contextual information from consecutive sentences, an individual sentence causes ambiguity translations. The document-level approach makes the translation much more coherent and fluent by conserving the connectivity between sentences in the whole document to improve the quality of translation and human readability. Recent works show that models that are able to encapsulate these contextual information gain better results and evaluation than conventional sentence-level models. This paper conducts experiments and analyzes various context-aware models specifically in English-Vietnamese translation tasks.\",\"PeriodicalId\":308944,\"journal\":{\"name\":\"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)\",\"volume\":\"76 11\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NICS51282.2020.9335843\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NICS51282.2020.9335843","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Exploring Document-Level Neural Machine Translation for English-Vietnamese
In Neural Machine Translation, the Transformer model has proven to be the state-of-the-art in multiple translation tasks. However, as a Seq2seq model, it can not abstract the contextual information when translating a document from one to another language. In the translation process, there are cases where, without the surrounding contextual information from consecutive sentences, an individual sentence causes ambiguity translations. The document-level approach makes the translation much more coherent and fluent by conserving the connectivity between sentences in the whole document to improve the quality of translation and human readability. Recent works show that models that are able to encapsulate these contextual information gain better results and evaluation than conventional sentence-level models. This paper conducts experiments and analyzes various context-aware models specifically in English-Vietnamese translation tasks.