{"title":"Research on Unsupervised Domain Adaptation System for Machine Translation","authors":"Menghua Jiang, Shiyu Zhao","doi":"10.1109/INSAI56792.2022.00043","DOIUrl":null,"url":null,"abstract":"With the intensive research on neural networks and deep learning, neural machine translation proposed in recent years has greatly improved translation quality and gradually replaced traditional statistical-based machine translation. Although neural machine translation can achieve very high translation accuracy with translation models trained on resource-rich outer domains, it tends to perform poorly in other inner domains where resources are scarce. Domain adaptation is one approach to enhance its performance, which uses resource-rich domains to help improve the translation quality of machine translation in resource-scarce domains. In this paper, we try to build a parallel corpus in the inner domain by using the parallel corpus in the outer domain to better train the translation model, which is eventually invoked by the system to realize the translation. The experimental results show that the accuracy of the machine translation system is significantly improved when translating data from the inner domain after applying this method.","PeriodicalId":318264,"journal":{"name":"2022 2nd International Conference on Networking Systems of AI (INSAI)","volume":"101 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 2nd International Conference on Networking Systems of AI (INSAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INSAI56792.2022.00043","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With the intensive research on neural networks and deep learning, neural machine translation proposed in recent years has greatly improved translation quality and gradually replaced traditional statistical-based machine translation. Although neural machine translation can achieve very high translation accuracy with translation models trained on resource-rich outer domains, it tends to perform poorly in other inner domains where resources are scarce. Domain adaptation is one approach to enhance its performance, which uses resource-rich domains to help improve the translation quality of machine translation in resource-scarce domains. In this paper, we try to build a parallel corpus in the inner domain by using the parallel corpus in the outer domain to better train the translation model, which is eventually invoked by the system to realize the translation. The experimental results show that the accuracy of the machine translation system is significantly improved when translating data from the inner domain after applying this method.