基于领域数据适应的在线英语到阿拉伯语翻译增强

Khaled Ibrahim Mohamed, M. Rashwan, M. Fakhr, Mostafa Abdel Azim
{"title":"基于领域数据适应的在线英语到阿拉伯语翻译增强","authors":"Khaled Ibrahim Mohamed, M. Rashwan, M. Fakhr, Mostafa Abdel Azim","doi":"10.1109/ICCTA32607.2013.9529753","DOIUrl":null,"url":null,"abstract":"the aim of this paper is to introduce a new technique that enhances online translation from English to Arabic for a specific domain. This enhancement is achieved by training a new \"Arabic online engine translation\" to \"Arabic manual translation\" model that corrects common errors in the online translation. This paper focuses on two popular online translation engines which are Google and Bing. The translation enhancement is done in two steps. First step is training the translation of each online engine on Moses toolkit separately to generate translation model that corrects the output of online translation engine, the result of this step is corrected Google translation and corrected Bing translation which is better than the original translation. Second step is choosing one sentence from the two corrected sentence using a language model trained on the manual Arabic translation which gives more enhancement than step 1.Two metrics are used to evaluate proposed model enhancement, BLEU Score (Bilingual Evaluation Understudy) and NIST Score (National Institute of Standards and Technology). The proposed model has obtained 32.44% BLEU Score over Google and 40.91% BLEU Score over Bing for LDC2004T14 dataset, 18.08% BLEU Score over Google and 26.55% BLEU Score over Bing for Distinct LDC2004T14 dataset, and 10.69% BLEU Score over Google and 9.94% BLEU Score over Bing for MEDAR dataset.","PeriodicalId":405465,"journal":{"name":"2013 23rd International Conference on Computer Theory and Applications (ICCTA)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Online English to Arabic Translation Enhancement by Domain Data Adaptation\",\"authors\":\"Khaled Ibrahim Mohamed, M. Rashwan, M. Fakhr, Mostafa Abdel Azim\",\"doi\":\"10.1109/ICCTA32607.2013.9529753\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"the aim of this paper is to introduce a new technique that enhances online translation from English to Arabic for a specific domain. This enhancement is achieved by training a new \\\"Arabic online engine translation\\\" to \\\"Arabic manual translation\\\" model that corrects common errors in the online translation. This paper focuses on two popular online translation engines which are Google and Bing. The translation enhancement is done in two steps. First step is training the translation of each online engine on Moses toolkit separately to generate translation model that corrects the output of online translation engine, the result of this step is corrected Google translation and corrected Bing translation which is better than the original translation. Second step is choosing one sentence from the two corrected sentence using a language model trained on the manual Arabic translation which gives more enhancement than step 1.Two metrics are used to evaluate proposed model enhancement, BLEU Score (Bilingual Evaluation Understudy) and NIST Score (National Institute of Standards and Technology). The proposed model has obtained 32.44% BLEU Score over Google and 40.91% BLEU Score over Bing for LDC2004T14 dataset, 18.08% BLEU Score over Google and 26.55% BLEU Score over Bing for Distinct LDC2004T14 dataset, and 10.69% BLEU Score over Google and 9.94% BLEU Score over Bing for MEDAR dataset.\",\"PeriodicalId\":405465,\"journal\":{\"name\":\"2013 23rd International Conference on Computer Theory and Applications (ICCTA)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-10-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 23rd International Conference on Computer Theory and Applications (ICCTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCTA32607.2013.9529753\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 23rd International Conference on Computer Theory and Applications (ICCTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCTA32607.2013.9529753","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文的目的是介绍一种新的技术来增强特定领域的英语到阿拉伯语的在线翻译。这种增强是通过训练一个新的“阿拉伯语在线引擎翻译”到“阿拉伯语手动翻译”模型来实现的,该模型可以纠正在线翻译中的常见错误。本文主要介绍了两种流行的在线翻译引擎:Google和Bing。翻译增强分两个步骤完成。第一步是在摩西工具包上分别训练每个在线引擎的翻译,生成翻译模型,修正在线翻译引擎的输出,这一步的结果是修正谷歌翻译和修正必应翻译,比原翻译更好。第二步是使用经过人工阿拉伯语翻译训练的语言模型从两个更正的句子中选择一个句子,该模型比第一步提供了更多的增强。两个指标用于评估提出的模型增强,BLEU分数(双语评估替补)和NIST分数(国家标准与技术研究所)。所提出的模型在LDC2004T14数据集上获得了比Google高32.44%的BLEU Score和比Bing高40.91%的BLEU Score,在Distinct LDC2004T14数据集上获得了比Google高18.08%的BLEU Score和比Bing高26.55%的BLEU Score,在MEDAR数据集上获得了比Google高10.69%的BLEU Score和比Bing高9.94%的BLEU Score。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Online English to Arabic Translation Enhancement by Domain Data Adaptation
the aim of this paper is to introduce a new technique that enhances online translation from English to Arabic for a specific domain. This enhancement is achieved by training a new "Arabic online engine translation" to "Arabic manual translation" model that corrects common errors in the online translation. This paper focuses on two popular online translation engines which are Google and Bing. The translation enhancement is done in two steps. First step is training the translation of each online engine on Moses toolkit separately to generate translation model that corrects the output of online translation engine, the result of this step is corrected Google translation and corrected Bing translation which is better than the original translation. Second step is choosing one sentence from the two corrected sentence using a language model trained on the manual Arabic translation which gives more enhancement than step 1.Two metrics are used to evaluate proposed model enhancement, BLEU Score (Bilingual Evaluation Understudy) and NIST Score (National Institute of Standards and Technology). The proposed model has obtained 32.44% BLEU Score over Google and 40.91% BLEU Score over Bing for LDC2004T14 dataset, 18.08% BLEU Score over Google and 26.55% BLEU Score over Bing for Distinct LDC2004T14 dataset, and 10.69% BLEU Score over Google and 9.94% BLEU Score over Bing for MEDAR dataset.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信