统计机器翻译系统组合的词重排对齐

Maoxi Li, Chengqing Zong
{"title":"统计机器翻译系统组合的词重排对齐","authors":"Maoxi Li, Chengqing Zong","doi":"10.1109/CHINSL.2008.ECP.80","DOIUrl":null,"url":null,"abstract":"Word alignment is a basic and critical process in the Statistical Machine Translation (SMT). The previous work on word alignment mainly focuses on the training process to get the word mapping relation between the source sentences and target sentences. However, the word alignment for combination of SMT system outputs is also important, which aims to find the word correspondence between alternative translation hypotheses of a source language sentence. Unfortunately, it does not attract so much attention in SMT research. In this paper, we propose a novel word alignment approach to effectively address the word alignment between sentences with different valid word orders, which changes the order of the word sequences (called word reordering) of the output hypotheses to make the word order more exactly match the alignment reference. We present experimental results on the IWSLT'2008 challenge tasks with the combination of four state-of-the-art SMT systems outputs. The results show that our approach significantly improves the performance of the system combination.","PeriodicalId":291958,"journal":{"name":"2008 6th International Symposium on Chinese Spoken Language Processing","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Word Reordering Alignment for Combination of Statistical Machine Translation Systems\",\"authors\":\"Maoxi Li, Chengqing Zong\",\"doi\":\"10.1109/CHINSL.2008.ECP.80\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Word alignment is a basic and critical process in the Statistical Machine Translation (SMT). The previous work on word alignment mainly focuses on the training process to get the word mapping relation between the source sentences and target sentences. However, the word alignment for combination of SMT system outputs is also important, which aims to find the word correspondence between alternative translation hypotheses of a source language sentence. Unfortunately, it does not attract so much attention in SMT research. In this paper, we propose a novel word alignment approach to effectively address the word alignment between sentences with different valid word orders, which changes the order of the word sequences (called word reordering) of the output hypotheses to make the word order more exactly match the alignment reference. We present experimental results on the IWSLT'2008 challenge tasks with the combination of four state-of-the-art SMT systems outputs. The results show that our approach significantly improves the performance of the system combination.\",\"PeriodicalId\":291958,\"journal\":{\"name\":\"2008 6th International Symposium on Chinese Spoken Language Processing\",\"volume\":\"40 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-12-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 6th International Symposium on Chinese Spoken Language Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CHINSL.2008.ECP.80\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 6th International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CHINSL.2008.ECP.80","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

摘要

词对齐是统计机器翻译(SMT)中一个基本而关键的过程。以往关于词对齐的研究主要集中在训练过程中获取源句和目标句之间的词映射关系。然而,SMT系统输出组合的词对齐也很重要,其目的是找到源语言句子的不同翻译假设之间的词对应关系。遗憾的是,这在贴片技术的研究中并没有引起足够的重视。在本文中,我们提出了一种新的词对齐方法来有效地解决不同有效词序的句子之间的词对齐问题,该方法通过改变输出假设的词序列顺序(称为词重排序)来使词序更精确地匹配对齐参考。我们介绍了IWSLT 2008年挑战任务的实验结果,结合了四个最先进的SMT系统输出。结果表明,我们的方法显著提高了系统组合的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Word Reordering Alignment for Combination of Statistical Machine Translation Systems
Word alignment is a basic and critical process in the Statistical Machine Translation (SMT). The previous work on word alignment mainly focuses on the training process to get the word mapping relation between the source sentences and target sentences. However, the word alignment for combination of SMT system outputs is also important, which aims to find the word correspondence between alternative translation hypotheses of a source language sentence. Unfortunately, it does not attract so much attention in SMT research. In this paper, we propose a novel word alignment approach to effectively address the word alignment between sentences with different valid word orders, which changes the order of the word sequences (called word reordering) of the output hypotheses to make the word order more exactly match the alignment reference. We present experimental results on the IWSLT'2008 challenge tasks with the combination of four state-of-the-art SMT systems outputs. The results show that our approach significantly improves the performance of the system combination.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信