我们能打败谷歌翻译吗?

Proceedings of the ITI 2012 34th International Conference on Information Technology Interfaces Pub Date : 2012-06-25 DOI:10.2498/iti.2012.0411

Marija Brkic Bakaric, Bozena Basic Mikulic, M. Matetić

{"title":"我们能打败谷歌翻译吗?","authors":"Marija Brkic Bakaric, Bozena Basic Mikulic, M. Matetić","doi":"10.2498/iti.2012.0411","DOIUrl":null,"url":null,"abstract":"This paper presents a machine translation evaluation study for Croatian-English language pair. In-domain and out-of-domain translations from Croatian into English have been obtained from Google Translate, our own statistical machine translation system LegTran, and from a professional translator. These translations have been evaluated by six different automatic metrics. The gains obtained from increasing the number of reference translations have been explored and measured. System level correlation between automatic evaluation metrics is given and the significance of the results is discussed. Bootstrapping, approximate randomization and the sign test have been used for confidence intervals and hypothesis testing.","PeriodicalId":135105,"journal":{"name":"Proceedings of the ITI 2012 34th International Conference on Information Technology Interfaces","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Can we beat google translate?\",\"authors\":\"Marija Brkic Bakaric, Bozena Basic Mikulic, M. Matetić\",\"doi\":\"10.2498/iti.2012.0411\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a machine translation evaluation study for Croatian-English language pair. In-domain and out-of-domain translations from Croatian into English have been obtained from Google Translate, our own statistical machine translation system LegTran, and from a professional translator. These translations have been evaluated by six different automatic metrics. The gains obtained from increasing the number of reference translations have been explored and measured. System level correlation between automatic evaluation metrics is given and the significance of the results is discussed. Bootstrapping, approximate randomization and the sign test have been used for confidence intervals and hypothesis testing.\",\"PeriodicalId\":135105,\"journal\":{\"name\":\"Proceedings of the ITI 2012 34th International Conference on Information Technology Interfaces\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-06-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the ITI 2012 34th International Conference on Information Technology Interfaces\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2498/iti.2012.0411\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ITI 2012 34th International Conference on Information Technology Interfaces","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2498/iti.2012.0411","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

本文对克罗地亚语-英语语言对进行了机器翻译评价研究。从克罗地亚语到英语的域内和域外翻译来自谷歌翻译，我们自己的统计机器翻译系统LegTran和专业翻译。这些翻译已经通过六个不同的自动度量进行了评估。从增加参考翻译的数量中获得的收益已经被探索和测量。给出了自动评价指标之间的系统级相关性，并讨论了结果的意义。自举，近似随机化和符号检验已用于置信区间和假设检验。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Can we beat google translate?

This paper presents a machine translation evaluation study for Croatian-English language pair. In-domain and out-of-domain translations from Croatian into English have been obtained from Google Translate, our own statistical machine translation system LegTran, and from a professional translator. These translations have been evaluated by six different automatic metrics. The gains obtained from increasing the number of reference translations have been explored and measured. System level correlation between automatic evaluation metrics is given and the significance of the results is discussed. Bootstrapping, approximate randomization and the sign test have been used for confidence intervals and hypothesis testing.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the ITI 2012 34th International Conference on Information Technology Interfaces

自引率

0.00%

发文量