English to Hindi Machine Transliteration System at NEWS 2009

NEWS@IJCNLP Pub Date : 2009-08-07 DOI:10.3115/1699705.1699726

Amitava Das, Asif Ekbal, Tapabrata Mondal, Sivaji Bandyopadhyay

引用次数: 28

Abstract

This paper reports about our work in the NEWS 2009 Machine Transliteration Shared Task held as part of ACL-IJCNLP 2009. We submitted one standard run and two non-standard runs for English to Hindi transliteration. The modified joint source-channel model has been used along with a number of alternatives. The system has been trained on the NEWS 2009 Machine Transliteration Shared Task datasets. For standard run, the system demonstrated an accuracy of 0.471 and the mean F-Score of 0.861. The non-standard runs yielded the accuracy and mean F-scores of 0.389 and 0.831 respectively in the first one and 0.384 and 0.828 respectively in the second one. The non-standard runs resulted in substantially worse performance than the standard run. The reasons for this are the ranking algorithm used for the output and the types of tokens present in the test set.

查看原文本刊更多论文

英语到印地语机器音译系统在新闻2009

本文报道了我们在作为ACL-IJCNLP 2009的一部分举行的NEWS 2009机器音译共享任务中的工作。我们提交了一个标准运行和两个非标准运行的英语到印地语音译。改进的联合源-通道模型与许多替代模型一起被使用。该系统已在NEWS 2009机器音译共享任务数据集上进行了训练。标准运行时，系统的准确率为0.471，平均F-Score为0.861。第一次非标准运行的准确性和平均f分数分别为0.389和0.831，第二次运行的准确性和平均f分数分别为0.384和0.828。非标准运行导致的性能比标准运行差得多。其原因是用于输出的排序算法和测试集中出现的令牌类型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

NEWS@IJCNLP

自引率

0.00%

发文量