Y. Heryadi, B. Wijanarko, Dina Fitria Murad, C. Tho, Kiyota Hashimoto
{"title":"Indonesian-Sundanese Language Machine Translation using Bidirectional Long Short-term Memory Model","authors":"Y. Heryadi, B. Wijanarko, Dina Fitria Murad, C. Tho, Kiyota Hashimoto","doi":"10.1109/ICCoSITE57641.2023.10127691","DOIUrl":null,"url":null,"abstract":"Translating a language to another language has become instrumental when peoples interact with other people who speak a different language. However, the language translation is not an easy computation task when there is a language-resource gap. This paper presents empirical results on the performance of two models: the Long Short-term Memory and the Bidirectional Long Short-term Memory models as machine language translation models involving Bahasa Indonesia and the Sundanese language. The empiric results showed that the Bidirectional Long Short-term Memory model achieves higher performance as a language translator from the Sundanese language to Bahasa Indonesia and vice versa (0.95 and 0.95 average training accuracy respectively; and 0.90 and 0.89 average testing BLEU scores respectively) than the Long Short-term Memory model as a language translator from the Sundanese language to Bahasa Indonesia and vice versa (0.93 and 0.92 average training accuracy respectively; and 0.91 and 0.88 average testing BLEU scores). These results validate some previously reported studies that claim the Bidirectional Long Short-term Memory model potentially outperform the Long Short-term Memory model when it is used to process a sequence dataset.","PeriodicalId":256184,"journal":{"name":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","volume":"87 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCoSITE57641.2023.10127691","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Translating a language to another language has become instrumental when peoples interact with other people who speak a different language. However, the language translation is not an easy computation task when there is a language-resource gap. This paper presents empirical results on the performance of two models: the Long Short-term Memory and the Bidirectional Long Short-term Memory models as machine language translation models involving Bahasa Indonesia and the Sundanese language. The empiric results showed that the Bidirectional Long Short-term Memory model achieves higher performance as a language translator from the Sundanese language to Bahasa Indonesia and vice versa (0.95 and 0.95 average training accuracy respectively; and 0.90 and 0.89 average testing BLEU scores respectively) than the Long Short-term Memory model as a language translator from the Sundanese language to Bahasa Indonesia and vice versa (0.93 and 0.92 average training accuracy respectively; and 0.91 and 0.88 average testing BLEU scores). These results validate some previously reported studies that claim the Bidirectional Long Short-term Memory model potentially outperform the Long Short-term Memory model when it is used to process a sequence dataset.
当人们与说不同语言的人交流时,将一种语言翻译成另一种语言已经成为一种工具。然而,在存在语言资源缺口的情况下,语言翻译并不是一项简单的计算任务。本文以印尼语和巽他语为研究对象,对两种机器语言翻译模型:长短期记忆模型和双向长短期记忆模型的性能进行了实证研究。实证结果表明,双向长短期记忆模型在Sundanese语和Bahasa Indonesia -反之的语言翻译中获得了更高的表现(平均训练准确率分别为0.95和0.95);和平均测试BLEU分数分别为0.90和0.89)比长短期记忆模型作为语言翻译从巽他语到印尼语,反之亦然(平均训练准确率分别为0.93和0.92;BLEU平均分分别为0.91和0.88)。这些结果验证了先前报道的一些研究,这些研究声称双向长短期记忆模型在处理序列数据集时可能优于长短期记忆模型。