Indonesian-Sundanese Language Machine Translation using Bidirectional Long Short-term Memory Model

2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE) Pub Date : 2023-02-16 DOI:10.1109/ICCoSITE57641.2023.10127691

Y. Heryadi, B. Wijanarko, Dina Fitria Murad, C. Tho, Kiyota Hashimoto

{"title":"Indonesian-Sundanese Language Machine Translation using Bidirectional Long Short-term Memory Model","authors":"Y. Heryadi, B. Wijanarko, Dina Fitria Murad, C. Tho, Kiyota Hashimoto","doi":"10.1109/ICCoSITE57641.2023.10127691","DOIUrl":null,"url":null,"abstract":"Translating a language to another language has become instrumental when peoples interact with other people who speak a different language. However, the language translation is not an easy computation task when there is a language-resource gap. This paper presents empirical results on the performance of two models: the Long Short-term Memory and the Bidirectional Long Short-term Memory models as machine language translation models involving Bahasa Indonesia and the Sundanese language. The empiric results showed that the Bidirectional Long Short-term Memory model achieves higher performance as a language translator from the Sundanese language to Bahasa Indonesia and vice versa (0.95 and 0.95 average training accuracy respectively; and 0.90 and 0.89 average testing BLEU scores respectively) than the Long Short-term Memory model as a language translator from the Sundanese language to Bahasa Indonesia and vice versa (0.93 and 0.92 average training accuracy respectively; and 0.91 and 0.88 average testing BLEU scores). These results validate some previously reported studies that claim the Bidirectional Long Short-term Memory model potentially outperform the Long Short-term Memory model when it is used to process a sequence dataset.","PeriodicalId":256184,"journal":{"name":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","volume":"87 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCoSITE57641.2023.10127691","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Translating a language to another language has become instrumental when peoples interact with other people who speak a different language. However, the language translation is not an easy computation task when there is a language-resource gap. This paper presents empirical results on the performance of two models: the Long Short-term Memory and the Bidirectional Long Short-term Memory models as machine language translation models involving Bahasa Indonesia and the Sundanese language. The empiric results showed that the Bidirectional Long Short-term Memory model achieves higher performance as a language translator from the Sundanese language to Bahasa Indonesia and vice versa (0.95 and 0.95 average training accuracy respectively; and 0.90 and 0.89 average testing BLEU scores respectively) than the Long Short-term Memory model as a language translator from the Sundanese language to Bahasa Indonesia and vice versa (0.93 and 0.92 average training accuracy respectively; and 0.91 and 0.88 average testing BLEU scores). These results validate some previously reported studies that claim the Bidirectional Long Short-term Memory model potentially outperform the Long Short-term Memory model when it is used to process a sequence dataset.

查看原文本刊更多论文

基于双向长短期记忆模型的印尼语-巽他语机器翻译

当人们与说不同语言的人交流时，将一种语言翻译成另一种语言已经成为一种工具。然而，在存在语言资源缺口的情况下，语言翻译并不是一项简单的计算任务。本文以印尼语和巽他语为研究对象，对两种机器语言翻译模型:长短期记忆模型和双向长短期记忆模型的性能进行了实证研究。实证结果表明，双向长短期记忆模型在Sundanese语和Bahasa Indonesia -反之的语言翻译中获得了更高的表现(平均训练准确率分别为0.95和0.95);和平均测试BLEU分数分别为0.90和0.89)比长短期记忆模型作为语言翻译从巽他语到印尼语，反之亦然(平均训练准确率分别为0.93和0.92;BLEU平均分分别为0.91和0.88)。这些结果验证了先前报道的一些研究，这些研究声称双向长短期记忆模型在处理序列数据集时可能优于长短期记忆模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)

自引率

0.00%

发文量