Win Zaw, Ye Kyaw Thu, S. S. Moe, N. Oo, T. Supnithi
{"title":"Statistical Machine Translation, Ripple Down Rules and Hidden Markov Model for Burmese Romanization","authors":"Win Zaw, Ye Kyaw Thu, S. S. Moe, N. Oo, T. Supnithi","doi":"10.1109/ICAIT51105.2020.9261807","DOIUrl":null,"url":null,"abstract":"Burmese Romanization is the phonetic translation of Burmese (Myanmar) language into their phonetic Latin script. Burmese Romanization is aimed the reader who is unfamiliar with the Burmese script to pronounce the Burmese language reasonably accurately. We developed a 15K Burmese Romanization parallel corpus based on the practical usage of several Romanizations of native Myanmar people. The experiments were performed using different SMT approaches (PBSMT, HPBSMT, OSM), Ripple Down Rules (RDR) and Hidden Markov Model (HMM) approaches to the task of Burmese Romanization. The results show that the OSM approach achieves the highest performance in Burmese to Romanized Burmese direction. On the other hand, HMM approach gives the highest results in Romanized Burmese to Burmese.","PeriodicalId":173291,"journal":{"name":"2020 International Conference on Advanced Information Technologies (ICAIT)","volume":"128 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Advanced Information Technologies (ICAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIT51105.2020.9261807","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Burmese Romanization is the phonetic translation of Burmese (Myanmar) language into their phonetic Latin script. Burmese Romanization is aimed the reader who is unfamiliar with the Burmese script to pronounce the Burmese language reasonably accurately. We developed a 15K Burmese Romanization parallel corpus based on the practical usage of several Romanizations of native Myanmar people. The experiments were performed using different SMT approaches (PBSMT, HPBSMT, OSM), Ripple Down Rules (RDR) and Hidden Markov Model (HMM) approaches to the task of Burmese Romanization. The results show that the OSM approach achieves the highest performance in Burmese to Romanized Burmese direction. On the other hand, HMM approach gives the highest results in Romanized Burmese to Burmese.
缅甸罗马化是缅甸语的语音翻译成他们的语音拉丁字母。缅甸罗马化是针对不熟悉缅甸文字的读者合理准确地发音缅甸语。基于缅甸人的几种罗马化的实际使用,我们开发了一个15K缅甸罗马化平行语料库。实验采用不同的SMT方法(PBSMT、HPBSMT、OSM)、Ripple Down Rules (RDR)和Hidden Markov Model (HMM)方法对缅甸语罗马化任务进行了研究。结果表明,OSM方法在缅语到罗马化缅语方向上的表现最好。另一方面,HMM方法在罗马化缅甸语到缅甸语中给出了最高的结果。