Thazin Myint Oo, T. Tanprasert, Ye Kyaw Thu, T. Supnithi
{"title":"缅甸方言翻译的音节到音节和词到词转换器","authors":"Thazin Myint Oo, T. Tanprasert, Ye Kyaw Thu, T. Supnithi","doi":"10.1109/iSAI-NLP56921.2022.9960259","DOIUrl":null,"url":null,"abstract":"Weighted Finite State Transducers (WFST) can be very efficient to implement Burmese dialects translation. We illustrate this on two Burmese dialect language pairs, Burmese-Beik and Burmese-Rakhine. In this study, we examine syllable and word segmentation schemes and their effect on alignment and transducing between dialect language pairs. We performed alignments with Anymalign, fastalign, pialign, Hieralign, eflomal and GIZA ++ approaches and implemented WFST based machine translation system with OpenFst library. From the overall results, syllable segmentation achieved higher BLEU and chrF scores for Burmese-Rakhine and Rakhine-Burmese translations. However, word segmentation achieved better translation performance for Burmese-Beik and Beik-Burmese translation directions. Alignment techniques fast align, Hieralign, eflomal and GIZA ++ are working well for low-resource Burmese dialects.","PeriodicalId":399019,"journal":{"name":"2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"116 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Syllable-to-Syllable and Word-to-Word Transducers for Burmese Dialect Translation\",\"authors\":\"Thazin Myint Oo, T. Tanprasert, Ye Kyaw Thu, T. Supnithi\",\"doi\":\"10.1109/iSAI-NLP56921.2022.9960259\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Weighted Finite State Transducers (WFST) can be very efficient to implement Burmese dialects translation. We illustrate this on two Burmese dialect language pairs, Burmese-Beik and Burmese-Rakhine. In this study, we examine syllable and word segmentation schemes and their effect on alignment and transducing between dialect language pairs. We performed alignments with Anymalign, fastalign, pialign, Hieralign, eflomal and GIZA ++ approaches and implemented WFST based machine translation system with OpenFst library. From the overall results, syllable segmentation achieved higher BLEU and chrF scores for Burmese-Rakhine and Rakhine-Burmese translations. However, word segmentation achieved better translation performance for Burmese-Beik and Beik-Burmese translation directions. Alignment techniques fast align, Hieralign, eflomal and GIZA ++ are working well for low-resource Burmese dialects.\",\"PeriodicalId\":399019,\"journal\":{\"name\":\"2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)\",\"volume\":\"116 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/iSAI-NLP56921.2022.9960259\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iSAI-NLP56921.2022.9960259","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Syllable-to-Syllable and Word-to-Word Transducers for Burmese Dialect Translation
Weighted Finite State Transducers (WFST) can be very efficient to implement Burmese dialects translation. We illustrate this on two Burmese dialect language pairs, Burmese-Beik and Burmese-Rakhine. In this study, we examine syllable and word segmentation schemes and their effect on alignment and transducing between dialect language pairs. We performed alignments with Anymalign, fastalign, pialign, Hieralign, eflomal and GIZA ++ approaches and implemented WFST based machine translation system with OpenFst library. From the overall results, syllable segmentation achieved higher BLEU and chrF scores for Burmese-Rakhine and Rakhine-Burmese translations. However, word segmentation achieved better translation performance for Burmese-Beik and Beik-Burmese translation directions. Alignment techniques fast align, Hieralign, eflomal and GIZA ++ are working well for low-resource Burmese dialects.