{"title":"Thai to Isarn dialect machine translation using rule-based and example-based","authors":"Paweena Unlee, Pusadee Seresangtakul","doi":"10.1109/JCSSE.2016.7748892","DOIUrl":null,"url":null,"abstract":"This paper presented a Thai to Isarn dialect machine translation, named ThaiIsarn - MT, using rule-based and example based. To develop the system, a Thai - Isarn dialect dictionary and linguistics rules database were constructed. The dictionary contained 8,050 Thai words. Moreover, an example bilingual corpus was constructed. Since Thai is non-segmented language, input sentence was segmented to sequence of word using longest matching algorithm. The Thai word was looked for the corresponding Isarn word from the dictionary. For word that may has several meanings, we used the rule based and example based to select the most suitable meaning. In order to evaluate the performance of the purposed system, 854 Thai sentences were translated by the system. The result shows that the translation accuracy is 82.44%.","PeriodicalId":321571,"journal":{"name":"2016 13th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 13th International Joint Conference on Computer Science and Software Engineering (JCSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JCSSE.2016.7748892","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
This paper presented a Thai to Isarn dialect machine translation, named ThaiIsarn - MT, using rule-based and example based. To develop the system, a Thai - Isarn dialect dictionary and linguistics rules database were constructed. The dictionary contained 8,050 Thai words. Moreover, an example bilingual corpus was constructed. Since Thai is non-segmented language, input sentence was segmented to sequence of word using longest matching algorithm. The Thai word was looked for the corresponding Isarn word from the dictionary. For word that may has several meanings, we used the rule based and example based to select the most suitable meaning. In order to evaluate the performance of the purposed system, 854 Thai sentences were translated by the system. The result shows that the translation accuracy is 82.44%.