Nana Mulyana Maghfur, Muhammad Okky Ibrohim, Junaedi Fahmi, Achmad Satria Putera, Oskar Riandi
{"title":"Text Normalization for Indonesian Text-to-Speech (TTS) using Rule-Based Approach: A Dataset and Preliminary Study","authors":"Nana Mulyana Maghfur, Muhammad Okky Ibrohim, Junaedi Fahmi, Achmad Satria Putera, Oskar Riandi","doi":"10.1109/ic2ie53219.2021.9649417","DOIUrl":null,"url":null,"abstract":"Text-to-Speech (TTS) is a technology that is currently widely used for several purposes both for academic/ non-commercial and industry/commercial purposes. In several cases, some researchers on the TTS field adding a text normalization process for normalizing text that will be used for TTS input to enhance the TTS performance itself. In this paper, we present a rule-based approach to make an Indonesian text normalization dataset that has a raw text and a spoken form of it for enhancing Indonesian Text-to-Speech (TTS) performance. We conduct a set of rule-based for normalizing Indonesian text as an input for the TTS system. Using those rule-based, we generated a dataset and correct it manually so that we have a gold standard for text normalization for Indonesian TTS input. Our approach shows a rule-based can give a good performance for normalizing text for Indonesian TTS with 0.0805 of Word Error Rate (WER).","PeriodicalId":178443,"journal":{"name":"2021 4th International Conference of Computer and Informatics Engineering (IC2IE)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 4th International Conference of Computer and Informatics Engineering (IC2IE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ic2ie53219.2021.9649417","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Text-to-Speech (TTS) is a technology that is currently widely used for several purposes both for academic/ non-commercial and industry/commercial purposes. In several cases, some researchers on the TTS field adding a text normalization process for normalizing text that will be used for TTS input to enhance the TTS performance itself. In this paper, we present a rule-based approach to make an Indonesian text normalization dataset that has a raw text and a spoken form of it for enhancing Indonesian Text-to-Speech (TTS) performance. We conduct a set of rule-based for normalizing Indonesian text as an input for the TTS system. Using those rule-based, we generated a dataset and correct it manually so that we have a gold standard for text normalization for Indonesian TTS input. Our approach shows a rule-based can give a good performance for normalizing text for Indonesian TTS with 0.0805 of Word Error Rate (WER).