Nana Mulyana Maghfur, Muhammad Okky Ibrohim, Junaedi Fahmi, Achmad Satria Putera, Oskar Riandi
{"title":"基于规则方法的印尼语文本到语音(TTS)文本规范化:数据集和初步研究","authors":"Nana Mulyana Maghfur, Muhammad Okky Ibrohim, Junaedi Fahmi, Achmad Satria Putera, Oskar Riandi","doi":"10.1109/ic2ie53219.2021.9649417","DOIUrl":null,"url":null,"abstract":"Text-to-Speech (TTS) is a technology that is currently widely used for several purposes both for academic/ non-commercial and industry/commercial purposes. In several cases, some researchers on the TTS field adding a text normalization process for normalizing text that will be used for TTS input to enhance the TTS performance itself. In this paper, we present a rule-based approach to make an Indonesian text normalization dataset that has a raw text and a spoken form of it for enhancing Indonesian Text-to-Speech (TTS) performance. We conduct a set of rule-based for normalizing Indonesian text as an input for the TTS system. Using those rule-based, we generated a dataset and correct it manually so that we have a gold standard for text normalization for Indonesian TTS input. Our approach shows a rule-based can give a good performance for normalizing text for Indonesian TTS with 0.0805 of Word Error Rate (WER).","PeriodicalId":178443,"journal":{"name":"2021 4th International Conference of Computer and Informatics Engineering (IC2IE)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Text Normalization for Indonesian Text-to-Speech (TTS) using Rule-Based Approach: A Dataset and Preliminary Study\",\"authors\":\"Nana Mulyana Maghfur, Muhammad Okky Ibrohim, Junaedi Fahmi, Achmad Satria Putera, Oskar Riandi\",\"doi\":\"10.1109/ic2ie53219.2021.9649417\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text-to-Speech (TTS) is a technology that is currently widely used for several purposes both for academic/ non-commercial and industry/commercial purposes. In several cases, some researchers on the TTS field adding a text normalization process for normalizing text that will be used for TTS input to enhance the TTS performance itself. In this paper, we present a rule-based approach to make an Indonesian text normalization dataset that has a raw text and a spoken form of it for enhancing Indonesian Text-to-Speech (TTS) performance. We conduct a set of rule-based for normalizing Indonesian text as an input for the TTS system. Using those rule-based, we generated a dataset and correct it manually so that we have a gold standard for text normalization for Indonesian TTS input. Our approach shows a rule-based can give a good performance for normalizing text for Indonesian TTS with 0.0805 of Word Error Rate (WER).\",\"PeriodicalId\":178443,\"journal\":{\"name\":\"2021 4th International Conference of Computer and Informatics Engineering (IC2IE)\",\"volume\":\"43 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 4th International Conference of Computer and Informatics Engineering (IC2IE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ic2ie53219.2021.9649417\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 4th International Conference of Computer and Informatics Engineering (IC2IE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ic2ie53219.2021.9649417","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Text Normalization for Indonesian Text-to-Speech (TTS) using Rule-Based Approach: A Dataset and Preliminary Study
Text-to-Speech (TTS) is a technology that is currently widely used for several purposes both for academic/ non-commercial and industry/commercial purposes. In several cases, some researchers on the TTS field adding a text normalization process for normalizing text that will be used for TTS input to enhance the TTS performance itself. In this paper, we present a rule-based approach to make an Indonesian text normalization dataset that has a raw text and a spoken form of it for enhancing Indonesian Text-to-Speech (TTS) performance. We conduct a set of rule-based for normalizing Indonesian text as an input for the TTS system. Using those rule-based, we generated a dataset and correct it manually so that we have a gold standard for text normalization for Indonesian TTS input. Our approach shows a rule-based can give a good performance for normalizing text for Indonesian TTS with 0.0805 of Word Error Rate (WER).