Text Normalization for Indonesian Text-to-Speech (TTS) using Rule-Based Approach: A Dataset and Preliminary Study

Nana Mulyana Maghfur, Muhammad Okky Ibrohim, Junaedi Fahmi, Achmad Satria Putera, Oskar Riandi
{"title":"Text Normalization for Indonesian Text-to-Speech (TTS) using Rule-Based Approach: A Dataset and Preliminary Study","authors":"Nana Mulyana Maghfur, Muhammad Okky Ibrohim, Junaedi Fahmi, Achmad Satria Putera, Oskar Riandi","doi":"10.1109/ic2ie53219.2021.9649417","DOIUrl":null,"url":null,"abstract":"Text-to-Speech (TTS) is a technology that is currently widely used for several purposes both for academic/ non-commercial and industry/commercial purposes. In several cases, some researchers on the TTS field adding a text normalization process for normalizing text that will be used for TTS input to enhance the TTS performance itself. In this paper, we present a rule-based approach to make an Indonesian text normalization dataset that has a raw text and a spoken form of it for enhancing Indonesian Text-to-Speech (TTS) performance. We conduct a set of rule-based for normalizing Indonesian text as an input for the TTS system. Using those rule-based, we generated a dataset and correct it manually so that we have a gold standard for text normalization for Indonesian TTS input. Our approach shows a rule-based can give a good performance for normalizing text for Indonesian TTS with 0.0805 of Word Error Rate (WER).","PeriodicalId":178443,"journal":{"name":"2021 4th International Conference of Computer and Informatics Engineering (IC2IE)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 4th International Conference of Computer and Informatics Engineering (IC2IE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ic2ie53219.2021.9649417","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Text-to-Speech (TTS) is a technology that is currently widely used for several purposes both for academic/ non-commercial and industry/commercial purposes. In several cases, some researchers on the TTS field adding a text normalization process for normalizing text that will be used for TTS input to enhance the TTS performance itself. In this paper, we present a rule-based approach to make an Indonesian text normalization dataset that has a raw text and a spoken form of it for enhancing Indonesian Text-to-Speech (TTS) performance. We conduct a set of rule-based for normalizing Indonesian text as an input for the TTS system. Using those rule-based, we generated a dataset and correct it manually so that we have a gold standard for text normalization for Indonesian TTS input. Our approach shows a rule-based can give a good performance for normalizing text for Indonesian TTS with 0.0805 of Word Error Rate (WER).
基于规则方法的印尼语文本到语音(TTS)文本规范化:数据集和初步研究
文本到语音(TTS)是目前广泛用于学术/非商业和工业/商业目的的几种技术。在一些情况下,一些研究人员在TTS领域增加了一个文本规范化过程,将文本规范化用于TTS输入,以提高TTS本身的性能。在本文中,我们提出了一种基于规则的方法来制作具有原始文本和口语形式的印尼语文本规范化数据集,以增强印尼语文本到语音(TTS)的性能。我们执行了一组基于规则的规则,用于规范化作为TTS系统输入的印度尼西亚文本。使用这些基于规则的方法,我们生成了一个数据集,并手动对其进行校正,这样我们就有了印尼语TTS输入文本规范化的黄金标准。我们的方法表明,基于规则的方法可以在0.0805的单词错误率(WER)下为印尼语TTS文本规范化提供良好的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信