基于离散小波变换和统计特征的印尼语音节特征提取与分类

Domy Kristomo, Risanuri Hidayat, I. Soesanti
{"title":"基于离散小波变换和统计特征的印尼语音节特征提取与分类","authors":"Domy Kristomo, Risanuri Hidayat, I. Soesanti","doi":"10.1109/ICSTC.2016.7877353","DOIUrl":null,"url":null,"abstract":"The major problem of most speech recognition systems is their unsatisfactory effectiveness (impact to recognition rate), efficiency (feature vector dimension), shift variance, and robustness in noisy condition. Feature extraction plays a very important role in the speech recognition process, because a better feature is good for improving the recognition rate. This paper presents a speech feature extraction by combining Discrete Wavelet Transform (DWT) and statistical method for recognizing the syllables sound in the Indonesian language. Three different mother wavelet transforms combined with statistical method (DWT-Statistical) are used as a feature extraction method. Multi-layer perceptron is used as a classifier after feature extraction process. This research aims to find the best properties in effectiveness and efficiency on performing feature extraction of each syllable sound to be applied in the speech recognition method on the intelligent systems. Experiments, in this study, show that the proposed method which uses 29 features set is applicable for feature extraction and classification of the Indonesian syllable. The results show that the average of recognition rate for the DWT-Statistical at the 7th level decomposition by using mother wavelet of Haar, Daubechies 2, and Coiflet 2 are 57.77%, 71.11%, and 67.77%, respectively.","PeriodicalId":228650,"journal":{"name":"2016 2nd International Conference on Science and Technology-Computer (ICST)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Feature extraction and classification of the Indonesian syllables using Discrete Wavelet Transform and statistical features\",\"authors\":\"Domy Kristomo, Risanuri Hidayat, I. Soesanti\",\"doi\":\"10.1109/ICSTC.2016.7877353\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The major problem of most speech recognition systems is their unsatisfactory effectiveness (impact to recognition rate), efficiency (feature vector dimension), shift variance, and robustness in noisy condition. Feature extraction plays a very important role in the speech recognition process, because a better feature is good for improving the recognition rate. This paper presents a speech feature extraction by combining Discrete Wavelet Transform (DWT) and statistical method for recognizing the syllables sound in the Indonesian language. Three different mother wavelet transforms combined with statistical method (DWT-Statistical) are used as a feature extraction method. Multi-layer perceptron is used as a classifier after feature extraction process. This research aims to find the best properties in effectiveness and efficiency on performing feature extraction of each syllable sound to be applied in the speech recognition method on the intelligent systems. Experiments, in this study, show that the proposed method which uses 29 features set is applicable for feature extraction and classification of the Indonesian syllable. The results show that the average of recognition rate for the DWT-Statistical at the 7th level decomposition by using mother wavelet of Haar, Daubechies 2, and Coiflet 2 are 57.77%, 71.11%, and 67.77%, respectively.\",\"PeriodicalId\":228650,\"journal\":{\"name\":\"2016 2nd International Conference on Science and Technology-Computer (ICST)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 2nd International Conference on Science and Technology-Computer (ICST)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSTC.2016.7877353\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 2nd International Conference on Science and Technology-Computer (ICST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSTC.2016.7877353","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

大多数语音识别系统的主要问题是它们的有效性(对识别率的影响)、效率(特征向量维数)、移位方差和噪声条件下的鲁棒性不理想。特征提取在语音识别过程中起着非常重要的作用,好的特征有利于提高语音识别率。提出了一种结合离散小波变换和统计方法的印尼语语音特征提取方法,用于印尼语音节语音识别。采用三种不同的母小波变换结合统计方法(DWT-Statistical)作为特征提取方法。经过特征提取后,使用多层感知器作为分类器。本研究的目的是寻找对每个音节语音进行特征提取的最佳有效性和效率,并将其应用于智能系统的语音识别方法中。实验表明,本文提出的使用29个特征集的方法适用于印尼语音节的特征提取和分类。结果表明:采用Haar、Daubechies 2和Coiflet 2母小波进行7级分解的DWT-Statistical平均识别率分别为57.77%、71.11%和67.77%;
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Feature extraction and classification of the Indonesian syllables using Discrete Wavelet Transform and statistical features
The major problem of most speech recognition systems is their unsatisfactory effectiveness (impact to recognition rate), efficiency (feature vector dimension), shift variance, and robustness in noisy condition. Feature extraction plays a very important role in the speech recognition process, because a better feature is good for improving the recognition rate. This paper presents a speech feature extraction by combining Discrete Wavelet Transform (DWT) and statistical method for recognizing the syllables sound in the Indonesian language. Three different mother wavelet transforms combined with statistical method (DWT-Statistical) are used as a feature extraction method. Multi-layer perceptron is used as a classifier after feature extraction process. This research aims to find the best properties in effectiveness and efficiency on performing feature extraction of each syllable sound to be applied in the speech recognition method on the intelligent systems. Experiments, in this study, show that the proposed method which uses 29 features set is applicable for feature extraction and classification of the Indonesian syllable. The results show that the average of recognition rate for the DWT-Statistical at the 7th level decomposition by using mother wavelet of Haar, Daubechies 2, and Coiflet 2 are 57.77%, 71.11%, and 67.77%, respectively.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信