基于小波、人义熵和AR-PSD特征的音节语音分类

2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA) Pub Date : 2017-03-01 DOI:10.1109/CSPA.2017.8064931

Domy Kristomo, Risanuri Hidayat, I. Soesanti

{"title":"基于小波、人义熵和AR-PSD特征的音节语音分类","authors":"Domy Kristomo, Risanuri Hidayat, I. Soesanti","doi":"10.1109/CSPA.2017.8064931","DOIUrl":null,"url":null,"abstract":"Feature extraction plays a very important role in the speech classification process because a better feature is good for improving the classification rate. This paper presents a speech feature extraction method by using Discrete Wavelet Transform (DWT) at 7th level of decomposition with mother wavelet of Dau-bechies 2, Renyi Entropy (RE), Autoregressive Power Spectral Density (AR-PSD), Statistical, as well as the combination of each method for extracting and classifying the certain Indonesian velar-vowel and alveolar-vowel syllables. Five different features set used in this study, namely the combination features of DWT and statistical (WS), RE, the combination of AR-PSD and Statistical (PSDS), the combination of PSDS and the selected features of RE (RPSDS), and the combination of DWT, RE, and AR-PSD (WRPSDS). Each syllable is segmented at a certain length to form a consonant-vowel. Multi-layer perceptron is used as a classifier after feature extraction process. The results show that the rank of the average recognition rate are WRPSDS, WS, RPSDS, PSDS, and RE, respectively.","PeriodicalId":445522,"journal":{"name":"2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Classification of the syllables sound using wavelet, Renyi entropy and AR-PSD features\",\"authors\":\"Domy Kristomo, Risanuri Hidayat, I. Soesanti\",\"doi\":\"10.1109/CSPA.2017.8064931\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Feature extraction plays a very important role in the speech classification process because a better feature is good for improving the classification rate. This paper presents a speech feature extraction method by using Discrete Wavelet Transform (DWT) at 7th level of decomposition with mother wavelet of Dau-bechies 2, Renyi Entropy (RE), Autoregressive Power Spectral Density (AR-PSD), Statistical, as well as the combination of each method for extracting and classifying the certain Indonesian velar-vowel and alveolar-vowel syllables. Five different features set used in this study, namely the combination features of DWT and statistical (WS), RE, the combination of AR-PSD and Statistical (PSDS), the combination of PSDS and the selected features of RE (RPSDS), and the combination of DWT, RE, and AR-PSD (WRPSDS). Each syllable is segmented at a certain length to form a consonant-vowel. Multi-layer perceptron is used as a classifier after feature extraction process. The results show that the rank of the average recognition rate are WRPSDS, WS, RPSDS, PSDS, and RE, respectively.\",\"PeriodicalId\":445522,\"journal\":{\"name\":\"2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA)\",\"volume\":\"46 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSPA.2017.8064931\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSPA.2017.8064931","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

特征提取在语音分类过程中起着非常重要的作用，好的特征有利于提高分类率。本文提出了一种利用7级离散小波变换(DWT)与母小波分解、Renyi熵(RE)、自回归功率谱密度(AR-PSD)和统计学相结合的方法提取印尼语部分velar-元音和alveal -元音音节的语音特征提取方法。本研究使用了五种不同的特征集，分别是DWT与statistical的组合特征(WS)、RE、AR-PSD与statistical的组合特征(PSDS)、PSDS与RE的选择特征(RPSDS)、DWT、RE和AR-PSD的组合特征(WRPSDS)。每个音节被分割成一定的长度，形成一个辅音-元音。经过特征提取后，使用多层感知器作为分类器。结果表明:平均识别率排名依次为WRPSDS、WS、RPSDS、PSDS、RE。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Classification of the syllables sound using wavelet, Renyi entropy and AR-PSD features

Feature extraction plays a very important role in the speech classification process because a better feature is good for improving the classification rate. This paper presents a speech feature extraction method by using Discrete Wavelet Transform (DWT) at 7th level of decomposition with mother wavelet of Dau-bechies 2, Renyi Entropy (RE), Autoregressive Power Spectral Density (AR-PSD), Statistical, as well as the combination of each method for extracting and classifying the certain Indonesian velar-vowel and alveolar-vowel syllables. Five different features set used in this study, namely the combination features of DWT and statistical (WS), RE, the combination of AR-PSD and Statistical (PSDS), the combination of PSDS and the selected features of RE (RPSDS), and the combination of DWT, RE, and AR-PSD (WRPSDS). Each syllable is segmented at a certain length to form a consonant-vowel. Multi-layer perceptron is used as a classifier after feature extraction process. The results show that the rank of the average recognition rate are WRPSDS, WS, RPSDS, PSDS, and RE, respectively.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA)

自引率

0.00%

发文量