{"title":"基于离散小波变换和统计特征的印尼语音节特征提取与分类","authors":"Domy Kristomo, Risanuri Hidayat, I. Soesanti","doi":"10.1109/ICSTC.2016.7877353","DOIUrl":null,"url":null,"abstract":"The major problem of most speech recognition systems is their unsatisfactory effectiveness (impact to recognition rate), efficiency (feature vector dimension), shift variance, and robustness in noisy condition. Feature extraction plays a very important role in the speech recognition process, because a better feature is good for improving the recognition rate. This paper presents a speech feature extraction by combining Discrete Wavelet Transform (DWT) and statistical method for recognizing the syllables sound in the Indonesian language. Three different mother wavelet transforms combined with statistical method (DWT-Statistical) are used as a feature extraction method. Multi-layer perceptron is used as a classifier after feature extraction process. This research aims to find the best properties in effectiveness and efficiency on performing feature extraction of each syllable sound to be applied in the speech recognition method on the intelligent systems. Experiments, in this study, show that the proposed method which uses 29 features set is applicable for feature extraction and classification of the Indonesian syllable. The results show that the average of recognition rate for the DWT-Statistical at the 7th level decomposition by using mother wavelet of Haar, Daubechies 2, and Coiflet 2 are 57.77%, 71.11%, and 67.77%, respectively.","PeriodicalId":228650,"journal":{"name":"2016 2nd International Conference on Science and Technology-Computer (ICST)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Feature extraction and classification of the Indonesian syllables using Discrete Wavelet Transform and statistical features\",\"authors\":\"Domy Kristomo, Risanuri Hidayat, I. Soesanti\",\"doi\":\"10.1109/ICSTC.2016.7877353\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The major problem of most speech recognition systems is their unsatisfactory effectiveness (impact to recognition rate), efficiency (feature vector dimension), shift variance, and robustness in noisy condition. Feature extraction plays a very important role in the speech recognition process, because a better feature is good for improving the recognition rate. This paper presents a speech feature extraction by combining Discrete Wavelet Transform (DWT) and statistical method for recognizing the syllables sound in the Indonesian language. Three different mother wavelet transforms combined with statistical method (DWT-Statistical) are used as a feature extraction method. Multi-layer perceptron is used as a classifier after feature extraction process. This research aims to find the best properties in effectiveness and efficiency on performing feature extraction of each syllable sound to be applied in the speech recognition method on the intelligent systems. Experiments, in this study, show that the proposed method which uses 29 features set is applicable for feature extraction and classification of the Indonesian syllable. The results show that the average of recognition rate for the DWT-Statistical at the 7th level decomposition by using mother wavelet of Haar, Daubechies 2, and Coiflet 2 are 57.77%, 71.11%, and 67.77%, respectively.\",\"PeriodicalId\":228650,\"journal\":{\"name\":\"2016 2nd International Conference on Science and Technology-Computer (ICST)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 2nd International Conference on Science and Technology-Computer (ICST)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSTC.2016.7877353\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 2nd International Conference on Science and Technology-Computer (ICST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSTC.2016.7877353","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Feature extraction and classification of the Indonesian syllables using Discrete Wavelet Transform and statistical features
The major problem of most speech recognition systems is their unsatisfactory effectiveness (impact to recognition rate), efficiency (feature vector dimension), shift variance, and robustness in noisy condition. Feature extraction plays a very important role in the speech recognition process, because a better feature is good for improving the recognition rate. This paper presents a speech feature extraction by combining Discrete Wavelet Transform (DWT) and statistical method for recognizing the syllables sound in the Indonesian language. Three different mother wavelet transforms combined with statistical method (DWT-Statistical) are used as a feature extraction method. Multi-layer perceptron is used as a classifier after feature extraction process. This research aims to find the best properties in effectiveness and efficiency on performing feature extraction of each syllable sound to be applied in the speech recognition method on the intelligent systems. Experiments, in this study, show that the proposed method which uses 29 features set is applicable for feature extraction and classification of the Indonesian syllable. The results show that the average of recognition rate for the DWT-Statistical at the 7th level decomposition by using mother wavelet of Haar, Daubechies 2, and Coiflet 2 are 57.77%, 71.11%, and 67.77%, respectively.