{"title":"基于差分进化算法的最优MFCC特征提取用于说话人识别","authors":"Mohsen Sadeghi, H. Marvi","doi":"10.1109/ICSPIS.2017.8311610","DOIUrl":null,"url":null,"abstract":"Speech is the most commonly and widely used form of communication and interaction between humans. The interfacing system, which is an automatic speaker recognition system, requires modeling to receive input data in the form of a feature with a minimum number and learn through this data. The purpose of this paper is to extract the optimal number of Mel-Frequency Cepstral Coefficients (MFCC) features without reducing the recognition accuracy for speaker recognition application. For this purpose, an algorithm has been proposed in which the Differential Evolution (EA) optimizer and also the probabilistic neural network (PNN) classifier are used to achieve this goal. After implementing this algorithm in MATLAB software, it was observed that the number of MFCC features, which so far had at least 13 for each frame, was reduced to 5 per frame, without any recognition accuracy being reduced.","PeriodicalId":380266,"journal":{"name":"2017 3rd Iranian Conference on Intelligent Systems and Signal Processing (ICSPIS)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"Optimal MFCC features extraction by differential evolution algorithm for speaker recognition\",\"authors\":\"Mohsen Sadeghi, H. Marvi\",\"doi\":\"10.1109/ICSPIS.2017.8311610\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech is the most commonly and widely used form of communication and interaction between humans. The interfacing system, which is an automatic speaker recognition system, requires modeling to receive input data in the form of a feature with a minimum number and learn through this data. The purpose of this paper is to extract the optimal number of Mel-Frequency Cepstral Coefficients (MFCC) features without reducing the recognition accuracy for speaker recognition application. For this purpose, an algorithm has been proposed in which the Differential Evolution (EA) optimizer and also the probabilistic neural network (PNN) classifier are used to achieve this goal. After implementing this algorithm in MATLAB software, it was observed that the number of MFCC features, which so far had at least 13 for each frame, was reduced to 5 per frame, without any recognition accuracy being reduced.\",\"PeriodicalId\":380266,\"journal\":{\"name\":\"2017 3rd Iranian Conference on Intelligent Systems and Signal Processing (ICSPIS)\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 3rd Iranian Conference on Intelligent Systems and Signal Processing (ICSPIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSPIS.2017.8311610\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 3rd Iranian Conference on Intelligent Systems and Signal Processing (ICSPIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSPIS.2017.8311610","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Optimal MFCC features extraction by differential evolution algorithm for speaker recognition
Speech is the most commonly and widely used form of communication and interaction between humans. The interfacing system, which is an automatic speaker recognition system, requires modeling to receive input data in the form of a feature with a minimum number and learn through this data. The purpose of this paper is to extract the optimal number of Mel-Frequency Cepstral Coefficients (MFCC) features without reducing the recognition accuracy for speaker recognition application. For this purpose, an algorithm has been proposed in which the Differential Evolution (EA) optimizer and also the probabilistic neural network (PNN) classifier are used to achieve this goal. After implementing this algorithm in MATLAB software, it was observed that the number of MFCC features, which so far had at least 13 for each frame, was reduced to 5 per frame, without any recognition accuracy being reduced.