{"title":"Optimal MFCC features extraction by differential evolution algorithm for speaker recognition","authors":"Mohsen Sadeghi, H. Marvi","doi":"10.1109/ICSPIS.2017.8311610","DOIUrl":null,"url":null,"abstract":"Speech is the most commonly and widely used form of communication and interaction between humans. The interfacing system, which is an automatic speaker recognition system, requires modeling to receive input data in the form of a feature with a minimum number and learn through this data. The purpose of this paper is to extract the optimal number of Mel-Frequency Cepstral Coefficients (MFCC) features without reducing the recognition accuracy for speaker recognition application. For this purpose, an algorithm has been proposed in which the Differential Evolution (EA) optimizer and also the probabilistic neural network (PNN) classifier are used to achieve this goal. After implementing this algorithm in MATLAB software, it was observed that the number of MFCC features, which so far had at least 13 for each frame, was reduced to 5 per frame, without any recognition accuracy being reduced.","PeriodicalId":380266,"journal":{"name":"2017 3rd Iranian Conference on Intelligent Systems and Signal Processing (ICSPIS)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 3rd Iranian Conference on Intelligent Systems and Signal Processing (ICSPIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSPIS.2017.8311610","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15
Abstract
Speech is the most commonly and widely used form of communication and interaction between humans. The interfacing system, which is an automatic speaker recognition system, requires modeling to receive input data in the form of a feature with a minimum number and learn through this data. The purpose of this paper is to extract the optimal number of Mel-Frequency Cepstral Coefficients (MFCC) features without reducing the recognition accuracy for speaker recognition application. For this purpose, an algorithm has been proposed in which the Differential Evolution (EA) optimizer and also the probabilistic neural network (PNN) classifier are used to achieve this goal. After implementing this algorithm in MATLAB software, it was observed that the number of MFCC features, which so far had at least 13 for each frame, was reduced to 5 per frame, without any recognition accuracy being reduced.