{"title":"Speaker recognition based on MFCC and BP neural networks","authors":"Yi Wang, B. Lawlor","doi":"10.1109/ISSC.2017.7983644","DOIUrl":null,"url":null,"abstract":"Speaker recognition has been developed over many years and it comes with many different methods. MFCC is one of more the successful methods due to it being generally modeled on the human auditory system. It represents high success rate of recognition and strong robustness against noise in the lower frequency regions. However, in the higher frequency regions, it captures speaker characteristics information less effectively. In recent years, Artificial Neural Networks have become popular. This paper presents a speaker recognition method based on MFCC and Back-Propagation Neural Networks. Experimental studies have proven that the recognition rate is successful when the number of questionable speakers is not very larger. When the number of speakers increases, the rate of recognition decreases. The potential problems and solutions are discussed, the number of training samples must be larger than the number of network model weights, 2–10 times. When the number of speakers increases, the number of training samples required also increases significantly.","PeriodicalId":170320,"journal":{"name":"2017 28th Irish Signals and Systems Conference (ISSC)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"26","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 28th Irish Signals and Systems Conference (ISSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSC.2017.7983644","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 26
Abstract
Speaker recognition has been developed over many years and it comes with many different methods. MFCC is one of more the successful methods due to it being generally modeled on the human auditory system. It represents high success rate of recognition and strong robustness against noise in the lower frequency regions. However, in the higher frequency regions, it captures speaker characteristics information less effectively. In recent years, Artificial Neural Networks have become popular. This paper presents a speaker recognition method based on MFCC and Back-Propagation Neural Networks. Experimental studies have proven that the recognition rate is successful when the number of questionable speakers is not very larger. When the number of speakers increases, the rate of recognition decreases. The potential problems and solutions are discussed, the number of training samples must be larger than the number of network model weights, 2–10 times. When the number of speakers increases, the number of training samples required also increases significantly.