CNN Trained Speaker Recognition System in Electric Vehicles

2022 International Virtual Conference on Power Engineering Computing and Control: Developments in Electric Vehicles and Energy Sector for Sustainable Future (PECCON) Pub Date : 2022-05-05 DOI:10.1109/PECCON55017.2022.9851029

Pavani Budiga, B. B, Gourimahadevi Gunisetty, Nalini Devi Moka, G. Reddy

{"title":"CNN Trained Speaker Recognition System in Electric Vehicles","authors":"Pavani Budiga, B. B, Gourimahadevi Gunisetty, Nalini Devi Moka, G. Reddy","doi":"10.1109/PECCON55017.2022.9851029","DOIUrl":null,"url":null,"abstract":"Speaker recognition is the technique of determining a person's identity based on their voice features. Speaker recognition modules are now included in several commercial products because of the speaker recognition revolution. One such application is in electric vehicles, where a speaker recognition system is used for voice authentication in unlocking the vehicle. The performance was affected due to the background noise in the existing model which was improved using the proposed Least Mean Square (LMS) filter and Kalman filter. For reducing background noise, the LMS filter performed much better, while the Kalman filter performed better for Additive White Gaussian Noise (AWGN). In this work, Features of a speech are extracted using Mel Frequency Cepstral Coefficient (MFCC) which is trained on Convolutional Neural Network (CNN) classifier algorithm employing 16000 PCM speech samples dataset. Recognizing speakers from different recording conditions creates numerous challenges for the system. The recognition accuracy increased to 92.8%. Superior results were obtained using the presented MFCC-CNN model with filtering approaches. Hence the experimental results conveys that the implemented model for external noises in speaker recognition system is better.","PeriodicalId":129147,"journal":{"name":"2022 International Virtual Conference on Power Engineering Computing and Control: Developments in Electric Vehicles and Energy Sector for Sustainable Future (PECCON)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Virtual Conference on Power Engineering Computing and Control: Developments in Electric Vehicles and Energy Sector for Sustainable Future (PECCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PECCON55017.2022.9851029","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Speaker recognition is the technique of determining a person's identity based on their voice features. Speaker recognition modules are now included in several commercial products because of the speaker recognition revolution. One such application is in electric vehicles, where a speaker recognition system is used for voice authentication in unlocking the vehicle. The performance was affected due to the background noise in the existing model which was improved using the proposed Least Mean Square (LMS) filter and Kalman filter. For reducing background noise, the LMS filter performed much better, while the Kalman filter performed better for Additive White Gaussian Noise (AWGN). In this work, Features of a speech are extracted using Mel Frequency Cepstral Coefficient (MFCC) which is trained on Convolutional Neural Network (CNN) classifier algorithm employing 16000 PCM speech samples dataset. Recognizing speakers from different recording conditions creates numerous challenges for the system. The recognition accuracy increased to 92.8%. Superior results were obtained using the presented MFCC-CNN model with filtering approaches. Hence the experimental results conveys that the implemented model for external noises in speaker recognition system is better.

查看原文本刊更多论文

CNN训练的电动汽车说话人识别系统

说话人识别是一种根据一个人的声音特征来确定其身份的技术。由于扬声器识别革命，扬声器识别模块现在包含在几个商业产品中。其中一个应用是电动汽车，在解锁车辆时使用扬声器识别系统进行语音认证。利用最小均方滤波和卡尔曼滤波对已有模型中受背景噪声影响的性能进行了改进。在降低背景噪声方面，LMS滤波器表现更好，而卡尔曼滤波器对加性高斯白噪声(AWGN)表现更好。在这项工作中，使用Mel频率倒谱系数(MFCC)提取语音特征，该特征在使用16000个PCM语音样本数据集的卷积神经网络(CNN)分类器算法上进行训练。识别来自不同录音条件的说话者给系统带来了许多挑战。识别准确率提高到92.8%。采用MFCC-CNN模型和滤波方法，得到了较好的结果。实验结果表明，所实现的模型对说话人识别系统中的外部噪声处理效果较好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 International Virtual Conference on Power Engineering Computing and Control: Developments in Electric Vehicles and Energy Sector for Sustainable Future (PECCON)

自引率

0.00%

发文量