CNN Trained Speaker Recognition System in Electric Vehicles

Pavani Budiga, B. B, Gourimahadevi Gunisetty, Nalini Devi Moka, G. Reddy
{"title":"CNN Trained Speaker Recognition System in Electric Vehicles","authors":"Pavani Budiga, B. B, Gourimahadevi Gunisetty, Nalini Devi Moka, G. Reddy","doi":"10.1109/PECCON55017.2022.9851029","DOIUrl":null,"url":null,"abstract":"Speaker recognition is the technique of determining a person's identity based on their voice features. Speaker recognition modules are now included in several commercial products because of the speaker recognition revolution. One such application is in electric vehicles, where a speaker recognition system is used for voice authentication in unlocking the vehicle. The performance was affected due to the background noise in the existing model which was improved using the proposed Least Mean Square (LMS) filter and Kalman filter. For reducing background noise, the LMS filter performed much better, while the Kalman filter performed better for Additive White Gaussian Noise (AWGN). In this work, Features of a speech are extracted using Mel Frequency Cepstral Coefficient (MFCC) which is trained on Convolutional Neural Network (CNN) classifier algorithm employing 16000 PCM speech samples dataset. Recognizing speakers from different recording conditions creates numerous challenges for the system. The recognition accuracy increased to 92.8%. Superior results were obtained using the presented MFCC-CNN model with filtering approaches. Hence the experimental results conveys that the implemented model for external noises in speaker recognition system is better.","PeriodicalId":129147,"journal":{"name":"2022 International Virtual Conference on Power Engineering Computing and Control: Developments in Electric Vehicles and Energy Sector for Sustainable Future (PECCON)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Virtual Conference on Power Engineering Computing and Control: Developments in Electric Vehicles and Energy Sector for Sustainable Future (PECCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PECCON55017.2022.9851029","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Speaker recognition is the technique of determining a person's identity based on their voice features. Speaker recognition modules are now included in several commercial products because of the speaker recognition revolution. One such application is in electric vehicles, where a speaker recognition system is used for voice authentication in unlocking the vehicle. The performance was affected due to the background noise in the existing model which was improved using the proposed Least Mean Square (LMS) filter and Kalman filter. For reducing background noise, the LMS filter performed much better, while the Kalman filter performed better for Additive White Gaussian Noise (AWGN). In this work, Features of a speech are extracted using Mel Frequency Cepstral Coefficient (MFCC) which is trained on Convolutional Neural Network (CNN) classifier algorithm employing 16000 PCM speech samples dataset. Recognizing speakers from different recording conditions creates numerous challenges for the system. The recognition accuracy increased to 92.8%. Superior results were obtained using the presented MFCC-CNN model with filtering approaches. Hence the experimental results conveys that the implemented model for external noises in speaker recognition system is better.
CNN训练的电动汽车说话人识别系统
说话人识别是一种根据一个人的声音特征来确定其身份的技术。由于扬声器识别革命,扬声器识别模块现在包含在几个商业产品中。其中一个应用是电动汽车,在解锁车辆时使用扬声器识别系统进行语音认证。利用最小均方滤波和卡尔曼滤波对已有模型中受背景噪声影响的性能进行了改进。在降低背景噪声方面,LMS滤波器表现更好,而卡尔曼滤波器对加性高斯白噪声(AWGN)表现更好。在这项工作中,使用Mel频率倒谱系数(MFCC)提取语音特征,该特征在使用16000个PCM语音样本数据集的卷积神经网络(CNN)分类器算法上进行训练。识别来自不同录音条件的说话者给系统带来了许多挑战。识别准确率提高到92.8%。采用MFCC-CNN模型和滤波方法,得到了较好的结果。实验结果表明,所实现的模型对说话人识别系统中的外部噪声处理效果较好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信