基于MFCC和DTW及信号处理包的语音识别系统

Tazwar Muttaqi, S. Mousavinezhad, S. Mahamud
{"title":"基于MFCC和DTW及信号处理包的语音识别系统","authors":"Tazwar Muttaqi, S. Mousavinezhad, S. Mahamud","doi":"10.1109/EIT.2018.8500256","DOIUrl":null,"url":null,"abstract":"User identification proof framework is essential for securing data from illicit access. To build a robust user identification system using voice, a new system is proposed to identify users using Mel-Scale Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) along with a package of digital signal processing. Human voice is a sign of boundless data. Precise voice recognition requires computerized processing. Proposed method extracts unique features from a voice signal by MFCC and DTW to compare the components between two signals with the aid of some efficient signal processing such as filtering, signal alignment, removing unvoiced part, amplitude normalization and zero-part removal. All these steps work perfectly for accurate voice signal recognition. Based on the similarity between voice signals, it distinguishes different users and grant access to the secured area for multiple users which could be substantial for internal security for any classified organization or nation.","PeriodicalId":188414,"journal":{"name":"2018 IEEE International Conference on Electro/Information Technology (EIT)","volume":"92 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"User Identification System Using Biometrics Speaker Recognition by MFCC and DTW Along with Signal Processing Package\",\"authors\":\"Tazwar Muttaqi, S. Mousavinezhad, S. Mahamud\",\"doi\":\"10.1109/EIT.2018.8500256\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"User identification proof framework is essential for securing data from illicit access. To build a robust user identification system using voice, a new system is proposed to identify users using Mel-Scale Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) along with a package of digital signal processing. Human voice is a sign of boundless data. Precise voice recognition requires computerized processing. Proposed method extracts unique features from a voice signal by MFCC and DTW to compare the components between two signals with the aid of some efficient signal processing such as filtering, signal alignment, removing unvoiced part, amplitude normalization and zero-part removal. All these steps work perfectly for accurate voice signal recognition. Based on the similarity between voice signals, it distinguishes different users and grant access to the secured area for multiple users which could be substantial for internal security for any classified organization or nation.\",\"PeriodicalId\":188414,\"journal\":{\"name\":\"2018 IEEE International Conference on Electro/Information Technology (EIT)\",\"volume\":\"92 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-05-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE International Conference on Electro/Information Technology (EIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EIT.2018.8500256\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Electro/Information Technology (EIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EIT.2018.8500256","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

摘要

用户身份证明框架对于防止非法访问数据至关重要。为了构建鲁棒的语音用户识别系统,提出了一种基于Mel-Scale Frequency Cepstral Coefficients (MFCC)和Dynamic Time Warping (DTW)的语音用户识别系统。人的声音是无限数据的标志。精确的声音识别需要计算机处理。该方法通过对语音信号进行滤波、信号对准、去浊音部分、幅度归一化和去零部分等有效的信号处理,通过MFCC和DTW提取语音信号的独特特征,比较两种信号的分量。所有这些步骤都完美地实现了准确的语音信号识别。基于语音信号之间的相似性,它可以区分不同的用户,并为多个用户授予访问安全区域的权限,这对于任何机密组织或国家的内部安全都是至关重要的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
User Identification System Using Biometrics Speaker Recognition by MFCC and DTW Along with Signal Processing Package
User identification proof framework is essential for securing data from illicit access. To build a robust user identification system using voice, a new system is proposed to identify users using Mel-Scale Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) along with a package of digital signal processing. Human voice is a sign of boundless data. Precise voice recognition requires computerized processing. Proposed method extracts unique features from a voice signal by MFCC and DTW to compare the components between two signals with the aid of some efficient signal processing such as filtering, signal alignment, removing unvoiced part, amplitude normalization and zero-part removal. All these steps work perfectly for accurate voice signal recognition. Based on the similarity between voice signals, it distinguishes different users and grant access to the secured area for multiple users which could be substantial for internal security for any classified organization or nation.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信