基于MFCC和DTW及信号处理包的语音识别系统

2018 IEEE International Conference on Electro/Information Technology (EIT) Pub Date : 2018-05-03 DOI:10.1109/EIT.2018.8500256

Tazwar Muttaqi, S. Mousavinezhad, S. Mahamud

{"title":"基于MFCC和DTW及信号处理包的语音识别系统","authors":"Tazwar Muttaqi, S. Mousavinezhad, S. Mahamud","doi":"10.1109/EIT.2018.8500256","DOIUrl":null,"url":null,"abstract":"User identification proof framework is essential for securing data from illicit access. To build a robust user identification system using voice, a new system is proposed to identify users using Mel-Scale Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) along with a package of digital signal processing. Human voice is a sign of boundless data. Precise voice recognition requires computerized processing. Proposed method extracts unique features from a voice signal by MFCC and DTW to compare the components between two signals with the aid of some efficient signal processing such as filtering, signal alignment, removing unvoiced part, amplitude normalization and zero-part removal. All these steps work perfectly for accurate voice signal recognition. Based on the similarity between voice signals, it distinguishes different users and grant access to the secured area for multiple users which could be substantial for internal security for any classified organization or nation.","PeriodicalId":188414,"journal":{"name":"2018 IEEE International Conference on Electro/Information Technology (EIT)","volume":"92 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"User Identification System Using Biometrics Speaker Recognition by MFCC and DTW Along with Signal Processing Package\",\"authors\":\"Tazwar Muttaqi, S. Mousavinezhad, S. Mahamud\",\"doi\":\"10.1109/EIT.2018.8500256\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"User identification proof framework is essential for securing data from illicit access. To build a robust user identification system using voice, a new system is proposed to identify users using Mel-Scale Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) along with a package of digital signal processing. Human voice is a sign of boundless data. Precise voice recognition requires computerized processing. Proposed method extracts unique features from a voice signal by MFCC and DTW to compare the components between two signals with the aid of some efficient signal processing such as filtering, signal alignment, removing unvoiced part, amplitude normalization and zero-part removal. All these steps work perfectly for accurate voice signal recognition. Based on the similarity between voice signals, it distinguishes different users and grant access to the secured area for multiple users which could be substantial for internal security for any classified organization or nation.\",\"PeriodicalId\":188414,\"journal\":{\"name\":\"2018 IEEE International Conference on Electro/Information Technology (EIT)\",\"volume\":\"92 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-05-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE International Conference on Electro/Information Technology (EIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EIT.2018.8500256\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Electro/Information Technology (EIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EIT.2018.8500256","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

摘要

用户身份证明框架对于防止非法访问数据至关重要。为了构建鲁棒的语音用户识别系统，提出了一种基于Mel-Scale Frequency Cepstral Coefficients (MFCC)和Dynamic Time Warping (DTW)的语音用户识别系统。人的声音是无限数据的标志。精确的声音识别需要计算机处理。该方法通过对语音信号进行滤波、信号对准、去浊音部分、幅度归一化和去零部分等有效的信号处理，通过MFCC和DTW提取语音信号的独特特征，比较两种信号的分量。所有这些步骤都完美地实现了准确的语音信号识别。基于语音信号之间的相似性，它可以区分不同的用户，并为多个用户授予访问安全区域的权限，这对于任何机密组织或国家的内部安全都是至关重要的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

User Identification System Using Biometrics Speaker Recognition by MFCC and DTW Along with Signal Processing Package

User identification proof framework is essential for securing data from illicit access. To build a robust user identification system using voice, a new system is proposed to identify users using Mel-Scale Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) along with a package of digital signal processing. Human voice is a sign of boundless data. Precise voice recognition requires computerized processing. Proposed method extracts unique features from a voice signal by MFCC and DTW to compare the components between two signals with the aid of some efficient signal processing such as filtering, signal alignment, removing unvoiced part, amplitude normalization and zero-part removal. All these steps work perfectly for accurate voice signal recognition. Based on the similarity between voice signals, it distinguishes different users and grant access to the secured area for multiple users which could be substantial for internal security for any classified organization or nation.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 IEEE International Conference on Electro/Information Technology (EIT)

自引率

0.00%

发文量