{"title":"Identification of Speaker from Disguised Voice Using MFCC Feature Extraction, Chi-Square and Classification Technique","authors":"Mahesh K. Singh","doi":"10.1007/s11277-024-11542-0","DOIUrl":null,"url":null,"abstract":"<p>The purpose of this manuscript is to show that certain acoustic features can be used to recognize the disguised speech of unknown speakers. As the name implies, forensic speaker identification entails the use of scientific techniques to ascertain an unknown speaker’s identity during an inquiry. This study aims to provide a voice recognition method that works well. To distinguish between speech and background noise in each frame, chi-square tests are utilized. The estimated background noise is continuously modified to achieve this. Chi-square noise estimations are then obtained once background noise has initially been reduced. The observed signal distribution and the estimated noise distribution are compared using a second chi-square test, this time using a different approach. For the frame to be labelled as noise, the chi-square test scores must be close together. Mel-frequency cepstrum coefficient (MFCC), features are grouped as three-dimensional features. The correlation coefficient characteristics of speech are coupled with the different MFCC feature extraction technique. The feature-based classification is done with support vector machine (SVM) classifiers and k-nearest neighbor (k-NN) classification technique. Classification results show that applying these unique features in an SVM classifier boosts classification accuracy.</p>","PeriodicalId":23827,"journal":{"name":"Wireless Personal Communications","volume":"138 1","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Wireless Personal Communications","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11277-024-11542-0","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
The purpose of this manuscript is to show that certain acoustic features can be used to recognize the disguised speech of unknown speakers. As the name implies, forensic speaker identification entails the use of scientific techniques to ascertain an unknown speaker’s identity during an inquiry. This study aims to provide a voice recognition method that works well. To distinguish between speech and background noise in each frame, chi-square tests are utilized. The estimated background noise is continuously modified to achieve this. Chi-square noise estimations are then obtained once background noise has initially been reduced. The observed signal distribution and the estimated noise distribution are compared using a second chi-square test, this time using a different approach. For the frame to be labelled as noise, the chi-square test scores must be close together. Mel-frequency cepstrum coefficient (MFCC), features are grouped as three-dimensional features. The correlation coefficient characteristics of speech are coupled with the different MFCC feature extraction technique. The feature-based classification is done with support vector machine (SVM) classifiers and k-nearest neighbor (k-NN) classification technique. Classification results show that applying these unique features in an SVM classifier boosts classification accuracy.
期刊介绍:
The Journal on Mobile Communication and Computing ...
Publishes tutorial, survey, and original research papers addressing mobile communications and computing;
Investigates theoretical, engineering, and experimental aspects of radio communications, voice, data, images, and multimedia;
Explores propagation, system models, speech and image coding, multiple access techniques, protocols, performance evaluation, radio local area networks, and networking and architectures, etc.;
98% of authors who answered a survey reported that they would definitely publish or probably publish in the journal again.
Wireless Personal Communications is an archival, peer reviewed, scientific and technical journal addressing mobile communications and computing. It investigates theoretical, engineering, and experimental aspects of radio communications, voice, data, images, and multimedia. A partial list of topics included in the journal is: propagation, system models, speech and image coding, multiple access techniques, protocols performance evaluation, radio local area networks, and networking and architectures.
In addition to the above mentioned areas, the journal also accepts papers that deal with interdisciplinary aspects of wireless communications along with: big data and analytics, business and economy, society, and the environment.
The journal features five principal types of papers: full technical papers, short papers, technical aspects of policy and standardization, letters offering new research thoughts and experimental ideas, and invited papers on important and emerging topics authored by renowned experts.