Speaker verification with multi-run ICA based speech enhancement

Ahmed Kamil Hasan Al-Ali, David Dean, B. Senadji, Mahsa Baktash, V. Chandran
{"title":"Speaker verification with multi-run ICA based speech enhancement","authors":"Ahmed Kamil Hasan Al-Ali, David Dean, B. Senadji, Mahsa Baktash, V. Chandran","doi":"10.1109/ICSPCS.2017.8270505","DOIUrl":null,"url":null,"abstract":"Forensic speaker verification systems show severe performance degradation in the presence of noise when the signal to noise ratio (SNR) is low. A possible solution to this problem is the use of multi-run independent component analysis (ICA) to reduce the effect of noise from the noisy speech signals. Previous works have used multi-run ICA in biosignal application; however, the effectiveness of multi-run ICA on noisy speaker verification has not been investigated yet. In this paper, we use multi-run ICA to enhance the noisy speech signals by choosing the highest signal to interference ratio (SIR) of the mixing matrix from different mixing matrices generated by iterating the fast ICA algorithm for several times. We use a combination of feature-warped mel frequency cepstral coefficients (MFCCs) and feature-warped MFCC extracted from the discrete wavelet transform (DWT) of the enhanced speech signals as the feature extraction. A state-of-the-art identity vector (i-vector) probabilistic linear discriminant analysis (PLDA) was used as a classifier in this paper. Experimental results demonstrate that the proposed method with multi-run ICA achieves high improvements in equal error rate (EER) of 66.68%, 69.24% and 70.78% over the baseline noisy speaker verification system, when the test speech signals are corrupted with CAR, STREET, and HOME noises respectively at −10 dB SNR.","PeriodicalId":268205,"journal":{"name":"2017 11th International Conference on Signal Processing and Communication Systems (ICSPCS)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 11th International Conference on Signal Processing and Communication Systems (ICSPCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSPCS.2017.8270505","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Forensic speaker verification systems show severe performance degradation in the presence of noise when the signal to noise ratio (SNR) is low. A possible solution to this problem is the use of multi-run independent component analysis (ICA) to reduce the effect of noise from the noisy speech signals. Previous works have used multi-run ICA in biosignal application; however, the effectiveness of multi-run ICA on noisy speaker verification has not been investigated yet. In this paper, we use multi-run ICA to enhance the noisy speech signals by choosing the highest signal to interference ratio (SIR) of the mixing matrix from different mixing matrices generated by iterating the fast ICA algorithm for several times. We use a combination of feature-warped mel frequency cepstral coefficients (MFCCs) and feature-warped MFCC extracted from the discrete wavelet transform (DWT) of the enhanced speech signals as the feature extraction. A state-of-the-art identity vector (i-vector) probabilistic linear discriminant analysis (PLDA) was used as a classifier in this paper. Experimental results demonstrate that the proposed method with multi-run ICA achieves high improvements in equal error rate (EER) of 66.68%, 69.24% and 70.78% over the baseline noisy speaker verification system, when the test speech signals are corrupted with CAR, STREET, and HOME noises respectively at −10 dB SNR.
基于多运行ICA语音增强的说话人验证
当信噪比较低时,法庭说话人验证系统在噪声存在下会出现严重的性能下降。一种可能的解决方案是使用多运行独立分量分析(ICA)来减少噪声语音信号的噪声影响。以往的研究已将多路ICA应用于生物信号;然而,多路ICA在噪声说话人验证中的有效性尚未得到研究。在本文中,我们通过多次迭代快速ICA算法生成的不同混频矩阵中选择混频矩阵的最高信噪比(SIR)来增强带噪声的语音信号。我们使用从增强语音信号的离散小波变换(DWT)中提取的特征扭曲的频率倒谱系数(MFCCs)和特征扭曲的MFCC作为特征提取。本文采用最先进的单位向量(i-vector)概率线性判别分析(PLDA)作为分类器。实验结果表明,当测试语音信号分别被CAR、STREET和HOME噪声(信噪比为- 10 dB)破坏时,该方法的等错误率(EER)比基线噪声扬声器验证系统提高了66.68%、69.24%和70.78%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信