基于多运行ICA语音增强的说话人验证

2017 11th International Conference on Signal Processing and Communication Systems (ICSPCS) Pub Date : 2017-12-01 DOI:10.1109/ICSPCS.2017.8270505

Ahmed Kamil Hasan Al-Ali, David Dean, B. Senadji, Mahsa Baktash, V. Chandran

{"title":"基于多运行ICA语音增强的说话人验证","authors":"Ahmed Kamil Hasan Al-Ali, David Dean, B. Senadji, Mahsa Baktash, V. Chandran","doi":"10.1109/ICSPCS.2017.8270505","DOIUrl":null,"url":null,"abstract":"Forensic speaker verification systems show severe performance degradation in the presence of noise when the signal to noise ratio (SNR) is low. A possible solution to this problem is the use of multi-run independent component analysis (ICA) to reduce the effect of noise from the noisy speech signals. Previous works have used multi-run ICA in biosignal application; however, the effectiveness of multi-run ICA on noisy speaker verification has not been investigated yet. In this paper, we use multi-run ICA to enhance the noisy speech signals by choosing the highest signal to interference ratio (SIR) of the mixing matrix from different mixing matrices generated by iterating the fast ICA algorithm for several times. We use a combination of feature-warped mel frequency cepstral coefficients (MFCCs) and feature-warped MFCC extracted from the discrete wavelet transform (DWT) of the enhanced speech signals as the feature extraction. A state-of-the-art identity vector (i-vector) probabilistic linear discriminant analysis (PLDA) was used as a classifier in this paper. Experimental results demonstrate that the proposed method with multi-run ICA achieves high improvements in equal error rate (EER) of 66.68%, 69.24% and 70.78% over the baseline noisy speaker verification system, when the test speech signals are corrupted with CAR, STREET, and HOME noises respectively at −10 dB SNR.","PeriodicalId":268205,"journal":{"name":"2017 11th International Conference on Signal Processing and Communication Systems (ICSPCS)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Speaker verification with multi-run ICA based speech enhancement\",\"authors\":\"Ahmed Kamil Hasan Al-Ali, David Dean, B. Senadji, Mahsa Baktash, V. Chandran\",\"doi\":\"10.1109/ICSPCS.2017.8270505\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Forensic speaker verification systems show severe performance degradation in the presence of noise when the signal to noise ratio (SNR) is low. A possible solution to this problem is the use of multi-run independent component analysis (ICA) to reduce the effect of noise from the noisy speech signals. Previous works have used multi-run ICA in biosignal application; however, the effectiveness of multi-run ICA on noisy speaker verification has not been investigated yet. In this paper, we use multi-run ICA to enhance the noisy speech signals by choosing the highest signal to interference ratio (SIR) of the mixing matrix from different mixing matrices generated by iterating the fast ICA algorithm for several times. We use a combination of feature-warped mel frequency cepstral coefficients (MFCCs) and feature-warped MFCC extracted from the discrete wavelet transform (DWT) of the enhanced speech signals as the feature extraction. A state-of-the-art identity vector (i-vector) probabilistic linear discriminant analysis (PLDA) was used as a classifier in this paper. Experimental results demonstrate that the proposed method with multi-run ICA achieves high improvements in equal error rate (EER) of 66.68%, 69.24% and 70.78% over the baseline noisy speaker verification system, when the test speech signals are corrupted with CAR, STREET, and HOME noises respectively at −10 dB SNR.\",\"PeriodicalId\":268205,\"journal\":{\"name\":\"2017 11th International Conference on Signal Processing and Communication Systems (ICSPCS)\",\"volume\":\"75 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 11th International Conference on Signal Processing and Communication Systems (ICSPCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSPCS.2017.8270505\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 11th International Conference on Signal Processing and Communication Systems (ICSPCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSPCS.2017.8270505","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

当信噪比较低时，法庭说话人验证系统在噪声存在下会出现严重的性能下降。一种可能的解决方案是使用多运行独立分量分析(ICA)来减少噪声语音信号的噪声影响。以往的研究已将多路ICA应用于生物信号;然而，多路ICA在噪声说话人验证中的有效性尚未得到研究。在本文中，我们通过多次迭代快速ICA算法生成的不同混频矩阵中选择混频矩阵的最高信噪比(SIR)来增强带噪声的语音信号。我们使用从增强语音信号的离散小波变换(DWT)中提取的特征扭曲的频率倒谱系数(MFCCs)和特征扭曲的MFCC作为特征提取。本文采用最先进的单位向量(i-vector)概率线性判别分析(PLDA)作为分类器。实验结果表明，当测试语音信号分别被CAR、STREET和HOME噪声(信噪比为- 10 dB)破坏时，该方法的等错误率(EER)比基线噪声扬声器验证系统提高了66.68%、69.24%和70.78%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Speaker verification with multi-run ICA based speech enhancement

Forensic speaker verification systems show severe performance degradation in the presence of noise when the signal to noise ratio (SNR) is low. A possible solution to this problem is the use of multi-run independent component analysis (ICA) to reduce the effect of noise from the noisy speech signals. Previous works have used multi-run ICA in biosignal application; however, the effectiveness of multi-run ICA on noisy speaker verification has not been investigated yet. In this paper, we use multi-run ICA to enhance the noisy speech signals by choosing the highest signal to interference ratio (SIR) of the mixing matrix from different mixing matrices generated by iterating the fast ICA algorithm for several times. We use a combination of feature-warped mel frequency cepstral coefficients (MFCCs) and feature-warped MFCC extracted from the discrete wavelet transform (DWT) of the enhanced speech signals as the feature extraction. A state-of-the-art identity vector (i-vector) probabilistic linear discriminant analysis (PLDA) was used as a classifier in this paper. Experimental results demonstrate that the proposed method with multi-run ICA achieves high improvements in equal error rate (EER) of 66.68%, 69.24% and 70.78% over the baseline noisy speaker verification system, when the test speech signals are corrupted with CAR, STREET, and HOME noises respectively at −10 dB SNR.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 11th International Conference on Signal Processing and Communication Systems (ICSPCS)

自引率

0.00%

发文量