使用i-vector方法的余弦距离评分的说话人验证

Musab T. S. Al-Kaltakchi, R. Al-Nima, Mahmood Alfathe, Mohammed A. M. Abdullah
{"title":"使用i-vector方法的余弦距离评分的说话人验证","authors":"Musab T. S. Al-Kaltakchi, R. Al-Nima, Mahmood Alfathe, Mohammed A. M. Abdullah","doi":"10.1109/CSASE48920.2020.9142088","DOIUrl":null,"url":null,"abstract":"In this paper, a robust yet simple speaker verification system is implemented. The speaker verification system is investigated employing the i-vector approach with the Cosine Distance Scoring (CDS) for system classification. In addition, to measure the system performance, Equal Error Rate (EER), Detection Error Trade-off (DET) Curve, Receiver Operating Characteristic (ROC) curve as well as Detection Cost Function (DCF) were utilized. Experimental results are conducted on the TMIT database using 64 randomly selected speakers. The proposed system utilizes the Mel Frequency Cepstral Coefficients (MFCC) and Power Normalized Cepstral Coefficients (PNCC) for feature extraction. In addition, features normalization methods such as Feature Warping (FW) and Cepstral Mean-Variance Normalization (CMVN) are used in order to mitigate channel effect noise. The speakers are modeled with the i-vector while CDS is used for classification. Experimental results demonstrate that the proposed system achieved promising results while being computationally efficient.","PeriodicalId":254581,"journal":{"name":"2020 International Conference on Computer Science and Software Engineering (CSASE)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Speaker Verification Using Cosine Distance Scoring with i-vector Approach\",\"authors\":\"Musab T. S. Al-Kaltakchi, R. Al-Nima, Mahmood Alfathe, Mohammed A. M. Abdullah\",\"doi\":\"10.1109/CSASE48920.2020.9142088\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a robust yet simple speaker verification system is implemented. The speaker verification system is investigated employing the i-vector approach with the Cosine Distance Scoring (CDS) for system classification. In addition, to measure the system performance, Equal Error Rate (EER), Detection Error Trade-off (DET) Curve, Receiver Operating Characteristic (ROC) curve as well as Detection Cost Function (DCF) were utilized. Experimental results are conducted on the TMIT database using 64 randomly selected speakers. The proposed system utilizes the Mel Frequency Cepstral Coefficients (MFCC) and Power Normalized Cepstral Coefficients (PNCC) for feature extraction. In addition, features normalization methods such as Feature Warping (FW) and Cepstral Mean-Variance Normalization (CMVN) are used in order to mitigate channel effect noise. The speakers are modeled with the i-vector while CDS is used for classification. Experimental results demonstrate that the proposed system achieved promising results while being computationally efficient.\",\"PeriodicalId\":254581,\"journal\":{\"name\":\"2020 International Conference on Computer Science and Software Engineering (CSASE)\",\"volume\":\"75 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Computer Science and Software Engineering (CSASE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSASE48920.2020.9142088\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Computer Science and Software Engineering (CSASE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSASE48920.2020.9142088","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

本文实现了一个鲁棒且简单的说话人验证系统。采用i向量方法和余弦距离评分(CDS)进行系统分类,研究了说话人验证系统。此外,利用等错误率(EER)、检测误差权衡(DET)曲线、受试者工作特征(ROC)曲线和检测成本函数(DCF)来衡量系统性能。实验结果是在TMIT数据库上随机选择64位说话人进行的。该系统利用Mel频率倒谱系数(MFCC)和功率归一化倒谱系数(PNCC)进行特征提取。此外,还采用特征扭曲(FW)和倒谱均值方差归一化(CMVN)等特征归一化方法来减轻信道效应噪声。扬声器使用i向量建模,而使用CDS进行分类。实验结果表明,该系统在计算效率高的同时取得了良好的效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Speaker Verification Using Cosine Distance Scoring with i-vector Approach
In this paper, a robust yet simple speaker verification system is implemented. The speaker verification system is investigated employing the i-vector approach with the Cosine Distance Scoring (CDS) for system classification. In addition, to measure the system performance, Equal Error Rate (EER), Detection Error Trade-off (DET) Curve, Receiver Operating Characteristic (ROC) curve as well as Detection Cost Function (DCF) were utilized. Experimental results are conducted on the TMIT database using 64 randomly selected speakers. The proposed system utilizes the Mel Frequency Cepstral Coefficients (MFCC) and Power Normalized Cepstral Coefficients (PNCC) for feature extraction. In addition, features normalization methods such as Feature Warping (FW) and Cepstral Mean-Variance Normalization (CMVN) are used in order to mitigate channel effect noise. The speakers are modeled with the i-vector while CDS is used for classification. Experimental results demonstrate that the proposed system achieved promising results while being computationally efficient.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信