Enhanced speaker recognition based on score level fusion of AHS and HMM

T. Islam, S. Mangayyagari, R. Sankar
{"title":"Enhanced speaker recognition based on score level fusion of AHS and HMM","authors":"T. Islam, S. Mangayyagari, R. Sankar","doi":"10.1109/SECON.2007.342843","DOIUrl":null,"url":null,"abstract":"Speaker recognition history dates back to some four decades, and yet it has not been reliable enough to be considered as a standalone security system. This paper focuses on the enhancement of speaker recognition through fusion of likelihood scores generated by arithmetic harmonic sphericity (AHS) and hidden Markov model (HMM) techniques. Due to the contrastive nature of AHS and HMM, we have observed a significant performance improvement of 22% and 6% true acceptance rate at 5% false acceptance rate, when this fusion technique was evaluated on two different datasets - YOHO and USF multimodal biometric dataset, respectively. Performance enhancement has been achieved on both the datasets, however performance on YOHO was comparatively higher than that on USF dataset, owing to the fact that USF dataset is a noisy outdoor dataset whereas YOHO is an indoor dataset.","PeriodicalId":423683,"journal":{"name":"Proceedings 2007 IEEE SoutheastCon","volume":"215 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 2007 IEEE SoutheastCon","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SECON.2007.342843","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Speaker recognition history dates back to some four decades, and yet it has not been reliable enough to be considered as a standalone security system. This paper focuses on the enhancement of speaker recognition through fusion of likelihood scores generated by arithmetic harmonic sphericity (AHS) and hidden Markov model (HMM) techniques. Due to the contrastive nature of AHS and HMM, we have observed a significant performance improvement of 22% and 6% true acceptance rate at 5% false acceptance rate, when this fusion technique was evaluated on two different datasets - YOHO and USF multimodal biometric dataset, respectively. Performance enhancement has been achieved on both the datasets, however performance on YOHO was comparatively higher than that on USF dataset, owing to the fact that USF dataset is a noisy outdoor dataset whereas YOHO is an indoor dataset.
基于AHS和HMM评分融合的增强说话人识别
说话人识别的历史可以追溯到大约40年前,但它还不够可靠,不能被视为一个独立的安全系统。本文研究了将算术调和球度(AHS)和隐马尔可夫模型(HMM)生成的似然分数融合在一起,增强说话人识别。由于AHS和HMM的对比性质,我们观察到,当这种融合技术分别在YOHO和USF多模态生物特征数据集上进行评估时,在5%的错误接受率下,其性能显著提高了22%和6%的真实接受率。两种数据集的性能都得到了提高,但由于USF数据集是一个有噪声的室外数据集,而YOHO数据集是一个室内数据集,因此YOHO数据集的性能相对高于USF数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信