一种新的双说话人分割说话人识别系统中的说话人变化检测方法

M. Bazyar, R. Sudirman
{"title":"一种新的双说话人分割说话人识别系统中的说话人变化检测方法","authors":"M. Bazyar, R. Sudirman","doi":"10.1109/ISCAIE.2014.7010226","DOIUrl":null,"url":null,"abstract":"Speaker change detection is done in many speaker and speech identification applications that the speech is from two speakers. However, the standard metric-based methods performance is not suitable and stable owing to the amid window distance calculation stability. Therefore, a new method is proposed to improve the stability and enhance the performance of the system according to speakers' characteristics using between window correlations. Moreover, reference speaker models set that shows the space of the entire speaker model are trained in this approach. A metric is defined as the between window correlation of scores likelihood vectors versus the reference models. The Peak and Valley information and gender information are also used. In this paper, we look at telephone conversations where it is known a priori that there are two speakers, but the identity of the speakers is not known. Experiments over Farsdat Database show better performance In comparison with the GLR and the BIC approaches. This new approach has more effect rather than the GLR and the BIC approaches in the broader value of defined thresholds.","PeriodicalId":385258,"journal":{"name":"2014 IEEE Symposium on Computer Applications and Industrial Electronics (ISCAIE)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A new speaker change detection method in a speaker identification system for two-speakers segmentation\",\"authors\":\"M. Bazyar, R. Sudirman\",\"doi\":\"10.1109/ISCAIE.2014.7010226\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speaker change detection is done in many speaker and speech identification applications that the speech is from two speakers. However, the standard metric-based methods performance is not suitable and stable owing to the amid window distance calculation stability. Therefore, a new method is proposed to improve the stability and enhance the performance of the system according to speakers' characteristics using between window correlations. Moreover, reference speaker models set that shows the space of the entire speaker model are trained in this approach. A metric is defined as the between window correlation of scores likelihood vectors versus the reference models. The Peak and Valley information and gender information are also used. In this paper, we look at telephone conversations where it is known a priori that there are two speakers, but the identity of the speakers is not known. Experiments over Farsdat Database show better performance In comparison with the GLR and the BIC approaches. This new approach has more effect rather than the GLR and the BIC approaches in the broader value of defined thresholds.\",\"PeriodicalId\":385258,\"journal\":{\"name\":\"2014 IEEE Symposium on Computer Applications and Industrial Electronics (ISCAIE)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-04-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 IEEE Symposium on Computer Applications and Industrial Electronics (ISCAIE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCAIE.2014.7010226\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE Symposium on Computer Applications and Industrial Electronics (ISCAIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCAIE.2014.7010226","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

说话人变化检测在许多说话人和语音识别应用中都是这样做的,即语音来自两个说话人。然而,基于度量的标准方法由于窗间距离计算的不稳定性,导致性能不稳定。为此,提出了一种根据说话人的特点,利用窗间相关性提高系统稳定性和性能的新方法。此外,该方法还训练了显示整个说话人模型空间的参考说话人模型集。度量被定义为分数似然向量与参考模型之间的窗口相关性。还使用了峰谷信息和性别信息。在这篇论文中,我们研究了电话交谈,其中先验地知道有两个说话人,但说话人的身份是未知的。在Farsdat数据库上进行的实验表明,与GLR和BIC方法相比,该方法具有更好的性能。在定义阈值的更广范围内,这种新方法比GLR和BIC方法更有效。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A new speaker change detection method in a speaker identification system for two-speakers segmentation
Speaker change detection is done in many speaker and speech identification applications that the speech is from two speakers. However, the standard metric-based methods performance is not suitable and stable owing to the amid window distance calculation stability. Therefore, a new method is proposed to improve the stability and enhance the performance of the system according to speakers' characteristics using between window correlations. Moreover, reference speaker models set that shows the space of the entire speaker model are trained in this approach. A metric is defined as the between window correlation of scores likelihood vectors versus the reference models. The Peak and Valley information and gender information are also used. In this paper, we look at telephone conversations where it is known a priori that there are two speakers, but the identity of the speakers is not known. Experiments over Farsdat Database show better performance In comparison with the GLR and the BIC approaches. This new approach has more effect rather than the GLR and the BIC approaches in the broader value of defined thresholds.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信