Speaker Recognition on Mono-Channel Telephony Recordings

Y. Solewicz, Noa Cohen, Johan Rohdin, S. Madikeri, Jan ”Honza” Čercnocký
{"title":"Speaker Recognition on Mono-Channel Telephony Recordings","authors":"Y. Solewicz, Noa Cohen, Johan Rohdin, S. Madikeri, Jan ”Honza” Čercnocký","doi":"10.21437/odyssey.2022-27","DOIUrl":null,"url":null,"abstract":"Conversations stored as mono data is a common problem in many real world speaker recognition applications. In this paper, we focus on investigative scenarios, where a number of mono telephone conversations are available for a speaker of interest. For example, a human operator may have verified that the speaker is present in these conversations. We propose several approaches for automatically creating enrollment models for the speaker of interest from such data. We then use the enrollment models to search for appearances of the speaker of interest in other calls. We analyze the performance of the different method on two dataset that matches our scenario, one is from a simulated case and one is from a real case. and real databases. We show that even simple methods not requiring tunable settings can perform well in these challenging and unpredicted scenarios. Nevertheless, bigger databases should be used to confirm these findings. The meth-198","PeriodicalId":315750,"journal":{"name":"The Speaker and Language Recognition Workshop","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Speaker and Language Recognition Workshop","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21437/odyssey.2022-27","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Conversations stored as mono data is a common problem in many real world speaker recognition applications. In this paper, we focus on investigative scenarios, where a number of mono telephone conversations are available for a speaker of interest. For example, a human operator may have verified that the speaker is present in these conversations. We propose several approaches for automatically creating enrollment models for the speaker of interest from such data. We then use the enrollment models to search for appearances of the speaker of interest in other calls. We analyze the performance of the different method on two dataset that matches our scenario, one is from a simulated case and one is from a real case. and real databases. We show that even simple methods not requiring tunable settings can perform well in these challenging and unpredicted scenarios. Nevertheless, bigger databases should be used to confirm these findings. The meth-198
单声道电话录音的说话人识别
在许多现实世界的说话人识别应用中,以单数据形式存储的对话是一个常见的问题。在本文中,我们专注于调查场景,其中许多单声道电话对话可用于感兴趣的说话者。例如,操作员可能已经验证了说话者是否存在于这些对话中。我们提出了几种方法来根据这些数据为感兴趣的说话者自动创建注册模型。然后,我们使用注册模型来搜索感兴趣的发言人在其他电话中的出现情况。我们在两个与我们的场景相匹配的数据集上分析了不同方法的性能,一个来自模拟案例,一个来自真实案例。和真正的数据库。我们表明,即使是不需要可调设置的简单方法,也可以在这些具有挑战性和不可预测的场景中表现良好。然而,应该使用更大的数据库来证实这些发现。冰毒- 198
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信