Speaker Recognition on Mono-Channel Telephony Recordings

The Speaker and Language Recognition Workshop Pub Date : 2022-06-28 DOI:10.21437/odyssey.2022-27

Y. Solewicz, Noa Cohen, Johan Rohdin, S. Madikeri, Jan ”Honza” Čercnocký

引用次数: 0

Abstract

Conversations stored as mono data is a common problem in many real world speaker recognition applications. In this paper, we focus on investigative scenarios, where a number of mono telephone conversations are available for a speaker of interest. For example, a human operator may have verified that the speaker is present in these conversations. We propose several approaches for automatically creating enrollment models for the speaker of interest from such data. We then use the enrollment models to search for appearances of the speaker of interest in other calls. We analyze the performance of the different method on two dataset that matches our scenario, one is from a simulated case and one is from a real case. and real databases. We show that even simple methods not requiring tunable settings can perform well in these challenging and unpredicted scenarios. Nevertheless, bigger databases should be used to confirm these findings. The meth-198

查看原文本刊更多论文

单声道电话录音的说话人识别

在许多现实世界的说话人识别应用中，以单数据形式存储的对话是一个常见的问题。在本文中，我们专注于调查场景，其中许多单声道电话对话可用于感兴趣的说话者。例如，操作员可能已经验证了说话者是否存在于这些对话中。我们提出了几种方法来根据这些数据为感兴趣的说话者自动创建注册模型。然后，我们使用注册模型来搜索感兴趣的发言人在其他电话中的出现情况。我们在两个与我们的场景相匹配的数据集上分析了不同方法的性能，一个来自模拟案例，一个来自真实案例。和真正的数据库。我们表明，即使是不需要可调设置的简单方法，也可以在这些具有挑战性和不可预测的场景中表现良好。然而，应该使用更大的数据库来证实这些发现。冰毒- 198

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

The Speaker and Language Recognition Workshop

自引率

0.00%

发文量