Co-channel speaker identification using usable speech extraction based on multi-pitch tracking

2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). Pub Date : 2003-04-06 DOI:10.1109/ICASSP.2003.1202330

Yang Shao, Deliang Wang

引用次数: 52

Abstract

Recently, usable speech criteria have been proposed to extract minimally corrupted speech for speaker identification (SID) in co-channel speech. In this paper, we propose a new usable speech extraction method to improve the SID performance under the co-channel situation based on the pitch information obtained from a robust multi-pitch tracking algorithm [2]. The idea is to retain the speech segments that have only one pitch detected and remove the others. The system is evaluated on co-channel speech and results show a significant improvement across various target to interferer ratios (TIR) for speaker identification.

查看原文本刊更多论文

基于多音高跟踪的可用语音提取的同信道说话人识别

近年来，人们提出了可用的语音标准来提取最小损坏语音，用于同信道语音的说话人识别(SID)。在本文中，我们提出了一种新的基于鲁棒多基音跟踪算法获得的基音信息的可用语音提取方法来提高同信道情况下的SID性能[2]。这个想法是保留只检测到一个音高的语音片段，并删除其他的。该系统在同信道语音上进行了评估，结果表明，在不同的目标干扰比(TIR)下，该系统在说话人识别方面有了显著的改善。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).

自引率

0.00%

发文量