A privacy-preserving and language-independent speaking detecting and speaker diarization approach for spontaneous conversation using microphones

2012 IEEE 11th International Conference on Signal Processing Pub Date : 2012-10-01 DOI:10.1109/ICOSP.2012.6491534

Ni Zhang, Y. Yaginuma

引用次数: 2

Abstract

Conversation conveys important social signals of human interaction that indicates interest, service-awareness, persuasiveness, etc. In this paper, the authors employ the most common setting of using microphones to capture spontaneous conversation, and introduce a privacy-preserving and language-independent speech processing approach that can detect speaking and separate speakers in high accuracy for such setting. Experimental results have validated that the approach can deliver accurate speaking recognition results in Japanese, English and Chinese conversation, and can be processed in real time applications.

查看原文本刊更多论文

一种使用麦克风进行自发对话的隐私保护和语言独立的说话检测和说话人拨号方法

谈话传达了人类互动的重要社会信号，表明兴趣、服务意识、说服力等。在本文中，作者采用了最常见的使用麦克风来捕捉自发对话的设置，并引入了一种隐私保护和语言无关的语音处理方法，可以在这种设置中高精度地检测说话和分离说话者。实验结果表明，该方法可以在日语、英语和汉语会话中提供准确的语音识别结果，并可用于实时应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 IEEE 11th International Conference on Signal Processing

自引率

0.00%

发文量