Text independent automatic speaker recognition using selforganizing maps

Conference Record of the 2004 IEEE Industry Applications Conference, 2004. 39th IAS Annual Meeting. Pub Date : 2004-11-01 DOI:10.1109/IAS.2004.1348670

A.T. Mafra, M. Simões

引用次数: 9

Abstract

This work presents one implementation of an automatic speaker recognition system, based on selforganizing map (SOM) neural networks. The voice of each speaker is modeled by a SOM, trained to specialize in the quantization of feature vectors (MFCCs) extracted from his voice. When a test sample is presented, it is quantized by all SpMs, that compete for the speaker: the SOM with smallest quantization error defines the speaker. The system was tested on a speaker identification task over a 14 speaker set, with phrases from three phonetically balanced sets and one variable answer set. The results comprovate the method's efficiency.

查看原文本刊更多论文

使用自组织地图的文本独立自动说话人识别

本文提出了一种基于自组织映射(SOM)神经网络的自动说话人识别系统。每个说话人的声音都由SOM建模，SOM专门训练从说话人的声音中提取的特征向量(mfccc)的量化。当给出一个测试样本时，它被所有争夺扬声器的spm量化:量化误差最小的SOM定义扬声器。该系统在一个14人的说话人识别任务中进行了测试，其中包括来自三个语音平衡集和一个变量答案集的短语。结果证明了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Conference Record of the 2004 IEEE Industry Applications Conference, 2004. 39th IAS Annual Meeting.

自引率

0.00%

发文量