Relevant Feature Selection for Audio-Visual Speech Recognition

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI:10.1109/MMSP.2007.4412847

Thomas Drugman, Mihai Gurban, J. Thiran

引用次数: 26

Abstract

We present a feature selection method based on information theoretic measures, targeted at multimodal signal processing, showing how we can quantitatively assess the relevance of features from different modalities. We are able to find the features with the highest amount of information relevant for the recognition task, and at the same having minimal redundancy. Our application is audio-visual speech recognition, and in particular selecting relevant visual features. Experimental results show that our method outperforms other feature selection algorithms from the literature by improving recognition accuracy even with a significantly reduced number of features.

查看原文本刊更多论文

视听语音识别的相关特征选择

我们提出了一种基于信息理论的特征选择方法，针对多模态信号处理，展示了我们如何定量评估来自不同模态的特征的相关性。我们能够找到与识别任务相关的信息量最大的特征，同时具有最小的冗余。我们的应用是视听语音识别，特别是选择相关的视觉特征。实验结果表明，该方法在显著减少特征数量的情况下也能提高识别精度，优于文献中的其他特征选择算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2007 IEEE 9th Workshop on Multimedia Signal Processing

自引率

0.00%

发文量