Audio-Visual Speech Classification based on Absent Class Detection

2020 28th European Signal Processing Conference (EUSIPCO) Pub Date : 2021-01-24 DOI:10.23919/Eusipco47968.2020.9287615

G. D. Sad, J. Gómez

引用次数: 1

Abstract

In the present paper, a novel method for Audio-Visual Speech Recognition is introduced, aiming to minimize the intra-class errors. Based on a novel training procedure, the Complementary Models are introduced. These models aim to detect the absence of a class, in contrast to traditional models that aim to detect the presence of a class. In the proposed method, traditional models are employed in the first stage of a cascade scheme, and then the proposed complementary models are used to make the final decision on the recognition results. Experimental results in all the scenarios evaluated (different inputs modalities, three databases, four classifiers, and acoustic noisy conditions), show that a good performance is achieved with the proposed scheme. Also, better results than other reported methods in the literature over two public databases are achieved.

查看原文本刊更多论文

基于缺席类检测的视听语音分类

本文提出了一种新的视听语音识别方法，旨在最大限度地减少类内误差。基于一种新的训练过程，引入了互补模型。这些模型的目标是检测类的缺失，而传统模型的目标是检测类的存在。在该方法中，首先在级联方案的第一阶段使用传统模型，然后使用所提出的补充模型对识别结果进行最终决策。在不同输入方式、三种数据库、四种分类器和噪声条件下的实验结果表明，该方法具有良好的性能。此外，在两个公共数据库上取得了比其他文献报道的方法更好的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 28th European Signal Processing Conference (EUSIPCO)

自引率

0.00%

发文量