Preliminary experiments on the robustness of biologically motivated features for DNN-based ASR

2015 4th International Work Conference on Bioinspired Intelligence (IWOBI) Pub Date : 2015-06-10 DOI:10.1109/IWOBI.2015.7160162

F. de-la-Calle-Silos, F. J. Valverde-Albacete, A. Gallardo-Antolín, Carmen Peláez-Moreno

引用次数: 0

Abstract

A perceptually motivated feature extraction method based on mimicking the masking properties of the cochlea has been recently found to provide enhanced performance when applied to conventional speech recognition back-ends. On the other hand, the introduction of Deep Neural Network (DNN) based acoustic models has produced dramatic improvements in performance. In particular, we found that Deep Maxout Networks, a modification of DNNs' feed-forward architecture that uses a max-out activation function, provides enhanced robustness to environmental noise. In this paper, we present preliminary experiments on the combination of these two elements that already show how the DMN-based back-end is capable of taking advantage of these auditorily inspired features making the whole system more robust and also suggesting that human-like representations of speech keep playing an important role in DNN-based automatic speech recognition systems.

查看原文本刊更多论文

生物动机特征在基于dnn的ASR中的鲁棒性初步实验

一种基于模仿耳蜗掩蔽特性的感知动机特征提取方法在应用于传统语音识别后端时提供了更好的性能。另一方面，基于深度神经网络(DNN)声学模型的引入在性能上产生了巨大的改进。特别是，我们发现深度最大输出网络(Deep Maxout Networks)是对dnn前馈架构的一种改进，它使用了最大输出激活函数，增强了对环境噪声的鲁棒性。在本文中，我们提出了关于这两个元素组合的初步实验，这些实验已经表明基于dnn的后端如何能够利用这些听觉启发的特征使整个系统更加鲁棒，并且还表明类人语音表示在基于dnn的自动语音识别系统中继续发挥重要作用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 4th International Work Conference on Bioinspired Intelligence (IWOBI)

自引率

0.00%

发文量