Construction of bottle-body autoencoder and its application to audio signal classification

2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI:10.1109/ICALIP.2016.7846541

Jichen Yang, Qianhua He, Min Cai, Yanxiong Li, Hai Jin

引用次数: 1

Abstract

In order to extract effective audio feature using autoencoder, different from traditional bottle-neck autoencoder, bottle-body autoencoder is presented in this paper, which is constructed using restricted Boltzmann machine with the same neurons at every layer. Bottle-body feature, which is obtained by using pseudo-inverse method to initialize weights, is applied to audio signal classification. The proposed approach is evaluated on the BBC Sound Effects Library, and shows a 14.90% and 16.20% improvement on classification accuracy than traditional Mel-frequency cepstral coefficient and bottle-neck feature.

查看原文本刊更多论文

瓶体自编码器的构造及其在音频信号分类中的应用

为了利用自编码器提取有效的音频特征，与传统的瓶颈自编码器不同，本文提出了采用约束玻尔兹曼机构造的每层神经元相同的瓶体自编码器。将拟逆法初始化权值得到的瓶体特征应用于音频信号分类。在BBC Sound Effects Library上对该方法进行了评估，与传统的Mel-frequency倒谱系数和瓶颈特征相比，该方法的分类准确率分别提高了14.90%和16.20%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 International Conference on Audio, Language and Image Processing (ICALIP)

自引率

0.00%

发文量