Chien-Yao Wang, Yu-Hao Chin, Tzu-Chiang Tai, D. Gunawan, Jia-Ching Wang
{"title":"Automatic recognition of audio event using dynamic local binary patterns","authors":"Chien-Yao Wang, Yu-Hao Chin, Tzu-Chiang Tai, D. Gunawan, Jia-Ching Wang","doi":"10.1109/ICCE-TW.2015.7216879","DOIUrl":null,"url":null,"abstract":"This work proposes an automatic recognition system for recognizing audio events. First, an audio signal is converted into a spectrogram by short time Fourier transform. The acoustic background noises in the spectrogram are reduced by box filtering. The contrast of the spectrogram is then enhanced by VAR operation. With the enhanced spectrogram, this work further proposes a novel dynamic local binary pattern (DLBP) feature based on human auditory system. Finally, the DLBP features are fed to multi-class support vector machines to achieve the audio event recognition. The experimental results on 16 classes of audio events demonstrate the performance of the proposed audio event recognition system.","PeriodicalId":340402,"journal":{"name":"2015 IEEE International Conference on Consumer Electronics - Taiwan","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE International Conference on Consumer Electronics - Taiwan","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCE-TW.2015.7216879","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This work proposes an automatic recognition system for recognizing audio events. First, an audio signal is converted into a spectrogram by short time Fourier transform. The acoustic background noises in the spectrogram are reduced by box filtering. The contrast of the spectrogram is then enhanced by VAR operation. With the enhanced spectrogram, this work further proposes a novel dynamic local binary pattern (DLBP) feature based on human auditory system. Finally, the DLBP features are fed to multi-class support vector machines to achieve the audio event recognition. The experimental results on 16 classes of audio events demonstrate the performance of the proposed audio event recognition system.