Chinnavat Jatturas, Sornsawan Chokkoedsakul, Pisitpong Devahasting Na Avudhva, Sukit Pankaew, Cherdkul Sopavanit, W. Asdornwised
{"title":"Feature-based and Deep Learning-based Classification of Environmental Sound","authors":"Chinnavat Jatturas, Sornsawan Chokkoedsakul, Pisitpong Devahasting Na Avudhva, Sukit Pankaew, Cherdkul Sopavanit, W. Asdornwised","doi":"10.1109/icce-asia46551.2019.8942209","DOIUrl":null,"url":null,"abstract":"In this paper, we perform comparison techniques for environmental sound classification with multilayer perceptron (MLP) and support vector machine (SVM), and deep learning using new machine learning platforms, i.e., Scikit-Iearn and Tensorflow, respectively. For feature-based classification, principal component analysis of short-time Fourier transform is used as our feature as the front end to MLP and SVM. For deep learning-based classification, convolution+pooling layers is acting as feature extractor from the input image, while fully connected layer will act as a classifier. Our experimental results show that our proposed deep neural network (DNN) models outperform the feature-based sound classification algorithms and the original deep learning work [1].","PeriodicalId":117814,"journal":{"name":"2019 IEEE International Conference on Consumer Electronics - Asia (ICCE-Asia)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Conference on Consumer Electronics - Asia (ICCE-Asia)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icce-asia46551.2019.8942209","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In this paper, we perform comparison techniques for environmental sound classification with multilayer perceptron (MLP) and support vector machine (SVM), and deep learning using new machine learning platforms, i.e., Scikit-Iearn and Tensorflow, respectively. For feature-based classification, principal component analysis of short-time Fourier transform is used as our feature as the front end to MLP and SVM. For deep learning-based classification, convolution+pooling layers is acting as feature extractor from the input image, while fully connected layer will act as a classifier. Our experimental results show that our proposed deep neural network (DNN) models outperform the feature-based sound classification algorithms and the original deep learning work [1].