{"title":"Environmental sound classification based on time-frequency representation","authors":"Khine Zar Thwe, Nu War","doi":"10.1109/SNPD.2017.8022729","DOIUrl":null,"url":null,"abstract":"This paper proposes a feature extraction method for environmental sound event classification based on time-frequency representation such as spectrogram. There are three portions to perform environmental classification. Firstly, the input signal is converted into spectrogram image with time-frequency representation using short time Fourier transforms. Secondly, this spectrogram is used to extract features with local binary pattern of three different radius and neighborhood sizes. The three distinct features resulted from local binary pattern based on spectrogram are concatenated and used as one feature vector. Finally, multi support vector machine is used for classification of environmental sound event. Evaluation is tested on ESC-10 dataset.","PeriodicalId":186094,"journal":{"name":"2017 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)","volume":"252 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SNPD.2017.8022729","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
This paper proposes a feature extraction method for environmental sound event classification based on time-frequency representation such as spectrogram. There are three portions to perform environmental classification. Firstly, the input signal is converted into spectrogram image with time-frequency representation using short time Fourier transforms. Secondly, this spectrogram is used to extract features with local binary pattern of three different radius and neighborhood sizes. The three distinct features resulted from local binary pattern based on spectrogram are concatenated and used as one feature vector. Finally, multi support vector machine is used for classification of environmental sound event. Evaluation is tested on ESC-10 dataset.