Usman Haider, M. Hanif, Hiroki Kobayashi, L. Parajuli, Daisuké Shimotoku, Ahmar Rashid, Sonia Safeer
{"title":"Bioacoustics Signal Classification Using Hybrid Feature Space with Machine Learning","authors":"Usman Haider, M. Hanif, Hiroki Kobayashi, L. Parajuli, Daisuké Shimotoku, Ahmar Rashid, Sonia Safeer","doi":"10.1109/ICCAE56788.2023.10111384","DOIUrl":null,"url":null,"abstract":"The vocal sounds emitted by animals and birds possess distinctive signatures. They are vital for acoustic monitoring to extract useful ecological data and track biodiversity. Recently, automated bioacoustics classification drew attention from the research community due to its diverse application. To that aim, we present a novel classification method for acoustic data by fusing optimally selected signal features. The proposed method extracts the distinctive statistical features, calculated using coefficients of Discrete Wavelet Transform (DWT), and fuses them with the descriptive features estimated using the Mel-Frequency Cepstral Coefficients (MFCC). The combined feature set is then passed to different machine learning classifiers, including Random Forest (RF), Support Vector Machine (SVM), and k-Nearest Neighbors (kNN) for sound classification of different animals and birds. The evaluation results show that the proposed method improves the classification accuracy and achieved high precision on all classifiers.","PeriodicalId":406112,"journal":{"name":"2023 15th International Conference on Computer and Automation Engineering (ICCAE)","volume":"343 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 15th International Conference on Computer and Automation Engineering (ICCAE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCAE56788.2023.10111384","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The vocal sounds emitted by animals and birds possess distinctive signatures. They are vital for acoustic monitoring to extract useful ecological data and track biodiversity. Recently, automated bioacoustics classification drew attention from the research community due to its diverse application. To that aim, we present a novel classification method for acoustic data by fusing optimally selected signal features. The proposed method extracts the distinctive statistical features, calculated using coefficients of Discrete Wavelet Transform (DWT), and fuses them with the descriptive features estimated using the Mel-Frequency Cepstral Coefficients (MFCC). The combined feature set is then passed to different machine learning classifiers, including Random Forest (RF), Support Vector Machine (SVM), and k-Nearest Neighbors (kNN) for sound classification of different animals and birds. The evaluation results show that the proposed method improves the classification accuracy and achieved high precision on all classifiers.