Carlos A. Galindo Meza, Juan A. del Hoyo Ontiveros, P. López-Meyer
{"title":"Classification of Respiration Sounds Using Deep Pre-trained Audio Embeddings","authors":"Carlos A. Galindo Meza, Juan A. del Hoyo Ontiveros, P. López-Meyer","doi":"10.1109/LA-CCI48322.2021.9769831","DOIUrl":null,"url":null,"abstract":"In this work we present the use of an end-to-end deep learning based pre-trained audio embeddings generator, and apply it to the purpose of classification of respiration sounds. With this approach, there is no need to pre-compute spectral representations, e.g. MFCC or filterbanks, since the classification model uses raw audio as the input. Transfer learning was used to train an audio classifier for sounds of respiratory cycles as defined in the ICBHI 2017 challenge. The results on this dataset show that this end-to-end model represents a viable alternative to more common spectral-based classifiers, while achieving state-of-the-art performance.","PeriodicalId":431041,"journal":{"name":"2021 IEEE Latin American Conference on Computational Intelligence (LA-CCI)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Latin American Conference on Computational Intelligence (LA-CCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/LA-CCI48322.2021.9769831","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this work we present the use of an end-to-end deep learning based pre-trained audio embeddings generator, and apply it to the purpose of classification of respiration sounds. With this approach, there is no need to pre-compute spectral representations, e.g. MFCC or filterbanks, since the classification model uses raw audio as the input. Transfer learning was used to train an audio classifier for sounds of respiratory cycles as defined in the ICBHI 2017 challenge. The results on this dataset show that this end-to-end model represents a viable alternative to more common spectral-based classifiers, while achieving state-of-the-art performance.