{"title":"Cough Sound Based Deep Learning Models for Diagnosis of COVID-19 Using Statistical Features and Time-Frequency Spectrum.","authors":"Jina Kim, Jinseok Lee","doi":"10.1109/EMBC53108.2024.10781593","DOIUrl":null,"url":null,"abstract":"<p><p>This paper presents a deep learning model that can classify COVID-19 patients through cough sounds. The cough sound data were selected from the Cambridge data set which is a crowedsourced data set collected from the Cambridge COVID-19 sounds application. Virufy and Coswara data sets were also selected for external testing. For the sound waveform, we extracted Variable frequency complex demodulation (VFCDM) image and applied to Xception, which is selected as the pre-trained model. Then we extracted zero crossing rate (ZCR), and spectral roll-off (SR), spectral centroid (SC), spectral bandwidth (SB), and concatenated them to the output node of the model. Results were evaluated by using area under receiver operating curve. Cambridge data set: 0.9346, Virufy data set: 0.9244, Coswara data set: 0.8250.</p>","PeriodicalId":72237,"journal":{"name":"Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference","volume":"2024 ","pages":"1-4"},"PeriodicalIF":0.0000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EMBC53108.2024.10781593","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper presents a deep learning model that can classify COVID-19 patients through cough sounds. The cough sound data were selected from the Cambridge data set which is a crowedsourced data set collected from the Cambridge COVID-19 sounds application. Virufy and Coswara data sets were also selected for external testing. For the sound waveform, we extracted Variable frequency complex demodulation (VFCDM) image and applied to Xception, which is selected as the pre-trained model. Then we extracted zero crossing rate (ZCR), and spectral roll-off (SR), spectral centroid (SC), spectral bandwidth (SB), and concatenated them to the output node of the model. Results were evaluated by using area under receiver operating curve. Cambridge data set: 0.9346, Virufy data set: 0.9244, Coswara data set: 0.8250.