COVID-19 Diagnosis from Cough Acoustics using ConvNets and Data Augmentation

Saranga Kingkor Mahanta, Darsh Kaushik, Shubham Jain, Hoang Van Truong, K. Guha
{"title":"COVID-19 Diagnosis from Cough Acoustics using ConvNets and Data Augmentation","authors":"Saranga Kingkor Mahanta, Darsh Kaushik, Shubham Jain, Hoang Van Truong, K. Guha","doi":"10.1109/icacfct53978.2021.9837350","DOIUrl":null,"url":null,"abstract":"With the periodic rise and fall of COVID-19 and countries being inflicted by its waves, an efficient, economic, and effortless diagnosis procedure for the virus has been the utmost need of the hour. Amongst the infected subjects, the asymptomatic ones need not be entirely free of symptoms caused by the virus. They might not show any observable symptoms like the symptomatic subjects, but they may differ from uninfected ones in the way they cough. These differences in the coughing sounds are minute and indiscernible to the human ear, however, these can be captured using machine learning models. In this paper, we present a deep learning approach to analyze the acoustic dataset provided in Track 1 of the DiCOVA 2021 Challenge containing cough sound recordings belonging to both COVID-19 positive and negative examples. To perform the classification we propose a ConvNet model. It achieved an AUC score percentage of 72.23 on a blind test set provided in the challenge for an unbiased evaluation of the models. Moreover, the ConvNet model incorporated with Data Augmentation further increased the AUC score percentage from 72.23 to 87.07. It also outperformed the DiCOVA 2021 Challenge’s baseline model by 23% thus, claiming the top position on the DiCOVA 2021 Challenge leaderboard. This paper proposes the use of Mel Frequency Cepstral Coefficients as the input features to the proposed model.","PeriodicalId":312952,"journal":{"name":"2021 First International Conference on Advances in Computing and Future Communication Technologies (ICACFCT)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 First International Conference on Advances in Computing and Future Communication Technologies (ICACFCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icacfct53978.2021.9837350","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

With the periodic rise and fall of COVID-19 and countries being inflicted by its waves, an efficient, economic, and effortless diagnosis procedure for the virus has been the utmost need of the hour. Amongst the infected subjects, the asymptomatic ones need not be entirely free of symptoms caused by the virus. They might not show any observable symptoms like the symptomatic subjects, but they may differ from uninfected ones in the way they cough. These differences in the coughing sounds are minute and indiscernible to the human ear, however, these can be captured using machine learning models. In this paper, we present a deep learning approach to analyze the acoustic dataset provided in Track 1 of the DiCOVA 2021 Challenge containing cough sound recordings belonging to both COVID-19 positive and negative examples. To perform the classification we propose a ConvNet model. It achieved an AUC score percentage of 72.23 on a blind test set provided in the challenge for an unbiased evaluation of the models. Moreover, the ConvNet model incorporated with Data Augmentation further increased the AUC score percentage from 72.23 to 87.07. It also outperformed the DiCOVA 2021 Challenge’s baseline model by 23% thus, claiming the top position on the DiCOVA 2021 Challenge leaderboard. This paper proposes the use of Mel Frequency Cepstral Coefficients as the input features to the proposed model.
基于卷积神经网络和数据增强的咳嗽声学诊断COVID-19
随着COVID-19的周期性起伏和各国受到其浪潮的影响,一种高效,经济,轻松的病毒诊断程序已成为当务之急。在感染者中,无症状者不一定完全没有病毒引起的症状。他们可能不会像有症状的人那样表现出任何可观察到的症状,但他们咳嗽的方式可能与未感染的人不同。这些咳嗽声音的差异很小,人耳无法察觉,然而,这些差异可以用机器学习模型捕捉到。在本文中,我们提出了一种深度学习方法来分析DiCOVA 2021挑战赛Track 1中提供的声学数据集,其中包含属于COVID-19阳性和阴性示例的咳嗽录音。为了进行分类,我们提出了一个卷积神经网络模型。在挑战中提供的盲测试集上,它的AUC得分百分比为72.23,用于对模型进行无偏评估。此外,结合Data Augmentation的ConvNet模型进一步将AUC得分百分比从72.23提高到87.07。它的表现也比DiCOVA 2021挑战赛的基准模型高出23%,因此在DiCOVA 2021挑战赛排行榜上排名第一。本文提出使用Mel频率倒谱系数作为该模型的输入特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信