频谱图窗口比较：使用卷积神经网络识别咳嗽声

IJCCS Indonesian Journal of Computing and Cybernetics Systems Pub Date : 2022-07-31 DOI:10.22146/ijccs.75697

D. Fudholi, Muhammad Auzan, Novia Arum Sari

{"title":"频谱图窗口比较：使用卷积神经网络识别咳嗽声","authors":"D. Fudholi, Muhammad Auzan, Novia Arum Sari","doi":"10.22146/ijccs.75697","DOIUrl":null,"url":null,"abstract":" Cough is one of the most common symptoms of diseases, especially respiratory diseases. Quick cough detection can be the key to the current pandemic of COVID-19. Good cough recognition is the one that uses non-intrusive tools such as a mobile phone microphone that does not disable human activities like stick sensors. To do sound-only detection, Deep Learning current best method Convolutional Neural Network (CNN) is used. However, CNN needs image input while sound input differs (one dimension rather than two). An extra process is needed, converting sound data to image data using a spectrogram. When building a spectrogram, there is a question about the best size. This research will compare the spectrogram's size, called Spectrogram Window, by the performance. The result is that windows with 4 seconds have the highest F1-score performance at 92.9%. Therefore, a window of around 4 seconds will perform better for sound recognition problems.","PeriodicalId":31625,"journal":{"name":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Spectrogram Window Comparison: Cough Sound Recognition using Convolutional Neural Network\",\"authors\":\"D. Fudholi, Muhammad Auzan, Novia Arum Sari\",\"doi\":\"10.22146/ijccs.75697\",\"DOIUrl\":null,\"url\":null,\"abstract\":\" Cough is one of the most common symptoms of diseases, especially respiratory diseases. Quick cough detection can be the key to the current pandemic of COVID-19. Good cough recognition is the one that uses non-intrusive tools such as a mobile phone microphone that does not disable human activities like stick sensors. To do sound-only detection, Deep Learning current best method Convolutional Neural Network (CNN) is used. However, CNN needs image input while sound input differs (one dimension rather than two). An extra process is needed, converting sound data to image data using a spectrogram. When building a spectrogram, there is a question about the best size. This research will compare the spectrogram's size, called Spectrogram Window, by the performance. The result is that windows with 4 seconds have the highest F1-score performance at 92.9%. Therefore, a window of around 4 seconds will perform better for sound recognition problems.\",\"PeriodicalId\":31625,\"journal\":{\"name\":\"IJCCS Indonesian Journal of Computing and Cybernetics Systems\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IJCCS Indonesian Journal of Computing and Cybernetics Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.22146/ijccs.75697\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IJCCS Indonesian Journal of Computing and Cybernetics Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22146/ijccs.75697","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

咳嗽是疾病，尤其是呼吸道疾病最常见的症状之一。快速检测咳嗽可能是当前COVID-19大流行的关键。良好的咳嗽识别是使用非侵入性工具(如手机麦克风)，不会禁用手杖传感器等人类活动。为了进行声音检测，使用了深度学习当前最好的方法卷积神经网络(CNN)。但是CNN需要的是图像输入，而声音输入是不同的(一维而不是二维)。需要一个额外的过程，使用频谱图将声音数据转换为图像数据。在构建谱图时，有一个关于最佳尺寸的问题。本研究将通过性能比较谱图的大小，称为谱图窗口。结果是4秒的窗口具有最高的f1得分性能，为92.9%。因此，大约4秒的窗口将更好地解决声音识别问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Spectrogram Window Comparison: Cough Sound Recognition using Convolutional Neural Network

Cough is one of the most common symptoms of diseases, especially respiratory diseases. Quick cough detection can be the key to the current pandemic of COVID-19. Good cough recognition is the one that uses non-intrusive tools such as a mobile phone microphone that does not disable human activities like stick sensors. To do sound-only detection, Deep Learning current best method Convolutional Neural Network (CNN) is used. However, CNN needs image input while sound input differs (one dimension rather than two). An extra process is needed, converting sound data to image data using a spectrogram. When building a spectrogram, there is a question about the best size. This research will compare the spectrogram's size, called Spectrogram Window, by the performance. The result is that windows with 4 seconds have the highest F1-score performance at 92.9%. Therefore, a window of around 4 seconds will perform better for sound recognition problems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IJCCS Indonesian Journal of Computing and Cybernetics Systems

自引率

0.00%

发文量

审稿时长

12 weeks