实时场景下COVID-19检测相关语音特征识别

Sougatam Das, Bishal Nahak, K. Nathwani
{"title":"实时场景下COVID-19检测相关语音特征识别","authors":"Sougatam Das, Bishal Nahak, K. Nathwani","doi":"10.1109/ICONAT53423.2022.9725840","DOIUrl":null,"url":null,"abstract":"The world's biggest pandemic, COVID-19, has shown its lethal impact on human life. The current diagnostics methods are reverse transcription-polymerase chain reaction (RT-PCR) and rapid diagnostic assays have several bottlenecks in terms of the nature of sample collection as it needs some laboratory experts and careful handling of the potentially infectious samples. However, one of the non-invasive ways of diagnostics is to focus on speech modality, which has been paid less attention, during the detection of COVID-19. Hence in this work, the speech features, particularly temporal and spectral features have been studied for COVID-19 detection. The temporal features used in this work are Short-Time Energy, Long-Term Log Energy Variation (LTLEV) Zero Crossing Count (ZCC) and Pitch etc. On the other hand, the spectral features used herein are Power Spectral Density, Average Power, Mel-Frequency Cepstral Coefficients, Group delay spectrum, Spectral Entropy etc. Such spectral and temporal speech features have not been analyzed in the identification of COVID-19 symptoms to the best of authors knowledge. Further, this paper has shown the impact of COVID-19 on a real time human voice, analyzed using speech processing techniques, and shown their efficacy in detecting COVID-19. These features are safe, comparatively faster, cost-effective, and require fewer complexities. Our article will motivate the scientific community to use such features for further research in the collective battle against COVID-19.","PeriodicalId":377501,"journal":{"name":"2022 International Conference for Advancement in Technology (ICONAT)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identification of Speech Features relevant for COVID-19 Detection in Real Time Scenario\",\"authors\":\"Sougatam Das, Bishal Nahak, K. Nathwani\",\"doi\":\"10.1109/ICONAT53423.2022.9725840\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The world's biggest pandemic, COVID-19, has shown its lethal impact on human life. The current diagnostics methods are reverse transcription-polymerase chain reaction (RT-PCR) and rapid diagnostic assays have several bottlenecks in terms of the nature of sample collection as it needs some laboratory experts and careful handling of the potentially infectious samples. However, one of the non-invasive ways of diagnostics is to focus on speech modality, which has been paid less attention, during the detection of COVID-19. Hence in this work, the speech features, particularly temporal and spectral features have been studied for COVID-19 detection. The temporal features used in this work are Short-Time Energy, Long-Term Log Energy Variation (LTLEV) Zero Crossing Count (ZCC) and Pitch etc. On the other hand, the spectral features used herein are Power Spectral Density, Average Power, Mel-Frequency Cepstral Coefficients, Group delay spectrum, Spectral Entropy etc. Such spectral and temporal speech features have not been analyzed in the identification of COVID-19 symptoms to the best of authors knowledge. Further, this paper has shown the impact of COVID-19 on a real time human voice, analyzed using speech processing techniques, and shown their efficacy in detecting COVID-19. These features are safe, comparatively faster, cost-effective, and require fewer complexities. Our article will motivate the scientific community to use such features for further research in the collective battle against COVID-19.\",\"PeriodicalId\":377501,\"journal\":{\"name\":\"2022 International Conference for Advancement in Technology (ICONAT)\",\"volume\":\"64 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference for Advancement in Technology (ICONAT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICONAT53423.2022.9725840\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference for Advancement in Technology (ICONAT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICONAT53423.2022.9725840","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

世界上最大的流行病COVID-19已经显示出对人类生活的致命影响。目前的诊断方法是逆转录聚合酶链反应(RT-PCR),快速诊断分析在样本收集的性质方面存在几个瓶颈,因为它需要一些实验室专家和对潜在传染性样本的谨慎处理。然而,在COVID-19的检测过程中,关注言语形态是一种非侵入性的诊断方法,而这一方法一直受到较少的关注。因此,在这项工作中,我们研究了语音特征,特别是时间和频谱特征,以用于COVID-19检测。在这项工作中使用的时间特征是短时能量、长期对数能量变化(LTLEV)、过零计数(ZCC)和俯仰等。另一方面,本文使用的频谱特征有功率谱密度、平均功率、Mel-Frequency倒谱系数、群延迟谱、谱熵等。据作者所知,在识别COVID-19症状时,还没有分析过这种频谱和时间语音特征。此外,本文还展示了COVID-19对实时人声的影响,并使用语音处理技术进行分析,并展示了其在COVID-19检测中的有效性。这些特性是安全的、相对较快的、经济有效的,并且需要较少的复杂性。我们的文章将激励科学界利用这些特征进行进一步的研究,共同抗击COVID-19。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Identification of Speech Features relevant for COVID-19 Detection in Real Time Scenario
The world's biggest pandemic, COVID-19, has shown its lethal impact on human life. The current diagnostics methods are reverse transcription-polymerase chain reaction (RT-PCR) and rapid diagnostic assays have several bottlenecks in terms of the nature of sample collection as it needs some laboratory experts and careful handling of the potentially infectious samples. However, one of the non-invasive ways of diagnostics is to focus on speech modality, which has been paid less attention, during the detection of COVID-19. Hence in this work, the speech features, particularly temporal and spectral features have been studied for COVID-19 detection. The temporal features used in this work are Short-Time Energy, Long-Term Log Energy Variation (LTLEV) Zero Crossing Count (ZCC) and Pitch etc. On the other hand, the spectral features used herein are Power Spectral Density, Average Power, Mel-Frequency Cepstral Coefficients, Group delay spectrum, Spectral Entropy etc. Such spectral and temporal speech features have not been analyzed in the identification of COVID-19 symptoms to the best of authors knowledge. Further, this paper has shown the impact of COVID-19 on a real time human voice, analyzed using speech processing techniques, and shown their efficacy in detecting COVID-19. These features are safe, comparatively faster, cost-effective, and require fewer complexities. Our article will motivate the scientific community to use such features for further research in the collective battle against COVID-19.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信