Multiple Channels Model Based on Mel Spectrogram for Classifying Abnormalities in Lung Sound

IF 0.5 Q4 ENGINEERING, BIOMEDICAL
Pham Thi Viet Huong, Le Duc Thinh, Phung Van Kien, Tran Anh Vu
{"title":"Multiple Channels Model Based on Mel Spectrogram for Classifying Abnormalities in Lung Sound","authors":"Pham Thi Viet Huong, Le Duc Thinh, Phung Van Kien, Tran Anh Vu","doi":"10.4028/p-21pucq","DOIUrl":null,"url":null,"abstract":"Lung sound analysis plays an important role in the assessment and diagnosis of respiratory conditions and diseases. It can provide valuable information about the functioning of the respiratory system, including the airways, lungs, and associated structures. By analyzing the characteristics of lung sounds, healthcare professionals can gain insights into the presence of abnormalities, such as airway obstructions, lung diseases, and respiratory infections. In this paper, a multiple channel model for processing and classifying abnormalities in lung sound is proposed, which utilize the characteristics of Mel spectrogram and the Empirical Mode Decomposition (EMD). Unlike previous research which directly convert the lung sound into scalogram or spectrogram, the pre-processing of the original audio signal is considered and focused in this paper. This pre-processing step includes denoising, resampling, padding and augmentation, which incredibly increase the quality of the input signal. Finally, the multiple channel is put into the VGG16 deep learning model to classify the abnormalities in lung sound, including wheezes, crackles, and both. The model is trained and tested on the benchmark ICBHI dataset. The proposed model has shown better performance when compared with the state-of-the-art researches.","PeriodicalId":15161,"journal":{"name":"Journal of Biomimetics, Biomaterials and Biomedical Engineering","volume":"40 16","pages":"0"},"PeriodicalIF":0.5000,"publicationDate":"2023-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biomimetics, Biomaterials and Biomedical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4028/p-21pucq","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Lung sound analysis plays an important role in the assessment and diagnosis of respiratory conditions and diseases. It can provide valuable information about the functioning of the respiratory system, including the airways, lungs, and associated structures. By analyzing the characteristics of lung sounds, healthcare professionals can gain insights into the presence of abnormalities, such as airway obstructions, lung diseases, and respiratory infections. In this paper, a multiple channel model for processing and classifying abnormalities in lung sound is proposed, which utilize the characteristics of Mel spectrogram and the Empirical Mode Decomposition (EMD). Unlike previous research which directly convert the lung sound into scalogram or spectrogram, the pre-processing of the original audio signal is considered and focused in this paper. This pre-processing step includes denoising, resampling, padding and augmentation, which incredibly increase the quality of the input signal. Finally, the multiple channel is put into the VGG16 deep learning model to classify the abnormalities in lung sound, including wheezes, crackles, and both. The model is trained and tested on the benchmark ICBHI dataset. The proposed model has shown better performance when compared with the state-of-the-art researches.
基于Mel谱图的多通道模型肺音异常分类
肺音分析在呼吸系统疾病的评估和诊断中起着重要作用。它可以提供有关呼吸系统功能的有价值的信息,包括气道、肺和相关结构。通过分析肺音的特征,医疗保健专业人员可以深入了解异常的存在,例如气道阻塞、肺部疾病和呼吸道感染。本文利用Mel谱图和经验模态分解(EMD)的特点,提出了一种多通道肺音异常处理与分类模型。不同于以往的研究直接将肺声转换为尺度图或频谱图,本文重点考虑了对原始音频信号的预处理。这个预处理步骤包括去噪,重采样,填充和增强,这令人难以置信地提高了输入信号的质量。最后,将多通道输入到VGG16深度学习模型中,对肺音中的异常进行分类,包括喘息声、噼啪声和两者兼有。在ICBHI基准数据集上对模型进行了训练和测试。与目前的研究结果相比,该模型具有更好的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
1.40
自引率
14.30%
发文量
73
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信