Alyaa Hamel Sfayyih , Nasri Sulaiman , Ahmad H. Sabry
{"title":"Non-invasive diagnosis of lung diseases via multimodal feature extraction from breathing audio and chest dynamics","authors":"Alyaa Hamel Sfayyih , Nasri Sulaiman , Ahmad H. Sabry","doi":"10.1016/j.compbiomed.2025.110182","DOIUrl":null,"url":null,"abstract":"<div><div>Early and accurate diagnosis of lung diseases is crucial for effective treatment. While traditional methods have limitations, audio analysis offers a promising non-invasive approach. However, existing studies often rely solely on acoustic features, neglecting valuable information contained in visual cues like chest wall dynamics. This research proposes a novel multimodal approach that integrates both audio and visual modalities to enhance lung disease detection. By extracting and fusing features from both modalities, we aim to capture a more comprehensive representation of lung health. The proposed deep learning model, trained on a dataset of audio and video recordings, achieved a validation accuracy of 92.02 %. The model effectively leverages features such as pitch, MFCCs, and breathing audio envelopes, along with visual cues from chest wall dynamics, to accurately classify different lung disease categories. This multimodal approach offers several advantages, including improved accuracy, robustness to noise and variability, and the potential for early disease detection. By addressing the limitations of single-modality approaches, this research contributes to the development of more effective and accessible lung disease diagnostic tools.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"191 ","pages":"Article 110182"},"PeriodicalIF":7.0000,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525005335","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Early and accurate diagnosis of lung diseases is crucial for effective treatment. While traditional methods have limitations, audio analysis offers a promising non-invasive approach. However, existing studies often rely solely on acoustic features, neglecting valuable information contained in visual cues like chest wall dynamics. This research proposes a novel multimodal approach that integrates both audio and visual modalities to enhance lung disease detection. By extracting and fusing features from both modalities, we aim to capture a more comprehensive representation of lung health. The proposed deep learning model, trained on a dataset of audio and video recordings, achieved a validation accuracy of 92.02 %. The model effectively leverages features such as pitch, MFCCs, and breathing audio envelopes, along with visual cues from chest wall dynamics, to accurately classify different lung disease categories. This multimodal approach offers several advantages, including improved accuracy, robustness to noise and variability, and the potential for early disease detection. By addressing the limitations of single-modality approaches, this research contributes to the development of more effective and accessible lung disease diagnostic tools.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.