Sufyan O. Zaben , Wan Mohd Nazmee Wan Zainon , Ahmad H. Sabry
{"title":"Machine learning-based methods for detecting respiratory abnormalities using audio and visual analysis: A review","authors":"Sufyan O. Zaben , Wan Mohd Nazmee Wan Zainon , Ahmad H. Sabry","doi":"10.1016/j.rineng.2025.104744","DOIUrl":null,"url":null,"abstract":"<div><div>Respiratory abnormalities pose a significant health burden, often demanding timely and accurate diagnosis. Traditional methods have limitations, prompting exploration of novel approaches. This review explores the potential of machine learning, leveraging both audio and visual analysis, for detecting respiratory abnormalities. We explore various methods employed, analyzing audio features like MFCCs and spectral energy, and exploring diverse visual features like chest wall motion and depth maps. Different machine learning techniques, including CNNs and RNNs, are discussed, highlighting their applications in detecting specific conditions like asthma and COPD. We systematically evaluate their performance, analyzing strengths and limitations of both audio-based and visual-based approaches. Further, we explore the potential of multimodal analysis, combining audio and visual information for enhanced performance. Reviewing existing studies, we assess the advantages gained over unimodal methods. Finally, we explore the clinical implications and future directions, discussing potential applications like remote monitoring and early diagnosis, while acknowledging remaining challenges and ethical considerations. By comprehensively examining the landscape of machine learning for audio and visual analysis in respiratory abnormality detection, this review offers valuable insights for researchers and clinicians, ultimately accelerating the translation of these promising methods into real-world practice.</div></div>","PeriodicalId":36919,"journal":{"name":"Results in Engineering","volume":"26 ","pages":"Article 104744"},"PeriodicalIF":6.0000,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Results in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590123025008217","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Respiratory abnormalities pose a significant health burden, often demanding timely and accurate diagnosis. Traditional methods have limitations, prompting exploration of novel approaches. This review explores the potential of machine learning, leveraging both audio and visual analysis, for detecting respiratory abnormalities. We explore various methods employed, analyzing audio features like MFCCs and spectral energy, and exploring diverse visual features like chest wall motion and depth maps. Different machine learning techniques, including CNNs and RNNs, are discussed, highlighting their applications in detecting specific conditions like asthma and COPD. We systematically evaluate their performance, analyzing strengths and limitations of both audio-based and visual-based approaches. Further, we explore the potential of multimodal analysis, combining audio and visual information for enhanced performance. Reviewing existing studies, we assess the advantages gained over unimodal methods. Finally, we explore the clinical implications and future directions, discussing potential applications like remote monitoring and early diagnosis, while acknowledging remaining challenges and ethical considerations. By comprehensively examining the landscape of machine learning for audio and visual analysis in respiratory abnormality detection, this review offers valuable insights for researchers and clinicians, ultimately accelerating the translation of these promising methods into real-world practice.