Evaluation of Deep Learning Methods for Pulmonary Disease Classification.

IF 1.1 4区医学 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Current Medical Imaging Reviews Pub Date : 2025-07-18 DOI:10.2174/0115734056388107250710120917

Ajay Pal Singh, Ankita Nigam, Gaurav Garg

{"title":"Evaluation of Deep Learning Methods for Pulmonary Disease Classification.","authors":"Ajay Pal Singh, Ankita Nigam, Gaurav Garg","doi":"10.2174/0115734056388107250710120917","DOIUrl":null,"url":null,"abstract":"Introduction: Driven by environmental pollution and the rise in infectious diseases, the increasing prevalence of lung conditions demands advancements in diagnostic techniques.Materials and methods: This study explores the use of various features, such as spectrograms, chromograms, and Mel Frequency Cepstral Coefficients (MFCC), to extract crucial information from auscultation recordings. It addresses challenges through filter-based audio enhancement methods. The primary goal is to improve disease detection accuracy by leveraging convolutional neural networks (CNNs) for feature extraction and dense neural networks for classification.Results: While deep learning models like CNNs and Recurrent Neural Network (RNN) outperform traditional machine learning models such as Sequence Vector Machine, K-Nearest Neighbours (KNN) and random forest with accuracies ranging from 70% to 85%. The combination of CNN, RNN, and long short-term memory achieved an accuracy of 88%. By integrating MFCC, Chroma Short-Term Fourier Transform (STFT), and spectrogram features with a CNN-based classifier, the proposed multi-feature deep learning model achieved the highest accuracy of 92%, surpassing all other methods.Discussion: The study effectively addresses key issues, including the overrepresentation of Chronic Obstructive Pulmonary Disease (COPD) samples over Lower Respiratory Tract Infections (LRTI) and Upper Respiratory Tract Infections (URTI) which hampers generalization across test audio samples.Conclusion: The proposed methodology caters common challenges like background noise in recordings, and the limited and imbalanced nature of datasets. These findings pave the way for enhanced clinical applications, showcasing the transformative potential of multi-feature deep learning methods in the classification of pulmonary diseases.","PeriodicalId":54215,"journal":{"name":"Current Medical Imaging Reviews","volume":" ","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2025-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Medical Imaging Reviews","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2174/0115734056388107250710120917","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction: Driven by environmental pollution and the rise in infectious diseases, the increasing prevalence of lung conditions demands advancements in diagnostic techniques.

Materials and methods: This study explores the use of various features, such as spectrograms, chromograms, and Mel Frequency Cepstral Coefficients (MFCC), to extract crucial information from auscultation recordings. It addresses challenges through filter-based audio enhancement methods. The primary goal is to improve disease detection accuracy by leveraging convolutional neural networks (CNNs) for feature extraction and dense neural networks for classification.

Results: While deep learning models like CNNs and Recurrent Neural Network (RNN) outperform traditional machine learning models such as Sequence Vector Machine, K-Nearest Neighbours (KNN) and random forest with accuracies ranging from 70% to 85%. The combination of CNN, RNN, and long short-term memory achieved an accuracy of 88%. By integrating MFCC, Chroma Short-Term Fourier Transform (STFT), and spectrogram features with a CNN-based classifier, the proposed multi-feature deep learning model achieved the highest accuracy of 92%, surpassing all other methods.

Discussion: The study effectively addresses key issues, including the overrepresentation of Chronic Obstructive Pulmonary Disease (COPD) samples over Lower Respiratory Tract Infections (LRTI) and Upper Respiratory Tract Infections (URTI) which hampers generalization across test audio samples.

Conclusion: The proposed methodology caters common challenges like background noise in recordings, and the limited and imbalanced nature of datasets. These findings pave the way for enhanced clinical applications, showcasing the transformative potential of multi-feature deep learning methods in the classification of pulmonary diseases.

查看原文本刊更多论文

肺部疾病分类的深度学习方法评价。

导言：由于环境污染和传染病的增加，肺部疾病的日益流行要求诊断技术的进步。材料和方法：本研究探讨了各种特征的使用，如频谱图、色谱图和Mel频率倒谱系数（MFCC），从听诊记录中提取关键信息。它通过基于滤波器的音频增强方法来解决挑战。主要目标是通过利用卷积神经网络（cnn）进行特征提取和密集神经网络进行分类来提高疾病检测的准确性。结果：深度学习模型如cnn和递归神经网络（RNN）优于传统的机器学习模型，如序列向量机、k近邻（KNN）和随机森林，准确率在70%到85%之间。CNN、RNN和长短期记忆的组合达到了88%的准确率。通过将MFCC、色度短时傅里叶变换（STFT）和频谱图特征与基于cnn的分类器相结合，所提出的多特征深度学习模型达到了92%的最高准确率，超过了所有其他方法。讨论：该研究有效地解决了关键问题，包括慢性阻塞性肺疾病（COPD）样本超过下呼吸道感染（LRTI）和上呼吸道感染（URTI），这阻碍了测试音频样本的泛化。结论：所提出的方法满足了常见的挑战，如录音中的背景噪声，以及数据集的有限性和不平衡性。这些发现为增强临床应用铺平了道路，展示了多特征深度学习方法在肺部疾病分类中的变革潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Current Medical Imaging Reviews 医学-核医学

CiteScore

2.60

自引率

0.00%

发文量

246

审稿时长

1 months

期刊介绍： Current Medical Imaging Reviews publishes frontier review articles, original research articles, drug clinical trial studies and guest edited thematic issues on all the latest advances on medical imaging dedicated to clinical research. All relevant areas are covered by the journal, including advances in the diagnosis, instrumentation and therapeutic applications related to all modern medical imaging techniques. The journal is essential reading for all clinicians and researchers involved in medical imaging and diagnosis.