{"title":"生成对抗网络增强数据改进心音异常检测","authors":"Shaunak Chakraborty , Prishita Kochhar , Shruti Patil , Ketan Kotecha , Shilpa Gite , Ganeshsree Selvachandran , Swagatam Das","doi":"10.1016/j.compbiomed.2025.110623","DOIUrl":null,"url":null,"abstract":"<div><div>The PhysioNet/Computing in Cardiology (CinC) Challenge 2016 dataset has driven significant advancements in automated heart sound analysis using machine learning (ML) and deep learning (DL). However, these efforts are constrained by the dataset's limited size and severe class imbalance, particularly the underrepresentation of coronary artery disease (CAD) cases. This study addresses these limitations by employing generative adversarial networks (GANs) to synthesize realistic CAD-like heart sound segments, augmenting existing datasets to improve classification performance. A Progressive Wasserstein GAN architecture was implemented to generate high-quality audio segments that accurately capture CAD heart sounds' spectral and temporal characteristics. The quality of synthetic audio was assessed using the Fréchet Audio Distance (FAD), achieving scores of 1.43 and 2.23 when compared to reference CAD and healthy samples, respectively. Novel post-processing steps, including bandpass filtering, further enhanced the fidelity of the synthetic samples. By augmenting the imbalanced heart sound dataset with these samples, we observed substantial improvements in the performance of five classification models. The GAN-augmented training set outperformed traditional augmentation and cost-sensitive learning methods, demonstrating superior sensitivity, specificity, and precision. This study highlights the potential of GAN-based data augmentation to address critical challenges in medical audio datasets. It offers a scalable and cost-effective solution for improving the generalizability and robustness of heart sound classification models, paving the way for enhanced diagnostic tools in biomedical signal processing.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"195 ","pages":"Article 110623"},"PeriodicalIF":6.3000,"publicationDate":"2025-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Generative adversarial network augmented data for improved heart sound abnormality detection\",\"authors\":\"Shaunak Chakraborty , Prishita Kochhar , Shruti Patil , Ketan Kotecha , Shilpa Gite , Ganeshsree Selvachandran , Swagatam Das\",\"doi\":\"10.1016/j.compbiomed.2025.110623\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The PhysioNet/Computing in Cardiology (CinC) Challenge 2016 dataset has driven significant advancements in automated heart sound analysis using machine learning (ML) and deep learning (DL). However, these efforts are constrained by the dataset's limited size and severe class imbalance, particularly the underrepresentation of coronary artery disease (CAD) cases. This study addresses these limitations by employing generative adversarial networks (GANs) to synthesize realistic CAD-like heart sound segments, augmenting existing datasets to improve classification performance. A Progressive Wasserstein GAN architecture was implemented to generate high-quality audio segments that accurately capture CAD heart sounds' spectral and temporal characteristics. The quality of synthetic audio was assessed using the Fréchet Audio Distance (FAD), achieving scores of 1.43 and 2.23 when compared to reference CAD and healthy samples, respectively. Novel post-processing steps, including bandpass filtering, further enhanced the fidelity of the synthetic samples. By augmenting the imbalanced heart sound dataset with these samples, we observed substantial improvements in the performance of five classification models. The GAN-augmented training set outperformed traditional augmentation and cost-sensitive learning methods, demonstrating superior sensitivity, specificity, and precision. This study highlights the potential of GAN-based data augmentation to address critical challenges in medical audio datasets. It offers a scalable and cost-effective solution for improving the generalizability and robustness of heart sound classification models, paving the way for enhanced diagnostic tools in biomedical signal processing.</div></div>\",\"PeriodicalId\":10578,\"journal\":{\"name\":\"Computers in biology and medicine\",\"volume\":\"195 \",\"pages\":\"Article 110623\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers in biology and medicine\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0010482525009746\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525009746","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
PhysioNet/Computing in Cardiology (CinC) Challenge 2016数据集在使用机器学习(ML)和深度学习(DL)的自动心音分析方面取得了重大进展。然而,这些努力受到数据集规模有限和严重类别不平衡的限制,特别是冠状动脉疾病(CAD)病例的代表性不足。本研究通过使用生成对抗网络(gan)来合成逼真的类似cad的心音片段,增强现有数据集以提高分类性能,从而解决了这些限制。采用渐进式Wasserstein GAN架构生成高质量音频片段,准确捕捉CAD心音的频谱和时间特征。使用fr音频距离(FAD)评估合成音频的质量,与参考CAD和健康样本相比,合成音频的得分分别为1.43和2.23。新颖的后处理步骤,包括带通滤波,进一步提高了合成样品的保真度。通过使用这些样本增强不平衡心音数据集,我们观察到五种分类模型的性能有了实质性的改善。gan增强训练集优于传统的增强和成本敏感学习方法,表现出更高的灵敏度、特异性和精度。这项研究强调了基于gan的数据增强在解决医疗音频数据集中的关键挑战方面的潜力。它为提高心音分类模型的通用性和鲁棒性提供了一种可扩展且经济高效的解决方案,为增强生物医学信号处理中的诊断工具铺平了道路。
Generative adversarial network augmented data for improved heart sound abnormality detection
The PhysioNet/Computing in Cardiology (CinC) Challenge 2016 dataset has driven significant advancements in automated heart sound analysis using machine learning (ML) and deep learning (DL). However, these efforts are constrained by the dataset's limited size and severe class imbalance, particularly the underrepresentation of coronary artery disease (CAD) cases. This study addresses these limitations by employing generative adversarial networks (GANs) to synthesize realistic CAD-like heart sound segments, augmenting existing datasets to improve classification performance. A Progressive Wasserstein GAN architecture was implemented to generate high-quality audio segments that accurately capture CAD heart sounds' spectral and temporal characteristics. The quality of synthetic audio was assessed using the Fréchet Audio Distance (FAD), achieving scores of 1.43 and 2.23 when compared to reference CAD and healthy samples, respectively. Novel post-processing steps, including bandpass filtering, further enhanced the fidelity of the synthetic samples. By augmenting the imbalanced heart sound dataset with these samples, we observed substantial improvements in the performance of five classification models. The GAN-augmented training set outperformed traditional augmentation and cost-sensitive learning methods, demonstrating superior sensitivity, specificity, and precision. This study highlights the potential of GAN-based data augmentation to address critical challenges in medical audio datasets. It offers a scalable and cost-effective solution for improving the generalizability and robustness of heart sound classification models, paving the way for enhanced diagnostic tools in biomedical signal processing.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.