CycleGuardian：一个基于改进的深度聚类和对比学习的呼吸声音自动分类框架

IF 4.6 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Complex & Intelligent Systems Pub Date : 2025-03-03 DOI:10.1007/s40747-025-01800-4

Yun Chu, Qiuhao Wang, Enze Zhou, Ling Fu, Qian Liu, Gang Zheng

{"title":"CycleGuardian：一个基于改进的深度聚类和对比学习的呼吸声音自动分类框架","authors":"Yun Chu, Qiuhao Wang, Enze Zhou, Ling Fu, Qian Liu, Gang Zheng","doi":"10.1007/s40747-025-01800-4","DOIUrl":null,"url":null,"abstract":"Auscultation plays a pivotal role in early respiratory and pulmonary disease diagnosis. Despite the emergence of deep learning-based methods for automatic respiratory sound classification post-Covid-19, limited datasets impede performance enhancement. Distinguishing between normal and abnormal respiratory sounds poses challenges due to the coexistence of normal respiratory components and noise components in both types. Moreover, different abnormal respiratory sounds exhibit similar anomalous features, hindering their differentiation. Besides, existing state-of-the-art models suffer from excessive parameter size, impeding deployment on resource-constrained mobile platforms. To address these issues, we design a lightweight network CycleGuardian and propose a framework based on an improved deep clustering and contrastive learning. We first generate a hybrid spectrogram for feature diversity and grouping spectrograms to facilitate intermittent abnormal sound capture. Then, CycleGuardian integrates a deep clustering module with a similarity-constrained clustering component to improve the ability to capture abnormal features and a contrastive learning module with group mixing for enhanced abnormal feature discernment. Multi-objective optimization enhances overall performance during training. In experiments, we use the ICBHI2017 dataset, following the official split method and without any pre-trained weights, our method achieves Sp: 82.06\\(\\%\\), Se: 44.47\\(\\%\\), and Score: 63.26\\(\\%\\) with a network model size of 38 M. Compared to the current model, our method leads by nearly 7\\(\\%\\), achieving the current best performances. Additionally, we deploy the network on Android devices, showcasing a comprehensive intelligent respiratory sound auscultation system.","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"52 1","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CycleGuardian: a framework for automatic respiratory sound classification based on improved deep clustering and contrastive learning\",\"authors\":\"Yun Chu, Qiuhao Wang, Enze Zhou, Ling Fu, Qian Liu, Gang Zheng\",\"doi\":\"10.1007/s40747-025-01800-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Auscultation plays a pivotal role in early respiratory and pulmonary disease diagnosis. Despite the emergence of deep learning-based methods for automatic respiratory sound classification post-Covid-19, limited datasets impede performance enhancement. Distinguishing between normal and abnormal respiratory sounds poses challenges due to the coexistence of normal respiratory components and noise components in both types. Moreover, different abnormal respiratory sounds exhibit similar anomalous features, hindering their differentiation. Besides, existing state-of-the-art models suffer from excessive parameter size, impeding deployment on resource-constrained mobile platforms. To address these issues, we design a lightweight network CycleGuardian and propose a framework based on an improved deep clustering and contrastive learning. We first generate a hybrid spectrogram for feature diversity and grouping spectrograms to facilitate intermittent abnormal sound capture. Then, CycleGuardian integrates a deep clustering module with a similarity-constrained clustering component to improve the ability to capture abnormal features and a contrastive learning module with group mixing for enhanced abnormal feature discernment. Multi-objective optimization enhances overall performance during training. In experiments, we use the ICBHI2017 dataset, following the official split method and without any pre-trained weights, our method achieves Sp: 82.06\\\\(\\\\%\\\\), Se: 44.47\\\\(\\\\%\\\\), and Score: 63.26\\\\(\\\\%\\\\) with a network model size of 38 M. Compared to the current model, our method leads by nearly 7\\\\(\\\\%\\\\), achieving the current best performances. Additionally, we deploy the network on Android devices, showcasing a comprehensive intelligent respiratory sound auscultation system.\",\"PeriodicalId\":10524,\"journal\":{\"name\":\"Complex & Intelligent Systems\",\"volume\":\"52 1\",\"pages\":\"\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-03-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Complex & Intelligent Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s40747-025-01800-4\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Complex & Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s40747-025-01800-4","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

听诊在呼吸和肺部疾病的早期诊断中起着关键作用。尽管出现了基于深度学习的呼吸声音自动分类方法，但有限的数据集阻碍了性能的提高。由于正常呼吸成分和噪声成分共存，正常呼吸声和异常呼吸声的区分面临挑战。此外，不同的异常呼吸音表现出相似的异常特征，阻碍了它们的分化。此外，现有最先进的模型存在参数大小过大的问题，阻碍了在资源受限的移动平台上的部署。为了解决这些问题，我们设计了一个轻量级网络CycleGuardian，并提出了一个基于改进的深度聚类和对比学习的框架。我们首先生成特征多样性和分组谱图的混合谱图，以方便间歇性异常声音捕获。然后，CycleGuardian集成了一个具有相似性约束的聚类组件的深度聚类模块，以提高异常特征的捕获能力，以及一个具有组混合的对比学习模块，以增强异常特征的识别能力。多目标优化提高了训练过程中的整体性能。在实验中，我们使用ICBHI2017数据集，遵循官方的分割方法，在没有任何预训练权值的情况下，我们的方法在网络模型大小为38 m的情况下实现了Sp: 82.06 \(\%\), Se: 44.47 \(\%\)和Score: 63.26 \(\%\)，与目前的模型相比，我们的方法领先了近7 \(\%\)，达到了目前的最佳性能。此外，我们将网络部署在Android设备上，展示了一个全面的智能呼吸听诊系统。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

CycleGuardian: a framework for automatic respiratory sound classification based on improved deep clustering and contrastive learning

Auscultation plays a pivotal role in early respiratory and pulmonary disease diagnosis. Despite the emergence of deep learning-based methods for automatic respiratory sound classification post-Covid-19, limited datasets impede performance enhancement. Distinguishing between normal and abnormal respiratory sounds poses challenges due to the coexistence of normal respiratory components and noise components in both types. Moreover, different abnormal respiratory sounds exhibit similar anomalous features, hindering their differentiation. Besides, existing state-of-the-art models suffer from excessive parameter size, impeding deployment on resource-constrained mobile platforms. To address these issues, we design a lightweight network CycleGuardian and propose a framework based on an improved deep clustering and contrastive learning. We first generate a hybrid spectrogram for feature diversity and grouping spectrograms to facilitate intermittent abnormal sound capture. Then, CycleGuardian integrates a deep clustering module with a similarity-constrained clustering component to improve the ability to capture abnormal features and a contrastive learning module with group mixing for enhanced abnormal feature discernment. Multi-objective optimization enhances overall performance during training. In experiments, we use the ICBHI2017 dataset, following the official split method and without any pre-trained weights, our method achieves Sp: 82.06\(\%\), Se: 44.47\(\%\), and Score: 63.26\(\%\) with a network model size of 38 M. Compared to the current model, our method leads by nearly 7\(\%\), achieving the current best performances. Additionally, we deploy the network on Android devices, showcasing a comprehensive intelligent respiratory sound auscultation system.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Complex & Intelligent Systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

9.60

自引率

10.30%

发文量

297

期刊介绍： Complex & Intelligent Systems aims to provide a forum for presenting and discussing novel approaches, tools and techniques meant for attaining a cross-fertilization between the broad fields of complex systems, computational simulation, and intelligent analytics and visualization. The transdisciplinary research that the journal focuses on will expand the boundaries of our understanding by investigating the principles and processes that underlie many of the most profound problems facing society today.