有限数据条件下定量诊断的扩散增强对比学习框架

IF 9.9 1区工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Advanced Engineering Informatics Pub Date : 2025-10-06 DOI:10.1016/j.aei.2025.103930

Kexin Yin , Chunjun Chen , Fengyu Ou , Boyuan Mu , Lu Yang , Yaowen Zhang

{"title":"有限数据条件下定量诊断的扩散增强对比学习框架","authors":"Kexin Yin , Chunjun Chen , Fengyu Ou , Boyuan Mu , Lu Yang , Yaowen Zhang","doi":"10.1016/j.aei.2025.103930","DOIUrl":null,"url":null,"abstract":"<div><div>Quantitative diagnosis of bearing faults is essential for ensuring the safe and reliable operation of mechanical transmission systems, especially under complex and variable operating conditions. However, in real-world scenarios, collecting sufficient labeled fault data remains a major challenge, hindering accurate fault classification and severity estimation. While diffusion models have shown strong potential in data generation, existing methods primarily use them to expand training sets without explicitly modeling fault semantics or guiding the learning process. To address these limitations, we propose a novel Diffusion-Augmented Contrastive Learning (DiCL) framework for quantitative bearing fault diagnosis under limited data conditions. First, a fault-controllable denoising diffusion probabilistic model (DDPM) is developed to generate class-conditional synthetic signals across various fault types and severity levels. These synthetic samples are further used to construct compound fault labels that reflect complex or multi-fault conditions. Second, a dual-branch contrastive learning strategy is adopted, where two input samples are jointly processed to form contrastive pairs. This mechanism enables the feature extraction network to learn fault-discriminative representations by reinforcing shared fault characteristics and suppressing irrelevant variations. Third, a cycle-consistency constraint is introduced via a composite loss function to enforce semantic alignment among samples of the same fault class. The proposed DiCL framework is evaluated on two bearing fault datasets: the Paderborn University bearing dataset and a laboratory-scale mechanical transmission system dataset. Experimental results demonstrate that DiCL achieves high-fidelity data generation and superior diagnostic performance. Notably, even with only 5% of the fault training set, DiCL attains over 80% classification accuracy on both benchmark datasets, significantly outperforming state-of-the-art baseline methods.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"69 ","pages":"Article 103930"},"PeriodicalIF":9.9000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Diffusion-augmented contrastive learning framework for quantitative diagnosis under limited data conditions\",\"authors\":\"Kexin Yin , Chunjun Chen , Fengyu Ou , Boyuan Mu , Lu Yang , Yaowen Zhang\",\"doi\":\"10.1016/j.aei.2025.103930\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Quantitative diagnosis of bearing faults is essential for ensuring the safe and reliable operation of mechanical transmission systems, especially under complex and variable operating conditions. However, in real-world scenarios, collecting sufficient labeled fault data remains a major challenge, hindering accurate fault classification and severity estimation. While diffusion models have shown strong potential in data generation, existing methods primarily use them to expand training sets without explicitly modeling fault semantics or guiding the learning process. To address these limitations, we propose a novel Diffusion-Augmented Contrastive Learning (DiCL) framework for quantitative bearing fault diagnosis under limited data conditions. First, a fault-controllable denoising diffusion probabilistic model (DDPM) is developed to generate class-conditional synthetic signals across various fault types and severity levels. These synthetic samples are further used to construct compound fault labels that reflect complex or multi-fault conditions. Second, a dual-branch contrastive learning strategy is adopted, where two input samples are jointly processed to form contrastive pairs. This mechanism enables the feature extraction network to learn fault-discriminative representations by reinforcing shared fault characteristics and suppressing irrelevant variations. Third, a cycle-consistency constraint is introduced via a composite loss function to enforce semantic alignment among samples of the same fault class. The proposed DiCL framework is evaluated on two bearing fault datasets: the Paderborn University bearing dataset and a laboratory-scale mechanical transmission system dataset. Experimental results demonstrate that DiCL achieves high-fidelity data generation and superior diagnostic performance. Notably, even with only 5% of the fault training set, DiCL attains over 80% classification accuracy on both benchmark datasets, significantly outperforming state-of-the-art baseline methods.</div></div>\",\"PeriodicalId\":50941,\"journal\":{\"name\":\"Advanced Engineering Informatics\",\"volume\":\"69 \",\"pages\":\"Article 103930\"},\"PeriodicalIF\":9.9000,\"publicationDate\":\"2025-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Engineering Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1474034625008237\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625008237","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

轴承故障的定量诊断是保证机械传动系统安全可靠运行的必要条件，特别是在复杂多变的工况下。然而，在现实场景中，收集足够的标记故障数据仍然是一个主要挑战，阻碍了准确的故障分类和严重程度估计。虽然扩散模型在数据生成中显示出强大的潜力，但现有的方法主要使用它们来扩展训练集，而没有明确地建模错误语义或指导学习过程。为了解决这些限制，我们提出了一种新的扩散增强对比学习（DiCL）框架，用于有限数据条件下的轴承故障定量诊断。首先，建立了一种故障可控去噪扩散概率模型（DDPM），用于生成不同故障类型和严重程度的类条件合成信号。这些合成样本进一步用于构建反映复杂或多故障条件的复合故障标签。其次，采用双分支对比学习策略，对两个输入样本进行联合处理，形成对比对。该机制使特征提取网络能够通过强化共享的故障特征和抑制不相关的变化来学习故障判别表示。第三，通过复合损失函数引入循环一致性约束，以强制同一故障类样本之间的语义对齐。提出的DiCL框架在两个轴承故障数据集上进行了评估：帕德博恩大学轴承数据集和实验室规模的机械传动系统数据集。实验结果表明，DiCL能够实现高保真的数据生成和优越的诊断性能。值得注意的是，即使只有5%的故障训练集，DiCL在两个基准数据集上的分类准确率都超过80%，显著优于最先进的基线方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Diffusion-augmented contrastive learning framework for quantitative diagnosis under limited data conditions

Quantitative diagnosis of bearing faults is essential for ensuring the safe and reliable operation of mechanical transmission systems, especially under complex and variable operating conditions. However, in real-world scenarios, collecting sufficient labeled fault data remains a major challenge, hindering accurate fault classification and severity estimation. While diffusion models have shown strong potential in data generation, existing methods primarily use them to expand training sets without explicitly modeling fault semantics or guiding the learning process. To address these limitations, we propose a novel Diffusion-Augmented Contrastive Learning (DiCL) framework for quantitative bearing fault diagnosis under limited data conditions. First, a fault-controllable denoising diffusion probabilistic model (DDPM) is developed to generate class-conditional synthetic signals across various fault types and severity levels. These synthetic samples are further used to construct compound fault labels that reflect complex or multi-fault conditions. Second, a dual-branch contrastive learning strategy is adopted, where two input samples are jointly processed to form contrastive pairs. This mechanism enables the feature extraction network to learn fault-discriminative representations by reinforcing shared fault characteristics and suppressing irrelevant variations. Third, a cycle-consistency constraint is introduced via a composite loss function to enforce semantic alignment among samples of the same fault class. The proposed DiCL framework is evaluated on two bearing fault datasets: the Paderborn University bearing dataset and a laboratory-scale mechanical transmission system dataset. Experimental results demonstrate that DiCL achieves high-fidelity data generation and superior diagnostic performance. Notably, even with only 5% of the fault training set, DiCL attains over 80% classification accuracy on both benchmark datasets, significantly outperforming state-of-the-art baseline methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Advanced Engineering Informatics 工程技术-工程：综合

CiteScore

12.40

自引率

18.20%

发文量

292

审稿时长

45 days

期刊介绍： Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.