Kexin Yin , Chunjun Chen , Fengyu Ou , Boyuan Mu , Lu Yang , Yaowen Zhang
{"title":"有限数据条件下定量诊断的扩散增强对比学习框架","authors":"Kexin Yin , Chunjun Chen , Fengyu Ou , Boyuan Mu , Lu Yang , Yaowen Zhang","doi":"10.1016/j.aei.2025.103930","DOIUrl":null,"url":null,"abstract":"<div><div>Quantitative diagnosis of bearing faults is essential for ensuring the safe and reliable operation of mechanical transmission systems, especially under complex and variable operating conditions. However, in real-world scenarios, collecting sufficient labeled fault data remains a major challenge, hindering accurate fault classification and severity estimation. While diffusion models have shown strong potential in data generation, existing methods primarily use them to expand training sets without explicitly modeling fault semantics or guiding the learning process. To address these limitations, we propose a novel Diffusion-Augmented Contrastive Learning (DiCL) framework for quantitative bearing fault diagnosis under limited data conditions. First, a fault-controllable denoising diffusion probabilistic model (DDPM) is developed to generate class-conditional synthetic signals across various fault types and severity levels. These synthetic samples are further used to construct compound fault labels that reflect complex or multi-fault conditions. Second, a dual-branch contrastive learning strategy is adopted, where two input samples are jointly processed to form contrastive pairs. This mechanism enables the feature extraction network to learn fault-discriminative representations by reinforcing shared fault characteristics and suppressing irrelevant variations. Third, a cycle-consistency constraint is introduced via a composite loss function to enforce semantic alignment among samples of the same fault class. The proposed DiCL framework is evaluated on two bearing fault datasets: the Paderborn University bearing dataset and a laboratory-scale mechanical transmission system dataset. Experimental results demonstrate that DiCL achieves high-fidelity data generation and superior diagnostic performance. Notably, even with only 5% of the fault training set, DiCL attains over 80% classification accuracy on both benchmark datasets, significantly outperforming state-of-the-art baseline methods.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"69 ","pages":"Article 103930"},"PeriodicalIF":9.9000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Diffusion-augmented contrastive learning framework for quantitative diagnosis under limited data conditions\",\"authors\":\"Kexin Yin , Chunjun Chen , Fengyu Ou , Boyuan Mu , Lu Yang , Yaowen Zhang\",\"doi\":\"10.1016/j.aei.2025.103930\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Quantitative diagnosis of bearing faults is essential for ensuring the safe and reliable operation of mechanical transmission systems, especially under complex and variable operating conditions. However, in real-world scenarios, collecting sufficient labeled fault data remains a major challenge, hindering accurate fault classification and severity estimation. While diffusion models have shown strong potential in data generation, existing methods primarily use them to expand training sets without explicitly modeling fault semantics or guiding the learning process. To address these limitations, we propose a novel Diffusion-Augmented Contrastive Learning (DiCL) framework for quantitative bearing fault diagnosis under limited data conditions. First, a fault-controllable denoising diffusion probabilistic model (DDPM) is developed to generate class-conditional synthetic signals across various fault types and severity levels. These synthetic samples are further used to construct compound fault labels that reflect complex or multi-fault conditions. Second, a dual-branch contrastive learning strategy is adopted, where two input samples are jointly processed to form contrastive pairs. This mechanism enables the feature extraction network to learn fault-discriminative representations by reinforcing shared fault characteristics and suppressing irrelevant variations. Third, a cycle-consistency constraint is introduced via a composite loss function to enforce semantic alignment among samples of the same fault class. The proposed DiCL framework is evaluated on two bearing fault datasets: the Paderborn University bearing dataset and a laboratory-scale mechanical transmission system dataset. Experimental results demonstrate that DiCL achieves high-fidelity data generation and superior diagnostic performance. Notably, even with only 5% of the fault training set, DiCL attains over 80% classification accuracy on both benchmark datasets, significantly outperforming state-of-the-art baseline methods.</div></div>\",\"PeriodicalId\":50941,\"journal\":{\"name\":\"Advanced Engineering Informatics\",\"volume\":\"69 \",\"pages\":\"Article 103930\"},\"PeriodicalIF\":9.9000,\"publicationDate\":\"2025-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Engineering Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1474034625008237\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625008237","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Diffusion-augmented contrastive learning framework for quantitative diagnosis under limited data conditions
Quantitative diagnosis of bearing faults is essential for ensuring the safe and reliable operation of mechanical transmission systems, especially under complex and variable operating conditions. However, in real-world scenarios, collecting sufficient labeled fault data remains a major challenge, hindering accurate fault classification and severity estimation. While diffusion models have shown strong potential in data generation, existing methods primarily use them to expand training sets without explicitly modeling fault semantics or guiding the learning process. To address these limitations, we propose a novel Diffusion-Augmented Contrastive Learning (DiCL) framework for quantitative bearing fault diagnosis under limited data conditions. First, a fault-controllable denoising diffusion probabilistic model (DDPM) is developed to generate class-conditional synthetic signals across various fault types and severity levels. These synthetic samples are further used to construct compound fault labels that reflect complex or multi-fault conditions. Second, a dual-branch contrastive learning strategy is adopted, where two input samples are jointly processed to form contrastive pairs. This mechanism enables the feature extraction network to learn fault-discriminative representations by reinforcing shared fault characteristics and suppressing irrelevant variations. Third, a cycle-consistency constraint is introduced via a composite loss function to enforce semantic alignment among samples of the same fault class. The proposed DiCL framework is evaluated on two bearing fault datasets: the Paderborn University bearing dataset and a laboratory-scale mechanical transmission system dataset. Experimental results demonstrate that DiCL achieves high-fidelity data generation and superior diagnostic performance. Notably, even with only 5% of the fault training set, DiCL attains over 80% classification accuracy on both benchmark datasets, significantly outperforming state-of-the-art baseline methods.
期刊介绍:
Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.