{"title":"基于样本加权反事实的不平衡故障诊断","authors":"Wei Zheng, Chunfei Gu, Hao Pan, Xin Fan, Sixiang Fu, Xiaoheng Ji","doi":"10.1016/j.compchemeng.2025.109434","DOIUrl":null,"url":null,"abstract":"<div><div>In fault detection tasks, the scarcity of fault samples often leads models to learn primarily from normal samples, resulting in biased predictions. To address class imbalance, this study introduces a data augmentation framework. It combines sample weighting based on stable learning with counterfactual generation. First, sample weighting is applied to enhance the model’s ability to capture true feature-outcome causal relationships. Then, a clustering algorithm selects high-weight samples with substantial distribution differences. Based on these representative normal samples, a multi-objective counterfactual generation method synthesizes fault samples under physical constraints, while weighted feature importance is used to identify key diagnostic features. The proposed approach effectively alleviates data imbalance in complex industrial fault diagnosis, improving both the accuracy and interpretability of fault detection. Experiments conducted on the Tennessee Eastman Process and the Multi-phase Flow Process show that our method significantly improves fault diagnosis performance under imbalanced data conditions.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"204 ","pages":"Article 109434"},"PeriodicalIF":3.9000,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Imbalanced fault diagnosis based on sample-weighted counterfactual\",\"authors\":\"Wei Zheng, Chunfei Gu, Hao Pan, Xin Fan, Sixiang Fu, Xiaoheng Ji\",\"doi\":\"10.1016/j.compchemeng.2025.109434\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In fault detection tasks, the scarcity of fault samples often leads models to learn primarily from normal samples, resulting in biased predictions. To address class imbalance, this study introduces a data augmentation framework. It combines sample weighting based on stable learning with counterfactual generation. First, sample weighting is applied to enhance the model’s ability to capture true feature-outcome causal relationships. Then, a clustering algorithm selects high-weight samples with substantial distribution differences. Based on these representative normal samples, a multi-objective counterfactual generation method synthesizes fault samples under physical constraints, while weighted feature importance is used to identify key diagnostic features. The proposed approach effectively alleviates data imbalance in complex industrial fault diagnosis, improving both the accuracy and interpretability of fault detection. Experiments conducted on the Tennessee Eastman Process and the Multi-phase Flow Process show that our method significantly improves fault diagnosis performance under imbalanced data conditions.</div></div>\",\"PeriodicalId\":286,\"journal\":{\"name\":\"Computers & Chemical Engineering\",\"volume\":\"204 \",\"pages\":\"Article 109434\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-10-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Chemical Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0098135425004375\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135425004375","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Imbalanced fault diagnosis based on sample-weighted counterfactual
In fault detection tasks, the scarcity of fault samples often leads models to learn primarily from normal samples, resulting in biased predictions. To address class imbalance, this study introduces a data augmentation framework. It combines sample weighting based on stable learning with counterfactual generation. First, sample weighting is applied to enhance the model’s ability to capture true feature-outcome causal relationships. Then, a clustering algorithm selects high-weight samples with substantial distribution differences. Based on these representative normal samples, a multi-objective counterfactual generation method synthesizes fault samples under physical constraints, while weighted feature importance is used to identify key diagnostic features. The proposed approach effectively alleviates data imbalance in complex industrial fault diagnosis, improving both the accuracy and interpretability of fault detection. Experiments conducted on the Tennessee Eastman Process and the Multi-phase Flow Process show that our method significantly improves fault diagnosis performance under imbalanced data conditions.
期刊介绍:
Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.