基于混合深度主动学习的低预算场景下高效数据驱动故障诊断

IF 11 1区 工程技术 Q1 ENGINEERING, INDUSTRIAL
Gyeongho Kim , Jae Gyeong Choi , Sujin Jeon , Soyeon Park , Sunghoon Lim
{"title":"基于混合深度主动学习的低预算场景下高效数据驱动故障诊断","authors":"Gyeongho Kim ,&nbsp;Jae Gyeong Choi ,&nbsp;Sujin Jeon ,&nbsp;Soyeon Park ,&nbsp;Sunghoon Lim","doi":"10.1016/j.ress.2025.111637","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate fault diagnosis using deep learning (DL) has become essential for effective quality control, maintenance, and process automation in various industrial processes. However, an efficient labeling strategy is required because constructing large-scale labeled datasets to train DL-based predictive models entails considerable cost and labor. While active learning (AL) has been a prominent solution for efficient data labeling in fault diagnosis, existing AL approaches are unsuitable in practice due to low-budget scenarios where there is insufficient labeled data to train the model stably. In this regard, this work proposes a novel method, called a hybrid deep active learning for low-budget (HDAL-LB) scenarios, that addresses emerging challenges in the label-scarce regime to perform efficient fault diagnosis. First, self-supervised learning is performed with a deep stacked residual variational auto-encoder to efficiently initialize an encoder for latent feature extraction. Second, an evidential learning-based training technique is developed to enable a cost-efficient generation of calibrated predictive uncertainty. Third, a hybrid query selection is systematically formulated under a combinatorial optimization framework, utilizing both uncertainty and data diversity for deep AL. The efficacy of the proposed method (i.e., HDAL-LB) in fault diagnosis is validated through four case studies, utilizing three public benchmark datasets and one private real-world dataset. The comprehensive experimental results demonstrate the superior performance of HDAL-LB under low-budget scenarios compared to existing baseline and state-of-the-art (SOTA) AL methods. Furthermore, extensive ablation studies demonstrate that HDAL-LB consistently exhibits effective fault diagnosis performance across various experimental settings, highlighting its label efficiency and practical applicability in real-world practice.</div></div>","PeriodicalId":54500,"journal":{"name":"Reliability Engineering & System Safety","volume":"266 ","pages":"Article 111637"},"PeriodicalIF":11.0000,"publicationDate":"2025-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards efficient data-driven fault diagnosis under low-budget scenarios via hybrid deep active learning\",\"authors\":\"Gyeongho Kim ,&nbsp;Jae Gyeong Choi ,&nbsp;Sujin Jeon ,&nbsp;Soyeon Park ,&nbsp;Sunghoon Lim\",\"doi\":\"10.1016/j.ress.2025.111637\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurate fault diagnosis using deep learning (DL) has become essential for effective quality control, maintenance, and process automation in various industrial processes. However, an efficient labeling strategy is required because constructing large-scale labeled datasets to train DL-based predictive models entails considerable cost and labor. While active learning (AL) has been a prominent solution for efficient data labeling in fault diagnosis, existing AL approaches are unsuitable in practice due to low-budget scenarios where there is insufficient labeled data to train the model stably. In this regard, this work proposes a novel method, called a hybrid deep active learning for low-budget (HDAL-LB) scenarios, that addresses emerging challenges in the label-scarce regime to perform efficient fault diagnosis. First, self-supervised learning is performed with a deep stacked residual variational auto-encoder to efficiently initialize an encoder for latent feature extraction. Second, an evidential learning-based training technique is developed to enable a cost-efficient generation of calibrated predictive uncertainty. Third, a hybrid query selection is systematically formulated under a combinatorial optimization framework, utilizing both uncertainty and data diversity for deep AL. The efficacy of the proposed method (i.e., HDAL-LB) in fault diagnosis is validated through four case studies, utilizing three public benchmark datasets and one private real-world dataset. The comprehensive experimental results demonstrate the superior performance of HDAL-LB under low-budget scenarios compared to existing baseline and state-of-the-art (SOTA) AL methods. Furthermore, extensive ablation studies demonstrate that HDAL-LB consistently exhibits effective fault diagnosis performance across various experimental settings, highlighting its label efficiency and practical applicability in real-world practice.</div></div>\",\"PeriodicalId\":54500,\"journal\":{\"name\":\"Reliability Engineering & System Safety\",\"volume\":\"266 \",\"pages\":\"Article 111637\"},\"PeriodicalIF\":11.0000,\"publicationDate\":\"2025-09-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Reliability Engineering & System Safety\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0951832025008373\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, INDUSTRIAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reliability Engineering & System Safety","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0951832025008373","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0

摘要

在各种工业过程中,使用深度学习(DL)进行准确的故障诊断已经成为有效的质量控制、维护和过程自动化的必要条件。然而,由于构建大规模标记数据集来训练基于dl的预测模型需要大量的成本和劳动力,因此需要一种有效的标记策略。虽然主动学习(AL)已经成为故障诊断中有效标记数据的重要解决方案,但由于低预算场景下标记数据不足,无法稳定训练模型,现有的AL方法在实践中并不适用。在这方面,这项工作提出了一种新的方法,称为低预算(HDAL-LB)场景的混合深度主动学习,该方法解决了标签稀缺制度中执行有效故障诊断的新挑战。首先,使用深度堆叠残差变分自编码器进行自监督学习,有效地初始化编码器以提取潜在特征。其次,开发了一种基于证据学习的训练技术,以经济高效地生成校准的预测不确定性。第三,利用深度人工智能的不确定性和数据多样性,在组合优化框架下系统地制定了混合查询选择。利用三个公共基准数据集和一个私有真实数据集,通过四个案例研究验证了所提出的方法(即HDAL-LB)在故障诊断中的有效性。综合实验结果表明,与现有的基线和最先进的(SOTA)人工智能方法相比,HDAL-LB在低预算场景下具有优越的性能。此外,广泛的消融研究表明,HDAL-LB在各种实验设置中始终表现出有效的故障诊断性能,突出了其标签效率和在现实世界实践中的实际适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Towards efficient data-driven fault diagnosis under low-budget scenarios via hybrid deep active learning
Accurate fault diagnosis using deep learning (DL) has become essential for effective quality control, maintenance, and process automation in various industrial processes. However, an efficient labeling strategy is required because constructing large-scale labeled datasets to train DL-based predictive models entails considerable cost and labor. While active learning (AL) has been a prominent solution for efficient data labeling in fault diagnosis, existing AL approaches are unsuitable in practice due to low-budget scenarios where there is insufficient labeled data to train the model stably. In this regard, this work proposes a novel method, called a hybrid deep active learning for low-budget (HDAL-LB) scenarios, that addresses emerging challenges in the label-scarce regime to perform efficient fault diagnosis. First, self-supervised learning is performed with a deep stacked residual variational auto-encoder to efficiently initialize an encoder for latent feature extraction. Second, an evidential learning-based training technique is developed to enable a cost-efficient generation of calibrated predictive uncertainty. Third, a hybrid query selection is systematically formulated under a combinatorial optimization framework, utilizing both uncertainty and data diversity for deep AL. The efficacy of the proposed method (i.e., HDAL-LB) in fault diagnosis is validated through four case studies, utilizing three public benchmark datasets and one private real-world dataset. The comprehensive experimental results demonstrate the superior performance of HDAL-LB under low-budget scenarios compared to existing baseline and state-of-the-art (SOTA) AL methods. Furthermore, extensive ablation studies demonstrate that HDAL-LB consistently exhibits effective fault diagnosis performance across various experimental settings, highlighting its label efficiency and practical applicability in real-world practice.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Reliability Engineering & System Safety
Reliability Engineering & System Safety 管理科学-工程:工业
CiteScore
15.20
自引率
39.50%
发文量
621
审稿时长
67 days
期刊介绍: Elsevier publishes Reliability Engineering & System Safety in association with the European Safety and Reliability Association and the Safety Engineering and Risk Analysis Division. The international journal is devoted to developing and applying methods to enhance the safety and reliability of complex technological systems, like nuclear power plants, chemical plants, hazardous waste facilities, space systems, offshore and maritime systems, transportation systems, constructed infrastructure, and manufacturing plants. The journal normally publishes only articles that involve the analysis of substantive problems related to the reliability of complex systems or present techniques and/or theoretical results that have a discernable relationship to the solution of such problems. An important aim is to balance academic material and practical applications.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信