An adaptive dual distillation framework for efficient remaining useful life prediction

IF 4.6 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Complex & Intelligent Systems Pub Date : 2025-04-22 DOI:10.1007/s40747-025-01886-w

Xiang Cheng, Jun Kit Chaw, Shafrida Sahrani, Mei Choo Ang, Saraswathy Shamini Gunasekaran, Moamin A. Mahmoud, Halimah Badioze Zaman, Yanfeng Zhao, Fuchen Ren

{"title":"An adaptive dual distillation framework for efficient remaining useful life prediction","authors":"Xiang Cheng, Jun Kit Chaw, Shafrida Sahrani, Mei Choo Ang, Saraswathy Shamini Gunasekaran, Moamin A. Mahmoud, Halimah Badioze Zaman, Yanfeng Zhao, Fuchen Ren","doi":"10.1007/s40747-025-01886-w","DOIUrl":null,"url":null,"abstract":"<p>Predicting the Remaining Useful Life (RUL) of industrial equipment is essential for proactive maintenance and health assessment, particularly under the computational constraints of edge devices. While deep learning methods, such as Long Short-Term Memory (LSTM) networks, excel at modeling complex time series, their high computational cost often restricts real-time deployment. To address this challenge, we present an Adaptive Dual Distillation Framework (A-DDF) that transfers knowledge from a large LSTM teacher model to a lightweight bidirectional Gated Recurrent Unit (GRU) student model. Soft-target distillation refines predictive distributions to provide robust supervision and our correlation-based feature alignment preserves inter-feature relationships and prevents information loss. An adaptive weighting mechanism balances these two distillation strategies, enabling the student model to maintain high predictive accuracy while reducing model complexity. We validate our approach on NASA’s C-MAPSS dataset, which includes diverse operating conditions. A-DDF outperforms previous methods, achieving a 12% decrease in relative error (MAPE), improving prediction accuracy and stability. Ablation experiments show the dual distillation strategy improves predictive accuracy, surpassing single distillation approaches. Notably, the student model achieves a 5.34-fold compression rate, reducing parameters by 83%, while maintaining or exceeding the performance of the LSTM teacher model. These results highlight A-DDF’s potential for efficient, high-accuracy predictive maintenance on edge devices. Comparisons with mainstream benchmarks confirm A-DDF’s superior performance across datasets. Finally, generality and quantization experiments validate its broad applicability and deployability. The proposed method emphasizes reducing model size without sacrificing performance, making it ideal for real-world predictive maintenance scenarios and intelligence-driven manufacturing applications.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"1 1","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Complex & Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s40747-025-01886-w","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Predicting the Remaining Useful Life (RUL) of industrial equipment is essential for proactive maintenance and health assessment, particularly under the computational constraints of edge devices. While deep learning methods, such as Long Short-Term Memory (LSTM) networks, excel at modeling complex time series, their high computational cost often restricts real-time deployment. To address this challenge, we present an Adaptive Dual Distillation Framework (A-DDF) that transfers knowledge from a large LSTM teacher model to a lightweight bidirectional Gated Recurrent Unit (GRU) student model. Soft-target distillation refines predictive distributions to provide robust supervision and our correlation-based feature alignment preserves inter-feature relationships and prevents information loss. An adaptive weighting mechanism balances these two distillation strategies, enabling the student model to maintain high predictive accuracy while reducing model complexity. We validate our approach on NASA’s C-MAPSS dataset, which includes diverse operating conditions. A-DDF outperforms previous methods, achieving a 12% decrease in relative error (MAPE), improving prediction accuracy and stability. Ablation experiments show the dual distillation strategy improves predictive accuracy, surpassing single distillation approaches. Notably, the student model achieves a 5.34-fold compression rate, reducing parameters by 83%, while maintaining or exceeding the performance of the LSTM teacher model. These results highlight A-DDF’s potential for efficient, high-accuracy predictive maintenance on edge devices. Comparisons with mainstream benchmarks confirm A-DDF’s superior performance across datasets. Finally, generality and quantization experiments validate its broad applicability and deployability. The proposed method emphasizes reducing model size without sacrificing performance, making it ideal for real-world predictive maintenance scenarios and intelligence-driven manufacturing applications.

查看原文本刊更多论文

高效预测剩余使用寿命的自适应双蒸馏框架

预测工业设备的剩余使用寿命（RUL）对于主动维护和健康评估至关重要，特别是在边缘设备的计算限制下。虽然长短期记忆（LSTM）网络等深度学习方法擅长于复杂时间序列的建模，但它们的高计算成本往往限制了实时部署。为了应对这一挑战，我们提出了一个自适应双蒸馏框架（a - ddf），该框架将知识从大型LSTM教师模型转移到轻量级双向门控循环单元（GRU）学生模型。软目标蒸馏改进了预测分布，提供了鲁棒性监督，我们基于相关性的特征对齐保留了特征间的关系，防止了信息丢失。自适应加权机制平衡了这两种蒸馏策略，使学生模型能够在降低模型复杂性的同时保持较高的预测精度。我们在NASA的C-MAPSS数据集上验证了我们的方法，其中包括不同的操作条件。a - ddf优于以前的方法，实现了相对误差（MAPE）降低12%，提高了预测精度和稳定性。烧蚀实验表明，双蒸馏策略提高了预测精度，优于单蒸馏方法。值得注意的是，学生模型实现了5.34倍的压缩率，减少了83%的参数，同时保持或超过了LSTM教师模型的性能。这些结果突出了A-DDF在边缘设备上高效、高精度预测性维护的潜力。与主流基准测试的比较证实了A-DDF在数据集上的卓越性能。最后，通用性和量化实验验证了该方法的广泛适用性和可部署性。所提出的方法强调在不牺牲性能的情况下减小模型尺寸，使其成为现实世界预测性维护场景和智能驱动制造应用的理想选择。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Complex & Intelligent Systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

9.60

自引率

10.30%

发文量

297

期刊介绍： Complex & Intelligent Systems aims to provide a forum for presenting and discussing novel approaches, tools and techniques meant for attaining a cross-fertilization between the broad fields of complex systems, computational simulation, and intelligent analytics and visualization. The transdisciplinary research that the journal focuses on will expand the boundaries of our understanding by investigating the principles and processes that underlie many of the most profound problems facing society today.