Shousheng Ding , Lei Meng , Jie Shang , Chen Jiang , Haobo Qiu , Liang Gao
{"title":"A soft actor-critic reinforcement learning-based method for remaining useful life prediction","authors":"Shousheng Ding , Lei Meng , Jie Shang , Chen Jiang , Haobo Qiu , Liang Gao","doi":"10.1016/j.ress.2025.111121","DOIUrl":null,"url":null,"abstract":"<div><div>Remaining useful life (RUL) prediction techniques play a crucial role in manufacturing equipment condition management and maintenance planning. Currently, data-driven deep learning methods have made significant advancements in this field. However, traditional approaches have not adequately considered the temporal correlations in both sensor data and RUL prediction values during the degradation process of equipment. The existing reinforcement learning (RL) methods face challenges such as lacking of sufficient lifespan variation information in the state variables, ignorance of dynamic changes in prediction error in the reward function design, and adoption of fixed interaction termination conditions that can't effectively promote the agent's learning of device degradation information. Therefore, this paper proposes a RL model based on the soft actor-critic (SAC) algorithm. Firstly, an autoencoder is employed to extract key features from the data collected by sensors. Subsequently, these key features, along with multi-dimensional lifespan features containing information from multiple historical time steps, are utilized to construct the state variables in RL. Next, a reward function is formulated taking into account error gradients. Finally, a progressive early stopping method is proposed to train the model. Extensive experiments are conducted on the CMAPSS dataset and XJTU-SY bearing dataset, and the proposed method demonstrates higher prediction accuracy compared to mainstream approaches.</div></div>","PeriodicalId":54500,"journal":{"name":"Reliability Engineering & System Safety","volume":"261 ","pages":"Article 111121"},"PeriodicalIF":9.4000,"publicationDate":"2025-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reliability Engineering & System Safety","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0951832025003229","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0
Abstract
Remaining useful life (RUL) prediction techniques play a crucial role in manufacturing equipment condition management and maintenance planning. Currently, data-driven deep learning methods have made significant advancements in this field. However, traditional approaches have not adequately considered the temporal correlations in both sensor data and RUL prediction values during the degradation process of equipment. The existing reinforcement learning (RL) methods face challenges such as lacking of sufficient lifespan variation information in the state variables, ignorance of dynamic changes in prediction error in the reward function design, and adoption of fixed interaction termination conditions that can't effectively promote the agent's learning of device degradation information. Therefore, this paper proposes a RL model based on the soft actor-critic (SAC) algorithm. Firstly, an autoencoder is employed to extract key features from the data collected by sensors. Subsequently, these key features, along with multi-dimensional lifespan features containing information from multiple historical time steps, are utilized to construct the state variables in RL. Next, a reward function is formulated taking into account error gradients. Finally, a progressive early stopping method is proposed to train the model. Extensive experiments are conducted on the CMAPSS dataset and XJTU-SY bearing dataset, and the proposed method demonstrates higher prediction accuracy compared to mainstream approaches.
期刊介绍:
Elsevier publishes Reliability Engineering & System Safety in association with the European Safety and Reliability Association and the Safety Engineering and Risk Analysis Division. The international journal is devoted to developing and applying methods to enhance the safety and reliability of complex technological systems, like nuclear power plants, chemical plants, hazardous waste facilities, space systems, offshore and maritime systems, transportation systems, constructed infrastructure, and manufacturing plants. The journal normally publishes only articles that involve the analysis of substantive problems related to the reliability of complex systems or present techniques and/or theoretical results that have a discernable relationship to the solution of such problems. An important aim is to balance academic material and practical applications.