Xiaofeng Liu, Zheng Zhao, Daiping Wei, Fan Yang, Lin Bo, Jun Luo
{"title":"A fault diagnosis strategy for gearboxes based on reinforcement learning with balanced reward and cost- equilibrium sampling","authors":"Xiaofeng Liu, Zheng Zhao, Daiping Wei, Fan Yang, Lin Bo, Jun Luo","doi":"10.1016/j.aei.2025.103946","DOIUrl":null,"url":null,"abstract":"<div><div>To address the challenge of deep reinforcement learning exhibiting low accuracy in data-imbalanced gearbox fault diagnosis, this paper proposes a <em>Reinforcement Learning Model with Cost-sensitive Sampling and Balanced Reward</em> (RLM-CSBR) from the perspectives of feature representation, data utilization, and learning strategy. To tackle the lack of minority-class feature patterns, a multi-level convolutional deep integrated Q-network is constructed to fully explore deep discriminative features from imbalanced data, thereby maximizing feature-perceptive information. To mitigate model training bias, a balanced reward strategy based on the sample missing rate is designed; this strategy not only guides the agent to prioritize the exploration and learning of sample-scarce categories but also ensures the utilization efficiency of majority-class samples. To solve the problem of insufficient model fitting for minority-class samples, a novel cost-equilibrium matrix is incorporated into the prioritized experience replay mechanism, which prioritizes the selection of high-value experiences learned from critical minority-class samples during training.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"69 ","pages":"Article 103946"},"PeriodicalIF":9.9000,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625008390","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
To address the challenge of deep reinforcement learning exhibiting low accuracy in data-imbalanced gearbox fault diagnosis, this paper proposes a Reinforcement Learning Model with Cost-sensitive Sampling and Balanced Reward (RLM-CSBR) from the perspectives of feature representation, data utilization, and learning strategy. To tackle the lack of minority-class feature patterns, a multi-level convolutional deep integrated Q-network is constructed to fully explore deep discriminative features from imbalanced data, thereby maximizing feature-perceptive information. To mitigate model training bias, a balanced reward strategy based on the sample missing rate is designed; this strategy not only guides the agent to prioritize the exploration and learning of sample-scarce categories but also ensures the utilization efficiency of majority-class samples. To solve the problem of insufficient model fitting for minority-class samples, a novel cost-equilibrium matrix is incorporated into the prioritized experience replay mechanism, which prioritizes the selection of high-value experiences learned from critical minority-class samples during training.
期刊介绍:
Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.