Towards a Deep Reinforcement Learning based approach for real time decision making and resource allocation for Prognostics and Health Management applications
{"title":"Towards a Deep Reinforcement Learning based approach for real time decision making and resource allocation for Prognostics and Health Management applications","authors":"Ricardo Ludeke, P. S. Heyns","doi":"10.1109/ICPHM57936.2023.10194168","DOIUrl":null,"url":null,"abstract":"Industrial operational environments are stochastic and can have complex system dynamics which introduce multiple levels of uncertainty. This uncertainty can lead to sub-optimal decision making and resource allocation. Digitalization and automation of production equipment and the maintenance environment enable predictive maintenance, which means that equipment can be stopped for maintenance at the optimal time instant. Resource constraints in maintenance capacity could however result in further undesired downtime if maintenance cannot be performed when scheduled. In this paper the use of a multi-agent deep reinforcement learning based approach for decision making is investigated to determine the optimal maintenance scheduling policy for a fleet of assets where there are maintenance resource constraints. By considering the underlying system dynamics of maintenance capacity, as well as the health state of individual assets, a near-optimal decision making policy is found that increases equipment availability while also maximizing maintenance capacity. The proposed solution is compared to a run-to-failure corrective maintenance strategy, a constant interval preventive maintenance strategy and a condition based predictive maintenance strategy. The proposed approach outperformed traditional maintenance strategies across several asset and operational maintenance performance metrics. It is concluded that deep reinforcement learning based decision making for asset health management and resource allocation is more effective than human based decision making.","PeriodicalId":169274,"journal":{"name":"2023 IEEE International Conference on Prognostics and Health Management (ICPHM)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Prognostics and Health Management (ICPHM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPHM57936.2023.10194168","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Industrial operational environments are stochastic and can have complex system dynamics which introduce multiple levels of uncertainty. This uncertainty can lead to sub-optimal decision making and resource allocation. Digitalization and automation of production equipment and the maintenance environment enable predictive maintenance, which means that equipment can be stopped for maintenance at the optimal time instant. Resource constraints in maintenance capacity could however result in further undesired downtime if maintenance cannot be performed when scheduled. In this paper the use of a multi-agent deep reinforcement learning based approach for decision making is investigated to determine the optimal maintenance scheduling policy for a fleet of assets where there are maintenance resource constraints. By considering the underlying system dynamics of maintenance capacity, as well as the health state of individual assets, a near-optimal decision making policy is found that increases equipment availability while also maximizing maintenance capacity. The proposed solution is compared to a run-to-failure corrective maintenance strategy, a constant interval preventive maintenance strategy and a condition based predictive maintenance strategy. The proposed approach outperformed traditional maintenance strategies across several asset and operational maintenance performance metrics. It is concluded that deep reinforcement learning based decision making for asset health management and resource allocation is more effective than human based decision making.