{"title":"A multi-objective deep reinforcement learning method for intelligent scheduling of wind-solar-hydro-battery complementary generation systems","authors":"Yuanyu Ge, Jun Xie, Jiaqi Chang, Shuo Feng","doi":"10.1016/j.ijepes.2025.110635","DOIUrl":null,"url":null,"abstract":"<div><div>Renewable energy resources are rapidly developing to pursue carbon neutrality. However, integrating these sources poses challenges due to their randomness, volatility, and spatial–temporal mismatch of energy-electricity demand. One effective solution is the complementary operation and bundled external delivery of wind, solar, battery storage, and cascade hydropower. However, the uncertainty of wind and solar and the complexity of multi-energy coupling systems increase the difficulty of power scheduling, making it challenging for traditional methods to overcome these obstacles. Thus, this work presents an intelligent scheduling method based on multi-objective deep reinforcement learning (MODRL) for the wind-solar-hydro-battery complementary system (WSHBCS). Firstly, a multi-objective scheduling model is established. Meanwhile, this work proposes a MODRL framework to learn the scheduling strategy for multiple competing objectives. Then, the scheduling problem is developed into a Markov decision process (MDP), in which the state, action, and reward functions of the WSHBCS are designed accordingly. Finally, a multi-policy twin delayed deep deterministic policy gradient (MPTD3) method is put forth to achieve intelligent decisions in continuous action spaces. Simulation results indicate that the proposed intelligent scheduling method effectively achieves multi-objective scheduling for the WSHBCS, outperforming traditional heuristic optimization methods in terms of multi-objective optimization performance, uncertainty adaptability, and solution time.</div></div>","PeriodicalId":50326,"journal":{"name":"International Journal of Electrical Power & Energy Systems","volume":"167 ","pages":"Article 110635"},"PeriodicalIF":5.0000,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Electrical Power & Energy Systems","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0142061525001863","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Renewable energy resources are rapidly developing to pursue carbon neutrality. However, integrating these sources poses challenges due to their randomness, volatility, and spatial–temporal mismatch of energy-electricity demand. One effective solution is the complementary operation and bundled external delivery of wind, solar, battery storage, and cascade hydropower. However, the uncertainty of wind and solar and the complexity of multi-energy coupling systems increase the difficulty of power scheduling, making it challenging for traditional methods to overcome these obstacles. Thus, this work presents an intelligent scheduling method based on multi-objective deep reinforcement learning (MODRL) for the wind-solar-hydro-battery complementary system (WSHBCS). Firstly, a multi-objective scheduling model is established. Meanwhile, this work proposes a MODRL framework to learn the scheduling strategy for multiple competing objectives. Then, the scheduling problem is developed into a Markov decision process (MDP), in which the state, action, and reward functions of the WSHBCS are designed accordingly. Finally, a multi-policy twin delayed deep deterministic policy gradient (MPTD3) method is put forth to achieve intelligent decisions in continuous action spaces. Simulation results indicate that the proposed intelligent scheduling method effectively achieves multi-objective scheduling for the WSHBCS, outperforming traditional heuristic optimization methods in terms of multi-objective optimization performance, uncertainty adaptability, and solution time.
期刊介绍:
The journal covers theoretical developments in electrical power and energy systems and their applications. The coverage embraces: generation and network planning; reliability; long and short term operation; expert systems; neural networks; object oriented systems; system control centres; database and information systems; stock and parameter estimation; system security and adequacy; network theory, modelling and computation; small and large system dynamics; dynamic model identification; on-line control including load and switching control; protection; distribution systems; energy economics; impact of non-conventional systems; and man-machine interfaces.
As well as original research papers, the journal publishes short contributions, book reviews and conference reports. All papers are peer-reviewed by at least two referees.