Day-ahead optimal dispatching of hybrid power system based on deep reinforcement learning

IF 1.3 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Cognitive Computation and Systems Pub Date : 2022-08-22 DOI:10.1049/ccs2.12068

Yakun Shi, Chaoxu Mu, Yi Hao, Shiqian Ma, Na Xu, Zhiqiang Chong

{"title":"Day-ahead optimal dispatching of hybrid power system based on deep reinforcement learning","authors":"Yakun Shi, Chaoxu Mu, Yi Hao, Shiqian Ma, Na Xu, Zhiqiang Chong","doi":"10.1049/ccs2.12068","DOIUrl":null,"url":null,"abstract":"<p>The problem of optimal dispatching of a power system containing a high proportion of renewable energy is of great significance for the realisation of new energy consumption and the economic and reliable operation of the power system. For the solution of non-linear, non-convex, multi-objective problems for the optimal operation design of a power system with wind and photovoltaic access, traditional methods have difficulties in terms of computational real-time and iterative convergence. To address this issue, a deep reinforcement learning-based optimal scheduling method for the hybrid power system is proposed, which enables continuous action control to obtain an optimal scheduling strategy through the interaction between the agent and the hybrid power system. Firstly, a mathematical description of the optimal scheduling problem containing wind power and photovoltaic power system is presented, and the state space, action space, and reward function of the agent are designed. Secondly, the basic framework of the deep reinforcement learning optimal scheduling model is constructed, and the basic principles of the twin delayed deep deterministic policy gradient algorithm are introduced. Finally, the effectiveness of the deep reinforcement learning model for day-ahead optimal scheduling of the hybrid power system is verified by means of an arithmetic analysis of the modified New England 39-bus system.</p>","PeriodicalId":33652,"journal":{"name":"Cognitive Computation and Systems","volume":"4 4","pages":"351-361"},"PeriodicalIF":1.3000,"publicationDate":"2022-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.12068","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Computation and Systems","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/ccs2.12068","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 1

Abstract

The problem of optimal dispatching of a power system containing a high proportion of renewable energy is of great significance for the realisation of new energy consumption and the economic and reliable operation of the power system. For the solution of non-linear, non-convex, multi-objective problems for the optimal operation design of a power system with wind and photovoltaic access, traditional methods have difficulties in terms of computational real-time and iterative convergence. To address this issue, a deep reinforcement learning-based optimal scheduling method for the hybrid power system is proposed, which enables continuous action control to obtain an optimal scheduling strategy through the interaction between the agent and the hybrid power system. Firstly, a mathematical description of the optimal scheduling problem containing wind power and photovoltaic power system is presented, and the state space, action space, and reward function of the agent are designed. Secondly, the basic framework of the deep reinforcement learning optimal scheduling model is constructed, and the basic principles of the twin delayed deep deterministic policy gradient algorithm are introduced. Finally, the effectiveness of the deep reinforcement learning model for day-ahead optimal scheduling of the hybrid power system is verified by means of an arithmetic analysis of the modified New England 39-bus system.

Abstract Image

查看原文本刊更多论文

基于深度强化学习的混合电力系统日前最优调度

高比例可再生能源电力系统的优化调度问题，对于实现新能源消纳和电力系统经济可靠运行具有重要意义。对于具有风电和光伏接入的电力系统的非线性、非凸、多目标优化设计问题，传统方法在计算实时性和迭代收敛性方面存在困难。针对这一问题，提出了一种基于深度强化学习的混合电力系统最优调度方法，使连续动作控制通过智能体与混合电力系统的相互作用获得最优调度策略。首先，对包含风电和光伏发电系统的最优调度问题进行数学描述，设计agent的状态空间、动作空间和奖励函数;其次，构造了深度强化学习最优调度模型的基本框架，介绍了双延迟深度确定性策略梯度算法的基本原理;最后，通过对改进后的新英格兰39总线系统的算法分析，验证了深度强化学习模型对混合动力系统日前优化调度的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊