Day-ahead optimal dispatching of hybrid power system based on deep reinforcement learning

IF 1.2 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Yakun Shi, Chaoxu Mu, Yi Hao, Shiqian Ma, Na Xu, Zhiqiang Chong
{"title":"Day-ahead optimal dispatching of hybrid power system based on deep reinforcement learning","authors":"Yakun Shi,&nbsp;Chaoxu Mu,&nbsp;Yi Hao,&nbsp;Shiqian Ma,&nbsp;Na Xu,&nbsp;Zhiqiang Chong","doi":"10.1049/ccs2.12068","DOIUrl":null,"url":null,"abstract":"<p>The problem of optimal dispatching of a power system containing a high proportion of renewable energy is of great significance for the realisation of new energy consumption and the economic and reliable operation of the power system. For the solution of non-linear, non-convex, multi-objective problems for the optimal operation design of a power system with wind and photovoltaic access, traditional methods have difficulties in terms of computational real-time and iterative convergence. To address this issue, a deep reinforcement learning-based optimal scheduling method for the hybrid power system is proposed, which enables continuous action control to obtain an optimal scheduling strategy through the interaction between the agent and the hybrid power system. Firstly, a mathematical description of the optimal scheduling problem containing wind power and photovoltaic power system is presented, and the state space, action space, and reward function of the agent are designed. Secondly, the basic framework of the deep reinforcement learning optimal scheduling model is constructed, and the basic principles of the twin delayed deep deterministic policy gradient algorithm are introduced. Finally, the effectiveness of the deep reinforcement learning model for day-ahead optimal scheduling of the hybrid power system is verified by means of an arithmetic analysis of the modified New England 39-bus system.</p>","PeriodicalId":33652,"journal":{"name":"Cognitive Computation and Systems","volume":"4 4","pages":"351-361"},"PeriodicalIF":1.2000,"publicationDate":"2022-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.12068","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Computation and Systems","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/ccs2.12068","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 1

Abstract

The problem of optimal dispatching of a power system containing a high proportion of renewable energy is of great significance for the realisation of new energy consumption and the economic and reliable operation of the power system. For the solution of non-linear, non-convex, multi-objective problems for the optimal operation design of a power system with wind and photovoltaic access, traditional methods have difficulties in terms of computational real-time and iterative convergence. To address this issue, a deep reinforcement learning-based optimal scheduling method for the hybrid power system is proposed, which enables continuous action control to obtain an optimal scheduling strategy through the interaction between the agent and the hybrid power system. Firstly, a mathematical description of the optimal scheduling problem containing wind power and photovoltaic power system is presented, and the state space, action space, and reward function of the agent are designed. Secondly, the basic framework of the deep reinforcement learning optimal scheduling model is constructed, and the basic principles of the twin delayed deep deterministic policy gradient algorithm are introduced. Finally, the effectiveness of the deep reinforcement learning model for day-ahead optimal scheduling of the hybrid power system is verified by means of an arithmetic analysis of the modified New England 39-bus system.

Abstract Image

基于深度强化学习的混合电力系统日前最优调度
高比例可再生能源电力系统的优化调度问题,对于实现新能源消纳和电力系统经济可靠运行具有重要意义。对于具有风电和光伏接入的电力系统的非线性、非凸、多目标优化设计问题,传统方法在计算实时性和迭代收敛性方面存在困难。针对这一问题,提出了一种基于深度强化学习的混合电力系统最优调度方法,使连续动作控制通过智能体与混合电力系统的相互作用获得最优调度策略。首先,对包含风电和光伏发电系统的最优调度问题进行数学描述,设计agent的状态空间、动作空间和奖励函数;其次,构造了深度强化学习最优调度模型的基本框架,介绍了双延迟深度确定性策略梯度算法的基本原理;最后,通过对改进后的新英格兰39总线系统的算法分析,验证了深度强化学习模型对混合动力系统日前优化调度的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Cognitive Computation and Systems
Cognitive Computation and Systems Computer Science-Computer Science Applications
CiteScore
2.50
自引率
0.00%
发文量
39
审稿时长
10 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信