智能电网分布式经济调度的多智能体深度强化学习算法

Lifu Ding, Zhiyun Lin, Gangfeng Yan
{"title":"智能电网分布式经济调度的多智能体深度强化学习算法","authors":"Lifu Ding, Zhiyun Lin, Gangfeng Yan","doi":"10.1109/IECON43393.2020.9255238","DOIUrl":null,"url":null,"abstract":"With the development of large-scale power grids, the issue of distributed economic dispatch has received considerable critical attention. However, due to the existence of some effects such as valve-point effects, the nonconvex objective function remains a major challenge for the distributed optimization problem. This paper proposes a cooperative deep reinforcement learning algorithm for distributed economic dispatch with the nonconvex objective function. In the distributed algorithm, all nodes obtain the value of actions by observing the environment and update state-action-value function in coordination with local neighbors. The state-action-value function is approximated by a neural network, which allows the algorithm to be used for large and continuous state spaces. The advantages of the algorithm are demonstrated through several case studies.","PeriodicalId":13045,"journal":{"name":"IECON 2020 The 46th Annual Conference of the IEEE Industrial Electronics Society","volume":"78 1","pages":"3529-3534"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-agent Deep Reinforcement Learning Algorithm for Distributed Economic Dispatch in Smart Grid\",\"authors\":\"Lifu Ding, Zhiyun Lin, Gangfeng Yan\",\"doi\":\"10.1109/IECON43393.2020.9255238\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of large-scale power grids, the issue of distributed economic dispatch has received considerable critical attention. However, due to the existence of some effects such as valve-point effects, the nonconvex objective function remains a major challenge for the distributed optimization problem. This paper proposes a cooperative deep reinforcement learning algorithm for distributed economic dispatch with the nonconvex objective function. In the distributed algorithm, all nodes obtain the value of actions by observing the environment and update state-action-value function in coordination with local neighbors. The state-action-value function is approximated by a neural network, which allows the algorithm to be used for large and continuous state spaces. The advantages of the algorithm are demonstrated through several case studies.\",\"PeriodicalId\":13045,\"journal\":{\"name\":\"IECON 2020 The 46th Annual Conference of the IEEE Industrial Electronics Society\",\"volume\":\"78 1\",\"pages\":\"3529-3534\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IECON 2020 The 46th Annual Conference of the IEEE Industrial Electronics Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IECON43393.2020.9255238\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IECON 2020 The 46th Annual Conference of the IEEE Industrial Electronics Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IECON43393.2020.9255238","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

随着大规模电网的发展,分布式经济调度问题越来越受到人们的重视。然而,由于阀点效应等影响的存在,非凸目标函数仍然是分布式优化问题的主要挑战。提出了一种具有非凸目标函数的分布式经济调度协同深度强化学习算法。在分布式算法中,所有节点通过观察环境获取动作值,并与局部邻居协调更新状态-动作-值函数。状态-动作-值函数由神经网络逼近,使得该算法适用于大的连续状态空间。通过实例分析,证明了该算法的优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multi-agent Deep Reinforcement Learning Algorithm for Distributed Economic Dispatch in Smart Grid
With the development of large-scale power grids, the issue of distributed economic dispatch has received considerable critical attention. However, due to the existence of some effects such as valve-point effects, the nonconvex objective function remains a major challenge for the distributed optimization problem. This paper proposes a cooperative deep reinforcement learning algorithm for distributed economic dispatch with the nonconvex objective function. In the distributed algorithm, all nodes obtain the value of actions by observing the environment and update state-action-value function in coordination with local neighbors. The state-action-value function is approximated by a neural network, which allows the algorithm to be used for large and continuous state spaces. The advantages of the algorithm are demonstrated through several case studies.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信