智能电网分布式经济调度的多智能体深度强化学习算法

IECON 2020 The 46th Annual Conference of the IEEE Industrial Electronics Society Pub Date : 2020-10-18 DOI:10.1109/IECON43393.2020.9255238

Lifu Ding, Zhiyun Lin, Gangfeng Yan

{"title":"智能电网分布式经济调度的多智能体深度强化学习算法","authors":"Lifu Ding, Zhiyun Lin, Gangfeng Yan","doi":"10.1109/IECON43393.2020.9255238","DOIUrl":null,"url":null,"abstract":"With the development of large-scale power grids, the issue of distributed economic dispatch has received considerable critical attention. However, due to the existence of some effects such as valve-point effects, the nonconvex objective function remains a major challenge for the distributed optimization problem. This paper proposes a cooperative deep reinforcement learning algorithm for distributed economic dispatch with the nonconvex objective function. In the distributed algorithm, all nodes obtain the value of actions by observing the environment and update state-action-value function in coordination with local neighbors. The state-action-value function is approximated by a neural network, which allows the algorithm to be used for large and continuous state spaces. The advantages of the algorithm are demonstrated through several case studies.","PeriodicalId":13045,"journal":{"name":"IECON 2020 The 46th Annual Conference of the IEEE Industrial Electronics Society","volume":"78 1","pages":"3529-3534"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-agent Deep Reinforcement Learning Algorithm for Distributed Economic Dispatch in Smart Grid\",\"authors\":\"Lifu Ding, Zhiyun Lin, Gangfeng Yan\",\"doi\":\"10.1109/IECON43393.2020.9255238\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of large-scale power grids, the issue of distributed economic dispatch has received considerable critical attention. However, due to the existence of some effects such as valve-point effects, the nonconvex objective function remains a major challenge for the distributed optimization problem. This paper proposes a cooperative deep reinforcement learning algorithm for distributed economic dispatch with the nonconvex objective function. In the distributed algorithm, all nodes obtain the value of actions by observing the environment and update state-action-value function in coordination with local neighbors. The state-action-value function is approximated by a neural network, which allows the algorithm to be used for large and continuous state spaces. The advantages of the algorithm are demonstrated through several case studies.\",\"PeriodicalId\":13045,\"journal\":{\"name\":\"IECON 2020 The 46th Annual Conference of the IEEE Industrial Electronics Society\",\"volume\":\"78 1\",\"pages\":\"3529-3534\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IECON 2020 The 46th Annual Conference of the IEEE Industrial Electronics Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IECON43393.2020.9255238\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IECON 2020 The 46th Annual Conference of the IEEE Industrial Electronics Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IECON43393.2020.9255238","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

随着大规模电网的发展，分布式经济调度问题越来越受到人们的重视。然而，由于阀点效应等影响的存在，非凸目标函数仍然是分布式优化问题的主要挑战。提出了一种具有非凸目标函数的分布式经济调度协同深度强化学习算法。在分布式算法中，所有节点通过观察环境获取动作值，并与局部邻居协调更新状态-动作-值函数。状态-动作-值函数由神经网络逼近，使得该算法适用于大的连续状态空间。通过实例分析，证明了该算法的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Multi-agent Deep Reinforcement Learning Algorithm for Distributed Economic Dispatch in Smart Grid

With the development of large-scale power grids, the issue of distributed economic dispatch has received considerable critical attention. However, due to the existence of some effects such as valve-point effects, the nonconvex objective function remains a major challenge for the distributed optimization problem. This paper proposes a cooperative deep reinforcement learning algorithm for distributed economic dispatch with the nonconvex objective function. In the distributed algorithm, all nodes obtain the value of actions by observing the environment and update state-action-value function in coordination with local neighbors. The state-action-value function is approximated by a neural network, which allows the algorithm to be used for large and continuous state spaces. The advantages of the algorithm are demonstrated through several case studies.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IECON 2020 The 46th Annual Conference of the IEEE Industrial Electronics Society

自引率

0.00%

发文量