Qian Xu, Chutian Yu, Xiang Yuan, Zao Fu, Hongzhe Liu
{"title":"Distributed Q-learning Algorithm for Economic Dispatch of Smart Grid with Unknown Cost Functions","authors":"Qian Xu, Chutian Yu, Xiang Yuan, Zao Fu, Hongzhe Liu","doi":"10.1109/CAC57257.2022.10055962","DOIUrl":null,"url":null,"abstract":"In this paper, a distributed Q-learning algorithm is studied to solve the economic dispatch (ED) problem in smart grid. To tackle the ED problem in the presence of unknown cost functions, most of the existing methods are designed based on the global information of generation units, which would suffer from potential network attack. To conquer such limitation, the distributed Q-leaning algorithm consisting of distributed communication and reinforcement learning (RL) is proposed, where no global information is allowed to used, but information exchange among neighboring generation units can be available. In distributed Q-learning, each generation unit learns the local action-value function and collaborates to optimize the ED problem. Finally, the convergence and optimality of the proposed algorithm are proven, and the numerical simulation results demonstrate the effectiveness of the algorithm.","PeriodicalId":287137,"journal":{"name":"2022 China Automation Congress (CAC)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 China Automation Congress (CAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CAC57257.2022.10055962","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, a distributed Q-learning algorithm is studied to solve the economic dispatch (ED) problem in smart grid. To tackle the ED problem in the presence of unknown cost functions, most of the existing methods are designed based on the global information of generation units, which would suffer from potential network attack. To conquer such limitation, the distributed Q-leaning algorithm consisting of distributed communication and reinforcement learning (RL) is proposed, where no global information is allowed to used, but information exchange among neighboring generation units can be available. In distributed Q-learning, each generation unit learns the local action-value function and collaborates to optimize the ED problem. Finally, the convergence and optimality of the proposed algorithm are proven, and the numerical simulation results demonstrate the effectiveness of the algorithm.