Yakun Shi, Chaoxu Mu, Yi Hao, Shiqian Ma, Na Xu, Zhiqiang Chong
{"title":"基于深度强化学习的混合电力系统日前最优调度","authors":"Yakun Shi, Chaoxu Mu, Yi Hao, Shiqian Ma, Na Xu, Zhiqiang Chong","doi":"10.1049/ccs2.12068","DOIUrl":null,"url":null,"abstract":"<p>The problem of optimal dispatching of a power system containing a high proportion of renewable energy is of great significance for the realisation of new energy consumption and the economic and reliable operation of the power system. For the solution of non-linear, non-convex, multi-objective problems for the optimal operation design of a power system with wind and photovoltaic access, traditional methods have difficulties in terms of computational real-time and iterative convergence. To address this issue, a deep reinforcement learning-based optimal scheduling method for the hybrid power system is proposed, which enables continuous action control to obtain an optimal scheduling strategy through the interaction between the agent and the hybrid power system. Firstly, a mathematical description of the optimal scheduling problem containing wind power and photovoltaic power system is presented, and the state space, action space, and reward function of the agent are designed. Secondly, the basic framework of the deep reinforcement learning optimal scheduling model is constructed, and the basic principles of the twin delayed deep deterministic policy gradient algorithm are introduced. Finally, the effectiveness of the deep reinforcement learning model for day-ahead optimal scheduling of the hybrid power system is verified by means of an arithmetic analysis of the modified New England 39-bus system.</p>","PeriodicalId":33652,"journal":{"name":"Cognitive Computation and Systems","volume":"4 4","pages":"351-361"},"PeriodicalIF":1.2000,"publicationDate":"2022-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.12068","citationCount":"1","resultStr":"{\"title\":\"Day-ahead optimal dispatching of hybrid power system based on deep reinforcement learning\",\"authors\":\"Yakun Shi, Chaoxu Mu, Yi Hao, Shiqian Ma, Na Xu, Zhiqiang Chong\",\"doi\":\"10.1049/ccs2.12068\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The problem of optimal dispatching of a power system containing a high proportion of renewable energy is of great significance for the realisation of new energy consumption and the economic and reliable operation of the power system. For the solution of non-linear, non-convex, multi-objective problems for the optimal operation design of a power system with wind and photovoltaic access, traditional methods have difficulties in terms of computational real-time and iterative convergence. To address this issue, a deep reinforcement learning-based optimal scheduling method for the hybrid power system is proposed, which enables continuous action control to obtain an optimal scheduling strategy through the interaction between the agent and the hybrid power system. Firstly, a mathematical description of the optimal scheduling problem containing wind power and photovoltaic power system is presented, and the state space, action space, and reward function of the agent are designed. Secondly, the basic framework of the deep reinforcement learning optimal scheduling model is constructed, and the basic principles of the twin delayed deep deterministic policy gradient algorithm are introduced. Finally, the effectiveness of the deep reinforcement learning model for day-ahead optimal scheduling of the hybrid power system is verified by means of an arithmetic analysis of the modified New England 39-bus system.</p>\",\"PeriodicalId\":33652,\"journal\":{\"name\":\"Cognitive Computation and Systems\",\"volume\":\"4 4\",\"pages\":\"351-361\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2022-08-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.12068\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cognitive Computation and Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1049/ccs2.12068\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Computation and Systems","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/ccs2.12068","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Day-ahead optimal dispatching of hybrid power system based on deep reinforcement learning
The problem of optimal dispatching of a power system containing a high proportion of renewable energy is of great significance for the realisation of new energy consumption and the economic and reliable operation of the power system. For the solution of non-linear, non-convex, multi-objective problems for the optimal operation design of a power system with wind and photovoltaic access, traditional methods have difficulties in terms of computational real-time and iterative convergence. To address this issue, a deep reinforcement learning-based optimal scheduling method for the hybrid power system is proposed, which enables continuous action control to obtain an optimal scheduling strategy through the interaction between the agent and the hybrid power system. Firstly, a mathematical description of the optimal scheduling problem containing wind power and photovoltaic power system is presented, and the state space, action space, and reward function of the agent are designed. Secondly, the basic framework of the deep reinforcement learning optimal scheduling model is constructed, and the basic principles of the twin delayed deep deterministic policy gradient algorithm are introduced. Finally, the effectiveness of the deep reinforcement learning model for day-ahead optimal scheduling of the hybrid power system is verified by means of an arithmetic analysis of the modified New England 39-bus system.