{"title":"基于混合动作空间强化学习的配电网电压/无功控制","authors":"Yuan Zhou;Yizhou Peng;Leijiao Ge;Luyang Hou;Ying Wang;Hongxia Niu","doi":"10.35833/MPCE.2024.000643","DOIUrl":null,"url":null,"abstract":"In volt/var control (VVC) for active distribution networks, it is essential to integrate traditional voltage regulation devices with modern smart photovoltaic inverters to prevent voltage violations. However, model-based multi-device VVC methods rely on accurate system models for decision-making, which can be challenging due to the extensive modeling workload. To tackle the complexities of multi-device cooperation in VVC, this paper proposes a two-timescale VVC method based on reinforcement learning with hybrid action space, termed the hybrid action representation twin delayed deep deterministic policy gradient (HAR-TD3) method. This method simultaneously manages traditional discrete voltage regulation devices, which operate on a slower timescale, and smart continuous voltage regulation devices, which function on a faster timescale. To enable effective collaboration between the different action spaces of these devices, we propose a variational auto-encoder based hybrid action reconstruction network. This network captures the interdependencies of hybrid actions by embedding both discrete and continuous actions into the latent representation space and subsequently decoding them for action reconstruction. The proposed method is validated on IEEE 33-bus, 69-bus, and 123-bus distribution networks. Numerical results indicate that the proposed method successfully coordinates discrete and continuous voltage regulation devices, achieving fewer voltage violations compared with state-of-the-art reinforcement learning methods.","PeriodicalId":51326,"journal":{"name":"Journal of Modern Power Systems and Clean Energy","volume":"13 4","pages":"1261-1273"},"PeriodicalIF":6.1000,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10834450","citationCount":"0","resultStr":"{\"title\":\"Two-Timescale Volt/var Control Based on Reinforcement Learning with Hybrid Action Space for Distribution Networks\",\"authors\":\"Yuan Zhou;Yizhou Peng;Leijiao Ge;Luyang Hou;Ying Wang;Hongxia Niu\",\"doi\":\"10.35833/MPCE.2024.000643\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In volt/var control (VVC) for active distribution networks, it is essential to integrate traditional voltage regulation devices with modern smart photovoltaic inverters to prevent voltage violations. However, model-based multi-device VVC methods rely on accurate system models for decision-making, which can be challenging due to the extensive modeling workload. To tackle the complexities of multi-device cooperation in VVC, this paper proposes a two-timescale VVC method based on reinforcement learning with hybrid action space, termed the hybrid action representation twin delayed deep deterministic policy gradient (HAR-TD3) method. This method simultaneously manages traditional discrete voltage regulation devices, which operate on a slower timescale, and smart continuous voltage regulation devices, which function on a faster timescale. To enable effective collaboration between the different action spaces of these devices, we propose a variational auto-encoder based hybrid action reconstruction network. This network captures the interdependencies of hybrid actions by embedding both discrete and continuous actions into the latent representation space and subsequently decoding them for action reconstruction. The proposed method is validated on IEEE 33-bus, 69-bus, and 123-bus distribution networks. Numerical results indicate that the proposed method successfully coordinates discrete and continuous voltage regulation devices, achieving fewer voltage violations compared with state-of-the-art reinforcement learning methods.\",\"PeriodicalId\":51326,\"journal\":{\"name\":\"Journal of Modern Power Systems and Clean Energy\",\"volume\":\"13 4\",\"pages\":\"1261-1273\"},\"PeriodicalIF\":6.1000,\"publicationDate\":\"2025-01-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10834450\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Modern Power Systems and Clean Energy\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10834450/\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Modern Power Systems and Clean Energy","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10834450/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Two-Timescale Volt/var Control Based on Reinforcement Learning with Hybrid Action Space for Distribution Networks
In volt/var control (VVC) for active distribution networks, it is essential to integrate traditional voltage regulation devices with modern smart photovoltaic inverters to prevent voltage violations. However, model-based multi-device VVC methods rely on accurate system models for decision-making, which can be challenging due to the extensive modeling workload. To tackle the complexities of multi-device cooperation in VVC, this paper proposes a two-timescale VVC method based on reinforcement learning with hybrid action space, termed the hybrid action representation twin delayed deep deterministic policy gradient (HAR-TD3) method. This method simultaneously manages traditional discrete voltage regulation devices, which operate on a slower timescale, and smart continuous voltage regulation devices, which function on a faster timescale. To enable effective collaboration between the different action spaces of these devices, we propose a variational auto-encoder based hybrid action reconstruction network. This network captures the interdependencies of hybrid actions by embedding both discrete and continuous actions into the latent representation space and subsequently decoding them for action reconstruction. The proposed method is validated on IEEE 33-bus, 69-bus, and 123-bus distribution networks. Numerical results indicate that the proposed method successfully coordinates discrete and continuous voltage regulation devices, achieving fewer voltage violations compared with state-of-the-art reinforcement learning methods.
期刊介绍:
Journal of Modern Power Systems and Clean Energy (MPCE), commencing from June, 2013, is a newly established, peer-reviewed and quarterly published journal in English. It is the first international power engineering journal originated in mainland China. MPCE publishes original papers, short letters and review articles in the field of modern power systems with focus on smart grid technology and renewable energy integration, etc.