{"title":"配电网光伏集成动态重构的层次深度强化学习","authors":"Yanjuan Wu , Qing Li , Jianniang Qiu","doi":"10.1016/j.epsr.2025.111587","DOIUrl":null,"url":null,"abstract":"<div><div>In the context of the dual-carbon strategy for achieving carbon peak and neutrality, the increasing penetration of new energy sources, particularly distributed photovoltaic (PV) systems, has made the distribution network (DN) environment more complex, leading to serious curtailment issues in modern distribution systems. This paper introduces a voltage deviation index into the reward function of deep reinforcement learning (DRL) and proposes a Hierarchical Deep Reinforcement Learning (HDRL) method, which divides the agent in the Reinforcement Learning (RL) algorithm into high-level and low-level controllers to analyze the operating environment of the DN from different spatial and temporal scales, optimizing the proactive strategy for Dynamic Reconfiguration of Distribution Networks (DNDR). The high-level control model is responsible for formulating global strategies and selecting different switching group actions, while the low-level control model is responsible for executing specific switching actions. This approach incentivizes the agent to reduce the selection of switching actions, thus avoiding the algorithm from falling into local optima in complex environments. Case studies show that the method saves $563.34 in total economic cost and improves voltage quality by 91.71 %. The case studies validate that this method can more effectively enhance the consumption capacity of distributed PV, reduce the phenomenon of curtailment, and improve the operational stability and reliability of the DN.</div></div>","PeriodicalId":50547,"journal":{"name":"Electric Power Systems Research","volume":"246 ","pages":"Article 111587"},"PeriodicalIF":4.2000,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hierarchical deep reinforcement learning for dynamic reconfiguration of photovoltaic integration in distribution network\",\"authors\":\"Yanjuan Wu , Qing Li , Jianniang Qiu\",\"doi\":\"10.1016/j.epsr.2025.111587\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In the context of the dual-carbon strategy for achieving carbon peak and neutrality, the increasing penetration of new energy sources, particularly distributed photovoltaic (PV) systems, has made the distribution network (DN) environment more complex, leading to serious curtailment issues in modern distribution systems. This paper introduces a voltage deviation index into the reward function of deep reinforcement learning (DRL) and proposes a Hierarchical Deep Reinforcement Learning (HDRL) method, which divides the agent in the Reinforcement Learning (RL) algorithm into high-level and low-level controllers to analyze the operating environment of the DN from different spatial and temporal scales, optimizing the proactive strategy for Dynamic Reconfiguration of Distribution Networks (DNDR). The high-level control model is responsible for formulating global strategies and selecting different switching group actions, while the low-level control model is responsible for executing specific switching actions. This approach incentivizes the agent to reduce the selection of switching actions, thus avoiding the algorithm from falling into local optima in complex environments. Case studies show that the method saves $563.34 in total economic cost and improves voltage quality by 91.71 %. The case studies validate that this method can more effectively enhance the consumption capacity of distributed PV, reduce the phenomenon of curtailment, and improve the operational stability and reliability of the DN.</div></div>\",\"PeriodicalId\":50547,\"journal\":{\"name\":\"Electric Power Systems Research\",\"volume\":\"246 \",\"pages\":\"Article 111587\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-04-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Electric Power Systems Research\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0378779625001798\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Electric Power Systems Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378779625001798","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Hierarchical deep reinforcement learning for dynamic reconfiguration of photovoltaic integration in distribution network
In the context of the dual-carbon strategy for achieving carbon peak and neutrality, the increasing penetration of new energy sources, particularly distributed photovoltaic (PV) systems, has made the distribution network (DN) environment more complex, leading to serious curtailment issues in modern distribution systems. This paper introduces a voltage deviation index into the reward function of deep reinforcement learning (DRL) and proposes a Hierarchical Deep Reinforcement Learning (HDRL) method, which divides the agent in the Reinforcement Learning (RL) algorithm into high-level and low-level controllers to analyze the operating environment of the DN from different spatial and temporal scales, optimizing the proactive strategy for Dynamic Reconfiguration of Distribution Networks (DNDR). The high-level control model is responsible for formulating global strategies and selecting different switching group actions, while the low-level control model is responsible for executing specific switching actions. This approach incentivizes the agent to reduce the selection of switching actions, thus avoiding the algorithm from falling into local optima in complex environments. Case studies show that the method saves $563.34 in total economic cost and improves voltage quality by 91.71 %. The case studies validate that this method can more effectively enhance the consumption capacity of distributed PV, reduce the phenomenon of curtailment, and improve the operational stability and reliability of the DN.
期刊介绍:
Electric Power Systems Research is an international medium for the publication of original papers concerned with the generation, transmission, distribution and utilization of electrical energy. The journal aims at presenting important results of work in this field, whether in the form of applied research, development of new procedures or components, orginal application of existing knowledge or new designapproaches. The scope of Electric Power Systems Research is broad, encompassing all aspects of electric power systems. The following list of topics is not intended to be exhaustive, but rather to indicate topics that fall within the journal purview.
• Generation techniques ranging from advances in conventional electromechanical methods, through nuclear power generation, to renewable energy generation.
• Transmission, spanning the broad area from UHV (ac and dc) to network operation and protection, line routing and design.
• Substation work: equipment design, protection and control systems.
• Distribution techniques, equipment development, and smart grids.
• The utilization area from energy efficiency to distributed load levelling techniques.
• Systems studies including control techniques, planning, optimization methods, stability, security assessment and insulation coordination.