Jiemai Gao , Siyuan Chen , Shixiong Fan , Jun Zhang , Kezheng Jiang , Jun Hao , David Wenzhong Gao
{"title":"基于物理指导的发电机脱扣控制的保守q学习","authors":"Jiemai Gao , Siyuan Chen , Shixiong Fan , Jun Zhang , Kezheng Jiang , Jun Hao , David Wenzhong Gao","doi":"10.1016/j.ijepes.2025.111181","DOIUrl":null,"url":null,"abstract":"<div><div>Maintaining transient stability following large disturbances is a critical challenge in modern power systems with high renewable energy penetration. Generator tripping control serves as an essential emergency strategy to mitigate rotor angle instability. However, conventional reinforcement learning approaches lack physical interpretability and robustness, limiting their operational acceptance. To address these concerns, this paper proposes a novel Physics-Informed Conservative Q-learning framework for emergency generator tripping. Specifically, the proposed approach integrates physics-informed neural networks (PINN) with conservative Q-learning (CQL) in a modular framework. A two-layer long short-term memory-based PINN, independently trained to predict generator rotor angle trajectories by embedding the governing swing equations to ensure physical consistency. Meanwhile, a convolutional neural network-based CQL agent is employed to learn robust generator tripping control policies, where the PINN outputs are incorporated as auxiliary dynamic features to enhance learning stability and safety. An action masking mechanism guided by physics-informed trajectory clustering further improves policy robustness by restricting decisions to critical generators. The proposed method is validated on a modified IEEE-39 bus system under multiple fault scenarios with different levels of renewable energy integration. Furthermore, to demonstrate scalability and generalization, additional validation is performed on the larger and more complex IEEE-118 bus system. Results from both testbeds show that the proposed approach significantly improves training efficiency, enhances transient control performance, ensures stable deployment behavior, and reduces generator tripping costs.</div></div>","PeriodicalId":50326,"journal":{"name":"International Journal of Electrical Power & Energy Systems","volume":"172 ","pages":"Article 111181"},"PeriodicalIF":5.0000,"publicationDate":"2025-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Conservative Q-learning with physics guidance for generator tripping control\",\"authors\":\"Jiemai Gao , Siyuan Chen , Shixiong Fan , Jun Zhang , Kezheng Jiang , Jun Hao , David Wenzhong Gao\",\"doi\":\"10.1016/j.ijepes.2025.111181\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Maintaining transient stability following large disturbances is a critical challenge in modern power systems with high renewable energy penetration. Generator tripping control serves as an essential emergency strategy to mitigate rotor angle instability. However, conventional reinforcement learning approaches lack physical interpretability and robustness, limiting their operational acceptance. To address these concerns, this paper proposes a novel Physics-Informed Conservative Q-learning framework for emergency generator tripping. Specifically, the proposed approach integrates physics-informed neural networks (PINN) with conservative Q-learning (CQL) in a modular framework. A two-layer long short-term memory-based PINN, independently trained to predict generator rotor angle trajectories by embedding the governing swing equations to ensure physical consistency. Meanwhile, a convolutional neural network-based CQL agent is employed to learn robust generator tripping control policies, where the PINN outputs are incorporated as auxiliary dynamic features to enhance learning stability and safety. An action masking mechanism guided by physics-informed trajectory clustering further improves policy robustness by restricting decisions to critical generators. The proposed method is validated on a modified IEEE-39 bus system under multiple fault scenarios with different levels of renewable energy integration. Furthermore, to demonstrate scalability and generalization, additional validation is performed on the larger and more complex IEEE-118 bus system. Results from both testbeds show that the proposed approach significantly improves training efficiency, enhances transient control performance, ensures stable deployment behavior, and reduces generator tripping costs.</div></div>\",\"PeriodicalId\":50326,\"journal\":{\"name\":\"International Journal of Electrical Power & Energy Systems\",\"volume\":\"172 \",\"pages\":\"Article 111181\"},\"PeriodicalIF\":5.0000,\"publicationDate\":\"2025-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Electrical Power & Energy Systems\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S014206152500729X\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Electrical Power & Energy Systems","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S014206152500729X","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Conservative Q-learning with physics guidance for generator tripping control
Maintaining transient stability following large disturbances is a critical challenge in modern power systems with high renewable energy penetration. Generator tripping control serves as an essential emergency strategy to mitigate rotor angle instability. However, conventional reinforcement learning approaches lack physical interpretability and robustness, limiting their operational acceptance. To address these concerns, this paper proposes a novel Physics-Informed Conservative Q-learning framework for emergency generator tripping. Specifically, the proposed approach integrates physics-informed neural networks (PINN) with conservative Q-learning (CQL) in a modular framework. A two-layer long short-term memory-based PINN, independently trained to predict generator rotor angle trajectories by embedding the governing swing equations to ensure physical consistency. Meanwhile, a convolutional neural network-based CQL agent is employed to learn robust generator tripping control policies, where the PINN outputs are incorporated as auxiliary dynamic features to enhance learning stability and safety. An action masking mechanism guided by physics-informed trajectory clustering further improves policy robustness by restricting decisions to critical generators. The proposed method is validated on a modified IEEE-39 bus system under multiple fault scenarios with different levels of renewable energy integration. Furthermore, to demonstrate scalability and generalization, additional validation is performed on the larger and more complex IEEE-118 bus system. Results from both testbeds show that the proposed approach significantly improves training efficiency, enhances transient control performance, ensures stable deployment behavior, and reduces generator tripping costs.
期刊介绍:
The journal covers theoretical developments in electrical power and energy systems and their applications. The coverage embraces: generation and network planning; reliability; long and short term operation; expert systems; neural networks; object oriented systems; system control centres; database and information systems; stock and parameter estimation; system security and adequacy; network theory, modelling and computation; small and large system dynamics; dynamic model identification; on-line control including load and switching control; protection; distribution systems; energy economics; impact of non-conventional systems; and man-machine interfaces.
As well as original research papers, the journal publishes short contributions, book reviews and conference reports. All papers are peer-reviewed by at least two referees.