{"title":"电力系统稳定性增强的直接启发式动态规划方法","authors":"Miao Yu, Chao Lu, Yongjun Liu","doi":"10.1109/ACC.2014.6858581","DOIUrl":null,"url":null,"abstract":"In this paper a neural network-based approximate dynamic programming method, namely direct heuristic dynamic programming (direct HDP), is applied to power system stability control. Direct HDP is a learning and approximation based approach to address nonlinear system control under uncertainty. In the present paper, real-time system responses provided by wide area measurement system (WAMS) are used to construct such controllers which are uniquely tailored for the problems under consideration. In addition, the controller learning objective is formulated as a reward function that reflects global characteristics of the power system low frequency oscillation under the consideration of coupling effect among system components. The contribution of the paper includes a convergence proof of the direct HDP algorithm using an LQR framework, as well as case study to illustrate the proposed learning control algorithm. The case study aims at providing a new solution to a difficult large scale system coordination problem where the China Southern Power Grid is used for.","PeriodicalId":369729,"journal":{"name":"2014 American Control Conference","volume":"67 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"Direct heuristic dynamic programming method for power system stability enhancement\",\"authors\":\"Miao Yu, Chao Lu, Yongjun Liu\",\"doi\":\"10.1109/ACC.2014.6858581\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper a neural network-based approximate dynamic programming method, namely direct heuristic dynamic programming (direct HDP), is applied to power system stability control. Direct HDP is a learning and approximation based approach to address nonlinear system control under uncertainty. In the present paper, real-time system responses provided by wide area measurement system (WAMS) are used to construct such controllers which are uniquely tailored for the problems under consideration. In addition, the controller learning objective is formulated as a reward function that reflects global characteristics of the power system low frequency oscillation under the consideration of coupling effect among system components. The contribution of the paper includes a convergence proof of the direct HDP algorithm using an LQR framework, as well as case study to illustrate the proposed learning control algorithm. The case study aims at providing a new solution to a difficult large scale system coordination problem where the China Southern Power Grid is used for.\",\"PeriodicalId\":369729,\"journal\":{\"name\":\"2014 American Control Conference\",\"volume\":\"67 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 American Control Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACC.2014.6858581\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 American Control Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACC.2014.6858581","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Direct heuristic dynamic programming method for power system stability enhancement
In this paper a neural network-based approximate dynamic programming method, namely direct heuristic dynamic programming (direct HDP), is applied to power system stability control. Direct HDP is a learning and approximation based approach to address nonlinear system control under uncertainty. In the present paper, real-time system responses provided by wide area measurement system (WAMS) are used to construct such controllers which are uniquely tailored for the problems under consideration. In addition, the controller learning objective is formulated as a reward function that reflects global characteristics of the power system low frequency oscillation under the consideration of coupling effect among system components. The contribution of the paper includes a convergence proof of the direct HDP algorithm using an LQR framework, as well as case study to illustrate the proposed learning control algorithm. The case study aims at providing a new solution to a difficult large scale system coordination problem where the China Southern Power Grid is used for.