T. P. Imthias Ahamed, E. A. Jasmin, Essam A. Al-Ammar
{"title":"强化学习在电力系统调度与控制中的应用","authors":"T. P. Imthias Ahamed, E. A. Jasmin, Essam A. Al-Ammar","doi":"10.1109/ISCI.2011.5958993","DOIUrl":null,"url":null,"abstract":"Reinforcement Learning (RL) has been applied to various scheduling and control problems in power systems in the last decade. However, the area is still in its infancy. In this paper, we present various research works in this area in a unified perspective. In most of the applications, power system problems — control of FACTS devices, reactive power control, Automatic Generation Control, Economic Dispatch, etc — are modeled as a Multistage Decision making Problem and RL is used to solve the MDP. One important point about RL is, it takes considerable amount of time to learn a control strategy. However, RL can learn off line using a simulation model. Once the control strategy is learned decision making can be done almost instantaneously. A major drawback of RL is most of the application does not scale up and much work need to be done. We hope this paper will generate more interest in the area and RL will be utilized to its full potential.","PeriodicalId":166647,"journal":{"name":"2011 IEEE Symposium on Computers & Informatics","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Reinforcement learning in power system scheduling and control: A unified perspective\",\"authors\":\"T. P. Imthias Ahamed, E. A. Jasmin, Essam A. Al-Ammar\",\"doi\":\"10.1109/ISCI.2011.5958993\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Reinforcement Learning (RL) has been applied to various scheduling and control problems in power systems in the last decade. However, the area is still in its infancy. In this paper, we present various research works in this area in a unified perspective. In most of the applications, power system problems — control of FACTS devices, reactive power control, Automatic Generation Control, Economic Dispatch, etc — are modeled as a Multistage Decision making Problem and RL is used to solve the MDP. One important point about RL is, it takes considerable amount of time to learn a control strategy. However, RL can learn off line using a simulation model. Once the control strategy is learned decision making can be done almost instantaneously. A major drawback of RL is most of the application does not scale up and much work need to be done. We hope this paper will generate more interest in the area and RL will be utilized to its full potential.\",\"PeriodicalId\":166647,\"journal\":{\"name\":\"2011 IEEE Symposium on Computers & Informatics\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-03-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 IEEE Symposium on Computers & Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCI.2011.5958993\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE Symposium on Computers & Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCI.2011.5958993","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reinforcement learning in power system scheduling and control: A unified perspective
Reinforcement Learning (RL) has been applied to various scheduling and control problems in power systems in the last decade. However, the area is still in its infancy. In this paper, we present various research works in this area in a unified perspective. In most of the applications, power system problems — control of FACTS devices, reactive power control, Automatic Generation Control, Economic Dispatch, etc — are modeled as a Multistage Decision making Problem and RL is used to solve the MDP. One important point about RL is, it takes considerable amount of time to learn a control strategy. However, RL can learn off line using a simulation model. Once the control strategy is learned decision making can be done almost instantaneously. A major drawback of RL is most of the application does not scale up and much work need to be done. We hope this paper will generate more interest in the area and RL will be utilized to its full potential.