{"title":"基于强化学习的混合电源控制系统","authors":"F. Daniel, A. Rix","doi":"10.1109/SAUPEC/RobMech/PRASA48453.2020.9041138","DOIUrl":null,"url":null,"abstract":"The control system of a hybrid power supply (HPS) is optimised using reinforcement learning. Hybrid power supplies combines two or more power sources. As more sources are integrated, the complexity of the control and energy exchange increases significantly. A model-free Q-learning RL controller is proposed to reduce the loss of power supply and to increase the cost optimisation of system components. The HPS consists of PV power, battery storage, a limited grid supply and a diesel generator. The use of renewable sources is a priority to reduce the amount of fuel usage. The results are compared to two baselines: random action controller and a rule-based controller. Different intervals are also assessed to find an optimal training interval to reduce computational power. The results showed that the RL-controller has the lowest LPS and optimises the use of the system components. This indicates that reinforcement learning-based control is viable and feasible for a HPS.","PeriodicalId":215514,"journal":{"name":"2020 International SAUPEC/RobMech/PRASA Conference","volume":"2019 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement Learning-based Control System of a Hybrid Power Supply\",\"authors\":\"F. Daniel, A. Rix\",\"doi\":\"10.1109/SAUPEC/RobMech/PRASA48453.2020.9041138\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The control system of a hybrid power supply (HPS) is optimised using reinforcement learning. Hybrid power supplies combines two or more power sources. As more sources are integrated, the complexity of the control and energy exchange increases significantly. A model-free Q-learning RL controller is proposed to reduce the loss of power supply and to increase the cost optimisation of system components. The HPS consists of PV power, battery storage, a limited grid supply and a diesel generator. The use of renewable sources is a priority to reduce the amount of fuel usage. The results are compared to two baselines: random action controller and a rule-based controller. Different intervals are also assessed to find an optimal training interval to reduce computational power. The results showed that the RL-controller has the lowest LPS and optimises the use of the system components. This indicates that reinforcement learning-based control is viable and feasible for a HPS.\",\"PeriodicalId\":215514,\"journal\":{\"name\":\"2020 International SAUPEC/RobMech/PRASA Conference\",\"volume\":\"2019 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International SAUPEC/RobMech/PRASA Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SAUPEC/RobMech/PRASA48453.2020.9041138\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International SAUPEC/RobMech/PRASA Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SAUPEC/RobMech/PRASA48453.2020.9041138","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reinforcement Learning-based Control System of a Hybrid Power Supply
The control system of a hybrid power supply (HPS) is optimised using reinforcement learning. Hybrid power supplies combines two or more power sources. As more sources are integrated, the complexity of the control and energy exchange increases significantly. A model-free Q-learning RL controller is proposed to reduce the loss of power supply and to increase the cost optimisation of system components. The HPS consists of PV power, battery storage, a limited grid supply and a diesel generator. The use of renewable sources is a priority to reduce the amount of fuel usage. The results are compared to two baselines: random action controller and a rule-based controller. Different intervals are also assessed to find an optimal training interval to reduce computational power. The results showed that the RL-controller has the lowest LPS and optimises the use of the system components. This indicates that reinforcement learning-based control is viable and feasible for a HPS.