{"title":"Intelligent guarantee power supply decision method based on reinforcement learning algorithm","authors":"Milu Zhou, Huijie Sun, Tian Yang, Tingting Li, Qi Hou","doi":"10.1186/s42162-025-00535-3","DOIUrl":null,"url":null,"abstract":"<div><p>Traditional power supply decision methods rely on fixed and rigorous mathematical models, which are difficult to accurately capture the characteristics and changing patterns of new loads, resulting in low prediction accuracy. Therefore, a decision model for guaranteeing power supply is constructed based on an improved proximal policy optimization algorithm, to study the intelligent guarantee power supply decision method. The experimental results show that the stability of the proximal policy optimization algorithms is generally high in all scenarios, especially in fault or anomaly scenarios and low load demand scenarios, which exceeds 110%. Its loss value decreases with the increase of training iterations. At 60 iterations, its loss value reaches the optimal value of 100 and then tends to stabilize. The research results indicate that the intelligent power supply strategy has good feasibility. This decision method helps to improve the stability, efficiency, intelligence level, and ability to respond to emergencies of the power grid.</p></div>","PeriodicalId":538,"journal":{"name":"Energy Informatics","volume":"8 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://energyinformatics.springeropen.com/counter/pdf/10.1186/s42162-025-00535-3","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy Informatics","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1186/s42162-025-00535-3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Energy","Score":null,"Total":0}
引用次数: 0
Abstract
Traditional power supply decision methods rely on fixed and rigorous mathematical models, which are difficult to accurately capture the characteristics and changing patterns of new loads, resulting in low prediction accuracy. Therefore, a decision model for guaranteeing power supply is constructed based on an improved proximal policy optimization algorithm, to study the intelligent guarantee power supply decision method. The experimental results show that the stability of the proximal policy optimization algorithms is generally high in all scenarios, especially in fault or anomaly scenarios and low load demand scenarios, which exceeds 110%. Its loss value decreases with the increase of training iterations. At 60 iterations, its loss value reaches the optimal value of 100 and then tends to stabilize. The research results indicate that the intelligent power supply strategy has good feasibility. This decision method helps to improve the stability, efficiency, intelligence level, and ability to respond to emergencies of the power grid.