Intelligent guarantee power supply decision method based on reinforcement learning algorithm

Q2 Energy

Energy Informatics Pub Date : 2025-06-10 DOI:10.1186/s42162-025-00535-3

Milu Zhou, Huijie Sun, Tian Yang, Tingting Li, Qi Hou

{"title":"Intelligent guarantee power supply decision method based on reinforcement learning algorithm","authors":"Milu Zhou, Huijie Sun, Tian Yang, Tingting Li, Qi Hou","doi":"10.1186/s42162-025-00535-3","DOIUrl":null,"url":null,"abstract":"<div><p>Traditional power supply decision methods rely on fixed and rigorous mathematical models, which are difficult to accurately capture the characteristics and changing patterns of new loads, resulting in low prediction accuracy. Therefore, a decision model for guaranteeing power supply is constructed based on an improved proximal policy optimization algorithm, to study the intelligent guarantee power supply decision method. The experimental results show that the stability of the proximal policy optimization algorithms is generally high in all scenarios, especially in fault or anomaly scenarios and low load demand scenarios, which exceeds 110%. Its loss value decreases with the increase of training iterations. At 60 iterations, its loss value reaches the optimal value of 100 and then tends to stabilize. The research results indicate that the intelligent power supply strategy has good feasibility. This decision method helps to improve the stability, efficiency, intelligence level, and ability to respond to emergencies of the power grid.</p></div>","PeriodicalId":538,"journal":{"name":"Energy Informatics","volume":"8 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://energyinformatics.springeropen.com/counter/pdf/10.1186/s42162-025-00535-3","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy Informatics","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1186/s42162-025-00535-3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Energy","Score":null,"Total":0}

引用次数: 0

Abstract

Traditional power supply decision methods rely on fixed and rigorous mathematical models, which are difficult to accurately capture the characteristics and changing patterns of new loads, resulting in low prediction accuracy. Therefore, a decision model for guaranteeing power supply is constructed based on an improved proximal policy optimization algorithm, to study the intelligent guarantee power supply decision method. The experimental results show that the stability of the proximal policy optimization algorithms is generally high in all scenarios, especially in fault or anomaly scenarios and low load demand scenarios, which exceeds 110%. Its loss value decreases with the increase of training iterations. At 60 iterations, its loss value reaches the optimal value of 100 and then tends to stabilize. The research results indicate that the intelligent power supply strategy has good feasibility. This decision method helps to improve the stability, efficiency, intelligence level, and ability to respond to emergencies of the power grid.

查看原文本刊更多论文

基于强化学习算法的智能保障供电决策方法

传统的供电决策方法依赖于固定的、严格的数学模型，难以准确地捕捉新负荷的特征和变化规律，导致预测精度较低。为此，基于改进的近端策略优化算法构建了保障供电决策模型，研究智能保障供电决策方法。实验结果表明，近端策略优化算法在所有场景下的稳定性普遍较高，特别是在故障或异常场景和低负荷需求场景下，稳定性超过110%。其损失值随着训练迭代次数的增加而减小。在60次迭代时，其损耗值达到最优值100，然后趋于稳定。研究结果表明，该智能供电策略具有良好的可行性。这种决策方法有助于提高电网的稳定性、效率、智能化水平和应急能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊