Interval Constrained Multi-Objective Optimization Scheduling Method for Island-Integrated Energy Systems Based on Meta-Learning and Enhanced Proximal Policy Optimization

IF 2.6 3区 工程技术 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS
Dongbao Jia, Ming Cao, Jing Sun, Feimeng Wang, Wei Xu, Yichen Wang
{"title":"Interval Constrained Multi-Objective Optimization Scheduling Method for Island-Integrated Energy Systems Based on Meta-Learning and Enhanced Proximal Policy Optimization","authors":"Dongbao Jia, Ming Cao, Jing Sun, Feimeng Wang, Wei Xu, Yichen Wang","doi":"10.3390/electronics13173579","DOIUrl":null,"url":null,"abstract":"Multiple uncertainties from source–load and energy conversion significantly impact the real-time dispatch of an island integrated energy system (IIES). This paper addresses the day-ahead scheduling problems of IIES under these conditions, aiming to minimize daily economic costs and maximize the output of renewable energies. We introduce an innovative algorithm for Interval Constrained Multi-objective Optimization Problems (ICMOPs), which incorporates meta-learning and an improved Proximal Policy Optimization with Clipped Objective (PPO-CLIP) approach. This algorithm fills a notable gap in the application of DRL to complex ICMOPs within the field. Initially, the multi-objective problem is decomposed into several single-objective problems using a uniform weight decomposition method. A meta-model trained via meta-learning enables fine-tuning to adapt solutions for subsidiary problems once the initial training is complete. Additionally, we enhance the PPO-CLIP framework with a novel strategy that integrates probability shifts and Generalized Advantage Estimation (GAE). In the final stage of scheduling plan selection, a technique for identifying interval turning points is employed to choose the optimal plan from the Pareto solution set. The results demonstrate that the method not only secures excellent scheduling solutions in complex environments through its robust generalization capabilities but also shows significant improvements over interval-constrained multi-objective evolutionary algorithms, such as IP-MOEA, ICMOABC, and IMOMA-II, across multiple multi-objective evaluation metrics including hypervolume (HV), runtime, and uncertainty.","PeriodicalId":11646,"journal":{"name":"Electronics","volume":null,"pages":null},"PeriodicalIF":2.6000,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Electronics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.3390/electronics13173579","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Multiple uncertainties from source–load and energy conversion significantly impact the real-time dispatch of an island integrated energy system (IIES). This paper addresses the day-ahead scheduling problems of IIES under these conditions, aiming to minimize daily economic costs and maximize the output of renewable energies. We introduce an innovative algorithm for Interval Constrained Multi-objective Optimization Problems (ICMOPs), which incorporates meta-learning and an improved Proximal Policy Optimization with Clipped Objective (PPO-CLIP) approach. This algorithm fills a notable gap in the application of DRL to complex ICMOPs within the field. Initially, the multi-objective problem is decomposed into several single-objective problems using a uniform weight decomposition method. A meta-model trained via meta-learning enables fine-tuning to adapt solutions for subsidiary problems once the initial training is complete. Additionally, we enhance the PPO-CLIP framework with a novel strategy that integrates probability shifts and Generalized Advantage Estimation (GAE). In the final stage of scheduling plan selection, a technique for identifying interval turning points is employed to choose the optimal plan from the Pareto solution set. The results demonstrate that the method not only secures excellent scheduling solutions in complex environments through its robust generalization capabilities but also shows significant improvements over interval-constrained multi-objective evolutionary algorithms, such as IP-MOEA, ICMOABC, and IMOMA-II, across multiple multi-objective evaluation metrics including hypervolume (HV), runtime, and uncertainty.
基于元学习和增强型近端策略优化的岛屿集成能源系统区间约束多目标优化调度方法
源-负载和能量转换的多重不确定性对岛屿综合能源系统(IIES)的实时调度产生了重大影响。本文探讨了这些条件下岛屿综合能源系统的日前调度问题,旨在最小化每日经济成本和最大化可再生能源产出。我们针对区间约束多目标优化问题(ICMOPs)引入了一种创新算法,该算法结合了元学习和改进的 "削目标近端策略优化"(PPO-CLIP)方法。该算法填补了 DRL 在复杂 ICMOP 领域应用的空白。首先,使用统一权重分解法将多目标问题分解为多个单目标问题。通过元学习训练的元模型可以在初始训练完成后进行微调,以调整附属问题的解决方案。此外,我们还利用一种整合了概率转移和广义优势估计(GAE)的新策略来增强 PPO-CLIP 框架。在调度计划选择的最后阶段,我们采用了一种识别区间转折点的技术,从帕累托解集中选择最优计划。结果表明,该方法不仅能通过其强大的泛化能力在复杂环境中确保获得出色的调度解决方案,而且在包括超体积(HV)、运行时间和不确定性在内的多个多目标评价指标方面,与区间约束多目标进化算法(如 IP-MOEA、ICMOABC 和 IMOMA-II)相比也有显著改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Electronics
Electronics Computer Science-Computer Networks and Communications
CiteScore
1.10
自引率
10.30%
发文量
3515
审稿时长
16.71 days
期刊介绍: Electronics (ISSN 2079-9292; CODEN: ELECGJ) is an international, open access journal on the science of electronics and its applications published quarterly online by MDPI.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信