Shuxiang Li , Yongsheng Pang , Zhaorong Huang , Xianghua Chu
{"title":"An offline-online learning framework combining meta-learning and reinforcement learning for evolutionary multi-objective optimization","authors":"Shuxiang Li , Yongsheng Pang , Zhaorong Huang , Xianghua Chu","doi":"10.1016/j.swevo.2025.102037","DOIUrl":null,"url":null,"abstract":"<div><div>Many multi-objective evolutionary algorithms (MOEAs) have been proposed in addressing the multi-objective optimization problems (MOPs). However, the performance of MOEAs varies significantly across various MOPs and there is no single MOEA that performs well on all MOP instances. In addition, existing methods for adaptive MOEA selection still face limitations, which restrict the further optimization for MOPs. To fill these gaps and improve the efficiency of solving MOPs, this study proposes an offline-online learning framework combining meta-learning and reinforcement learning (O<sup>2</sup>-MRL). Instead of proposing a new MOEA or optimizing a strategy, O<sup>2</sup>-MRL solves MOPs by taking full advantage of the existing MOEAs and addresses the limitations of existing MOEA selection methods. O<sup>2</sup>-MRL can adaptively select the appropriate MOEAs for various types of MOPs with different dimensions (Offline) and automatically schedule the selected MOEAs during the optimization process (Online), offering a new idea for optimizing MOPs. To evaluate the performance of the proposed O<sup>2</sup>-MRL, forty-seven benchmark MOPs are used as instances, and nine representative MOEAs are selected for comparison. Comprehensive experiments demonstrate the significant efficiency of O<sup>2</sup>-MRL, as it achieves optimal solutions in 60.28 % of the MOPs across different dimensions and improves the optimization results in 48.23 % of them, with an average improvement of 8.72 %. In addition to maintaining high optimization performance, O<sup>2</sup>-MRL also demonstrates superior convergence speed and stability across various types of MOPs. Two real-world MOPs are employed to evaluate the practicality of O<sup>2</sup>-MRL, and the experimental results indicate that it achieves optimal solutions in both cases.</div></div>","PeriodicalId":48682,"journal":{"name":"Swarm and Evolutionary Computation","volume":"97 ","pages":"Article 102037"},"PeriodicalIF":8.5000,"publicationDate":"2025-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Swarm and Evolutionary Computation","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210650225001956","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Many multi-objective evolutionary algorithms (MOEAs) have been proposed in addressing the multi-objective optimization problems (MOPs). However, the performance of MOEAs varies significantly across various MOPs and there is no single MOEA that performs well on all MOP instances. In addition, existing methods for adaptive MOEA selection still face limitations, which restrict the further optimization for MOPs. To fill these gaps and improve the efficiency of solving MOPs, this study proposes an offline-online learning framework combining meta-learning and reinforcement learning (O2-MRL). Instead of proposing a new MOEA or optimizing a strategy, O2-MRL solves MOPs by taking full advantage of the existing MOEAs and addresses the limitations of existing MOEA selection methods. O2-MRL can adaptively select the appropriate MOEAs for various types of MOPs with different dimensions (Offline) and automatically schedule the selected MOEAs during the optimization process (Online), offering a new idea for optimizing MOPs. To evaluate the performance of the proposed O2-MRL, forty-seven benchmark MOPs are used as instances, and nine representative MOEAs are selected for comparison. Comprehensive experiments demonstrate the significant efficiency of O2-MRL, as it achieves optimal solutions in 60.28 % of the MOPs across different dimensions and improves the optimization results in 48.23 % of them, with an average improvement of 8.72 %. In addition to maintaining high optimization performance, O2-MRL also demonstrates superior convergence speed and stability across various types of MOPs. Two real-world MOPs are employed to evaluate the practicality of O2-MRL, and the experimental results indicate that it achieves optimal solutions in both cases.
期刊介绍:
Swarm and Evolutionary Computation is a pioneering peer-reviewed journal focused on the latest research and advancements in nature-inspired intelligent computation using swarm and evolutionary algorithms. It covers theoretical, experimental, and practical aspects of these paradigms and their hybrids, promoting interdisciplinary research. The journal prioritizes the publication of high-quality, original articles that push the boundaries of evolutionary computation and swarm intelligence. Additionally, it welcomes survey papers on current topics and novel applications. Topics of interest include but are not limited to: Genetic Algorithms, and Genetic Programming, Evolution Strategies, and Evolutionary Programming, Differential Evolution, Artificial Immune Systems, Particle Swarms, Ant Colony, Bacterial Foraging, Artificial Bees, Fireflies Algorithm, Harmony Search, Artificial Life, Digital Organisms, Estimation of Distribution Algorithms, Stochastic Diffusion Search, Quantum Computing, Nano Computing, Membrane Computing, Human-centric Computing, Hybridization of Algorithms, Memetic Computing, Autonomic Computing, Self-organizing systems, Combinatorial, Discrete, Binary, Constrained, Multi-objective, Multi-modal, Dynamic, and Large-scale Optimization.