{"title":"Enhanced Particle Swarm Optimization via Reinforcement Learning","authors":"Di Wu, G. Wang","doi":"10.1115/detc2020-22519","DOIUrl":null,"url":null,"abstract":"\n Particle swarm optimization (PSO) method is a well-known optimization algorithm, which shows good performance in solving different optimization problems. However, PSO usually suffers from slow convergence. In this paper, a reinforcement learning method is used to enhance PSO in convergence by replacing the uniformly distributed random number in the updating function by a random number generated from a well-selected normal distribution. The mean and variance of the normal distribution are estimated from the current state of each individual through a policy net. The historic behavior of the swarm group is learned to update the policy net and guide the selection of parameters of the normal distribution. The proposed algorithm is tested with numerical test functions and the results show that the convergence rate of PSO can be improved with the proposed Reinforcement Learning method (RL-PSO).","PeriodicalId":415040,"journal":{"name":"Volume 11A: 46th Design Automation Conference (DAC)","volume":"136 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Volume 11A: 46th Design Automation Conference (DAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/detc2020-22519","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Particle swarm optimization (PSO) method is a well-known optimization algorithm, which shows good performance in solving different optimization problems. However, PSO usually suffers from slow convergence. In this paper, a reinforcement learning method is used to enhance PSO in convergence by replacing the uniformly distributed random number in the updating function by a random number generated from a well-selected normal distribution. The mean and variance of the normal distribution are estimated from the current state of each individual through a policy net. The historic behavior of the swarm group is learned to update the policy net and guide the selection of parameters of the normal distribution. The proposed algorithm is tested with numerical test functions and the results show that the convergence rate of PSO can be improved with the proposed Reinforcement Learning method (RL-PSO).