{"title":"Decentralized and Partially Decentralized Reinforcement Learning for Distributed Combinatorial Optimization Problems","authors":"Omkar J. Tilak, S. Mukhopadhyay","doi":"10.1109/ICMLA.2010.64","DOIUrl":null,"url":null,"abstract":"In this paper, we describe a framework for solving computationally hard, distributed function optimization problems using reinforcement learning techniques. In particular, we model a function optimization problem as an identical payoff game played by a team of reinforcement learning agents. The team performs a stochastic search through the domain space of the parameters of the function. However, current game learning algorithms suffer from significant memory requirement, significant communication overhead and slow convergence. To alleviate these problems, we present novel decentralized and partially decentralized reinforcement learning algorithms for the team. Simulation results are presented for the NP-Hard sensor subset selection problem to show that the agents learn locally optimal parameter values and illustrate the advantages of the proposed algorithms.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Ninth International Conference on Machine Learning and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2010.64","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
In this paper, we describe a framework for solving computationally hard, distributed function optimization problems using reinforcement learning techniques. In particular, we model a function optimization problem as an identical payoff game played by a team of reinforcement learning agents. The team performs a stochastic search through the domain space of the parameters of the function. However, current game learning algorithms suffer from significant memory requirement, significant communication overhead and slow convergence. To alleviate these problems, we present novel decentralized and partially decentralized reinforcement learning algorithms for the team. Simulation results are presented for the NP-Hard sensor subset selection problem to show that the agents learn locally optimal parameter values and illustrate the advantages of the proposed algorithms.