{"title":"Multi-Agent Reinforcement Learning Reward Engineering via Stochastic Game Evaluation","authors":"A. Kattepur","doi":"10.1109/SNPD54884.2022.10051783","DOIUrl":null,"url":null,"abstract":"With the proliferation of Reinforcement Learning (RL) algorithms across multiple applications, the design of appropriate reward mechanisms that elicit desired behaviours becomes crucial. Reward setting is made more difficult in the multi-agent case where cooperative, competitive or mixed interactions between agents may lead to differing outcomes. In this paper, we formulate the reward engineering of multi-agent reinforcement learning approaches via game theoretic models. This approach is used to analyze the overall team reward when choosing one game theoretic structure over another. An empirical analysis is provided over game theoretic simulators that demonstrate co-operative game rewards improve the rewards by upto 30%. This formulation may be applied to a variety of use cases within logistics, transport and telecommunications domains that employ multi-agent techniques.","PeriodicalId":425462,"journal":{"name":"2022 IEEE/ACIS 23rd International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)","volume":"117 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/ACIS 23rd International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SNPD54884.2022.10051783","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With the proliferation of Reinforcement Learning (RL) algorithms across multiple applications, the design of appropriate reward mechanisms that elicit desired behaviours becomes crucial. Reward setting is made more difficult in the multi-agent case where cooperative, competitive or mixed interactions between agents may lead to differing outcomes. In this paper, we formulate the reward engineering of multi-agent reinforcement learning approaches via game theoretic models. This approach is used to analyze the overall team reward when choosing one game theoretic structure over another. An empirical analysis is provided over game theoretic simulators that demonstrate co-operative game rewards improve the rewards by upto 30%. This formulation may be applied to a variety of use cases within logistics, transport and telecommunications domains that employ multi-agent techniques.