Chuong Van Nguyen, P. Hoang, Hong-Kyong Kim, H. Ahn
{"title":"Distributed learning in a multi-agent potential game","authors":"Chuong Van Nguyen, P. Hoang, Hong-Kyong Kim, H. Ahn","doi":"10.23919/ICCAS.2017.8204449","DOIUrl":null,"url":null,"abstract":"In a non-cooperative dynamic game, each player participating in a changing environment aims to optimize its actions selfishly. In this paper, we focus our analysis on a class of games, namely dynamic potential game in multiagent systems. The problems of the game with constraints and without constraints are both considered; in both cases, we propose algorithms to learn the Nash equilibrium (NE) in a distributed fashion. The idea of NE learning is relied on two-time-scale dynamics and convex optimization. A numerical example is presented to verify the effectiveness of the proposed methods.","PeriodicalId":140598,"journal":{"name":"2017 17th International Conference on Control, Automation and Systems (ICCAS)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 17th International Conference on Control, Automation and Systems (ICCAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/ICCAS.2017.8204449","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
In a non-cooperative dynamic game, each player participating in a changing environment aims to optimize its actions selfishly. In this paper, we focus our analysis on a class of games, namely dynamic potential game in multiagent systems. The problems of the game with constraints and without constraints are both considered; in both cases, we propose algorithms to learn the Nash equilibrium (NE) in a distributed fashion. The idea of NE learning is relied on two-time-scale dynamics and convex optimization. A numerical example is presented to verify the effectiveness of the proposed methods.