{"title":"具有反馈延迟和通信延迟的聚合博弈的次线性动态遗憾","authors":"Pin Liu;Letian Wang;Yue Chen","doi":"10.1109/TCNS.2024.3432250","DOIUrl":null,"url":null,"abstract":"In this article, we investigate aggregative game problems in dynamic environments, where the cost functions of players change over time and are revealed gradually after players select their strategies in given strategy sets. Specifically, after selecting strategies, players observe the corresponding cost functions with feedback delays. Moreover, players receive the information of their neighbors with communication delays in weight-unbalanced digraphs. To handle such aggregative game problems, a robust learning algorithm is proposed to track time-varying Nash equilibria. The upper bounds of dynamic regrets are given to illustrate the performance of the proposed algorithm. We quantify the impact of both feedback delays and communication delays on dynamic regrets and give the framework to analyze the convergence of strategies in the case of delays. The proposed algorithm achieves the sublinearity of dynamic regrets in the absence of delays. A numerical example is given to illustrate the efficiency of the proposed learning algorithm.","PeriodicalId":56023,"journal":{"name":"IEEE Transactions on Control of Network Systems","volume":"12 1","pages":"1071-1079"},"PeriodicalIF":4.0000,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sublinear Dynamic Regrets for Aggregative Games With Feedback Delays and Communication Delays\",\"authors\":\"Pin Liu;Letian Wang;Yue Chen\",\"doi\":\"10.1109/TCNS.2024.3432250\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this article, we investigate aggregative game problems in dynamic environments, where the cost functions of players change over time and are revealed gradually after players select their strategies in given strategy sets. Specifically, after selecting strategies, players observe the corresponding cost functions with feedback delays. Moreover, players receive the information of their neighbors with communication delays in weight-unbalanced digraphs. To handle such aggregative game problems, a robust learning algorithm is proposed to track time-varying Nash equilibria. The upper bounds of dynamic regrets are given to illustrate the performance of the proposed algorithm. We quantify the impact of both feedback delays and communication delays on dynamic regrets and give the framework to analyze the convergence of strategies in the case of delays. The proposed algorithm achieves the sublinearity of dynamic regrets in the absence of delays. A numerical example is given to illustrate the efficiency of the proposed learning algorithm.\",\"PeriodicalId\":56023,\"journal\":{\"name\":\"IEEE Transactions on Control of Network Systems\",\"volume\":\"12 1\",\"pages\":\"1071-1079\"},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2024-07-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Control of Network Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10607957/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Control of Network Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10607957/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Sublinear Dynamic Regrets for Aggregative Games With Feedback Delays and Communication Delays
In this article, we investigate aggregative game problems in dynamic environments, where the cost functions of players change over time and are revealed gradually after players select their strategies in given strategy sets. Specifically, after selecting strategies, players observe the corresponding cost functions with feedback delays. Moreover, players receive the information of their neighbors with communication delays in weight-unbalanced digraphs. To handle such aggregative game problems, a robust learning algorithm is proposed to track time-varying Nash equilibria. The upper bounds of dynamic regrets are given to illustrate the performance of the proposed algorithm. We quantify the impact of both feedback delays and communication delays on dynamic regrets and give the framework to analyze the convergence of strategies in the case of delays. The proposed algorithm achieves the sublinearity of dynamic regrets in the absence of delays. A numerical example is given to illustrate the efficiency of the proposed learning algorithm.
期刊介绍:
The IEEE Transactions on Control of Network Systems is committed to the timely publication of high-impact papers at the intersection of control systems and network science. In particular, the journal addresses research on the analysis, design and implementation of networked control systems, as well as control over networks. Relevant work includes the full spectrum from basic research on control systems to the design of engineering solutions for automatic control of, and over, networks. The topics covered by this journal include: Coordinated control and estimation over networks, Control and computation over sensor networks, Control under communication constraints, Control and performance analysis issues that arise in the dynamics of networks used in application areas such as communications, computers, transportation, manufacturing, Web ranking and aggregation, social networks, biology, power systems, economics, Synchronization of activities across a controlled network, Stability analysis of controlled networks, Analysis of networks as hybrid dynamical systems.