多渠道博弈中的收益控制：影响对手学习进化

IF 9.4 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

IEEE Transactions on Cybernetics Pub Date : 2025-01-03 DOI:10.1109/TCYB.2024.3507830

Juan Shi;Chen Chu;Guoxi Fan;Die Hu;Jinzhuo Liu;Zhen Wang;Shuyue Hu

{"title":"多渠道博弈中的收益控制：影响对手学习进化","authors":"Juan Shi;Chen Chu;Guoxi Fan;Die Hu;Jinzhuo Liu;Zhen Wang;Shuyue Hu","doi":"10.1109/TCYB.2024.3507830","DOIUrl":null,"url":null,"abstract":"In this article, we introduce a new theory for payoff control in multichannel learning environments, where agents interact with each other over multiple channels and each channel is a repeated normal form game. We propose two payoff control strategies—partial control and full control—that allow a single agent to set an upper bound to the opponent’s expected payoffs summed across all channels, even if the opponent is a reinforcement learning agent. We prove that a partial (or full) control strategy can be obtained by solving a system of inequalities, and characterize the conditions under which such a partial (or full) control strategy exists. We show that by utilizing these control strategies, the agent can influence the opponent’s learning evolution and direct it toward a desired viable equilibrium. Our experiments confirm the effectiveness of our theory for payoff control in a wide range of multichannel learning environments.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 2","pages":"776-785"},"PeriodicalIF":9.4000,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Payoff Control in Multichannel Games: Influencing Opponent Learning Evolution\",\"authors\":\"Juan Shi;Chen Chu;Guoxi Fan;Die Hu;Jinzhuo Liu;Zhen Wang;Shuyue Hu\",\"doi\":\"10.1109/TCYB.2024.3507830\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this article, we introduce a new theory for payoff control in multichannel learning environments, where agents interact with each other over multiple channels and each channel is a repeated normal form game. We propose two payoff control strategies—partial control and full control—that allow a single agent to set an upper bound to the opponent’s expected payoffs summed across all channels, even if the opponent is a reinforcement learning agent. We prove that a partial (or full) control strategy can be obtained by solving a system of inequalities, and characterize the conditions under which such a partial (or full) control strategy exists. We show that by utilizing these control strategies, the agent can influence the opponent’s learning evolution and direct it toward a desired viable equilibrium. Our experiments confirm the effectiveness of our theory for payoff control in a wide range of multichannel learning environments.\",\"PeriodicalId\":13112,\"journal\":{\"name\":\"IEEE Transactions on Cybernetics\",\"volume\":\"55 2\",\"pages\":\"776-785\"},\"PeriodicalIF\":9.4000,\"publicationDate\":\"2025-01-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Cybernetics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10820999/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cybernetics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10820999/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Payoff Control in Multichannel Games: Influencing Opponent Learning Evolution

In this article, we introduce a new theory for payoff control in multichannel learning environments, where agents interact with each other over multiple channels and each channel is a repeated normal form game. We propose two payoff control strategies—partial control and full control—that allow a single agent to set an upper bound to the opponent’s expected payoffs summed across all channels, even if the opponent is a reinforcement learning agent. We prove that a partial (or full) control strategy can be obtained by solving a system of inequalities, and characterize the conditions under which such a partial (or full) control strategy exists. We show that by utilizing these control strategies, the agent can influence the opponent’s learning evolution and direct it toward a desired viable equilibrium. Our experiments confirm the effectiveness of our theory for payoff control in a wide range of multichannel learning environments.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Cybernetics COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-COMPUTER SCIENCE, CYBERNETICS

CiteScore

25.40

自引率

11.00%

发文量

1869

期刊介绍： The scope of the IEEE Transactions on Cybernetics includes computational approaches to the field of cybernetics. Specifically, the transactions welcomes papers on communication and control across machines or machine, human, and organizations. The scope includes such areas as computational intelligence, computer vision, neural networks, genetic algorithms, machine learning, fuzzy systems, cognitive systems, decision making, and robotics, to the extent that they contribute to the theme of cybernetics or demonstrate an application of cybernetics principles.