Buyers Collusion in Incentivized Forwarding Networks: A Multi-Agent Reinforcement Learning Study

IEEE Transactions on Machine Learning in Communications and Networking Pub Date : 2024-02-12 DOI:10.1109/TMLCN.2024.3365420

Mostafa Ibrahim;Sabit Ekin;Ali Imran

{"title":"Buyers Collusion in Incentivized Forwarding Networks: A Multi-Agent Reinforcement Learning Study","authors":"Mostafa Ibrahim;Sabit Ekin;Ali Imran","doi":"10.1109/TMLCN.2024.3365420","DOIUrl":null,"url":null,"abstract":"We present the issue of monetarily incentivized forwarding in a multi-hop mesh network architecture from an economic perspective. It is anticipated that credit-incentivized forwarding and relaying will be a simple method of exchanging transmission power and spectrum for connectivity. However, gateways and forwarding nodes, like any other free market, may create an oligopolistic market for the users they serve. In this study, a coalition scheme between buyers aims to address price control by gateways or nodes closer to gateways. In a Stackelberg competition game, buyer agents (users) and sellers (gateways) make decisions using reinforcement learning (RL), with decentralized Deep Q-Networks to buy and sell forwarding resources. We allow communication links between the buyers with a limited messaging space, without defining a collusion mechanism. The idea is to demonstrate that through messaging, and RL tacit collusion can emerge between agents in a decentralized setup. The multi-agent reinforcement learning (MARL) system is presented and analyzed from a machine-learning perspective. Moreover, MARL dynamics are discussed via mean field analysis to better understand divergence causes and make implementation recommendations for such systems. Finally, the simulation results show the results of coordination among the users.","PeriodicalId":100641,"journal":{"name":"IEEE Transactions on Machine Learning in Communications and Networking","volume":"2 ","pages":"240-260"},"PeriodicalIF":0.0000,"publicationDate":"2024-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10433203","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Machine Learning in Communications and Networking","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10433203/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

We present the issue of monetarily incentivized forwarding in a multi-hop mesh network architecture from an economic perspective. It is anticipated that credit-incentivized forwarding and relaying will be a simple method of exchanging transmission power and spectrum for connectivity. However, gateways and forwarding nodes, like any other free market, may create an oligopolistic market for the users they serve. In this study, a coalition scheme between buyers aims to address price control by gateways or nodes closer to gateways. In a Stackelberg competition game, buyer agents (users) and sellers (gateways) make decisions using reinforcement learning (RL), with decentralized Deep Q-Networks to buy and sell forwarding resources. We allow communication links between the buyers with a limited messaging space, without defining a collusion mechanism. The idea is to demonstrate that through messaging, and RL tacit collusion can emerge between agents in a decentralized setup. The multi-agent reinforcement learning (MARL) system is presented and analyzed from a machine-learning perspective. Moreover, MARL dynamics are discussed via mean field analysis to better understand divergence causes and make implementation recommendations for such systems. Finally, the simulation results show the results of coordination among the users.

查看原文本刊更多论文

激励转发网络中的买家串通：多代理强化学习研究

我们从经济学角度介绍了多跳网状网络架构中的货币激励转发问题。我们预计，信用激励转发和中继将是交换传输功率和频谱以实现连接的一种简单方法。然而，网关和转发节点与其他自由市场一样，可能会为其服务的用户创造一个寡头垄断市场。在本研究中，买方之间的联盟计划旨在解决网关或更靠近网关的节点的价格控制问题。在斯塔克尔伯格竞争博弈中，买方代理（用户）和卖方（网关）利用强化学习（RL）做出决策，并通过分散的深度 Q 网络来买卖转发资源。我们允许买方之间在有限的信息空间内建立通信联系，但不定义串通机制。我们的想法是证明，通过信息传递和 RL，可以在分散设置的代理之间形成默契串通。从机器学习的角度介绍并分析了多代理强化学习（MARL）系统。此外，还通过均值场分析讨论了 MARL 动态，以更好地理解分歧原因，并为此类系统提出实施建议。最后，模拟结果显示了用户之间的协调结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Machine Learning in Communications and Networking

自引率

0.00%

发文量