A Cooperation-Free Resource Allocation Algorithm Enhanced by Reinforcement Learning for Coexisting IIoTs

2023 IEEE 19th International Conference on Factory Communication Systems (WFCS) Pub Date : 2023-04-26 DOI:10.1109/WFCS57264.2023.10144246

Jialin Zhang, W. Liang, Bo Yang, Huaguang Shi, Qi Wang, Zhibo Pang

{"title":"A Cooperation-Free Resource Allocation Algorithm Enhanced by Reinforcement Learning for Coexisting IIoTs","authors":"Jialin Zhang, W. Liang, Bo Yang, Huaguang Shi, Qi Wang, Zhibo Pang","doi":"10.1109/WFCS57264.2023.10144246","DOIUrl":null,"url":null,"abstract":"The Industrial Internet of Things (IIoTs) plays an important role in various industrial applications, which require multiple time-critical networks to be deployed in the same region. The limited communication resources inevitably incur network coexistence problems. For scenarios where coexisting networks cannot coordinate effectively, the centralized or partial-information-based decentralized resource allocation methods cannot be implemented. To address this concern, we propose a Cooperation-Free Reinforcement Learning (CF-RL) algorithm for the fully distributed resource allocation problem in coexisting IIoT systems. Each network adopts the proposed algorithm to minimize collisions through a trial-and-error approach without any information interaction. To resist the influence of environmental dynamics, each coexisting network learns the state transition probability of the resource block instead of the resource block's position. Moreover, to potentially ensure the overall system performance, each network additionally considers the period offset in the initialization phase and action selection phase, so that the coexisting networks have different preferences for different state transitions. We conduct extensive simulations to verify the convergence performance. Evaluation results show that the CF-RL algorithm almost achieves (more than 99.88%) the effect of centralized resource allocation and has obvious superiorities over other cooperation-free algorithms in terms of the convergence rate, the number of collisions, and the resource utilization ratio.","PeriodicalId":345607,"journal":{"name":"2023 IEEE 19th International Conference on Factory Communication Systems (WFCS)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 19th International Conference on Factory Communication Systems (WFCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WFCS57264.2023.10144246","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The Industrial Internet of Things (IIoTs) plays an important role in various industrial applications, which require multiple time-critical networks to be deployed in the same region. The limited communication resources inevitably incur network coexistence problems. For scenarios where coexisting networks cannot coordinate effectively, the centralized or partial-information-based decentralized resource allocation methods cannot be implemented. To address this concern, we propose a Cooperation-Free Reinforcement Learning (CF-RL) algorithm for the fully distributed resource allocation problem in coexisting IIoT systems. Each network adopts the proposed algorithm to minimize collisions through a trial-and-error approach without any information interaction. To resist the influence of environmental dynamics, each coexisting network learns the state transition probability of the resource block instead of the resource block's position. Moreover, to potentially ensure the overall system performance, each network additionally considers the period offset in the initialization phase and action selection phase, so that the coexisting networks have different preferences for different state transitions. We conduct extensive simulations to verify the convergence performance. Evaluation results show that the CF-RL algorithm almost achieves (more than 99.88%) the effect of centralized resource allocation and has obvious superiorities over other cooperation-free algorithms in terms of the convergence rate, the number of collisions, and the resource utilization ratio.

查看原文本刊更多论文

基于强化学习的共存物联网无合作资源分配算法

工业物联网(iiot)在各种工业应用中发挥着重要作用，这些应用需要在同一区域部署多个时间关键型网络。有限的通信资源不可避免地产生网络共存问题。对于共存网络无法有效协调的场景，无法实现集中式或部分信息化的分散资源分配方式。为了解决这一问题，我们提出了一种无合作强化学习(CF-RL)算法来解决共存工业物联网系统中完全分布式的资源分配问题。每个网络都采用本文提出的算法，在没有任何信息交互的情况下，通过试错的方法将碰撞最小化。为了抵抗环境动态的影响，每个共存网络学习资源块的状态转移概率，而不是资源块的位置。此外，为了潜在地保证系统的整体性能，每个网络在初始化阶段和动作选择阶段额外考虑周期偏移，使得共存网络对不同的状态转换具有不同的偏好。我们进行了大量的仿真来验证收敛性能。评价结果表明，CF-RL算法几乎达到(99.88%以上)资源集中分配的效果，在收敛速度、碰撞次数、资源利用率等方面都比其他无协作算法有明显的优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE 19th International Conference on Factory Communication Systems (WFCS)

自引率

0.00%

发文量