主动飞行- ris辅助移动边缘计算网络的能效优化：一种深度强化学习方法

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Internet of Things Journal Pub Date : 2025-03-31 DOI:10.1109/JIOT.2025.3556246

Wei Chen;Yulong Zou;Jia Zhu;Liangsen Zhai

{"title":"主动飞行- ris辅助移动边缘计算网络的能效优化：一种深度强化学习方法","authors":"Wei Chen;Yulong Zou;Jia Zhu;Liangsen Zhai","doi":"10.1109/JIOT.2025.3556246","DOIUrl":null,"url":null,"abstract":"This article explores the energy efficiency (EE) of a mobile-edge computing (MEC) architecture for Internet of Things (IoT) networks, in which multiple IoT devices (IoTDs) perform local computations while further offloading part of their tasks to the base station (BS)-enabled MEC server utilizing the time division multiple access (TDMA) protocol. To address the radio propagation issues caused by obstructions, we utilize an uncrewed aerial vehicle (UAV)-mounted active reconfigurable intelligent surface (RIS), referred to as a flying-RIS (FRIS), to reflect and even amplify the incident signal from IoTDs to the BS. Since tasks can be executed in parallel on both IoTDs and the BS, the assignment of service timeslots, the UAV trajectory, and the resource allocation for both IoTDs and FRIS should be jointly optimized to maximize system EE. To this end, we reformulate the optimization problem as a markov decision process (MDP) and introduce a deep reinforcement learning (DRL) approach for addressing the formulated problem, called the proximal policy optimization (PPO) based resource allocation with trajectory design and FRIS reflection matrix optimization (PPO-RATDFRO) algorithm. By continuously interacting within the constructed environment, the proposed system iteratively refines its policy to determine the FRIS trajectory and reflection matrix, along with the service timeslots and computation resources for each IoTD. Finally, simulation results demonstrate that the proposed PPO-RATDFRO algorithm significantly enhances EE for all served IoTDs, compared to various benchmark algorithms.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 13","pages":"23563-23576"},"PeriodicalIF":8.9000,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Energy-Efficiency Optimization of Active Flying-RIS-Assisted Mobile-Edge Computing Networks: A Deep-Reinforcement-Learning Approach\",\"authors\":\"Wei Chen;Yulong Zou;Jia Zhu;Liangsen Zhai\",\"doi\":\"10.1109/JIOT.2025.3556246\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article explores the energy efficiency (EE) of a mobile-edge computing (MEC) architecture for Internet of Things (IoT) networks, in which multiple IoT devices (IoTDs) perform local computations while further offloading part of their tasks to the base station (BS)-enabled MEC server utilizing the time division multiple access (TDMA) protocol. To address the radio propagation issues caused by obstructions, we utilize an uncrewed aerial vehicle (UAV)-mounted active reconfigurable intelligent surface (RIS), referred to as a flying-RIS (FRIS), to reflect and even amplify the incident signal from IoTDs to the BS. Since tasks can be executed in parallel on both IoTDs and the BS, the assignment of service timeslots, the UAV trajectory, and the resource allocation for both IoTDs and FRIS should be jointly optimized to maximize system EE. To this end, we reformulate the optimization problem as a markov decision process (MDP) and introduce a deep reinforcement learning (DRL) approach for addressing the formulated problem, called the proximal policy optimization (PPO) based resource allocation with trajectory design and FRIS reflection matrix optimization (PPO-RATDFRO) algorithm. By continuously interacting within the constructed environment, the proposed system iteratively refines its policy to determine the FRIS trajectory and reflection matrix, along with the service timeslots and computation resources for each IoTD. Finally, simulation results demonstrate that the proposed PPO-RATDFRO algorithm significantly enhances EE for all served IoTDs, compared to various benchmark algorithms.\",\"PeriodicalId\":54347,\"journal\":{\"name\":\"IEEE Internet of Things Journal\",\"volume\":\"12 13\",\"pages\":\"23563-23576\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-03-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Internet of Things Journal\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10945770/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10945770/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

本文探讨了用于物联网（IoT）网络的移动边缘计算（MEC）架构的能效（EE），其中多个物联网设备（iotd）执行本地计算，同时利用时分多址（TDMA）协议进一步将部分任务卸载到支持基站（BS）的MEC服务器。为了解决由障碍物引起的无线电传播问题，我们使用了一种无人驾驶飞行器（UAV）安装的主动可重构智能表面（RIS），称为飞行RIS (FRIS)，以反射甚至放大从物联网td到BS的事件信号。由于任务可以在iotd和BS上并行执行，因此需要联合优化iotd和FRIS的服务时点分配、无人机轨迹以及资源分配，以最大化系统EE。为此，我们将优化问题重新表述为马尔可夫决策过程（MDP），并引入了一种深度强化学习（DRL）方法来解决所制定的问题，称为基于近端策略优化（PPO）的资源分配，具有轨迹设计和FRIS反射矩阵优化（PPO- ratdfro）算法。通过在构建的环境中不断地相互作用，所提出的系统迭代地改进其策略，以确定FRIS轨迹和反射矩阵，以及每个IoTD的服务时隙和计算资源。最后，仿真结果表明，与各种基准算法相比，所提出的PPO-RATDFRO算法显著提高了所有服务的iotd的EE。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Energy-Efficiency Optimization of Active Flying-RIS-Assisted Mobile-Edge Computing Networks: A Deep-Reinforcement-Learning Approach

This article explores the energy efficiency (EE) of a mobile-edge computing (MEC) architecture for Internet of Things (IoT) networks, in which multiple IoT devices (IoTDs) perform local computations while further offloading part of their tasks to the base station (BS)-enabled MEC server utilizing the time division multiple access (TDMA) protocol. To address the radio propagation issues caused by obstructions, we utilize an uncrewed aerial vehicle (UAV)-mounted active reconfigurable intelligent surface (RIS), referred to as a flying-RIS (FRIS), to reflect and even amplify the incident signal from IoTDs to the BS. Since tasks can be executed in parallel on both IoTDs and the BS, the assignment of service timeslots, the UAV trajectory, and the resource allocation for both IoTDs and FRIS should be jointly optimized to maximize system EE. To this end, we reformulate the optimization problem as a markov decision process (MDP) and introduce a deep reinforcement learning (DRL) approach for addressing the formulated problem, called the proximal policy optimization (PPO) based resource allocation with trajectory design and FRIS reflection matrix optimization (PPO-RATDFRO) algorithm. By continuously interacting within the constructed environment, the proposed system iteratively refines its policy to determine the FRIS trajectory and reflection matrix, along with the service timeslots and computation resources for each IoTD. Finally, simulation results demonstrate that the proposed PPO-RATDFRO algorithm significantly enhances EE for all served IoTDs, compared to various benchmark algorithms.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Internet of Things Journal Computer Science-Information Systems

CiteScore

17.60

自引率

13.20%

发文量

1982

期刊介绍： The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.