Hybrid quantum-enhanced reinforcement learning for energy-efficient resource allocation in fog-edge computing

IF 1.1 4区数学 Q4 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Combinatorial Optimization Pub Date : 2025-08-08 DOI:10.1007/s10878-025-01336-w

S. Sureka Nithila Princy, Paulraj Ranjith kumar

{"title":"Hybrid quantum-enhanced reinforcement learning for energy-efficient resource allocation in fog-edge computing","authors":"S. Sureka Nithila Princy, Paulraj Ranjith kumar","doi":"10.1007/s10878-025-01336-w","DOIUrl":null,"url":null,"abstract":"<p>The proliferation of Internet of Things (IoT) devices has intensified the need for intelligent, adaptive, and energy-efficient resource management across mobile edge–fog–cloud infrastructures. Conventional optimization approaches often fail to manage the dynamic interplay among fluctuating workloads, energy constraints, and real-time scheduling. To address this, a Hybrid Quantum-Enhanced Reinforcement Learning (HQERL) framework is introduced, unifying quantum-inspired heuristics, swarm intelligence, and reinforcement learning into a co-adaptive sched uling system. HQERL employs a feedback-driven architecture to synchronize exploration, optimization, and policy refinement for enhanced task scheduling and resource control. The Maximum Likelihood Swarm Whale Optimization (MLSWO) module encodes dynamic task and system states using swarm intelligence guided by statistical likelihood, generating information-rich inputs for the learning controller. To prevent premature convergence and expand the scheduling search space, the Quantum Brainstorm Optimization (QBO) component incorporates probabilistic memory and collective learning to diversify scheduling solutions. These enhanced representations and exploratory strategies feed into the Proximal Policy Optimization (PPO) controller, which dynamically adapts resource allocation policies in real time based on system feedback, ensuring resilience to workload shifts. Furthermore, Dynamic Voltage Scaling (DVS) is integrated to improve energy efficiency by adjusting processor voltages and frequencies according to workload demands. This seamless coordination enables HQERL to balance task latency, resource use, and power consumption. Evaluation on the LSApp dataset reveals HQERL yields a 15% energy efficiency gain, 12% makespan reduction, and a 23.3% boost in peak system utility, validating its effectiveness for sustainable IoT resource management.</p>","PeriodicalId":50231,"journal":{"name":"Journal of Combinatorial Optimization","volume":"13 1","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2025-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Combinatorial Optimization","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s10878-025-01336-w","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

The proliferation of Internet of Things (IoT) devices has intensified the need for intelligent, adaptive, and energy-efficient resource management across mobile edge–fog–cloud infrastructures. Conventional optimization approaches often fail to manage the dynamic interplay among fluctuating workloads, energy constraints, and real-time scheduling. To address this, a Hybrid Quantum-Enhanced Reinforcement Learning (HQERL) framework is introduced, unifying quantum-inspired heuristics, swarm intelligence, and reinforcement learning into a co-adaptive sched uling system. HQERL employs a feedback-driven architecture to synchronize exploration, optimization, and policy refinement for enhanced task scheduling and resource control. The Maximum Likelihood Swarm Whale Optimization (MLSWO) module encodes dynamic task and system states using swarm intelligence guided by statistical likelihood, generating information-rich inputs for the learning controller. To prevent premature convergence and expand the scheduling search space, the Quantum Brainstorm Optimization (QBO) component incorporates probabilistic memory and collective learning to diversify scheduling solutions. These enhanced representations and exploratory strategies feed into the Proximal Policy Optimization (PPO) controller, which dynamically adapts resource allocation policies in real time based on system feedback, ensuring resilience to workload shifts. Furthermore, Dynamic Voltage Scaling (DVS) is integrated to improve energy efficiency by adjusting processor voltages and frequencies according to workload demands. This seamless coordination enables HQERL to balance task latency, resource use, and power consumption. Evaluation on the LSApp dataset reveals HQERL yields a 15% energy efficiency gain, 12% makespan reduction, and a 23.3% boost in peak system utility, validating its effectiveness for sustainable IoT resource management.

查看原文本刊更多论文

雾边缘计算中节能资源分配的混合量子增强强化学习

物联网（IoT）设备的激增加剧了对跨移动边缘雾云基础设施的智能、自适应和节能资源管理的需求。传统的优化方法通常无法管理波动的工作负载、能量约束和实时调度之间的动态相互作用。为了解决这个问题，引入了混合量子增强强化学习（HQERL）框架，将量子启发的启发式，群体智能和强化学习统一到一个协同自适应调度系统中。HQERL采用反馈驱动的体系结构来同步探索、优化和策略改进，以增强任务调度和资源控制。最大似然群鲸优化（MLSWO）模块利用统计似然引导的群智能对动态任务和系统状态进行编码，为学习控制器生成信息丰富的输入。为了防止过早收敛和扩大调度搜索空间，量子头脑风暴优化（QBO）组件结合了概率记忆和集体学习，使调度解决方案多样化。这些增强的表示和探索性策略提供给邻域策略优化（PPO）控制器，该控制器根据系统反馈实时动态调整资源分配策略，确保对工作负载变化的弹性。此外，集成了动态电压缩放（DVS），根据工作负载需求调整处理器电压和频率，提高能源效率。这种无缝协调使HQERL能够平衡任务延迟、资源使用和功耗。对LSApp数据集的评估表明，HQERL可以提高15%的能源效率，减少12%的完工时间，并提高23.3%的峰值系统效用，验证了其在可持续物联网资源管理方面的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Combinatorial Optimization 数学-计算机：跨学科应用

CiteScore

2.00

自引率

10.00%

发文量

审稿时长

6 months

期刊介绍： The objective of Journal of Combinatorial Optimization is to advance and promote the theory and applications of combinatorial optimization, which is an area of research at the intersection of applied mathematics, computer science, and operations research and which overlaps with many other areas such as computation complexity, computational biology, VLSI design, communication networks, and management science. It includes complexity analysis and algorithm design for combinatorial optimization problems, numerical experiments and problem discovery with applications in science and engineering. The Journal of Combinatorial Optimization publishes refereed papers dealing with all theoretical, computational and applied aspects of combinatorial optimization. It also publishes reviews of appropriate books and special issues of journals.