Deep Reinforcement Learning Based Transmission Scheduling for Sensing Aware Control

IF 6.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
Tiankai Jin;Cailian Chen;Yehan Ma;Xinping Guan
{"title":"Deep Reinforcement Learning Based Transmission Scheduling for Sensing Aware Control","authors":"Tiankai Jin;Cailian Chen;Yehan Ma;Xinping Guan","doi":"10.1109/TASE.2025.3530409","DOIUrl":null,"url":null,"abstract":"Massive field data is wirelessly transmitted to the edge side to facilitate sensing and control in the emerging Industrial Internet of Things (IIoT) systems. Under the expanding transmission scheduling space and dynamic network conditions, balancing control performance and limited transmission resources is a fundamental challenge. For this problem, we propose a novel deep reinforcement learning (DRL)-based transmission scheduling method (DTSM), where sensing performance guarantee is introduced for its criticality in ensuring complete system observation and effective control. Specifically, taking system observability as the key metric, the time slots for multi-sensor data transmission under different control demands are properly reserved with theoretically guaranteed performance. Then, the primal-dual DRL framework is adopted to further improve the overall performance of system control and resource utilization by dynamically scheduling the transmission number of each sensor. The scheduling is based on the real-time states of sensing and wireless network, and the action space is determined according to our reserved time slots. Besides, after primal-dual updates, the scheduling results can satisfy the estimation error-evaluated constraint imposed for the ultimate control effect. Finally, the proposed method is applied to the industrial laminar cooling process and its effectiveness is fully demonstrated. Note to Practitioners—This paper is motivated by the requirement of balancing control performance and scarce transmission resources in industrial automation fields such as steel manufacturing, where massive sensor data is transmitted to the edge side through wireless networks. The expanding transmission scheduling space and dynamic network conditions have led to increased interest in advanced deep reinforcement learning (DRL) methods. However, few previous works have explored the impact of control demands on intelligent transmission scheduling design. For these issues, we propose a novel DRL-based transmission scheduling method (DTSM), where the time slots for multi-sensor data transmission are delicately reserved according to different control demands and dynamic scheduling is realized based on real-time states of sensing and wireless network. The overall performance of system control and resource utilization is improved, and practitioners can easily adjust method parameters to achieve the desired balance between the two aspects according to practical demands. Case studies in the industrial hot rolling process demonstrate the superiority of DTSM. Our future work will consider the joint scheduling of uplink-downlink transmissions and design the collaboration among multiple edge computing nodes (ECNs) to address the limitations of centralized learning methods. Besides, the proposed method can be extended to other industrial applications such as flight control system testing.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"10905-10919"},"PeriodicalIF":6.4000,"publicationDate":"2025-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10843757/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Massive field data is wirelessly transmitted to the edge side to facilitate sensing and control in the emerging Industrial Internet of Things (IIoT) systems. Under the expanding transmission scheduling space and dynamic network conditions, balancing control performance and limited transmission resources is a fundamental challenge. For this problem, we propose a novel deep reinforcement learning (DRL)-based transmission scheduling method (DTSM), where sensing performance guarantee is introduced for its criticality in ensuring complete system observation and effective control. Specifically, taking system observability as the key metric, the time slots for multi-sensor data transmission under different control demands are properly reserved with theoretically guaranteed performance. Then, the primal-dual DRL framework is adopted to further improve the overall performance of system control and resource utilization by dynamically scheduling the transmission number of each sensor. The scheduling is based on the real-time states of sensing and wireless network, and the action space is determined according to our reserved time slots. Besides, after primal-dual updates, the scheduling results can satisfy the estimation error-evaluated constraint imposed for the ultimate control effect. Finally, the proposed method is applied to the industrial laminar cooling process and its effectiveness is fully demonstrated. Note to Practitioners—This paper is motivated by the requirement of balancing control performance and scarce transmission resources in industrial automation fields such as steel manufacturing, where massive sensor data is transmitted to the edge side through wireless networks. The expanding transmission scheduling space and dynamic network conditions have led to increased interest in advanced deep reinforcement learning (DRL) methods. However, few previous works have explored the impact of control demands on intelligent transmission scheduling design. For these issues, we propose a novel DRL-based transmission scheduling method (DTSM), where the time slots for multi-sensor data transmission are delicately reserved according to different control demands and dynamic scheduling is realized based on real-time states of sensing and wireless network. The overall performance of system control and resource utilization is improved, and practitioners can easily adjust method parameters to achieve the desired balance between the two aspects according to practical demands. Case studies in the industrial hot rolling process demonstrate the superiority of DTSM. Our future work will consider the joint scheduling of uplink-downlink transmissions and design the collaboration among multiple edge computing nodes (ECNs) to address the limitations of centralized learning methods. Besides, the proposed method can be extended to other industrial applications such as flight control system testing.
基于深度强化学习的传感感知控制传输调度
大量现场数据无线传输到边缘侧,以促进新兴工业物联网(IIoT)系统中的传感和控制。在不断扩大的传输调度空间和动态的网络条件下,如何平衡控制性能和有限的传输资源是一个根本性的挑战。针对这一问题,我们提出了一种新的基于深度强化学习(DRL)的传输调度方法(DTSM),其中引入了感知性能保证,因为它对于保证系统的完全观察和有效控制至关重要。具体而言,以系统可观测性为关键指标,合理预留了不同控制需求下多传感器数据传输的时隙,理论上保证了性能。然后,采用原始-对偶DRL框架,通过动态调度各传感器的传输次数,进一步提高系统的整体控制性能和资源利用率。调度基于传感和无线网络的实时状态,行动空间根据我们预留的时隙确定。此外,经过原始对偶更新后的调度结果能够满足为最终控制效果而施加的估计误差评估约束。最后,将该方法应用于工业层流冷却过程,充分证明了其有效性。从业人员注意-本文的动机是在钢铁制造等工业自动化领域中,需要平衡控制性能和稀缺的传输资源,在这些领域中,大量传感器数据通过无线网络传输到边缘侧。不断扩大的传输调度空间和动态的网络条件使得人们对高级深度强化学习(DRL)方法越来越感兴趣。然而,以往的研究很少探讨控制需求对智能输电调度设计的影响。针对这些问题,我们提出了一种新的基于drl的传输调度方法(DTSM),该方法根据不同的控制需求精细地保留多传感器数据传输的时隙,并根据传感和无线网络的实时状态实现动态调度。提高了系统控制和资源利用的整体性能,从业者可以根据实际需求轻松调整方法参数,达到两方面的理想平衡。工业热轧过程的实例研究表明了DTSM的优越性。我们未来的工作将考虑上行-下行传输的联合调度,并设计多个边缘计算节点(ecn)之间的协作,以解决集中式学习方法的局限性。此外,该方法还可以推广到飞行控制系统测试等其他工业应用中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Transactions on Automation Science and Engineering
IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统
CiteScore
12.50
自引率
14.30%
发文量
404
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信