Adaptive heuristics for scheduling DNN inferencing on edge and cloud for personalized UAV fleets

IF 6.2 2区计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS

Future Generation Computer Systems-The International Journal of Escience Pub Date : 2025-05-14 DOI:10.1016/j.future.2025.107874

Suman Raj, Radhika Mittal, Harshil Gupta, Yogesh Simmhan

{"title":"Adaptive heuristics for scheduling DNN inferencing on edge and cloud for personalized UAV fleets","authors":"Suman Raj, Radhika Mittal, Harshil Gupta, Yogesh Simmhan","doi":"10.1016/j.future.2025.107874","DOIUrl":null,"url":null,"abstract":"<div><div>Drone fleets with onboard cameras coupled with computer vision and DNN inferencing models can support diverse applications, from package deliveries to disaster monitoring. One such novel domain is for one or more “buddy” drones to assist Visually Impaired People (VIPs) lead an active lifestyle. Video inferencing tasks from such drones can help both navigate the drone and provide situation awareness to the VIP, and hence have strict execution deadlines. These tasks can execute either on an accelerated edge like Nvidia Jetson linked to the drone, or on a cloud INFerencing-as-a-Service (INFaaS). However, making this decision is a challenge given the latency and cost trade-offs across a stream of deadline-sensitive tasks, in the presence of network and/or compute variability. We propose a deadline-driven heuristic, DEMS-A, to schedule diverse DNN tasks generated continuously to perform inferencing over video segments generated by multiple drones linked to an edge, with the option to execute on the cloud. We use strategies like task dropping, work stealing and migration, and dynamic adaptation to cloud variability, to fully utilize the captive edge with intelligent offloading to the cloud, and guarantee a Quality of Service (QoS), <em>i.e.</em> maximize the utility and the number of tasks completed. We also introduce an additional Quality of Experience (QoE) metric useful to the assistive drone domain, which values the frequency of success for task types to ensure the responsiveness and reliability of the VIP application. We extend our DEMS solution to GEMS to solve this. We evaluate these strategies, using (i) an emulated setup of a fleet of over 80 drones supporting over 25 VIPs, with real DNN models executing on pre-recorded drone video streams, using Jetson Nano edges and AWS Lambda cloud functions, and (ii) a real-world setup of a Tello drone and a Jetson Orin Nano edge accelerator executing a subset of the DNN models on live video feeds and generating drone commands to follow a VIP in real-time. The detailed comparative emulation study shows that our strategies have a task completion rate of up to 88%, up to <span><math><mrow><mn>2</mn><mo>.</mo><mn>7</mn><mo>×</mo></mrow></math></span> higher QoS utility compared to the baselines, a further 16% higher QoS utility while adapting to network variability, and up to 75% higher QoE utility. Our practical validation using real drones exhibits task completion of up to 87% for GEMS and 33% higher total utility of GEMS compared to edge-only and achieves the smoothest trajectory with minimum jerk and lowest yaw error.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"173 ","pages":"Article 107874"},"PeriodicalIF":6.2000,"publicationDate":"2025-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X25001694","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

Drone fleets with onboard cameras coupled with computer vision and DNN inferencing models can support diverse applications, from package deliveries to disaster monitoring. One such novel domain is for one or more “buddy” drones to assist Visually Impaired People (VIPs) lead an active lifestyle. Video inferencing tasks from such drones can help both navigate the drone and provide situation awareness to the VIP, and hence have strict execution deadlines. These tasks can execute either on an accelerated edge like Nvidia Jetson linked to the drone, or on a cloud INFerencing-as-a-Service (INFaaS). However, making this decision is a challenge given the latency and cost trade-offs across a stream of deadline-sensitive tasks, in the presence of network and/or compute variability. We propose a deadline-driven heuristic, DEMS-A, to schedule diverse DNN tasks generated continuously to perform inferencing over video segments generated by multiple drones linked to an edge, with the option to execute on the cloud. We use strategies like task dropping, work stealing and migration, and dynamic adaptation to cloud variability, to fully utilize the captive edge with intelligent offloading to the cloud, and guarantee a Quality of Service (QoS), i.e. maximize the utility and the number of tasks completed. We also introduce an additional Quality of Experience (QoE) metric useful to the assistive drone domain, which values the frequency of success for task types to ensure the responsiveness and reliability of the VIP application. We extend our DEMS solution to GEMS to solve this. We evaluate these strategies, using (i) an emulated setup of a fleet of over 80 drones supporting over 25 VIPs, with real DNN models executing on pre-recorded drone video streams, using Jetson Nano edges and AWS Lambda cloud functions, and (ii) a real-world setup of a Tello drone and a Jetson Orin Nano edge accelerator executing a subset of the DNN models on live video feeds and generating drone commands to follow a VIP in real-time. The detailed comparative emulation study shows that our strategies have a task completion rate of up to 88%, up to

2.7 \times

higher QoS utility compared to the baselines, a further 16% higher QoS utility while adapting to network variability, and up to 75% higher QoE utility. Our practical validation using real drones exhibits task completion of up to 87% for GEMS and 33% higher total utility of GEMS compared to edge-only and achieves the smoothest trajectory with minimum jerk and lowest yaw error.

查看原文本刊更多论文

个性化无人机机群边缘和云上DNN推理调度的自适应启发式算法

配备机载摄像头的无人机，加上计算机视觉和深度神经网络推理模型，可以支持从包裹递送到灾难监测等各种应用。其中一个新颖的领域是让一架或多架“伙伴”无人机帮助视障人士（vip）过上积极的生活。来自此类无人机的视频推理任务可以帮助无人机导航并为VIP提供情况感知，因此有严格的执行期限。这些任务既可以在与无人机相连的Nvidia Jetson等加速边缘设备上执行，也可以在云推理即服务（INFaaS）上执行。然而，考虑到在网络和/或计算可变性的情况下，跨一系列对截止日期敏感的任务的延迟和成本权衡，做出此决策是一项挑战。我们提出了一种截止日期驱动的启发式算法dem - a，用于调度连续生成的各种DNN任务，以对连接到边缘的多架无人机生成的视频片段进行推理，并可选择在云上执行。我们采用任务丢弃、工作窃取和迁移、动态适应云变化等策略，充分利用专属边缘，智能卸载到云，保证服务质量（QoS），即最大限度地提高效用和完成任务的数量。我们还引入了对辅助无人机领域有用的额外体验质量（QoE）度量，该度量评估任务类型的成功频率，以确保VIP应用程序的响应性和可靠性。我们将我们的dem解决方案扩展到GEMS来解决这个问题。我们评估这些策略，使用(i)超过80架无人机支持超过25名VIP的模拟设置，使用预先录制的无人机视频流执行真实的DNN模型，使用Jetson Nano edge和AWS Lambda云功能，以及（ii）真实世界的设置Tello无人机和Jetson Orin Nano edge加速器执行DNN模型的子集在实时视频馈馈线上并生成无人机命令实时跟踪VIP。详细的比较仿真研究表明，与基线相比，我们的策略具有高达88%的任务完成率，高达2.7倍的QoS效用，在适应网络可变性的同时进一步提高16%的QoS效用，以及高达75%的QoE效用。我们使用真实无人机进行的实际验证显示，GEMS的任务完成率高达87%，与边缘相比，GEMS的总效用提高了33%，并且以最小的抖动和最低的偏航误差实现了最平滑的轨迹。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Future Generation Computer Systems-The International Journal of Escience 工程技术-计算机：理论方法

CiteScore

19.90

自引率

2.70%

发文量

376

审稿时长

10.6 months

期刊介绍： Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications. Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration. Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.