EFFECT-DNN: Energy-efficient Edge Framework for Real-time DNN Inference

Xiaojie Zhang, Motahare Mounesan, S. Debroy
{"title":"EFFECT-DNN: Energy-efficient Edge Framework for Real-time DNN Inference","authors":"Xiaojie Zhang, Motahare Mounesan, S. Debroy","doi":"10.1109/WoWMoM57956.2023.00015","DOIUrl":null,"url":null,"abstract":"Real-time visual computing applications running Deep Neural Networks (DNN) are becoming popular for mission-critical use cases such as, disaster response, tactical scenarios, and medical triage that require establishing ad-hoc edge environments. However, strict latency deadlines of such applications require real-time processing of pre-trained DNN layers (i.e., DNN inference) involving image/video data which is highly challenging to achieve under such resource- constrained edge environments. In this paper, we address the trade-off between end-to-end latency of DNN inference and IoT devices’ energy consumption by proposing ‘EFFECT-DNN’, an energy efficient edge computing framework. The EFFECT-DNN framework aims to strike such balance by employing a collaborative DNN partitioning and task offloading strategy. Such strategy also involves resource allocation from IoT devices and edge servers to satisfy DNN inference deadline requirement even when the network bandwidth is on the lower end, which is often the case for critical use cases. The underlying optimization is formulated as a dynamic Mixed-Integer Nonlinear Programming (MINLP) problem is decoupled and solved by convex optimization and a game-like heuristic algorithm. We evaluate the performance of EFFECT-DNN framework on a hardware testbed and using extensive simulations with real-world DNN s. The results demonstrate that the proposed framework can ensure DNN inference deadline satisfaction with significant (~ 20-30%) device energy savings.","PeriodicalId":132845,"journal":{"name":"2023 IEEE 24th International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 24th International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WoWMoM57956.2023.00015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Real-time visual computing applications running Deep Neural Networks (DNN) are becoming popular for mission-critical use cases such as, disaster response, tactical scenarios, and medical triage that require establishing ad-hoc edge environments. However, strict latency deadlines of such applications require real-time processing of pre-trained DNN layers (i.e., DNN inference) involving image/video data which is highly challenging to achieve under such resource- constrained edge environments. In this paper, we address the trade-off between end-to-end latency of DNN inference and IoT devices’ energy consumption by proposing ‘EFFECT-DNN’, an energy efficient edge computing framework. The EFFECT-DNN framework aims to strike such balance by employing a collaborative DNN partitioning and task offloading strategy. Such strategy also involves resource allocation from IoT devices and edge servers to satisfy DNN inference deadline requirement even when the network bandwidth is on the lower end, which is often the case for critical use cases. The underlying optimization is formulated as a dynamic Mixed-Integer Nonlinear Programming (MINLP) problem is decoupled and solved by convex optimization and a game-like heuristic algorithm. We evaluate the performance of EFFECT-DNN framework on a hardware testbed and using extensive simulations with real-world DNN s. The results demonstrate that the proposed framework can ensure DNN inference deadline satisfaction with significant (~ 20-30%) device energy savings.
EFFECT-DNN:实时DNN推理的节能边缘框架
运行深度神经网络(DNN)的实时视觉计算应用程序在关键任务用例中越来越受欢迎,例如需要建立临时边缘环境的灾难响应、战术场景和医疗分类。然而,此类应用的严格延迟期限要求实时处理涉及图像/视频数据的预训练DNN层(即DNN推理),这在资源受限的边缘环境下是极具挑战性的。在本文中,我们通过提出“EFFECT-DNN”,一种节能的边缘计算框架,解决了DNN推理的端到端延迟和物联网设备能耗之间的权衡。EFFECT-DNN框架旨在通过采用协作DNN分区和任务卸载策略来实现这种平衡。这种策略还涉及到从物联网设备和边缘服务器分配资源,以满足DNN推理截止日期要求,即使网络带宽处于低端,这通常是关键用例的情况。底层优化被表述为一个动态混合整数非线性规划(MINLP)问题,通过凸优化和类博弈启发式算法解耦求解。我们在硬件测试平台上评估了EFFECT-DNN框架的性能,并使用实际DNN进行了大量模拟。结果表明,所提出的框架可以确保DNN推理截止日期的满足,并且显著(~ 20-30%)的设备节能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信