MIRAGE: Machine Learning-based Modeling of Identical Replicas of the Jetson AGX Embedded Platform

2021 IEEE/ACM Symposium on Edge Computing (SEC) Pub Date : 2021-12-01 DOI:10.1145/3453142.3491284

Hassan Halawa, Hazem A. Abdelhafez, M. O. Ahmed, K. Pattabiraman, M. Ripeanu

{"title":"MIRAGE: Machine Learning-based Modeling of Identical Replicas of the Jetson AGX Embedded Platform","authors":"Hassan Halawa, Hazem A. Abdelhafez, M. O. Ahmed, K. Pattabiraman, M. Ripeanu","doi":"10.1145/3453142.3491284","DOIUrl":null,"url":null,"abstract":"A common feature of devices deployed at the edge today is their configurability. The NVIDIA Jetson AGX, for example, has a user-configurable frequency range larger than one order of magnitude for the CPU, the GPU, and the memory controller. Key to make effective use of this configurability is the ability to anticipate the application-level impact of a frequency configuration choice. To this end, this paper presents a novel modeling approach for predicting the runtime and power consumption for convolutional neural net-works (CNNs). This modeling approach is: (i) effective - i.e., makes predictions with low error (models achieve an average relative error of 15.4% for runtime and 14.9% for energy); (ii) efficient - i.e., has a low cost to make predictions; (iii) generic - i.e., supports deploying updated and possibly different deep learning inference models without the need for retraining, and (iv) practical - i.e., requires a low training cost. Three features, all geared towards meeting the challenges of deploying in a real-world environment, set this work apart: (i) the focus on predicting the impact of the frequency configuration choice, (ii) the methodological choice to aggregate predictions at fine (i.e., kernel level) granularity which provides generality; and (iii) taking into account the inter-node variability among nominally identical devices.","PeriodicalId":6779,"journal":{"name":"2021 IEEE/ACM Symposium on Edge Computing (SEC)","volume":"1 1","pages":"26-40"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/ACM Symposium on Edge Computing (SEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3453142.3491284","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

A common feature of devices deployed at the edge today is their configurability. The NVIDIA Jetson AGX, for example, has a user-configurable frequency range larger than one order of magnitude for the CPU, the GPU, and the memory controller. Key to make effective use of this configurability is the ability to anticipate the application-level impact of a frequency configuration choice. To this end, this paper presents a novel modeling approach for predicting the runtime and power consumption for convolutional neural net-works (CNNs). This modeling approach is: (i) effective - i.e., makes predictions with low error (models achieve an average relative error of 15.4% for runtime and 14.9% for energy); (ii) efficient - i.e., has a low cost to make predictions; (iii) generic - i.e., supports deploying updated and possibly different deep learning inference models without the need for retraining, and (iv) practical - i.e., requires a low training cost. Three features, all geared towards meeting the challenges of deploying in a real-world environment, set this work apart: (i) the focus on predicting the impact of the frequency configuration choice, (ii) the methodological choice to aggregate predictions at fine (i.e., kernel level) granularity which provides generality; and (iii) taking into account the inter-node variability among nominally identical devices.

查看原文本刊更多论文

海市蜃楼:基于机器学习的Jetson AGX嵌入式平台相同复制品建模

如今部署在边缘的设备的一个共同特征是它们的可配置性。例如，NVIDIA Jetson AGX的用户可配置频率范围大于CPU、GPU和内存控制器的一个数量级。有效利用这种可配置性的关键是能够预测频率配置选择对应用程序级的影响。为此，本文提出了一种预测卷积神经网络(cnn)运行时间和功耗的新颖建模方法。这种建模方法是:(i)有效的——即以低误差进行预测(模型在运行时间和能源方面的平均相对误差分别为15.4%和14.9%);(ii)高效——即进行预测的成本低;(iii)通用性——即支持部署更新的和可能不同的深度学习推理模型，而不需要再训练;(iv)实用性——即需要较低的训练成本。三个特点，都是为了应对在现实环境中部署的挑战，使这项工作与众不同:(i)专注于预测频率配置选择的影响，(ii)在精细(即内核级别)粒度上聚合预测的方法选择，提供了通用性;(iii)考虑到名义上相同的设备之间的节点间可变性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE/ACM Symposium on Edge Computing (SEC)

自引率

0.00%

发文量