The Benefit of the Doubt: Uncertainty Aware Sensing for Edge Computing Platforms

2021 IEEE/ACM Symposium on Edge Computing (SEC) Pub Date : 2021-02-11 DOI:10.1145/3453142.3492330

Lorena Qendro, Jagmohan Chauhan, Alberto Gil C. P. Ramos, C. Mascolo

{"title":"The Benefit of the Doubt: Uncertainty Aware Sensing for Edge Computing Platforms","authors":"Lorena Qendro, Jagmohan Chauhan, Alberto Gil C. P. Ramos, C. Mascolo","doi":"10.1145/3453142.3492330","DOIUrl":null,"url":null,"abstract":"Neural networks (NNs) have drastically improved the performance of mobile and embedded applications but lack measures of “reliability” estimation that would enable reasoning over their predictions. Despite the vital importance, especially in areas of human well-being and health, state-of-the-art uncertainty estimation techniques are computationally expensive when applied to resource-constrained devices. We propose an efficient framework for predictive uncertainty estimation in NNs deployed on edge computing platforms with no need for fine-tuning or re-training strategies. To meet the energy and latency requirements of these systems the framework is built from the ground up to provide predictive uncertainty based only on one forward pass and a negligible amount of additional matrix multiplications. Our aim is to enable already trained deep learning models to generate uncertainty estimates on resource-limited devices at inference time focusing on classification tasks. This framework is founded on theoretical developments casting dropout training as approximate inference in Bayesian NNs. Our novel layerwise distribution approximation to the convolution layer cascades through the network, providing uncertainty estimates in one single run which ensures minimal overhead, especially compared with uncertainty techniques that require multiple forwards passes and an equal linear rise in energy and latency requirements making them unsuitable in practice. We demonstrate that it yields better performance and flexibility over previous work based on multilayer perceptrons to obtain uncertainty estimates. Our evaluation with mobile applications datasets on Nvidia Jetson TX2 and Nano shows that our approach not only obtains robust and accurate uncertainty estimations but also outperforms state-of-the-art methods in terms of systems performance, reducing energy consumption (up to 28–folds), keeping the memory overhead at a minimum while still improving accuracy (up to 16%).","PeriodicalId":6779,"journal":{"name":"2021 IEEE/ACM Symposium on Edge Computing (SEC)","volume":"1 1","pages":"214-227"},"PeriodicalIF":0.0000,"publicationDate":"2021-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/ACM Symposium on Edge Computing (SEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3453142.3492330","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Neural networks (NNs) have drastically improved the performance of mobile and embedded applications but lack measures of “reliability” estimation that would enable reasoning over their predictions. Despite the vital importance, especially in areas of human well-being and health, state-of-the-art uncertainty estimation techniques are computationally expensive when applied to resource-constrained devices. We propose an efficient framework for predictive uncertainty estimation in NNs deployed on edge computing platforms with no need for fine-tuning or re-training strategies. To meet the energy and latency requirements of these systems the framework is built from the ground up to provide predictive uncertainty based only on one forward pass and a negligible amount of additional matrix multiplications. Our aim is to enable already trained deep learning models to generate uncertainty estimates on resource-limited devices at inference time focusing on classification tasks. This framework is founded on theoretical developments casting dropout training as approximate inference in Bayesian NNs. Our novel layerwise distribution approximation to the convolution layer cascades through the network, providing uncertainty estimates in one single run which ensures minimal overhead, especially compared with uncertainty techniques that require multiple forwards passes and an equal linear rise in energy and latency requirements making them unsuitable in practice. We demonstrate that it yields better performance and flexibility over previous work based on multilayer perceptrons to obtain uncertainty estimates. Our evaluation with mobile applications datasets on Nvidia Jetson TX2 and Nano shows that our approach not only obtains robust and accurate uncertainty estimations but also outperforms state-of-the-art methods in terms of systems performance, reducing energy consumption (up to 28–folds), keeping the memory overhead at a minimum while still improving accuracy (up to 16%).

查看原文本刊更多论文

怀疑的好处:边缘计算平台的不确定性感知传感

神经网络(NNs)极大地提高了移动和嵌入式应用程序的性能，但缺乏“可靠性”估计的措施，无法对其预测进行推理。尽管最先进的不确定性估计技术至关重要，特别是在人类福祉和健康领域，但当应用于资源有限的设备时，计算成本很高。我们提出了一种有效的框架，用于部署在边缘计算平台上的神经网络的预测不确定性估计，无需微调或重新训练策略。为了满足这些系统的能量和延迟需求，该框架从头开始构建，以提供仅基于一次前向传递和可忽略不计的额外矩阵乘法的预测不确定性。我们的目标是使已经训练好的深度学习模型能够在集中于分类任务的推理时间对资源有限的设备产生不确定性估计。该框架建立在理论发展的基础上，将辍学训练作为贝叶斯神经网络的近似推理。我们对卷积层的新颖分层分布近似通过网络级联，在一次运行中提供不确定性估计，确保最小的开销，特别是与不确定性技术相比，不确定性技术需要多次向前传递，能量和延迟要求等线性上升，这使得它们不适合实践。我们证明了它比以前基于多层感知器的工作产生更好的性能和灵活性，以获得不确定性估计。我们对Nvidia Jetson TX2和Nano上的移动应用程序数据集进行的评估表明，我们的方法不仅获得了强大而准确的不确定性估计，而且在系统性能方面优于最先进的方法，降低了能耗(高达28倍)，将内存开销保持在最低限度，同时仍然提高了准确性(高达16%)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE/ACM Symposium on Edge Computing (SEC)

自引率

0.00%

发文量