Leveraging Prediction Confidence For Versatile Optimizations to CNNs

Proceedings of the First International Conference on AI-ML Systems Pub Date : 2021-10-21 DOI:10.1145/3486001.3486222

Saksham Sharma, Vanshika V Bhargava, Aditya Singh, K. Bhardwaj, Sparsh Mittal

{"title":"Leveraging Prediction Confidence For Versatile Optimizations to CNNs","authors":"Saksham Sharma, Vanshika V Bhargava, Aditya Singh, K. Bhardwaj, Sparsh Mittal","doi":"10.1145/3486001.3486222","DOIUrl":null,"url":null,"abstract":"Modern convolutional neural networks (CNNs) incur huge computational and energy overheads. In this paper, we propose two techniques for inferring the confidence in the correctness of a prediction in the early layers of a CNN. The first technique uses a statistical approach, whereas the second technique requires retraining. We argue that prediction confidence estimation can enable diverse optimizations to CNNs. We demonstrate two optimizations. First, we predict selected images in early layers. This is possible because in a dataset, many images are easy to predict and they can be predicted in the early layers of a CNN. This reduces the average computation count at the cost of accuracy and parameter count. Second, we propose predicting only selected images for which the prediction-confidence is high. This reduces the coverage; however, the accuracy on the images that are predicted is higher. Our results with VGG16 and ResNet50 CNNs on the Caltech256 dataset show that our techniques are effective. For example, for ResNet, our first technique reduces the accuracy from 71.6% to 69.8% while reducing the computations by 14%. Similarly, with the second technique, on reducing the coverage from 100% to 90%, the accuracy is increased from 71.6% to 75.6%. Keywords: computer vision, CNN, approximate computing, accuracy-coverage tradeoff, prediction confidence","PeriodicalId":266754,"journal":{"name":"Proceedings of the First International Conference on AI-ML Systems","volume":"118 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the First International Conference on AI-ML Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3486001.3486222","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Modern convolutional neural networks (CNNs) incur huge computational and energy overheads. In this paper, we propose two techniques for inferring the confidence in the correctness of a prediction in the early layers of a CNN. The first technique uses a statistical approach, whereas the second technique requires retraining. We argue that prediction confidence estimation can enable diverse optimizations to CNNs. We demonstrate two optimizations. First, we predict selected images in early layers. This is possible because in a dataset, many images are easy to predict and they can be predicted in the early layers of a CNN. This reduces the average computation count at the cost of accuracy and parameter count. Second, we propose predicting only selected images for which the prediction-confidence is high. This reduces the coverage; however, the accuracy on the images that are predicted is higher. Our results with VGG16 and ResNet50 CNNs on the Caltech256 dataset show that our techniques are effective. For example, for ResNet, our first technique reduces the accuracy from 71.6% to 69.8% while reducing the computations by 14%. Similarly, with the second technique, on reducing the coverage from 100% to 90%, the accuracy is increased from 71.6% to 75.6%. Keywords: computer vision, CNN, approximate computing, accuracy-coverage tradeoff, prediction confidence

查看原文本刊更多论文

利用预测置信度对cnn进行通用优化

现代卷积神经网络(cnn)会产生巨大的计算和能量开销。在本文中，我们提出了两种技术来推断CNN早期层预测正确性的置信度。第一种技术使用统计方法，而第二种技术需要再培训。我们认为预测置信度估计可以实现cnn的多种优化。我们将演示两种优化。首先，我们在早期层中预测选定的图像。这是可能的，因为在一个数据集中，许多图像很容易预测，它们可以在CNN的早期层中预测。这减少了平均计算次数，但代价是准确性和参数数量。其次，我们建议只预测预测置信度高的选定图像。这减少了覆盖率;然而，预测图像的准确性更高。我们在Caltech256数据集上使用VGG16和ResNet50 cnn的结果表明，我们的技术是有效的。例如，对于ResNet，我们的第一种技术将准确率从71.6%降低到69.8%，同时减少了14%的计算量。同样，使用第二种技术，当覆盖率从100%降低到90%时，准确率从71.6%提高到75.6%。关键词:计算机视觉，CNN，近似计算，精度-覆盖权衡，预测置信度

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the First International Conference on AI-ML Systems

自引率

0.00%

发文量