A 1036 TOp/s/W, 12.2 mW, 2.72 μJ/Inference All Digital TNN Accelerator in 22 nm FDX Technology for TinyML Applications

2022 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS) Pub Date : 2022-04-20 DOI:10.1109/coolchips54332.2022.9772668

Moritz Scherer, Alfio Di Mauro, Georg Rutishauser, Tim Fischer, L. Benini

引用次数: 6

Abstract

Tiny Machine Learning (TinyML) applications impose μJ/Inference constraints, with maximum power consumption of a few tens of mW. It is extremely challenging to meet these requirement at a reasonable accuracy level. In this work, we address this challenge with a flexible, fully digital Ternary Neural Network (TNN) accelerator in a RISC-V-based SoC. The design achieves 2.72 μJ/Inference, 12.2 mW, 3200 Inferences/sec at 0.5 V for a non-trivial 9-layer, 96 channels-per-layer network with CIFAR-10 accuracy of 86 %. The peak energy efficiency is 1036 TOp/s/W, outperforming the state-of-the-art in silicon-proven TinyML accelerators by 1.67x.

查看原文本刊更多论文

用于TinyML应用的1036 TOp/s/W、12.2 mW、2.72 μJ/Inference全数字TNN加速器

微型机器学习(TinyML)应用程序施加μJ/Inference约束，最大功耗为几十mW。在合理的精度水平上满足这些要求是极具挑战性的。在这项工作中，我们在基于risc - v的SoC中使用灵活的全数字三元神经网络(TNN)加速器来解决这一挑战。该设计实现了2.72 μJ/Inference, 12.2 mW, 3200 Inferences/sec，在0.5 V下实现了9层，每层96通道的重要网络，CIFAR-10精度为86%。峰值能量效率为1036 TOp/s/W，比最先进的经硅验证的TinyML加速器高出1.67倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS)

自引率

0.00%

发文量