Learn to Learn on Chip: Hardware-aware Meta-learning for Quantized Few-shot Learning at the Edge

2022 IEEE/ACM 7th Symposium on Edge Computing (SEC) Pub Date : 2022-12-01 DOI:10.1109/SEC54971.2022.00009

Nitish Satya Murthy, Peter Vrancx, Nathan Laubeuf, P. Debacker, F. Catthoor, M. Verhelst

{"title":"Learn to Learn on Chip: Hardware-aware Meta-learning for Quantized Few-shot Learning at the Edge","authors":"Nitish Satya Murthy, Peter Vrancx, Nathan Laubeuf, P. Debacker, F. Catthoor, M. Verhelst","doi":"10.1109/SEC54971.2022.00009","DOIUrl":null,"url":null,"abstract":"Recent years have seen a growing trend of deploying deep neural network-based applications on edge devices. Many of these applications, such as biometric identification, activity tracking, user preference learning, etc., require fine-tuning of the trained networks for user personalization. One way to prepare these models to handle new, unseen tasks, is to pre-train them on a distribution of known tasks. This observation has led to increasing research into meta-learning based few-shot learning techniques. However, basic meta-learning approaches do not account for the limited memory and computational resources during on-chip training. We propose a modified meta-learning algorithm that enables quantized fine-tuning to optimally condition the models for on-chip few shot learning. The modification involves the inclusion of target hardware constraints upfront in the meta-learning process. Block floating point datatypes with low precision mantissa bits are utilized in the forward and backward passes, to allow hardware-friendly adaptation. Experiments show that our algorithm provides better initializations than conventional algorithms, more suitable for efficient quantized fine-tuning. This allows the few-shot learner to achieve better convergence, in terms of accuracy and speed. Extensive experiments are also performed to analyze the impact of initialization on quantized fine-tuning and further corroborate the benefits of our method.","PeriodicalId":364062,"journal":{"name":"2022 IEEE/ACM 7th Symposium on Edge Computing (SEC)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/ACM 7th Symposium on Edge Computing (SEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SEC54971.2022.00009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Recent years have seen a growing trend of deploying deep neural network-based applications on edge devices. Many of these applications, such as biometric identification, activity tracking, user preference learning, etc., require fine-tuning of the trained networks for user personalization. One way to prepare these models to handle new, unseen tasks, is to pre-train them on a distribution of known tasks. This observation has led to increasing research into meta-learning based few-shot learning techniques. However, basic meta-learning approaches do not account for the limited memory and computational resources during on-chip training. We propose a modified meta-learning algorithm that enables quantized fine-tuning to optimally condition the models for on-chip few shot learning. The modification involves the inclusion of target hardware constraints upfront in the meta-learning process. Block floating point datatypes with low precision mantissa bits are utilized in the forward and backward passes, to allow hardware-friendly adaptation. Experiments show that our algorithm provides better initializations than conventional algorithms, more suitable for efficient quantized fine-tuning. This allows the few-shot learner to achieve better convergence, in terms of accuracy and speed. Extensive experiments are also performed to analyze the impact of initialization on quantized fine-tuning and further corroborate the benefits of our method.

查看原文本刊更多论文

学会在芯片上学习:硬件感知的元学习在边缘量化少量学习

近年来，在边缘设备上部署基于深度神经网络的应用程序的趋势日益增长。许多此类应用，如生物识别、活动跟踪、用户偏好学习等，都需要对训练好的网络进行微调，以实现用户个性化。让这些模型准备好处理新的、看不见的任务的一种方法是在已知任务的分布上对它们进行预训练。这一观察结果导致了对基于少量学习技术的元学习的越来越多的研究。然而，基本的元学习方法并没有考虑到芯片上训练过程中有限的内存和计算资源。我们提出了一种改进的元学习算法，该算法可以进行量化微调，以优化片上少镜头学习的模型。修改涉及在元学习过程中预先包含目标硬件约束。在向前和向后传递中使用具有低精度尾数位的块浮点数据类型，以允许硬件友好的适应。实验表明，该算法比传统算法提供了更好的初始化，更适合于有效的量化微调。这使得少射学习器在准确性和速度方面实现更好的收敛。我们还进行了大量的实验来分析初始化对量化微调的影响，并进一步证实了我们的方法的优点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE/ACM 7th Symposium on Edge Computing (SEC)

自引率

0.00%

发文量