Transfer-Once-For-All: AI Model Optimization for Edge

2023 IEEE International Conference on Edge Computing and Communications (EDGE) Pub Date : 2023-03-27 DOI:10.1109/EDGE60047.2023.00017

Achintya Kundu, L. Wynter, Rhui Dih Lee, L. A. Bathen

{"title":"Transfer-Once-For-All: AI Model Optimization for Edge","authors":"Achintya Kundu, L. Wynter, Rhui Dih Lee, L. A. Bathen","doi":"10.1109/EDGE60047.2023.00017","DOIUrl":null,"url":null,"abstract":"Weight-sharing neural architecture search aims to optimize a configurable neural network model (supernet) for a variety of deployment scenarios across many devices with different resource constraints. Existing approaches use evolutionary search to extract models of different sizes from a supernet trained on a very large data set, and then fine-tune the extracted models on the typically small, real-world data set of interest. The computational cost of training thus grows linearly with the number of different model deployment scenarios. Hence, we propose Transfer-Once-For-All (TOFA) for supernet-style training on small data sets with constant computational training cost over any number of edge deployment scenarios. Given a task, TOFA obtains custom neural networks, both the topology and the weights, optimized for any number of edge deployment scenarios. To overcome the challenges arising from small data, TOFA utilizes a unified semi-supervised training loss to simultaneously train all subnets within the supernet, coupled with on-the-fly architecture selection at deployment time.","PeriodicalId":369407,"journal":{"name":"2023 IEEE International Conference on Edge Computing and Communications (EDGE)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Edge Computing and Communications (EDGE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EDGE60047.2023.00017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Weight-sharing neural architecture search aims to optimize a configurable neural network model (supernet) for a variety of deployment scenarios across many devices with different resource constraints. Existing approaches use evolutionary search to extract models of different sizes from a supernet trained on a very large data set, and then fine-tune the extracted models on the typically small, real-world data set of interest. The computational cost of training thus grows linearly with the number of different model deployment scenarios. Hence, we propose Transfer-Once-For-All (TOFA) for supernet-style training on small data sets with constant computational training cost over any number of edge deployment scenarios. Given a task, TOFA obtains custom neural networks, both the topology and the weights, optimized for any number of edge deployment scenarios. To overcome the challenges arising from small data, TOFA utilizes a unified semi-supervised training loss to simultaneously train all subnets within the supernet, coupled with on-the-fly architecture selection at deployment time.

查看原文本刊更多论文

transfer -一劳永逸:Edge的AI模型优化

权重共享神经架构搜索旨在优化可配置的神经网络模型(超级网络)，以适应不同资源约束下不同设备的各种部署场景。现有的方法使用进化搜索从一个在非常大的数据集上训练的超级网络中提取不同大小的模型，然后在通常较小的、感兴趣的真实数据集上对提取的模型进行微调。因此，训练的计算成本随着不同模型部署场景的数量线性增长。因此，我们提出Transfer-Once-For-All (TOFA)，用于在小数据集上进行超级网络式训练，在任何数量的边缘部署场景下都具有恒定的计算训练成本。给定一个任务，TOFA获得自定义神经网络，包括拓扑和权重，针对任何数量的边缘部署场景进行了优化。为了克服小数据带来的挑战，TOFA利用统一的半监督训练损失来同时训练超级网络中的所有子网，并在部署时进行动态架构选择。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE International Conference on Edge Computing and Communications (EDGE)

自引率

0.00%

发文量