DiReDi:面向 AIoT 应用的蒸馏和反向蒸馏

Chen Sun, Qing Tong, Wenshuang Yang, Wenqi Zhang
{"title":"DiReDi:面向 AIoT 应用的蒸馏和反向蒸馏","authors":"Chen Sun, Qing Tong, Wenshuang Yang, Wenqi Zhang","doi":"arxiv-2409.08308","DOIUrl":null,"url":null,"abstract":"Typically, the significant efficiency can be achieved by deploying different\nedge AI models in various real world scenarios while a few large models manage\nthose edge AI models remotely from cloud servers. However, customizing edge AI\nmodels for each user's specific application or extending current models to new\napplication scenarios remains a challenge. Inappropriate local training or fine\ntuning of edge AI models by users can lead to model malfunction, potentially\nresulting in legal issues for the manufacturer. To address aforementioned\nissues, this paper proposes an innovative framework called \"DiReD\", which\ninvolves knowledge DIstillation & REverse DIstillation. In the initial step, an\nedge AI model is trained with presumed data and a KD process using the cloud AI\nmodel in the upper management cloud server. This edge AI model is then\ndispatched to edge AI devices solely for inference in the user's application\nscenario. When the user needs to update the edge AI model to better fit the\nactual scenario, the reverse distillation (RD) process is employed to extract\nthe knowledge: the difference between user preferences and the manufacturer's\npresumptions from the edge AI model using the user's exclusive data. Only the\nextracted knowledge is reported back to the upper management cloud server to\nupdate the cloud AI model, thus protecting user privacy by not using any\nexclusive data. The updated cloud AI can then update the edge AI model with the\nextended knowledge. Simulation results demonstrate that the proposed \"DiReDi\"\nframework allows the manufacturer to update the user model by learning new\nknowledge from the user's actual scenario with private data. The initial\nredundant knowledge is reduced since the retraining emphasizes user private\ndata.","PeriodicalId":501422,"journal":{"name":"arXiv - CS - Distributed, Parallel, and Cluster Computing","volume":"64 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DiReDi: Distillation and Reverse Distillation for AIoT Applications\",\"authors\":\"Chen Sun, Qing Tong, Wenshuang Yang, Wenqi Zhang\",\"doi\":\"arxiv-2409.08308\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Typically, the significant efficiency can be achieved by deploying different\\nedge AI models in various real world scenarios while a few large models manage\\nthose edge AI models remotely from cloud servers. However, customizing edge AI\\nmodels for each user's specific application or extending current models to new\\napplication scenarios remains a challenge. Inappropriate local training or fine\\ntuning of edge AI models by users can lead to model malfunction, potentially\\nresulting in legal issues for the manufacturer. To address aforementioned\\nissues, this paper proposes an innovative framework called \\\"DiReD\\\", which\\ninvolves knowledge DIstillation & REverse DIstillation. In the initial step, an\\nedge AI model is trained with presumed data and a KD process using the cloud AI\\nmodel in the upper management cloud server. This edge AI model is then\\ndispatched to edge AI devices solely for inference in the user's application\\nscenario. When the user needs to update the edge AI model to better fit the\\nactual scenario, the reverse distillation (RD) process is employed to extract\\nthe knowledge: the difference between user preferences and the manufacturer's\\npresumptions from the edge AI model using the user's exclusive data. Only the\\nextracted knowledge is reported back to the upper management cloud server to\\nupdate the cloud AI model, thus protecting user privacy by not using any\\nexclusive data. The updated cloud AI can then update the edge AI model with the\\nextended knowledge. Simulation results demonstrate that the proposed \\\"DiReDi\\\"\\nframework allows the manufacturer to update the user model by learning new\\nknowledge from the user's actual scenario with private data. The initial\\nredundant knowledge is reduced since the retraining emphasizes user private\\ndata.\",\"PeriodicalId\":501422,\"journal\":{\"name\":\"arXiv - CS - Distributed, Parallel, and Cluster Computing\",\"volume\":\"64 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Distributed, Parallel, and Cluster Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.08308\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Distributed, Parallel, and Cluster Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.08308","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

通常情况下,通过在各种现实场景中部署不同的边缘人工智能模型,同时由少数大型模型从云服务器远程管理这些边缘人工智能模型,可以实现显著的效率。然而,为每个用户的特定应用定制边缘人工智能模型或将当前模型扩展到新的应用场景仍然是一项挑战。用户对边缘人工智能模型进行不恰当的本地训练或微调可能会导致模型失灵,从而给制造商带来潜在的法律问题。为解决上述问题,本文提出了一个名为 "DiReD "的创新框架。第一步,利用上层管理云服务器中的云人工智能模型,通过假定数据和 KD 流程训练边缘人工智能模型。然后,该边缘人工智能模型被分配到边缘人工智能设备上,仅用于用户应用场景中的推理。当用户需要更新边缘人工智能模型以更好地适应实际场景时,就会采用反向蒸馏(RD)流程,利用用户的独家数据从边缘人工智能模型中提取知识:用户偏好与制造商假设之间的差异。只有提取的知识才会反馈给上层管理云服务器,用于更新云人工智能模型,从而通过不使用任何独家数据来保护用户隐私。更新后的云人工智能可以利用扩展知识更新边缘人工智能模型。仿真结果表明,所提出的 "DiReDi "框架允许制造商通过从用户的实际场景中学习新知识来更新用户模型。由于再训练强调用户私有数据,因此减少了初始冗余知识。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
DiReDi: Distillation and Reverse Distillation for AIoT Applications
Typically, the significant efficiency can be achieved by deploying different edge AI models in various real world scenarios while a few large models manage those edge AI models remotely from cloud servers. However, customizing edge AI models for each user's specific application or extending current models to new application scenarios remains a challenge. Inappropriate local training or fine tuning of edge AI models by users can lead to model malfunction, potentially resulting in legal issues for the manufacturer. To address aforementioned issues, this paper proposes an innovative framework called "DiReD", which involves knowledge DIstillation & REverse DIstillation. In the initial step, an edge AI model is trained with presumed data and a KD process using the cloud AI model in the upper management cloud server. This edge AI model is then dispatched to edge AI devices solely for inference in the user's application scenario. When the user needs to update the edge AI model to better fit the actual scenario, the reverse distillation (RD) process is employed to extract the knowledge: the difference between user preferences and the manufacturer's presumptions from the edge AI model using the user's exclusive data. Only the extracted knowledge is reported back to the upper management cloud server to update the cloud AI model, thus protecting user privacy by not using any exclusive data. The updated cloud AI can then update the edge AI model with the extended knowledge. Simulation results demonstrate that the proposed "DiReDi" framework allows the manufacturer to update the user model by learning new knowledge from the user's actual scenario with private data. The initial redundant knowledge is reduced since the retraining emphasizes user private data.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信