通过最佳MEC-Device卸载在6G中部署设备上AIGC推理服务

IEEE Networking Letters Pub Date : 2024-11-04 DOI:10.1109/LNET.2024.3490954

Changshi Zhou;Weiqi Liu;Tao Han;Nirwan Ansari

{"title":"通过最佳MEC-Device卸载在6G中部署设备上AIGC推理服务","authors":"Changshi Zhou;Weiqi Liu;Tao Han;Nirwan Ansari","doi":"10.1109/LNET.2024.3490954","DOIUrl":null,"url":null,"abstract":"From AI-assisted art creation to large language model (LLM)-powered ChatGPT, AI-generated contents and services are becoming a transforming force. It calls for the telecom industry to embrace the prospects of AIGC services and face the unique challenges posed by incorporating generative model services into the AI-native 6G wireless network paradigm. We propose enabling AIGC inference services on mobile devices by optimizing MEC-device computing offloading, through which AIGC task latency is minimized by reinforcement learning based policy agent in a computing resource constrained and bandwidth limited wireless environment. Simulation results are presented to demonstrate the performance advantage.","PeriodicalId":100628,"journal":{"name":"IEEE Networking Letters","volume":"6 4","pages":"232-236"},"PeriodicalIF":0.0000,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deploying On-Device AIGC Inference Services in 6G via Optimal MEC-Device Offloading\",\"authors\":\"Changshi Zhou;Weiqi Liu;Tao Han;Nirwan Ansari\",\"doi\":\"10.1109/LNET.2024.3490954\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"From AI-assisted art creation to large language model (LLM)-powered ChatGPT, AI-generated contents and services are becoming a transforming force. It calls for the telecom industry to embrace the prospects of AIGC services and face the unique challenges posed by incorporating generative model services into the AI-native 6G wireless network paradigm. We propose enabling AIGC inference services on mobile devices by optimizing MEC-device computing offloading, through which AIGC task latency is minimized by reinforcement learning based policy agent in a computing resource constrained and bandwidth limited wireless environment. Simulation results are presented to demonstrate the performance advantage.\",\"PeriodicalId\":100628,\"journal\":{\"name\":\"IEEE Networking Letters\",\"volume\":\"6 4\",\"pages\":\"232-236\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-11-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Networking Letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10742103/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Networking Letters","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10742103/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

从人工智能辅助的艺术创作到大型语言模型（LLM）驱动的ChatGPT，人工智能生成的内容和服务正在成为一股变革力量。它呼吁电信行业拥抱AIGC服务的前景，并面对将生成模型服务纳入人工智能原生6G无线网络范式所带来的独特挑战。我们提出通过优化MEC-device计算卸载在移动设备上启用AIGC推理服务，在计算资源受限和带宽有限的无线环境下，通过基于强化学习的策略代理最小化AIGC任务延迟。仿真结果验证了该方法的性能优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Deploying On-Device AIGC Inference Services in 6G via Optimal MEC-Device Offloading

From AI-assisted art creation to large language model (LLM)-powered ChatGPT, AI-generated contents and services are becoming a transforming force. It calls for the telecom industry to embrace the prospects of AIGC services and face the unique challenges posed by incorporating generative model services into the AI-native 6G wireless network paradigm. We propose enabling AIGC inference services on mobile devices by optimizing MEC-device computing offloading, through which AIGC task latency is minimized by reinforcement learning based policy agent in a computing resource constrained and bandwidth limited wireless environment. Simulation results are presented to demonstrate the performance advantage.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Networking Letters

自引率

0.00%

发文量