UniLCD:通过强化学习进行统一本地云决策

Kathakoli Sengupta, Zhongkai Shagguan, Sandesh Bharadwaj, Sanjay Arora, Eshed Ohn-Bar, Renato Mancuso
{"title":"UniLCD:通过强化学习进行统一本地云决策","authors":"Kathakoli Sengupta, Zhongkai Shagguan, Sandesh Bharadwaj, Sanjay Arora, Eshed Ohn-Bar, Renato Mancuso","doi":"arxiv-2409.11403","DOIUrl":null,"url":null,"abstract":"Embodied vision-based real-world systems, such as mobile robots, require a\ncareful balance between energy consumption, compute latency, and safety\nconstraints to optimize operation across dynamic tasks and contexts. As local\ncomputation tends to be restricted, offloading the computation, ie, to a remote\nserver, can save local resources while providing access to high-quality\npredictions from powerful and large models. However, the resulting\ncommunication and latency overhead has led to limited usability of cloud models\nin dynamic, safety-critical, real-time settings. To effectively address this\ntrade-off, we introduce UniLCD, a novel hybrid inference framework for enabling\nflexible local-cloud collaboration. By efficiently optimizing a flexible\nrouting module via reinforcement learning and a suitable multi-task objective,\nUniLCD is specifically designed to support the multiple constraints of\nsafety-critical end-to-end mobile systems. We validate the proposed approach\nusing a challenging, crowded navigation task requiring frequent and timely\nswitching between local and cloud operations. UniLCD demonstrates improved\noverall performance and efficiency, by over 35% compared to state-of-the-art\nbaselines based on various split computing and early exit strategies.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":"28 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"UniLCD: Unified Local-Cloud Decision-Making via Reinforcement Learning\",\"authors\":\"Kathakoli Sengupta, Zhongkai Shagguan, Sandesh Bharadwaj, Sanjay Arora, Eshed Ohn-Bar, Renato Mancuso\",\"doi\":\"arxiv-2409.11403\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Embodied vision-based real-world systems, such as mobile robots, require a\\ncareful balance between energy consumption, compute latency, and safety\\nconstraints to optimize operation across dynamic tasks and contexts. As local\\ncomputation tends to be restricted, offloading the computation, ie, to a remote\\nserver, can save local resources while providing access to high-quality\\npredictions from powerful and large models. However, the resulting\\ncommunication and latency overhead has led to limited usability of cloud models\\nin dynamic, safety-critical, real-time settings. To effectively address this\\ntrade-off, we introduce UniLCD, a novel hybrid inference framework for enabling\\nflexible local-cloud collaboration. By efficiently optimizing a flexible\\nrouting module via reinforcement learning and a suitable multi-task objective,\\nUniLCD is specifically designed to support the multiple constraints of\\nsafety-critical end-to-end mobile systems. We validate the proposed approach\\nusing a challenging, crowded navigation task requiring frequent and timely\\nswitching between local and cloud operations. UniLCD demonstrates improved\\noverall performance and efficiency, by over 35% compared to state-of-the-art\\nbaselines based on various split computing and early exit strategies.\",\"PeriodicalId\":501031,\"journal\":{\"name\":\"arXiv - CS - Robotics\",\"volume\":\"28 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Robotics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.11403\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Robotics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.11403","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

基于嵌入式视觉的真实世界系统(如移动机器人)需要在能耗、计算延迟和安全限制之间取得谨慎的平衡,以优化在动态任务和环境中的运行。由于本地计算往往受到限制,因此将计算卸载到远程服务器上可以节省本地资源,同时还能从强大的大型模型中获取高质量的预测结果。然而,由此产生的通信和延迟开销导致云模型在动态、安全关键、实时环境中的可用性受到限制。为了有效解决这一矛盾,我们引入了 UniLCD,这是一种新颖的混合推理框架,用于实现灵活的本地-云协作。通过强化学习和合适的多任务目标对灵活路由模块进行有效优化,UniLCD 专为支持安全关键型端到端移动系统的多重约束而设计。我们利用一项具有挑战性的拥挤导航任务验证了所提出的方法,该任务要求在本地操作和云操作之间频繁、及时地切换。与基于各种分离计算和早期退出策略的先进基线相比,UniLCD 的整体性能和效率提高了 35% 以上。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
UniLCD: Unified Local-Cloud Decision-Making via Reinforcement Learning
Embodied vision-based real-world systems, such as mobile robots, require a careful balance between energy consumption, compute latency, and safety constraints to optimize operation across dynamic tasks and contexts. As local computation tends to be restricted, offloading the computation, ie, to a remote server, can save local resources while providing access to high-quality predictions from powerful and large models. However, the resulting communication and latency overhead has led to limited usability of cloud models in dynamic, safety-critical, real-time settings. To effectively address this trade-off, we introduce UniLCD, a novel hybrid inference framework for enabling flexible local-cloud collaboration. By efficiently optimizing a flexible routing module via reinforcement learning and a suitable multi-task objective, UniLCD is specifically designed to support the multiple constraints of safety-critical end-to-end mobile systems. We validate the proposed approach using a challenging, crowded navigation task requiring frequent and timely switching between local and cloud operations. UniLCD demonstrates improved overall performance and efficiency, by over 35% compared to state-of-the-art baselines based on various split computing and early exit strategies.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信