UniLCD：通过强化学习进行统一本地云决策

arXiv - CS - Robotics Pub Date : 2024-09-17 DOI:arxiv-2409.11403

Kathakoli Sengupta, Zhongkai Shagguan, Sandesh Bharadwaj, Sanjay Arora, Eshed Ohn-Bar, Renato Mancuso

{"title":"UniLCD：通过强化学习进行统一本地云决策","authors":"Kathakoli Sengupta, Zhongkai Shagguan, Sandesh Bharadwaj, Sanjay Arora, Eshed Ohn-Bar, Renato Mancuso","doi":"arxiv-2409.11403","DOIUrl":null,"url":null,"abstract":"Embodied vision-based real-world systems, such as mobile robots, require a\ncareful balance between energy consumption, compute latency, and safety\nconstraints to optimize operation across dynamic tasks and contexts. As local\ncomputation tends to be restricted, offloading the computation, ie, to a remote\nserver, can save local resources while providing access to high-quality\npredictions from powerful and large models. However, the resulting\ncommunication and latency overhead has led to limited usability of cloud models\nin dynamic, safety-critical, real-time settings. To effectively address this\ntrade-off, we introduce UniLCD, a novel hybrid inference framework for enabling\nflexible local-cloud collaboration. By efficiently optimizing a flexible\nrouting module via reinforcement learning and a suitable multi-task objective,\nUniLCD is specifically designed to support the multiple constraints of\nsafety-critical end-to-end mobile systems. We validate the proposed approach\nusing a challenging, crowded navigation task requiring frequent and timely\nswitching between local and cloud operations. UniLCD demonstrates improved\noverall performance and efficiency, by over 35% compared to state-of-the-art\nbaselines based on various split computing and early exit strategies.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":"28 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"UniLCD: Unified Local-Cloud Decision-Making via Reinforcement Learning\",\"authors\":\"Kathakoli Sengupta, Zhongkai Shagguan, Sandesh Bharadwaj, Sanjay Arora, Eshed Ohn-Bar, Renato Mancuso\",\"doi\":\"arxiv-2409.11403\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Embodied vision-based real-world systems, such as mobile robots, require a\\ncareful balance between energy consumption, compute latency, and safety\\nconstraints to optimize operation across dynamic tasks and contexts. As local\\ncomputation tends to be restricted, offloading the computation, ie, to a remote\\nserver, can save local resources while providing access to high-quality\\npredictions from powerful and large models. However, the resulting\\ncommunication and latency overhead has led to limited usability of cloud models\\nin dynamic, safety-critical, real-time settings. To effectively address this\\ntrade-off, we introduce UniLCD, a novel hybrid inference framework for enabling\\nflexible local-cloud collaboration. By efficiently optimizing a flexible\\nrouting module via reinforcement learning and a suitable multi-task objective,\\nUniLCD is specifically designed to support the multiple constraints of\\nsafety-critical end-to-end mobile systems. We validate the proposed approach\\nusing a challenging, crowded navigation task requiring frequent and timely\\nswitching between local and cloud operations. UniLCD demonstrates improved\\noverall performance and efficiency, by over 35% compared to state-of-the-art\\nbaselines based on various split computing and early exit strategies.\",\"PeriodicalId\":501031,\"journal\":{\"name\":\"arXiv - CS - Robotics\",\"volume\":\"28 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Robotics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.11403\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Robotics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.11403","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

基于嵌入式视觉的真实世界系统（如移动机器人）需要在能耗、计算延迟和安全限制之间取得谨慎的平衡，以优化在动态任务和环境中的运行。由于本地计算往往受到限制，因此将计算卸载到远程服务器上可以节省本地资源，同时还能从强大的大型模型中获取高质量的预测结果。然而，由此产生的通信和延迟开销导致云模型在动态、安全关键、实时环境中的可用性受到限制。为了有效解决这一矛盾，我们引入了 UniLCD，这是一种新颖的混合推理框架，用于实现灵活的本地-云协作。通过强化学习和合适的多任务目标对灵活路由模块进行有效优化，UniLCD 专为支持安全关键型端到端移动系统的多重约束而设计。我们利用一项具有挑战性的拥挤导航任务验证了所提出的方法，该任务要求在本地操作和云操作之间频繁、及时地切换。与基于各种分离计算和早期退出策略的先进基线相比，UniLCD 的整体性能和效率提高了 35% 以上。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

UniLCD: Unified Local-Cloud Decision-Making via Reinforcement Learning

Embodied vision-based real-world systems, such as mobile robots, require a careful balance between energy consumption, compute latency, and safety constraints to optimize operation across dynamic tasks and contexts. As local computation tends to be restricted, offloading the computation, ie, to a remote server, can save local resources while providing access to high-quality predictions from powerful and large models. However, the resulting communication and latency overhead has led to limited usability of cloud models in dynamic, safety-critical, real-time settings. To effectively address this trade-off, we introduce UniLCD, a novel hybrid inference framework for enabling flexible local-cloud collaboration. By efficiently optimizing a flexible routing module via reinforcement learning and a suitable multi-task objective, UniLCD is specifically designed to support the multiple constraints of safety-critical end-to-end mobile systems. We validate the proposed approach using a challenging, crowded navigation task requiring frequent and timely switching between local and cloud operations. UniLCD demonstrates improved overall performance and efficiency, by over 35% compared to state-of-the-art baselines based on various split computing and early exit strategies.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - CS - Robotics

自引率

0.00%

发文量