Autonomous UAV last-mile delivery in urban environments: A survey on deep learning and reinforcement learning solutions

IF 4.6 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Control Engineering Practice Pub Date : 2025-08-14 DOI:10.1016/j.conengprac.2025.106491

Jingrui Guo , Yangyang Zhou , Laurent Burlion , Andrey V. Savkin , Chao Huang

{"title":"Autonomous UAV last-mile delivery in urban environments: A survey on deep learning and reinforcement learning solutions","authors":"Jingrui Guo , Yangyang Zhou , Laurent Burlion , Andrey V. Savkin , Chao Huang","doi":"10.1016/j.conengprac.2025.106491","DOIUrl":null,"url":null,"abstract":"<div><div>This survey investigates the convergence of deep learning (DL) and reinforcement learning (RL) for unmanned aerial vehicle (UAV) applications, particularly in autonomous last-mile delivery. The ongoing growth of e-commerce heightens the need for advanced UAV technologies to overcome urban logistics challenges, including navigation and package delivery. DL and RL offer promising methods for object detection, path planning, and decision-making, enhancing delivery efficiency. However, significant challenges persist, particularly regarding scalability, computational constraints, and adaptation to volatile urban settings. Large UAV fleets and intricate city environments exacerbate scalability issues, while limited onboard processing capacity hinders the use of computationally intensive DL and RL models. Moreover, adapting to unpredictable conditions demands robust navigation strategies. This survey emphasizes hybrid approaches that integrate supervised and reinforcement learning or employ transfer learning to adapt pre-trained models. Techniques like model based RL and domain adaptation are highlighted as potential pathways to improve generalizability and bridge the gap between simulation and real-world deployment.</div></div>","PeriodicalId":50615,"journal":{"name":"Control Engineering Practice","volume":"165 ","pages":"Article 106491"},"PeriodicalIF":4.6000,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Control Engineering Practice","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0967066125002539","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

This survey investigates the convergence of deep learning (DL) and reinforcement learning (RL) for unmanned aerial vehicle (UAV) applications, particularly in autonomous last-mile delivery. The ongoing growth of e-commerce heightens the need for advanced UAV technologies to overcome urban logistics challenges, including navigation and package delivery. DL and RL offer promising methods for object detection, path planning, and decision-making, enhancing delivery efficiency. However, significant challenges persist, particularly regarding scalability, computational constraints, and adaptation to volatile urban settings. Large UAV fleets and intricate city environments exacerbate scalability issues, while limited onboard processing capacity hinders the use of computationally intensive DL and RL models. Moreover, adapting to unpredictable conditions demands robust navigation strategies. This survey emphasizes hybrid approaches that integrate supervised and reinforcement learning or employ transfer learning to adapt pre-trained models. Techniques like model based RL and domain adaptation are highlighted as potential pathways to improve generalizability and bridge the gap between simulation and real-world deployment.

查看原文本刊更多论文

自主无人机在城市环境中的最后一英里交付：深度学习和强化学习解决方案的调查

本调查探讨了深度学习（DL）和强化学习（RL）在无人机（UAV）应用中的收敛性，特别是在自动驾驶的最后一英里交付中。电子商务的持续增长增加了对先进无人机技术的需求，以克服包括导航和包裹递送在内的城市物流挑战。深度学习和强化学习为目标检测、路径规划和决策提供了有前途的方法，提高了交付效率。然而，重大的挑战仍然存在，特别是在可扩展性、计算限制和适应多变的城市环境方面。大型无人机机队和复杂的城市环境加剧了可扩展性问题，而有限的机载处理能力阻碍了计算密集型DL和RL模型的使用。此外，适应不可预测的条件需要强大的导航策略。这项调查强调了混合方法，将监督学习和强化学习结合起来，或采用迁移学习来适应预训练的模型。像基于模型的强化学习和领域适应这样的技术被强调为提高泛化能力和弥合模拟与现实世界部署之间差距的潜在途径。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Control Engineering Practice 工程技术-工程：电子与电气

CiteScore

9.20

自引率

12.20%

发文量

183

审稿时长

44 days

期刊介绍： Control Engineering Practice strives to meet the needs of industrial practitioners and industrially related academics and researchers. It publishes papers which illustrate the direct application of control theory and its supporting tools in all possible areas of automation. As a result, the journal only contains papers which can be considered to have made significant contributions to the application of advanced control techniques. It is normally expected that practical results should be included, but where simulation only studies are available, it is necessary to demonstrate that the simulation model is representative of a genuine application. Strictly theoretical papers will find a more appropriate home in Control Engineering Practice''s sister publication, Automatica. It is also expected that papers are innovative with respect to the state of the art and are sufficiently detailed for a reader to be able to duplicate the main results of the paper (supplementary material, including datasets, tables, code and any relevant interactive material can be made available and downloaded from the website). The benefits of the presented methods must be made very clear and the new techniques must be compared and contrasted with results obtained using existing methods. Moreover, a thorough analysis of failures that may happen in the design process and implementation can also be part of the paper. The scope of Control Engineering Practice matches the activities of IFAC. Papers demonstrating the contribution of automation and control in improving the performance, quality, productivity, sustainability, resource and energy efficiency, and the manageability of systems and processes for the benefit of mankind and are relevant to industrial practitioners are most welcome.