Resource-constrained dynamic planning and model-free control in turbulent urban environments

IF 5.2 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Robotics and Autonomous Systems Pub Date : 2025-07-25 DOI:10.1016/j.robot.2025.105130

Thomas Nakken Larsen , Mandar Tabib , Adil Rasheed

{"title":"Resource-constrained dynamic planning and model-free control in turbulent urban environments","authors":"Thomas Nakken Larsen , Mandar Tabib , Adil Rasheed","doi":"10.1016/j.robot.2025.105130","DOIUrl":null,"url":null,"abstract":"<div><div>This work proposes a descriptive digital twin of an urban region with a sufficiently small footprint to run online in resource-constrained quadrotor systems. The digital twin describes the building geometries and turbulent kinetic energy (TKE) within an urban region, enabled via our proposed surrogate model (SM) for TKE reconstruction. This forms the basis of a simulation environment for autonomous path following and collision avoidance using Deep Reinforcement Learning (DRL). A Voronoi-based turbulence-weighted graph (TWG) is developed for safe path planning and is capable of reacting to dynamic changes in wind direction and, consequently, the turbulence field. Lastly, the environment simulates oncoming traffic of dynamic unknown obstacles for a vision-enabled DRL agent to evade. Several DRL agents with different observation and action spaces are trained and evaluated.</div><div>The SM enables rapid reconstruction of the turbulence with an accuracy comparable to full-order methods. The TWG plans safe paths that reduce the worst-case TKE exposure by 44% at the cost of increasing the average path length by 33% compared to a shortest-distance approach. The DRL agents successfully solve the navigation problem with a 100% success rate in the static obstacle scenario, where the minimum clearance between buildings is <span><math><mrow><mn>1</mn><mo>.</mo><mn>0</mn><mi>m</mi></mrow></math></span>. In scenarios with dynamic obstacles, the agent achieves a 79% success rate. Suggestions for further performance and safety improvements in the TWG planner and DRL agents are presented.</div></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"194 ","pages":"Article 105130"},"PeriodicalIF":5.2000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Autonomous Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0921889025002271","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

This work proposes a descriptive digital twin of an urban region with a sufficiently small footprint to run online in resource-constrained quadrotor systems. The digital twin describes the building geometries and turbulent kinetic energy (TKE) within an urban region, enabled via our proposed surrogate model (SM) for TKE reconstruction. This forms the basis of a simulation environment for autonomous path following and collision avoidance using Deep Reinforcement Learning (DRL). A Voronoi-based turbulence-weighted graph (TWG) is developed for safe path planning and is capable of reacting to dynamic changes in wind direction and, consequently, the turbulence field. Lastly, the environment simulates oncoming traffic of dynamic unknown obstacles for a vision-enabled DRL agent to evade. Several DRL agents with different observation and action spaces are trained and evaluated.

The SM enables rapid reconstruction of the turbulence with an accuracy comparable to full-order methods. The TWG plans safe paths that reduce the worst-case TKE exposure by 44% at the cost of increasing the average path length by 33% compared to a shortest-distance approach. The DRL agents successfully solve the navigation problem with a 100% success rate in the static obstacle scenario, where the minimum clearance between buildings is

1.0 m

. In scenarios with dynamic obstacles, the agent achieves a 79% success rate. Suggestions for further performance and safety improvements in the TWG planner and DRL agents are presented.

查看原文本刊更多论文

动荡城市环境中资源约束下的动态规划与无模型控制

这项工作提出了一个描述性的数字孪生城市地区，其足迹足够小，可以在资源受限的四旋翼系统中在线运行。数字孪生体描述了城市区域内的建筑几何形状和湍流动能（TKE），通过我们提出的替代模型（SM）实现了TKE重建。这构成了使用深度强化学习（DRL）进行自主路径跟踪和避免碰撞的模拟环境的基础。基于voronoi的湍流加权图（TWG）用于安全路径规划，能够对风向的动态变化做出反应，从而对湍流场做出反应。最后，该环境模拟迎面而来的动态未知障碍物，使具有视觉功能的DRL代理能够规避。对具有不同观察和动作空间的多个DRL代理进行了训练和评估。SM能够以与全阶方法相当的精度快速重建湍流。与最短距离的方法相比，TWG计划的安全路径将最坏情况下的TKE暴露减少44%，代价是平均路径长度增加33%。在静态障碍物场景下，DRL agent以100%的成功率成功解决了导航问题，其中建筑物之间的最小间隙为1.0m。在有动态障碍物的场景中，智能体的成功率达到79%。提出了进一步改进TWG规划器和DRL代理的性能和安全性的建议。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Robotics and Autonomous Systems 工程技术-机器人学

CiteScore

9.00

自引率

7.00%

发文量

164

审稿时长

4.5 months

期刊介绍： Robotics and Autonomous Systems will carry articles describing fundamental developments in the field of robotics, with special emphasis on autonomous systems. An important goal of this journal is to extend the state of the art in both symbolic and sensory based robot control and learning in the context of autonomous systems. Robotics and Autonomous Systems will carry articles on the theoretical, computational and experimental aspects of autonomous systems, or modules of such systems.