Resource-constrained dynamic planning and model-free control in turbulent urban environments

IF 5.2 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
Thomas Nakken Larsen , Mandar Tabib , Adil Rasheed
{"title":"Resource-constrained dynamic planning and model-free control in turbulent urban environments","authors":"Thomas Nakken Larsen ,&nbsp;Mandar Tabib ,&nbsp;Adil Rasheed","doi":"10.1016/j.robot.2025.105130","DOIUrl":null,"url":null,"abstract":"<div><div>This work proposes a descriptive digital twin of an urban region with a sufficiently small footprint to run online in resource-constrained quadrotor systems. The digital twin describes the building geometries and turbulent kinetic energy (TKE) within an urban region, enabled via our proposed surrogate model (SM) for TKE reconstruction. This forms the basis of a simulation environment for autonomous path following and collision avoidance using Deep Reinforcement Learning (DRL). A Voronoi-based turbulence-weighted graph (TWG) is developed for safe path planning and is capable of reacting to dynamic changes in wind direction and, consequently, the turbulence field. Lastly, the environment simulates oncoming traffic of dynamic unknown obstacles for a vision-enabled DRL agent to evade. Several DRL agents with different observation and action spaces are trained and evaluated.</div><div>The SM enables rapid reconstruction of the turbulence with an accuracy comparable to full-order methods. The TWG plans safe paths that reduce the worst-case TKE exposure by 44% at the cost of increasing the average path length by 33% compared to a shortest-distance approach. The DRL agents successfully solve the navigation problem with a 100% success rate in the static obstacle scenario, where the minimum clearance between buildings is <span><math><mrow><mn>1</mn><mo>.</mo><mn>0</mn><mi>m</mi></mrow></math></span>. In scenarios with dynamic obstacles, the agent achieves a 79% success rate. Suggestions for further performance and safety improvements in the TWG planner and DRL agents are presented.</div></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"194 ","pages":"Article 105130"},"PeriodicalIF":5.2000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Autonomous Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0921889025002271","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

This work proposes a descriptive digital twin of an urban region with a sufficiently small footprint to run online in resource-constrained quadrotor systems. The digital twin describes the building geometries and turbulent kinetic energy (TKE) within an urban region, enabled via our proposed surrogate model (SM) for TKE reconstruction. This forms the basis of a simulation environment for autonomous path following and collision avoidance using Deep Reinforcement Learning (DRL). A Voronoi-based turbulence-weighted graph (TWG) is developed for safe path planning and is capable of reacting to dynamic changes in wind direction and, consequently, the turbulence field. Lastly, the environment simulates oncoming traffic of dynamic unknown obstacles for a vision-enabled DRL agent to evade. Several DRL agents with different observation and action spaces are trained and evaluated.
The SM enables rapid reconstruction of the turbulence with an accuracy comparable to full-order methods. The TWG plans safe paths that reduce the worst-case TKE exposure by 44% at the cost of increasing the average path length by 33% compared to a shortest-distance approach. The DRL agents successfully solve the navigation problem with a 100% success rate in the static obstacle scenario, where the minimum clearance between buildings is 1.0m. In scenarios with dynamic obstacles, the agent achieves a 79% success rate. Suggestions for further performance and safety improvements in the TWG planner and DRL agents are presented.
动荡城市环境中资源约束下的动态规划与无模型控制
这项工作提出了一个描述性的数字孪生城市地区,其足迹足够小,可以在资源受限的四旋翼系统中在线运行。数字孪生体描述了城市区域内的建筑几何形状和湍流动能(TKE),通过我们提出的替代模型(SM)实现了TKE重建。这构成了使用深度强化学习(DRL)进行自主路径跟踪和避免碰撞的模拟环境的基础。基于voronoi的湍流加权图(TWG)用于安全路径规划,能够对风向的动态变化做出反应,从而对湍流场做出反应。最后,该环境模拟迎面而来的动态未知障碍物,使具有视觉功能的DRL代理能够规避。对具有不同观察和动作空间的多个DRL代理进行了训练和评估。SM能够以与全阶方法相当的精度快速重建湍流。与最短距离的方法相比,TWG计划的安全路径将最坏情况下的TKE暴露减少44%,代价是平均路径长度增加33%。在静态障碍物场景下,DRL agent以100%的成功率成功解决了导航问题,其中建筑物之间的最小间隙为1.0m。在有动态障碍物的场景中,智能体的成功率达到79%。提出了进一步改进TWG规划器和DRL代理的性能和安全性的建议。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Robotics and Autonomous Systems
Robotics and Autonomous Systems 工程技术-机器人学
CiteScore
9.00
自引率
7.00%
发文量
164
审稿时长
4.5 months
期刊介绍: Robotics and Autonomous Systems will carry articles describing fundamental developments in the field of robotics, with special emphasis on autonomous systems. An important goal of this journal is to extend the state of the art in both symbolic and sensory based robot control and learning in the context of autonomous systems. Robotics and Autonomous Systems will carry articles on the theoretical, computational and experimental aspects of autonomous systems, or modules of such systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信