Decentralized Multi-Robot Navigation Based on Deep Reinforcement Learning and Trajectory Optimization.

IF 3.9 3区医学 Q1 ENGINEERING, MULTIDISCIPLINARY

Biomimetics Pub Date : 2025-06-04 DOI:10.3390/biomimetics10060366

Yifei Bi, Jianing Luo, Jiwei Zhu, Junxiu Liu, Wei Li

{"title":"Decentralized Multi-Robot Navigation Based on Deep Reinforcement Learning and Trajectory Optimization.","authors":"Yifei Bi, Jianing Luo, Jiwei Zhu, Junxiu Liu, Wei Li","doi":"10.3390/biomimetics10060366","DOIUrl":null,"url":null,"abstract":"<p><p>Multi-robot systems are significant in decision-making capabilities and applications, but avoiding collisions during movement remains a critical challenge. Existing decentralized obstacle avoidance strategies, while low in computational cost, often fail to ensure safety effectively. To address this issue, this paper leverages graph neural networks (GNNs) and deep reinforcement learning (DRL) to aggregate high-dimensional features as inputs for reinforcement learning (RL) to generate paths. Additionally, it introduces safety constraints through an artificial potential field (APF) to optimize these trajectories. Additionally, a constrained nonlinear optimization method further refines the APF-adjusted paths, resulting in the development of the GNN-RL-APF-Lagrangian algorithm. By combining APF and nonlinear optimization techniques, experimental results demonstrate that this method significantly enhances the safety and obstacle avoidance capabilities of multi-robot systems in complex environments. The proposed GNN-RL-APF-Lagrangian algorithm achieves a 96.43% success rate in sparse obstacle environments and 89.77% in dense obstacle scenarios, representing improvements of 59% and 60%, respectively, over baseline GNN-RL approaches. The method maintains scalability up to 30 robots while preserving distributed execution properties.</p>","PeriodicalId":8907,"journal":{"name":"Biomimetics","volume":"10 6","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12190238/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomimetics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.3390/biomimetics10060366","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Multi-robot systems are significant in decision-making capabilities and applications, but avoiding collisions during movement remains a critical challenge. Existing decentralized obstacle avoidance strategies, while low in computational cost, often fail to ensure safety effectively. To address this issue, this paper leverages graph neural networks (GNNs) and deep reinforcement learning (DRL) to aggregate high-dimensional features as inputs for reinforcement learning (RL) to generate paths. Additionally, it introduces safety constraints through an artificial potential field (APF) to optimize these trajectories. Additionally, a constrained nonlinear optimization method further refines the APF-adjusted paths, resulting in the development of the GNN-RL-APF-Lagrangian algorithm. By combining APF and nonlinear optimization techniques, experimental results demonstrate that this method significantly enhances the safety and obstacle avoidance capabilities of multi-robot systems in complex environments. The proposed GNN-RL-APF-Lagrangian algorithm achieves a 96.43% success rate in sparse obstacle environments and 89.77% in dense obstacle scenarios, representing improvements of 59% and 60%, respectively, over baseline GNN-RL approaches. The method maintains scalability up to 30 robots while preserving distributed execution properties.

查看原文本刊更多论文

基于深度强化学习和轨迹优化的分散多机器人导航。

多机器人系统在决策能力和应用方面具有重要意义，但在运动过程中避免碰撞仍然是一个关键挑战。现有的分散避障策略虽然计算成本低，但往往不能有效地保证安全。为了解决这个问题，本文利用图神经网络（gnn）和深度强化学习（DRL）来聚合高维特征作为强化学习（RL）的输入来生成路径。此外，它还通过人工势场（APF）引入安全约束来优化这些轨迹。此外，约束非线性优化方法进一步细化了apf调整路径，从而发展了GNN-RL-APF-Lagrangian算法。将APF和非线性优化技术相结合，实验结果表明，该方法显著提高了复杂环境下多机器人系统的安全性和避障能力。本文提出的GNN-RL- apf - lagrange算法在稀疏障碍物环境下的成功率为96.43%，在密集障碍物场景下的成功率为89.77%，分别比基线GNN-RL方法提高了59%和60%。该方法保持了多达30个机器人的可扩展性，同时保留了分布式执行属性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊