{"title":"MAPPO-ITD3-IMLFQ algorithm for multi-mobile robot path planning","authors":"Likun Hu, Chunyou Wei, Linfei Yin","doi":"10.1016/j.aei.2025.103398","DOIUrl":null,"url":null,"abstract":"<div><div>With the development of robotics, mobile robots (MRs) are widely applied in industrial and agricultural production. Reasonable path planning (PP) algorithms are the prerequisite for multi-mobile robot (MMR) systems to accomplish tasks. However, the existing PP algorithms of MMR systems still have the problems of being unable to dynamically assign tasks, not comprehensively considering the needs of kinematic constraints and dynamic obstacle avoidance, and poorly coordinating path conflicts. This study proposes a multi-agent proximal policy optimization-artificial potential field twin delayed deep deterministic policy gradient-improved multi-level feedback queue (MAPPO-ITD3-IMLFQ) algorithm for the PP of MMR systems. The proposed MAPPO-ITD3-IMLFQ algorithm combines the multi-agent proximal policy optimization (MAPPO) algorithm, the improved twin delayed deep deterministic policy gradient (ITD3) algorithm, and the improved multi-level feedback queue (IMLFQ) algorithm to form a PP algorithm for MMR system. The MRs apply the MAPPO algorithm to calculate task assignment (TA) schemes and provide sub-goal points for ITD3 algorithm. The MRs apply the ITD3 algorithm to calculate the path of the MRs. When the paths of different MRs conflict, the MR applies the IMLFQ algorithm to coordinate the movement of the MRs. The proposed MAPPO-ITD3-IMLFQ algorithm realizes the dynamic TA of the MMR system, meets the kinematic constraints and dynamic obstacle avoidance requirements of MRs, and coordinates path conflicts among the MRs. In this study, the proposed MAPPO-ITD3-IMLFQ algorithm is applied to different environments for the PP of MMRs. Experimental results show that: compared to the Hungarian algorithm and the genetic algorithm, the proposed MAPPO-ITD3-IMLFQ algorithm reduces the time spent on assigned tasks by 75.25 % and 77.44 %, respectively. Compared to the PP algorithms for reinforcement learning, the proposed MAPPO-ITD3-IMLFQ algorithm reduces the length of the planned path by 23.57 % on average.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103398"},"PeriodicalIF":8.0000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625002915","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
With the development of robotics, mobile robots (MRs) are widely applied in industrial and agricultural production. Reasonable path planning (PP) algorithms are the prerequisite for multi-mobile robot (MMR) systems to accomplish tasks. However, the existing PP algorithms of MMR systems still have the problems of being unable to dynamically assign tasks, not comprehensively considering the needs of kinematic constraints and dynamic obstacle avoidance, and poorly coordinating path conflicts. This study proposes a multi-agent proximal policy optimization-artificial potential field twin delayed deep deterministic policy gradient-improved multi-level feedback queue (MAPPO-ITD3-IMLFQ) algorithm for the PP of MMR systems. The proposed MAPPO-ITD3-IMLFQ algorithm combines the multi-agent proximal policy optimization (MAPPO) algorithm, the improved twin delayed deep deterministic policy gradient (ITD3) algorithm, and the improved multi-level feedback queue (IMLFQ) algorithm to form a PP algorithm for MMR system. The MRs apply the MAPPO algorithm to calculate task assignment (TA) schemes and provide sub-goal points for ITD3 algorithm. The MRs apply the ITD3 algorithm to calculate the path of the MRs. When the paths of different MRs conflict, the MR applies the IMLFQ algorithm to coordinate the movement of the MRs. The proposed MAPPO-ITD3-IMLFQ algorithm realizes the dynamic TA of the MMR system, meets the kinematic constraints and dynamic obstacle avoidance requirements of MRs, and coordinates path conflicts among the MRs. In this study, the proposed MAPPO-ITD3-IMLFQ algorithm is applied to different environments for the PP of MMRs. Experimental results show that: compared to the Hungarian algorithm and the genetic algorithm, the proposed MAPPO-ITD3-IMLFQ algorithm reduces the time spent on assigned tasks by 75.25 % and 77.44 %, respectively. Compared to the PP algorithms for reinforcement learning, the proposed MAPPO-ITD3-IMLFQ algorithm reduces the length of the planned path by 23.57 % on average.
期刊介绍:
Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.