{"title":"Improved ACO algorithm fused with improved Q-Learning algorithm for Bessel curve global path planning of search and rescue robots","authors":"Wenkai Fang , Zhigao Liao , Yufeng Bai","doi":"10.1016/j.robot.2024.104822","DOIUrl":null,"url":null,"abstract":"<div><div>Addressing issues with traditional ant colony and reinforcement learning algorithms, such as low search efficiency and the tendency to produce insufficiently smooth paths that easily fall into local optima, this paper designs an improved ant colony optimization algorithm fusion with improved Q-Learning (IAC-IQL) algorithm for Bessel curve global path planning of search and rescue (SAR) robots. First, the heuristic function model in the ant colony algorithm is improved, the elite ant search strategy and the adaptive pheromone volatility factor strategy are introduced, and the initial path is searched in realize the motion environment with the help of the improved ant colony algorithm, and the initialized pheromone matrix is constructed. Second, the improved ant colony algorithm and Q-Learning (QL) algorithm are fused by utilizing the similarity between the pheromone matrix in the improved ant colony algorithm and the Q-matrix in the QL algorithm. A heuristic learning evaluation model is designed to dynamically adjust the learning factor and provide guidance for the search path. Additionally, a dynamic adaptive greedy strategy is introduced to balance the exploration and exploitation of the robot in the environment. Finally, the paths are smoothed using third-order Bessel curves to eliminate the problem of excessive steering angles. Through three sets of comparative simulation experiments conducted in Pycharm platform, the effectiveness, superiority, and practicality of the IAC-IQL algorithm were verified. The experimental results demonstrated that the IAC-IQL algorithm integrates the strong search capability of ant colony algorithm and the self-learning characteristics of QL algorithm. SAR robots equipped with the improved IAC-IQL algorithm exhibit significantly enhanced iterative search efficiency in grid simulation environment and image sampling simulation environment. The global path optimization indicators demonstrate high efficiency, and the paths are smoother.</div></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"182 ","pages":"Article 104822"},"PeriodicalIF":4.3000,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Autonomous Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0921889024002069","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Addressing issues with traditional ant colony and reinforcement learning algorithms, such as low search efficiency and the tendency to produce insufficiently smooth paths that easily fall into local optima, this paper designs an improved ant colony optimization algorithm fusion with improved Q-Learning (IAC-IQL) algorithm for Bessel curve global path planning of search and rescue (SAR) robots. First, the heuristic function model in the ant colony algorithm is improved, the elite ant search strategy and the adaptive pheromone volatility factor strategy are introduced, and the initial path is searched in realize the motion environment with the help of the improved ant colony algorithm, and the initialized pheromone matrix is constructed. Second, the improved ant colony algorithm and Q-Learning (QL) algorithm are fused by utilizing the similarity between the pheromone matrix in the improved ant colony algorithm and the Q-matrix in the QL algorithm. A heuristic learning evaluation model is designed to dynamically adjust the learning factor and provide guidance for the search path. Additionally, a dynamic adaptive greedy strategy is introduced to balance the exploration and exploitation of the robot in the environment. Finally, the paths are smoothed using third-order Bessel curves to eliminate the problem of excessive steering angles. Through three sets of comparative simulation experiments conducted in Pycharm platform, the effectiveness, superiority, and practicality of the IAC-IQL algorithm were verified. The experimental results demonstrated that the IAC-IQL algorithm integrates the strong search capability of ant colony algorithm and the self-learning characteristics of QL algorithm. SAR robots equipped with the improved IAC-IQL algorithm exhibit significantly enhanced iterative search efficiency in grid simulation environment and image sampling simulation environment. The global path optimization indicators demonstrate high efficiency, and the paths are smoother.
期刊介绍:
Robotics and Autonomous Systems will carry articles describing fundamental developments in the field of robotics, with special emphasis on autonomous systems. An important goal of this journal is to extend the state of the art in both symbolic and sensory based robot control and learning in the context of autonomous systems.
Robotics and Autonomous Systems will carry articles on the theoretical, computational and experimental aspects of autonomous systems, or modules of such systems.