{"title":"Proximity-Based Reward System and Reinforcement Learning for Path Planning","authors":"Marc-Andrė Blais, M. Akhloufi","doi":"10.1109/ICCAE56788.2023.10111485","DOIUrl":null,"url":null,"abstract":"Path planning is an important and complex task in the field of robotics and automation. It consists of finding the optimal path given a starting location, obstacles and a final destination. Reinforcement learning is a trial and error approach that has seen success in the field of path planning. Multiple reinforcement learning algorithms such as Q-learning and SARSA exist and have achieved great results. These algorithms typically use a uniform reward system such that every move, collision and goal return a specific reward. We propose a proximity-based reward system for classical reinforcement learning algorithms on path planning scenarios. We compare our reward systems combined with different optimization techniques and algorithms for path planning. These approaches are compared using the total completion rate for the mazes and average training time. We achieved interesting results with our reward systems and optimization techniques allowing us to decrease the training time.","PeriodicalId":406112,"journal":{"name":"2023 15th International Conference on Computer and Automation Engineering (ICCAE)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 15th International Conference on Computer and Automation Engineering (ICCAE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCAE56788.2023.10111485","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Path planning is an important and complex task in the field of robotics and automation. It consists of finding the optimal path given a starting location, obstacles and a final destination. Reinforcement learning is a trial and error approach that has seen success in the field of path planning. Multiple reinforcement learning algorithms such as Q-learning and SARSA exist and have achieved great results. These algorithms typically use a uniform reward system such that every move, collision and goal return a specific reward. We propose a proximity-based reward system for classical reinforcement learning algorithms on path planning scenarios. We compare our reward systems combined with different optimization techniques and algorithms for path planning. These approaches are compared using the total completion rate for the mazes and average training time. We achieved interesting results with our reward systems and optimization techniques allowing us to decrease the training time.