{"title":"基于强化学习的地面机器人导航","authors":"Sharanya S, Divya Radhakrishna Varma, Ajay Paul","doi":"10.1109/ACIRS58671.2023.10240285","DOIUrl":null,"url":null,"abstract":"Reinforcement learning (RL) is a machine learning technique that enables an agent to learn optimal behaviors within an environment through a process of trial and error. This is achieved by the agent receiving rewards or punishments based on its actions. In this study, we compare the performance of two RL algorithms, SARSA and Q-Learning, in a simulated environment using the Gazebo Simulator. The goal of the simulation is to navigate a ground robot towards pre-defined goals. By manipulating various training parameters, we investigate the impact on learning speed and robot behavior. To ensure meaningful comparisons, we vary the navigation goal and the complexity of the simulation environment. Through extensive simulations, our results highlight the effectiveness of RL-based navigation for ground robots and offer insights into the influential parameters that significantly affect optimal navigation performance. It underscores the importance of algorithm selection and parameter optimization in achieving optimal navigation performance.","PeriodicalId":148401,"journal":{"name":"2023 8th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Navigation of Ground Robots with Reinforcement Learning\",\"authors\":\"Sharanya S, Divya Radhakrishna Varma, Ajay Paul\",\"doi\":\"10.1109/ACIRS58671.2023.10240285\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Reinforcement learning (RL) is a machine learning technique that enables an agent to learn optimal behaviors within an environment through a process of trial and error. This is achieved by the agent receiving rewards or punishments based on its actions. In this study, we compare the performance of two RL algorithms, SARSA and Q-Learning, in a simulated environment using the Gazebo Simulator. The goal of the simulation is to navigate a ground robot towards pre-defined goals. By manipulating various training parameters, we investigate the impact on learning speed and robot behavior. To ensure meaningful comparisons, we vary the navigation goal and the complexity of the simulation environment. Through extensive simulations, our results highlight the effectiveness of RL-based navigation for ground robots and offer insights into the influential parameters that significantly affect optimal navigation performance. It underscores the importance of algorithm selection and parameter optimization in achieving optimal navigation performance.\",\"PeriodicalId\":148401,\"journal\":{\"name\":\"2023 8th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 8th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACIRS58671.2023.10240285\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 8th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACIRS58671.2023.10240285","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Navigation of Ground Robots with Reinforcement Learning
Reinforcement learning (RL) is a machine learning technique that enables an agent to learn optimal behaviors within an environment through a process of trial and error. This is achieved by the agent receiving rewards or punishments based on its actions. In this study, we compare the performance of two RL algorithms, SARSA and Q-Learning, in a simulated environment using the Gazebo Simulator. The goal of the simulation is to navigate a ground robot towards pre-defined goals. By manipulating various training parameters, we investigate the impact on learning speed and robot behavior. To ensure meaningful comparisons, we vary the navigation goal and the complexity of the simulation environment. Through extensive simulations, our results highlight the effectiveness of RL-based navigation for ground robots and offer insights into the influential parameters that significantly affect optimal navigation performance. It underscores the importance of algorithm selection and parameter optimization in achieving optimal navigation performance.