{"title":"The application of path planning algorithm based on deep reinforcement learning for mobile robots","authors":"Siyi Tian, Shuo Lei, Qiming Huang, Anyi Huang","doi":"10.1109/cost57098.2022.00084","DOIUrl":null,"url":null,"abstract":"To meet the need for autonomous route planning for tour guide robots in tourist venues, this paper proposes a path planning algorithm based on deep reinforcement learning. The traditional Deep Q-learning Network (DQN) algorithm two defects - overfitting and overestimation. This paper adopts a method that discards the experience pool and treats behavioural values equally, which not only solves the shortcomings of the traditional method, but also satisfies the need for mobile robots to lead tourists on tours through autonomous learning. The paper analyses the principle and process of the method and compares it with the traditional method through experiments to verify that the method outperforms the traditional method in terms of accuracy and speed.","PeriodicalId":135595,"journal":{"name":"2022 International Conference on Culture-Oriented Science and Technology (CoST)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Culture-Oriented Science and Technology (CoST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/cost57098.2022.00084","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
To meet the need for autonomous route planning for tour guide robots in tourist venues, this paper proposes a path planning algorithm based on deep reinforcement learning. The traditional Deep Q-learning Network (DQN) algorithm two defects - overfitting and overestimation. This paper adopts a method that discards the experience pool and treats behavioural values equally, which not only solves the shortcomings of the traditional method, but also satisfies the need for mobile robots to lead tourists on tours through autonomous learning. The paper analyses the principle and process of the method and compares it with the traditional method through experiments to verify that the method outperforms the traditional method in terms of accuracy and speed.