Hongbin Zhang, Jiliang Luo, Xinjie Lin, KaiCheng Tan, Chunrong Pan
{"title":"Dispatching and Path Planning of Automated Guided Vehicles based on Petri Nets and Deep Reinforcement Learning","authors":"Hongbin Zhang, Jiliang Luo, Xinjie Lin, KaiCheng Tan, Chunrong Pan","doi":"10.1109/ICNSC52481.2021.9702196","DOIUrl":null,"url":null,"abstract":"A formal approach is proposed for scheduling a team of automated guided vehicles (AGVs). First, a team of AGVs and their environment are modeled as a place timed Petri net (P-timed PN). Second, a method is presented to design controlling structures to avoid collisions and reduce deadlocks of P-timed PNs. Third, a multi-AGV path planning problem is formulated as a Markov decision process, and a neural network is trained based on a corresponding reinforcement learning with the PN model to estimate the action reward function for a given Multi-AGV system. Finally, an approach is obtained to dispatch transportation tasks among and to plan routes for AGVs according to rewards calculated by the neural network. An example is utilized to illustrate the proposed methods.","PeriodicalId":129062,"journal":{"name":"2021 IEEE International Conference on Networking, Sensing and Control (ICNSC)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Networking, Sensing and Control (ICNSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNSC52481.2021.9702196","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
A formal approach is proposed for scheduling a team of automated guided vehicles (AGVs). First, a team of AGVs and their environment are modeled as a place timed Petri net (P-timed PN). Second, a method is presented to design controlling structures to avoid collisions and reduce deadlocks of P-timed PNs. Third, a multi-AGV path planning problem is formulated as a Markov decision process, and a neural network is trained based on a corresponding reinforcement learning with the PN model to estimate the action reward function for a given Multi-AGV system. Finally, an approach is obtained to dispatch transportation tasks among and to plan routes for AGVs according to rewards calculated by the neural network. An example is utilized to illustrate the proposed methods.