{"title":"基于强化学习算法的电梯群调度系统研究","authors":"Liu Zheng, Shu Guang, Dong Hui","doi":"10.1109/MIC.2013.6758037","DOIUrl":null,"url":null,"abstract":"Elevator group control system (EGCS) is a complex decision-making system, which has characteristics of multi-objective, randomness and nonlinear. It is difficult to adopt precise mathematical models describing. This paper introduces a new elevator dynamic scheduling system based on reinforcement learning algorithm. We trade reinforcement learning algorithm as the way to learn the optimal strategy in the course of interacting with the environment. Average waiting time and average riding time are optimized indicators. Combine with the value iteration algorithm called Q-learning to construct the whole algorithm for elevator group scheduling. The simulation result shows great superior and feasibility for elevator dynamic scheduling system based on reinforcement learning algorithm.","PeriodicalId":404630,"journal":{"name":"Proceedings of 2013 2nd International Conference on Measurement, Information and Control","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Research of elevator group scheduling system based on reinforcement learning algorithm\",\"authors\":\"Liu Zheng, Shu Guang, Dong Hui\",\"doi\":\"10.1109/MIC.2013.6758037\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Elevator group control system (EGCS) is a complex decision-making system, which has characteristics of multi-objective, randomness and nonlinear. It is difficult to adopt precise mathematical models describing. This paper introduces a new elevator dynamic scheduling system based on reinforcement learning algorithm. We trade reinforcement learning algorithm as the way to learn the optimal strategy in the course of interacting with the environment. Average waiting time and average riding time are optimized indicators. Combine with the value iteration algorithm called Q-learning to construct the whole algorithm for elevator group scheduling. The simulation result shows great superior and feasibility for elevator dynamic scheduling system based on reinforcement learning algorithm.\",\"PeriodicalId\":404630,\"journal\":{\"name\":\"Proceedings of 2013 2nd International Conference on Measurement, Information and Control\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of 2013 2nd International Conference on Measurement, Information and Control\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MIC.2013.6758037\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of 2013 2nd International Conference on Measurement, Information and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MIC.2013.6758037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research of elevator group scheduling system based on reinforcement learning algorithm
Elevator group control system (EGCS) is a complex decision-making system, which has characteristics of multi-objective, randomness and nonlinear. It is difficult to adopt precise mathematical models describing. This paper introduces a new elevator dynamic scheduling system based on reinforcement learning algorithm. We trade reinforcement learning algorithm as the way to learn the optimal strategy in the course of interacting with the environment. Average waiting time and average riding time are optimized indicators. Combine with the value iteration algorithm called Q-learning to construct the whole algorithm for elevator group scheduling. The simulation result shows great superior and feasibility for elevator dynamic scheduling system based on reinforcement learning algorithm.