Sequential Decision Making for Elevator Control

IF 1.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of Advances in Information Technology Pub Date : 2023-01-01 DOI:10.12720/jait.14.5.1124-1131

Emre Oner Tartan, Cebrail Ciflikli

引用次数: 0

Abstract

—In the last decade Reinforcement Learning (RL) has significantly changed the conventional control paradigm in many fields. RL approach is spreading with many applications such as autonomous driving and industry automation. Markov Decision Process (MDP) forms a mathematical idealized basis for RL if the explicit model is available. Dynamic programming allows to find an optimal policy for sequential decision making in a MDP. In this study we consider the elevator control as a sequential decision making problem, describe it as a MDP with finite state space and solve it using dynamic programming. At each decision making time step we aim to take the optimal action to minimize the total of hall call waiting times in the episodic task. We consider a sample 6-floor building and simulate the proposed method in comparison with the conventional Nearest Car Method (NCM).

查看原文本刊更多论文

电梯控制的顺序决策

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊