Rule driven multi objective dynamic scheduling by data envelopment analysis and reinforcement learning

2010 IEEE International Conference on Automation and Logistics Pub Date : 2010-09-23 DOI:10.1109/ICAL.2010.5585316

Xili Chen, X. Hao, H. Lin, Tomohiro Murata

引用次数: 23

Abstract

This paper presents a rule driven method of developing composite dispatching rule for multi objective dynamic scheduling. Data envelopment analysis is adopted to select elementary dispatching rules, where each rule is justified as efficient for optimizing specific operational objectives of interest. The selected rules are subsequently combined into a single composite rule using the weighted aggregation manner. An intelligent agent is trained using reinforcement learning to acquire the scheduling knowledge of assigning the appropriate weighting values for building the composite rule to cope with the WIP fluctuation of a machine. Implementation of the proposed method in a two objective dynamic job shop scheduling problem is demonstrated and the results are satisfactory.

查看原文本刊更多论文

基于数据包络分析和强化学习的规则驱动多目标动态调度

提出了一种规则驱动的多目标动态调度复合调度规则制定方法。采用数据包络分析选择基本调度规则，其中每个规则都被证明是有效的，可以优化感兴趣的特定操作目标。随后，使用加权聚合方式将所选规则组合为单个组合规则。利用强化学习训练智能体，获取分配适当权重值的调度知识，构建复合规则，以应对机器在制品的波动。最后将该方法应用于一个双目标动态作业车间调度问题，得到了满意的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2010 IEEE International Conference on Automation and Logistics

自引率

0.00%

发文量