{"title":"Rule driven multi objective dynamic scheduling by data envelopment analysis and reinforcement learning","authors":"Xili Chen, X. Hao, H. Lin, Tomohiro Murata","doi":"10.1109/ICAL.2010.5585316","DOIUrl":null,"url":null,"abstract":"This paper presents a rule driven method of developing composite dispatching rule for multi objective dynamic scheduling. Data envelopment analysis is adopted to select elementary dispatching rules, where each rule is justified as efficient for optimizing specific operational objectives of interest. The selected rules are subsequently combined into a single composite rule using the weighted aggregation manner. An intelligent agent is trained using reinforcement learning to acquire the scheduling knowledge of assigning the appropriate weighting values for building the composite rule to cope with the WIP fluctuation of a machine. Implementation of the proposed method in a two objective dynamic job shop scheduling problem is demonstrated and the results are satisfactory.","PeriodicalId":393739,"journal":{"name":"2010 IEEE International Conference on Automation and Logistics","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Automation and Logistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAL.2010.5585316","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 23
Abstract
This paper presents a rule driven method of developing composite dispatching rule for multi objective dynamic scheduling. Data envelopment analysis is adopted to select elementary dispatching rules, where each rule is justified as efficient for optimizing specific operational objectives of interest. The selected rules are subsequently combined into a single composite rule using the weighted aggregation manner. An intelligent agent is trained using reinforcement learning to acquire the scheduling knowledge of assigning the appropriate weighting values for building the composite rule to cope with the WIP fluctuation of a machine. Implementation of the proposed method in a two objective dynamic job shop scheduling problem is demonstrated and the results are satisfactory.