Lingbin Ning, Yidong Li, Min Zhou, Haifeng Song, Hai-rong Dong
{"title":"A Deep Reinforcement Learning Approach to High-speed Train Timetable Rescheduling under Disturbances","authors":"Lingbin Ning, Yidong Li, Min Zhou, Haifeng Song, Hai-rong Dong","doi":"10.1109/ITSC.2019.8917180","DOIUrl":null,"url":null,"abstract":"Train timetable rescheduling (TTR) aims to address the recovery of train operation order in reordering and retiming strategies during disturbances. Considering this problem, this paper introduces a deep reinforcement learning (DRL) approach to minimize the average total delay for all trains along the railway line. Specifically, the detailed train operation in block sections and stations is illustrated to establish a learning environment involving its state sets, action sets, and the reward function. The learning agent is responsible for adjusting running times, dwell times and departure sequences for trains and conflicts are resolved simultaneously. Numerical experiments are performed on an adapted timetable carried out on the Beijing-Shanghai high-speed railway line. The experimental results indicate that the proposed approach reduces the average total delay by 46.38% in real time, compared to the First-Come-First-Served (FCFS) method.","PeriodicalId":6717,"journal":{"name":"2019 IEEE Intelligent Transportation Systems Conference (ITSC)","volume":"17 1","pages":"3469-3474"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"37","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Intelligent Transportation Systems Conference (ITSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITSC.2019.8917180","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 37
Abstract
Train timetable rescheduling (TTR) aims to address the recovery of train operation order in reordering and retiming strategies during disturbances. Considering this problem, this paper introduces a deep reinforcement learning (DRL) approach to minimize the average total delay for all trains along the railway line. Specifically, the detailed train operation in block sections and stations is illustrated to establish a learning environment involving its state sets, action sets, and the reward function. The learning agent is responsible for adjusting running times, dwell times and departure sequences for trains and conflicts are resolved simultaneously. Numerical experiments are performed on an adapted timetable carried out on the Beijing-Shanghai high-speed railway line. The experimental results indicate that the proposed approach reduces the average total delay by 46.38% in real time, compared to the First-Come-First-Served (FCFS) method.