Lingbin Ning, Yidong Li, Min Zhou, Haifeng Song, Hai-rong Dong
{"title":"基于深度强化学习的干扰下高速列车时刻表重新调度","authors":"Lingbin Ning, Yidong Li, Min Zhou, Haifeng Song, Hai-rong Dong","doi":"10.1109/ITSC.2019.8917180","DOIUrl":null,"url":null,"abstract":"Train timetable rescheduling (TTR) aims to address the recovery of train operation order in reordering and retiming strategies during disturbances. Considering this problem, this paper introduces a deep reinforcement learning (DRL) approach to minimize the average total delay for all trains along the railway line. Specifically, the detailed train operation in block sections and stations is illustrated to establish a learning environment involving its state sets, action sets, and the reward function. The learning agent is responsible for adjusting running times, dwell times and departure sequences for trains and conflicts are resolved simultaneously. Numerical experiments are performed on an adapted timetable carried out on the Beijing-Shanghai high-speed railway line. The experimental results indicate that the proposed approach reduces the average total delay by 46.38% in real time, compared to the First-Come-First-Served (FCFS) method.","PeriodicalId":6717,"journal":{"name":"2019 IEEE Intelligent Transportation Systems Conference (ITSC)","volume":"17 1","pages":"3469-3474"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"37","resultStr":"{\"title\":\"A Deep Reinforcement Learning Approach to High-speed Train Timetable Rescheduling under Disturbances\",\"authors\":\"Lingbin Ning, Yidong Li, Min Zhou, Haifeng Song, Hai-rong Dong\",\"doi\":\"10.1109/ITSC.2019.8917180\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Train timetable rescheduling (TTR) aims to address the recovery of train operation order in reordering and retiming strategies during disturbances. Considering this problem, this paper introduces a deep reinforcement learning (DRL) approach to minimize the average total delay for all trains along the railway line. Specifically, the detailed train operation in block sections and stations is illustrated to establish a learning environment involving its state sets, action sets, and the reward function. The learning agent is responsible for adjusting running times, dwell times and departure sequences for trains and conflicts are resolved simultaneously. Numerical experiments are performed on an adapted timetable carried out on the Beijing-Shanghai high-speed railway line. The experimental results indicate that the proposed approach reduces the average total delay by 46.38% in real time, compared to the First-Come-First-Served (FCFS) method.\",\"PeriodicalId\":6717,\"journal\":{\"name\":\"2019 IEEE Intelligent Transportation Systems Conference (ITSC)\",\"volume\":\"17 1\",\"pages\":\"3469-3474\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"37\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE Intelligent Transportation Systems Conference (ITSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITSC.2019.8917180\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Intelligent Transportation Systems Conference (ITSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITSC.2019.8917180","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Deep Reinforcement Learning Approach to High-speed Train Timetable Rescheduling under Disturbances
Train timetable rescheduling (TTR) aims to address the recovery of train operation order in reordering and retiming strategies during disturbances. Considering this problem, this paper introduces a deep reinforcement learning (DRL) approach to minimize the average total delay for all trains along the railway line. Specifically, the detailed train operation in block sections and stations is illustrated to establish a learning environment involving its state sets, action sets, and the reward function. The learning agent is responsible for adjusting running times, dwell times and departure sequences for trains and conflicts are resolved simultaneously. Numerical experiments are performed on an adapted timetable carried out on the Beijing-Shanghai high-speed railway line. The experimental results indicate that the proposed approach reduces the average total delay by 46.38% in real time, compared to the First-Come-First-Served (FCFS) method.