Li Zhu , Jia Miao , Cheng Chen , Baicheng Yan , Taiyuan Gong , F.Richard Yu , Tao Tang
{"title":"IRS-guided policy search in model based reinforcement learning for virtually coupled train sets control","authors":"Li Zhu , Jia Miao , Cheng Chen , Baicheng Yan , Taiyuan Gong , F.Richard Yu , Tao Tang","doi":"10.1016/j.cie.2025.111372","DOIUrl":null,"url":null,"abstract":"<div><div>Virtually Coupled Train Set (VCTS) is a railway operation concept that allows shorter train intervals. However, the complex dynamic model of train convoy in urban rail transit systems (URTS) makes it challenging to achieve efficient cooperative controller for VCTS. Existing methods for VCTS control assume accurate train dynamics models are available, which is difficult to achieve in real-world VCTS scenarios. In this paper, we present a framework that employs model-based reinforcement learning (MBRL) to control VCTS. In comparison with classical methods and model-free reinforcement learning (MFRL), MBRL can learn more efficiently with limited data, making it particularly useful in situations where interactions with the real environment are costly or dangerous, such as in VCTS. Guided Policy Search (GPS) is used in the MBRL framework to derive the VCTS control policy while ensuring stability, punctuality, and safe distance protection of trains, due to its remarkable ability to quickly learn from the environment. We construct a deep neural network (DNN) based train convoy model to approximate train convoy dynamics. An iterative random shooting (IRS) based optimization method is applied to generate a set of candidate policies, which are then used to train the GPS to obtain the target policy for VCTS. Our experiments show that the proposed IRS-guided policy search (IRS-GPS) in MBRL can provide effective VCTS cooperative control. Our proposed IRS-GPS in MBRL for VCTS cooperative control ensures that virtually coupled trains operate safely, stably, and on time at short intervals.</div></div>","PeriodicalId":55220,"journal":{"name":"Computers & Industrial Engineering","volume":"208 ","pages":"Article 111372"},"PeriodicalIF":6.5000,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Industrial Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360835225005182","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Virtually Coupled Train Set (VCTS) is a railway operation concept that allows shorter train intervals. However, the complex dynamic model of train convoy in urban rail transit systems (URTS) makes it challenging to achieve efficient cooperative controller for VCTS. Existing methods for VCTS control assume accurate train dynamics models are available, which is difficult to achieve in real-world VCTS scenarios. In this paper, we present a framework that employs model-based reinforcement learning (MBRL) to control VCTS. In comparison with classical methods and model-free reinforcement learning (MFRL), MBRL can learn more efficiently with limited data, making it particularly useful in situations where interactions with the real environment are costly or dangerous, such as in VCTS. Guided Policy Search (GPS) is used in the MBRL framework to derive the VCTS control policy while ensuring stability, punctuality, and safe distance protection of trains, due to its remarkable ability to quickly learn from the environment. We construct a deep neural network (DNN) based train convoy model to approximate train convoy dynamics. An iterative random shooting (IRS) based optimization method is applied to generate a set of candidate policies, which are then used to train the GPS to obtain the target policy for VCTS. Our experiments show that the proposed IRS-guided policy search (IRS-GPS) in MBRL can provide effective VCTS cooperative control. Our proposed IRS-GPS in MBRL for VCTS cooperative control ensures that virtually coupled trains operate safely, stably, and on time at short intervals.
期刊介绍:
Computers & Industrial Engineering (CAIE) is dedicated to researchers, educators, and practitioners in industrial engineering and related fields. Pioneering the integration of computers in research, education, and practice, industrial engineering has evolved to make computers and electronic communication integral to its domain. CAIE publishes original contributions focusing on the development of novel computerized methodologies to address industrial engineering problems. It also highlights the applications of these methodologies to issues within the broader industrial engineering and associated communities. The journal actively encourages submissions that push the boundaries of fundamental theories and concepts in industrial engineering techniques.