{"title":"具有混合请求和公平性的灵活公交服务动态调度:启发式指导的多代理强化学习与模仿学习","authors":"Weitiao Wu , Yanchen Zhu , Ronghui Liu","doi":"10.1016/j.trb.2024.103069","DOIUrl":null,"url":null,"abstract":"<div><p>Flexible bus is a class of demand-responsive transit that provides door-to-door service. It is gaining popularity now but also encounters many challenges, such as high dynamism, immediacy requirements, and financial sustainability. Scientific literature designs flexible bus services only for reservation demand, overlooking the potential market for immediate demand that can improve ride pooling and financial sustainability. The increasing availability of historical travel demand data provides opportunities for leveraging future demand prediction in optimizing fleet utilization. This study investigates prediction failure risk-aware dynamic scheduling flexible bus services with hybrid requests allowing for both reservation and immediate demand. Equity in request waiting time for immediate demand is emphasized as a key objective. We model this problem as a multi-objective Markov decision process to jointly optimize vehicle routing, timetable, holding control and passenger assignment. To solve this problem, we develop a novel heuristics-guided multi-agent reinforcement learning (MARL) framework entailing three salient features: 1) incorporating the demand forecasting and prediction error correction modules into the MARL framework; 2) combining the benefits of MARL, local search algorithm, and imitation learning (IL) to improve solution quality; 3) incorporating an improved strategy in action selection with time-related information about spatio-temporal relationships between vehicles and passengers to enhance training efficiency. These enhancements are general methodological contributions to the artificial intelligence and operations research communities. Numerical experiments show that our proposed method is comparable to prevailing benchmark methods both with respect to training stability and solution quality. The benefit of demand prediction is significant even when the prediction is imperfect. Our model and algorithm are applied to a real-world case study in Guangzhou, China. Managerial insights are also provided.</p></div>","PeriodicalId":54418,"journal":{"name":"Transportation Research Part B-Methodological","volume":"190 ","pages":"Article 103069"},"PeriodicalIF":5.8000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dynamic scheduling of flexible bus services with hybrid requests and fairness: Heuristics-guided multi-agent reinforcement learning with imitation learning\",\"authors\":\"Weitiao Wu , Yanchen Zhu , Ronghui Liu\",\"doi\":\"10.1016/j.trb.2024.103069\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Flexible bus is a class of demand-responsive transit that provides door-to-door service. It is gaining popularity now but also encounters many challenges, such as high dynamism, immediacy requirements, and financial sustainability. Scientific literature designs flexible bus services only for reservation demand, overlooking the potential market for immediate demand that can improve ride pooling and financial sustainability. The increasing availability of historical travel demand data provides opportunities for leveraging future demand prediction in optimizing fleet utilization. This study investigates prediction failure risk-aware dynamic scheduling flexible bus services with hybrid requests allowing for both reservation and immediate demand. Equity in request waiting time for immediate demand is emphasized as a key objective. We model this problem as a multi-objective Markov decision process to jointly optimize vehicle routing, timetable, holding control and passenger assignment. To solve this problem, we develop a novel heuristics-guided multi-agent reinforcement learning (MARL) framework entailing three salient features: 1) incorporating the demand forecasting and prediction error correction modules into the MARL framework; 2) combining the benefits of MARL, local search algorithm, and imitation learning (IL) to improve solution quality; 3) incorporating an improved strategy in action selection with time-related information about spatio-temporal relationships between vehicles and passengers to enhance training efficiency. These enhancements are general methodological contributions to the artificial intelligence and operations research communities. Numerical experiments show that our proposed method is comparable to prevailing benchmark methods both with respect to training stability and solution quality. The benefit of demand prediction is significant even when the prediction is imperfect. Our model and algorithm are applied to a real-world case study in Guangzhou, China. Managerial insights are also provided.</p></div>\",\"PeriodicalId\":54418,\"journal\":{\"name\":\"Transportation Research Part B-Methodological\",\"volume\":\"190 \",\"pages\":\"Article 103069\"},\"PeriodicalIF\":5.8000,\"publicationDate\":\"2024-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Research Part B-Methodological\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0191261524001930\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part B-Methodological","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0191261524001930","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
Dynamic scheduling of flexible bus services with hybrid requests and fairness: Heuristics-guided multi-agent reinforcement learning with imitation learning
Flexible bus is a class of demand-responsive transit that provides door-to-door service. It is gaining popularity now but also encounters many challenges, such as high dynamism, immediacy requirements, and financial sustainability. Scientific literature designs flexible bus services only for reservation demand, overlooking the potential market for immediate demand that can improve ride pooling and financial sustainability. The increasing availability of historical travel demand data provides opportunities for leveraging future demand prediction in optimizing fleet utilization. This study investigates prediction failure risk-aware dynamic scheduling flexible bus services with hybrid requests allowing for both reservation and immediate demand. Equity in request waiting time for immediate demand is emphasized as a key objective. We model this problem as a multi-objective Markov decision process to jointly optimize vehicle routing, timetable, holding control and passenger assignment. To solve this problem, we develop a novel heuristics-guided multi-agent reinforcement learning (MARL) framework entailing three salient features: 1) incorporating the demand forecasting and prediction error correction modules into the MARL framework; 2) combining the benefits of MARL, local search algorithm, and imitation learning (IL) to improve solution quality; 3) incorporating an improved strategy in action selection with time-related information about spatio-temporal relationships between vehicles and passengers to enhance training efficiency. These enhancements are general methodological contributions to the artificial intelligence and operations research communities. Numerical experiments show that our proposed method is comparable to prevailing benchmark methods both with respect to training stability and solution quality. The benefit of demand prediction is significant even when the prediction is imperfect. Our model and algorithm are applied to a real-world case study in Guangzhou, China. Managerial insights are also provided.
期刊介绍:
Transportation Research: Part B publishes papers on all methodological aspects of the subject, particularly those that require mathematical analysis. The general theme of the journal is the development and solution of problems that are adequately motivated to deal with important aspects of the design and/or analysis of transportation systems. Areas covered include: traffic flow; design and analysis of transportation networks; control and scheduling; optimization; queuing theory; logistics; supply chains; development and application of statistical, econometric and mathematical models to address transportation problems; cost models; pricing and/or investment; traveler or shipper behavior; cost-benefit methodologies.