{"title":"Multi-agent reinforcement learning with causal communication for ride-sourcing pricing in mixed autonomy mobility","authors":"Ningke Xie , Yong Chen , Wei Tang , Xiqun (Michael) Chen","doi":"10.1016/j.trc.2025.105164","DOIUrl":null,"url":null,"abstract":"<div><div>The burgeoning self-driving technology has provided a solid impetus for the ride-sourcing market and new demand and supply management challenges. Under the context of a long-haul mixed operation of autonomous vehicles and human-driven vehicles, this paper focuses on profit-maximizing pricing for both demand and supply sides, in which the prices are differentiated by service type, time, and location. Diverging from most studies limited to centralized control for small-scale problems, we align with distributed and scalable requirements in practice and tackle the coordination challenge from a causal communication perspective. Based on the spatial supply–demand interdependencies inherent in the ride-sourcing market, operation areas are modeled as collaborative intelligent agents. The pricing problem is formulated as a decentralized partially observable Markov game augmented with neighborhood communication. Then a multi-agent reinforcement learning with causal communication method is developed to jointly optimize pricing policy and communication mechanism through end-to-end learning. The bidirectional communication mechanism is ensured to be effective and succinct by maximizing the causal effect of the communication message. Leveraging theoretical analysis, the proposed method is proven to cope with partial observability and non-stationary environments through collaborative communication. Besides, an agent-based simulator for mixed autonomy mobility is established on a real-world large-scale network, emulating the causal communication process among decentralized areas, as well as the heterogeneity, elasticity, and uncertainty of ride-sourcing demand and supply. Two representative scenarios are designed to demonstrate the dynamic evolutions of mixed autonomy mobility: (a) smaller-sized autonomous vehicles and conservative passenger acceptance (conservative stage), and (b) larger-sized autonomous vehicles and liberal passenger acceptance (liberal stage). The results highlight that incorporating the causal communication mechanism can speed up the learning process and guide informed pricing decisions. Furthermore, the proposed method gains managerial insights into proactively regulating pricing schemes for a smooth transformation into fully autonomous ride-sourcing services.</div></div>","PeriodicalId":54417,"journal":{"name":"Transportation Research Part C-Emerging Technologies","volume":"176 ","pages":"Article 105164"},"PeriodicalIF":7.6000,"publicationDate":"2025-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part C-Emerging Technologies","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0968090X25001688","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The burgeoning self-driving technology has provided a solid impetus for the ride-sourcing market and new demand and supply management challenges. Under the context of a long-haul mixed operation of autonomous vehicles and human-driven vehicles, this paper focuses on profit-maximizing pricing for both demand and supply sides, in which the prices are differentiated by service type, time, and location. Diverging from most studies limited to centralized control for small-scale problems, we align with distributed and scalable requirements in practice and tackle the coordination challenge from a causal communication perspective. Based on the spatial supply–demand interdependencies inherent in the ride-sourcing market, operation areas are modeled as collaborative intelligent agents. The pricing problem is formulated as a decentralized partially observable Markov game augmented with neighborhood communication. Then a multi-agent reinforcement learning with causal communication method is developed to jointly optimize pricing policy and communication mechanism through end-to-end learning. The bidirectional communication mechanism is ensured to be effective and succinct by maximizing the causal effect of the communication message. Leveraging theoretical analysis, the proposed method is proven to cope with partial observability and non-stationary environments through collaborative communication. Besides, an agent-based simulator for mixed autonomy mobility is established on a real-world large-scale network, emulating the causal communication process among decentralized areas, as well as the heterogeneity, elasticity, and uncertainty of ride-sourcing demand and supply. Two representative scenarios are designed to demonstrate the dynamic evolutions of mixed autonomy mobility: (a) smaller-sized autonomous vehicles and conservative passenger acceptance (conservative stage), and (b) larger-sized autonomous vehicles and liberal passenger acceptance (liberal stage). The results highlight that incorporating the causal communication mechanism can speed up the learning process and guide informed pricing decisions. Furthermore, the proposed method gains managerial insights into proactively regulating pricing schemes for a smooth transformation into fully autonomous ride-sourcing services.
期刊介绍:
Transportation Research: Part C (TR_C) is dedicated to showcasing high-quality, scholarly research that delves into the development, applications, and implications of transportation systems and emerging technologies. Our focus lies not solely on individual technologies, but rather on their broader implications for the planning, design, operation, control, maintenance, and rehabilitation of transportation systems, services, and components. In essence, the intellectual core of the journal revolves around the transportation aspect rather than the technology itself. We actively encourage the integration of quantitative methods from diverse fields such as operations research, control systems, complex networks, computer science, and artificial intelligence. Join us in exploring the intersection of transportation systems and emerging technologies to drive innovation and progress in the field.