{"title":"利用 K 原型算法对首末两英里(FMLM)合乘出租车服务的乘客请求进行聚类","authors":"Azimah Mohd , Lay Eng Teoh , Hooi Ling Khoo","doi":"10.1016/j.multra.2024.100132","DOIUrl":null,"url":null,"abstract":"<div><p>Shared mobility services are evolving globally. However, the first-mile and last-mile (FMLM) shared-ride taxi service poses a complex problem due to its large-scale nature and mixed-type variables (numeric and categorical features). As the input size of the problem increases exponentially, the absence of a known polynomial-time algorithm further complicates the finding of an optimal solution. Consequently, exploring potential solutions becomes computationally infeasible for more significant instances. Thus, this paper proposes using the k-prototype algorithm, an unsupervised learning approach, to cluster passengers' requests for FMLM shared-ride taxi service, which can reduce the problem's complexity via feasible clustering. Notably, the k-prototype algorithm is suitable for data sets with both numeric and categorical variables. It demonstrates a promising ability to handle large data sets effectively. As presented in this paper, the FMLM shared-ride taxi service prototypes and their unique characteristics could be optimally identified using the k-prototype algorithm with the Silhouette coefficient (as a performance index). By examining an illustrative case study with ten mixed-type variables of 946 passengers' requests, the results demonstrate the effective clustering of passengers' requests into three distinct prototypes, which can be characterized uniquely based on the temporal factors (pickup time of individual requests) and trip characteristics (including traveled distance, taxi type, as well as pickup and drop-off locations) that are significant in operating a competitive shared-ride taxi service. This paper is anticipated to reveal useful practical implications for the relevant stakeholders, especially the taxi service providers, in managing the FMLM shared-ride taxi services optimally to ensure an efficient and effective operating system.</p></div>","PeriodicalId":100933,"journal":{"name":"Multimodal Transportation","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772586324000133/pdfft?md5=128170411b4f4b7fcb38a6a108487011&pid=1-s2.0-S2772586324000133-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Passengers' requests clustering with k-prototype algorithm for the first-mile and last-mile (FMLM) shared-ride taxi service\",\"authors\":\"Azimah Mohd , Lay Eng Teoh , Hooi Ling Khoo\",\"doi\":\"10.1016/j.multra.2024.100132\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Shared mobility services are evolving globally. However, the first-mile and last-mile (FMLM) shared-ride taxi service poses a complex problem due to its large-scale nature and mixed-type variables (numeric and categorical features). As the input size of the problem increases exponentially, the absence of a known polynomial-time algorithm further complicates the finding of an optimal solution. Consequently, exploring potential solutions becomes computationally infeasible for more significant instances. Thus, this paper proposes using the k-prototype algorithm, an unsupervised learning approach, to cluster passengers' requests for FMLM shared-ride taxi service, which can reduce the problem's complexity via feasible clustering. Notably, the k-prototype algorithm is suitable for data sets with both numeric and categorical variables. It demonstrates a promising ability to handle large data sets effectively. As presented in this paper, the FMLM shared-ride taxi service prototypes and their unique characteristics could be optimally identified using the k-prototype algorithm with the Silhouette coefficient (as a performance index). By examining an illustrative case study with ten mixed-type variables of 946 passengers' requests, the results demonstrate the effective clustering of passengers' requests into three distinct prototypes, which can be characterized uniquely based on the temporal factors (pickup time of individual requests) and trip characteristics (including traveled distance, taxi type, as well as pickup and drop-off locations) that are significant in operating a competitive shared-ride taxi service. This paper is anticipated to reveal useful practical implications for the relevant stakeholders, especially the taxi service providers, in managing the FMLM shared-ride taxi services optimally to ensure an efficient and effective operating system.</p></div>\",\"PeriodicalId\":100933,\"journal\":{\"name\":\"Multimodal Transportation\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2772586324000133/pdfft?md5=128170411b4f4b7fcb38a6a108487011&pid=1-s2.0-S2772586324000133-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Multimodal Transportation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772586324000133\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Multimodal Transportation","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772586324000133","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
共享交通服务在全球范围内不断发展。然而,"最初一英里和最后一英里"(FMLM)共享乘车出租车服务因其大规模性质和混合型变量(数字和分类特征)而构成了一个复杂的问题。由于问题的输入规模呈指数增长,缺乏已知的多项式时间算法,使得寻找最优解的工作更加复杂。因此,对于更重要的实例,探索潜在的解决方案在计算上变得不可行。因此,本文提出使用 k 原型算法(一种无监督学习方法)对 FMLM 合乘出租车服务的乘客请求进行聚类,通过可行的聚类降低问题的复杂性。值得注意的是,k 原型算法适用于包含数字变量和分类变量的数据集。它展示了有效处理大型数据集的能力。正如本文所介绍的,使用 k 原型算法和 Silhouette 系数(作为性能指标),可以优化识别 FMLM 共享乘车出租车服务原型及其独特特征。通过对包含 946 个乘客请求的 10 个混合型变量的示例研究进行检验,结果表明乘客的请求可以有效地聚类成三个不同的原型,这些原型可以根据对运营有竞争力的合乘出租车服务具有重要意义的时间因素(单个请求的上车时间)和行程特征(包括行驶距离、出租车类型以及上车和下车地点)来确定。本文预计将为相关利益方(尤其是出租车服务提供商)揭示有用的实际意义,以优化管理 FMLM 合乘出租车服务,确保运营系统的效率和效益。
Passengers' requests clustering with k-prototype algorithm for the first-mile and last-mile (FMLM) shared-ride taxi service
Shared mobility services are evolving globally. However, the first-mile and last-mile (FMLM) shared-ride taxi service poses a complex problem due to its large-scale nature and mixed-type variables (numeric and categorical features). As the input size of the problem increases exponentially, the absence of a known polynomial-time algorithm further complicates the finding of an optimal solution. Consequently, exploring potential solutions becomes computationally infeasible for more significant instances. Thus, this paper proposes using the k-prototype algorithm, an unsupervised learning approach, to cluster passengers' requests for FMLM shared-ride taxi service, which can reduce the problem's complexity via feasible clustering. Notably, the k-prototype algorithm is suitable for data sets with both numeric and categorical variables. It demonstrates a promising ability to handle large data sets effectively. As presented in this paper, the FMLM shared-ride taxi service prototypes and their unique characteristics could be optimally identified using the k-prototype algorithm with the Silhouette coefficient (as a performance index). By examining an illustrative case study with ten mixed-type variables of 946 passengers' requests, the results demonstrate the effective clustering of passengers' requests into three distinct prototypes, which can be characterized uniquely based on the temporal factors (pickup time of individual requests) and trip characteristics (including traveled distance, taxi type, as well as pickup and drop-off locations) that are significant in operating a competitive shared-ride taxi service. This paper is anticipated to reveal useful practical implications for the relevant stakeholders, especially the taxi service providers, in managing the FMLM shared-ride taxi services optimally to ensure an efficient and effective operating system.