Identifying subway commuters travel patterns using traffic smart card data: A topic model

IF 2.6 Q3 TRANSPORTATION
Peng He , Danyong Feng , Yang Yang , Zijia Wang
{"title":"Identifying subway commuters travel patterns using traffic smart card data: A topic model","authors":"Peng He ,&nbsp;Danyong Feng ,&nbsp;Yang Yang ,&nbsp;Zijia Wang","doi":"10.1016/j.jrtpm.2024.100497","DOIUrl":null,"url":null,"abstract":"<div><div>The paper presents a novel approach using Hierarchical Dirichlet Processes (HDP) integrated with K-means clustering to analyze public transit commuting behaviors using smartcard and POI data. The HDP, an unsupervised model, is designed to discern travel activities, however, little is done for this purpose. Our study proposed representing each trip using four features (duration, date, arrival time, and station type classified using POI-data) as inputs to the HDP model, which outputs the identification of specific activities such as home, work, and leisure. A comparison to other methods including trip frequency, activity duration, and Hidden Markov models demonstrates that our approach offers superior fit, as evidenced by lower perplexity and higher similarity metrics. To further refine the classification of commuting behaviors, we applied a two-step clustering algorithm that considers features such as regularity, temporality, and spatiality, resulting in the identification of strong and weak commuting behavior patterns. This classification provides urban planners with insights into the spatiotemporal characteristics of travelers in urban rail transit systems, thereby supporting more effective urban planning.</div></div>","PeriodicalId":51821,"journal":{"name":"Journal of Rail Transport Planning & Management","volume":"34 ","pages":"Article 100497"},"PeriodicalIF":2.6000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Rail Transport Planning & Management","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210970624000672","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"TRANSPORTATION","Score":null,"Total":0}
引用次数: 0

Abstract

The paper presents a novel approach using Hierarchical Dirichlet Processes (HDP) integrated with K-means clustering to analyze public transit commuting behaviors using smartcard and POI data. The HDP, an unsupervised model, is designed to discern travel activities, however, little is done for this purpose. Our study proposed representing each trip using four features (duration, date, arrival time, and station type classified using POI-data) as inputs to the HDP model, which outputs the identification of specific activities such as home, work, and leisure. A comparison to other methods including trip frequency, activity duration, and Hidden Markov models demonstrates that our approach offers superior fit, as evidenced by lower perplexity and higher similarity metrics. To further refine the classification of commuting behaviors, we applied a two-step clustering algorithm that considers features such as regularity, temporality, and spatiality, resulting in the identification of strong and weak commuting behavior patterns. This classification provides urban planners with insights into the spatiotemporal characteristics of travelers in urban rail transit systems, thereby supporting more effective urban planning.
利用交通智能卡数据识别地铁通勤者的出行模式:一个主题模型
基于智能卡和POI数据,提出了一种基于分层狄利克雷过程(HDP)和K-means聚类的公共交通通勤行为分析方法。HDP是一种无监督模型,旨在识别旅行活动,然而,在这方面做得很少。我们的研究建议使用四个特征(持续时间、日期、到达时间和使用poi数据分类的站点类型)来表示每次旅行,作为HDP模型的输入,该模型输出特定活动(如家庭、工作和休闲)的识别。与其他方法(包括行程频率、活动持续时间和隐马尔可夫模型)的比较表明,我们的方法具有更好的拟合性,这一点可以通过更低的困惑度和更高的相似性指标得到证明。为了进一步完善通勤行为的分类,我们采用了一种考虑规律性、时代性和空间性等特征的两步聚类算法,从而识别出强弱通勤行为模式。这种分类为城市规划者提供了对城市轨道交通系统中旅客时空特征的洞察,从而支持更有效的城市规划。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.10
自引率
8.10%
发文量
41
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信