{"title":"Identifying subway commuters travel patterns using traffic smart card data: A topic model","authors":"Peng He , Danyong Feng , Yang Yang , Zijia Wang","doi":"10.1016/j.jrtpm.2024.100497","DOIUrl":null,"url":null,"abstract":"<div><div>The paper presents a novel approach using Hierarchical Dirichlet Processes (HDP) integrated with K-means clustering to analyze public transit commuting behaviors using smartcard and POI data. The HDP, an unsupervised model, is designed to discern travel activities, however, little is done for this purpose. Our study proposed representing each trip using four features (duration, date, arrival time, and station type classified using POI-data) as inputs to the HDP model, which outputs the identification of specific activities such as home, work, and leisure. A comparison to other methods including trip frequency, activity duration, and Hidden Markov models demonstrates that our approach offers superior fit, as evidenced by lower perplexity and higher similarity metrics. To further refine the classification of commuting behaviors, we applied a two-step clustering algorithm that considers features such as regularity, temporality, and spatiality, resulting in the identification of strong and weak commuting behavior patterns. This classification provides urban planners with insights into the spatiotemporal characteristics of travelers in urban rail transit systems, thereby supporting more effective urban planning.</div></div>","PeriodicalId":51821,"journal":{"name":"Journal of Rail Transport Planning & Management","volume":"34 ","pages":"Article 100497"},"PeriodicalIF":2.6000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Rail Transport Planning & Management","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210970624000672","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"TRANSPORTATION","Score":null,"Total":0}
引用次数: 0
Abstract
The paper presents a novel approach using Hierarchical Dirichlet Processes (HDP) integrated with K-means clustering to analyze public transit commuting behaviors using smartcard and POI data. The HDP, an unsupervised model, is designed to discern travel activities, however, little is done for this purpose. Our study proposed representing each trip using four features (duration, date, arrival time, and station type classified using POI-data) as inputs to the HDP model, which outputs the identification of specific activities such as home, work, and leisure. A comparison to other methods including trip frequency, activity duration, and Hidden Markov models demonstrates that our approach offers superior fit, as evidenced by lower perplexity and higher similarity metrics. To further refine the classification of commuting behaviors, we applied a two-step clustering algorithm that considers features such as regularity, temporality, and spatiality, resulting in the identification of strong and weak commuting behavior patterns. This classification provides urban planners with insights into the spatiotemporal characteristics of travelers in urban rail transit systems, thereby supporting more effective urban planning.