Station-to-User Transfer Learning: Towards Explainable User Clustering Through Latent Trip Signatures Using Tidal-Regularized Non-Negative Matrix Factorization

Proceedings of the 28th International Conference on Advances in Geographic Information Systems Pub Date : 2020-04-27 DOI:10.1145/3397536.3422250

Liming Zhang, Andreas Zufle, D. Pfoser

{"title":"Station-to-User Transfer Learning: Towards Explainable User Clustering Through Latent Trip Signatures Using Tidal-Regularized Non-Negative Matrix Factorization","authors":"Liming Zhang, Andreas Zufle, D. Pfoser","doi":"10.1145/3397536.3422250","DOIUrl":null,"url":null,"abstract":"Urban areas provide us with a treasure trove of available data capturing almost every aspect of a population's life. This work focuses on mobility data and how it will help improve our understanding of urban mobility patterns. Readily available and sizable farecard data captures trips in a public transportation network. However, such data typically lacks temporal signatures and as such the task of inferring trip semantics, station function, and user clustering is quite challenging. While existing approaches either focus on station-level or user-level signals only, we propose a Station-to-User (S2U) transfer learning framework, which augments user-level learning with shared temporal patterns learned from station-level signals. Our framework is based on a novel, so-called \"Tidal-Regularized Non-negative Matrix Factorization\" method, which incorporates a-priori tidal traffic patterns in generic Non-negative Matrix Factorization. To evaluate our model performance, a user clustering stability test based on the classical Rand Index is introduced as a metric to benchmark different unsupervised learning models. Using this metric, quantitative evaluations on three real-world datasets show that S2U outperforms two baselines methods by 7-21%. We also provide a qualitative analysis of the user clustering and station functions for the Washington D.C. metro and show how S2U can support spatiotemporal urban analytics.","PeriodicalId":233918,"journal":{"name":"Proceedings of the 28th International Conference on Advances in Geographic Information Systems","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 28th International Conference on Advances in Geographic Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3397536.3422250","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Urban areas provide us with a treasure trove of available data capturing almost every aspect of a population's life. This work focuses on mobility data and how it will help improve our understanding of urban mobility patterns. Readily available and sizable farecard data captures trips in a public transportation network. However, such data typically lacks temporal signatures and as such the task of inferring trip semantics, station function, and user clustering is quite challenging. While existing approaches either focus on station-level or user-level signals only, we propose a Station-to-User (S2U) transfer learning framework, which augments user-level learning with shared temporal patterns learned from station-level signals. Our framework is based on a novel, so-called "Tidal-Regularized Non-negative Matrix Factorization" method, which incorporates a-priori tidal traffic patterns in generic Non-negative Matrix Factorization. To evaluate our model performance, a user clustering stability test based on the classical Rand Index is introduced as a metric to benchmark different unsupervised learning models. Using this metric, quantitative evaluations on three real-world datasets show that S2U outperforms two baselines methods by 7-21%. We also provide a qualitative analysis of the user clustering and station functions for the Washington D.C. metro and show how S2U can support spatiotemporal urban analytics.

查看原文本刊更多论文

站到用户迁移学习:利用潮汐正则化非负矩阵分解的潜在行程特征实现可解释的用户聚类

城市地区为我们提供了一个可获得数据的宝库，几乎涵盖了人口生活的各个方面。这项工作的重点是交通数据，以及它将如何帮助我们提高对城市交通模式的理解。随时可用且数量可观的车费卡数据记录了公共交通网络中的行程。然而，这些数据通常缺乏时间签名，因此推断行程语义、站点功能和用户聚类的任务相当具有挑战性。虽然现有方法只关注站级或用户级信号，但我们提出了一个站到用户(S2U)迁移学习框架，该框架通过从站级信号中学习的共享时间模式来增强用户级学习。我们的框架基于一种新颖的，所谓的“潮汐正则化非负矩阵分解”方法，该方法将先验潮汐交通模式纳入一般的非负矩阵分解中。为了评估我们的模型性能，引入了基于经典Rand指数的用户聚类稳定性测试，作为基准测试不同的无监督学习模型。使用该指标，对三个真实数据集的定量评估表明，S2U比两种基线方法的性能高出7-21%。我们还对华盛顿特区地铁的用户集群和车站功能进行了定性分析，并展示了S2U如何支持时空城市分析。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 28th International Conference on Advances in Geographic Information Systems

自引率

0.00%

发文量