Residence-Workplace Identification and Validation Based on Mobile Phone Data: A Case Study in a Large-Scale Urban Agglomeration in China

Yang Zhou, Quan Yuan, Chao Yang, Tangyi Guo, Xiaoyi Ma, Wenyong Sun, Tianren Yang
{"title":"Residence-Workplace Identification and Validation Based on Mobile Phone Data: A Case Study in a Large-Scale Urban Agglomeration in China","authors":"Yang Zhou, Quan Yuan, Chao Yang, Tangyi Guo, Xiaoyi Ma, Wenyong Sun, Tianren Yang","doi":"10.1177/03611981241270163","DOIUrl":null,"url":null,"abstract":"Residence-workplace identification is a fundamental task in mobile phone data analysis, but it faces certain challenges in sparse data processing and results validation because of the lack of ground-truth labels. Previous studies have generally relied on frequency-based methods for inference and trip-based metrics for validation, posing limitations in reliability and efficiency. This paper aims to fill this gap by developing a systematic approach that ranges from data error categorization and processing, feature relevance examination and parameter optimization, and the development of performance metrics considering both residence and workplace validation. For residence-workplace identification, we use a spatiotemporal closeness criterion to deal with the sparsity of data and develop effective dwelling time to enhance frequency-based methods, using one-month cellular signaling records from nine cities in the Yangtze River Delta urban agglomeration in China. For validation, we propose a residence-workplace pair metric based on the population-adjusted number of users, enabling more efficient evaluation of home and work locations than trip-based metrics. Results show that the mean absolute percentage errors (MAPEs) for the Nanjing and Shanghai cases are 5.04% and 8.46%, respectively. Adopted and verified in the large-scale urban agglomeration, the proposed method would be reliable for extracting residence and workplace from low-resolution mobile phone data and contributing to a more accurate identification of urban commuting patterns.","PeriodicalId":517391,"journal":{"name":"Transportation Research Record: Journal of the Transportation Research Board","volume":"45 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Record: Journal of the Transportation Research Board","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/03611981241270163","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Residence-workplace identification is a fundamental task in mobile phone data analysis, but it faces certain challenges in sparse data processing and results validation because of the lack of ground-truth labels. Previous studies have generally relied on frequency-based methods for inference and trip-based metrics for validation, posing limitations in reliability and efficiency. This paper aims to fill this gap by developing a systematic approach that ranges from data error categorization and processing, feature relevance examination and parameter optimization, and the development of performance metrics considering both residence and workplace validation. For residence-workplace identification, we use a spatiotemporal closeness criterion to deal with the sparsity of data and develop effective dwelling time to enhance frequency-based methods, using one-month cellular signaling records from nine cities in the Yangtze River Delta urban agglomeration in China. For validation, we propose a residence-workplace pair metric based on the population-adjusted number of users, enabling more efficient evaluation of home and work locations than trip-based metrics. Results show that the mean absolute percentage errors (MAPEs) for the Nanjing and Shanghai cases are 5.04% and 8.46%, respectively. Adopted and verified in the large-scale urban agglomeration, the proposed method would be reliable for extracting residence and workplace from low-resolution mobile phone data and contributing to a more accurate identification of urban commuting patterns.
基于手机数据的居住地-工作地识别与验证:中国大规模城市群案例研究
居住地-工作地识别是手机数据分析中的一项基本任务,但由于缺乏地面实况标签,它在稀疏数据处理和结果验证方面面临着一定的挑战。以往的研究一般依赖基于频率的方法进行推断,基于行程的指标进行验证,在可靠性和效率方面存在局限性。本文旨在通过开发一种系统方法来填补这一空白,该方法包括数据错误分类和处理、特征相关性检查和参数优化,以及考虑居住地和工作地验证的性能指标开发。在居住地-工作地识别方面,我们使用时空接近性准则来处理数据稀疏性问题,并利用中国长三角城市群九个城市一个月的蜂窝信令记录,开发了有效居住时间来增强基于频率的方法。为了进行验证,我们提出了一种基于人口调整后用户数量的居住地-工作地配对指标,与基于行程的指标相比,它能更有效地评估居住地和工作地。结果表明,南京和上海案例的平均绝对百分比误差(MAPE)分别为 5.04% 和 8.46%。通过在大规模城市群中的应用和验证,所提出的方法可以可靠地从低分辨率手机数据中提取居住地和工作地,有助于更准确地识别城市通勤模式。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信