通过手机和公共交通数据检测薄弱的公共交通连接

Thomas Holleczek, Liang Yu, Joseph K. Lee, Oliver Senn, C. Ratti, Patrick Jaillet
{"title":"通过手机和公共交通数据检测薄弱的公共交通连接","authors":"Thomas Holleczek, Liang Yu, Joseph K. Lee, Oliver Senn, C. Ratti, Patrick Jaillet","doi":"10.1145/2640087.2644196","DOIUrl":null,"url":null,"abstract":"Securing public transportation ridership is critical for developing a sustainable urban future. However, many modern and growing cities are facing declines in public transport usage. Existing systems for analyzing and identifying weaknesses in public transport connections face major limitations. In cities, origin-destination (OD) matrices--which measure the flow of people between different geographical regions--are often generated using household surveys, which are time consuming and lack spatial and temporal accuracy. Focus in more recent research has been drawn towards using cellphones to overcome these limitations. In this article, we show that urban mobility patterns and transport mode choices can be derived from cellphone call detail records coupled with public transport data recorded from smart cards. Specifically, we present new methods to determine the spatial and temporal variability of public and private transportation usage and transport mode preferences across the dense, metropolitan city-state of Singapore.\n The cellphone dataset consists of location data of 3.4 million anonymized users of SingTel, Singapore's largest telecommunications company with a market share of 45.3%. The data were recorded during a two-month period from mid-March to mid-May 2011. A call detail record (CDR) includes the location (spatial resolution of 400 m) of the cell tower each cellphone connects to and was created in the case of following network events:\n • a phone call was initiated or received (at the beginning and at the end of the call).\n • a short message was sent or received.\n • the cellphone user accessed the data network (for example, to open a website or retrieve emails).\n By applying a clustering detection algorithm to these call detail records, we detect individual trips and extrapolate the overall mobility of people between the 55 administrative districts of Singapore (taking into account the market share of SingTel and the cellphone penetration of 144 %). The mode share of private transport usage is then derived by computing the difference between the estimated overall mobility and the traces of 4.4 million public transportation smart card users during the same time period.\n We validate out data mining approach using the results from Singapore's Household Interview Travel Survey (HITS): Our results revealed that there are 3.5 million (HITS: 3.5 million) inter-district trips by public transport and 4.3 million (HITS: 4.4 million) inter-district trips by private transport (including taxis). Private transport usage dominates in regions without access to a subway line (see Figure 1). Along with classifying which transportation connections are weak or underserved---where people prefer to take private rather than public transport---the analysis shows that the mode share of public transport increases from 38% in the morning to 44% around mid-day and 52% in the evening.\n The value of deriving such patterns using cellphone call detail records have important implications not only for urban and transportation planning, but also for other domains such as disease control in cities. As humans serve as the primary and secondary vectors of many infectious diseases, understanding from where people arrive and depart and by which transportation modes people are traveling, we have the potential to model how and where diseases might be spreading and from where they might originate.","PeriodicalId":264550,"journal":{"name":"BigDataScience '14","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"28","resultStr":"{\"title\":\"Detecting weak public transport connections from cellphone and public transport data\",\"authors\":\"Thomas Holleczek, Liang Yu, Joseph K. Lee, Oliver Senn, C. Ratti, Patrick Jaillet\",\"doi\":\"10.1145/2640087.2644196\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Securing public transportation ridership is critical for developing a sustainable urban future. However, many modern and growing cities are facing declines in public transport usage. Existing systems for analyzing and identifying weaknesses in public transport connections face major limitations. In cities, origin-destination (OD) matrices--which measure the flow of people between different geographical regions--are often generated using household surveys, which are time consuming and lack spatial and temporal accuracy. Focus in more recent research has been drawn towards using cellphones to overcome these limitations. In this article, we show that urban mobility patterns and transport mode choices can be derived from cellphone call detail records coupled with public transport data recorded from smart cards. Specifically, we present new methods to determine the spatial and temporal variability of public and private transportation usage and transport mode preferences across the dense, metropolitan city-state of Singapore.\\n The cellphone dataset consists of location data of 3.4 million anonymized users of SingTel, Singapore's largest telecommunications company with a market share of 45.3%. The data were recorded during a two-month period from mid-March to mid-May 2011. A call detail record (CDR) includes the location (spatial resolution of 400 m) of the cell tower each cellphone connects to and was created in the case of following network events:\\n • a phone call was initiated or received (at the beginning and at the end of the call).\\n • a short message was sent or received.\\n • the cellphone user accessed the data network (for example, to open a website or retrieve emails).\\n By applying a clustering detection algorithm to these call detail records, we detect individual trips and extrapolate the overall mobility of people between the 55 administrative districts of Singapore (taking into account the market share of SingTel and the cellphone penetration of 144 %). The mode share of private transport usage is then derived by computing the difference between the estimated overall mobility and the traces of 4.4 million public transportation smart card users during the same time period.\\n We validate out data mining approach using the results from Singapore's Household Interview Travel Survey (HITS): Our results revealed that there are 3.5 million (HITS: 3.5 million) inter-district trips by public transport and 4.3 million (HITS: 4.4 million) inter-district trips by private transport (including taxis). Private transport usage dominates in regions without access to a subway line (see Figure 1). Along with classifying which transportation connections are weak or underserved---where people prefer to take private rather than public transport---the analysis shows that the mode share of public transport increases from 38% in the morning to 44% around mid-day and 52% in the evening.\\n The value of deriving such patterns using cellphone call detail records have important implications not only for urban and transportation planning, but also for other domains such as disease control in cities. As humans serve as the primary and secondary vectors of many infectious diseases, understanding from where people arrive and depart and by which transportation modes people are traveling, we have the potential to model how and where diseases might be spreading and from where they might originate.\",\"PeriodicalId\":264550,\"journal\":{\"name\":\"BigDataScience '14\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"28\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BigDataScience '14\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2640087.2644196\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BigDataScience '14","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2640087.2644196","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 28

摘要

确保公共交通客流量对于发展可持续的城市未来至关重要。然而,许多现代和发展中的城市都面临着公共交通使用量下降的问题。分析和识别公共交通连接弱点的现有系统面临重大限制。在城市中,衡量不同地理区域之间人口流动的始发地-目的地(OD)矩阵通常是通过家庭调查生成的,这既耗时又缺乏时空准确性。最近的研究重点是利用手机来克服这些限制。在本文中,我们展示了城市交通模式和交通方式的选择可以从手机通话详细记录和智能卡记录的公共交通数据中得出。具体来说,我们提出了新的方法来确定新加坡这个人口密集的大都市国家的公共和私人交通工具使用和交通方式偏好的时空变化。该手机数据集包括新加坡电信340万匿名用户的位置数据。新加坡电信是新加坡最大的电信公司,占有45.3%的市场份额。这些数据是在2011年3月中旬至5月中旬的两个月期间记录的。呼叫详细记录(CDR)包括每个手机连接的蜂窝塔的位置(空间分辨率为400米),并在以下网络事件的情况下创建:•发起或接收电话(在呼叫开始和结束时)。•发送或接收短消息。•手机用户访问数据网络(例如,打开一个网站或检索电子邮件)。通过对这些呼叫细节记录应用聚类检测算法,我们检测个人旅行并推断新加坡55个行政区之间的总体人员流动性(考虑到新加坡电信的市场份额和144%的手机普及率)。然后,通过计算估计的整体机动性与440万公共交通智能卡用户在同一时期的轨迹之间的差异,得出私人交通使用的模式份额。我们使用新加坡家庭访谈旅行调查(HITS)的结果验证了我们的数据挖掘方法:我们的结果显示,公共交通有350万(HITS: 350万)次跨区旅行,私人交通(包括出租车)有430万(HITS: 440万)次跨区旅行。在没有地铁线路的地区,私人交通占主导地位(见图1)。除了对交通连接薄弱或服务不足的地区进行分类(人们更喜欢乘坐私人交通而不是公共交通)外,分析还显示,公共交通的模式份额从上午的38%增加到中午的44%,傍晚的52%。利用手机通话详细记录得出这种模式的价值不仅对城市和交通规划有重要意义,而且对城市疾病控制等其他领域也有重要意义。由于人类是许多传染病的主要和次要媒介,了解人们从哪里到达和离开,以及人们乘坐的交通方式,我们就有可能模拟疾病如何和在哪里传播,以及它们可能起源于哪里。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Detecting weak public transport connections from cellphone and public transport data
Securing public transportation ridership is critical for developing a sustainable urban future. However, many modern and growing cities are facing declines in public transport usage. Existing systems for analyzing and identifying weaknesses in public transport connections face major limitations. In cities, origin-destination (OD) matrices--which measure the flow of people between different geographical regions--are often generated using household surveys, which are time consuming and lack spatial and temporal accuracy. Focus in more recent research has been drawn towards using cellphones to overcome these limitations. In this article, we show that urban mobility patterns and transport mode choices can be derived from cellphone call detail records coupled with public transport data recorded from smart cards. Specifically, we present new methods to determine the spatial and temporal variability of public and private transportation usage and transport mode preferences across the dense, metropolitan city-state of Singapore. The cellphone dataset consists of location data of 3.4 million anonymized users of SingTel, Singapore's largest telecommunications company with a market share of 45.3%. The data were recorded during a two-month period from mid-March to mid-May 2011. A call detail record (CDR) includes the location (spatial resolution of 400 m) of the cell tower each cellphone connects to and was created in the case of following network events: • a phone call was initiated or received (at the beginning and at the end of the call). • a short message was sent or received. • the cellphone user accessed the data network (for example, to open a website or retrieve emails). By applying a clustering detection algorithm to these call detail records, we detect individual trips and extrapolate the overall mobility of people between the 55 administrative districts of Singapore (taking into account the market share of SingTel and the cellphone penetration of 144 %). The mode share of private transport usage is then derived by computing the difference between the estimated overall mobility and the traces of 4.4 million public transportation smart card users during the same time period. We validate out data mining approach using the results from Singapore's Household Interview Travel Survey (HITS): Our results revealed that there are 3.5 million (HITS: 3.5 million) inter-district trips by public transport and 4.3 million (HITS: 4.4 million) inter-district trips by private transport (including taxis). Private transport usage dominates in regions without access to a subway line (see Figure 1). Along with classifying which transportation connections are weak or underserved---where people prefer to take private rather than public transport---the analysis shows that the mode share of public transport increases from 38% in the morning to 44% around mid-day and 52% in the evening. The value of deriving such patterns using cellphone call detail records have important implications not only for urban and transportation planning, but also for other domains such as disease control in cities. As humans serve as the primary and secondary vectors of many infectious diseases, understanding from where people arrive and depart and by which transportation modes people are traveling, we have the potential to model how and where diseases might be spreading and from where they might originate.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信