Taxi passenger hotspot prediction using automatic ARIMA model

Mohammad Sabar Jamil, Saiful Akbar
{"title":"Taxi passenger hotspot prediction using automatic ARIMA model","authors":"Mohammad Sabar Jamil, Saiful Akbar","doi":"10.1109/ICSITECH.2017.8257080","DOIUrl":null,"url":null,"abstract":"As a transportation mode, taxis facing low occupancy rate problems at certain time and over-demand at another. This issue is due to imbalance of supply and demand of taxi service. According to some studies the imbalance caused by inefficient taxis distribution. Recently, there are data sources available by utilizing GPS technology that can be used to obtain spatio-temporal information. This spatio-temporal information is valuable to develop intelligent taxi system. There are a lot of previous research related to intelligent taxi system in several research topics, one of which on the analysis hotspot area. This study utilizes Automatic ARIMA Model to perform time-series analysis to predict the passenger's hotspot area based on the spatio-temporal data provided from local taxi firm in Bandung. The challenge of this research is to use the right method in the data-preprocessing phase so the Automatic ARIMA Model can process the spatio-temporal data. Some of the alternatives proposed for preprocessing phase and experimental results show that grid mapping method can be used well in the preprocessing phase. Research results show that the Automatic ARIMA can be used to conduct an analysis of the spatio-temporal data. This is indicated by Mean Absolute Scaled Error (MASE) value 0.8797 for New York City dataset and 0.6338 for Bandung dataset. Cross-validation analysis also showed satisfactory results when the actual demand is quite large. However, if the actual demand is close to zero, the result of the analysis becomes less reliable. This can be understood as a lack in the quality of the data, not in the prediction model.","PeriodicalId":165045,"journal":{"name":"2017 3rd International Conference on Science in Information Technology (ICSITech)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 3rd International Conference on Science in Information Technology (ICSITech)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSITECH.2017.8257080","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11

Abstract

As a transportation mode, taxis facing low occupancy rate problems at certain time and over-demand at another. This issue is due to imbalance of supply and demand of taxi service. According to some studies the imbalance caused by inefficient taxis distribution. Recently, there are data sources available by utilizing GPS technology that can be used to obtain spatio-temporal information. This spatio-temporal information is valuable to develop intelligent taxi system. There are a lot of previous research related to intelligent taxi system in several research topics, one of which on the analysis hotspot area. This study utilizes Automatic ARIMA Model to perform time-series analysis to predict the passenger's hotspot area based on the spatio-temporal data provided from local taxi firm in Bandung. The challenge of this research is to use the right method in the data-preprocessing phase so the Automatic ARIMA Model can process the spatio-temporal data. Some of the alternatives proposed for preprocessing phase and experimental results show that grid mapping method can be used well in the preprocessing phase. Research results show that the Automatic ARIMA can be used to conduct an analysis of the spatio-temporal data. This is indicated by Mean Absolute Scaled Error (MASE) value 0.8797 for New York City dataset and 0.6338 for Bandung dataset. Cross-validation analysis also showed satisfactory results when the actual demand is quite large. However, if the actual demand is close to zero, the result of the analysis becomes less reliable. This can be understood as a lack in the quality of the data, not in the prediction model.
基于自动ARIMA模型的出租车乘客热点预测
出租车作为一种交通方式,时而面临入住率低,时而又面临需求过剩的问题。这个问题是由于出租车服务的供需不平衡造成的。根据一些研究,低效的出租车分配造成了不平衡。近年来,利用GPS技术获得了可用于获取时空信息的数据源。这些时空信息对智能出租车系统的开发具有重要的参考价值。前人对智能出租车系统的研究有很多,其中一个研究课题是分析热点领域。本研究基于万隆市当地出租车公司提供的时空数据,利用自动ARIMA模型进行时间序列分析,预测乘客的热点区域。如何在数据预处理阶段采用正确的方法,使自动ARIMA模型能够处理时空数据,是本研究的挑战。实验结果表明,网格映射方法可以很好地应用于预处理阶段。研究结果表明,自动ARIMA可用于对时空数据进行分析。纽约市数据集的平均绝对缩放误差(MASE)值为0.8797,万隆数据集的平均绝对缩放误差为0.6338。当实际需求量较大时,交叉验证分析也显示出令人满意的结果。然而,如果实际需求接近于零,分析的结果就变得不那么可靠。这可以理解为数据质量的不足,而不是预测模型的不足。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信