Bus journey simulation to develop public transport predictive algorithms

Thilo Reich , Marcin Budka , David Hulbert
{"title":"Bus journey simulation to develop public transport predictive algorithms","authors":"Thilo Reich ,&nbsp;Marcin Budka ,&nbsp;David Hulbert","doi":"10.1016/j.socl.2021.100029","DOIUrl":null,"url":null,"abstract":"<div><p>Encouraging the use of public transport is essential to combat congestion and pollution in an urban environment. To achieve this, the reliability of arrival time prediction should be improved as this is one area of improvement frequently requested by passengers. The development of accurate predictive algorithms requires good quality data, which is often not available. Here we demonstrate a method to synthesise data using a reference curve approach derived from very limited real world data without reliable ground truth. This approach allows the controlled introduction of artefacts and noise to simulate their impact on prediction accuracy. To illustrate these impacts, a recurrent neural network next-step prediction is used to compare different scenarios in two different UK cities. The results show that a realistic data synthesis is possible, allowing for controlled testing of predictive algorithms. It also highlights the importance of reliable data transmission to gain such data from real world sources. Our main contribution is the demonstration of a synthetic data generator for public transport data, which can be used to compensate for low data quality. We further show that this data generator can be used to develop and enhance predictive algorithms in the context of urban bus networks if high-quality data is limited, by mixing synthetic and real data.</p></div>","PeriodicalId":101169,"journal":{"name":"Soft Computing Letters","volume":"3 ","pages":"Article 100029"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666222121000174/pdfft?md5=ac9f2e119a87da7041d08bee1f99991e&pid=1-s2.0-S2666222121000174-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soft Computing Letters","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666222121000174","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Encouraging the use of public transport is essential to combat congestion and pollution in an urban environment. To achieve this, the reliability of arrival time prediction should be improved as this is one area of improvement frequently requested by passengers. The development of accurate predictive algorithms requires good quality data, which is often not available. Here we demonstrate a method to synthesise data using a reference curve approach derived from very limited real world data without reliable ground truth. This approach allows the controlled introduction of artefacts and noise to simulate their impact on prediction accuracy. To illustrate these impacts, a recurrent neural network next-step prediction is used to compare different scenarios in two different UK cities. The results show that a realistic data synthesis is possible, allowing for controlled testing of predictive algorithms. It also highlights the importance of reliable data transmission to gain such data from real world sources. Our main contribution is the demonstration of a synthetic data generator for public transport data, which can be used to compensate for low data quality. We further show that this data generator can be used to develop and enhance predictive algorithms in the context of urban bus networks if high-quality data is limited, by mixing synthetic and real data.

公交行程模拟,开发公共交通预测算法
鼓励使用公共交通工具是解决城市环境拥挤和污染问题的关键。为了实现这一目标,应提高到达时间预测的可靠性,因为这是乘客经常要求改进的一个领域。开发准确的预测算法需要高质量的数据,而这些数据通常是无法获得的。在这里,我们展示了一种使用参考曲线方法合成数据的方法,该方法来源于非常有限的真实世界数据,没有可靠的地面真值。这种方法允许可控地引入伪影和噪声来模拟它们对预测精度的影响。为了说明这些影响,使用递归神经网络下一步预测来比较两个不同英国城市的不同情景。结果表明,现实的数据合成是可能的,允许对预测算法进行控制测试。它还强调了可靠的数据传输对于从真实世界来源获得此类数据的重要性。我们的主要贡献是演示了公共交通数据的合成数据生成器,它可以用来弥补数据质量低的问题。我们进一步表明,在高质量数据有限的情况下,通过混合合成数据和真实数据,该数据生成器可用于开发和增强城市公交网络背景下的预测算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信