Assessing regression methods to estimate network-wide bicycle traffic volumes based on crowdsourced GPS and permanent counter data

Emely Richter , Joscha Raudszus , Sven Lißner
{"title":"Assessing regression methods to estimate network-wide bicycle traffic volumes based on crowdsourced GPS and permanent counter data","authors":"Emely Richter ,&nbsp;Joscha Raudszus ,&nbsp;Sven Lißner","doi":"10.1016/j.jcmr.2025.100073","DOIUrl":null,"url":null,"abstract":"<div><div>GPS data offer an up-to-date, available, and easily processable database for bicycle traffic planning. Unlike permanent counters, they generally represent wide parts of the bicycle network. However, GPS data is derivable only from a subset of the cycling population and thus provides a limited overview of existing bicycle traffic volumes in a city at best. For planning or dimensioning of cycling infrastructure the data is only partially sufficient. Values such as the (annual) average daily number of bicycles (ADB/AADB) are more suitable. Using regression methods, GPS data in combination with (permanent) counter data can be utilized to model network-wide ADB. So far however, related studies mostly deal with only few counters in individual cities or metropolitan regions. Due to different modelling approaches and input variables, the results are neither comparable nor transferable. Therefore, no conclusion as to which models are most suited can be drawn. This study investigates the extrapolation of GPS data from a nationwide data set in Germany. First, six different types of regression models are trained based on the data set. Second, the trained models are utilized for network-wide AADB estimation in six municipalities. Thereby, this study provides a framework for comparable error metrics and investigates the suitability of the tested models for (1) estimation at permanent counters and (2) network-wide estimation. The models are divided into three classes: linear, tree-based and neural network models. We used 452 data points from permanent counters across Germany for model training. After assessing the model performances at the counters, they are applied to municipality-wide network sections. Comparing the overall performance, Support Vector Regression currently proves to be the most promising for extrapolating traffic volumes from GPS data to network-wide AADB.</div></div>","PeriodicalId":100771,"journal":{"name":"Journal of Cycling and Micromobility Research","volume":"5 ","pages":"Article 100073"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cycling and Micromobility Research","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2950105925000178","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

GPS data offer an up-to-date, available, and easily processable database for bicycle traffic planning. Unlike permanent counters, they generally represent wide parts of the bicycle network. However, GPS data is derivable only from a subset of the cycling population and thus provides a limited overview of existing bicycle traffic volumes in a city at best. For planning or dimensioning of cycling infrastructure the data is only partially sufficient. Values such as the (annual) average daily number of bicycles (ADB/AADB) are more suitable. Using regression methods, GPS data in combination with (permanent) counter data can be utilized to model network-wide ADB. So far however, related studies mostly deal with only few counters in individual cities or metropolitan regions. Due to different modelling approaches and input variables, the results are neither comparable nor transferable. Therefore, no conclusion as to which models are most suited can be drawn. This study investigates the extrapolation of GPS data from a nationwide data set in Germany. First, six different types of regression models are trained based on the data set. Second, the trained models are utilized for network-wide AADB estimation in six municipalities. Thereby, this study provides a framework for comparable error metrics and investigates the suitability of the tested models for (1) estimation at permanent counters and (2) network-wide estimation. The models are divided into three classes: linear, tree-based and neural network models. We used 452 data points from permanent counters across Germany for model training. After assessing the model performances at the counters, they are applied to municipality-wide network sections. Comparing the overall performance, Support Vector Regression currently proves to be the most promising for extrapolating traffic volumes from GPS data to network-wide AADB.
评估基于众包GPS和永久计数器数据估算全网自行车交通量的回归方法
GPS数据为自行车交通规划提供了一个最新的、可用的、易于处理的数据库。与永久计数器不同,它们通常代表自行车网络的大部分。然而,GPS数据只能从骑车人口的一个子集中得出,因此最多只能提供一个城市现有自行车交通量的有限概况。对于循环基础设施的规划或量纲化,数据仅是部分充分的。像(年)平均每日自行车数量(ADB/AADB)这样的值更合适。使用回归方法,GPS数据与(永久)计数器数据相结合,可用于建立全网ADB模型。但到目前为止,相关研究大多只涉及个别城市或大都市区的少数柜台。由于不同的建模方法和输入变量,结果既没有可比性也没有可转移性。因此,对于哪种模式最适合,我们无法得出结论。本研究调查了从德国全国数据集的GPS数据的外推。首先,基于数据集训练六种不同类型的回归模型。其次,将训练好的模型用于六个城市的全网AADB估计。因此,本研究为可比较的误差度量提供了一个框架,并调查了测试模型的适用性:(1)永久计数器估计和(2)网络范围估计。模型分为三类:线性模型、树模型和神经网络模型。我们使用了来自德国永久计数器的452个数据点进行模型训练。在评估了模型在柜台上的表现后,它们被应用于全市范围的网络部分。比较整体性能,支持向量回归目前被证明是最有希望从GPS数据推断交通量到全网AADB的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信