{"title":"Reliable crash analysis: Comparing biases and error rates of empirical Bayes before-after analyses to mixed-models","authors":"Matthew A. Albrecht, Razi Hasan","doi":"10.1016/j.aap.2025.107921","DOIUrl":null,"url":null,"abstract":"<div><div>Estimating reliable causal estimates of road safety interventions is challenging, with a number of these challenges addressable through analysis choices. At a minimum, developing reliable crash modification factors (CMFs) needs to address three critical confounding factors, i.e., 1) the regression-to-the-mean (RTM) phenomenon, 2) the effect of traffic volume, and 3) the time trend for the occurrence of crashes. The current preferred crash analysis method is the empirical Bayes (EB) before-after analysis but requires complex bespoke analysis and may not be the best performing method. We compare in a simulation experiment various EB methods to a more straightforward negative binomial generalized linear mixed model (NB-GLMM) with an interaction term between treatment group and time for analysing treatment effects in crash data. Data were simulated using two broad scenarios: 1) an idealized randomized controlled design, and 2) a moderately biased site-selection scenario commonly encountered in road safety crash analyses. Both scenarios varied treatment effects, overdispersion, and sample sizes. The NB-GLMM performed best, maintaining type I error rate and providing least biased estimates across most analyses. Most standard EB methods were too liberal or generally more biased, with the exception of the EB method that incorporated a varying dispersion parameter. Incorporating mixed-effects modelling into the EB procedure improved bias. Overall, we found that using a “standard” NB-GLMM with an interaction term is sufficient for crash analysis, reducing complexity compared to bespoke EB solutions. Chosen methods should also be the least biased and possess the marginal error rates under both ideal and selection-bias conditions. Mixed-effects approaches to analysis of road safety interventions satisfy these criteria outperforming standard or other empirical Bayes approaches tested here.</div></div>","PeriodicalId":6926,"journal":{"name":"Accident; analysis and prevention","volume":"212 ","pages":"Article 107921"},"PeriodicalIF":5.7000,"publicationDate":"2025-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accident; analysis and prevention","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0001457525000077","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ERGONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
Estimating reliable causal estimates of road safety interventions is challenging, with a number of these challenges addressable through analysis choices. At a minimum, developing reliable crash modification factors (CMFs) needs to address three critical confounding factors, i.e., 1) the regression-to-the-mean (RTM) phenomenon, 2) the effect of traffic volume, and 3) the time trend for the occurrence of crashes. The current preferred crash analysis method is the empirical Bayes (EB) before-after analysis but requires complex bespoke analysis and may not be the best performing method. We compare in a simulation experiment various EB methods to a more straightforward negative binomial generalized linear mixed model (NB-GLMM) with an interaction term between treatment group and time for analysing treatment effects in crash data. Data were simulated using two broad scenarios: 1) an idealized randomized controlled design, and 2) a moderately biased site-selection scenario commonly encountered in road safety crash analyses. Both scenarios varied treatment effects, overdispersion, and sample sizes. The NB-GLMM performed best, maintaining type I error rate and providing least biased estimates across most analyses. Most standard EB methods were too liberal or generally more biased, with the exception of the EB method that incorporated a varying dispersion parameter. Incorporating mixed-effects modelling into the EB procedure improved bias. Overall, we found that using a “standard” NB-GLMM with an interaction term is sufficient for crash analysis, reducing complexity compared to bespoke EB solutions. Chosen methods should also be the least biased and possess the marginal error rates under both ideal and selection-bias conditions. Mixed-effects approaches to analysis of road safety interventions satisfy these criteria outperforming standard or other empirical Bayes approaches tested here.
期刊介绍:
Accident Analysis & Prevention provides wide coverage of the general areas relating to accidental injury and damage, including the pre-injury and immediate post-injury phases. Published papers deal with medical, legal, economic, educational, behavioral, theoretical or empirical aspects of transportation accidents, as well as with accidents at other sites. Selected topics within the scope of the Journal may include: studies of human, environmental and vehicular factors influencing the occurrence, type and severity of accidents and injury; the design, implementation and evaluation of countermeasures; biomechanics of impact and human tolerance limits to injury; modelling and statistical analysis of accident data; policy, planning and decision-making in safety.