{"title":"Out-of-sample prediction and interpretation for random parameter generalized linear models","authors":"Jonathan S. Wood , Vikash Gayah","doi":"10.1016/j.aap.2025.108147","DOIUrl":null,"url":null,"abstract":"<div><div>Incorporating random parameters (RPs) into generalized linearized models (GLMs) – such as the negative binomial (NB) regression model used to predict crash frequencies – has been shown to improve model fit and better address issues such as unobserved heterogeneity. However, applying models with RPs to make predictions for observations outside the sample used to estimate the model is not straightforward. Recent studies have proposed various methods to incorporate RPs in out-of-sample predictions, but these tend to provide biased estimates or are computationally intensive to apply. This paper applies fundamental statistical theory to leverage properties of the underlying RP distributions incorporated into GLMs to provide more direct and accurate predictions, as well as directly estimate prediction variance for out-of-sample observations. Methods are provided for several common RP distributions – including the normal/Gaussian, lognormal, triangular, uniform, and gamma distributions – combined within log-link GLM framework. Additionally, closed-form equations for elasticities and marginal effects for the random parameters are provided. The proposed methods are tested using crash frequency prediction models developed using data from the Highway Safety Information System (HSIS). The results suggest that the proposed exact method provides more accurate predictions than the computational-intensive simulation-based approximation approaches while also being simple to apply. The method is suitable for the widespread use of RPs in research and in practical applications of GLMs.</div></div>","PeriodicalId":6926,"journal":{"name":"Accident; analysis and prevention","volume":"220 ","pages":"Article 108147"},"PeriodicalIF":5.7000,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accident; analysis and prevention","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0001457525002337","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ERGONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
Incorporating random parameters (RPs) into generalized linearized models (GLMs) – such as the negative binomial (NB) regression model used to predict crash frequencies – has been shown to improve model fit and better address issues such as unobserved heterogeneity. However, applying models with RPs to make predictions for observations outside the sample used to estimate the model is not straightforward. Recent studies have proposed various methods to incorporate RPs in out-of-sample predictions, but these tend to provide biased estimates or are computationally intensive to apply. This paper applies fundamental statistical theory to leverage properties of the underlying RP distributions incorporated into GLMs to provide more direct and accurate predictions, as well as directly estimate prediction variance for out-of-sample observations. Methods are provided for several common RP distributions – including the normal/Gaussian, lognormal, triangular, uniform, and gamma distributions – combined within log-link GLM framework. Additionally, closed-form equations for elasticities and marginal effects for the random parameters are provided. The proposed methods are tested using crash frequency prediction models developed using data from the Highway Safety Information System (HSIS). The results suggest that the proposed exact method provides more accurate predictions than the computational-intensive simulation-based approximation approaches while also being simple to apply. The method is suitable for the widespread use of RPs in research and in practical applications of GLMs.
期刊介绍:
Accident Analysis & Prevention provides wide coverage of the general areas relating to accidental injury and damage, including the pre-injury and immediate post-injury phases. Published papers deal with medical, legal, economic, educational, behavioral, theoretical or empirical aspects of transportation accidents, as well as with accidents at other sites. Selected topics within the scope of the Journal may include: studies of human, environmental and vehicular factors influencing the occurrence, type and severity of accidents and injury; the design, implementation and evaluation of countermeasures; biomechanics of impact and human tolerance limits to injury; modelling and statistical analysis of accident data; policy, planning and decision-making in safety.