Quansheng Yue , Yanyong Guo , Tarek Sayed , Lai Zheng , Hao Lyu , Pan Liu
{"title":"Bayesian hierarchical non-stationary hybrid modeling for threshold estimation in peak over threshold approach","authors":"Quansheng Yue , Yanyong Guo , Tarek Sayed , Lai Zheng , Hao Lyu , Pan Liu","doi":"10.1016/j.aap.2025.108249","DOIUrl":null,"url":null,"abstract":"<div><div>The peak over threshold (POT) approach in extreme value theory is widely used for crash risk estimation, but the reliability is often undermined by the subjective and arbitrary selection of the conflict threshold, which can lead to biased outcomes. This study advances the hybrid modeling method for objective threshold determination by developing a non-stationary framework and comprehensively comparing five distinct model structures. The framework allows the threshold to vary with real-time traffic covariates, while the comparison identifies the optimal distribution for general conflicts. The Bayesian hierarchical structure is used to combine traffic conflicts from different sites, incorporating covariates and site-specific unobserved heterogeneity. Five non-stationary BHHM models, including Normal-GPD, Cauchy-GPD, Logistic-GPD, Gamma-GPD, and Lognormal-GPD models, were developed and compared. Traditional graphical diagnostic and quantile regression approaches were also used for comparison. Traffic conflicts collected from three signalized intersections in the city of Surrey, British Columbia were used for the study. The Bayesian approach is employed to estimate the threshold and other parameters in the non-stationary BHHM models. The results show that the proposed BHHM approach could estimate the threshold parameter objectively. The non-stationary BHHM models capture how the threshold varies dynamically across signal cycles in response to changing traffic status. The Lognormal-GPD model is superior to the other four BHHM models in terms of crash estimation accuracy and model fit. The crash estimates using the threshold determined by the BHHM outperform those estimated based on the graphical diagnostic and quantile regression approaches, indicating the superiority of the proposed threshold determination approach. The findings of this study contribute to enhancing the existing EVT methods for providing a threshold determination approach as well as producing reliable crash estimations.</div></div>","PeriodicalId":6926,"journal":{"name":"Accident; analysis and prevention","volume":"223 ","pages":"Article 108249"},"PeriodicalIF":6.2000,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accident; analysis and prevention","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0001457525003379","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ERGONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
The peak over threshold (POT) approach in extreme value theory is widely used for crash risk estimation, but the reliability is often undermined by the subjective and arbitrary selection of the conflict threshold, which can lead to biased outcomes. This study advances the hybrid modeling method for objective threshold determination by developing a non-stationary framework and comprehensively comparing five distinct model structures. The framework allows the threshold to vary with real-time traffic covariates, while the comparison identifies the optimal distribution for general conflicts. The Bayesian hierarchical structure is used to combine traffic conflicts from different sites, incorporating covariates and site-specific unobserved heterogeneity. Five non-stationary BHHM models, including Normal-GPD, Cauchy-GPD, Logistic-GPD, Gamma-GPD, and Lognormal-GPD models, were developed and compared. Traditional graphical diagnostic and quantile regression approaches were also used for comparison. Traffic conflicts collected from three signalized intersections in the city of Surrey, British Columbia were used for the study. The Bayesian approach is employed to estimate the threshold and other parameters in the non-stationary BHHM models. The results show that the proposed BHHM approach could estimate the threshold parameter objectively. The non-stationary BHHM models capture how the threshold varies dynamically across signal cycles in response to changing traffic status. The Lognormal-GPD model is superior to the other four BHHM models in terms of crash estimation accuracy and model fit. The crash estimates using the threshold determined by the BHHM outperform those estimated based on the graphical diagnostic and quantile regression approaches, indicating the superiority of the proposed threshold determination approach. The findings of this study contribute to enhancing the existing EVT methods for providing a threshold determination approach as well as producing reliable crash estimations.
期刊介绍:
Accident Analysis & Prevention provides wide coverage of the general areas relating to accidental injury and damage, including the pre-injury and immediate post-injury phases. Published papers deal with medical, legal, economic, educational, behavioral, theoretical or empirical aspects of transportation accidents, as well as with accidents at other sites. Selected topics within the scope of the Journal may include: studies of human, environmental and vehicular factors influencing the occurrence, type and severity of accidents and injury; the design, implementation and evaluation of countermeasures; biomechanics of impact and human tolerance limits to injury; modelling and statistical analysis of accident data; policy, planning and decision-making in safety.