Causal relationship discovery for highway crash analysis using semi-data-driven Bayesian network.

IF 6.2 1区 工程技术 Q1 ERGONOMICS
Accident; analysis and prevention Pub Date : 2025-10-01 Epub Date: 2025-08-07 DOI:10.1016/j.aap.2025.108181
Yifan Wang, Xuesong Wang
{"title":"Causal relationship discovery for highway crash analysis using semi-data-driven Bayesian network.","authors":"Yifan Wang, Xuesong Wang","doi":"10.1016/j.aap.2025.108181","DOIUrl":null,"url":null,"abstract":"<p><p>With the widespread application of advanced machine learning techniques, researchers need a more transparent decision-making process. The data-driven causal relationship discovery techniques often lack interpretability. Therefore, a semi-data-driven Bayesian network structure learning algorithm, the Expert Knowledge Constraint-based (EKC) algorithm, is proposed. By integrating expert knowledge with conditional independence tests, the EKC algorithm constructs a causal Bayesian network with a high level of interpretability. The algorithm was applied to a highway safety scene using crash data collected in 2022 from the HuNing Highway in China. The effects of the Bayesian network on variables were estimated using the Bayesian estimation algorithm, and the most dangerous scenarios were ranked using the variable elimination algorithm. Key findings include: (1) date-related variables do not directly affect crashes; (2) unfavorable temperatures, medium-level traffic volumes, and snowy weather conditions are associated with higher crash probabilities; and (3) the highest crash probability occurs under medium traffic volume, cold temperatures, winter season, cloudy weather, morning hours, and weekdays. The EKC algorithm was compared with the Hill Climbing algorithm, Chow-Liu Trees algorithm, and logistic model, demonstrating significant improvements in interpretability while maintaining good fitting scores. Furthermore, the definition framework of model interpretability in traffic crash analytics was discussed, including causality, trust, heterogeneity, transferability, and stability.</p>","PeriodicalId":6926,"journal":{"name":"Accident; analysis and prevention","volume":"221 ","pages":"108181"},"PeriodicalIF":6.2000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accident; analysis and prevention","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.aap.2025.108181","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/8/7 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ERGONOMICS","Score":null,"Total":0}
引用次数: 0

Abstract

With the widespread application of advanced machine learning techniques, researchers need a more transparent decision-making process. The data-driven causal relationship discovery techniques often lack interpretability. Therefore, a semi-data-driven Bayesian network structure learning algorithm, the Expert Knowledge Constraint-based (EKC) algorithm, is proposed. By integrating expert knowledge with conditional independence tests, the EKC algorithm constructs a causal Bayesian network with a high level of interpretability. The algorithm was applied to a highway safety scene using crash data collected in 2022 from the HuNing Highway in China. The effects of the Bayesian network on variables were estimated using the Bayesian estimation algorithm, and the most dangerous scenarios were ranked using the variable elimination algorithm. Key findings include: (1) date-related variables do not directly affect crashes; (2) unfavorable temperatures, medium-level traffic volumes, and snowy weather conditions are associated with higher crash probabilities; and (3) the highest crash probability occurs under medium traffic volume, cold temperatures, winter season, cloudy weather, morning hours, and weekdays. The EKC algorithm was compared with the Hill Climbing algorithm, Chow-Liu Trees algorithm, and logistic model, demonstrating significant improvements in interpretability while maintaining good fitting scores. Furthermore, the definition framework of model interpretability in traffic crash analytics was discussed, including causality, trust, heterogeneity, transferability, and stability.

基于半数据驱动贝叶斯网络的公路碰撞分析因果关系发现。
随着先进机器学习技术的广泛应用,研究人员需要一个更透明的决策过程。数据驱动的因果关系发现技术往往缺乏可解释性。为此,提出了一种半数据驱动的贝叶斯网络结构学习算法——基于专家知识约束(EKC)算法。EKC算法通过将专家知识与条件独立检验相结合,构建了一个具有高可解释性的因果贝叶斯网络。该算法应用于高速公路安全场景,使用了2022年从中国沪宁高速公路收集的碰撞数据。使用贝叶斯估计算法估计贝叶斯网络对变量的影响,并使用变量消除算法对最危险的场景进行排序。主要发现包括:(1)日期相关变量不直接影响崩溃;(2)不利温度、中等交通量和降雪天气条件与较高的碰撞概率相关;(3)中等交通量、低温、冬季、多云天气、早晨时段和工作日发生碰撞的概率最高。将EKC算法与Hill climb算法、Chow-Liu树算法和logistic模型进行比较,结果表明在保持良好拟合分数的同时,EKC算法的可解释性有了显著提高。讨论了交通事故分析中模型可解释性的定义框架,包括因果关系、信任、异质性、可转移性和稳定性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
11.90
自引率
16.90%
发文量
264
审稿时长
48 days
期刊介绍: Accident Analysis & Prevention provides wide coverage of the general areas relating to accidental injury and damage, including the pre-injury and immediate post-injury phases. Published papers deal with medical, legal, economic, educational, behavioral, theoretical or empirical aspects of transportation accidents, as well as with accidents at other sites. Selected topics within the scope of the Journal may include: studies of human, environmental and vehicular factors influencing the occurrence, type and severity of accidents and injury; the design, implementation and evaluation of countermeasures; biomechanics of impact and human tolerance limits to injury; modelling and statistical analysis of accident data; policy, planning and decision-making in safety.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信