在模拟老年司机交通违规严重程度时考虑多尺度建筑环境:可解释的机器学习框架

IF 5.7 1区 工程技术 Q1 ERGONOMICS
Zhiyuan Sun , Zhoumeng Ai , Zehao Wang , Jianyu Wang , Xin Gu , Duo Wang , Huapu Lu , Yanyan Chen
{"title":"在模拟老年司机交通违规严重程度时考虑多尺度建筑环境:可解释的机器学习框架","authors":"Zhiyuan Sun ,&nbsp;Zhoumeng Ai ,&nbsp;Zehao Wang ,&nbsp;Jianyu Wang ,&nbsp;Xin Gu ,&nbsp;Duo Wang ,&nbsp;Huapu Lu ,&nbsp;Yanyan Chen","doi":"10.1016/j.aap.2024.107740","DOIUrl":null,"url":null,"abstract":"<div><p>The causes of traffic violations by elderly drivers are different from those of other age groups. To reduce serious traffic violations that are more likely to cause serious traffic crashes, this study divided the severity of traffic violations into three levels (i.e., slight, ordinary, severe) based on point deduction, and explore the patterns of serious traffic violations (i.e., ordinary, severe) using multi-source data. This paper designed an interpretable machine learning framework, in which four popular machine learning models were enhanced and compared. Specifically, adaptive synthetic sampling method was applied to overcome the effects of imbalanced data and improve the prediction accuracy of minority classes (i.e., ordinary, severe); multi-objective feature selection based on NSGA-II was used to remove the redundant factors to increase the computational efficiency and make the patterns discovered by the explainer more effective; Bayesian hyperparameter optimization aimed to obtain more effective hyperparameters combination with fewer iterations and boost the model adaptability. Results show that the proposed interpretable machine learning framework can significantly improve and distinguish the performance of four popular machine learning models and two post-hoc interpretation methods. It is found that six of the top ten important factors belong to multi-scale built environment attributes. By comparing the results of feature contribution and interaction effects, some findings can be summarized: ordinary and severe traffic violations have some identical influencing factors and interactive effects; have the same influencing factors or the same combinations of influencing factors, but the values of the factors are different; have some unique influencing factors and unique combinations of influencing factors.</p></div>","PeriodicalId":6926,"journal":{"name":"Accident; analysis and prevention","volume":"207 ","pages":"Article 107740"},"PeriodicalIF":5.7000,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Considering multi-scale built environment in modeling severity of traffic violations by elderly drivers: An interpretable machine learning framework\",\"authors\":\"Zhiyuan Sun ,&nbsp;Zhoumeng Ai ,&nbsp;Zehao Wang ,&nbsp;Jianyu Wang ,&nbsp;Xin Gu ,&nbsp;Duo Wang ,&nbsp;Huapu Lu ,&nbsp;Yanyan Chen\",\"doi\":\"10.1016/j.aap.2024.107740\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The causes of traffic violations by elderly drivers are different from those of other age groups. To reduce serious traffic violations that are more likely to cause serious traffic crashes, this study divided the severity of traffic violations into three levels (i.e., slight, ordinary, severe) based on point deduction, and explore the patterns of serious traffic violations (i.e., ordinary, severe) using multi-source data. This paper designed an interpretable machine learning framework, in which four popular machine learning models were enhanced and compared. Specifically, adaptive synthetic sampling method was applied to overcome the effects of imbalanced data and improve the prediction accuracy of minority classes (i.e., ordinary, severe); multi-objective feature selection based on NSGA-II was used to remove the redundant factors to increase the computational efficiency and make the patterns discovered by the explainer more effective; Bayesian hyperparameter optimization aimed to obtain more effective hyperparameters combination with fewer iterations and boost the model adaptability. Results show that the proposed interpretable machine learning framework can significantly improve and distinguish the performance of four popular machine learning models and two post-hoc interpretation methods. It is found that six of the top ten important factors belong to multi-scale built environment attributes. By comparing the results of feature contribution and interaction effects, some findings can be summarized: ordinary and severe traffic violations have some identical influencing factors and interactive effects; have the same influencing factors or the same combinations of influencing factors, but the values of the factors are different; have some unique influencing factors and unique combinations of influencing factors.</p></div>\",\"PeriodicalId\":6926,\"journal\":{\"name\":\"Accident; analysis and prevention\",\"volume\":\"207 \",\"pages\":\"Article 107740\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2024-08-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Accident; analysis and prevention\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0001457524002859\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ERGONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accident; analysis and prevention","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0001457524002859","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ERGONOMICS","Score":null,"Total":0}
引用次数: 0

摘要

老年驾驶员交通违法的原因与其他年龄段的驾驶员不同。为了减少更容易引发严重交通事故的严重交通违法行为,本研究根据扣分情况将交通违法行为的严重程度分为三个等级(即轻微、一般、严重),并利用多源数据探索严重交通违法行为(即一般、严重)的规律。本文设计了一个可解释的机器学习框架,在此框架内对四种流行的机器学习模型进行了增强和比较。具体来说,采用自适应合成采样方法克服了不平衡数据的影响,提高了对少数类别(即普通、严重)的预测精度;采用基于 NSGA-II 的多目标特征选择去除冗余因子,提高了计算效率,使解释器发现的模式更加有效;贝叶斯超参数优化旨在以更少的迭代次数获得更有效的超参数组合,提高模型的适应性。结果表明,所提出的可解释机器学习框架能显著改善和区分四种流行的机器学习模型和两种事后解释方法的性能。研究发现,排名前十的重要因素中有六个属于多尺度建筑环境属性。通过比较特征贡献和交互效应的结果,可以总结出一些结论:普通交通违法行为和严重交通违法行为具有一些相同的影响因素和交互效应;具有相同的影响因素或相同的影响因素组合,但因素值不同;具有一些独特的影响因素和独特的影响因素组合。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Considering multi-scale built environment in modeling severity of traffic violations by elderly drivers: An interpretable machine learning framework

The causes of traffic violations by elderly drivers are different from those of other age groups. To reduce serious traffic violations that are more likely to cause serious traffic crashes, this study divided the severity of traffic violations into three levels (i.e., slight, ordinary, severe) based on point deduction, and explore the patterns of serious traffic violations (i.e., ordinary, severe) using multi-source data. This paper designed an interpretable machine learning framework, in which four popular machine learning models were enhanced and compared. Specifically, adaptive synthetic sampling method was applied to overcome the effects of imbalanced data and improve the prediction accuracy of minority classes (i.e., ordinary, severe); multi-objective feature selection based on NSGA-II was used to remove the redundant factors to increase the computational efficiency and make the patterns discovered by the explainer more effective; Bayesian hyperparameter optimization aimed to obtain more effective hyperparameters combination with fewer iterations and boost the model adaptability. Results show that the proposed interpretable machine learning framework can significantly improve and distinguish the performance of four popular machine learning models and two post-hoc interpretation methods. It is found that six of the top ten important factors belong to multi-scale built environment attributes. By comparing the results of feature contribution and interaction effects, some findings can be summarized: ordinary and severe traffic violations have some identical influencing factors and interactive effects; have the same influencing factors or the same combinations of influencing factors, but the values of the factors are different; have some unique influencing factors and unique combinations of influencing factors.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
11.90
自引率
16.90%
发文量
264
审稿时长
48 days
期刊介绍: Accident Analysis & Prevention provides wide coverage of the general areas relating to accidental injury and damage, including the pre-injury and immediate post-injury phases. Published papers deal with medical, legal, economic, educational, behavioral, theoretical or empirical aspects of transportation accidents, as well as with accidents at other sites. Selected topics within the scope of the Journal may include: studies of human, environmental and vehicular factors influencing the occurrence, type and severity of accidents and injury; the design, implementation and evaluation of countermeasures; biomechanics of impact and human tolerance limits to injury; modelling and statistical analysis of accident data; policy, planning and decision-making in safety.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信