对不完整碰撞数据进行可靠估算,以预测驾驶员受伤严重程度

IF 5.7 1区 工程技术 Q1 ERGONOMICS
Xiaowei Gao , Xinke Jiang , Dingyi Zhuang , James Haworth , Shenhao Wang , Ilya Ilyankou , Huanfa Chen
{"title":"对不完整碰撞数据进行可靠估算,以预测驾驶员受伤严重程度","authors":"Xiaowei Gao ,&nbsp;Xinke Jiang ,&nbsp;Dingyi Zhuang ,&nbsp;James Haworth ,&nbsp;Shenhao Wang ,&nbsp;Ilya Ilyankou ,&nbsp;Huanfa Chen","doi":"10.1016/j.aap.2025.108020","DOIUrl":null,"url":null,"abstract":"<div><div>Traffic crash analyses are frequently challenged by incomplete documentation, particularly in standardised multi-party crash full records. Traditional imputation methods like MICE and KNN, while effective for single-category analyses, fail to address the complex interdependencies inherent in standardised crash records where different types of road user are present. This study introduces a novel graph-based imputation framework that integrates an Inexact Match Bipartite-Graph with Contrastive Learning in a Transformer-GNN architecture, providing a unified solution to handle missing data of various crash types in a complete crash record database. Testing on UK traffic crash records (2018–2022) demonstrates the robust performance of the imputation model, achieving imputation accuracy between 99.24% and 94.74% across missing data rates from 10% to 70%. In the downstream task of classifying the severity of the injury, our imputed data set proved to be highly reliable, achieving a Gmean score of 62.19% to identify levels of imbalanced severity, even under severe missing with a missing rate of 70%. Furthermore, explainable SHAP values demonstrated that data imputation preserved the most important contributing factors. These results validate our framework’s effectiveness in maintaining both data integrity and essential relationship structures in standardised crash records, advancing the field of traffic safety analysis through improved imputation methodology.</div></div>","PeriodicalId":6926,"journal":{"name":"Accident; analysis and prevention","volume":"216 ","pages":"Article 108020"},"PeriodicalIF":5.7000,"publicationDate":"2025-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reliable imputation of incomplete crash data for predicting driver injury severity\",\"authors\":\"Xiaowei Gao ,&nbsp;Xinke Jiang ,&nbsp;Dingyi Zhuang ,&nbsp;James Haworth ,&nbsp;Shenhao Wang ,&nbsp;Ilya Ilyankou ,&nbsp;Huanfa Chen\",\"doi\":\"10.1016/j.aap.2025.108020\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Traffic crash analyses are frequently challenged by incomplete documentation, particularly in standardised multi-party crash full records. Traditional imputation methods like MICE and KNN, while effective for single-category analyses, fail to address the complex interdependencies inherent in standardised crash records where different types of road user are present. This study introduces a novel graph-based imputation framework that integrates an Inexact Match Bipartite-Graph with Contrastive Learning in a Transformer-GNN architecture, providing a unified solution to handle missing data of various crash types in a complete crash record database. Testing on UK traffic crash records (2018–2022) demonstrates the robust performance of the imputation model, achieving imputation accuracy between 99.24% and 94.74% across missing data rates from 10% to 70%. In the downstream task of classifying the severity of the injury, our imputed data set proved to be highly reliable, achieving a Gmean score of 62.19% to identify levels of imbalanced severity, even under severe missing with a missing rate of 70%. Furthermore, explainable SHAP values demonstrated that data imputation preserved the most important contributing factors. These results validate our framework’s effectiveness in maintaining both data integrity and essential relationship structures in standardised crash records, advancing the field of traffic safety analysis through improved imputation methodology.</div></div>\",\"PeriodicalId\":6926,\"journal\":{\"name\":\"Accident; analysis and prevention\",\"volume\":\"216 \",\"pages\":\"Article 108020\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2025-04-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Accident; analysis and prevention\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S000145752500106X\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ERGONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accident; analysis and prevention","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S000145752500106X","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ERGONOMICS","Score":null,"Total":0}
引用次数: 0

摘要

交通事故分析经常会受到不完整记录的挑战,特别是在标准化的多方事故完整记录中。MICE 和 KNN 等传统估算方法虽然对单一类别分析有效,但无法解决标准化碰撞记录中存在的不同类型道路使用者之间复杂的相互依存关系。本研究介绍了一种新颖的基于图的估算框架,该框架在 Transformer-GNN 架构中整合了非精确匹配双分部图和对比学习,为处理完整碰撞记录数据库中各种碰撞类型的缺失数据提供了统一的解决方案。对英国交通事故记录(2018-2022 年)的测试证明了该估算模型的强大性能,在数据缺失率为 10% 到 70% 的情况下,估算准确率达到 99.24% 到 94.74%。在对伤害严重程度进行分类的下游任务中,我们的估算数据集被证明是高度可靠的,即使在缺失率为 70% 的严重缺失情况下,也能达到 62.19% 的 Gmean 分数,以识别不平衡的严重程度。此外,可解释的 SHAP 值表明,数据估算保留了最重要的促成因素。这些结果验证了我们的框架在保持标准化碰撞记录中的数据完整性和基本关系结构方面的有效性,并通过改进的归因方法推进了交通安全分析领域的发展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Reliable imputation of incomplete crash data for predicting driver injury severity
Traffic crash analyses are frequently challenged by incomplete documentation, particularly in standardised multi-party crash full records. Traditional imputation methods like MICE and KNN, while effective for single-category analyses, fail to address the complex interdependencies inherent in standardised crash records where different types of road user are present. This study introduces a novel graph-based imputation framework that integrates an Inexact Match Bipartite-Graph with Contrastive Learning in a Transformer-GNN architecture, providing a unified solution to handle missing data of various crash types in a complete crash record database. Testing on UK traffic crash records (2018–2022) demonstrates the robust performance of the imputation model, achieving imputation accuracy between 99.24% and 94.74% across missing data rates from 10% to 70%. In the downstream task of classifying the severity of the injury, our imputed data set proved to be highly reliable, achieving a Gmean score of 62.19% to identify levels of imbalanced severity, even under severe missing with a missing rate of 70%. Furthermore, explainable SHAP values demonstrated that data imputation preserved the most important contributing factors. These results validate our framework’s effectiveness in maintaining both data integrity and essential relationship structures in standardised crash records, advancing the field of traffic safety analysis through improved imputation methodology.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
11.90
自引率
16.90%
发文量
264
审稿时长
48 days
期刊介绍: Accident Analysis & Prevention provides wide coverage of the general areas relating to accidental injury and damage, including the pre-injury and immediate post-injury phases. Published papers deal with medical, legal, economic, educational, behavioral, theoretical or empirical aspects of transportation accidents, as well as with accidents at other sites. Selected topics within the scope of the Journal may include: studies of human, environmental and vehicular factors influencing the occurrence, type and severity of accidents and injury; the design, implementation and evaluation of countermeasures; biomechanics of impact and human tolerance limits to injury; modelling and statistical analysis of accident data; policy, planning and decision-making in safety.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信