Analyzing feature importance for older pedestrian crash severity: A comparative study of DNN models, emphasizing road and vehicle types with SHAP interpretation
{"title":"Analyzing feature importance for older pedestrian crash severity: A comparative study of DNN models, emphasizing road and vehicle types with SHAP interpretation","authors":"Rocksana Akter , Susilawati Susilawati , Hamza Zubair , Wai Tong Chor","doi":"10.1016/j.multra.2025.100203","DOIUrl":null,"url":null,"abstract":"<div><div>Recognizing the importance of road safety modeling, the study explores Deep Neural Networks (DNN) with features like hidden layers, batch normalization, Rectified Linear Unit (ReLU) activation, and dropout to predict crash severity, interpreting decisions using SHapley Additive exPlanations (SHAP) for crashes involving older pedestrians. The objective is to understand features influencing crashes involving older pedestrians, including vehicle attributes, road and environmental conditions, and temporal parameters. The analysis focused on 1808 pedestrian crashes involving individuals aged 65 and over at intersections in Victoria, Australia. This dataset comprises 6.14% fatalities, 52.38% serious injuries, and 41.48% incidents with other injuries. The study evaluated three DNN models for crash severity prediction, with the two hidden layers DNN model excelling in precision metrics and achieving a perfect Area Under the Receiver Operating Characteristics curve for fatalities. Compared to XGBoost, the DNN models demonstrated superior performance in predicting severe outcomes. SHAP analysis on the two hidden layers DNN model highlighted key factors influencing crash severity, offering insights into the nuanced relationships between features and predictions. The analysis highlighted the significance of variables like Traffic Control, Vehicle Type, and Movement in predicting fatalities and serious injuries. This study emphasizes the importance of considering Road and Vehicle Types to understand their roles in accident severity and identify interventions to reduce risks. Neglecting these factors may lead to incomplete or biased conclusions about crash outcomes. This research provides valuable insights for improving road safety, highlighting the effectiveness of SHAP force plots, bars, beeswarm plots, and dependency plots in enhancing clarity and understanding of DNN model predictions. These tools help identify the impact of features on crash severity.</div></div>","PeriodicalId":100933,"journal":{"name":"Multimodal Transportation","volume":"4 2","pages":"Article 100203"},"PeriodicalIF":0.0000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Multimodal Transportation","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772586325000176","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Recognizing the importance of road safety modeling, the study explores Deep Neural Networks (DNN) with features like hidden layers, batch normalization, Rectified Linear Unit (ReLU) activation, and dropout to predict crash severity, interpreting decisions using SHapley Additive exPlanations (SHAP) for crashes involving older pedestrians. The objective is to understand features influencing crashes involving older pedestrians, including vehicle attributes, road and environmental conditions, and temporal parameters. The analysis focused on 1808 pedestrian crashes involving individuals aged 65 and over at intersections in Victoria, Australia. This dataset comprises 6.14% fatalities, 52.38% serious injuries, and 41.48% incidents with other injuries. The study evaluated three DNN models for crash severity prediction, with the two hidden layers DNN model excelling in precision metrics and achieving a perfect Area Under the Receiver Operating Characteristics curve for fatalities. Compared to XGBoost, the DNN models demonstrated superior performance in predicting severe outcomes. SHAP analysis on the two hidden layers DNN model highlighted key factors influencing crash severity, offering insights into the nuanced relationships between features and predictions. The analysis highlighted the significance of variables like Traffic Control, Vehicle Type, and Movement in predicting fatalities and serious injuries. This study emphasizes the importance of considering Road and Vehicle Types to understand their roles in accident severity and identify interventions to reduce risks. Neglecting these factors may lead to incomplete or biased conclusions about crash outcomes. This research provides valuable insights for improving road safety, highlighting the effectiveness of SHAP force plots, bars, beeswarm plots, and dependency plots in enhancing clarity and understanding of DNN model predictions. These tools help identify the impact of features on crash severity.