A human-in-the-loop ensemble fusion framework for road crash prediction: coping with imbalanced heterogeneous data from the driver-vehicle-environment system
{"title":"A human-in-the-loop ensemble fusion framework for road crash prediction: coping with imbalanced heterogeneous data from the driver-vehicle-environment system","authors":"Dauha Elamrani Abou Elassad , Zouhair Elamrani Abou Elassad , Abdel Majid Ed-Dahbi , Othmane El Meslouhi , Mustapha Kardouchi , Moulay Akhloufi , Nusrat Jahan","doi":"10.1080/19427867.2024.2392063","DOIUrl":null,"url":null,"abstract":"<div><div>Road accidents are an inevitable aspect of daily life, and predicting crashes is crucial for minimizing disruptions and advancing intelligent transportation technologies. This study aims to design an ensemble fusion decision system using various base classifiers and a meta-classifier to improve crash prediction efficiency within the driver-vehicle-environment system. We adopted a data-driven strategy to analyze four categories of features—driver demographics, vehicle telemetry, driver inputs, and environmental conditions—collected from a driving simulator. Optimized modeling strategies using AdaBoost, XGBoost, GBM, LightGBM, and CatBoost were implemented. Moreover, statistical logit models were also used to assess the likelihood of crashes and the correlations among key variables. Furthermore, three resampling strategies, SMOTE-TL, SMOTE-ENN, and ADASYN, were employed to address class imbalance. The best performance was achieved with GBM, XGBoost, and AdaBoost as base classifiers, SMOTE-TL for balancing, and CatBoost as the meta-classifier, with 89.78% precision, 95.69% recall, and 92.64% F1-score.</div></div>","PeriodicalId":48974,"journal":{"name":"Transportation Letters-The International Journal of Transportation Research","volume":"17 5","pages":"Pages 827-843"},"PeriodicalIF":3.3000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Letters-The International Journal of Transportation Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/org/science/article/pii/S1942786724000699","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TRANSPORTATION","Score":null,"Total":0}
引用次数: 0
Abstract
Road accidents are an inevitable aspect of daily life, and predicting crashes is crucial for minimizing disruptions and advancing intelligent transportation technologies. This study aims to design an ensemble fusion decision system using various base classifiers and a meta-classifier to improve crash prediction efficiency within the driver-vehicle-environment system. We adopted a data-driven strategy to analyze four categories of features—driver demographics, vehicle telemetry, driver inputs, and environmental conditions—collected from a driving simulator. Optimized modeling strategies using AdaBoost, XGBoost, GBM, LightGBM, and CatBoost were implemented. Moreover, statistical logit models were also used to assess the likelihood of crashes and the correlations among key variables. Furthermore, three resampling strategies, SMOTE-TL, SMOTE-ENN, and ADASYN, were employed to address class imbalance. The best performance was achieved with GBM, XGBoost, and AdaBoost as base classifiers, SMOTE-TL for balancing, and CatBoost as the meta-classifier, with 89.78% precision, 95.69% recall, and 92.64% F1-score.
期刊介绍:
Transportation Letters: The International Journal of Transportation Research is a quarterly journal that publishes high-quality peer-reviewed and mini-review papers as well as technical notes and book reviews on the state-of-the-art in transportation research.
The focus of Transportation Letters is on analytical and empirical findings, methodological papers, and theoretical and conceptual insights across all areas of research. Review resource papers that merge descriptions of the state-of-the-art with innovative and new methodological, theoretical, and conceptual insights spanning all areas of transportation research are invited and of particular interest.