A hybrid data mining framework to investigate roadway departure crashes on rural two-lane Highways: Applying Fast and Frugal Tree with Association Rules Mining
Ahmed Hossain , Subasish Das , Xiaoduan Sun , Ahmed Sajid Hasan , Mohammad Jalayer , M.Ashifur Rahman
{"title":"A hybrid data mining framework to investigate roadway departure crashes on rural two-lane Highways: Applying Fast and Frugal Tree with Association Rules Mining","authors":"Ahmed Hossain , Subasish Das , Xiaoduan Sun , Ahmed Sajid Hasan , Mohammad Jalayer , M.Ashifur Rahman","doi":"10.1016/j.aap.2025.108066","DOIUrl":null,"url":null,"abstract":"<div><div>The complexity of factors contributing to roadway departure (RwD) crashes on rural highways necessitate advanced analytical approaches to enhance traffic safety. This study presents a hybrid data mining framework that combines the Fast and Frugal Tree (FFT) and Association Rules Mining (ARM) algorithms to identify the patterns of RwD crashes on rural 2-lane highways in Louisiana state. The research is focused on addressing two critical research questions (RQ), RQ1: Which variable features contribute to the fatal-severe RwD crashes? RQ2: Focusing on the identified top factors contributing to fatal-severe RwD crashes, how co-occurrence of different crash-contributing factors increase the likelihood of RwD crashes? For the analysis, this research team collected crash data from the Louisiana Department of Transportation and Development, encompassing a total of 22,406 unique RwD crashes on rural 2-lane highways. In the first stage (addressing RQ1), the FFT model identified the top variable features contributing to fatal-severe RwD crashes, including no use of seatbelts, alcohol-impaired driver condition, male gender, dark-no-streetlight, older drivers (>64 years), 12 am – 6 am, light truck, and so on. In the second stage (addressing RQ2), based on the factors identified by FFT, ARM explored how these factors interact and associate, revealing intricate drivers’ behavioral patterns related to RwD crashes. This comprehensive analysis uncovers not only the individual impact of these factors but also their combined effects, offering a deeper understanding of the dynamics of RwD crashes. This research contributes valuable insights into evidence-based, data-driven strategies to reduce the frequency and severity of RwD crashes on rural highways, advancing traffic safety initiatives and improving safety on rural 2-lane roadways.</div></div>","PeriodicalId":6926,"journal":{"name":"Accident; analysis and prevention","volume":"217 ","pages":"Article 108066"},"PeriodicalIF":5.7000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accident; analysis and prevention","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0001457525001526","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ERGONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
The complexity of factors contributing to roadway departure (RwD) crashes on rural highways necessitate advanced analytical approaches to enhance traffic safety. This study presents a hybrid data mining framework that combines the Fast and Frugal Tree (FFT) and Association Rules Mining (ARM) algorithms to identify the patterns of RwD crashes on rural 2-lane highways in Louisiana state. The research is focused on addressing two critical research questions (RQ), RQ1: Which variable features contribute to the fatal-severe RwD crashes? RQ2: Focusing on the identified top factors contributing to fatal-severe RwD crashes, how co-occurrence of different crash-contributing factors increase the likelihood of RwD crashes? For the analysis, this research team collected crash data from the Louisiana Department of Transportation and Development, encompassing a total of 22,406 unique RwD crashes on rural 2-lane highways. In the first stage (addressing RQ1), the FFT model identified the top variable features contributing to fatal-severe RwD crashes, including no use of seatbelts, alcohol-impaired driver condition, male gender, dark-no-streetlight, older drivers (>64 years), 12 am – 6 am, light truck, and so on. In the second stage (addressing RQ2), based on the factors identified by FFT, ARM explored how these factors interact and associate, revealing intricate drivers’ behavioral patterns related to RwD crashes. This comprehensive analysis uncovers not only the individual impact of these factors but also their combined effects, offering a deeper understanding of the dynamics of RwD crashes. This research contributes valuable insights into evidence-based, data-driven strategies to reduce the frequency and severity of RwD crashes on rural highways, advancing traffic safety initiatives and improving safety on rural 2-lane roadways.
期刊介绍:
Accident Analysis & Prevention provides wide coverage of the general areas relating to accidental injury and damage, including the pre-injury and immediate post-injury phases. Published papers deal with medical, legal, economic, educational, behavioral, theoretical or empirical aspects of transportation accidents, as well as with accidents at other sites. Selected topics within the scope of the Journal may include: studies of human, environmental and vehicular factors influencing the occurrence, type and severity of accidents and injury; the design, implementation and evaluation of countermeasures; biomechanics of impact and human tolerance limits to injury; modelling and statistical analysis of accident data; policy, planning and decision-making in safety.