{"title":"Development of a City-wide Traffic Accident Prediction Model Using Hybrid Machine-based Learning Approaches","authors":"Young Woong Kim, Dong Woo Lee, Ajin Hwang","doi":"10.14251/jscm.2023.3.57","DOIUrl":null,"url":null,"abstract":"Predicting traffic accidents is a challenging task because taking into account uncertainty in modeling traffic accidents is not trivial. To address these issues, this article develops a hybrid modeling pipeline combining unsupervised and supervised learning to predict the level of hazardous road sites and explore the causality of accidents by controlling unobserved heterogeneity issues effectively. Traffic accident data for Won-ju province, Korea, from 2020 to 2021, and external factors affecting traffic accidents, such as average travel speed and weather information, are combined based on road links. Through the modeling pipeline, a clustering technique is adopted to capture unobserved heterogeneous information among roads. Since traffic accident data contains a wide variety of categorical and hierarchical features, ensemble methods such as boosting techniques were applied to handle heterogeneity issues among these features. To explore the relationship between the accident and determinant factors, are adopted to interpret the results of machine learning models. Model-agnostic methods, however, generally provide results based on images, this study also added a process that extracts texts from images to overcome compatible issues with existing road safety management systems.","PeriodicalId":395795,"journal":{"name":"Crisis and Emergency Management: Theory and Praxis","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Crisis and Emergency Management: Theory and Praxis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14251/jscm.2023.3.57","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Predicting traffic accidents is a challenging task because taking into account uncertainty in modeling traffic accidents is not trivial. To address these issues, this article develops a hybrid modeling pipeline combining unsupervised and supervised learning to predict the level of hazardous road sites and explore the causality of accidents by controlling unobserved heterogeneity issues effectively. Traffic accident data for Won-ju province, Korea, from 2020 to 2021, and external factors affecting traffic accidents, such as average travel speed and weather information, are combined based on road links. Through the modeling pipeline, a clustering technique is adopted to capture unobserved heterogeneous information among roads. Since traffic accident data contains a wide variety of categorical and hierarchical features, ensemble methods such as boosting techniques were applied to handle heterogeneity issues among these features. To explore the relationship between the accident and determinant factors, are adopted to interpret the results of machine learning models. Model-agnostic methods, however, generally provide results based on images, this study also added a process that extracts texts from images to overcome compatible issues with existing road safety management systems.