Preventing Traffic Accidents Through Machine Learning Predictive Models

Tarikwa Tesfa Bedane, Beakal Gizachew Assefa, Sudhir Kumar Mohapatra
{"title":"Preventing Traffic Accidents Through Machine Learning Predictive Models","authors":"Tarikwa Tesfa Bedane, Beakal Gizachew Assefa, Sudhir Kumar Mohapatra","doi":"10.1109/ict4da53266.2021.9672249","DOIUrl":null,"url":null,"abstract":"Road Traffic Accidents (RTA) are a serious issue of societies resulting in huge losses at the economic and social levels and responsible for millions of deaths and injuries every year in the world. For instance, in Ethiopia, the number of deaths due to traffic accidents is increasing from one year to another. Addis Ababa is one of the popular and known cities that encounter a high number of RTAs due to the increasing number of vehicles and population. The main objective of this paper is to apply machine learning algorithms to predict the accident severity and identify the major causes of accidents in crowded cities (application of Addis Ababa city). The required data are collected from Addis Ababa city police departments and 12316 records of the accident are used for data analysis. We applied seven machine learning classification algorithms (Logistic Regression, Naive Bayes, Decision Tree, Support Vector Machine, K Nearest Neighbor, Random Forest, and AdaBoost) for predicting accident severity and compared the performance to choose the best model. We applied random undersampling and SMOTE oversampling techniques to handle the class imbalance nature of the dependent features and Principal Component Analysis (PCA) for dimension reduction. The experimental result shows that Random Forest achieved a 93.76% F1 score with SMOTE over-sampled data set and about 18% feature size reduction. Moreover, light condition, driving experience, age band of the driver, type of road lane, and types of junctions are identified as major determinant factors of the accident. According to this study, these are major factors to RTA and need to be considered in the design of infrastructure, regulations and policies to reduce accidents.","PeriodicalId":371663,"journal":{"name":"2021 International Conference on Information and Communication Technology for Development for Africa (ICT4DA)","volume":"93 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Information and Communication Technology for Development for Africa (ICT4DA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ict4da53266.2021.9672249","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Road Traffic Accidents (RTA) are a serious issue of societies resulting in huge losses at the economic and social levels and responsible for millions of deaths and injuries every year in the world. For instance, in Ethiopia, the number of deaths due to traffic accidents is increasing from one year to another. Addis Ababa is one of the popular and known cities that encounter a high number of RTAs due to the increasing number of vehicles and population. The main objective of this paper is to apply machine learning algorithms to predict the accident severity and identify the major causes of accidents in crowded cities (application of Addis Ababa city). The required data are collected from Addis Ababa city police departments and 12316 records of the accident are used for data analysis. We applied seven machine learning classification algorithms (Logistic Regression, Naive Bayes, Decision Tree, Support Vector Machine, K Nearest Neighbor, Random Forest, and AdaBoost) for predicting accident severity and compared the performance to choose the best model. We applied random undersampling and SMOTE oversampling techniques to handle the class imbalance nature of the dependent features and Principal Component Analysis (PCA) for dimension reduction. The experimental result shows that Random Forest achieved a 93.76% F1 score with SMOTE over-sampled data set and about 18% feature size reduction. Moreover, light condition, driving experience, age band of the driver, type of road lane, and types of junctions are identified as major determinant factors of the accident. According to this study, these are major factors to RTA and need to be considered in the design of infrastructure, regulations and policies to reduce accidents.
通过机器学习预测模型预防交通事故
道路交通事故是社会的一个严重问题,在经济和社会层面造成巨大损失,每年在世界上造成数百万人死亡和受伤。例如,在埃塞俄比亚,交通事故造成的死亡人数每年都在增加。由于车辆和人口的增加,亚的斯亚贝巴是一个受欢迎和知名的城市,遇到了大量的rta。本文的主要目标是应用机器学习算法来预测事故严重程度,并确定拥挤城市中事故的主要原因(亚的斯亚贝巴市的应用)。所需数据从亚的斯亚贝巴市警察部门收集,并使用12316事故记录进行数据分析。我们应用了七种机器学习分类算法(逻辑回归、朴素贝叶斯、决策树、支持向量机、K近邻、随机森林和AdaBoost)来预测事故严重程度,并比较了性能以选择最佳模型。我们采用随机欠采样和SMOTE过采样技术来处理相关特征的类不平衡性质,并采用主成分分析(PCA)进行降维。实验结果表明,在SMOTE过采样数据集上,随机森林的F1得分达到了93.76%,特征尺寸缩小了18%左右。此外,光照条件、驾驶经验、驾驶员年龄、车道类型和路口类型被认为是事故的主要决定因素。根据本研究,这些都是影响RTA的主要因素,需要在基础设施、法规和政策的设计中加以考虑,以减少事故。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信