行人肇事逃逸事故综合调查:应用 XGBoost 和二元 Logistic 回归模型

Ahmed Hossain, Xiaoduan Sun, A. S. Hasan, M. Jalayer, Julius Codjoe
{"title":"行人肇事逃逸事故综合调查:应用 XGBoost 和二元 Logistic 回归模型","authors":"Ahmed Hossain, Xiaoduan Sun, A. S. Hasan, M. Jalayer, Julius Codjoe","doi":"10.1177/03611981241262315","DOIUrl":null,"url":null,"abstract":"The present trend in the United States suggests that one in five pedestrian fatalities in motor vehicle crashes involves a hit-and-run, a serious traffic safety concern. The over-representation of pedestrian hit-and-run collisions necessitates a systemic data-driven investigation to uncover the contributing factors that cause fatalities or serious injuries. This study addressed two research questions (RQ), RQ1: What factors contribute to pedestrian hit-and-runs? RQ2: What causes hit-and-run pedestrian fatalities? This study addresses the RQs using the XGBoost algorithm (RQ1) and binary logistic regression model (RQ2) to analyze police-reported pedestrian crashes (2015–2019) in Louisiana. The XGBoost model was used to classify pedestrian hit-and-run crashes (hit-and-run = yes/no) and identified critical factors as predictors of pedestrian hit-and-run crashes including: primary contributing factors (pedestrian action, pedestrian violation, prior movement, pedestrian condition); settings (dark-with-streetlight, posted speed limit of > 55 mph, two-way road with physical separation); pedestrian characteristics (younger and older pedestrians, male gender, presence of dark clothing); and weekend. The binary logistic regression model was further used to identify critical high-risk hit-and-run scenarios resulting in fatal or severe injury of pedestrians. Some of the identified top factors are posted speed limit of 55 mph or higher (OR = 12.74), pedestrian impairment (OR = 4.77), older pedestrians (OR = 2.68), younger pedestrians (OR = 1.79), and dark-no-streetlight conditions (OR = 2.91). Both models showed strong relationships between pedestrian hit-and-run crashes and fatal or severe injuries (e.g., dark-with-streetlight, high-speed settings, older pedestrians, and pedestrian actions). Identifying these critical links can help policymakers, law enforcement agencies, and transportation authorities develop targeted interventions and strategies to address the risk factors.","PeriodicalId":309251,"journal":{"name":"Transportation Research Record: Journal of the Transportation Research Board","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comprehensive Investigation of Pedestrian Hit-and-Run Crashes: Applying XGBoost and Binary Logistic Regression Model\",\"authors\":\"Ahmed Hossain, Xiaoduan Sun, A. S. Hasan, M. Jalayer, Julius Codjoe\",\"doi\":\"10.1177/03611981241262315\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The present trend in the United States suggests that one in five pedestrian fatalities in motor vehicle crashes involves a hit-and-run, a serious traffic safety concern. The over-representation of pedestrian hit-and-run collisions necessitates a systemic data-driven investigation to uncover the contributing factors that cause fatalities or serious injuries. This study addressed two research questions (RQ), RQ1: What factors contribute to pedestrian hit-and-runs? RQ2: What causes hit-and-run pedestrian fatalities? This study addresses the RQs using the XGBoost algorithm (RQ1) and binary logistic regression model (RQ2) to analyze police-reported pedestrian crashes (2015–2019) in Louisiana. The XGBoost model was used to classify pedestrian hit-and-run crashes (hit-and-run = yes/no) and identified critical factors as predictors of pedestrian hit-and-run crashes including: primary contributing factors (pedestrian action, pedestrian violation, prior movement, pedestrian condition); settings (dark-with-streetlight, posted speed limit of > 55 mph, two-way road with physical separation); pedestrian characteristics (younger and older pedestrians, male gender, presence of dark clothing); and weekend. The binary logistic regression model was further used to identify critical high-risk hit-and-run scenarios resulting in fatal or severe injury of pedestrians. Some of the identified top factors are posted speed limit of 55 mph or higher (OR = 12.74), pedestrian impairment (OR = 4.77), older pedestrians (OR = 2.68), younger pedestrians (OR = 1.79), and dark-no-streetlight conditions (OR = 2.91). Both models showed strong relationships between pedestrian hit-and-run crashes and fatal or severe injuries (e.g., dark-with-streetlight, high-speed settings, older pedestrians, and pedestrian actions). Identifying these critical links can help policymakers, law enforcement agencies, and transportation authorities develop targeted interventions and strategies to address the risk factors.\",\"PeriodicalId\":309251,\"journal\":{\"name\":\"Transportation Research Record: Journal of the Transportation Research Board\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Research Record: Journal of the Transportation Research Board\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1177/03611981241262315\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Record: Journal of the Transportation Research Board","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/03611981241262315","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

美国目前的趋势表明,在机动车碰撞事故中,每五起行人死亡事故中就有一起涉及肇事逃逸,这是一个严重的交通安全问题。由于行人肇事逃逸的比例过高,有必要进行系统的数据驱动调查,以揭示导致死亡或重伤的诱因。本研究提出了两个研究问题(RQ):RQ1:导致行人肇事逃逸的因素有哪些?问题 2:导致行人肇事逃逸死亡的原因是什么?本研究使用 XGBoost 算法(RQ1)和二元逻辑回归模型(RQ2)来分析路易斯安那州警方报告的行人碰撞事故(2015-2019 年),从而解决上述研究问题。XGBoost 模型用于对行人肇事逃逸事故进行分类(肇事逃逸 = 是/否),并确定了预测行人肇事逃逸事故的关键因素,包括:主要诱因(行人行为、行人违规行为、之前的运动、行人状况);环境(有路灯的黑暗环境、公布的限速大于 55 英里/小时、有物理隔离的双向道路);行人特征(年轻和年长行人、男性、深色衣服的存在);以及周末。二元逻辑回归模型被进一步用于识别导致行人死亡或重伤的关键高风险肇事逃逸情况。确定的一些首要因素包括:张贴的限速为 55 英里/小时或以上(OR = 12.74)、行人受损(OR = 4.77)、年长行人(OR = 2.68)、年轻行人(OR = 1.79)和黑暗无路灯条件(OR = 2.91)。两个模型都显示出行人肇事逃逸事故与致命或严重伤害之间的密切关系(例如,有路灯的黑暗环境、高速环境、年长行人和行人行动)。确定这些关键联系有助于政策制定者、执法机构和交通管理部门制定有针对性的干预措施和策略来应对风险因素。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Comprehensive Investigation of Pedestrian Hit-and-Run Crashes: Applying XGBoost and Binary Logistic Regression Model
The present trend in the United States suggests that one in five pedestrian fatalities in motor vehicle crashes involves a hit-and-run, a serious traffic safety concern. The over-representation of pedestrian hit-and-run collisions necessitates a systemic data-driven investigation to uncover the contributing factors that cause fatalities or serious injuries. This study addressed two research questions (RQ), RQ1: What factors contribute to pedestrian hit-and-runs? RQ2: What causes hit-and-run pedestrian fatalities? This study addresses the RQs using the XGBoost algorithm (RQ1) and binary logistic regression model (RQ2) to analyze police-reported pedestrian crashes (2015–2019) in Louisiana. The XGBoost model was used to classify pedestrian hit-and-run crashes (hit-and-run = yes/no) and identified critical factors as predictors of pedestrian hit-and-run crashes including: primary contributing factors (pedestrian action, pedestrian violation, prior movement, pedestrian condition); settings (dark-with-streetlight, posted speed limit of > 55 mph, two-way road with physical separation); pedestrian characteristics (younger and older pedestrians, male gender, presence of dark clothing); and weekend. The binary logistic regression model was further used to identify critical high-risk hit-and-run scenarios resulting in fatal or severe injury of pedestrians. Some of the identified top factors are posted speed limit of 55 mph or higher (OR = 12.74), pedestrian impairment (OR = 4.77), older pedestrians (OR = 2.68), younger pedestrians (OR = 1.79), and dark-no-streetlight conditions (OR = 2.91). Both models showed strong relationships between pedestrian hit-and-run crashes and fatal or severe injuries (e.g., dark-with-streetlight, high-speed settings, older pedestrians, and pedestrian actions). Identifying these critical links can help policymakers, law enforcement agencies, and transportation authorities develop targeted interventions and strategies to address the risk factors.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信