Does random slope hierarchical modeling always outperform random intercept counterpart? Accounting for unobserved heterogeneity in a real-time empirical analysis of critical crash occurrence

IF 2.4 3区工程技术 Q3 TRANSPORTATION

Journal of Transportation Safety & Security Pub Date : 2022-03-11 DOI:10.1080/19439962.2022.2048761

Arash Khoda Bakhshi, Mohamed M. Ahmed

{"title":"Does random slope hierarchical modeling always outperform random intercept counterpart? Accounting for unobserved heterogeneity in a real-time empirical analysis of critical crash occurrence","authors":"Arash Khoda Bakhshi, Mohamed M. Ahmed","doi":"10.1080/19439962.2022.2048761","DOIUrl":null,"url":null,"abstract":"Abstract Traffic crashes impose tremendous socio-economic losses on societies. To alleviate these concerns, countless traffic safety researches have shed light on the cognition of observable crash/crash severity contributing factors. Nonetheless, some influential factors might not be observable or measurable, referred to as unobserved heterogeneity, that could be accounted for by structuring random intercepts and slopes in hierarchical models. With this respect, although it is known random slopes can capture more unobserved heterogeneity, most previous studies utilized random intercepts to simplify result interpretations, indicating an inconsistency in the literature considering the hierarchical modeling specification. This study delves into the mentioned confusion within an empirical real-time clustering critical crashes, involving fatal or incapacitating injuries, versus non-critical crashes throughout 402-miles of Interstate-80 in Wyoming. The crash dataset was conflated with real-time traffic-related and environmental contributing factors. Regarding the inclusion of random intercepts and slopes, eleven Logistic regressions were conducted. As a data-dependent matter, results depicted random slopes, compared to random intercepts, do not necessarily enhance models’ out-of-sample predictive performance because they impose much more complexity on the models’ structure. Besides, considering the type of unobserved heterogeneity, if random slopes are required, random intercepts should be accompanied to allow data showing their true patterns.","PeriodicalId":46672,"journal":{"name":"Journal of Transportation Safety & Security","volume":"3 1","pages":"177 - 214"},"PeriodicalIF":2.4000,"publicationDate":"2022-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Transportation Safety & Security","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1080/19439962.2022.2048761","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"TRANSPORTATION","Score":null,"Total":0}

引用次数: 10

Abstract

Abstract Traffic crashes impose tremendous socio-economic losses on societies. To alleviate these concerns, countless traffic safety researches have shed light on the cognition of observable crash/crash severity contributing factors. Nonetheless, some influential factors might not be observable or measurable, referred to as unobserved heterogeneity, that could be accounted for by structuring random intercepts and slopes in hierarchical models. With this respect, although it is known random slopes can capture more unobserved heterogeneity, most previous studies utilized random intercepts to simplify result interpretations, indicating an inconsistency in the literature considering the hierarchical modeling specification. This study delves into the mentioned confusion within an empirical real-time clustering critical crashes, involving fatal or incapacitating injuries, versus non-critical crashes throughout 402-miles of Interstate-80 in Wyoming. The crash dataset was conflated with real-time traffic-related and environmental contributing factors. Regarding the inclusion of random intercepts and slopes, eleven Logistic regressions were conducted. As a data-dependent matter, results depicted random slopes, compared to random intercepts, do not necessarily enhance models’ out-of-sample predictive performance because they impose much more complexity on the models’ structure. Besides, considering the type of unobserved heterogeneity, if random slopes are required, random intercepts should be accompanied to allow data showing their true patterns.

查看原文本刊更多论文

随机斜率分层模型总是优于随机截距模型吗?在关键事故发生的实时实证分析中考虑未观察到的异质性

交通事故给社会造成了巨大的社会经济损失。为了减轻这些担忧，无数的交通安全研究揭示了对可观察到的碰撞/碰撞严重程度影响因素的认知。尽管如此，一些影响因素可能无法观察到或测量，称为未观察到的异质性，这可以通过构建分层模型中的随机截距和斜率来解释。在这方面，尽管已知随机斜率可以捕获更多未观察到的异质性，但大多数先前的研究使用随机截距来简化结果解释，这表明考虑到分层建模规范，文献中存在不一致。这项研究深入研究了在怀俄明州80号州际公路402英里范围内，涉及致命或致残伤害的关键事故与非关键事故的经验实时集群中所提到的混乱。碰撞数据集与实时交通相关和环境因素相结合。关于随机截距和斜率的纳入，进行了11次Logistic回归。作为一个数据依赖的问题，与随机截距相比，描述随机斜率的结果不一定能提高模型的样本外预测性能，因为它们对模型的结构施加了更多的复杂性。此外，考虑到未观测到的异质性类型，如果需要随机斜率，则应伴随着随机截距，以使数据显示其真实模式。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Transportation Safety & Security TRANSPORTATION-

CiteScore

6.00

自引率

15.40%

发文量