RUMBoost:梯度提升随机效用模型

IF 7.6 1区 工程技术 Q1 TRANSPORTATION SCIENCE & TECHNOLOGY
Nicolas Salvadé, Tim Hillel
{"title":"RUMBoost:梯度提升随机效用模型","authors":"Nicolas Salvadé,&nbsp;Tim Hillel","doi":"10.1016/j.trc.2024.104897","DOIUrl":null,"url":null,"abstract":"<div><div>This paper introduces the RUMBoost model, a novel discrete choice modelling approach that combines the interpretability and behavioural robustness of Random Utility Models (RUMs) with the generalisation and predictive ability of tree-based ensemble methods. We obtain the full functional form of non-linear utility specifications by replacing each linear parameter in the utility functions of a RUM with an ensemble of gradient boosted regression trees. We introduce additional constraints on the ensembles to ensure three crucial features of the utility specifications: (i) dependency of the utilities of each alternative on only the attributes of that alternative, (ii) monotonicity of marginal utilities, and (iii) an intrinsically interpretable functional form, where the exact response of the model is known throughout the entire input space. Furthermore, we introduce an optimisation-based smoothing technique that replaces the piece-wise constant utility values of alternative attributes with monotonic piece-wise cubic splines to identify non-linear parameters with defined gradient. We demonstrate the potential of RUMBoost compared to various benchmark ML and Random Utility models for revealed and stated preference mode choice data as well as a semi-synthetic example. The results highlight both the great predictive performance and the direct interpretability of our proposed approach, allowing for the identification of complex behaviours associated with different alternatives. The smoothed attribute utility functions allow for the calculation of various behavioural indicators such as the Value of Time (VoT) and marginal utilities. Finally, we demonstrate the flexibility of our methodology by showing how the RUMBoost model can be extended to complex model specifications, including attribute interactions, correlation within alternative error terms (Nested Logit model) and heterogeneity within the population (Mixed Logit model).</div></div>","PeriodicalId":54417,"journal":{"name":"Transportation Research Part C-Emerging Technologies","volume":"170 ","pages":"Article 104897"},"PeriodicalIF":7.6000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"RUMBoost: Gradient boosted random utility models\",\"authors\":\"Nicolas Salvadé,&nbsp;Tim Hillel\",\"doi\":\"10.1016/j.trc.2024.104897\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This paper introduces the RUMBoost model, a novel discrete choice modelling approach that combines the interpretability and behavioural robustness of Random Utility Models (RUMs) with the generalisation and predictive ability of tree-based ensemble methods. We obtain the full functional form of non-linear utility specifications by replacing each linear parameter in the utility functions of a RUM with an ensemble of gradient boosted regression trees. We introduce additional constraints on the ensembles to ensure three crucial features of the utility specifications: (i) dependency of the utilities of each alternative on only the attributes of that alternative, (ii) monotonicity of marginal utilities, and (iii) an intrinsically interpretable functional form, where the exact response of the model is known throughout the entire input space. Furthermore, we introduce an optimisation-based smoothing technique that replaces the piece-wise constant utility values of alternative attributes with monotonic piece-wise cubic splines to identify non-linear parameters with defined gradient. We demonstrate the potential of RUMBoost compared to various benchmark ML and Random Utility models for revealed and stated preference mode choice data as well as a semi-synthetic example. The results highlight both the great predictive performance and the direct interpretability of our proposed approach, allowing for the identification of complex behaviours associated with different alternatives. The smoothed attribute utility functions allow for the calculation of various behavioural indicators such as the Value of Time (VoT) and marginal utilities. Finally, we demonstrate the flexibility of our methodology by showing how the RUMBoost model can be extended to complex model specifications, including attribute interactions, correlation within alternative error terms (Nested Logit model) and heterogeneity within the population (Mixed Logit model).</div></div>\",\"PeriodicalId\":54417,\"journal\":{\"name\":\"Transportation Research Part C-Emerging Technologies\",\"volume\":\"170 \",\"pages\":\"Article 104897\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2024-11-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Research Part C-Emerging Technologies\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0968090X24004182\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"TRANSPORTATION SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part C-Emerging Technologies","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0968090X24004182","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

本文介绍了 RUMBoost 模型,这是一种新颖的离散选择建模方法,它将随机效用模型(RUM)的可解释性和行为稳健性与基于树的集合方法的泛化和预测能力相结合。我们用梯度提升回归树集合取代 RUM 效用函数中的每个线性参数,从而获得非线性效用规格的完整函数形式。我们对集合引入了额外的约束条件,以确保效用规范的三个关键特征:(i) 每种选择的效用仅依赖于该选择的属性;(ii) 边际效用的单调性;(iii) 本质上可解释的函数形式,即模型的确切响应在整个输入空间都是已知的。此外,我们还引入了一种基于优化的平滑技术,用单调的片断三次样条来替代替代属性的片断恒定效用值,以识别具有确定梯度的非线性参数。我们展示了 RUMBoost 与各种基准 ML 模型和随机效用模型相比在揭示偏好和陈述偏好模式选择数据以及半合成示例方面的潜力。结果凸显了我们提出的方法的强大预测性能和直接可解释性,从而可以识别与不同替代品相关的复杂行为。平滑属性效用函数允许计算各种行为指标,如时间价值(VoT)和边际效用。最后,我们展示了 RUMBoost 模型如何扩展到复杂的模型规格,包括属性交互、替代误差项内的相关性(嵌套 Logit 模型)和人口内的异质性(混合 Logit 模型),从而证明了我们方法的灵活性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
RUMBoost: Gradient boosted random utility models
This paper introduces the RUMBoost model, a novel discrete choice modelling approach that combines the interpretability and behavioural robustness of Random Utility Models (RUMs) with the generalisation and predictive ability of tree-based ensemble methods. We obtain the full functional form of non-linear utility specifications by replacing each linear parameter in the utility functions of a RUM with an ensemble of gradient boosted regression trees. We introduce additional constraints on the ensembles to ensure three crucial features of the utility specifications: (i) dependency of the utilities of each alternative on only the attributes of that alternative, (ii) monotonicity of marginal utilities, and (iii) an intrinsically interpretable functional form, where the exact response of the model is known throughout the entire input space. Furthermore, we introduce an optimisation-based smoothing technique that replaces the piece-wise constant utility values of alternative attributes with monotonic piece-wise cubic splines to identify non-linear parameters with defined gradient. We demonstrate the potential of RUMBoost compared to various benchmark ML and Random Utility models for revealed and stated preference mode choice data as well as a semi-synthetic example. The results highlight both the great predictive performance and the direct interpretability of our proposed approach, allowing for the identification of complex behaviours associated with different alternatives. The smoothed attribute utility functions allow for the calculation of various behavioural indicators such as the Value of Time (VoT) and marginal utilities. Finally, we demonstrate the flexibility of our methodology by showing how the RUMBoost model can be extended to complex model specifications, including attribute interactions, correlation within alternative error terms (Nested Logit model) and heterogeneity within the population (Mixed Logit model).
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
15.80
自引率
12.00%
发文量
332
审稿时长
64 days
期刊介绍: Transportation Research: Part C (TR_C) is dedicated to showcasing high-quality, scholarly research that delves into the development, applications, and implications of transportation systems and emerging technologies. Our focus lies not solely on individual technologies, but rather on their broader implications for the planning, design, operation, control, maintenance, and rehabilitation of transportation systems, services, and components. In essence, the intellectual core of the journal revolves around the transportation aspect rather than the technology itself. We actively encourage the integration of quantitative methods from diverse fields such as operations research, control systems, complex networks, computer science, and artificial intelligence. Join us in exploring the intersection of transportation systems and emerging technologies to drive innovation and progress in the field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信