{"title":"基于机器学习的因果推理,用于评估旅行行为研究中的干预措施:差异框架","authors":"Meng Zhou , Sixian Huang , Wei Tu , Donggen Wang","doi":"10.1016/j.tbs.2024.100852","DOIUrl":null,"url":null,"abstract":"<div><p>Causal inference with the difference-in-differences (DID) framework is popular in identifying causal effects with observational data and has started to be applied in recent travel behaviour studies. Most relevant transportation research adopts the conventional linear parametric DID model, which is known to be inflexible and restrictive. This study applies non-parametric DID estimators facilitated by machine learning (ML) models for causal inference in a variety of data scenarios. Semi-parametric and doubly robust estimators are established and integrated with the ML-based cross-fitting pipeline. Simulation studies and empirical case studies are conducted to showcase the ability of ML-based DID to detect causal effects from both simulated and real-world datasets. Results suggest that the proposed methods outperform conventional DID models in all data scenarios. Light working models are generally preferred over hyperparameter-dependent ones for their comparable performance, lower computational burden, and higher levels of compatibility to real-world empirical analysis. Empirical case studies also demonstrate how the proposed DID method could be applied to evaluate the impacts of various interventions on travel behaviour in different contexts. The present study adds to the existing travel behaviour literature by leveraging machine learning algorithms and non-parametric estimators to the impact evaluation of external interventions on travel characteristics and expanding the application of causal inference approaches in transportation research.</p></div>","PeriodicalId":51534,"journal":{"name":"Travel Behaviour and Society","volume":"37 ","pages":"Article 100852"},"PeriodicalIF":5.1000,"publicationDate":"2024-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning-based causal inference for evaluating intervention in travel behaviour research: A difference-in-differences framework\",\"authors\":\"Meng Zhou , Sixian Huang , Wei Tu , Donggen Wang\",\"doi\":\"10.1016/j.tbs.2024.100852\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Causal inference with the difference-in-differences (DID) framework is popular in identifying causal effects with observational data and has started to be applied in recent travel behaviour studies. Most relevant transportation research adopts the conventional linear parametric DID model, which is known to be inflexible and restrictive. This study applies non-parametric DID estimators facilitated by machine learning (ML) models for causal inference in a variety of data scenarios. Semi-parametric and doubly robust estimators are established and integrated with the ML-based cross-fitting pipeline. Simulation studies and empirical case studies are conducted to showcase the ability of ML-based DID to detect causal effects from both simulated and real-world datasets. Results suggest that the proposed methods outperform conventional DID models in all data scenarios. Light working models are generally preferred over hyperparameter-dependent ones for their comparable performance, lower computational burden, and higher levels of compatibility to real-world empirical analysis. Empirical case studies also demonstrate how the proposed DID method could be applied to evaluate the impacts of various interventions on travel behaviour in different contexts. The present study adds to the existing travel behaviour literature by leveraging machine learning algorithms and non-parametric estimators to the impact evaluation of external interventions on travel characteristics and expanding the application of causal inference approaches in transportation research.</p></div>\",\"PeriodicalId\":51534,\"journal\":{\"name\":\"Travel Behaviour and Society\",\"volume\":\"37 \",\"pages\":\"Article 100852\"},\"PeriodicalIF\":5.1000,\"publicationDate\":\"2024-07-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Travel Behaviour and Society\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2214367X24001157\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"TRANSPORTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Travel Behaviour and Society","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214367X24001157","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION","Score":null,"Total":0}
引用次数: 0
摘要
差分(DID)框架的因果推论在利用观察数据确定因果效应方面很受欢迎,并开始应用于近期的出行行为研究中。大多数相关的交通研究都采用传统的线性参数 DID 模型,众所周知,该模型缺乏灵活性且具有限制性。本研究通过机器学习(ML)模型,将非参数 DID 估计器应用于各种数据情景下的因果推断。建立了半参数和双重稳健估计器,并与基于 ML 的交叉拟合管道集成。通过模拟研究和实证案例研究,展示了基于 ML 的 DID 从模拟和真实世界数据集中检测因果效应的能力。结果表明,所提出的方法在所有数据情况下都优于传统的 DID 模型。与超参数依赖模型相比,轻度工作模型通常更受青睐,因为它们具有可比的性能、更低的计算负担以及与现实世界实证分析更高的兼容性。实证案例研究还展示了如何将所提出的 DID 方法用于评估各种干预措施在不同情况下对旅行行为的影响。本研究利用机器学习算法和非参数估计器来评估外部干预措施对出行特征的影响,拓展了因果推理方法在交通研究中的应用,从而为现有的出行行为文献增添了新的内容。
Machine learning-based causal inference for evaluating intervention in travel behaviour research: A difference-in-differences framework
Causal inference with the difference-in-differences (DID) framework is popular in identifying causal effects with observational data and has started to be applied in recent travel behaviour studies. Most relevant transportation research adopts the conventional linear parametric DID model, which is known to be inflexible and restrictive. This study applies non-parametric DID estimators facilitated by machine learning (ML) models for causal inference in a variety of data scenarios. Semi-parametric and doubly robust estimators are established and integrated with the ML-based cross-fitting pipeline. Simulation studies and empirical case studies are conducted to showcase the ability of ML-based DID to detect causal effects from both simulated and real-world datasets. Results suggest that the proposed methods outperform conventional DID models in all data scenarios. Light working models are generally preferred over hyperparameter-dependent ones for their comparable performance, lower computational burden, and higher levels of compatibility to real-world empirical analysis. Empirical case studies also demonstrate how the proposed DID method could be applied to evaluate the impacts of various interventions on travel behaviour in different contexts. The present study adds to the existing travel behaviour literature by leveraging machine learning algorithms and non-parametric estimators to the impact evaluation of external interventions on travel characteristics and expanding the application of causal inference approaches in transportation research.
期刊介绍:
Travel Behaviour and Society is an interdisciplinary journal publishing high-quality original papers which report leading edge research in theories, methodologies and applications concerning transportation issues and challenges which involve the social and spatial dimensions. In particular, it provides a discussion forum for major research in travel behaviour, transportation infrastructure, transportation and environmental issues, mobility and social sustainability, transportation geographic information systems (TGIS), transportation and quality of life, transportation data collection and analysis, etc.