Two-Stage Interrupted Time Series Analysis with Machine Learning: Evaluating the Health Effects of the 2018 Wildfire Smoke Event in San Francisco County as a Case Study.
IF 5 2区 医学Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH
Arnab K Dey, Yiqun Ma, Gabriel Carrasco-Escobar, Changwoo Han, François Rerolle, Tarik Benmarhnia
{"title":"Two-Stage Interrupted Time Series Analysis with Machine Learning: Evaluating the Health Effects of the 2018 Wildfire Smoke Event in San Francisco County as a Case Study.","authors":"Arnab K Dey, Yiqun Ma, Gabriel Carrasco-Escobar, Changwoo Han, François Rerolle, Tarik Benmarhnia","doi":"10.1093/aje/kwaf147","DOIUrl":null,"url":null,"abstract":"<p><p>Randomized controlled trials (RCTs) are considered a key identification strategy for establishing causal relationships between exposures and outcomes. When evaluating the health impacts of extreme weather events, however, RCTs are generally infeasible due to ethical issues, costs, and the lack of a suitable control group. Quasi-experimental designs capitalizing on the timing of natural experiments, such as Interrupted Time Series (ITS), offer a valuable alternative to estimate causal effects when control groups are not available. This paper explores the application of a two-stage ITS framework that compares traditional autoregressive integrated moving average (ARIMA) models and two machine learning algorithms: Neural Network Autoregressive (NNETAR) and Prophet-Extreme Gradient Boosting (XGBoost). As a case study, we assess the impacts of the 2018 wildfire smoke event on respiratory hospitalizations in San Francisco County, California. We split the data into pre- and post-event periods to train and evaluate the models, perform cross-validation for hyperparameter tuning, and predict hospitalizations under the counterfactual scenario. Data and R code are provided for reproducibility. In the case study, the Prophet-XGBoost shows the best model performance and was used to generate the counterfactual trends. We estimate that the 2018 smoke event resulted in a total of 92 (95% empirical confidence interval: 24, 125) excess respiratory hospitalizations (12.5% of the observed hospitalization count during the event period). Our proposed approach offers a powerful tool for assessing the effects of extreme weather events and can be broadly applied to other epidemiological contexts, such as public health policy evaluation.</p>","PeriodicalId":7472,"journal":{"name":"American journal of epidemiology","volume":" ","pages":""},"PeriodicalIF":5.0000,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of epidemiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/aje/kwaf147","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0
Abstract
Randomized controlled trials (RCTs) are considered a key identification strategy for establishing causal relationships between exposures and outcomes. When evaluating the health impacts of extreme weather events, however, RCTs are generally infeasible due to ethical issues, costs, and the lack of a suitable control group. Quasi-experimental designs capitalizing on the timing of natural experiments, such as Interrupted Time Series (ITS), offer a valuable alternative to estimate causal effects when control groups are not available. This paper explores the application of a two-stage ITS framework that compares traditional autoregressive integrated moving average (ARIMA) models and two machine learning algorithms: Neural Network Autoregressive (NNETAR) and Prophet-Extreme Gradient Boosting (XGBoost). As a case study, we assess the impacts of the 2018 wildfire smoke event on respiratory hospitalizations in San Francisco County, California. We split the data into pre- and post-event periods to train and evaluate the models, perform cross-validation for hyperparameter tuning, and predict hospitalizations under the counterfactual scenario. Data and R code are provided for reproducibility. In the case study, the Prophet-XGBoost shows the best model performance and was used to generate the counterfactual trends. We estimate that the 2018 smoke event resulted in a total of 92 (95% empirical confidence interval: 24, 125) excess respiratory hospitalizations (12.5% of the observed hospitalization count during the event period). Our proposed approach offers a powerful tool for assessing the effects of extreme weather events and can be broadly applied to other epidemiological contexts, such as public health policy evaluation.
期刊介绍:
The American Journal of Epidemiology is the oldest and one of the premier epidemiologic journals devoted to the publication of empirical research findings, opinion pieces, and methodological developments in the field of epidemiologic research.
It is a peer-reviewed journal aimed at both fellow epidemiologists and those who use epidemiologic data, including public health workers and clinicians.