Evaluating the predictive performance of different data sources to forecast overdose deaths at the neighborhood level with machine learning in Rhode Island
John C. Halifax , Bennett Allen , Claire Pratty , Victoria Jent , Alexandra Skinner , Magdalena Cerdá , Brandon D.L. Marshall , Daniel B. Neill , Jennifer Ahern
{"title":"Evaluating the predictive performance of different data sources to forecast overdose deaths at the neighborhood level with machine learning in Rhode Island","authors":"John C. Halifax , Bennett Allen , Claire Pratty , Victoria Jent , Alexandra Skinner , Magdalena Cerdá , Brandon D.L. Marshall , Daniel B. Neill , Jennifer Ahern","doi":"10.1016/j.ypmed.2025.108276","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><div>To evaluate the predictive performance of different data sources to forecast fatal overdose in Rhode Island neighborhoods, with the goal of providing a template for other jurisdictions interested in predictive analytics to direct overdose prevention resources.</div></div><div><h3>Methods</h3><div>We evaluated seven combinations of data from six administrative data sources (American Community Survey (ACS) five-year estimates, built environment, emergency medical services non-fatal overdose response, prescription drug monitoring program, carceral release, and historical fatal overdose data). Fatal overdoses in Rhode Island census block groups (CBGs) were predicted using two machine learning approaches: linear regressions and random forests embedded in a nested cross-validation design. We evaluated performance using mean squared error and the percentage of statewide overdoses captured by CBGs forecast to be in top percentiles from 2019 to 2021.</div></div><div><h3>Results</h3><div>Linear models trained on ACS data combined with one other data source performed well, and comparably to models trained on all available data. Those including emergency medical service, prescription drug monitoring program, or carceral release data with ACS data achieved a priori goals for percentage of statewide overdoses captured by CBGs prioritized by models on average.</div></div><div><h3>Conclusions</h3><div>Prioritizing neighborhoods for overdose prevention with forecasting is feasible using a simple-to-implement model trained on publicly available ACS data combined with only one other administrative data source in Rhode Island, offering a starting point for other jurisdictions.</div></div>","PeriodicalId":20339,"journal":{"name":"Preventive medicine","volume":"194 ","pages":"Article 108276"},"PeriodicalIF":4.3000,"publicationDate":"2025-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Preventive medicine","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0091743525000593","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives
To evaluate the predictive performance of different data sources to forecast fatal overdose in Rhode Island neighborhoods, with the goal of providing a template for other jurisdictions interested in predictive analytics to direct overdose prevention resources.
Methods
We evaluated seven combinations of data from six administrative data sources (American Community Survey (ACS) five-year estimates, built environment, emergency medical services non-fatal overdose response, prescription drug monitoring program, carceral release, and historical fatal overdose data). Fatal overdoses in Rhode Island census block groups (CBGs) were predicted using two machine learning approaches: linear regressions and random forests embedded in a nested cross-validation design. We evaluated performance using mean squared error and the percentage of statewide overdoses captured by CBGs forecast to be in top percentiles from 2019 to 2021.
Results
Linear models trained on ACS data combined with one other data source performed well, and comparably to models trained on all available data. Those including emergency medical service, prescription drug monitoring program, or carceral release data with ACS data achieved a priori goals for percentage of statewide overdoses captured by CBGs prioritized by models on average.
Conclusions
Prioritizing neighborhoods for overdose prevention with forecasting is feasible using a simple-to-implement model trained on publicly available ACS data combined with only one other administrative data source in Rhode Island, offering a starting point for other jurisdictions.
期刊介绍:
Founded in 1972 by Ernst Wynder, Preventive Medicine is an international scholarly journal that provides prompt publication of original articles on the science and practice of disease prevention, health promotion, and public health policymaking. Preventive Medicine aims to reward innovation. It will favor insightful observational studies, thoughtful explorations of health data, unsuspected new angles for existing hypotheses, robust randomized controlled trials, and impartial systematic reviews. Preventive Medicine''s ultimate goal is to publish research that will have an impact on the work of practitioners of disease prevention and health promotion, as well as of related disciplines.