Predicting COVID-19 Outbreaks in Correctional Facilities Using Machine Learning.

IF 1.9 Q3 HEALTH CARE SCIENCES & SERVICES
MDM Policy and Practice Pub Date : 2024-01-29 eCollection Date: 2024-01-01 DOI:10.1177/23814683231222469
Giovanni S P Malloy, Lisa B Puglisi, Kristofer B Bucklen, Tyler D Harvey, Emily A Wang, Margaret L Brandeau
{"title":"Predicting COVID-19 Outbreaks in Correctional Facilities Using Machine Learning.","authors":"Giovanni S P Malloy, Lisa B Puglisi, Kristofer B Bucklen, Tyler D Harvey, Emily A Wang, Margaret L Brandeau","doi":"10.1177/23814683231222469","DOIUrl":null,"url":null,"abstract":"<p><p><b>Introduction.</b> The risk of infectious disease transmission, including COVID-19, is disproportionately high in correctional facilities due to close living conditions, relatively low levels of vaccination, and reduced access to testing and treatment. While much progress has been made on describing and mitigating COVID-19 and other infectious disease risk in jails and prisons, there are open questions about which data can best predict future outbreaks. <b>Methods.</b> We used facility data and demographic and health data collected from 24 prison facilities in the Pennsylvania Department of Corrections from March 2020 to May 2021 to determine which sources of data best predict a coming COVID-19 outbreak in a prison facility. We used machine learning methods to cluster the prisons into groups based on similar facility-level characteristics, including size, rurality, and demographics of incarcerated people. We developed logistic regression classification models to predict for each cluster, before and after vaccine availability, whether there would be no cases, an outbreak defined as 2 or more cases, or a large outbreak, defined as 10 or more cases in the next 1, 2, and 3 d. We compared these predictions to data on outbreaks that occurred. <b>Results.</b> Facilities were divided into 8 clusters of sizes varying from 1 to 7 facilities per cluster. We trained 60 logistic regressions; 20 had test sets with between 35% and 65% of days with outbreaks detected. Of these, 8 logistic regressions correctly predicted the occurrence of an outbreak more than 55% of the time. The most common predictive feature was incident cases among the incarcerated population from 2 to 32 d prior. Other predictive features included the number of tests administered from 1 to 33 d prior, total population, test positivity rate, and county deaths, hospitalizations, and incident cases. Cumulative cases, vaccination rates, and race, ethnicity, or age statistics for incarcerated populations were generally not predictive. <b>Conclusions.</b> County-level measures of COVID-19, facility population, and test positivity rate appear as potential promising predictors of COVID-19 outbreaks in correctional facilities, suggesting that correctional facilities should monitor community transmission in addition to facility transmission to inform future outbreak response decisions. These efforts should not be limited to COVID-19 but should include any large-scale infectious disease outbreak that may involve institution-community transmission.</p><p><strong>Highlights: </strong>The risk of infectious disease transmission, including COVID-19, is disproportionately high in correctional facilities.We used machine learning methods with data collected from 24 prison facilities in the Pennsylvania Department of Corrections to determine which sources of data best predict a coming COVID-19 outbreak in a prison facility.Key predictors included county-level measures of COVID-19, facility population, and the test positivity rate in a facility.Fortifying correctional facilities with the ability to monitor local community rates of infection (e.g., though improved interagency collaboration and data sharing) along with continued testing of incarcerated people and staff can help correctional facilities better predict-and respond to-future infectious disease outbreaks.</p>","PeriodicalId":36567,"journal":{"name":"MDM Policy and Practice","volume":"9 1","pages":"23814683231222469"},"PeriodicalIF":1.9000,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10826393/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"MDM Policy and Practice","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/23814683231222469","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction. The risk of infectious disease transmission, including COVID-19, is disproportionately high in correctional facilities due to close living conditions, relatively low levels of vaccination, and reduced access to testing and treatment. While much progress has been made on describing and mitigating COVID-19 and other infectious disease risk in jails and prisons, there are open questions about which data can best predict future outbreaks. Methods. We used facility data and demographic and health data collected from 24 prison facilities in the Pennsylvania Department of Corrections from March 2020 to May 2021 to determine which sources of data best predict a coming COVID-19 outbreak in a prison facility. We used machine learning methods to cluster the prisons into groups based on similar facility-level characteristics, including size, rurality, and demographics of incarcerated people. We developed logistic regression classification models to predict for each cluster, before and after vaccine availability, whether there would be no cases, an outbreak defined as 2 or more cases, or a large outbreak, defined as 10 or more cases in the next 1, 2, and 3 d. We compared these predictions to data on outbreaks that occurred. Results. Facilities were divided into 8 clusters of sizes varying from 1 to 7 facilities per cluster. We trained 60 logistic regressions; 20 had test sets with between 35% and 65% of days with outbreaks detected. Of these, 8 logistic regressions correctly predicted the occurrence of an outbreak more than 55% of the time. The most common predictive feature was incident cases among the incarcerated population from 2 to 32 d prior. Other predictive features included the number of tests administered from 1 to 33 d prior, total population, test positivity rate, and county deaths, hospitalizations, and incident cases. Cumulative cases, vaccination rates, and race, ethnicity, or age statistics for incarcerated populations were generally not predictive. Conclusions. County-level measures of COVID-19, facility population, and test positivity rate appear as potential promising predictors of COVID-19 outbreaks in correctional facilities, suggesting that correctional facilities should monitor community transmission in addition to facility transmission to inform future outbreak response decisions. These efforts should not be limited to COVID-19 but should include any large-scale infectious disease outbreak that may involve institution-community transmission.

Highlights: The risk of infectious disease transmission, including COVID-19, is disproportionately high in correctional facilities.We used machine learning methods with data collected from 24 prison facilities in the Pennsylvania Department of Corrections to determine which sources of data best predict a coming COVID-19 outbreak in a prison facility.Key predictors included county-level measures of COVID-19, facility population, and the test positivity rate in a facility.Fortifying correctional facilities with the ability to monitor local community rates of infection (e.g., though improved interagency collaboration and data sharing) along with continued testing of incarcerated people and staff can help correctional facilities better predict-and respond to-future infectious disease outbreaks.

利用机器学习预测惩教机构中 COVID-19 的爆发。
导言。由于生活条件恶劣、疫苗接种率相对较低以及检测和治疗机会减少,包括 COVID-19 在内的传染病传播风险在惩教机构中格外高。尽管在描述和降低 COVID-19 以及其他传染病在监狱和看守所的传播风险方面已经取得了很大进展,但关于哪些数据可以最好地预测未来的疫情爆发,仍有很多问题有待解决。方法。我们使用了从 2020 年 3 月到 2021 年 5 月从宾夕法尼亚州惩教署的 24 所监狱设施收集的设施数据、人口和健康数据,以确定哪些数据源最能预测监狱设施中即将爆发的 COVID-19 疫情。我们使用机器学习方法,根据类似的设施级特征(包括规模、乡村化程度和被监禁者的人口统计学特征)将监狱分组。我们建立了逻辑回归分类模型,以预测每个群组在疫苗供应前后是否会出现无病例、爆发(定义为 2 例或更多病例)或大规模爆发(定义为未来 1 天、2 天和 3 天内出现 10 例或更多病例)。我们将这些预测与已发生的疫情数据进行了比较。结果。医疗机构被分为 8 个群组,每个群组的规模从 1 到 7 个不等。我们对 60 个逻辑回归进行了训练;其中 20 个测试集检测到的疫情爆发天数在 35% 到 65% 之间。其中,8 个逻辑回归在 55% 以上的时间内正确预测了疫情的发生。最常见的预测特征是监禁人群在 2 到 32 天前出现的病例。其他预测特征包括 1 至 33 d 前的检测次数、总人口、检测阳性率以及县级死亡、住院和发病病例。累计病例、疫苗接种率以及被监禁人群的种族、民族或年龄统计数据一般不具有预测性。结论。县级 COVID-19、惩教机构人口和检测阳性率似乎是惩教机构中 COVID-19 爆发的潜在预测因素,这表明惩教机构除了监测惩教机构的传播情况外,还应监测社区的传播情况,以便为未来的疫情应对决策提供信息。这些工作不应仅限于 COVID-19,还应包括任何可能涉及机构-社区传播的大规模传染病疫情:我们使用机器学习方法,利用从宾夕法尼亚州惩教署 24 所监狱设施收集的数据,确定哪些数据源最能预测监狱设施中即将爆发的 COVID-19 疫情。主要预测因素包括 COVID-19 的县级衡量标准、设施人口以及设施中的检测阳性率、加强惩教机构监测当地社区感染率的能力(例如,通过改善机构间合作和数据共享),同时继续对在押人员和工作人员进行检测,可以帮助惩教机构更好地预测和应对未来的传染病爆发。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
MDM Policy and Practice
MDM Policy and Practice Medicine-Health Policy
CiteScore
2.50
自引率
0.00%
发文量
28
审稿时长
15 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信