Machine Learning Potential for Identifying and Forecasting Complex Environmental Drivers of Vibrio vulnificus Infections in the United States.

IF 10.1 1区 环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES
Environmental Health Perspectives Pub Date : 2025-01-01 Epub Date: 2025-01-23 DOI:10.1289/EHP15593
Amy Marie Campbell, Jordi Manuel Cabrera-Gumbau, Joaquin Trinanes, Craig Baker-Austin, Jaime Martinez-Urtaza
{"title":"Machine Learning Potential for Identifying and Forecasting Complex Environmental Drivers of <i>Vibrio vulnificus</i> Infections in the United States.","authors":"Amy Marie Campbell, Jordi Manuel Cabrera-Gumbau, Joaquin Trinanes, Craig Baker-Austin, Jaime Martinez-Urtaza","doi":"10.1289/EHP15593","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Environmental change in coastal areas can drive marine bacteria and resulting infections, such as those caused by <i>Vibrio vulnificus</i>, with both foodborne and nonfoodborne exposure routes and high mortality. Although ecological drivers of <i>V. vulnificus</i> in the environment have been well-characterized, fewer models have been able to apply this to human infection risk due to limited surveillance.</p><p><strong>Objectives: </strong>The Cholera and Other <i>Vibrio</i> Illness Surveillance (COVIS) system database has reported <i>V. vulnificus</i> infections in the United States since 1988, offering a unique opportunity to both explore the forecasting capabilities machine learning could provide and to characterize complex environmental drivers of <i>V. vulnificus</i> infections.</p><p><strong>Methods: </strong>Machine learning models, in the form of random forest classification models, were trained and refined using the epidemiological data from 2008 to 2018, six environmental variables (sea surface temperature, salinity, chlorophyll <i>a</i> concentration, sea level, land surface temperature, and runoff rate) and categorical encoders to assess our predictive potential to forecast <i>V. vulnificus</i> infections based on environmental data.</p><p><strong>Results: </strong>The highest-performing model, which used balanced classes, had an Area Under the Curve score of 0.984 and a sensitivity of 0.971, highlighting the potential of machine learning to anticipate areas and periods of <i>V. vulnificus</i> risk. A higher false positive rate was found when the model was applied to real-world imbalanced surveillance data, which is pertinent amid modeled underreporting and misdiagnosis ratios of <i>V. vulnificus</i> infections. Further models were also developed to explore multilevel spatial resolution, finding state-specific models can improve specificity and early warning system potential by exclusively using lagged environmental data.</p><p><strong>Discussion: </strong>The machine learning approach was able to characterize nonlinear and interacting environmental associations driving <i>V. vulnificus</i> infections. This study accentuates the potential of machine learning and robust surveillance for forecasting environmentally associated marine infections, providing future directions for improvements, further application, and operationalization. https://doi.org/10.1289/EHP15593.</p>","PeriodicalId":11862,"journal":{"name":"Environmental Health Perspectives","volume":"133 1","pages":"17006"},"PeriodicalIF":10.1000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11756857/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Health Perspectives","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1289/EHP15593","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/23 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Environmental change in coastal areas can drive marine bacteria and resulting infections, such as those caused by Vibrio vulnificus, with both foodborne and nonfoodborne exposure routes and high mortality. Although ecological drivers of V. vulnificus in the environment have been well-characterized, fewer models have been able to apply this to human infection risk due to limited surveillance.

Objectives: The Cholera and Other Vibrio Illness Surveillance (COVIS) system database has reported V. vulnificus infections in the United States since 1988, offering a unique opportunity to both explore the forecasting capabilities machine learning could provide and to characterize complex environmental drivers of V. vulnificus infections.

Methods: Machine learning models, in the form of random forest classification models, were trained and refined using the epidemiological data from 2008 to 2018, six environmental variables (sea surface temperature, salinity, chlorophyll a concentration, sea level, land surface temperature, and runoff rate) and categorical encoders to assess our predictive potential to forecast V. vulnificus infections based on environmental data.

Results: The highest-performing model, which used balanced classes, had an Area Under the Curve score of 0.984 and a sensitivity of 0.971, highlighting the potential of machine learning to anticipate areas and periods of V. vulnificus risk. A higher false positive rate was found when the model was applied to real-world imbalanced surveillance data, which is pertinent amid modeled underreporting and misdiagnosis ratios of V. vulnificus infections. Further models were also developed to explore multilevel spatial resolution, finding state-specific models can improve specificity and early warning system potential by exclusively using lagged environmental data.

Discussion: The machine learning approach was able to characterize nonlinear and interacting environmental associations driving V. vulnificus infections. This study accentuates the potential of machine learning and robust surveillance for forecasting environmentally associated marine infections, providing future directions for improvements, further application, and operationalization. https://doi.org/10.1289/EHP15593.

在美国识别和预测创伤弧菌感染的复杂环境驱动因素的机器学习潜力。
背景:沿海地区的环境变化可驱动海洋细菌和由此引起的感染,例如由创伤弧菌引起的感染,具有食源性和非食源性暴露途径和高死亡率。虽然环境中创伤弧菌的生态驱动因素已经得到了很好的表征,但由于监测有限,很少有模型能够将其应用于人类感染风险。目的:自1988年以来,霍乱和其他弧菌疾病监测(COVIS)系统数据库报告了美国创伤弧菌感染,为探索机器学习可以提供的预测能力和表征创伤弧菌感染的复杂环境驱动因素提供了独特的机会。方法:利用2008 - 2018年流行病学数据、6个环境变量(海面温度、盐度、叶绿素a浓度、海平面、地表温度和径流率)和分类编码器,以随机森林分类模型的形式训练和改进机器学习模型,评估基于环境数据预测创伤弧菌感染的预测潜力。结果:使用平衡类的表现最好的模型,曲线下面积得分为0.984,灵敏度为0.971,突出了机器学习预测创伤弧菌风险区域和时期的潜力。当模型应用于现实世界的不平衡监测数据时,发现假阳性率较高,这与模型中创伤弧菌感染的低报和误诊率有关。进一步的模型也被开发来探索多层次的空间分辨率,发现特定于状态的模型可以通过只使用滞后的环境数据来提高特异性和早期预警系统的潜力。讨论:机器学习方法能够表征驱动创伤弧菌感染的非线性和相互作用的环境关联。这项研究强调了机器学习和强大监测在预测与环境相关的海洋感染方面的潜力,为改进、进一步应用和操作提供了未来的方向。https://doi.org/10.1289/EHP15593。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Environmental Health Perspectives
Environmental Health Perspectives 环境科学-公共卫生、环境卫生与职业卫生
CiteScore
14.40
自引率
2.90%
发文量
388
审稿时长
6 months
期刊介绍: Environmental Health Perspectives (EHP) is a monthly peer-reviewed journal supported by the National Institute of Environmental Health Sciences, part of the National Institutes of Health under the U.S. Department of Health and Human Services. Its mission is to facilitate discussions on the connections between the environment and human health by publishing top-notch research and news. EHP ranks third in Public, Environmental, and Occupational Health, fourth in Toxicology, and fifth in Environmental Sciences.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信