Integrating environmental clustering to enhance epidemic forecasting with machine learning models

Yosra Didi , Ahlam Walha , Ali Wali
{"title":"Integrating environmental clustering to enhance epidemic forecasting with machine learning models","authors":"Yosra Didi ,&nbsp;Ahlam Walha ,&nbsp;Ali Wali","doi":"10.1016/j.ijcce.2025.06.001","DOIUrl":null,"url":null,"abstract":"<div><div>The COVID-19 pandemic underscored the urgent need for more accurate and adaptive forecasting models to support public health decision-making and limit disease spread. However, many existing models overlook the influence of environmental and climatic factors that significantly affect transmission dynamics. This study addresses this gap with a novel forecasting framework that integrates environmental data into predictive modelling. Our key contributions are threefold: (1) we analyse the relationship between environmental variables (temperature, humidity, and air quality) and COVID-19 trends across countries; (2) we propose a two-stage approach combining K-means clustering to group countries based on environmental conditions, followed by region-specific machine learning models using Support Vector Regression (SVR), Prophet, and Long Short-Term Memory (LSTM) networks for both univariate and multivariate time series forecasting; and (3) we demonstrate that LSTM significantly outperforms other models, achieving superior accuracy for 30-day COVID-19 case predictions. Our results highlight the importance of incorporating environmental variables in epidemic modelling and offer a practical tool for more targeted and effective public health responses. This research provides actionable insights that can inform the design of climate-aware forecasting systems for future pandemic preparedness.</div></div>","PeriodicalId":100694,"journal":{"name":"International Journal of Cognitive Computing in Engineering","volume":"6 ","pages":"Pages 628-642"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Cognitive Computing in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666307425000300","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The COVID-19 pandemic underscored the urgent need for more accurate and adaptive forecasting models to support public health decision-making and limit disease spread. However, many existing models overlook the influence of environmental and climatic factors that significantly affect transmission dynamics. This study addresses this gap with a novel forecasting framework that integrates environmental data into predictive modelling. Our key contributions are threefold: (1) we analyse the relationship between environmental variables (temperature, humidity, and air quality) and COVID-19 trends across countries; (2) we propose a two-stage approach combining K-means clustering to group countries based on environmental conditions, followed by region-specific machine learning models using Support Vector Regression (SVR), Prophet, and Long Short-Term Memory (LSTM) networks for both univariate and multivariate time series forecasting; and (3) we demonstrate that LSTM significantly outperforms other models, achieving superior accuracy for 30-day COVID-19 case predictions. Our results highlight the importance of incorporating environmental variables in epidemic modelling and offer a practical tool for more targeted and effective public health responses. This research provides actionable insights that can inform the design of climate-aware forecasting systems for future pandemic preparedness.
将环境聚类与机器学习模型相结合,增强流行病预测
COVID-19大流行凸显了迫切需要更准确和适应性更强的预测模型,以支持公共卫生决策和限制疾病传播。然而,许多现有模型忽略了环境和气候因素的影响,这些因素对传动动力学有显著影响。本研究用一种新颖的预测框架解决了这一差距,该框架将环境数据整合到预测模型中。我们的主要贡献有三个方面:(1)我们分析了各国环境变量(温度、湿度和空气质量)与COVID-19趋势之间的关系;(2)我们提出了一种结合K-means聚类的两阶段方法,根据环境条件对国家进行分组,然后使用支持向量回归(SVR)、Prophet和长短期记忆(LSTM)网络的区域特定机器学习模型进行单变量和多变量时间序列预测;(3)我们证明LSTM显著优于其他模型,在30天的COVID-19病例预测中取得了卓越的准确性。我们的研究结果强调了将环境变量纳入流行病建模的重要性,并为更有针对性和更有效的公共卫生响应提供了实用工具。这项研究提供了可行的见解,可以为未来大流行防范的气候感知预测系统的设计提供信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
13.80
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信