Analyzing dengue outbreak patterns using integrated machine learning approaches: A study in Bangladesh.

IF 2.3 3区 医学 Q2 HEALTH CARE SCIENCES & SERVICES
Health Informatics Journal Pub Date : 2025-07-01 Epub Date: 2025-09-18 DOI:10.1177/14604582251381159
Tanvir Ahammad, Apurbo Kormokar, Sabina Yasmin, Selina Sharmin
{"title":"Analyzing dengue outbreak patterns using integrated machine learning approaches: A study in Bangladesh.","authors":"Tanvir Ahammad, Apurbo Kormokar, Sabina Yasmin, Selina Sharmin","doi":"10.1177/14604582251381159","DOIUrl":null,"url":null,"abstract":"<p><p>Dengue fever remains a persistent global health threat, particularly in Southeast Asia, the Pacific, and the Americas. This study aims to improve early detection and prediction of dengue outbreaks by addressing the challenges of data scarcity and complex transmission factors through a hybrid machine learning approach. We developed a methodology that integrates clustering and classification techniques to identify and predict seasonal patterns of dengue risk. Using regional data from Bangladesh, clustering was performed to uncover latent patterns, with optimal clusters selected based on low inertia and high silhouette scores. The supervised machine learning models were then trained on labeled data to classify dengue risk levels using key meteorological and demographic characteristics. Clustering analysis revealed well-defined structures within the data, with a silhouette score of 0.774, indicating robust clustering quality. The classification models demonstrated exceptional performance, achieving more than 99% in accuracy, precision, recall, and F1 score metrics. These models effectively identified high-risk periods and regions with strong seasonal trends in dengue incidence. Overall, this study presents a data-driven framework for the early detection of dengue outbreaks, supporting proactive public health strategies, while also contributing to the identification of dengue patterns and serving as a tool for controlling infectious diseases.</p>","PeriodicalId":55069,"journal":{"name":"Health Informatics Journal","volume":"31 3","pages":"14604582251381159"},"PeriodicalIF":2.3000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Informatics Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/14604582251381159","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/9/18 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

Abstract

Dengue fever remains a persistent global health threat, particularly in Southeast Asia, the Pacific, and the Americas. This study aims to improve early detection and prediction of dengue outbreaks by addressing the challenges of data scarcity and complex transmission factors through a hybrid machine learning approach. We developed a methodology that integrates clustering and classification techniques to identify and predict seasonal patterns of dengue risk. Using regional data from Bangladesh, clustering was performed to uncover latent patterns, with optimal clusters selected based on low inertia and high silhouette scores. The supervised machine learning models were then trained on labeled data to classify dengue risk levels using key meteorological and demographic characteristics. Clustering analysis revealed well-defined structures within the data, with a silhouette score of 0.774, indicating robust clustering quality. The classification models demonstrated exceptional performance, achieving more than 99% in accuracy, precision, recall, and F1 score metrics. These models effectively identified high-risk periods and regions with strong seasonal trends in dengue incidence. Overall, this study presents a data-driven framework for the early detection of dengue outbreaks, supporting proactive public health strategies, while also contributing to the identification of dengue patterns and serving as a tool for controlling infectious diseases.

利用综合机器学习方法分析登革热暴发模式:孟加拉国的一项研究。
登革热仍然是一个持续存在的全球健康威胁,特别是在东南亚、太平洋和美洲。本研究旨在通过混合机器学习方法解决数据稀缺和复杂传播因素的挑战,提高登革热疫情的早期发现和预测。我们开发了一种整合聚类和分类技术的方法,以识别和预测登革热风险的季节性模式。使用来自孟加拉国的区域数据,进行聚类以发现潜在模式,并根据低惯性和高轮廓分数选择最佳聚类。然后对有监督的机器学习模型进行标记数据训练,利用关键气象和人口特征对登革热风险水平进行分类。聚类分析显示数据结构明确,剪影得分为0.774,表明聚类质量良好。分类模型表现出优异的性能,在准确率、精密度、召回率和F1得分指标方面达到99%以上。这些模型有效地确定了登革热发病率具有强烈季节性趋势的高危时期和区域。总体而言,这项研究提出了一个数据驱动的框架,用于早期发现登革热疫情,支持积极主动的公共卫生战略,同时也有助于确定登革热模式并作为控制传染病的工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Health Informatics Journal
Health Informatics Journal HEALTH CARE SCIENCES & SERVICES-MEDICAL INFORMATICS
CiteScore
7.80
自引率
6.70%
发文量
80
审稿时长
6 months
期刊介绍: Health Informatics Journal is an international peer-reviewed journal. All papers submitted to Health Informatics Journal are subject to peer review by members of a carefully appointed editorial board. The journal operates a conventional single-blind reviewing policy in which the reviewer’s name is always concealed from the submitting author.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信