Early prediction of the outbreak risk of dengue fever in Ba Ria-Vung Tau province, Vietnam: An analysis based on Google trends and statistical models

IF 8.8 3区 医学 Q1 Medicine
Dang Anh Tuan , Pham Vu Nhat Uyen
{"title":"Early prediction of the outbreak risk of dengue fever in Ba Ria-Vung Tau province, Vietnam: An analysis based on Google trends and statistical models","authors":"Dang Anh Tuan ,&nbsp;Pham Vu Nhat Uyen","doi":"10.1016/j.idm.2025.03.001","DOIUrl":null,"url":null,"abstract":"<div><div>Dengue fever (DF), caused by the Dengue virus through the Aedes mosquito vector, is a dangerous infectious disease with the potential to become a global epidemic. Vietnam, particularly Ba Ria-Vung Tau (BRVT) province, is facing a high risk of DF. This study aims to determine the relationship between the search volume for DF on Google Trends and DF cases in BRVT province, thereby constructing a model to predict the early outbreak risk of DF locally. Using Poisson regression (adjusted by quasi-Poisson), considering the lagged effect of Google Trends Index (GTI) search volume on DF cases, and removing the autocorrelation (AC) of DF cases by using appropriate transformations, seven forecast models were surveyed based on the dataset of DF cases and GTI search volume weekly with the phrase \"sốt xuất huyết\" (dengue fever) in BRVT province from January 2019 to August 2023 (243 weeks). The model selected is the one with the lowest dispersion index. The results show that the correlation coefficient (95% confidence interval) and dispersion index of the 7 models including Basis TSR; Basis TSR + AC: Lag(Residuals,1); Basis TSR + AC: Lag(SXH,1); Basis TSR + AC: Lag(log(SXH+1),1); TSR Lag(GTI,2) + AC: Lag(log(SXH+1),2); TSR Lag(GTI,3) + AC: Lag(log(SXH+1),3); TSR Lag(GTI,0) + AC: Lag(log(SXH+1),1) are 0.71 (0.63–0.76) and 74.2; 0.79 (0.73–0.83) and 48.6; 0.89 (0.87–0.92) and 37.3; 0.98 (0.97–0.99) and 7.2; 0.96 (0.95–0.97) and 14.3; 0.93 (0.91–0.94) and 25.7; 0.98 (0.97–0.99) and 6.8, respectively. Therefore, the final model is the most suitable one selected. Testing the accuracy of the selected model using the ROC curve with the Youden criterion, the AUC (threshold 75%) is 0.982, and the AUC (threshold 95%) is 0.984, indicating the very good predictive ability of the model. In summary, the research results show the potential for applying this model in Vietnam, especially in BRVT, to enhance the effectiveness of epidemic prevention measures and protect public health.</div></div>","PeriodicalId":36831,"journal":{"name":"Infectious Disease Modelling","volume":"10 3","pages":"Pages 743-757"},"PeriodicalIF":8.8000,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infectious Disease Modelling","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468042725000120","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Dengue fever (DF), caused by the Dengue virus through the Aedes mosquito vector, is a dangerous infectious disease with the potential to become a global epidemic. Vietnam, particularly Ba Ria-Vung Tau (BRVT) province, is facing a high risk of DF. This study aims to determine the relationship between the search volume for DF on Google Trends and DF cases in BRVT province, thereby constructing a model to predict the early outbreak risk of DF locally. Using Poisson regression (adjusted by quasi-Poisson), considering the lagged effect of Google Trends Index (GTI) search volume on DF cases, and removing the autocorrelation (AC) of DF cases by using appropriate transformations, seven forecast models were surveyed based on the dataset of DF cases and GTI search volume weekly with the phrase "sốt xuất huyết" (dengue fever) in BRVT province from January 2019 to August 2023 (243 weeks). The model selected is the one with the lowest dispersion index. The results show that the correlation coefficient (95% confidence interval) and dispersion index of the 7 models including Basis TSR; Basis TSR + AC: Lag(Residuals,1); Basis TSR + AC: Lag(SXH,1); Basis TSR + AC: Lag(log(SXH+1),1); TSR Lag(GTI,2) + AC: Lag(log(SXH+1),2); TSR Lag(GTI,3) + AC: Lag(log(SXH+1),3); TSR Lag(GTI,0) + AC: Lag(log(SXH+1),1) are 0.71 (0.63–0.76) and 74.2; 0.79 (0.73–0.83) and 48.6; 0.89 (0.87–0.92) and 37.3; 0.98 (0.97–0.99) and 7.2; 0.96 (0.95–0.97) and 14.3; 0.93 (0.91–0.94) and 25.7; 0.98 (0.97–0.99) and 6.8, respectively. Therefore, the final model is the most suitable one selected. Testing the accuracy of the selected model using the ROC curve with the Youden criterion, the AUC (threshold 75%) is 0.982, and the AUC (threshold 95%) is 0.984, indicating the very good predictive ability of the model. In summary, the research results show the potential for applying this model in Vietnam, especially in BRVT, to enhance the effectiveness of epidemic prevention measures and protect public health.
越南巴黎头省登革热暴发风险的早期预测:基于谷歌趋势和统计模型的分析
登革热由登革热病毒通过伊蚊媒介引起,是一种危险的传染病,有可能成为全球流行病。越南,特别是巴Ria-Vung Tau (BRVT)省,正面临着DF的高风险。本研究旨在确定谷歌Trends上的DF搜索量与BRVT省DF病例之间的关系,从而构建一个预测当地DF早期爆发风险的模型。利用泊松回归(拟泊松校正),考虑谷歌趋势指数(GTI)搜索量对登革热病例的滞后效应,并通过适当的变换去除DF病例的自相关(AC),以2019年1月至2023年8月(243周)BRVT省DF病例和GTI每周搜索量数据为基础,以“sốt xuất huyết”(登革热)为关键词,对7个预测模型进行了调查。选择色散指数最低的模型。结果表明,包括Basis TSR在内的7个模型的相关系数(95%置信区间)和离散度指数;基础TSR + AC: Lag(残差,1);基TSR + AC: Lag(SXH,1);基础TSR + AC: Lag(log(SXH+1),1);TSR Lag(GTI,2) + AC: Lag(log(SXH+1),2);TSR Lag(GTI,3) + AC: Lag(log(SXH+1),3);TSR滞后(GTI, 0) + AC:滞后(日志(SXH + 1), 1)是0.71(0.63 - -0.76)和74.2;0.79(0.73-0.83)和48.6;0.89(0.87-0.92)和37.3;0.98(0.97-0.99)和7.2;0.96(0.95 ~ 0.97)和14.3;0.93(0.91-0.94)和25.7;0.98(0.97-0.99)和6.8。因此,最终的模型是选出的最合适的模型。采用约登准则的ROC曲线检验所选模型的准确性,AUC(阈值75%)为0.982,AUC(阈值95%)为0.984,表明模型具有很好的预测能力。综上所述,研究结果表明,在越南,特别是BRVT,应用该模型可以提高防疫措施的有效性,保护公众健康。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Infectious Disease Modelling
Infectious Disease Modelling Mathematics-Applied Mathematics
CiteScore
17.00
自引率
3.40%
发文量
73
审稿时长
17 weeks
期刊介绍: Infectious Disease Modelling is an open access journal that undergoes peer-review. Its main objective is to facilitate research that combines mathematical modelling, retrieval and analysis of infection disease data, and public health decision support. The journal actively encourages original research that improves this interface, as well as review articles that highlight innovative methodologies relevant to data collection, informatics, and policy making in the field of public health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信