Dengue Spread Modeling in the Absence of Sufficient Epidemiological Parameters: Comparison of SARIMA and SVM Time Series Models

J. Co, Jason Allan Tan, Regina Justina Estuar, Kennedy E. Espina
{"title":"Dengue Spread Modeling in the Absence of Sufficient Epidemiological Parameters: Comparison of SARIMA and SVM Time Series Models","authors":"J. Co, Jason Allan Tan, Regina Justina Estuar, Kennedy E. Espina","doi":"10.2139/ssrn.3086161","DOIUrl":null,"url":null,"abstract":"Dengue remains to be a major public health concern in the Philippines, claiming hundreds of lives every year. Given limited data for deriving necessary epidemiological parameters in developing deterministic disease models, forecasting as a means in controlling and anticipating outbreaks remains a challenge. In this study, two time series models, namely Seasonal Autoregressive Integrated Moving Average and Support Vector Machine, were developed without the requirement for prior epidemiological parameters. Performances of the models in predicting dengue incidences in the Western Visayas Region of the Philippines were compared by measuring the Root Mean Square Error and Mean Average Error. Results showed that the models were both effective in forecasting Dengue incidences for epidemiological surveillance as validated by historical data. SARIMA model yielded average RMSE and MAE scores of 16.8187 and 11.4640, respectively. Meanwhile, SVM model achieved scores of 11.8723 and 7.7369, respectively. With the data and setup used, this study showed that SVM outperformed SARIMA in forecasting Dengue incidences. Furthermore, preliminary investigation of one-month lagged climate variables using Random Forest Regressor’s feature ranking yielded rain intensity and value as top possible dengue incidence climate predictors","PeriodicalId":350026,"journal":{"name":"ERN: Human Development in Developing Economies (Topic)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ERN: Human Development in Developing Economies (Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3086161","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Dengue remains to be a major public health concern in the Philippines, claiming hundreds of lives every year. Given limited data for deriving necessary epidemiological parameters in developing deterministic disease models, forecasting as a means in controlling and anticipating outbreaks remains a challenge. In this study, two time series models, namely Seasonal Autoregressive Integrated Moving Average and Support Vector Machine, were developed without the requirement for prior epidemiological parameters. Performances of the models in predicting dengue incidences in the Western Visayas Region of the Philippines were compared by measuring the Root Mean Square Error and Mean Average Error. Results showed that the models were both effective in forecasting Dengue incidences for epidemiological surveillance as validated by historical data. SARIMA model yielded average RMSE and MAE scores of 16.8187 and 11.4640, respectively. Meanwhile, SVM model achieved scores of 11.8723 and 7.7369, respectively. With the data and setup used, this study showed that SVM outperformed SARIMA in forecasting Dengue incidences. Furthermore, preliminary investigation of one-month lagged climate variables using Random Forest Regressor’s feature ranking yielded rain intensity and value as top possible dengue incidence climate predictors
缺乏足够流行病学参数的登革热传播建模:SARIMA和SVM时间序列模型的比较
登革热仍然是菲律宾的一个主要公共卫生问题,每年夺去数百人的生命。由于在开发确定性疾病模型时获得必要流行病学参数的数据有限,预测作为控制和预测疾病暴发的手段仍然是一项挑战。在本研究中,在不需要预先流行病学参数的情况下,建立了季节自回归综合移动平均和支持向量机两个时间序列模型。通过测量均方根误差和平均误差,比较了模型在预测菲律宾西维萨亚斯地区登革热发病率方面的性能。结果表明,该模型对登革热流行病学监测具有较好的预测效果。SARIMA模型平均RMSE和MAE得分分别为16.8187和11.4640。同时,SVM模型的得分分别为11.8723和7.7369。通过使用数据和设置,本研究表明SVM在预测登革热发病率方面优于SARIMA。此外,对一个月滞后的气候变量进行了初步调查,使用随机森林回归的特征排序,得出了降雨强度和值作为登革热发病率的最可能气候预测因子
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信