Predicting the dynamics of norovirus infection using time series models

A. Kosova, V. Chalapa
{"title":"Predicting the dynamics of norovirus infection using time series models","authors":"A. Kosova, V. Chalapa","doi":"10.52420/2071-5943-2023-22-3-57-63","DOIUrl":null,"url":null,"abstract":"Introduction. Norovirus infection (NI) is the most prevalent cause of acute gastroenteritis and outbreaks in semi-closed settings. Forecasting of NI may improve situational awareness and control measures.The aim of the study is to evaluate accuracy of time-series models for forecasting of norovirus incidence (on Sverdlovsk region dataset).Materials and methods. Simple ARIMA time-series models was chosen to forecast NI incidence via regression on its own lagged values. Dataset including passive surveillance monthly reports for Sverdlovsk region was used. All models were trained on data for 2015−2018 and tested on data for 2019. Models were benchmarked using Akaike information criterion (AIC) and mean absolute percentage error (MAPE).Results and discussion. NI incidence in Sverdlovsk raised in 2015-2018 with strong winter-spring seasonality. The time-series incidence data was stationary. Nine significant models were found and the most accurate model was SARIMA (1,0,0)(0,0,1). Despite its accuracy on 2019 test sample, forecast on COVID-19 pandemic period was failed. It was supposed that including additional regressors (climate and herd immunity data) and choosing of more robust time-series models may improve forecasting accuracy.Conclusion. ARIMA time-series models (especially SARIMA) suitable to forecast future incidence of NI in Sverdlovsk region. Additional investigations in terms of possible regressors and improved model robustness are needed.","PeriodicalId":247511,"journal":{"name":"Ural Medical Journal","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ural Medical Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52420/2071-5943-2023-22-3-57-63","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction. Norovirus infection (NI) is the most prevalent cause of acute gastroenteritis and outbreaks in semi-closed settings. Forecasting of NI may improve situational awareness and control measures.The aim of the study is to evaluate accuracy of time-series models for forecasting of norovirus incidence (on Sverdlovsk region dataset).Materials and methods. Simple ARIMA time-series models was chosen to forecast NI incidence via regression on its own lagged values. Dataset including passive surveillance monthly reports for Sverdlovsk region was used. All models were trained on data for 2015−2018 and tested on data for 2019. Models were benchmarked using Akaike information criterion (AIC) and mean absolute percentage error (MAPE).Results and discussion. NI incidence in Sverdlovsk raised in 2015-2018 with strong winter-spring seasonality. The time-series incidence data was stationary. Nine significant models were found and the most accurate model was SARIMA (1,0,0)(0,0,1). Despite its accuracy on 2019 test sample, forecast on COVID-19 pandemic period was failed. It was supposed that including additional regressors (climate and herd immunity data) and choosing of more robust time-series models may improve forecasting accuracy.Conclusion. ARIMA time-series models (especially SARIMA) suitable to forecast future incidence of NI in Sverdlovsk region. Additional investigations in terms of possible regressors and improved model robustness are needed.
利用时间序列模型预测诺如病毒感染的动态
介绍。诺如病毒感染(NI)是在半封闭环境中引起急性胃肠炎和暴发的最常见原因。NI的预测可以改善态势感知和控制措施。该研究的目的是评估预测诺瓦克病毒发病率的时间序列模型的准确性(在斯维尔德洛夫斯克地区数据集上)。材料和方法。选择简单的ARIMA时间序列模型,通过对自身滞后值的回归来预测NI发病率。数据集包括斯维尔德洛夫斯克地区的被动监测月度报告。所有模型都使用2015 - 2018年的数据进行训练,并使用2019年的数据进行测试。采用赤池信息准则(Akaike information criterion, AIC)和平均绝对百分比误差(mean absolute percentage error, MAPE)对模型进行基准评价。结果和讨论。2015-2018年斯维尔德洛夫斯克的NI发病率有所上升,冬春季季节性较强。时间序列发病率数据是平稳的。共发现9个显著模型,其中最准确的模型为SARIMA(1,0,0)(0,0,1)。尽管对2019年的测试样本预测准确,但对新冠肺炎大流行时期的预测却失败了。假设加入额外的回归因子(气候和群体免疫数据)和选择更稳健的时间序列模型可以提高预测的准确性。适合预测斯维尔德洛夫斯克地区未来NI发病率的ARIMA时间序列模型(特别是SARIMA)。需要在可能的回归量和改进的模型稳健性方面进行额外的研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信