{"title":"利用时间序列模型预测诺如病毒感染的动态","authors":"A. Kosova, V. Chalapa","doi":"10.52420/2071-5943-2023-22-3-57-63","DOIUrl":null,"url":null,"abstract":"Introduction. Norovirus infection (NI) is the most prevalent cause of acute gastroenteritis and outbreaks in semi-closed settings. Forecasting of NI may improve situational awareness and control measures.The aim of the study is to evaluate accuracy of time-series models for forecasting of norovirus incidence (on Sverdlovsk region dataset).Materials and methods. Simple ARIMA time-series models was chosen to forecast NI incidence via regression on its own lagged values. Dataset including passive surveillance monthly reports for Sverdlovsk region was used. All models were trained on data for 2015−2018 and tested on data for 2019. Models were benchmarked using Akaike information criterion (AIC) and mean absolute percentage error (MAPE).Results and discussion. NI incidence in Sverdlovsk raised in 2015-2018 with strong winter-spring seasonality. The time-series incidence data was stationary. Nine significant models were found and the most accurate model was SARIMA (1,0,0)(0,0,1). Despite its accuracy on 2019 test sample, forecast on COVID-19 pandemic period was failed. It was supposed that including additional regressors (climate and herd immunity data) and choosing of more robust time-series models may improve forecasting accuracy.Conclusion. ARIMA time-series models (especially SARIMA) suitable to forecast future incidence of NI in Sverdlovsk region. Additional investigations in terms of possible regressors and improved model robustness are needed.","PeriodicalId":247511,"journal":{"name":"Ural Medical Journal","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting the dynamics of norovirus infection using time series models\",\"authors\":\"A. Kosova, V. Chalapa\",\"doi\":\"10.52420/2071-5943-2023-22-3-57-63\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction. Norovirus infection (NI) is the most prevalent cause of acute gastroenteritis and outbreaks in semi-closed settings. Forecasting of NI may improve situational awareness and control measures.The aim of the study is to evaluate accuracy of time-series models for forecasting of norovirus incidence (on Sverdlovsk region dataset).Materials and methods. Simple ARIMA time-series models was chosen to forecast NI incidence via regression on its own lagged values. Dataset including passive surveillance monthly reports for Sverdlovsk region was used. All models were trained on data for 2015−2018 and tested on data for 2019. Models were benchmarked using Akaike information criterion (AIC) and mean absolute percentage error (MAPE).Results and discussion. NI incidence in Sverdlovsk raised in 2015-2018 with strong winter-spring seasonality. The time-series incidence data was stationary. Nine significant models were found and the most accurate model was SARIMA (1,0,0)(0,0,1). Despite its accuracy on 2019 test sample, forecast on COVID-19 pandemic period was failed. It was supposed that including additional regressors (climate and herd immunity data) and choosing of more robust time-series models may improve forecasting accuracy.Conclusion. ARIMA time-series models (especially SARIMA) suitable to forecast future incidence of NI in Sverdlovsk region. Additional investigations in terms of possible regressors and improved model robustness are needed.\",\"PeriodicalId\":247511,\"journal\":{\"name\":\"Ural Medical Journal\",\"volume\":\"62 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ural Medical Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.52420/2071-5943-2023-22-3-57-63\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ural Medical Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52420/2071-5943-2023-22-3-57-63","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
介绍。诺如病毒感染(NI)是在半封闭环境中引起急性胃肠炎和暴发的最常见原因。NI的预测可以改善态势感知和控制措施。该研究的目的是评估预测诺瓦克病毒发病率的时间序列模型的准确性(在斯维尔德洛夫斯克地区数据集上)。材料和方法。选择简单的ARIMA时间序列模型,通过对自身滞后值的回归来预测NI发病率。数据集包括斯维尔德洛夫斯克地区的被动监测月度报告。所有模型都使用2015 - 2018年的数据进行训练,并使用2019年的数据进行测试。采用赤池信息准则(Akaike information criterion, AIC)和平均绝对百分比误差(mean absolute percentage error, MAPE)对模型进行基准评价。结果和讨论。2015-2018年斯维尔德洛夫斯克的NI发病率有所上升,冬春季季节性较强。时间序列发病率数据是平稳的。共发现9个显著模型,其中最准确的模型为SARIMA(1,0,0)(0,0,1)。尽管对2019年的测试样本预测准确,但对新冠肺炎大流行时期的预测却失败了。假设加入额外的回归因子(气候和群体免疫数据)和选择更稳健的时间序列模型可以提高预测的准确性。适合预测斯维尔德洛夫斯克地区未来NI发病率的ARIMA时间序列模型(特别是SARIMA)。需要在可能的回归量和改进的模型稳健性方面进行额外的研究。
Predicting the dynamics of norovirus infection using time series models
Introduction. Norovirus infection (NI) is the most prevalent cause of acute gastroenteritis and outbreaks in semi-closed settings. Forecasting of NI may improve situational awareness and control measures.The aim of the study is to evaluate accuracy of time-series models for forecasting of norovirus incidence (on Sverdlovsk region dataset).Materials and methods. Simple ARIMA time-series models was chosen to forecast NI incidence via regression on its own lagged values. Dataset including passive surveillance monthly reports for Sverdlovsk region was used. All models were trained on data for 2015−2018 and tested on data for 2019. Models were benchmarked using Akaike information criterion (AIC) and mean absolute percentage error (MAPE).Results and discussion. NI incidence in Sverdlovsk raised in 2015-2018 with strong winter-spring seasonality. The time-series incidence data was stationary. Nine significant models were found and the most accurate model was SARIMA (1,0,0)(0,0,1). Despite its accuracy on 2019 test sample, forecast on COVID-19 pandemic period was failed. It was supposed that including additional regressors (climate and herd immunity data) and choosing of more robust time-series models may improve forecasting accuracy.Conclusion. ARIMA time-series models (especially SARIMA) suitable to forecast future incidence of NI in Sverdlovsk region. Additional investigations in terms of possible regressors and improved model robustness are needed.