Peramalan Jumlah Kasus COVID-19 Menggunakan Joint Learning

M. Nur, F. Amin, Ahmad Yusuf
{"title":"Peramalan Jumlah Kasus COVID-19 Menggunakan Joint Learning","authors":"M. Nur, F. Amin, Ahmad Yusuf","doi":"10.28932/jutisi.v8i1.4469","DOIUrl":null,"url":null,"abstract":"COVID-19 is a dangerous illness because it spreads quickly and easily. Vaccines are already available but the pandemic isn’t likely to end soon. Forecasting is hoped to help handle the pandemic. Deep learning, specially LSTM, has been used to forecast COVID-19 case count in some regions. However, deep learning models generally need a lot of training data while COVID-19 daily data are scarce. However, COVID-19 pandemic happens in many regions. This research aims to use joint learning with data from other regions to improve model performance with fewer data and to use the model to forecast until 9 months since the date of last data taken. Joint learning was done by making models share some parts and training the models together. To overcome the different data scale and pandemic age in the regions, the data was first transformed into discrete SIRD variables and was evaluated using RMSSE. Joint learning failed to improve the model performance in this research. The proposed model performance was signficantly better than ARIMA-SIRD and SIRD model but wasn’t better than normal encoder-decoder LSTM. The models only reached RMSSE below one occasionally. Additionally, it was found that doing joint learning with all regions without selecting them by clustering can make the model performance worse instead. It was also found that RMSSE is too sensitive to a mostly stagnant time-series due to its division by the error of one-step naïve forecast.","PeriodicalId":185279,"journal":{"name":"Jurnal Teknik Informatika dan Sistem Informasi","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal Teknik Informatika dan Sistem Informasi","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.28932/jutisi.v8i1.4469","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

COVID-19 is a dangerous illness because it spreads quickly and easily. Vaccines are already available but the pandemic isn’t likely to end soon. Forecasting is hoped to help handle the pandemic. Deep learning, specially LSTM, has been used to forecast COVID-19 case count in some regions. However, deep learning models generally need a lot of training data while COVID-19 daily data are scarce. However, COVID-19 pandemic happens in many regions. This research aims to use joint learning with data from other regions to improve model performance with fewer data and to use the model to forecast until 9 months since the date of last data taken. Joint learning was done by making models share some parts and training the models together. To overcome the different data scale and pandemic age in the regions, the data was first transformed into discrete SIRD variables and was evaluated using RMSSE. Joint learning failed to improve the model performance in this research. The proposed model performance was signficantly better than ARIMA-SIRD and SIRD model but wasn’t better than normal encoder-decoder LSTM. The models only reached RMSSE below one occasionally. Additionally, it was found that doing joint learning with all regions without selecting them by clustering can make the model performance worse instead. It was also found that RMSSE is too sensitive to a mostly stagnant time-series due to its division by the error of one-step naïve forecast.
COVID-19是一种危险的疾病,因为它传播迅速且容易。疫苗已经可用,但大流行不太可能很快结束。预测有望帮助应对大流行。在一些地区,深度学习特别是LSTM已被用于预测COVID-19病例数。然而,深度学习模型通常需要大量的训练数据,而COVID-19的日常数据很少。然而,COVID-19大流行发生在许多地区。本研究旨在通过与其他地区数据的联合学习,在数据较少的情况下提高模型的性能,并使用模型预测到最后一次数据采集日期后的9个月。联合学习是通过让模型共享某些部分,并一起训练模型来完成的。为了克服各地区不同的数据规模和流行年龄,首先将数据转换为离散的SIRD变量,并使用RMSSE进行评估。在本研究中,联合学习并没有提高模型的性能。该模型的性能显著优于ARIMA-SIRD和SIRD模型,但不优于普通编解码器LSTM。模型只是偶尔达到RMSSE低于1。此外,我们还发现,对所有区域进行联合学习而不进行聚类选择反而会使模型的性能变差。我们还发现RMSSE对大部分停滞时间序列过于敏感,因为它被一步naïve预测误差所分割。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信