Peramalan Jumlah Kasus COVID-19 Menggunakan Joint Learning

Jurnal Teknik Informatika dan Sistem Informasi Pub Date : 2022-04-29 DOI:10.28932/jutisi.v8i1.4469

M. Nur, F. Amin, Ahmad Yusuf

{"title":"Peramalan Jumlah Kasus COVID-19 Menggunakan Joint Learning","authors":"M. Nur, F. Amin, Ahmad Yusuf","doi":"10.28932/jutisi.v8i1.4469","DOIUrl":null,"url":null,"abstract":"COVID-19 is a dangerous illness because it spreads quickly and easily. Vaccines are already available but the pandemic isn’t likely to end soon. Forecasting is hoped to help handle the pandemic. Deep learning, specially LSTM, has been used to forecast COVID-19 case count in some regions. However, deep learning models generally need a lot of training data while COVID-19 daily data are scarce. However, COVID-19 pandemic happens in many regions. This research aims to use joint learning with data from other regions to improve model performance with fewer data and to use the model to forecast until 9 months since the date of last data taken. Joint learning was done by making models share some parts and training the models together. To overcome the different data scale and pandemic age in the regions, the data was first transformed into discrete SIRD variables and was evaluated using RMSSE. Joint learning failed to improve the model performance in this research. The proposed model performance was signficantly better than ARIMA-SIRD and SIRD model but wasn’t better than normal encoder-decoder LSTM. The models only reached RMSSE below one occasionally. Additionally, it was found that doing joint learning with all regions without selecting them by clustering can make the model performance worse instead. It was also found that RMSSE is too sensitive to a mostly stagnant time-series due to its division by the error of one-step naïve forecast.","PeriodicalId":185279,"journal":{"name":"Jurnal Teknik Informatika dan Sistem Informasi","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal Teknik Informatika dan Sistem Informasi","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.28932/jutisi.v8i1.4469","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

COVID-19 is a dangerous illness because it spreads quickly and easily. Vaccines are already available but the pandemic isn’t likely to end soon. Forecasting is hoped to help handle the pandemic. Deep learning, specially LSTM, has been used to forecast COVID-19 case count in some regions. However, deep learning models generally need a lot of training data while COVID-19 daily data are scarce. However, COVID-19 pandemic happens in many regions. This research aims to use joint learning with data from other regions to improve model performance with fewer data and to use the model to forecast until 9 months since the date of last data taken. Joint learning was done by making models share some parts and training the models together. To overcome the different data scale and pandemic age in the regions, the data was first transformed into discrete SIRD variables and was evaluated using RMSSE. Joint learning failed to improve the model performance in this research. The proposed model performance was signficantly better than ARIMA-SIRD and SIRD model but wasn’t better than normal encoder-decoder LSTM. The models only reached RMSSE below one occasionally. Additionally, it was found that doing joint learning with all regions without selecting them by clustering can make the model performance worse instead. It was also found that RMSSE is too sensitive to a mostly stagnant time-series due to its division by the error of one-step naïve forecast.

查看原文本刊更多论文

COVID-19是一种危险的疾病，因为它传播迅速且容易。疫苗已经可用，但大流行不太可能很快结束。预测有望帮助应对大流行。在一些地区，深度学习特别是LSTM已被用于预测COVID-19病例数。然而，深度学习模型通常需要大量的训练数据，而COVID-19的日常数据很少。然而，COVID-19大流行发生在许多地区。本研究旨在通过与其他地区数据的联合学习，在数据较少的情况下提高模型的性能，并使用模型预测到最后一次数据采集日期后的9个月。联合学习是通过让模型共享某些部分，并一起训练模型来完成的。为了克服各地区不同的数据规模和流行年龄，首先将数据转换为离散的SIRD变量，并使用RMSSE进行评估。在本研究中，联合学习并没有提高模型的性能。该模型的性能显著优于ARIMA-SIRD和SIRD模型，但不优于普通编解码器LSTM。模型只是偶尔达到RMSSE低于1。此外，我们还发现，对所有区域进行联合学习而不进行聚类选择反而会使模型的性能变差。我们还发现RMSSE对大部分停滞时间序列过于敏感，因为它被一步naïve预测误差所分割。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Jurnal Teknik Informatika dan Sistem Informasi

自引率

0.00%

发文量