Empirical Study of Weight Initializations for COVID-19 Predictions in India

M. Narkhede, Shubham S. Mane, P. Bartakke, M. Sutaone
{"title":"Empirical Study of Weight Initializations for COVID-19 Predictions in India","authors":"M. Narkhede, Shubham S. Mane, P. Bartakke, M. Sutaone","doi":"10.1109/NCC52529.2021.9530160","DOIUrl":null,"url":null,"abstract":"The first case of the novel Coronavirus disease (COVID-19) in India was recorded on 30th January 2020 in Kerela and it has spread across all states in India. The prediction of the number of COVID-19 cases is important for government officials to plan various control strategies. This paper presents a weekly prediction of cumulative number of COVID-19 cases in India. A graded lockdown feature, which describes the status of lockdown, is derived and incorporated in the input dataset as one of the features. For prediction, this paper proposes a model which is a stacking of different deep neural networks which have recurrent connections. Vanishing gradients is a common issue with such networks with recurrent connections. Proper weight initialization of the network is one of the solutions to overcome the vanishing gradients problem. Hence, the weight distributions and convergence performance of some state-of-the-art weight initialization techniques have been analyzed in this paper. The proposed model is initialized with the technique which would aid to avoid the vanishing gradients problem and converge faster to a lower loss. This paper also provides a comparison of the proposed model for univariate and multivariate prediction with other prediction models such as statistical model - Auto-Regressive Integrated Moving Average (ARIMA), and deep learning architectures long short term memory (LSTM), bidirectional LSTM (bi-LSTM) and gated recurrent unit (GRU). The results demonstrate that the proposed model gives better prediction results than these models.","PeriodicalId":414087,"journal":{"name":"2021 National Conference on Communications (NCC)","volume":"112 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC52529.2021.9530160","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The first case of the novel Coronavirus disease (COVID-19) in India was recorded on 30th January 2020 in Kerela and it has spread across all states in India. The prediction of the number of COVID-19 cases is important for government officials to plan various control strategies. This paper presents a weekly prediction of cumulative number of COVID-19 cases in India. A graded lockdown feature, which describes the status of lockdown, is derived and incorporated in the input dataset as one of the features. For prediction, this paper proposes a model which is a stacking of different deep neural networks which have recurrent connections. Vanishing gradients is a common issue with such networks with recurrent connections. Proper weight initialization of the network is one of the solutions to overcome the vanishing gradients problem. Hence, the weight distributions and convergence performance of some state-of-the-art weight initialization techniques have been analyzed in this paper. The proposed model is initialized with the technique which would aid to avoid the vanishing gradients problem and converge faster to a lower loss. This paper also provides a comparison of the proposed model for univariate and multivariate prediction with other prediction models such as statistical model - Auto-Regressive Integrated Moving Average (ARIMA), and deep learning architectures long short term memory (LSTM), bidirectional LSTM (bi-LSTM) and gated recurrent unit (GRU). The results demonstrate that the proposed model gives better prediction results than these models.
印度COVID-19预测权重初始化的实证研究
印度第一例新型冠状病毒病(COVID-19)于2020年1月30日在克雷拉邦记录在案,并已蔓延到印度所有邦。预测新冠肺炎病例数对政府官员制定各种控制策略非常重要。本文介绍了印度COVID-19累计病例数的每周预测。导出了描述锁定状态的分级锁定特征,并将其作为特征之一合并到输入数据集中。为了进行预测,本文提出了一种由具有循环连接的不同深度神经网络叠加而成的模型。梯度消失是这种具有循环连接的网络的一个常见问题。对网络进行适当的权值初始化是解决梯度消失问题的方法之一。因此,本文分析了一些最先进的权值初始化技术的权值分布和收敛性能。采用该方法对模型进行初始化,避免了梯度消失问题,收敛速度更快,损失更小。本文还将所提出的单变量和多变量预测模型与其他预测模型(如统计模型-自回归集成移动平均(ARIMA))以及深度学习架构长短期记忆(LSTM),双向LSTM (bi-LSTM)和门通循环单元(GRU))进行了比较。结果表明,该模型的预测效果优于上述模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信