Learning from Others: A Data Driven Transfer Learning based Daily New COVID-19 Case Prediction in India using an Ensemble of LSTM-RNNs

Debasrita Chakraborty, Debayan Goswami, Ashish Ghosh, Jonathan H. Chan, Susmita K. Ghosh
{"title":"Learning from Others: A Data Driven Transfer Learning based Daily New COVID-19 Case Prediction in India using an Ensemble of LSTM-RNNs","authors":"Debasrita Chakraborty, Debayan Goswami, Ashish Ghosh, Jonathan H. Chan, Susmita K. Ghosh","doi":"10.1145/3468784.3470769","DOIUrl":null,"url":null,"abstract":"Accurate prediction of the number of COVID-19 infected cases per day is fast becoming a critical necessity globally to mitigate the burden on various health systems. In a densely populated country like India which has currently the second highest number of infections and limited medical support, it is a need for the authorities to know the statistics beforehand to address these issues more effectively. In this article, a data driven transfer learning based model is proposed that takes into account the conditions of different countries which have witnessed the COVID-19 infection. We have taken four countries to be the source domain for transfer learning scenario namely, the United States of America, Spain, Brazil and Bangladesh. We have pre-trained four different LSTM-RNN models with each of the country’s data and have re-trained (fine tuned) each of the models using only a very small portion of Indian data on COVID-19. Predictions of these four models are averaged to get the actual prediction. It is seen that such an ensemble model outperforms all the compared models and accurately predicts even the daily cases. This may be due to the fact that the four LSTM-RNNs used here could successfully take into account the diversities of conditions. As India is a diverse nation with variety of climates, it makes more sense to incorporate such transfer learning techniques.","PeriodicalId":341589,"journal":{"name":"The 12th International Conference on Advances in Information Technology","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 12th International Conference on Advances in Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3468784.3470769","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Accurate prediction of the number of COVID-19 infected cases per day is fast becoming a critical necessity globally to mitigate the burden on various health systems. In a densely populated country like India which has currently the second highest number of infections and limited medical support, it is a need for the authorities to know the statistics beforehand to address these issues more effectively. In this article, a data driven transfer learning based model is proposed that takes into account the conditions of different countries which have witnessed the COVID-19 infection. We have taken four countries to be the source domain for transfer learning scenario namely, the United States of America, Spain, Brazil and Bangladesh. We have pre-trained four different LSTM-RNN models with each of the country’s data and have re-trained (fine tuned) each of the models using only a very small portion of Indian data on COVID-19. Predictions of these four models are averaged to get the actual prediction. It is seen that such an ensemble model outperforms all the compared models and accurately predicts even the daily cases. This may be due to the fact that the four LSTM-RNNs used here could successfully take into account the diversities of conditions. As India is a diverse nation with variety of climates, it makes more sense to incorporate such transfer learning techniques.
向他人学习:使用lstm - rnn集合的基于数据驱动的迁移学习的印度每日新冠肺炎病例预测
准确预测每天COVID-19感染病例的数量正迅速成为全球减轻各种卫生系统负担的关键必要条件。在印度这样一个人口稠密的国家,目前感染人数第二高,医疗支持有限,当局需要事先了解统计数据,以便更有效地解决这些问题。本文提出了一种基于数据驱动的迁移学习模型,该模型考虑了不同国家的COVID-19感染情况。我们选取了四个国家作为迁移学习情景的源域,即美国、西班牙、巴西和孟加拉国。我们使用每个国家的数据预训练了四个不同的LSTM-RNN模型,并仅使用印度关于COVID-19的一小部分数据对每个模型进行了重新训练(微调)。对这四种模型的预测值进行平均,得到实际的预测值。可以看出,这种集成模型优于所有比较的模型,甚至可以准确地预测日常情况。这可能是因为这里使用的四个lstm - rnn可以成功地考虑到条件的多样性。由于印度是一个气候多样的多元化国家,因此采用这种迁移学习技术更有意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信