A pragmatic ensemble learning approach for rainfall prediction

Soumili Ghosh, Mahendra Kumar Gourisaria, Biswajit Sahoo, Himansu Das
{"title":"A pragmatic ensemble learning approach for rainfall prediction","authors":"Soumili Ghosh, Mahendra Kumar Gourisaria, Biswajit Sahoo, Himansu Das","doi":"10.1007/s43926-023-00044-3","DOIUrl":null,"url":null,"abstract":"Abstract Heavy rainfall and precipitation play a massive role in shaping the socio-agricultural landscape of a country. Being one of the key indicators of climate change, natural disasters, and of the general topology of a region, rainfall prediction is a gift of estimation that can be used for multiple beneficial causes. Machine learning has an impressive repertoire in aiding prediction and estimation of rainfall. This paper aims to find the effect of ensemble learning, a subset of machine learning, on a rainfall prediction dataset, to increase the predictability of the models used. The classification models used in this paper were tested once individually, and then with applied ensemble techniques like bagging and boosting, on a rainfall dataset based in Australia. The objective of this paper is to demonstrate a reduction in bias and variance via ensemble learning techniques while also analyzing the increase or decrease in the aforementioned metrics. The study shows an overall reduction in bias by an average of 6% using boosting, and an average reduction in variance by 13.6%. Model performance was observed to become more generalized by lowering the false negative rate by an average of more than 20%. The techniques explored in this paper can be further utilized to improve model performance even further via hyper-parameter tuning.","PeriodicalId":34751,"journal":{"name":"Discover Internet of Things","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Discover Internet of Things","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s43926-023-00044-3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract Heavy rainfall and precipitation play a massive role in shaping the socio-agricultural landscape of a country. Being one of the key indicators of climate change, natural disasters, and of the general topology of a region, rainfall prediction is a gift of estimation that can be used for multiple beneficial causes. Machine learning has an impressive repertoire in aiding prediction and estimation of rainfall. This paper aims to find the effect of ensemble learning, a subset of machine learning, on a rainfall prediction dataset, to increase the predictability of the models used. The classification models used in this paper were tested once individually, and then with applied ensemble techniques like bagging and boosting, on a rainfall dataset based in Australia. The objective of this paper is to demonstrate a reduction in bias and variance via ensemble learning techniques while also analyzing the increase or decrease in the aforementioned metrics. The study shows an overall reduction in bias by an average of 6% using boosting, and an average reduction in variance by 13.6%. Model performance was observed to become more generalized by lowering the false negative rate by an average of more than 20%. The techniques explored in this paper can be further utilized to improve model performance even further via hyper-parameter tuning.
一种实用的降雨预测集成学习方法
强降雨和降水在塑造一个国家的社会农业景观中起着巨大的作用。作为气候变化、自然灾害和一个地区总体拓扑结构的关键指标之一,降雨预测是一种可以用于多种有益原因的估计礼物。机器学习在帮助预测和估计降雨量方面有着令人印象深刻的能力。本文旨在找到集成学习(机器学习的一个子集)对降雨预测数据集的影响,以提高所使用模型的可预测性。本文中使用的分类模型分别进行了一次测试,然后在澳大利亚的降雨数据集上应用了套袋和增强等综合技术。本文的目的是通过集成学习技术证明偏差和方差的减少,同时也分析上述指标的增加或减少。研究表明,使用增强技术,总体偏差平均减少了6%,方差平均减少了13.6%。通过将假阴性率平均降低20%以上,观察到模型性能变得更加一般化。本文探讨的技术可以进一步利用,通过超参数调优进一步提高模型性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Discover Internet of Things
Discover Internet of Things Internet of Things (IoT)-
CiteScore
7.50
自引率
0.00%
发文量
6
审稿时长
28 days
期刊介绍: Discover Internet of Things is part of the Discover journal series committed to providing a streamlined submission process, rapid review and publication, and a high level of author service at every stage. It is an open access, community-focussed journal publishing research from across all fields relevant to the Internet of Things (IoT), providing cutting-edge and state-of-art research findings to researchers, academicians, students, and engineers. Discover Internet of Things is a broad, open access journal publishing research from across all fields relevant to IoT. Discover Internet of Things covers concepts at the component, hardware, and system level as well as programming, operating systems, software, applications and other technology-oriented research topics. The journal is uniquely interdisciplinary because its scope spans several research communities, ranging from computer systems to communication, optimisation, big data analytics, and application. It is also intended that articles published in Discover Internet of Things may help to support and accelerate Sustainable Development Goal 9: ‘Build resilient infrastructure, promote inclusive and sustainable industrialization and foster innovation’. Discover Internet of Things welcomes all observational, experimental, theoretical, analytical, mathematical modelling, data-driven, and applied approaches that advance the study of all aspects of IoT research.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信