Rainfall Prediction using XGB Model with the Australian Dataset

Surendra Reddy Vinta, Rashika Peeriga
{"title":"Rainfall Prediction using XGB Model with the Australian Dataset","authors":"Surendra Reddy Vinta, Rashika Peeriga","doi":"10.4108/ew.5386","DOIUrl":null,"url":null,"abstract":"Rainfall prediction is a critical field of study with several practical uses, including agriculture, water management, and disaster preparedness. In this work, we examine the performance of several machine learning models in forecasting rainfall using a dataset of Australian rainfall observations from Kaggle. Six models are compared: random forest (RF), logistic regression (LogReg), Gaussian Naive Bayes (GNB), k-nearest neighbours (kNN), support vector classifier (SVC), and XGBoost (XGB). Missing value imputation and feature selection were used to preprocess the dataset. To analyse the models, we employed cross-validation and performance indicators such as accuracy, precision, recall, and F1-score. According to our findings, the RF and XGB models fared the best, with accuracy ratings of 87% and 85%, respectively. \nWith accuracy ratings below 70%, the GNB and SVC models performed the poorest. Our findings imply that machine learning algorithms can be useful tools for predicting rainfall, but careful model selection and evaluation are required for reliable results.","PeriodicalId":502230,"journal":{"name":"EAI Endorsed Transactions on Energy Web","volume":"26 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EAI Endorsed Transactions on Energy Web","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4108/ew.5386","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Rainfall prediction is a critical field of study with several practical uses, including agriculture, water management, and disaster preparedness. In this work, we examine the performance of several machine learning models in forecasting rainfall using a dataset of Australian rainfall observations from Kaggle. Six models are compared: random forest (RF), logistic regression (LogReg), Gaussian Naive Bayes (GNB), k-nearest neighbours (kNN), support vector classifier (SVC), and XGBoost (XGB). Missing value imputation and feature selection were used to preprocess the dataset. To analyse the models, we employed cross-validation and performance indicators such as accuracy, precision, recall, and F1-score. According to our findings, the RF and XGB models fared the best, with accuracy ratings of 87% and 85%, respectively. With accuracy ratings below 70%, the GNB and SVC models performed the poorest. Our findings imply that machine learning algorithms can be useful tools for predicting rainfall, but careful model selection and evaluation are required for reliable results.
利用 XGB 模型和澳大利亚数据集进行降雨预测
降雨预测是一个重要的研究领域,具有多种实际用途,包括农业、水资源管理和备灾。在这项工作中,我们利用 Kaggle 提供的澳大利亚降雨观测数据集,检验了几种机器学习模型在降雨预测方面的性能。我们比较了六种模型:随机森林(RF)、逻辑回归(LogReg)、高斯直觉贝叶斯(GNB)、k-近邻(kNN)、支持向量分类器(SVC)和 XGBoost(XGB)。缺失值估算和特征选择用于数据集的预处理。为了分析模型,我们采用了交叉验证和准确率、精确度、召回率和 F1 分数等性能指标。根据我们的研究结果,RF 和 XGB 模型表现最佳,准确率分别为 87% 和 85%。GNB 和 SVC 模型的准确率低于 70%,表现最差。我们的研究结果表明,机器学习算法是预测降雨量的有用工具,但要获得可靠的结果,还需要对模型进行仔细选择和评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信