利用 XGB 模型和澳大利亚数据集进行降雨预测

EAI Endorsed Transactions on Energy Web Pub Date : 2024-03-12 DOI:10.4108/ew.5386

Surendra Reddy Vinta, Rashika Peeriga

{"title":"利用 XGB 模型和澳大利亚数据集进行降雨预测","authors":"Surendra Reddy Vinta, Rashika Peeriga","doi":"10.4108/ew.5386","DOIUrl":null,"url":null,"abstract":"Rainfall prediction is a critical field of study with several practical uses, including agriculture, water management, and disaster preparedness. In this work, we examine the performance of several machine learning models in forecasting rainfall using a dataset of Australian rainfall observations from Kaggle. Six models are compared: random forest (RF), logistic regression (LogReg), Gaussian Naive Bayes (GNB), k-nearest neighbours (kNN), support vector classifier (SVC), and XGBoost (XGB). Missing value imputation and feature selection were used to preprocess the dataset. To analyse the models, we employed cross-validation and performance indicators such as accuracy, precision, recall, and F1-score. According to our findings, the RF and XGB models fared the best, with accuracy ratings of 87% and 85%, respectively. \nWith accuracy ratings below 70%, the GNB and SVC models performed the poorest. Our findings imply that machine learning algorithms can be useful tools for predicting rainfall, but careful model selection and evaluation are required for reliable results.","PeriodicalId":502230,"journal":{"name":"EAI Endorsed Transactions on Energy Web","volume":"26 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Rainfall Prediction using XGB Model with the Australian Dataset\",\"authors\":\"Surendra Reddy Vinta, Rashika Peeriga\",\"doi\":\"10.4108/ew.5386\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Rainfall prediction is a critical field of study with several practical uses, including agriculture, water management, and disaster preparedness. In this work, we examine the performance of several machine learning models in forecasting rainfall using a dataset of Australian rainfall observations from Kaggle. Six models are compared: random forest (RF), logistic regression (LogReg), Gaussian Naive Bayes (GNB), k-nearest neighbours (kNN), support vector classifier (SVC), and XGBoost (XGB). Missing value imputation and feature selection were used to preprocess the dataset. To analyse the models, we employed cross-validation and performance indicators such as accuracy, precision, recall, and F1-score. According to our findings, the RF and XGB models fared the best, with accuracy ratings of 87% and 85%, respectively. \\nWith accuracy ratings below 70%, the GNB and SVC models performed the poorest. Our findings imply that machine learning algorithms can be useful tools for predicting rainfall, but careful model selection and evaluation are required for reliable results.\",\"PeriodicalId\":502230,\"journal\":{\"name\":\"EAI Endorsed Transactions on Energy Web\",\"volume\":\"26 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"EAI Endorsed Transactions on Energy Web\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4108/ew.5386\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"EAI Endorsed Transactions on Energy Web","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4108/ew.5386","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

降雨预测是一个重要的研究领域，具有多种实际用途，包括农业、水资源管理和备灾。在这项工作中，我们利用 Kaggle 提供的澳大利亚降雨观测数据集，检验了几种机器学习模型在降雨预测方面的性能。我们比较了六种模型：随机森林（RF）、逻辑回归（LogReg）、高斯直觉贝叶斯（GNB）、k-近邻（kNN）、支持向量分类器（SVC）和 XGBoost（XGB）。缺失值估算和特征选择用于数据集的预处理。为了分析模型，我们采用了交叉验证和准确率、精确度、召回率和 F1 分数等性能指标。根据我们的研究结果，RF 和 XGB 模型表现最佳，准确率分别为 87% 和 85%。GNB 和 SVC 模型的准确率低于 70%，表现最差。我们的研究结果表明，机器学习算法是预测降雨量的有用工具，但要获得可靠的结果，还需要对模型进行仔细选择和评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Rainfall Prediction using XGB Model with the Australian Dataset

Rainfall prediction is a critical field of study with several practical uses, including agriculture, water management, and disaster preparedness. In this work, we examine the performance of several machine learning models in forecasting rainfall using a dataset of Australian rainfall observations from Kaggle. Six models are compared: random forest (RF), logistic regression (LogReg), Gaussian Naive Bayes (GNB), k-nearest neighbours (kNN), support vector classifier (SVC), and XGBoost (XGB). Missing value imputation and feature selection were used to preprocess the dataset. To analyse the models, we employed cross-validation and performance indicators such as accuracy, precision, recall, and F1-score. According to our findings, the RF and XGB models fared the best, with accuracy ratings of 87% and 85%, respectively. With accuracy ratings below 70%, the GNB and SVC models performed the poorest. Our findings imply that machine learning algorithms can be useful tools for predicting rainfall, but careful model selection and evaluation are required for reliable results.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

EAI Endorsed Transactions on Energy Web

自引率

0.00%

发文量