Mohammad Liton Hossain, S. M. Nasif Shams, Saeed Mahmud Ullah
{"title":"孟加拉国达卡风速预测机器学习算法比较研究","authors":"Mohammad Liton Hossain, S. M. Nasif Shams, Saeed Mahmud Ullah","doi":"10.1186/s40807-024-00109-z","DOIUrl":null,"url":null,"abstract":"This study evaluated the performance of multiple models that used machine learning to anticipate wind speed in the city of Dhaka. The NASA Power website provided the data set for this investigation. The models used for prediction included the decision tree regressor, support vector regressor, random forest, linear regression, neural network and polynomial regression. A hold-out check and k-fold cross-validation were used to assess how well these models performed. With the highest R2 scores and lowest RMSEs on both the validation and test sets, the results demonstrated that the polynomial regression model performed the best. With the lowest R2 scores and largest RMSEs on both sets, the decision tree model scored the poorest. High R2 scores and low RMSEs were achieved by the random forest model, which had comparable performance to the polynomial regression model but required a longer computation time. In addition, the neural network model demonstrated commendable predictive accuracy, yielding an R2 score of 0.67 and a low RMSE of 0.57. However, its application is contingent on the availability of substantial computational resources, given its extensive computation time of 457.93 s. The study concludes by highlighting the efficacy of the Polynomial Regression model as the optimal choice for wind speed prediction in Dhaka, offering a balance between superior performance and efficient computation. This insight provides valuable guidance for practitioners and researchers seeking effective models for similar applications.","PeriodicalId":93049,"journal":{"name":"Renewables: wind, water, and solar","volume":"36 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparative study of machine learning algorithms for wind speed prediction in Dhaka, Bangladesh\",\"authors\":\"Mohammad Liton Hossain, S. M. Nasif Shams, Saeed Mahmud Ullah\",\"doi\":\"10.1186/s40807-024-00109-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study evaluated the performance of multiple models that used machine learning to anticipate wind speed in the city of Dhaka. The NASA Power website provided the data set for this investigation. The models used for prediction included the decision tree regressor, support vector regressor, random forest, linear regression, neural network and polynomial regression. A hold-out check and k-fold cross-validation were used to assess how well these models performed. With the highest R2 scores and lowest RMSEs on both the validation and test sets, the results demonstrated that the polynomial regression model performed the best. With the lowest R2 scores and largest RMSEs on both sets, the decision tree model scored the poorest. High R2 scores and low RMSEs were achieved by the random forest model, which had comparable performance to the polynomial regression model but required a longer computation time. In addition, the neural network model demonstrated commendable predictive accuracy, yielding an R2 score of 0.67 and a low RMSE of 0.57. However, its application is contingent on the availability of substantial computational resources, given its extensive computation time of 457.93 s. The study concludes by highlighting the efficacy of the Polynomial Regression model as the optimal choice for wind speed prediction in Dhaka, offering a balance between superior performance and efficient computation. This insight provides valuable guidance for practitioners and researchers seeking effective models for similar applications.\",\"PeriodicalId\":93049,\"journal\":{\"name\":\"Renewables: wind, water, and solar\",\"volume\":\"36 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Renewables: wind, water, and solar\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/s40807-024-00109-z\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Renewables: wind, water, and solar","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s40807-024-00109-z","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
本研究评估了使用机器学习预测达卡市风速的多个模型的性能。NASA Power 网站为本次调查提供了数据集。用于预测的模型包括决策树回归器、支持向量回归器、随机森林、线性回归、神经网络和多项式回归。我们使用了保留检查和 k 倍交叉验证来评估这些模型的性能。结果表明,多项式回归模型在验证集和测试集上的 R2 分数最高,RMSE 最低。决策树模型在两个集上的 R2 分数最低,有效误差最大,表现最差。随机森林模型获得了较高的 R2 分数和较低的 RMSE,其性能与多项式回归模型相当,但需要较长的计算时间。此外,神经网络模型的预测准确性也值得称赞,其 R2 得分为 0.67,RMSE 为 0.57。不过,由于其计算时间长达 457.93 秒,因此其应用取决于是否有大量的计算资源。研究最后强调,多项式回归模型是达卡风速预测的最佳选择,在卓越性能和高效计算之间取得了平衡。这一见解为为类似应用寻找有效模型的从业人员和研究人员提供了宝贵的指导。
Comparative study of machine learning algorithms for wind speed prediction in Dhaka, Bangladesh
This study evaluated the performance of multiple models that used machine learning to anticipate wind speed in the city of Dhaka. The NASA Power website provided the data set for this investigation. The models used for prediction included the decision tree regressor, support vector regressor, random forest, linear regression, neural network and polynomial regression. A hold-out check and k-fold cross-validation were used to assess how well these models performed. With the highest R2 scores and lowest RMSEs on both the validation and test sets, the results demonstrated that the polynomial regression model performed the best. With the lowest R2 scores and largest RMSEs on both sets, the decision tree model scored the poorest. High R2 scores and low RMSEs were achieved by the random forest model, which had comparable performance to the polynomial regression model but required a longer computation time. In addition, the neural network model demonstrated commendable predictive accuracy, yielding an R2 score of 0.67 and a low RMSE of 0.57. However, its application is contingent on the availability of substantial computational resources, given its extensive computation time of 457.93 s. The study concludes by highlighting the efficacy of the Polynomial Regression model as the optimal choice for wind speed prediction in Dhaka, offering a balance between superior performance and efficient computation. This insight provides valuable guidance for practitioners and researchers seeking effective models for similar applications.