{"title":"Quasi-Newton optimised Kolmogorov-Arnold Networks for wind farm power prediction.","authors":"Auwalu Saleh Mubarak, Zubaida Said Ameen, Sagiru Mati, Ayodele Lasisi, Quadri Noorulhasan Naveed, Rabiu Aliyu Abdulkadir","doi":"10.1016/j.heliyon.2024.e40799","DOIUrl":null,"url":null,"abstract":"<p><p>Having accurate and effective wind energy forecasting that can be easily incorporated into smart networks is important. Appropriate planning and energy generation predictions are necessary for these infrastructures. The production of wind energy is linked to instability and unpredictability. Wind energy forecasting has traditionally been performed using statistical models, but with the advent of artificial intelligence (AI), research indicates that AI is more accurate than the statical technique. In this study, the nominal power of six wind farms in China was predicted using Kolmogorov-Arnold Networks (KAN) and Multilayer Perceptron (MLP) models. KAN as an alternative to the conventional MLP, has the ability to handle problems with scalability, vanishing gradients, and interpretability associated with MLP. The KAN uses learnable B-Spline as activation functions prompting it to address the issues of the MLP. We employed the Radial Basis Function (RBF) with Gaussian kernels to approximate the 3-order B-spline basis. In most deep learning models stochastic gradient-based optimization algorithms such as Adaptive Moment Estimation (ADAM) and Stochastic Gradient Descent (SGD) optimizer are mostly employed, a quasi-Newton optimization technique Limited-memory Broyden-Fletcher-Goldfarb-Shanno algorithm LBFGS was employed in this work to approximate the Hessian matrix and estimate the parameter space's curvature. Also, in the preprocessing of the data, the Interquartile Range (IQR) technique is used to handle outliers and a clustering-based K-Nearest Neighbor (KNN) imputer to handle missing values. Based on different sites, the KAN-LBFGS shows superior performance based on the performance evaluation metrics with site 5 achieving MSE of 0.0039, RMSE of 0.0622, MAE of 0.0352, and DC of 0.9468. The study highlights the importance of the model's architecture, preprocessing and optimization techniques.</p>","PeriodicalId":12894,"journal":{"name":"Heliyon","volume":"10 23","pages":"e40799"},"PeriodicalIF":3.4000,"publicationDate":"2024-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11652856/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Heliyon","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1016/j.heliyon.2024.e40799","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/15 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Having accurate and effective wind energy forecasting that can be easily incorporated into smart networks is important. Appropriate planning and energy generation predictions are necessary for these infrastructures. The production of wind energy is linked to instability and unpredictability. Wind energy forecasting has traditionally been performed using statistical models, but with the advent of artificial intelligence (AI), research indicates that AI is more accurate than the statical technique. In this study, the nominal power of six wind farms in China was predicted using Kolmogorov-Arnold Networks (KAN) and Multilayer Perceptron (MLP) models. KAN as an alternative to the conventional MLP, has the ability to handle problems with scalability, vanishing gradients, and interpretability associated with MLP. The KAN uses learnable B-Spline as activation functions prompting it to address the issues of the MLP. We employed the Radial Basis Function (RBF) with Gaussian kernels to approximate the 3-order B-spline basis. In most deep learning models stochastic gradient-based optimization algorithms such as Adaptive Moment Estimation (ADAM) and Stochastic Gradient Descent (SGD) optimizer are mostly employed, a quasi-Newton optimization technique Limited-memory Broyden-Fletcher-Goldfarb-Shanno algorithm LBFGS was employed in this work to approximate the Hessian matrix and estimate the parameter space's curvature. Also, in the preprocessing of the data, the Interquartile Range (IQR) technique is used to handle outliers and a clustering-based K-Nearest Neighbor (KNN) imputer to handle missing values. Based on different sites, the KAN-LBFGS shows superior performance based on the performance evaluation metrics with site 5 achieving MSE of 0.0039, RMSE of 0.0622, MAE of 0.0352, and DC of 0.9468. The study highlights the importance of the model's architecture, preprocessing and optimization techniques.
期刊介绍:
Heliyon is an all-science, open access journal that is part of the Cell Press family. Any paper reporting scientifically accurate and valuable research, which adheres to accepted ethical and scientific publishing standards, will be considered for publication. Our growing team of dedicated section editors, along with our in-house team, handle your paper and manage the publication process end-to-end, giving your research the editorial support it deserves.