P. Matrenin, A. Khalyasmaa, V. Gamaley, S. Eroshenko, N. A. Papkova, D. A. Sekatski, Y. V. Potachits
{"title":"Improving of the Generation Accuracy Forecasting of Photovoltaic Plants Based on k-Means and k-Nearest Neighbors Algorithms","authors":"P. Matrenin, A. Khalyasmaa, V. Gamaley, S. Eroshenko, N. A. Papkova, D. A. Sekatski, Y. V. Potachits","doi":"10.21122/1029-7448-2023-66-4-305-321","DOIUrl":null,"url":null,"abstract":"Renewable energy sources (RES) are seen as a means of the fuel and energy complex carbon footprint reduction but the stochastic nature of generation complicates RES integration with electric power systems. Therefore, it is necessary to develop and improve methods for forecasting of the power plants generation using the energy of the sun, wind and water flows. One of the ways to improve the accuracy of forecast models is a deep analysis of meteorological conditions as the main factor affecting the power generation. In this paper, a method for adapting of forecast models to the meteorological conditions of photovoltaic stations operation based on machine learning algorithms was proposed and studied. In this case, unsupervised learning is first performed using the k-means method to form clusters. For this, it is also proposed to use studied the feature space dimensionality reduction algorithm to visualize and estimate the clustering accuracy. Then, for each cluster, its own machine learning model was trained for generation forecasting and the k-nearest neighbours algorithm was built to attribute the current conditions at the model operation stage to one of the formed clusters. The study was conducted on hourly meteorological data for the period from 1985 to 2021. A feature of the approach is the clustering of weather conditions on hourly rather than daily intervals. As a result, the mean absolute percentage error of forecasting is reduced significantly, depending on the prediction model used. For the best case, the error in forecasting of a photovoltaic plant generation an hour ahead was 9 %.","PeriodicalId":52141,"journal":{"name":"Energetika. Proceedings of CIS Higher Education Institutions and Power Engineering Associations","volume":"25 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energetika. Proceedings of CIS Higher Education Institutions and Power Engineering Associations","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21122/1029-7448-2023-66-4-305-321","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Energy","Score":null,"Total":0}
引用次数: 2
Abstract
Renewable energy sources (RES) are seen as a means of the fuel and energy complex carbon footprint reduction but the stochastic nature of generation complicates RES integration with electric power systems. Therefore, it is necessary to develop and improve methods for forecasting of the power plants generation using the energy of the sun, wind and water flows. One of the ways to improve the accuracy of forecast models is a deep analysis of meteorological conditions as the main factor affecting the power generation. In this paper, a method for adapting of forecast models to the meteorological conditions of photovoltaic stations operation based on machine learning algorithms was proposed and studied. In this case, unsupervised learning is first performed using the k-means method to form clusters. For this, it is also proposed to use studied the feature space dimensionality reduction algorithm to visualize and estimate the clustering accuracy. Then, for each cluster, its own machine learning model was trained for generation forecasting and the k-nearest neighbours algorithm was built to attribute the current conditions at the model operation stage to one of the formed clusters. The study was conducted on hourly meteorological data for the period from 1985 to 2021. A feature of the approach is the clustering of weather conditions on hourly rather than daily intervals. As a result, the mean absolute percentage error of forecasting is reduced significantly, depending on the prediction model used. For the best case, the error in forecasting of a photovoltaic plant generation an hour ahead was 9 %.
期刊介绍:
The most important objectives of the journal are the generalization of scientific and practical achievements in the field of power engineering, increase scientific and practical skills as researchers and industry representatives. Scientific concept publications include the publication of a modern national and international research and achievements in areas such as general energetic, electricity, thermal energy, construction, environmental issues energy, energy economy, etc. The journal publishes the results of basic research and the advanced achievements of practices aimed at improving the efficiency of the functioning of the energy sector, reduction of losses in electricity and heat networks, improving the reliability of electrical protection systems, the stability of the energetic complex, literature reviews on a wide range of energy issues.