{"title":"研究训练时间对基于机器学习的光伏发电预测的影响","authors":"Dávid Markovics, M. J. Mayer","doi":"10.1109/IYCE54153.2022.9857544","DOIUrl":null,"url":null,"abstract":"The built-in capacity and therefore the importance of photovoltaic (PV) power generation is inevitably increasing in the past years resulting that it is less and less possible to turn a blind eye to the challenging problems that come with it. This article focuses on a practical issue regarding numerical weather prediction (NWP) based PV forecasting with machine learning, which is the effect of the training time. Most of the studies propose new high-level sophisticated algorithms which can reduce the errors with a given percentage, however few of them deals with practical questions regarding the training time therefore the operating time needed to reach the accuracy of physical models, and the optimal length of the training dataset is still generally unclear. This study is supposed to provide some suggestions introducing the test method and the results of the simulations that were done on 2-years data including spatial weather forecast and the power production of a Hungarian PV plant. Three fast but powerful ensemble methods, XGBoost, LightGBM and ExtraTrees regression were applied using different train length cases in the range of 2,4,6… 102 weeks, for all the 52 two-week intervals. This simulation allows to investigate not just the previously mentioned aspects, but also the seasonal difference and sensitivity for the forecasting accuracy. The results are shown in both day-ahead and intraday time horizons, where in case of latter, also some measured power values of the close-past has been added to the input variables.","PeriodicalId":248738,"journal":{"name":"2022 8th International Youth Conference on Energy (IYCE)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Investigating the effect of training time for machine learning based photovoltaic power forecasting\",\"authors\":\"Dávid Markovics, M. J. Mayer\",\"doi\":\"10.1109/IYCE54153.2022.9857544\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The built-in capacity and therefore the importance of photovoltaic (PV) power generation is inevitably increasing in the past years resulting that it is less and less possible to turn a blind eye to the challenging problems that come with it. This article focuses on a practical issue regarding numerical weather prediction (NWP) based PV forecasting with machine learning, which is the effect of the training time. Most of the studies propose new high-level sophisticated algorithms which can reduce the errors with a given percentage, however few of them deals with practical questions regarding the training time therefore the operating time needed to reach the accuracy of physical models, and the optimal length of the training dataset is still generally unclear. This study is supposed to provide some suggestions introducing the test method and the results of the simulations that were done on 2-years data including spatial weather forecast and the power production of a Hungarian PV plant. Three fast but powerful ensemble methods, XGBoost, LightGBM and ExtraTrees regression were applied using different train length cases in the range of 2,4,6… 102 weeks, for all the 52 two-week intervals. This simulation allows to investigate not just the previously mentioned aspects, but also the seasonal difference and sensitivity for the forecasting accuracy. The results are shown in both day-ahead and intraday time horizons, where in case of latter, also some measured power values of the close-past has been added to the input variables.\",\"PeriodicalId\":248738,\"journal\":{\"name\":\"2022 8th International Youth Conference on Energy (IYCE)\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 8th International Youth Conference on Energy (IYCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IYCE54153.2022.9857544\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 8th International Youth Conference on Energy (IYCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IYCE54153.2022.9857544","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Investigating the effect of training time for machine learning based photovoltaic power forecasting
The built-in capacity and therefore the importance of photovoltaic (PV) power generation is inevitably increasing in the past years resulting that it is less and less possible to turn a blind eye to the challenging problems that come with it. This article focuses on a practical issue regarding numerical weather prediction (NWP) based PV forecasting with machine learning, which is the effect of the training time. Most of the studies propose new high-level sophisticated algorithms which can reduce the errors with a given percentage, however few of them deals with practical questions regarding the training time therefore the operating time needed to reach the accuracy of physical models, and the optimal length of the training dataset is still generally unclear. This study is supposed to provide some suggestions introducing the test method and the results of the simulations that were done on 2-years data including spatial weather forecast and the power production of a Hungarian PV plant. Three fast but powerful ensemble methods, XGBoost, LightGBM and ExtraTrees regression were applied using different train length cases in the range of 2,4,6… 102 weeks, for all the 52 two-week intervals. This simulation allows to investigate not just the previously mentioned aspects, but also the seasonal difference and sensitivity for the forecasting accuracy. The results are shown in both day-ahead and intraday time horizons, where in case of latter, also some measured power values of the close-past has been added to the input variables.