研究训练时间对基于机器学习的光伏发电预测的影响

Dávid Markovics, M. J. Mayer
{"title":"研究训练时间对基于机器学习的光伏发电预测的影响","authors":"Dávid Markovics, M. J. Mayer","doi":"10.1109/IYCE54153.2022.9857544","DOIUrl":null,"url":null,"abstract":"The built-in capacity and therefore the importance of photovoltaic (PV) power generation is inevitably increasing in the past years resulting that it is less and less possible to turn a blind eye to the challenging problems that come with it. This article focuses on a practical issue regarding numerical weather prediction (NWP) based PV forecasting with machine learning, which is the effect of the training time. Most of the studies propose new high-level sophisticated algorithms which can reduce the errors with a given percentage, however few of them deals with practical questions regarding the training time therefore the operating time needed to reach the accuracy of physical models, and the optimal length of the training dataset is still generally unclear. This study is supposed to provide some suggestions introducing the test method and the results of the simulations that were done on 2-years data including spatial weather forecast and the power production of a Hungarian PV plant. Three fast but powerful ensemble methods, XGBoost, LightGBM and ExtraTrees regression were applied using different train length cases in the range of 2,4,6… 102 weeks, for all the 52 two-week intervals. This simulation allows to investigate not just the previously mentioned aspects, but also the seasonal difference and sensitivity for the forecasting accuracy. The results are shown in both day-ahead and intraday time horizons, where in case of latter, also some measured power values of the close-past has been added to the input variables.","PeriodicalId":248738,"journal":{"name":"2022 8th International Youth Conference on Energy (IYCE)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Investigating the effect of training time for machine learning based photovoltaic power forecasting\",\"authors\":\"Dávid Markovics, M. J. Mayer\",\"doi\":\"10.1109/IYCE54153.2022.9857544\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The built-in capacity and therefore the importance of photovoltaic (PV) power generation is inevitably increasing in the past years resulting that it is less and less possible to turn a blind eye to the challenging problems that come with it. This article focuses on a practical issue regarding numerical weather prediction (NWP) based PV forecasting with machine learning, which is the effect of the training time. Most of the studies propose new high-level sophisticated algorithms which can reduce the errors with a given percentage, however few of them deals with practical questions regarding the training time therefore the operating time needed to reach the accuracy of physical models, and the optimal length of the training dataset is still generally unclear. This study is supposed to provide some suggestions introducing the test method and the results of the simulations that were done on 2-years data including spatial weather forecast and the power production of a Hungarian PV plant. Three fast but powerful ensemble methods, XGBoost, LightGBM and ExtraTrees regression were applied using different train length cases in the range of 2,4,6… 102 weeks, for all the 52 two-week intervals. This simulation allows to investigate not just the previously mentioned aspects, but also the seasonal difference and sensitivity for the forecasting accuracy. The results are shown in both day-ahead and intraday time horizons, where in case of latter, also some measured power values of the close-past has been added to the input variables.\",\"PeriodicalId\":248738,\"journal\":{\"name\":\"2022 8th International Youth Conference on Energy (IYCE)\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 8th International Youth Conference on Energy (IYCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IYCE54153.2022.9857544\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 8th International Youth Conference on Energy (IYCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IYCE54153.2022.9857544","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

在过去的几年里,内置容量和光伏发电的重要性不可避免地增加,导致越来越不可能对随之而来的具有挑战性的问题视而不见。本文主要研究基于数值天气预报(NWP)的PV预测与机器学习的一个实际问题,即训练时间的影响。大多数研究都提出了新的高级复杂算法,可以将误差降低到给定的百分比,但是很少有研究涉及到训练时间的实际问题,因此达到物理模型精度所需的操作时间,以及训练数据集的最佳长度仍然普遍不清楚。本研究通过对匈牙利某光伏电站2年的空间天气预报和发电数据进行模拟,并介绍了测试方法和模拟结果,提出了一些建议。在所有52周的时间间隔中,采用了XGBoost、LightGBM和ExtraTrees三种快速但功能强大的集成方法,在2、4、6…102周的不同训练长度的情况下应用。该模拟不仅可以研究前面提到的方面,还可以研究季节差异和预测精度的敏感性。结果显示在前一天和当天的时间范围,在后者的情况下,也有一些测量功率值的近过去已被添加到输入变量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Investigating the effect of training time for machine learning based photovoltaic power forecasting
The built-in capacity and therefore the importance of photovoltaic (PV) power generation is inevitably increasing in the past years resulting that it is less and less possible to turn a blind eye to the challenging problems that come with it. This article focuses on a practical issue regarding numerical weather prediction (NWP) based PV forecasting with machine learning, which is the effect of the training time. Most of the studies propose new high-level sophisticated algorithms which can reduce the errors with a given percentage, however few of them deals with practical questions regarding the training time therefore the operating time needed to reach the accuracy of physical models, and the optimal length of the training dataset is still generally unclear. This study is supposed to provide some suggestions introducing the test method and the results of the simulations that were done on 2-years data including spatial weather forecast and the power production of a Hungarian PV plant. Three fast but powerful ensemble methods, XGBoost, LightGBM and ExtraTrees regression were applied using different train length cases in the range of 2,4,6… 102 weeks, for all the 52 two-week intervals. This simulation allows to investigate not just the previously mentioned aspects, but also the seasonal difference and sensitivity for the forecasting accuracy. The results are shown in both day-ahead and intraday time horizons, where in case of latter, also some measured power values of the close-past has been added to the input variables.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信