A Short-Term Load Forecasting Method via Model Selection Based on Random Forest

2022 IEEE 8th International Conference on Computer and Communications (ICCC) Pub Date : 2022-12-09 DOI:10.1109/ICCC56324.2022.10065825

Ziyi Li, Jingyi Zhang, Wenpeng Jing, Zhaoming Lu, Wei Zheng, X. Wen

{"title":"A Short-Term Load Forecasting Method via Model Selection Based on Random Forest","authors":"Ziyi Li, Jingyi Zhang, Wenpeng Jing, Zhaoming Lu, Wei Zheng, X. Wen","doi":"10.1109/ICCC56324.2022.10065825","DOIUrl":null,"url":null,"abstract":"Short-term load forecasting(STLF) is an essential module of energy management system, which is of great signifi-cance to the economic dispatch and operation stability in smart grid. There is a large collection of methods developed for STLF, but it is still challenging to provide high precision STLF under different weather conditions which are the main factors affecting power generation load, especially for distributed photovoltaic power generation load. A short-term load forecasting method via model selection based on random forest is proposed in this paper to realize reliable and accurate daily power generation load forecasting under different conditions. We first perform clustering analysis on the raw data through K-means. In particular, we consider both weighted meteorological factors and historical load to improve clustering performance. Secondly, we establish a model pool consisting of state-of-the-art machine learning(ML) models which is selected from four alternative ML models, and each model is the best model for each cluster. Then, we train a random forest based on each set of data and its optimal model label. In the prediction stage, random forest is utilized to directly select an appropriate model from model pool to obtain the final prediction load. The performance of the proposed method is validated on real generation load of practical scenarios. The result indicates the superiority and advantages of the model selection based STLF method compared with the single model methods, and the mean absolute error(MAE), root mean square error(RMSE) and mean absolute percentage error(MAPE) are reduced by 118.5054(KW), 10.43% and 2.08%, respectively.","PeriodicalId":263098,"journal":{"name":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","volume":"169 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCC56324.2022.10065825","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Short-term load forecasting(STLF) is an essential module of energy management system, which is of great signifi-cance to the economic dispatch and operation stability in smart grid. There is a large collection of methods developed for STLF, but it is still challenging to provide high precision STLF under different weather conditions which are the main factors affecting power generation load, especially for distributed photovoltaic power generation load. A short-term load forecasting method via model selection based on random forest is proposed in this paper to realize reliable and accurate daily power generation load forecasting under different conditions. We first perform clustering analysis on the raw data through K-means. In particular, we consider both weighted meteorological factors and historical load to improve clustering performance. Secondly, we establish a model pool consisting of state-of-the-art machine learning(ML) models which is selected from four alternative ML models, and each model is the best model for each cluster. Then, we train a random forest based on each set of data and its optimal model label. In the prediction stage, random forest is utilized to directly select an appropriate model from model pool to obtain the final prediction load. The performance of the proposed method is validated on real generation load of practical scenarios. The result indicates the superiority and advantages of the model selection based STLF method compared with the single model methods, and the mean absolute error(MAE), root mean square error(RMSE) and mean absolute percentage error(MAPE) are reduced by 118.5054(KW), 10.43% and 2.08%, respectively.

查看原文本刊更多论文

基于随机森林模型选择的短期负荷预测方法

短期负荷预测(STLF)是智能电网能源管理系统的重要模块，对智能电网的经济调度和运行稳定具有重要意义。针对STLF开发的方法很多，但在不同天气条件下提供高精度的STLF仍然是一个挑战，天气条件是影响发电负荷的主要因素，尤其是分布式光伏发电负荷。为了实现不同工况下可靠、准确的日发电负荷预测，提出了一种基于随机森林模型选择的短期负荷预测方法。我们首先通过K-means对原始数据进行聚类分析。特别是，我们考虑了加权气象因素和历史负载来提高聚类性能。其次，我们建立了一个由最先进的机器学习(ML)模型组成的模型池，这些模型从四个备选ML模型中选择，每个模型都是每个集群的最佳模型。然后，我们根据每组数据及其最优模型标签训练一个随机森林。在预测阶段，利用随机森林直接从模型池中选择合适的模型，得到最终的预测负荷。在实际场景的发电负荷上验证了该方法的性能。结果表明，与单模型方法相比，基于模型选择的STLF方法具有优越性和优势，平均绝对误差(MAE)、均方根误差(RMSE)和平均绝对百分比误差(MAPE)分别降低了118.5054(KW)、10.43%和2.08%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE 8th International Conference on Computer and Communications (ICCC)

自引率

0.00%

发文量