{"title":"A Short-Term Load Forecasting Method via Model Selection Based on Random Forest","authors":"Ziyi Li, Jingyi Zhang, Wenpeng Jing, Zhaoming Lu, Wei Zheng, X. Wen","doi":"10.1109/ICCC56324.2022.10065825","DOIUrl":null,"url":null,"abstract":"Short-term load forecasting(STLF) is an essential module of energy management system, which is of great signifi-cance to the economic dispatch and operation stability in smart grid. There is a large collection of methods developed for STLF, but it is still challenging to provide high precision STLF under different weather conditions which are the main factors affecting power generation load, especially for distributed photovoltaic power generation load. A short-term load forecasting method via model selection based on random forest is proposed in this paper to realize reliable and accurate daily power generation load forecasting under different conditions. We first perform clustering analysis on the raw data through K-means. In particular, we consider both weighted meteorological factors and historical load to improve clustering performance. Secondly, we establish a model pool consisting of state-of-the-art machine learning(ML) models which is selected from four alternative ML models, and each model is the best model for each cluster. Then, we train a random forest based on each set of data and its optimal model label. In the prediction stage, random forest is utilized to directly select an appropriate model from model pool to obtain the final prediction load. The performance of the proposed method is validated on real generation load of practical scenarios. The result indicates the superiority and advantages of the model selection based STLF method compared with the single model methods, and the mean absolute error(MAE), root mean square error(RMSE) and mean absolute percentage error(MAPE) are reduced by 118.5054(KW), 10.43% and 2.08%, respectively.","PeriodicalId":263098,"journal":{"name":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","volume":"169 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCC56324.2022.10065825","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Short-term load forecasting(STLF) is an essential module of energy management system, which is of great signifi-cance to the economic dispatch and operation stability in smart grid. There is a large collection of methods developed for STLF, but it is still challenging to provide high precision STLF under different weather conditions which are the main factors affecting power generation load, especially for distributed photovoltaic power generation load. A short-term load forecasting method via model selection based on random forest is proposed in this paper to realize reliable and accurate daily power generation load forecasting under different conditions. We first perform clustering analysis on the raw data through K-means. In particular, we consider both weighted meteorological factors and historical load to improve clustering performance. Secondly, we establish a model pool consisting of state-of-the-art machine learning(ML) models which is selected from four alternative ML models, and each model is the best model for each cluster. Then, we train a random forest based on each set of data and its optimal model label. In the prediction stage, random forest is utilized to directly select an appropriate model from model pool to obtain the final prediction load. The performance of the proposed method is validated on real generation load of practical scenarios. The result indicates the superiority and advantages of the model selection based STLF method compared with the single model methods, and the mean absolute error(MAE), root mean square error(RMSE) and mean absolute percentage error(MAPE) are reduced by 118.5054(KW), 10.43% and 2.08%, respectively.