{"title":"Model selection for long-term load forecasting under uncertainty","authors":"Aditya Thangjam, Sanjita Jaipuria, Pradeep Kumar Dadabada","doi":"10.1108/jm2-09-2023-0211","DOIUrl":null,"url":null,"abstract":"<h3>Purpose</h3>\n<p>The purpose of this study is to propose a systematic model selection procedure for long-term load forecasting (LTLF) for ex-ante and ex-post cases considering uncertainty in exogenous predictors.</p><!--/ Abstract__block -->\n<h3>Design/methodology/approach</h3>\n<p>The different variants of regression models, namely, Polynomial Regression (PR), Generalised Additive Model (GAM), Quantile Polynomial Regression (QPR) and Quantile Spline Regression (QSR), incorporating uncertainty in exogenous predictors like population, Real Gross State Product (RGSP) and Real Per Capita Income (RPCI), temperature and indicators of breakpoints and calendar effects, are considered for LTLF. Initially, the Backward Feature Elimination procedure is used to identify the optimal set of predictors for LTLF. Then, the consistency in model accuracies is evaluated using point and probabilistic forecast error metrics for ex-ante and ex-post cases.</p><!--/ Abstract__block -->\n<h3>Findings</h3>\n<p>From this study, it is found PR model outperformed in ex-ante condition, while QPR model outperformed in ex-post condition. Further, QPR model performed consistently across validation and testing periods. Overall, QPR model excelled in capturing uncertainty in exogenous predictors, thereby reducing over-forecast error and risk of overinvestment.</p><!--/ Abstract__block -->\n<h3>Research limitations/implications</h3>\n<p>These findings can help utilities to align model selection strategies with their risk tolerance.</p><!--/ Abstract__block -->\n<h3>Originality/value</h3>\n<p>To propose the systematic model selection procedure in this study, the consistent performance of PR, GAM, QPR and QSR models are evaluated using point forecast accuracy metrics Mean Absolute Percentage Error, Root Mean Squared Error and probabilistic forecast accuracy metric Pinball Score for ex-ante and ex-post cases considering uncertainty in the considered exogenous predictors such as RGSP, RPCI, population and temperature.</p><!--/ Abstract__block -->","PeriodicalId":16349,"journal":{"name":"Journal of Modelling in Management","volume":null,"pages":null},"PeriodicalIF":1.8000,"publicationDate":"2024-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Modelling in Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1108/jm2-09-2023-0211","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MANAGEMENT","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose
The purpose of this study is to propose a systematic model selection procedure for long-term load forecasting (LTLF) for ex-ante and ex-post cases considering uncertainty in exogenous predictors.
Design/methodology/approach
The different variants of regression models, namely, Polynomial Regression (PR), Generalised Additive Model (GAM), Quantile Polynomial Regression (QPR) and Quantile Spline Regression (QSR), incorporating uncertainty in exogenous predictors like population, Real Gross State Product (RGSP) and Real Per Capita Income (RPCI), temperature and indicators of breakpoints and calendar effects, are considered for LTLF. Initially, the Backward Feature Elimination procedure is used to identify the optimal set of predictors for LTLF. Then, the consistency in model accuracies is evaluated using point and probabilistic forecast error metrics for ex-ante and ex-post cases.
Findings
From this study, it is found PR model outperformed in ex-ante condition, while QPR model outperformed in ex-post condition. Further, QPR model performed consistently across validation and testing periods. Overall, QPR model excelled in capturing uncertainty in exogenous predictors, thereby reducing over-forecast error and risk of overinvestment.
Research limitations/implications
These findings can help utilities to align model selection strategies with their risk tolerance.
Originality/value
To propose the systematic model selection procedure in this study, the consistent performance of PR, GAM, QPR and QSR models are evaluated using point forecast accuracy metrics Mean Absolute Percentage Error, Root Mean Squared Error and probabilistic forecast accuracy metric Pinball Score for ex-ante and ex-post cases considering uncertainty in the considered exogenous predictors such as RGSP, RPCI, population and temperature.
期刊介绍:
Journal of Modelling in Management (JM2) provides a forum for academics and researchers with a strong interest in business and management modelling. The journal analyses the conceptual antecedents and theoretical underpinnings leading to research modelling processes which derive useful consequences in terms of management science, business and management implementation and applications. JM2 is focused on the utilization of management data, which is amenable to research modelling processes, and welcomes academic papers that not only encompass the whole research process (from conceptualization to managerial implications) but also make explicit the individual links between ''antecedents and modelling'' (how to tackle certain problems) and ''modelling and consequences'' (how to apply the models and draw appropriate conclusions). The journal is particularly interested in innovative methodological and statistical modelling processes and those models that result in clear and justified managerial decisions. JM2 specifically promotes and supports research writing, that engages in an academically rigorous manner, in areas related to research modelling such as: A priori theorizing conceptual models, Artificial intelligence, machine learning, Association rule mining, clustering, feature selection, Business analytics: Descriptive, Predictive, and Prescriptive Analytics, Causal analytics: structural equation modeling, partial least squares modeling, Computable general equilibrium models, Computer-based models, Data mining, data analytics with big data, Decision support systems and business intelligence, Econometric models, Fuzzy logic modeling, Generalized linear models, Multi-attribute decision-making models, Non-linear models, Optimization, Simulation models, Statistical decision models, Statistical inference making and probabilistic modeling, Text mining, web mining, and visual analytics, Uncertainty-based reasoning models.