Radhakrishnan Angamuthu Chinnathambi, Mitch Campion, A. S. Nair, P. Ranganathan
{"title":"Investigation of Price-Feature Selection Algorithms for the Day-Ahead Electricity Markets","authors":"Radhakrishnan Angamuthu Chinnathambi, Mitch Campion, A. S. Nair, P. Ranganathan","doi":"10.1109/EPEC.2018.8598282","DOIUrl":null,"url":null,"abstract":"This paper investigates three types of feature selection techniques such as relative importance using Linear Regression (LR), Multivariate Adaptive Regression Splines (MARS), and Random forest (RF) to reduce the forecasts error for the hourly spot price of the Iberian electricity markets. Two pricing datasets of durations three and six months were used to validate the performance of the model. Three different set of features (17, 4, 2) for three and six months duration were used in this study. These selected features were applied to the two-stage hybrid model such as ARIMA-GLM, ARIMA-SVM, and ARIMA- RF. Finally, three variables (or features) that are commonly matched were selected and tested. Considerable reduction in Mean Absolute Percentage Errors (MAPE) values were observed for both three and six-month datasets.","PeriodicalId":265297,"journal":{"name":"2018 IEEE Electrical Power and Energy Conference (EPEC)","volume":"125 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Electrical Power and Energy Conference (EPEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EPEC.2018.8598282","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
This paper investigates three types of feature selection techniques such as relative importance using Linear Regression (LR), Multivariate Adaptive Regression Splines (MARS), and Random forest (RF) to reduce the forecasts error for the hourly spot price of the Iberian electricity markets. Two pricing datasets of durations three and six months were used to validate the performance of the model. Three different set of features (17, 4, 2) for three and six months duration were used in this study. These selected features were applied to the two-stage hybrid model such as ARIMA-GLM, ARIMA-SVM, and ARIMA- RF. Finally, three variables (or features) that are commonly matched were selected and tested. Considerable reduction in Mean Absolute Percentage Errors (MAPE) values were observed for both three and six-month datasets.