Kshitij A. Kakade, Kshitish Ghate, Rajat K Jaiswal, R. Jaiswal
{"title":"A Novel Approach to Forecast Crude Oil Prices Using Machine Learning and Technical Indicators","authors":"Kshitij A. Kakade, Kshitish Ghate, Rajat K Jaiswal, R. Jaiswal","doi":"10.12720/jait.14.2.302-310","DOIUrl":null,"url":null,"abstract":"—This study proposes to use a hybrid ensemble learning approach to improve the prediction efficiency of crude oil prices. It combines the Long Short-Term Memory (LSTM) with factors that influence the price of crude oil. The information from fundamental and technical indicators is considered along with statistical model predictions like autoregressive integrated moving average (ARIMA)to make one-step-ahead crude oil price predictions. A Principal Component Analysis (PCA) approach is employed to transform the explanatory variables. This study combines the LSTM with PCA, jointly known as the LP model wherein PCA transforms of the fundamental and technical indicators are used as inputs to improve LSTM predictions. Further, it attempts to improve these predictions by introducing the LSTM+PCA+ARIMA (LPA) model, which uses an ensemble learning approach to utilize the forecast from the ARIMA model, as an additional input. Among LP and LPA models, the LSTM model is used as a benchmark to evaluate the performance of the hybrid models. Based on the result, a significant improvement is seen in the LP model over the chosen window sizes and error metrics. On the other hand, the LPA model performs better across all dimensions with an average improvement of 41% over the LSTM model in terms of forecasting accuracy. Moreover, the equivalence of forecasting accuracy is tested using the Diebold-Mariano and Wilcoxon signed-rank tests","PeriodicalId":36452,"journal":{"name":"Journal of Advances in Information Technology","volume":"1 1","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advances in Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12720/jait.14.2.302-310","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
—This study proposes to use a hybrid ensemble learning approach to improve the prediction efficiency of crude oil prices. It combines the Long Short-Term Memory (LSTM) with factors that influence the price of crude oil. The information from fundamental and technical indicators is considered along with statistical model predictions like autoregressive integrated moving average (ARIMA)to make one-step-ahead crude oil price predictions. A Principal Component Analysis (PCA) approach is employed to transform the explanatory variables. This study combines the LSTM with PCA, jointly known as the LP model wherein PCA transforms of the fundamental and technical indicators are used as inputs to improve LSTM predictions. Further, it attempts to improve these predictions by introducing the LSTM+PCA+ARIMA (LPA) model, which uses an ensemble learning approach to utilize the forecast from the ARIMA model, as an additional input. Among LP and LPA models, the LSTM model is used as a benchmark to evaluate the performance of the hybrid models. Based on the result, a significant improvement is seen in the LP model over the chosen window sizes and error metrics. On the other hand, the LPA model performs better across all dimensions with an average improvement of 41% over the LSTM model in terms of forecasting accuracy. Moreover, the equivalence of forecasting accuracy is tested using the Diebold-Mariano and Wilcoxon signed-rank tests