利用前向变量选择进行预测的自回归分布滞后多重时间序列建模

Q3 Economics, Econometrics and Finance

WSEAS Transactions on Business and Economics Pub Date : 2024-04-19 DOI:10.37394/23207.2024.21.84

Achmad Efendi, Yusi Tyroni Mursityo, N. W. Hidajati, Nur Andajani, Zuraidah Zuraidah, S. Handoyo

{"title":"利用前向变量选择进行预测的自回归分布滞后多重时间序列建模","authors":"Achmad Efendi, Yusi Tyroni Mursityo, N. W. Hidajati, Nur Andajani, Zuraidah Zuraidah, S. Handoyo","doi":"10.37394/23207.2024.21.84","DOIUrl":null,"url":null,"abstract":"The conventional time series methods tend to explore the modeling process and statistics tests to find the best model. On the other hand, machine learning methods are concerned with finding it based on the highest performance in the testing data. This research proposes a mixture approach in the development of the ARDL (Autoregressive Distributed Lags) model to predict the Cayenne peppers price. Multiple time series data are formed into a matrix of input-output pairs with various lag numbers of 3, 5, and 7. The dataset is normalized with the Min-max and Z score transformations. The ARDL predictor variables of each lag number and dataset combinations are selected using the forward selection method with a majority vote of four criteria namely the Cp (Cp Mallow), AIC (Akaike Information Criterion), BIC (Bayesian Information Criterion), and adjusted R2 . Each ARDL model is evaluated in the testing data with performance metrics of the RMSE (Root Mean Square Error), MAE (Mean Absolute Error), and R2 . Both AIC and adjusted R2 always form the majority vote in the determining optimal predictor variable of ARDL models in all scenarios. The ARDL predictor variables in each lag number are different but they are the same in the different dataset scenarios. The price of Cayenne pepper yesterday is the predictor variable with the most contribution in all of the 9 ARDL models yielded. The ARDL lag 3 with the original dataset outperforms in the RMSE and MAE metrics while the ARDL lag 3 with the Z score dataset outperforms in the R2 metric.","PeriodicalId":39427,"journal":{"name":"WSEAS Transactions on Business and Economics","volume":" 29","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multiple Time Series Modeling of Autoregressive Distributed Lags with Forward Variable Selection for Prediction\",\"authors\":\"Achmad Efendi, Yusi Tyroni Mursityo, N. W. Hidajati, Nur Andajani, Zuraidah Zuraidah, S. Handoyo\",\"doi\":\"10.37394/23207.2024.21.84\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The conventional time series methods tend to explore the modeling process and statistics tests to find the best model. On the other hand, machine learning methods are concerned with finding it based on the highest performance in the testing data. This research proposes a mixture approach in the development of the ARDL (Autoregressive Distributed Lags) model to predict the Cayenne peppers price. Multiple time series data are formed into a matrix of input-output pairs with various lag numbers of 3, 5, and 7. The dataset is normalized with the Min-max and Z score transformations. The ARDL predictor variables of each lag number and dataset combinations are selected using the forward selection method with a majority vote of four criteria namely the Cp (Cp Mallow), AIC (Akaike Information Criterion), BIC (Bayesian Information Criterion), and adjusted R2 . Each ARDL model is evaluated in the testing data with performance metrics of the RMSE (Root Mean Square Error), MAE (Mean Absolute Error), and R2 . Both AIC and adjusted R2 always form the majority vote in the determining optimal predictor variable of ARDL models in all scenarios. The ARDL predictor variables in each lag number are different but they are the same in the different dataset scenarios. The price of Cayenne pepper yesterday is the predictor variable with the most contribution in all of the 9 ARDL models yielded. The ARDL lag 3 with the original dataset outperforms in the RMSE and MAE metrics while the ARDL lag 3 with the Z score dataset outperforms in the R2 metric.\",\"PeriodicalId\":39427,\"journal\":{\"name\":\"WSEAS Transactions on Business and Economics\",\"volume\":\" 29\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-04-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"WSEAS Transactions on Business and Economics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.37394/23207.2024.21.84\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Economics, Econometrics and Finance\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"WSEAS Transactions on Business and Economics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37394/23207.2024.21.84","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Economics, Econometrics and Finance","Score":null,"Total":0}

引用次数: 0

摘要

传统的时间序列方法倾向于探索建模过程和统计测试，以找到最佳模型。另一方面，机器学习方法关注的是根据测试数据中的最高性能来寻找最佳模型。本研究在建立 ARDL（自回归分布滞后）模型预测卡宴花椒价格时提出了一种混合方法。多个时间序列数据组成了一个输入输出对矩阵，其滞后数分别为 3、5 和 7。数据集通过最小-最大和 Z 分数变换进行归一化处理。每个滞后期数和数据集组合的 ARDL 预测变量均采用正向选择法，由四个标准（即 Cp（Cp Mallow）、AIC（Akaike 信息准则）、BIC（贝叶斯信息准则）和调整后 R2）的多数票选出。每个 ARDL 模型都通过 RMSE（均方根误差）、MAE（平均绝对误差）和 R2 等性能指标对测试数据进行评估。在所有情况下，AIC 和调整后 R2 在确定 ARDL 模型的最佳预测变量时总是占多数。每个滞后期的 ARDL 预测变量是不同的，但在不同的数据集情况下是相同的。昨天卡宴辣椒的价格是所有 9 个 ARDL 模型中贡献最大的预测变量。使用原始数据集的 ARDL 第 3 滞后模型在 RMSE 和 MAE 指标上优于使用 Z 分数数据集的 ARDL 第 3 滞后模型，而在 R2 指标上优于使用 Z 分数数据集的 ARDL 第 3 滞后模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Multiple Time Series Modeling of Autoregressive Distributed Lags with Forward Variable Selection for Prediction

The conventional time series methods tend to explore the modeling process and statistics tests to find the best model. On the other hand, machine learning methods are concerned with finding it based on the highest performance in the testing data. This research proposes a mixture approach in the development of the ARDL (Autoregressive Distributed Lags) model to predict the Cayenne peppers price. Multiple time series data are formed into a matrix of input-output pairs with various lag numbers of 3, 5, and 7. The dataset is normalized with the Min-max and Z score transformations. The ARDL predictor variables of each lag number and dataset combinations are selected using the forward selection method with a majority vote of four criteria namely the Cp (Cp Mallow), AIC (Akaike Information Criterion), BIC (Bayesian Information Criterion), and adjusted R2 . Each ARDL model is evaluated in the testing data with performance metrics of the RMSE (Root Mean Square Error), MAE (Mean Absolute Error), and R2 . Both AIC and adjusted R2 always form the majority vote in the determining optimal predictor variable of ARDL models in all scenarios. The ARDL predictor variables in each lag number are different but they are the same in the different dataset scenarios. The price of Cayenne pepper yesterday is the predictor variable with the most contribution in all of the 9 ARDL models yielded. The ARDL lag 3 with the original dataset outperforms in the RMSE and MAE metrics while the ARDL lag 3 with the Z score dataset outperforms in the R2 metric.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

WSEAS Transactions on Business and Economics Economics, Econometrics and Finance-Economics and Econometrics

CiteScore

1.50

自引率

0.00%

发文量

180

期刊介绍： WSEAS Transactions on Business and Economics publishes original research papers relating to the global economy. We aim to bring important work using any economic approach to a wide international audience and therefore only publish papers of exceptional scientific value that advance our understanding of finances. The research presented must transcend the limits of case studies, while both experimental and theoretical studies are accepted. While its main emphasis is economic, it is a multi-disciplinary journal and therefore its content mirrors the diverse interests and approaches of scholars involved with the international dimensions of business, economics, finance, history, law, marketing, management, political science, and related areas. It also welcomes scholarly contributions from officials with government agencies, international agencies, and non-governmental organizations.