集成递归特征选择与自动机器学习框架的全球小麦价格预测

IF 4.8 Q1 AGRICULTURE, MULTIDISCIPLINARY
Prity Kumari , N. Harshith , Athula Ginige
{"title":"集成递归特征选择与自动机器学习框架的全球小麦价格预测","authors":"Prity Kumari ,&nbsp;N. Harshith ,&nbsp;Athula Ginige","doi":"10.1016/j.jafr.2025.102113","DOIUrl":null,"url":null,"abstract":"<div><div>Wheat price forecasting is challenging due to the interplay between economic trends, environmental variability and unpredictable market conditions, as well as the scarcity of reliable data. This research presents an innovative method for predicting global wheat prices by combining Recursive Feature Elimination with Cross Validation (RFECV) and Bayesian Ridge Regression in an Automated Machine Learning (AutoML) framework. The inclusion of key input variables - Lag 1, Lag 2 and an outlier indicator, marks a significant improvement in capturing regular and extreme market events which have a strong impact on prices. A comprehensive evaluation was conducted using 32 years of monthly wheat price data (1990–2022) from the Federal Reserve Economic Database, comparing 35 machine learning models. . Based on test set performance, top five performing models <em>i.e.</em> Bayesian Ridge, Linear Regression, Least Angle Regression (Lars), Least Angle Regression with Cross-Validation (LarsCV) and Least Absolute Shrinkage and Selection Operator Least Angle Regression with Cross-Validation (Lasso Lars CV), were selected and further assessed across four train-test splits (80:20, 75:25, 70:30 and 65:35). The 80:20 split provided the most stable results, with Bayesian Ridge achieving the lowest RMSE of 12.26, outperforming the other models by 1.39% to 2.61% across all splits. These findings highlight the model’s generalizability and potential application in policy formulation, market regulation and strategic planning for global wheat trade.</div></div>","PeriodicalId":34393,"journal":{"name":"Journal of Agriculture and Food Research","volume":"22 ","pages":"Article 102113"},"PeriodicalIF":4.8000,"publicationDate":"2025-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Integrating recursive feature selection with automated machine learning framework for global wheat price prediction\",\"authors\":\"Prity Kumari ,&nbsp;N. Harshith ,&nbsp;Athula Ginige\",\"doi\":\"10.1016/j.jafr.2025.102113\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Wheat price forecasting is challenging due to the interplay between economic trends, environmental variability and unpredictable market conditions, as well as the scarcity of reliable data. This research presents an innovative method for predicting global wheat prices by combining Recursive Feature Elimination with Cross Validation (RFECV) and Bayesian Ridge Regression in an Automated Machine Learning (AutoML) framework. The inclusion of key input variables - Lag 1, Lag 2 and an outlier indicator, marks a significant improvement in capturing regular and extreme market events which have a strong impact on prices. A comprehensive evaluation was conducted using 32 years of monthly wheat price data (1990–2022) from the Federal Reserve Economic Database, comparing 35 machine learning models. . Based on test set performance, top five performing models <em>i.e.</em> Bayesian Ridge, Linear Regression, Least Angle Regression (Lars), Least Angle Regression with Cross-Validation (LarsCV) and Least Absolute Shrinkage and Selection Operator Least Angle Regression with Cross-Validation (Lasso Lars CV), were selected and further assessed across four train-test splits (80:20, 75:25, 70:30 and 65:35). The 80:20 split provided the most stable results, with Bayesian Ridge achieving the lowest RMSE of 12.26, outperforming the other models by 1.39% to 2.61% across all splits. These findings highlight the model’s generalizability and potential application in policy formulation, market regulation and strategic planning for global wheat trade.</div></div>\",\"PeriodicalId\":34393,\"journal\":{\"name\":\"Journal of Agriculture and Food Research\",\"volume\":\"22 \",\"pages\":\"Article 102113\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2025-06-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Agriculture and Food Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666154325004843\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Agriculture and Food Research","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666154325004843","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

由于经济趋势、环境变化和不可预测的市场条件之间的相互作用,以及可靠数据的缺乏,小麦价格预测具有挑战性。本研究提出了一种在自动机器学习(AutoML)框架下结合递归特征消除与交叉验证(RFECV)和贝叶斯岭回归的预测全球小麦价格的创新方法。包括关键的输入变量-滞后1,滞后2和离群指标,标志着在捕捉对价格有强烈影响的常规和极端市场事件方面取得了重大进展。使用来自美联储经济数据库的32年月度小麦价格数据(1990-2022)进行了全面评估,比较了35个机器学习模型。根据测试集的性能,选择了表现最好的5个模型,即贝叶斯岭、线性回归、最小角度回归(Lars)、最小角度回归与交叉验证(LarsCV)和最小绝对收缩和选择算子最小角度回归与交叉验证(Lasso LarsCV),并在4个列车测试分割(80:20、75:25、70:30和65:35)中进行了进一步评估。80:20分割提供了最稳定的结果,贝叶斯岭实现了最低的RMSE为12.26,在所有分割中优于其他模型1.39%至2.61%。这些发现突出了该模型在全球小麦贸易政策制定、市场监管和战略规划方面的推广应用潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Integrating recursive feature selection with automated machine learning framework for global wheat price prediction
Wheat price forecasting is challenging due to the interplay between economic trends, environmental variability and unpredictable market conditions, as well as the scarcity of reliable data. This research presents an innovative method for predicting global wheat prices by combining Recursive Feature Elimination with Cross Validation (RFECV) and Bayesian Ridge Regression in an Automated Machine Learning (AutoML) framework. The inclusion of key input variables - Lag 1, Lag 2 and an outlier indicator, marks a significant improvement in capturing regular and extreme market events which have a strong impact on prices. A comprehensive evaluation was conducted using 32 years of monthly wheat price data (1990–2022) from the Federal Reserve Economic Database, comparing 35 machine learning models. . Based on test set performance, top five performing models i.e. Bayesian Ridge, Linear Regression, Least Angle Regression (Lars), Least Angle Regression with Cross-Validation (LarsCV) and Least Absolute Shrinkage and Selection Operator Least Angle Regression with Cross-Validation (Lasso Lars CV), were selected and further assessed across four train-test splits (80:20, 75:25, 70:30 and 65:35). The 80:20 split provided the most stable results, with Bayesian Ridge achieving the lowest RMSE of 12.26, outperforming the other models by 1.39% to 2.61% across all splits. These findings highlight the model’s generalizability and potential application in policy formulation, market regulation and strategic planning for global wheat trade.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
5.40
自引率
2.60%
发文量
193
审稿时长
69 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信