Forecasting Potato Production in Major South Asian Countries: a Comparative Study of Machine Learning and Time Series Models

IF 2.3 3区 农林科学 Q1 AGRONOMY
Pradeep Mishra, Abdullah Mohammad Ghazi Al khatib, Bayan Mohamad Alshaib, Binita Kuamri, Shiwani Tiwari, Aditya Pratap Singh, Shikha Yadav, Divya Sharma, Prity Kuamri
{"title":"Forecasting Potato Production in Major South Asian Countries: a Comparative Study of Machine Learning and Time Series Models","authors":"Pradeep Mishra, Abdullah Mohammad Ghazi Al khatib, Bayan Mohamad Alshaib, Binita Kuamri, Shiwani Tiwari, Aditya Pratap Singh, Shikha Yadav, Divya Sharma, Prity Kuamri","doi":"10.1007/s11540-023-09683-z","DOIUrl":null,"url":null,"abstract":"<p>This study analyzed and forecasted potato production in eight major South Asian countries from 1961 to 2028 using advanced time series and machine learning approaches. Annual potato production data was modelled with autoregressive integrated moving average (ARIMA), state space, and extreme gradient boosting (XGBoost) models. The models were trained on 1961–2009 data and evaluated on a 2010–2021 validation set. On the training set, XGBoost showed the best performance. However, on the validation set, ARIMA and state space models significantly outperformed XGBoost, indicating issues with overfitting. The ARIMA models produced the lowest forecast errors for Afghanistan, Bangladesh, China, and Myanmar. Meanwhile, state space models were optimal for India, Nepal, Pakistan, and Sri Lanka, demonstrating that no one approach was uniformly best. The top performing models forecast potato production up to 2028, Afghanistan’s production is expected to remain stable at around 860–862 thousand metric tons. Bangladesh’s output is forecasted to stay constant at 9887 thousand metric tons. In contrast, China is predicted to see a steady increase from 94,625 to 96,193 thousand metric tons. India’s production is anticipated to grow significantly from 54,704 to 62,396 thousand metric tons. Conversely, Myanmar’s production is projected to decline from 460 to 426 thousand metric tons. Nepal’s output is expected to steadily increase from 3395 to 4011 thousand metric tons. Pakistan’s production is forecasted to rise substantially from 5573 to 7045 thousand metric tons. Lastly, Sri Lanka’s potato production is projected to experience a modest increase from 77 to 84 thousand metric tons. These forecasts reflect the different levels of potato demand, consumption, and trade in each country, as well as the effects of climate change, pests, and diseases on potato yields. The rigorous comparative application of advanced time series and machine learning techniques provides valuable data-driven insights into future South Asian potato supply. The forecasts can assist food security planning and agricultural policymaking in the region.</p>","PeriodicalId":20378,"journal":{"name":"Potato Research","volume":"17 1","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Potato Research","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1007/s11540-023-09683-z","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0

Abstract

This study analyzed and forecasted potato production in eight major South Asian countries from 1961 to 2028 using advanced time series and machine learning approaches. Annual potato production data was modelled with autoregressive integrated moving average (ARIMA), state space, and extreme gradient boosting (XGBoost) models. The models were trained on 1961–2009 data and evaluated on a 2010–2021 validation set. On the training set, XGBoost showed the best performance. However, on the validation set, ARIMA and state space models significantly outperformed XGBoost, indicating issues with overfitting. The ARIMA models produced the lowest forecast errors for Afghanistan, Bangladesh, China, and Myanmar. Meanwhile, state space models were optimal for India, Nepal, Pakistan, and Sri Lanka, demonstrating that no one approach was uniformly best. The top performing models forecast potato production up to 2028, Afghanistan’s production is expected to remain stable at around 860–862 thousand metric tons. Bangladesh’s output is forecasted to stay constant at 9887 thousand metric tons. In contrast, China is predicted to see a steady increase from 94,625 to 96,193 thousand metric tons. India’s production is anticipated to grow significantly from 54,704 to 62,396 thousand metric tons. Conversely, Myanmar’s production is projected to decline from 460 to 426 thousand metric tons. Nepal’s output is expected to steadily increase from 3395 to 4011 thousand metric tons. Pakistan’s production is forecasted to rise substantially from 5573 to 7045 thousand metric tons. Lastly, Sri Lanka’s potato production is projected to experience a modest increase from 77 to 84 thousand metric tons. These forecasts reflect the different levels of potato demand, consumption, and trade in each country, as well as the effects of climate change, pests, and diseases on potato yields. The rigorous comparative application of advanced time series and machine learning techniques provides valuable data-driven insights into future South Asian potato supply. The forecasts can assist food security planning and agricultural policymaking in the region.

Abstract Image

南亚主要国家马铃薯产量预测:机器学习与时间序列模型比较研究
本研究采用先进的时间序列和机器学习方法,对八个主要南亚国家1961年至2028年的马铃薯产量进行了分析和预测。马铃薯年产量数据采用自回归综合移动平均(ARIMA)、状态空间和极端梯度提升(XGBoost)模型进行建模。这些模型在 1961-2009 年的数据上进行了训练,并在 2010-2021 年的验证集上进行了评估。在训练集上,XGBoost 表现最佳。然而,在验证集上,ARIMA 和状态空间模型的表现明显优于 XGBoost,这表明存在过度拟合的问题。在阿富汗、孟加拉国、中国和缅甸,ARIMA 模型产生的预测误差最小。与此同时,状态空间模型对印度、尼泊尔、巴基斯坦和斯里兰卡的预报效果最佳,这表明没有哪种方法是一致最佳的。表现最好的模型预测了到2028年的马铃薯产量,预计阿富汗的产量将稳定在86-86.2万吨左右。孟加拉国的产量预计将保持在988.7万吨。相比之下,中国的产量预计将从 9462.5 万吨稳步增长到 9619.3 万吨。印度的产量预计将从 5470.4 万吨大幅增长到 6239.6 万吨。相反,缅甸的产量预计将从 460 千吨下降到 426 千吨。尼泊尔的产量预计将从 3395 千吨稳步增长到 4011 千吨。巴基斯坦的产量预计将从557.3万吨大幅增至704.5万吨。最后,斯里兰卡的马铃薯产量预计将从7.7万吨略增至8.4万吨。这些预测反映了各国不同的马铃薯需求、消费和贸易水平,以及气候变化、病虫害对马铃薯产量的影响。先进的时间序列和机器学习技术的严格比较应用,为未来南亚马铃薯的供应提供了宝贵的数据驱动的见解。这些预测有助于该地区的粮食安全规划和农业决策。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Potato Research
Potato Research AGRONOMY-
CiteScore
5.50
自引率
6.90%
发文量
66
审稿时长
>12 weeks
期刊介绍: Potato Research, the journal of the European Association for Potato Research (EAPR), promotes the exchange of information on all aspects of this fast-evolving global industry. It offers the latest developments in innovative research to scientists active in potato research. The journal includes authoritative coverage of new scientific developments, publishing original research and review papers on such topics as: Molecular sciences; Breeding; Physiology; Pathology; Nematology; Virology; Agronomy; Engineering and Utilization.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信