Machine learning ensembles, neural network, hybrid and sparse regression approaches for weather based rainfed cotton yield forecast

IF 3 3区地球科学 Q2 BIOPHYSICS

International Journal of Biometeorology Pub Date : 2024-04-27 DOI:10.1007/s00484-024-02661-1

Girish R Kashyap, Shankarappa Sridhara, Konapura Nagaraja Manoj, Pradeep Gopakkali, Bappa Das, Prakash Kumar Jha, P. V. Vara Prasad

{"title":"Machine learning ensembles, neural network, hybrid and sparse regression approaches for weather based rainfed cotton yield forecast","authors":"Girish R Kashyap, Shankarappa Sridhara, Konapura Nagaraja Manoj, Pradeep Gopakkali, Bappa Das, Prakash Kumar Jha, P. V. Vara Prasad","doi":"10.1007/s00484-024-02661-1","DOIUrl":null,"url":null,"abstract":"<div><p>Cotton is a major economic crop predominantly cultivated under rainfed situations. The accurate prediction of cotton yield invariably helps farmers, industries, and policy makers. The final cotton yield is mostly determined by the weather patterns that prevail during the crop growing phase. Crop yield prediction with greater accuracy is possible due to the development of innovative technologies which analyses the bigdata with its high-performance computing abilities. Machine learning technologies can make yield prediction reasonable and faster and with greater flexibility than process based complex crop simulation models. The present study demonstrates the usability of ML algorithms for yield forecasting and facilitates the comparison of different models. The cotton yield was simulated by employing the weekly weather indices as inputs and the model performance was assessed by nRMSE, MAPE and EF values. Results show that stacked generalised ensemble model and artificial neural networks predicted the cotton yield with lower nRMSE, MAPE and higher efficiency compared to other models. Variable importance studies in LASSO and ENET model found minimum temperature and relative humidity as the main determinates of cotton yield in all districts. The models were ranked based these performance metrics in the order of Stacked generalised ensemble > ANN > PCA ANN > SMLR ANN > LASSO> ENET > SVM > PCA SMLR > SMLR SVM > SMLR. This study shows that stacked generalised ensembling and ANN method can be used for reliable yield forecasting at district or county level and helps stakeholders in timely decision-making.</p></div>","PeriodicalId":588,"journal":{"name":"International Journal of Biometeorology","volume":"68 6","pages":"1179 - 1197"},"PeriodicalIF":3.0000,"publicationDate":"2024-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Biometeorology","FirstCategoryId":"89","ListUrlMain":"https://link.springer.com/article/10.1007/s00484-024-02661-1","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOPHYSICS","Score":null,"Total":0}

引用次数: 0

Abstract

Cotton is a major economic crop predominantly cultivated under rainfed situations. The accurate prediction of cotton yield invariably helps farmers, industries, and policy makers. The final cotton yield is mostly determined by the weather patterns that prevail during the crop growing phase. Crop yield prediction with greater accuracy is possible due to the development of innovative technologies which analyses the bigdata with its high-performance computing abilities. Machine learning technologies can make yield prediction reasonable and faster and with greater flexibility than process based complex crop simulation models. The present study demonstrates the usability of ML algorithms for yield forecasting and facilitates the comparison of different models. The cotton yield was simulated by employing the weekly weather indices as inputs and the model performance was assessed by nRMSE, MAPE and EF values. Results show that stacked generalised ensemble model and artificial neural networks predicted the cotton yield with lower nRMSE, MAPE and higher efficiency compared to other models. Variable importance studies in LASSO and ENET model found minimum temperature and relative humidity as the main determinates of cotton yield in all districts. The models were ranked based these performance metrics in the order of Stacked generalised ensemble > ANN > PCA ANN > SMLR ANN > LASSO> ENET > SVM > PCA SMLR > SMLR SVM > SMLR. This study shows that stacked generalised ensembling and ANN method can be used for reliable yield forecasting at district or county level and helps stakeholders in timely decision-making.

Abstract Image

查看原文本刊更多论文

基于天气的雨浇棉花产量预测的机器学习组合、神经网络、混合和稀疏回归方法

棉花是一种主要靠雨水灌溉的经济作物。准确预测棉花产量对农民、工业和政策制定者都有很大帮助。棉花的最终产量主要取决于作物生长阶段的天气模式。由于创新技术的发展，利用高性能计算能力对大数据进行分析，可以更准确地预测作物产量。与基于过程的复杂作物模拟模型相比，机器学习技术能使产量预测更合理、更快速、更灵活。本研究展示了机器学习算法在产量预测中的可用性，并有助于对不同模型进行比较。采用每周天气指数作为输入对棉花产量进行了模拟，并通过 nRMSE、MAPE 和 EF 值对模型性能进行了评估。结果表明，与其他模型相比，叠加广义集合模型和人工神经网络预测棉花产量的 nRMSE、MAPE 更低，效率更高。LASSO 和 ENET 模型的变量重要性研究发现，最低温度和相对湿度是所有地区棉花产量的主要决定因素。根据这些性能指标对模型进行了排序，依次为堆叠广义集合模型、ANN 模型、PCA ANN 模型、SMLR ANN 模型、LASSO 模型、ENET 模型、SVM 模型、PCA SMLR 模型、SMLR SVM 模型、SMLR 模型。这项研究表明，堆叠广义集合和 ANN 方法可用于在地区或县一级进行可靠的产量预测，并有助于利益相关者及时做出决策。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Biometeorology 地学-环境科学

CiteScore

6.40

自引率

9.40%

发文量

183

审稿时长

1 months

期刊介绍： The Journal publishes original research papers, review articles and short communications on studies examining the interactions between living organisms and factors of the natural and artificial atmospheric environment. Living organisms extend from single cell organisms, to plants and animals, including humans. The atmospheric environment includes climate and weather, electromagnetic radiation, and chemical and biological pollutants. The journal embraces basic and applied research and practical aspects such as living conditions, agriculture, forestry, and health. The journal is published for the International Society of Biometeorology, and most membership categories include a subscription to the Journal.