{"title":"Field scale wheat yield prediction using ensemble machine learning techniques","authors":"","doi":"10.1016/j.atech.2024.100543","DOIUrl":null,"url":null,"abstract":"<div><p>Wheat is crucial for global food security and plays significant role in achieving United Nations Sustainable Development Goal 2 (Zero Hunger). In India, wheat accounts for 33.5 % of total cereal production. Accurate and cost effective yield predictions are essential for maintaining food security. Wheat yield forecasting is influenced by various factors, such as genotype and environmental conditions. Among these, the effect of morpho-physiological traits in the field is important for predicting yield but hasn't been studied much using ensemble machine learning methods. This study aims to bridge this gap by evaluating 29 morpho-physiological traits to predict site specific wheat yield. Big data framework was used to develop and refine several ensemble machine learning models based on field trial datasets. The developed models are optimized to reduce errors and prevent overfitting and underfitting, boosting their predictive precision. Each model's efficiency was evaluated using performance metrics such as mean absolute error, mean absolute deviation, root mean square error, R<sup>2</sup>, and overall accuracy. The ensemble model, which combines random forest (RF) and artificial neural networks (ANN), demonstrated better performance by achieving a mean absolute percentage error of 4.65 %, and R<sup>2</sup> value of 98.48 % and 98.18 % accuracy on test data.Our results demonstrate that ensemble models combining RF with support vector regression (SVR) outperformed individual models such as RF, SVR, ANN, multivariate adaptive regression splines (MARS). These findings are a promising step towards future research focused on creating more advanced ensemble methods with finely tuned hyperparameters for improving the accuracy of large-scale wheat yield prediction.</p></div>","PeriodicalId":74813,"journal":{"name":"Smart agricultural technology","volume":null,"pages":null},"PeriodicalIF":6.3000,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772375524001485/pdfft?md5=b0b8243a386829dc9bc102a99e8823f0&pid=1-s2.0-S2772375524001485-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Smart agricultural technology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772375524001485","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
Wheat is crucial for global food security and plays significant role in achieving United Nations Sustainable Development Goal 2 (Zero Hunger). In India, wheat accounts for 33.5 % of total cereal production. Accurate and cost effective yield predictions are essential for maintaining food security. Wheat yield forecasting is influenced by various factors, such as genotype and environmental conditions. Among these, the effect of morpho-physiological traits in the field is important for predicting yield but hasn't been studied much using ensemble machine learning methods. This study aims to bridge this gap by evaluating 29 morpho-physiological traits to predict site specific wheat yield. Big data framework was used to develop and refine several ensemble machine learning models based on field trial datasets. The developed models are optimized to reduce errors and prevent overfitting and underfitting, boosting their predictive precision. Each model's efficiency was evaluated using performance metrics such as mean absolute error, mean absolute deviation, root mean square error, R2, and overall accuracy. The ensemble model, which combines random forest (RF) and artificial neural networks (ANN), demonstrated better performance by achieving a mean absolute percentage error of 4.65 %, and R2 value of 98.48 % and 98.18 % accuracy on test data.Our results demonstrate that ensemble models combining RF with support vector regression (SVR) outperformed individual models such as RF, SVR, ANN, multivariate adaptive regression splines (MARS). These findings are a promising step towards future research focused on creating more advanced ensemble methods with finely tuned hyperparameters for improving the accuracy of large-scale wheat yield prediction.