Estimating Cotton Yield in the Brazilian Cerrado Using Linear Regression Models from MODIS Vegetation Index Time Series

Daniel A. B. de Siqueira, Carlos M. P. Vaz, Flávio S. da Silva, Ednaldo J. Ferreira, E. A. Speranza, Júlio C. Franchini, Rafael Galbieri, Jean-Louis Bélot, M. de Souza, Fabiano J. Perina, Sérgio das Chagas
{"title":"Estimating Cotton Yield in the Brazilian Cerrado Using Linear Regression Models from MODIS Vegetation Index Time Series","authors":"Daniel A. B. de Siqueira, Carlos M. P. Vaz, Flávio S. da Silva, Ednaldo J. Ferreira, E. A. Speranza, Júlio C. Franchini, Rafael Galbieri, Jean-Louis Bélot, M. de Souza, Fabiano J. Perina, Sérgio das Chagas","doi":"10.3390/agriengineering6020054","DOIUrl":null,"url":null,"abstract":"Satellite remote sensing data expedite crop yield estimation, offering valuable insights for farmers’ decision making. Recent forecasting methods, particularly those utilizing machine learning algorithms like Random Forest and Artificial Neural Networks, show promise. However, challenges such as validation performances, large volume of data, and the inherent complexity and inexplicability of these models hinder their widespread adoption. This paper presents a simpler approach, employing linear regression models fitted from vegetation indices (VIs) extracted from MODIS sensor data on the Terra and Aqua satellites. The aim is to forecast cotton yields in key areas of the Brazilian Cerrado. Using data from 281 commercial production plots, models were trained (167 plots) and tested (114 plots), relating seed cotton yield to nine commonly used VIs averaged over 15-day intervals. Among the evaluated VIs, Enhanced Vegetation Index (EVI) and Triangular Vegetation Index (TVI) exhibited the lowest root mean square errors (RMSE) and the highest determination coefficients (R2). Optimal periods for in-season yield prediction fell between 90 and 105 to 135 and 150 days after sowing (DAS), corresponding to key phenological phases such as boll development, open boll, and fiber maturation, with the lowest RMSE of about 750 kg ha−1 and R2 of 0.70. The best forecasts for early crop stages were provided by models at the peaks (maximum value of the VI time series) for EVI and TVI, which occurred around 80–90 DAS. The proposed approach makes the yield predictability more inferable along the crop time series just by providing sowing dates, contour maps, and their respective VIs.","PeriodicalId":505370,"journal":{"name":"AgriEngineering","volume":"41 11‐12","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AgriEngineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/agriengineering6020054","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Satellite remote sensing data expedite crop yield estimation, offering valuable insights for farmers’ decision making. Recent forecasting methods, particularly those utilizing machine learning algorithms like Random Forest and Artificial Neural Networks, show promise. However, challenges such as validation performances, large volume of data, and the inherent complexity and inexplicability of these models hinder their widespread adoption. This paper presents a simpler approach, employing linear regression models fitted from vegetation indices (VIs) extracted from MODIS sensor data on the Terra and Aqua satellites. The aim is to forecast cotton yields in key areas of the Brazilian Cerrado. Using data from 281 commercial production plots, models were trained (167 plots) and tested (114 plots), relating seed cotton yield to nine commonly used VIs averaged over 15-day intervals. Among the evaluated VIs, Enhanced Vegetation Index (EVI) and Triangular Vegetation Index (TVI) exhibited the lowest root mean square errors (RMSE) and the highest determination coefficients (R2). Optimal periods for in-season yield prediction fell between 90 and 105 to 135 and 150 days after sowing (DAS), corresponding to key phenological phases such as boll development, open boll, and fiber maturation, with the lowest RMSE of about 750 kg ha−1 and R2 of 0.70. The best forecasts for early crop stages were provided by models at the peaks (maximum value of the VI time series) for EVI and TVI, which occurred around 80–90 DAS. The proposed approach makes the yield predictability more inferable along the crop time series just by providing sowing dates, contour maps, and their respective VIs.
利用 MODIS 植被指数时间序列的线性回归模型估算巴西塞拉多地区的棉花产量
卫星遥感数据加快了作物产量估算的速度,为农民决策提供了宝贵的见解。最近的预测方法,特别是那些利用随机森林和人工神经网络等机器学习算法的方法,显示出了良好的前景。然而,验证性能、大量数据以及这些模型固有的复杂性和不可解释性等挑战阻碍了它们的广泛应用。本文介绍了一种更简单的方法,即利用从 Terra 和 Aqua 卫星上的 MODIS 传感器数据中提取的植被指数(VI)拟合的线性回归模型。目的是预测巴西 Cerrado 主要地区的棉花产量。利用来自 281 块商业生产地块的数据,对模型进行了训练(167 块地块)和测试(114 块地块),将籽棉产量与九种常用的 VIs(平均 15 天间隔)联系起来。在评估的植被指数中,增强植被指数(EVI)和三角植被指数(TVI)的均方根误差(RMSE)最小,判定系数(R2)最高。播种后 90 天、105 天至 135 天和 150 天(DAS)是当季产量预测的最佳时期,与棉铃发育、棉铃开放和纤维成熟等关键物候期相对应,RMSE 最低,约为 750 千克/公顷,R2 为 0.70。在 EVI 和 TVI 的峰值(VI 时间序列的最大值)时,模型对作物早期阶段的预测效果最佳,这些峰值出现在 80-90 DAS 左右。所提出的方法只需提供播种日期、等值线图及其各自的植被指数,就能沿着作物时间序列推断出产量的可预测性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信