Daniel A. B. de Siqueira, Carlos M. P. Vaz, Flávio S. da Silva, Ednaldo J. Ferreira, E. A. Speranza, Júlio C. Franchini, Rafael Galbieri, Jean-Louis Bélot, M. de Souza, Fabiano J. Perina, Sérgio das Chagas
{"title":"利用 MODIS 植被指数时间序列的线性回归模型估算巴西塞拉多地区的棉花产量","authors":"Daniel A. B. de Siqueira, Carlos M. P. Vaz, Flávio S. da Silva, Ednaldo J. Ferreira, E. A. Speranza, Júlio C. Franchini, Rafael Galbieri, Jean-Louis Bélot, M. de Souza, Fabiano J. Perina, Sérgio das Chagas","doi":"10.3390/agriengineering6020054","DOIUrl":null,"url":null,"abstract":"Satellite remote sensing data expedite crop yield estimation, offering valuable insights for farmers’ decision making. Recent forecasting methods, particularly those utilizing machine learning algorithms like Random Forest and Artificial Neural Networks, show promise. However, challenges such as validation performances, large volume of data, and the inherent complexity and inexplicability of these models hinder their widespread adoption. This paper presents a simpler approach, employing linear regression models fitted from vegetation indices (VIs) extracted from MODIS sensor data on the Terra and Aqua satellites. The aim is to forecast cotton yields in key areas of the Brazilian Cerrado. Using data from 281 commercial production plots, models were trained (167 plots) and tested (114 plots), relating seed cotton yield to nine commonly used VIs averaged over 15-day intervals. Among the evaluated VIs, Enhanced Vegetation Index (EVI) and Triangular Vegetation Index (TVI) exhibited the lowest root mean square errors (RMSE) and the highest determination coefficients (R2). Optimal periods for in-season yield prediction fell between 90 and 105 to 135 and 150 days after sowing (DAS), corresponding to key phenological phases such as boll development, open boll, and fiber maturation, with the lowest RMSE of about 750 kg ha−1 and R2 of 0.70. The best forecasts for early crop stages were provided by models at the peaks (maximum value of the VI time series) for EVI and TVI, which occurred around 80–90 DAS. The proposed approach makes the yield predictability more inferable along the crop time series just by providing sowing dates, contour maps, and their respective VIs.","PeriodicalId":505370,"journal":{"name":"AgriEngineering","volume":"41 11‐12","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Estimating Cotton Yield in the Brazilian Cerrado Using Linear Regression Models from MODIS Vegetation Index Time Series\",\"authors\":\"Daniel A. B. de Siqueira, Carlos M. P. Vaz, Flávio S. da Silva, Ednaldo J. Ferreira, E. A. Speranza, Júlio C. Franchini, Rafael Galbieri, Jean-Louis Bélot, M. de Souza, Fabiano J. Perina, Sérgio das Chagas\",\"doi\":\"10.3390/agriengineering6020054\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Satellite remote sensing data expedite crop yield estimation, offering valuable insights for farmers’ decision making. Recent forecasting methods, particularly those utilizing machine learning algorithms like Random Forest and Artificial Neural Networks, show promise. However, challenges such as validation performances, large volume of data, and the inherent complexity and inexplicability of these models hinder their widespread adoption. This paper presents a simpler approach, employing linear regression models fitted from vegetation indices (VIs) extracted from MODIS sensor data on the Terra and Aqua satellites. The aim is to forecast cotton yields in key areas of the Brazilian Cerrado. Using data from 281 commercial production plots, models were trained (167 plots) and tested (114 plots), relating seed cotton yield to nine commonly used VIs averaged over 15-day intervals. Among the evaluated VIs, Enhanced Vegetation Index (EVI) and Triangular Vegetation Index (TVI) exhibited the lowest root mean square errors (RMSE) and the highest determination coefficients (R2). Optimal periods for in-season yield prediction fell between 90 and 105 to 135 and 150 days after sowing (DAS), corresponding to key phenological phases such as boll development, open boll, and fiber maturation, with the lowest RMSE of about 750 kg ha−1 and R2 of 0.70. The best forecasts for early crop stages were provided by models at the peaks (maximum value of the VI time series) for EVI and TVI, which occurred around 80–90 DAS. The proposed approach makes the yield predictability more inferable along the crop time series just by providing sowing dates, contour maps, and their respective VIs.\",\"PeriodicalId\":505370,\"journal\":{\"name\":\"AgriEngineering\",\"volume\":\"41 11‐12\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-04-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"AgriEngineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3390/agriengineering6020054\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"AgriEngineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/agriengineering6020054","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
卫星遥感数据加快了作物产量估算的速度,为农民决策提供了宝贵的见解。最近的预测方法,特别是那些利用随机森林和人工神经网络等机器学习算法的方法,显示出了良好的前景。然而,验证性能、大量数据以及这些模型固有的复杂性和不可解释性等挑战阻碍了它们的广泛应用。本文介绍了一种更简单的方法,即利用从 Terra 和 Aqua 卫星上的 MODIS 传感器数据中提取的植被指数(VI)拟合的线性回归模型。目的是预测巴西 Cerrado 主要地区的棉花产量。利用来自 281 块商业生产地块的数据,对模型进行了训练(167 块地块)和测试(114 块地块),将籽棉产量与九种常用的 VIs(平均 15 天间隔)联系起来。在评估的植被指数中,增强植被指数(EVI)和三角植被指数(TVI)的均方根误差(RMSE)最小,判定系数(R2)最高。播种后 90 天、105 天至 135 天和 150 天(DAS)是当季产量预测的最佳时期,与棉铃发育、棉铃开放和纤维成熟等关键物候期相对应,RMSE 最低,约为 750 千克/公顷,R2 为 0.70。在 EVI 和 TVI 的峰值(VI 时间序列的最大值)时,模型对作物早期阶段的预测效果最佳,这些峰值出现在 80-90 DAS 左右。所提出的方法只需提供播种日期、等值线图及其各自的植被指数,就能沿着作物时间序列推断出产量的可预测性。
Estimating Cotton Yield in the Brazilian Cerrado Using Linear Regression Models from MODIS Vegetation Index Time Series
Satellite remote sensing data expedite crop yield estimation, offering valuable insights for farmers’ decision making. Recent forecasting methods, particularly those utilizing machine learning algorithms like Random Forest and Artificial Neural Networks, show promise. However, challenges such as validation performances, large volume of data, and the inherent complexity and inexplicability of these models hinder their widespread adoption. This paper presents a simpler approach, employing linear regression models fitted from vegetation indices (VIs) extracted from MODIS sensor data on the Terra and Aqua satellites. The aim is to forecast cotton yields in key areas of the Brazilian Cerrado. Using data from 281 commercial production plots, models were trained (167 plots) and tested (114 plots), relating seed cotton yield to nine commonly used VIs averaged over 15-day intervals. Among the evaluated VIs, Enhanced Vegetation Index (EVI) and Triangular Vegetation Index (TVI) exhibited the lowest root mean square errors (RMSE) and the highest determination coefficients (R2). Optimal periods for in-season yield prediction fell between 90 and 105 to 135 and 150 days after sowing (DAS), corresponding to key phenological phases such as boll development, open boll, and fiber maturation, with the lowest RMSE of about 750 kg ha−1 and R2 of 0.70. The best forecasts for early crop stages were provided by models at the peaks (maximum value of the VI time series) for EVI and TVI, which occurred around 80–90 DAS. The proposed approach makes the yield predictability more inferable along the crop time series just by providing sowing dates, contour maps, and their respective VIs.