基于集成算法建模的夏小麦叶面积指数递归特征消除——以埃塞俄比亚中部高地为例

Q2 Environmental Science
Dereje Biru , Berhan Gessesse , Gebeyehu Abebe
{"title":"基于集成算法建模的夏小麦叶面积指数递归特征消除——以埃塞俄比亚中部高地为例","authors":"Dereje Biru ,&nbsp;Berhan Gessesse ,&nbsp;Gebeyehu Abebe","doi":"10.1016/j.envc.2025.101113","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate and nondestructive monitoring of the wheat leaf area index (LAI) is important for effective agricultural management and production forecasting. However, building a high-performance predictive model faces challenges in selecting suitable machine learning algorithms and identifying important variables. This study explored the use of ensemble algorithm-based recursive feature elimination (RFE) for summer wheat LAI estimation using the Google Earth Engine (GEE) cloud computing platform. Remote sensing datasets, including Sentinel-1/2 and digital elevation models, encompassing spectral bands, vegetation indices, texture metrics, and topographic variables, were used. The preprocessing stage involved creating 136 independent variables in the GEE, whereas the LAI data were collected from 84 systematically selected samples using the ACCUPAR LP-80 Ceptometer. Further processing included feature combination, min–max normalization, extraction of the 136 independent variables to the LAI data, and data partitioning for training and testing. The RFE algorithm was applied using the random forest (RF) and gradient tree boost (GTB) algorithms within the GEE to predict the summer wheat LAI at the Lole State Farm. Model performance validation analysis was evaluated via R-squared (R<sup>2</sup>), root mean squared error (RMSE), mean squared error (MSE), and mean absolute error (MAE) statistical models. The results indicated that 49 significant variables were selected for the RFE-RF model, whereas 29 were chosen for the RFE-GTB model. The GTB model outperformed the RF model, achieving R<sup>2</sup> values of 0.968 for training and 0.88 for validation, whereas the R<sup>2</sup> values of the RF model were 0.961 for training and 0.856 for validation. The GTB model also exhibited superior accuracy, with lower RMSE, MSE, and MAE values. Additionally, a predicted LAI map for summer wheat was generated, ranging from 0.22-2.12 for the random forest model and from 0.24-2.23 for the gradient tree boost model. Overall, this study demonstrated the improvement of the learning algorithm by identifying important variables, evaluating its performance in predicting wheat LAI, and generating a map of the predicted LAI. The results offer valuable insights for the nondestructive and rapid acquisition of summer wheat LAI by employing an ensemble algorithm-based RFE and utilizing Earth observation data in the GEE.</div></div>","PeriodicalId":34794,"journal":{"name":"Environmental Challenges","volume":"19 ","pages":"Article 101113"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Recursive feature elimination for summer wheat leaf area index using ensemble algorithm-based modeling: The case of central Highland of Ethiopia\",\"authors\":\"Dereje Biru ,&nbsp;Berhan Gessesse ,&nbsp;Gebeyehu Abebe\",\"doi\":\"10.1016/j.envc.2025.101113\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurate and nondestructive monitoring of the wheat leaf area index (LAI) is important for effective agricultural management and production forecasting. However, building a high-performance predictive model faces challenges in selecting suitable machine learning algorithms and identifying important variables. This study explored the use of ensemble algorithm-based recursive feature elimination (RFE) for summer wheat LAI estimation using the Google Earth Engine (GEE) cloud computing platform. Remote sensing datasets, including Sentinel-1/2 and digital elevation models, encompassing spectral bands, vegetation indices, texture metrics, and topographic variables, were used. The preprocessing stage involved creating 136 independent variables in the GEE, whereas the LAI data were collected from 84 systematically selected samples using the ACCUPAR LP-80 Ceptometer. Further processing included feature combination, min–max normalization, extraction of the 136 independent variables to the LAI data, and data partitioning for training and testing. The RFE algorithm was applied using the random forest (RF) and gradient tree boost (GTB) algorithms within the GEE to predict the summer wheat LAI at the Lole State Farm. Model performance validation analysis was evaluated via R-squared (R<sup>2</sup>), root mean squared error (RMSE), mean squared error (MSE), and mean absolute error (MAE) statistical models. The results indicated that 49 significant variables were selected for the RFE-RF model, whereas 29 were chosen for the RFE-GTB model. The GTB model outperformed the RF model, achieving R<sup>2</sup> values of 0.968 for training and 0.88 for validation, whereas the R<sup>2</sup> values of the RF model were 0.961 for training and 0.856 for validation. The GTB model also exhibited superior accuracy, with lower RMSE, MSE, and MAE values. Additionally, a predicted LAI map for summer wheat was generated, ranging from 0.22-2.12 for the random forest model and from 0.24-2.23 for the gradient tree boost model. Overall, this study demonstrated the improvement of the learning algorithm by identifying important variables, evaluating its performance in predicting wheat LAI, and generating a map of the predicted LAI. The results offer valuable insights for the nondestructive and rapid acquisition of summer wheat LAI by employing an ensemble algorithm-based RFE and utilizing Earth observation data in the GEE.</div></div>\",\"PeriodicalId\":34794,\"journal\":{\"name\":\"Environmental Challenges\",\"volume\":\"19 \",\"pages\":\"Article 101113\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-03-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Challenges\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2667010025000332\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Environmental Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Challenges","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667010025000332","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Environmental Science","Score":null,"Total":0}
引用次数: 0

摘要

准确、无损地监测小麦叶面积指数对有效的农业管理和生产预测具有重要意义。然而,构建高性能的预测模型面临着选择合适的机器学习算法和识别重要变量的挑战。本研究在谷歌Earth Engine (GEE)云计算平台上,探索了基于集成算法的递归特征消去(RFE)在夏小麦LAI估计中的应用。遥感数据集包括Sentinel-1/2和数字高程模型,包括光谱带、植被指数、纹理指标和地形变量。预处理阶段包括在GEE中创建136个自变量,而LAI数据是使用ACCUPAR LP-80 Ceptometer从84个系统选择的样本中收集的。进一步的处理包括特征组合、最小-最大归一化、提取136个自变量到LAI数据,并对数据进行划分进行训练和测试。采用随机森林(RF)算法和梯度树增强(GTB)算法对Lole State Farm的夏小麦LAI进行预测。通过r平方(R2)、均方根误差(RMSE)、均方误差(MSE)和平均绝对误差(MAE)统计模型对模型性能进行验证分析。结果表明,RFE-RF模型选取了49个显著变量,而RFE-GTB模型选取了29个显著变量。GTB模型优于RF模型,训练R2为0.968,验证R2为0.88,而RF模型的训练R2为0.961,验证R2为0.856。GTB模型也具有较低的RMSE、MSE和MAE值,具有较好的准确性。此外,还生成了夏季小麦的LAI预测图,随机森林模型的预测范围为0.22-2.12,梯度树boost模型的预测范围为0.24-2.23。总体而言,本研究通过识别重要变量,评估其预测小麦LAI的性能,并生成预测LAI的地图,证明了学习算法的改进。研究结果为采用基于集成算法的RFE和利用地球观测数据在GEE中无损快速获取夏季小麦LAI提供了有价值的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Recursive feature elimination for summer wheat leaf area index using ensemble algorithm-based modeling: The case of central Highland of Ethiopia
Accurate and nondestructive monitoring of the wheat leaf area index (LAI) is important for effective agricultural management and production forecasting. However, building a high-performance predictive model faces challenges in selecting suitable machine learning algorithms and identifying important variables. This study explored the use of ensemble algorithm-based recursive feature elimination (RFE) for summer wheat LAI estimation using the Google Earth Engine (GEE) cloud computing platform. Remote sensing datasets, including Sentinel-1/2 and digital elevation models, encompassing spectral bands, vegetation indices, texture metrics, and topographic variables, were used. The preprocessing stage involved creating 136 independent variables in the GEE, whereas the LAI data were collected from 84 systematically selected samples using the ACCUPAR LP-80 Ceptometer. Further processing included feature combination, min–max normalization, extraction of the 136 independent variables to the LAI data, and data partitioning for training and testing. The RFE algorithm was applied using the random forest (RF) and gradient tree boost (GTB) algorithms within the GEE to predict the summer wheat LAI at the Lole State Farm. Model performance validation analysis was evaluated via R-squared (R2), root mean squared error (RMSE), mean squared error (MSE), and mean absolute error (MAE) statistical models. The results indicated that 49 significant variables were selected for the RFE-RF model, whereas 29 were chosen for the RFE-GTB model. The GTB model outperformed the RF model, achieving R2 values of 0.968 for training and 0.88 for validation, whereas the R2 values of the RF model were 0.961 for training and 0.856 for validation. The GTB model also exhibited superior accuracy, with lower RMSE, MSE, and MAE values. Additionally, a predicted LAI map for summer wheat was generated, ranging from 0.22-2.12 for the random forest model and from 0.24-2.23 for the gradient tree boost model. Overall, this study demonstrated the improvement of the learning algorithm by identifying important variables, evaluating its performance in predicting wheat LAI, and generating a map of the predicted LAI. The results offer valuable insights for the nondestructive and rapid acquisition of summer wheat LAI by employing an ensemble algorithm-based RFE and utilizing Earth observation data in the GEE.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Environmental Challenges
Environmental Challenges Environmental Science-Environmental Engineering
CiteScore
8.00
自引率
0.00%
发文量
249
审稿时长
8 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信