Preprocessing and regression approaches alter the spectral estimation accuracy of plant phosphorus content—A three-level meta-analysis

IF 8.9 1区 农林科学 Q1 AGRICULTURE, MULTIDISCIPLINARY
Tianli Wang , Yi Zhang , Fei Li , Ning Cao
{"title":"Preprocessing and regression approaches alter the spectral estimation accuracy of plant phosphorus content—A three-level meta-analysis","authors":"Tianli Wang ,&nbsp;Yi Zhang ,&nbsp;Fei Li ,&nbsp;Ning Cao","doi":"10.1016/j.compag.2025.110205","DOIUrl":null,"url":null,"abstract":"<div><div>Remote sensing technology and machine learning methods are being scaled up globally to predict nutrient content based on spectral data. However, there is a lack of rigorous comparison of co-benefit delivery across different factors, which leads to unstable accuracy of the final model owing to insufficient analysis of the factors influencing the prediction model. In particular, for nutrients (e.g. phosphorus), visual symptoms are not obvious or have a certain lag. Therefore, a Three-Level Meta-Analysis model was proposed in this study to extract and analyse a large number of studies, delving into the analysis of various influencing factors and filling the current knowledge gap. Through global synthesis, a Three-Level Meta-Analysis was applied to seven validated datasets of field observations from multispectral remote sensing, including 32 effect sizes, and 46 datasets of field observations from hyperspectral remote sensing, including 630 effect sizes. We thoroughly explored the heterogeneity of a Three-Level Meta-Analysis using the new machine learning method Meta-Forest, while also using Meta-Cart to explore the interaction effects between moderating variables. Through a comprehensive analysis of the literature published over the past 25 years, we determined the importance of matching preprocessing and regression methods for predicting plant phosphorus spectral responses. The combination of pretreatment and regression methods is particularly important for regional-scale phosphorus concentration prediction. Baseline calibration is effective in removing background noise at the regional scale; however, it cannot solve the problem of redundancy between hyperspectral data. It is necessary to combine a regression method that can effectively deal with redundancy between data to improve the accuracy of the model. Nonlinear non-parametric regression can better deal with the complex nonlinear relationship between phosphorus concentration and spectral data and can resist the influence of the quantity and quality of the data itself and the heterogeneity of the study area; therefore, it has excellent prediction ability. The type of spectrometer is crucial for predicting regional phosphorus concentrations using multispectral data, especially when collecting data using drones. This study provides guidance for fully utilising spectral data and establishing a fast, efficient, and non-destructive prediction model for plant P concentrations, revealing the optimal selection of data preprocessing and regression methods.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"234 ","pages":"Article 110205"},"PeriodicalIF":8.9000,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925003114","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Remote sensing technology and machine learning methods are being scaled up globally to predict nutrient content based on spectral data. However, there is a lack of rigorous comparison of co-benefit delivery across different factors, which leads to unstable accuracy of the final model owing to insufficient analysis of the factors influencing the prediction model. In particular, for nutrients (e.g. phosphorus), visual symptoms are not obvious or have a certain lag. Therefore, a Three-Level Meta-Analysis model was proposed in this study to extract and analyse a large number of studies, delving into the analysis of various influencing factors and filling the current knowledge gap. Through global synthesis, a Three-Level Meta-Analysis was applied to seven validated datasets of field observations from multispectral remote sensing, including 32 effect sizes, and 46 datasets of field observations from hyperspectral remote sensing, including 630 effect sizes. We thoroughly explored the heterogeneity of a Three-Level Meta-Analysis using the new machine learning method Meta-Forest, while also using Meta-Cart to explore the interaction effects between moderating variables. Through a comprehensive analysis of the literature published over the past 25 years, we determined the importance of matching preprocessing and regression methods for predicting plant phosphorus spectral responses. The combination of pretreatment and regression methods is particularly important for regional-scale phosphorus concentration prediction. Baseline calibration is effective in removing background noise at the regional scale; however, it cannot solve the problem of redundancy between hyperspectral data. It is necessary to combine a regression method that can effectively deal with redundancy between data to improve the accuracy of the model. Nonlinear non-parametric regression can better deal with the complex nonlinear relationship between phosphorus concentration and spectral data and can resist the influence of the quantity and quality of the data itself and the heterogeneity of the study area; therefore, it has excellent prediction ability. The type of spectrometer is crucial for predicting regional phosphorus concentrations using multispectral data, especially when collecting data using drones. This study provides guidance for fully utilising spectral data and establishing a fast, efficient, and non-destructive prediction model for plant P concentrations, revealing the optimal selection of data preprocessing and regression methods.
预处理和回归方法改变了植物磷含量光谱估计的精度——一个三水平元分析
遥感技术和机器学习方法正在全球范围内推广,以根据光谱数据预测营养成分。然而,由于缺乏对不同因素之间的协同效益交付进行严格的比较,导致由于对影响预测模型的因素分析不足,最终模型的准确性不稳定。特别是对于营养物(如磷),视觉症状不明显或有一定的滞后性。因此,本研究提出了一个三层次元分析模型,提取和分析大量的研究,深入分析各种影响因素,填补目前的知识空白。通过全局综合,对7个经验证的多光谱遥感野外观测数据集(32个效应量)和46个经验证的高光谱遥感野外观测数据集(630个效应量)进行三水平meta分析。我们使用新的机器学习方法Meta-Forest深入探讨了三层元分析的异质性,同时也使用Meta-Cart来探索调节变量之间的相互作用效应。通过对过去25年发表的文献的综合分析,我们确定了预处理和回归匹配方法在预测植物磷光谱响应中的重要性。预处理与回归相结合的方法对区域尺度的磷浓度预测尤为重要。基线校正能有效去除区域尺度的背景噪音;但是,它不能解决高光谱数据之间的冗余问题。需要结合一种能够有效处理数据间冗余的回归方法来提高模型的准确性。非线性非参数回归能够较好地处理磷浓度与光谱数据之间复杂的非线性关系,能够抵抗数据本身的数量和质量以及研究区域异质性的影响;因此,它具有出色的预测能力。光谱仪的类型对于使用多光谱数据预测区域磷浓度至关重要,特别是在使用无人机收集数据时。本研究为充分利用光谱数据,建立快速、高效、无损的植物磷浓度预测模型,揭示数据预处理和回归方法的最佳选择提供指导。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computers and Electronics in Agriculture
Computers and Electronics in Agriculture 工程技术-计算机:跨学科应用
CiteScore
15.30
自引率
14.50%
发文量
800
审稿时长
62 days
期刊介绍: Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信