Sebastian Vogel , Mandy Gebbers , Ingmar Schröter , Wolfgang Schwanghart , Eric Bönecke , Jörg Rühlmann , Eckart Kramer , Robin Gebbers
{"title":"对原位土壤pH传感器数据的独立校准:空间和时间接近的相关性,样本量和数据传播对校准模型性能的影响","authors":"Sebastian Vogel , Mandy Gebbers , Ingmar Schröter , Wolfgang Schwanghart , Eric Bönecke , Jörg Rühlmann , Eckart Kramer , Robin Gebbers","doi":"10.1016/j.geoderma.2025.117261","DOIUrl":null,"url":null,"abstract":"<div><div>Proximal soil sensing data usually require calibration to extract meaningful information, e.g. for precision agriculture. For optimal application, calibration modeling is typically based on reference data collected from field-wise sampling. However, this approach is neither time- nor cost-effective and hinders the adoption of proximal soil sensing in practical agriculture. Thus, minimizing calibration sampling is essential.</div><div>For the case of pH mapping in precision agriculture, we have investigated whether calibration efforts can be reduced by reference sampling, which is less dependent on location and timing. This study utilized 612 sensor and standard lab pH measurements from 62 fields of two farms in Northeast Germany. We assess the effects of spatial and temporal proximity, sample size and data spread on the performance of univariate linear regression models for pH sensor data calibration. Moreover, the performance of site-independent calibration was analyzed for five domain levels: (i) regional, (ii) farm, (iii) spatial neighborhood (NH), (iv) temporal NH and (v) field. This multi-level approach aims at assessing the generalizability of calibration models.</div><div>Partial correlation analysis revealed that sample size is the most crucial factor for optimizing model performance. Field-wise calibration yielded a 34 % lower mean RMSE than regional, farm, or neighborhood models but required the most reference samples. In contrast, neighborhood models reduced sample size by approximately 74 %, farm models by 87 % and the regional model by 86 %, while achieving similar RMSEs. Thus, farm or regional models can significantly reduce sampling effort and costs in practical precision agriculture. The spread of the pH value was the next important factor influencing model performance. The pH range in the sensor measurements selected for calibration should be larger than 1 unit to obtain good quality calibration models. Finally, spatial and temporal dispersion had the least effect on calibration performance.</div></div>","PeriodicalId":12511,"journal":{"name":"Geoderma","volume":"456 ","pages":"Article 117261"},"PeriodicalIF":5.6000,"publicationDate":"2025-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards site-independent calibration of in situ soil pH sensor data: Relevance of spatial and temporal proximity, sample size and data spread for calibration model performance\",\"authors\":\"Sebastian Vogel , Mandy Gebbers , Ingmar Schröter , Wolfgang Schwanghart , Eric Bönecke , Jörg Rühlmann , Eckart Kramer , Robin Gebbers\",\"doi\":\"10.1016/j.geoderma.2025.117261\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Proximal soil sensing data usually require calibration to extract meaningful information, e.g. for precision agriculture. For optimal application, calibration modeling is typically based on reference data collected from field-wise sampling. However, this approach is neither time- nor cost-effective and hinders the adoption of proximal soil sensing in practical agriculture. Thus, minimizing calibration sampling is essential.</div><div>For the case of pH mapping in precision agriculture, we have investigated whether calibration efforts can be reduced by reference sampling, which is less dependent on location and timing. This study utilized 612 sensor and standard lab pH measurements from 62 fields of two farms in Northeast Germany. We assess the effects of spatial and temporal proximity, sample size and data spread on the performance of univariate linear regression models for pH sensor data calibration. Moreover, the performance of site-independent calibration was analyzed for five domain levels: (i) regional, (ii) farm, (iii) spatial neighborhood (NH), (iv) temporal NH and (v) field. This multi-level approach aims at assessing the generalizability of calibration models.</div><div>Partial correlation analysis revealed that sample size is the most crucial factor for optimizing model performance. Field-wise calibration yielded a 34 % lower mean RMSE than regional, farm, or neighborhood models but required the most reference samples. In contrast, neighborhood models reduced sample size by approximately 74 %, farm models by 87 % and the regional model by 86 %, while achieving similar RMSEs. Thus, farm or regional models can significantly reduce sampling effort and costs in practical precision agriculture. The spread of the pH value was the next important factor influencing model performance. The pH range in the sensor measurements selected for calibration should be larger than 1 unit to obtain good quality calibration models. Finally, spatial and temporal dispersion had the least effect on calibration performance.</div></div>\",\"PeriodicalId\":12511,\"journal\":{\"name\":\"Geoderma\",\"volume\":\"456 \",\"pages\":\"Article 117261\"},\"PeriodicalIF\":5.6000,\"publicationDate\":\"2025-03-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Geoderma\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0016706125000990\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"SOIL SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoderma","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0016706125000990","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOIL SCIENCE","Score":null,"Total":0}
Towards site-independent calibration of in situ soil pH sensor data: Relevance of spatial and temporal proximity, sample size and data spread for calibration model performance
Proximal soil sensing data usually require calibration to extract meaningful information, e.g. for precision agriculture. For optimal application, calibration modeling is typically based on reference data collected from field-wise sampling. However, this approach is neither time- nor cost-effective and hinders the adoption of proximal soil sensing in practical agriculture. Thus, minimizing calibration sampling is essential.
For the case of pH mapping in precision agriculture, we have investigated whether calibration efforts can be reduced by reference sampling, which is less dependent on location and timing. This study utilized 612 sensor and standard lab pH measurements from 62 fields of two farms in Northeast Germany. We assess the effects of spatial and temporal proximity, sample size and data spread on the performance of univariate linear regression models for pH sensor data calibration. Moreover, the performance of site-independent calibration was analyzed for five domain levels: (i) regional, (ii) farm, (iii) spatial neighborhood (NH), (iv) temporal NH and (v) field. This multi-level approach aims at assessing the generalizability of calibration models.
Partial correlation analysis revealed that sample size is the most crucial factor for optimizing model performance. Field-wise calibration yielded a 34 % lower mean RMSE than regional, farm, or neighborhood models but required the most reference samples. In contrast, neighborhood models reduced sample size by approximately 74 %, farm models by 87 % and the regional model by 86 %, while achieving similar RMSEs. Thus, farm or regional models can significantly reduce sampling effort and costs in practical precision agriculture. The spread of the pH value was the next important factor influencing model performance. The pH range in the sensor measurements selected for calibration should be larger than 1 unit to obtain good quality calibration models. Finally, spatial and temporal dispersion had the least effect on calibration performance.
期刊介绍:
Geoderma - the global journal of soil science - welcomes authors, readers and soil research from all parts of the world, encourages worldwide soil studies, and embraces all aspects of soil science and its associated pedagogy. The journal particularly welcomes interdisciplinary work focusing on dynamic soil processes and functions across space and time.