在高空间密度数据中过滤异常值以提高地图可靠性的方法

IF 2.6 3区农林科学 Q1 Agricultural and Biological Sciences

Scientia Agricola Pub Date : 2022-01-01 DOI:10.1590/1678-992X-2020-0178

Leonardo Felipe Maldaner, J. Molin, M. Spekken

{"title":"在高空间密度数据中过滤异常值以提高地图可靠性的方法","authors":"Leonardo Felipe Maldaner, J. Molin, M. Spekken","doi":"10.1590/1678-992X-2020-0178","DOIUrl":null,"url":null,"abstract":"ABSTRACT The considerable volume of data generated by sensors in the field presents systematic errors; thus, it is extremely important to exclude these errors to ensure mapping quality. The objective of this research was to develop and test a methodology to identify and exclude outliers in high-density spatial data sets, determine whether the developed filter process could help decrease the nugget effect and improve the spatial variability characterization of high sampling data. We created a filter composed of a global, anisotropic, and an anisotropic local analysis of data, which considered the respective neighborhood values. For that purpose, we used the median to classify a given spatial point into the data set as the main statistical parameter and took into account its neighbors within a radius. The filter was tested using raw data sets of corn yield, soil electrical conductivity (ECa), and the sensor vegetation index (SVI) in sugarcane. The results showed an improvement in accuracy of spatial variability within the data sets. The methodology reduced RMSE by 85 %, 97 %, and 79 % in corn yield, soil ECa, and SVI respectively, compared to interpolation errors of raw data sets. The filter excluded the local outliers, which considerably reduced the nugget effects, reducing estimation error of the interpolated data. The methodology proposed in this work had a better performance in removing outlier data when compared to two other methodologies from the literature.","PeriodicalId":49559,"journal":{"name":"Scientia Agricola","volume":"16 1","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Methodology to filter out outliers in high spatial density data to improve maps reliability\",\"authors\":\"Leonardo Felipe Maldaner, J. Molin, M. Spekken\",\"doi\":\"10.1590/1678-992X-2020-0178\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACT The considerable volume of data generated by sensors in the field presents systematic errors; thus, it is extremely important to exclude these errors to ensure mapping quality. The objective of this research was to develop and test a methodology to identify and exclude outliers in high-density spatial data sets, determine whether the developed filter process could help decrease the nugget effect and improve the spatial variability characterization of high sampling data. We created a filter composed of a global, anisotropic, and an anisotropic local analysis of data, which considered the respective neighborhood values. For that purpose, we used the median to classify a given spatial point into the data set as the main statistical parameter and took into account its neighbors within a radius. The filter was tested using raw data sets of corn yield, soil electrical conductivity (ECa), and the sensor vegetation index (SVI) in sugarcane. The results showed an improvement in accuracy of spatial variability within the data sets. The methodology reduced RMSE by 85 %, 97 %, and 79 % in corn yield, soil ECa, and SVI respectively, compared to interpolation errors of raw data sets. The filter excluded the local outliers, which considerably reduced the nugget effects, reducing estimation error of the interpolated data. The methodology proposed in this work had a better performance in removing outlier data when compared to two other methodologies from the literature.\",\"PeriodicalId\":49559,\"journal\":{\"name\":\"Scientia Agricola\",\"volume\":\"16 1\",\"pages\":\"\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2022-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientia Agricola\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://doi.org/10.1590/1678-992X-2020-0178\",\"RegionNum\":3,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Agricultural and Biological Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientia Agricola","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1590/1678-992X-2020-0178","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Agricultural and Biological Sciences","Score":null,"Total":0}

引用次数: 8

摘要

现场传感器产生的大量数据存在系统误差;因此，排除这些误差以保证映射质量是极其重要的。本研究的目的是开发和测试一种方法来识别和排除高密度空间数据集中的异常值，确定所开发的滤波过程是否有助于降低块金效应并改善高采样数据的空间变异性特征。我们创建了一个由全局、各向异性和各向异性局部数据分析组成的过滤器，它们考虑了各自的邻域值。为此，我们使用中位数将给定的空间点作为主要统计参数分类到数据集中，并考虑其半径内的邻居。利用玉米产量、土壤电导率(ECa)和甘蔗传感器植被指数(SVI)的原始数据集对该滤波器进行了测试。结果表明，数据集内空间变异性的准确性有所提高。与原始数据集的插值误差相比，该方法将玉米产量、土壤ECa和SVI的RMSE分别降低了85%、97%和79%。该滤波器排除了局部异常点，大大降低了块金效应，降低了插值数据的估计误差。与文献中的其他两种方法相比，本研究中提出的方法在去除异常数据方面具有更好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Methodology to filter out outliers in high spatial density data to improve maps reliability

ABSTRACT The considerable volume of data generated by sensors in the field presents systematic errors; thus, it is extremely important to exclude these errors to ensure mapping quality. The objective of this research was to develop and test a methodology to identify and exclude outliers in high-density spatial data sets, determine whether the developed filter process could help decrease the nugget effect and improve the spatial variability characterization of high sampling data. We created a filter composed of a global, anisotropic, and an anisotropic local analysis of data, which considered the respective neighborhood values. For that purpose, we used the median to classify a given spatial point into the data set as the main statistical parameter and took into account its neighbors within a radius. The filter was tested using raw data sets of corn yield, soil electrical conductivity (ECa), and the sensor vegetation index (SVI) in sugarcane. The results showed an improvement in accuracy of spatial variability within the data sets. The methodology reduced RMSE by 85 %, 97 %, and 79 % in corn yield, soil ECa, and SVI respectively, compared to interpolation errors of raw data sets. The filter excluded the local outliers, which considerably reduced the nugget effects, reducing estimation error of the interpolated data. The methodology proposed in this work had a better performance in removing outlier data when compared to two other methodologies from the literature.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Scientia Agricola 农林科学-农业综合

CiteScore

5.10

自引率

3.80%

发文量

审稿时长

18-36 weeks

期刊介绍： Scientia Agricola is a journal of the University of São Paulo edited at the Luiz de Queiroz campus in Piracicaba, a city in São Paulo state, southeastern Brazil. Scientia Agricola publishes original articles which contribute to the advancement of the agricultural, environmental and biological sciences.