{"title":"Explorers vs. followers: A behavioural approach to spatial bias correction in species distribution modelling","authors":"Emy Guilbault , Panu Somervuo , Ian Renner","doi":"10.1016/j.ecolmodel.2025.111311","DOIUrl":null,"url":null,"abstract":"<div><div>In recent years, the increase in data availability through citizen science data collection has raised questions about the quality of this data. Species distribution models can be severely impacted by non-random spatial distributions of records. Multiple methods exist to correct for spatial bias and most of them imply that the sampling is uneven in space and determined by the observers’ choices of where to search for observations. Most methods for addressing sampling biases in opportunistic datasets assume that each observer behaves uniformly, which in practice may not be the case. We focus our study on a widely-used correction method, chosen for its adaptable framework, and assess its effectiveness in mitigating biases from a group of observers with varying behaviours. This method includes a covariate in the model as a bias proxy and corrects for this bias by setting this covariate equal to a constant upon prediction. We differentiate two observer behaviours: exploring and following. Under this paradigm, explorers select destinations far away from the current set of observed points, while followers choose destinations at or near one of the observed points. As such, it is worth investigating whether the current approaches to correcting for observer bias hold under varying observer behaviours, or whether a data-driven approach based on modelled observer behaviour may lead to better predictions. To do so, we developed a new software platform, obsimulator, to simulate patterns of points driven by observer behaviour. We established a correction method based on a bias incorporation approach using k-nearest neighbours. We found that including a bias covariate and setting it to a constant for prediction yields the best results and the strength of the correction differs between cohorts of observers. Additionally, the optimal number of neighbouring points and smoothing parameters depends on the ratio of explorers versus followers in the observers’ cohort.</div></div>","PeriodicalId":51043,"journal":{"name":"Ecological Modelling","volume":"510 ","pages":"Article 111311"},"PeriodicalIF":3.2000,"publicationDate":"2025-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Modelling","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0304380025002972","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, the increase in data availability through citizen science data collection has raised questions about the quality of this data. Species distribution models can be severely impacted by non-random spatial distributions of records. Multiple methods exist to correct for spatial bias and most of them imply that the sampling is uneven in space and determined by the observers’ choices of where to search for observations. Most methods for addressing sampling biases in opportunistic datasets assume that each observer behaves uniformly, which in practice may not be the case. We focus our study on a widely-used correction method, chosen for its adaptable framework, and assess its effectiveness in mitigating biases from a group of observers with varying behaviours. This method includes a covariate in the model as a bias proxy and corrects for this bias by setting this covariate equal to a constant upon prediction. We differentiate two observer behaviours: exploring and following. Under this paradigm, explorers select destinations far away from the current set of observed points, while followers choose destinations at or near one of the observed points. As such, it is worth investigating whether the current approaches to correcting for observer bias hold under varying observer behaviours, or whether a data-driven approach based on modelled observer behaviour may lead to better predictions. To do so, we developed a new software platform, obsimulator, to simulate patterns of points driven by observer behaviour. We established a correction method based on a bias incorporation approach using k-nearest neighbours. We found that including a bias covariate and setting it to a constant for prediction yields the best results and the strength of the correction differs between cohorts of observers. Additionally, the optimal number of neighbouring points and smoothing parameters depends on the ratio of explorers versus followers in the observers’ cohort.
期刊介绍:
The journal is concerned with the use of mathematical models and systems analysis for the description of ecological processes and for the sustainable management of resources. Human activity and well-being are dependent on and integrated with the functioning of ecosystems and the services they provide. We aim to understand these basic ecosystem functions using mathematical and conceptual modelling, systems analysis, thermodynamics, computer simulations, and ecological theory. This leads to a preference for process-based models embedded in theory with explicit causative agents as opposed to strictly statistical or correlative descriptions. These modelling methods can be applied to a wide spectrum of issues ranging from basic ecology to human ecology to socio-ecological systems. The journal welcomes research articles, short communications, review articles, letters to the editor, book reviews, and other communications. The journal also supports the activities of the [International Society of Ecological Modelling (ISEM)](http://www.isemna.org/).