John D. J. Clare, Benjamin Zuckerberg, Laura A. Nunes, James Strange, Rich Hatfield, Claudio Gratton
{"title":"Volunteers Sample Where Endangered Bumble Bees Occur: Model-Based Triage of Preferential Sampling in Multi-Species or Integrated Distribution Models","authors":"John D. J. Clare, Benjamin Zuckerberg, Laura A. Nunes, James Strange, Rich Hatfield, Claudio Gratton","doi":"10.1111/ddi.70034","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Aim</h3>\n \n <p>Many broad-scale ecological inventory and monitoring efforts collect multi-species (or otherwise multivariate) data under unstructured study designs. Unstructured designs are vulnerable to preferential sampling, where residual covariance between locations selected for sampling and the response variable of interest may render predictions strongly biased.</p>\n </section>\n \n <section>\n \n <h3> Innovation</h3>\n \n <p>We extend previous work to address preferential sampling in spatial single-species distribution models to a multivariate context. Using spatially structured latent variables to approximate residual covariance between species occurrence probabilities and sampling inclusion probabilities, we present ways to account for sampling that may be preferential to varying degrees across multiple species, where (analogously) multiple datastreams might be preferential to varying degrees for a single species, or both. We use simulation to explore our proposed model and present an application that delineates the distributions of 13 bumble bee species across Wisconsin, USA and evaluates evidence for preferential sampling within 3 citizen science datastreams.</p>\n </section>\n \n <section>\n \n <h3> Main Conclusions</h3>\n \n <p>Simulation results suggest that our proposed model improves out-of-sample predictions of species occurrence or richness when the sampling design is preferential and residual covariance between sampling and species occurrence exhibits spatial structure compatible with model assumptions, reducing bias in predictions of species occurrence or richness. Empirically, volunteers appeared to sample preferentially with respect to bumble bee distributions, being more likely to sample in locations where the federally listed <i>Bombus affinis</i> was more likely to occur. Our approach enables practitioners a means to triage preferential sampling within increasingly popular multi-species or integrated distribution models and can be modified slightly to deal with a variety of other response variables.</p>\n </section>\n </div>","PeriodicalId":51018,"journal":{"name":"Diversity and Distributions","volume":"31 5","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/ddi.70034","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diversity and Distributions","FirstCategoryId":"93","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/ddi.70034","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIODIVERSITY CONSERVATION","Score":null,"Total":0}
引用次数: 0
Abstract
Aim
Many broad-scale ecological inventory and monitoring efforts collect multi-species (or otherwise multivariate) data under unstructured study designs. Unstructured designs are vulnerable to preferential sampling, where residual covariance between locations selected for sampling and the response variable of interest may render predictions strongly biased.
Innovation
We extend previous work to address preferential sampling in spatial single-species distribution models to a multivariate context. Using spatially structured latent variables to approximate residual covariance between species occurrence probabilities and sampling inclusion probabilities, we present ways to account for sampling that may be preferential to varying degrees across multiple species, where (analogously) multiple datastreams might be preferential to varying degrees for a single species, or both. We use simulation to explore our proposed model and present an application that delineates the distributions of 13 bumble bee species across Wisconsin, USA and evaluates evidence for preferential sampling within 3 citizen science datastreams.
Main Conclusions
Simulation results suggest that our proposed model improves out-of-sample predictions of species occurrence or richness when the sampling design is preferential and residual covariance between sampling and species occurrence exhibits spatial structure compatible with model assumptions, reducing bias in predictions of species occurrence or richness. Empirically, volunteers appeared to sample preferentially with respect to bumble bee distributions, being more likely to sample in locations where the federally listed Bombus affinis was more likely to occur. Our approach enables practitioners a means to triage preferential sampling within increasingly popular multi-species or integrated distribution models and can be modified slightly to deal with a variety of other response variables.
期刊介绍:
Diversity and Distributions is a journal of conservation biogeography. We publish papers that deal with the application of biogeographical principles, theories, and analyses (being those concerned with the distributional dynamics of taxa and assemblages) to problems concerning the conservation of biodiversity. We no longer consider papers the sole aim of which is to describe or analyze patterns of biodiversity or to elucidate processes that generate biodiversity.