Considerations for fitting occupancy models to data from eBird and similar volunteer-collected data

Wesley M Hochachka, Viviana Ruiz Gutierrez, Alison Johnston
{"title":"Considerations for fitting occupancy models to data from eBird and similar volunteer-collected data","authors":"Wesley M Hochachka, Viviana Ruiz Gutierrez, Alison Johnston","doi":"10.1093/ornithology/ukad035","DOIUrl":null,"url":null,"abstract":"An occupancy model makes use of data that are structured as sets of repeated visits to each of many sites, in order estimate the actual probability of occupancy (i.e., proportion of occupied sites) after correcting for imperfect detection using the information contained in the sets of repeated observations. We explore the conditions under which preexisting, volunteer-collected data from the citizen science project eBird can be used for fitting occupancy models. Because the majority of eBird’s data are not collected in the form of repeated observations at individual locations, we explore two ways in which the single-visit records could be used in occupancy models. First, we assess the potential for space-for-time substitution: aggregating single-visit records from different locations within a region into pseudo-repeat visits. On average, eBird’s observers did not make their observations at locations that were representative of the habitat in the surrounding area, which would lead to biased estimates of occupancy probabilities when using space-for-time substitution. Thus, the use of space-for-time substitution is not always appropriate. Second, we explored the utility of including data from single-visit records to supplement sets of repeated-visit data. In a simulation study we found that inclusion of single-visit records increased the precision of occupancy estimates, but only when detection probabilities are high. When detection probability was low, the addition of single-visit records exacerbated biases in estimates of occupancy probability. We conclude that subsets of data from eBird, and likely from similar projects, can be used for occupancy modeling either using space-for-time substation or supplementing repeated-visit data with data from single-visit records. The appropriateness of either alternative will depend on the goals of a study and on the probabilities of detection and occupancy of the species of interest.","PeriodicalId":501265,"journal":{"name":"The Auk","volume":"9 3","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Auk","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/ornithology/ukad035","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

An occupancy model makes use of data that are structured as sets of repeated visits to each of many sites, in order estimate the actual probability of occupancy (i.e., proportion of occupied sites) after correcting for imperfect detection using the information contained in the sets of repeated observations. We explore the conditions under which preexisting, volunteer-collected data from the citizen science project eBird can be used for fitting occupancy models. Because the majority of eBird’s data are not collected in the form of repeated observations at individual locations, we explore two ways in which the single-visit records could be used in occupancy models. First, we assess the potential for space-for-time substitution: aggregating single-visit records from different locations within a region into pseudo-repeat visits. On average, eBird’s observers did not make their observations at locations that were representative of the habitat in the surrounding area, which would lead to biased estimates of occupancy probabilities when using space-for-time substitution. Thus, the use of space-for-time substitution is not always appropriate. Second, we explored the utility of including data from single-visit records to supplement sets of repeated-visit data. In a simulation study we found that inclusion of single-visit records increased the precision of occupancy estimates, but only when detection probabilities are high. When detection probability was low, the addition of single-visit records exacerbated biases in estimates of occupancy probability. We conclude that subsets of data from eBird, and likely from similar projects, can be used for occupancy modeling either using space-for-time substation or supplementing repeated-visit data with data from single-visit records. The appropriateness of either alternative will depend on the goals of a study and on the probabilities of detection and occupancy of the species of interest.
将入住率模型拟合到eBird数据和类似志愿者收集的数据的考虑
占用模型使用数据,这些数据被结构为对许多站点中的每个站点的重复访问集,以便在使用重复观察集中包含的信息纠正不完善的检测后估计占用的实际概率(即占用站点的比例)。我们探索了公民科学项目eBird中预先存在的志愿者收集的数据可以用于拟合占用模型的条件。由于大多数eBird的数据不是以在单个地点重复观察的形式收集的,因此我们探索了两种将单次访问记录用于占用模型的方法。首先,我们评估了空间-时间替代的潜力:将来自一个区域内不同位置的单次访问记录聚合为伪重复访问。平均而言,eBird的观测者并没有在周围地区具有代表性的栖息地进行观测,这将导致在使用空间-时间替代时对占用概率的估计有偏差。因此,使用空间替换时间的方法并不总是合适的。其次,我们探索了将单次访问记录中的数据用于补充重复访问数据集的效用。在模拟研究中,我们发现单次访问记录的包含增加了入住率估计的精度,但只有在检测概率很高的情况下。当检测概率较低时,单次就诊记录的增加加剧了入住概率估计的偏差。我们得出的结论是,来自eBird的数据子集,以及可能来自类似项目的数据子集,可以用于占用建模,或者使用时空变电站,或者用单次访问记录的数据补充重复访问数据。任何一种选择的适当性将取决于研究的目标以及感兴趣的物种被发现和占有的概率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信