Impact of obscured data on species distribution models.

IF 5.2 1区 环境科学与生态学 Q1 BIODIVERSITY CONSERVATION
Kyo Soung Koo, Ko-Huan Lee, Dawon Lee, Yikweon Jang
{"title":"Impact of obscured data on species distribution models.","authors":"Kyo Soung Koo, Ko-Huan Lee, Dawon Lee, Yikweon Jang","doi":"10.1111/cobi.70050","DOIUrl":null,"url":null,"abstract":"<p><p>The lack of knowledge about geographic distribution and environmental preference can hinder conservation efforts for rare and threatened species. Open-source databases provide an opportunity to address these knowledge gaps through the geographic information they hold on species worldwide. However, to protect rare and endangered species, open-source databases often assign locations that do not match the original locations, which introduce inaccuracies in occurrence records (e.g., the \"obscured\" function in iNaturalist replaces the original location with a random location in a 0.2 × 0.2° cell). We tested the efficacy of the iNaturalist's obscured function in concealing geographic information and the function's impact on the species distribution modeling of 3 endangered species in South Korea: gold-spotted pond frogs (Pelophylax chosenicus), Reeves' turtles (Mauremys reevesii), and Mongolia racerunner (Eremias argus). We collected occurrence data (orginal data) for these 3 species and uploaded the data to iNaturalist. We then compared location, elevation, and habitat area in the original data set with these data in the obscured data set. To investigate the differences in species distribution, we ran species distribution models with both data sets. We also assessed the awareness of obscured function in peer-reviewed articles for which occurrence records from iNaturalist were used. The locations assigned by the obscured function significantly altered the geographic information of the species, including elevational range, habitat type, and environmental variables relevant to species distribution. Potential distributions estimated using locations assigned under the obscured function were different from those estimated using the original data. Only 4 out of 170 peer-reviewed articles acknowledged the presence of obscured data in iNaturalist, suggesting that most researchers are unaware of this issue. The locations assigned by the obscured function can cause serious problems in species distribution modeling and thus may negatively affect conservation of endangered species. We encourage researchers to thoroughly vet data obtained from open-source databases and urge database platforms to make it clear when data have been obscured.</p>","PeriodicalId":10689,"journal":{"name":"Conservation Biology","volume":" ","pages":"e70050"},"PeriodicalIF":5.2000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conservation Biology","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1111/cobi.70050","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIODIVERSITY CONSERVATION","Score":null,"Total":0}
引用次数: 0

Abstract

The lack of knowledge about geographic distribution and environmental preference can hinder conservation efforts for rare and threatened species. Open-source databases provide an opportunity to address these knowledge gaps through the geographic information they hold on species worldwide. However, to protect rare and endangered species, open-source databases often assign locations that do not match the original locations, which introduce inaccuracies in occurrence records (e.g., the "obscured" function in iNaturalist replaces the original location with a random location in a 0.2 × 0.2° cell). We tested the efficacy of the iNaturalist's obscured function in concealing geographic information and the function's impact on the species distribution modeling of 3 endangered species in South Korea: gold-spotted pond frogs (Pelophylax chosenicus), Reeves' turtles (Mauremys reevesii), and Mongolia racerunner (Eremias argus). We collected occurrence data (orginal data) for these 3 species and uploaded the data to iNaturalist. We then compared location, elevation, and habitat area in the original data set with these data in the obscured data set. To investigate the differences in species distribution, we ran species distribution models with both data sets. We also assessed the awareness of obscured function in peer-reviewed articles for which occurrence records from iNaturalist were used. The locations assigned by the obscured function significantly altered the geographic information of the species, including elevational range, habitat type, and environmental variables relevant to species distribution. Potential distributions estimated using locations assigned under the obscured function were different from those estimated using the original data. Only 4 out of 170 peer-reviewed articles acknowledged the presence of obscured data in iNaturalist, suggesting that most researchers are unaware of this issue. The locations assigned by the obscured function can cause serious problems in species distribution modeling and thus may negatively affect conservation of endangered species. We encourage researchers to thoroughly vet data obtained from open-source databases and urge database platforms to make it clear when data have been obscured.

模糊数据对物种分布模型的影响。
缺乏对地理分布和环境偏好的了解会阻碍对稀有和濒危物种的保护工作。开源数据库提供了一个机会,通过它们保存的世界各地物种的地理信息来解决这些知识差距。然而,为了保护稀有和濒危物种,开源数据库经常分配与原始位置不匹配的位置,这在发生记录中引入了不准确性(例如,iNaturalist中的“模糊”功能将原始位置替换为0.2 × 0.2°单元格中的随机位置)。本文以金斑池蛙(Pelophylax chosenicus)、里氏龟(Mauremys reevesii)和蒙古斑龟(Eremias argus) 3种韩国濒危物种为研究对象,对iNaturalist模糊功能的地理信息隐藏效果及其对物种分布模型的影响进行了测试。我们收集了这3个物种的发生数据(原始数据),并将数据上传到iNaturalist。然后,我们将原始数据集中的位置、海拔和栖息地面积与模糊数据集中的这些数据进行了比较。为了研究物种分布的差异,我们在两个数据集上运行了物种分布模型。我们还评估了使用iNaturalist的事件记录的同行评议文章对模糊功能的认识。模糊函数所指定的位置显著改变了物种的地理信息,包括海拔范围、栖息地类型和与物种分布相关的环境变量。利用模糊函数下分配的位置估计的潜在分布与使用原始数据估计的潜在分布不同。在170篇同行评议的文章中,只有4篇承认iNaturalist中存在模糊数据,这表明大多数研究人员都没有意识到这个问题。模糊函数所指定的位置会给物种分布建模带来严重的问题,从而可能对濒危物种的保护产生负面影响。我们鼓励研究人员彻底审查从开源数据库获得的数据,并敦促数据库平台在数据被掩盖时予以澄清。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Conservation Biology
Conservation Biology 环境科学-环境科学
CiteScore
12.70
自引率
3.20%
发文量
175
审稿时长
2 months
期刊介绍: Conservation Biology welcomes submissions that address the science and practice of conserving Earth's biological diversity. We encourage submissions that emphasize issues germane to any of Earth''s ecosystems or geographic regions and that apply diverse approaches to analyses and problem solving. Nevertheless, manuscripts with relevance to conservation that transcend the particular ecosystem, species, or situation described will be prioritized for publication.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信