需要多种阈值选择方法来二元化物种分布模型预测

IF 4.6 2区 环境科学与生态学 Q1 BIODIVERSITY CONSERVATION
Marjon Hellegers, Arjen van Hinsberg, Jonathan Lenoir, Jürgen Dengler, Mark A. J. Huijbregts, Aafke M. Schipper
{"title":"需要多种阈值选择方法来二元化物种分布模型预测","authors":"Marjon Hellegers,&nbsp;Arjen van Hinsberg,&nbsp;Jonathan Lenoir,&nbsp;Jürgen Dengler,&nbsp;Mark A. J. Huijbregts,&nbsp;Aafke M. Schipper","doi":"10.1111/ddi.70019","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Aim</h3>\n \n <p>Probabilities of occurrence predicted by species distribution models (SDMs) are routinely binarised based on a threshold that maximises the true skill statistic. Recently, the true skill statistic is criticised for favouring overprediction when species' prevalence is low. We aim to assess the effect of three alternative threshold-selection methods on predicted range sizes and changes therein under climate change across a large number of species in Europe.</p>\n </section>\n \n <section>\n \n <h3> Location</h3>\n \n <p>Europe.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>We fitted SDMs for 1677 vascular plant species, using species observations from 431,179 vegetation plots as response variables and climate, soil and topographic variables as predictors. We used the SDMs to quantify current and future range sizes of each species under mild and severe climate change scenarios, each combined with two dispersal assumptions (no and unlimited dispersal) and using four threshold-selection methods: maximising true skill statistic (TSS), minimising the difference between sensitivity and specificity (DSS), maximising Matthew's correlation coefficient (MCC) and maximising F-measure (F). Further, we assessed prediction errors for each threshold-selection method in relation to species prevalence.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>We found that SDM outputs binarised with TSS resulted in the largest predicted ranges and the smallest future range contractions. For current climate conditions, median range sizes were 1.3, 3.6 and 9.2 times larger when binarised with TSS than with DSS, MCC and F, respectively. Under severe climate change without dispersal, median range size declines were 56%, 61%, 79% and 88% with TSS, DSS, MCC and F, respectively. Binarisation based on TSS tended to result in the highest overprediction rates and lowest underprediction rates, while this was the opposite with F.</p>\n </section>\n \n <section>\n \n <h3> Main Conclusions</h3>\n \n <p>We recommend choosing the threshold-selection method based on the goals and scope of the study, while carefully considering the trade-offs between overprediction and underprediction. Alternatively, we recommend using multiple threshold-selection methods to quantify the uncertainty in the binarisation.</p>\n </section>\n </div>","PeriodicalId":51018,"journal":{"name":"Diversity and Distributions","volume":"31 4","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/ddi.70019","citationCount":"0","resultStr":"{\"title\":\"Multiple Threshold-Selection Methods Are Needed to Binarise Species Distribution Model Predictions\",\"authors\":\"Marjon Hellegers,&nbsp;Arjen van Hinsberg,&nbsp;Jonathan Lenoir,&nbsp;Jürgen Dengler,&nbsp;Mark A. J. Huijbregts,&nbsp;Aafke M. Schipper\",\"doi\":\"10.1111/ddi.70019\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <h3> Aim</h3>\\n \\n <p>Probabilities of occurrence predicted by species distribution models (SDMs) are routinely binarised based on a threshold that maximises the true skill statistic. Recently, the true skill statistic is criticised for favouring overprediction when species' prevalence is low. We aim to assess the effect of three alternative threshold-selection methods on predicted range sizes and changes therein under climate change across a large number of species in Europe.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Location</h3>\\n \\n <p>Europe.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Methods</h3>\\n \\n <p>We fitted SDMs for 1677 vascular plant species, using species observations from 431,179 vegetation plots as response variables and climate, soil and topographic variables as predictors. We used the SDMs to quantify current and future range sizes of each species under mild and severe climate change scenarios, each combined with two dispersal assumptions (no and unlimited dispersal) and using four threshold-selection methods: maximising true skill statistic (TSS), minimising the difference between sensitivity and specificity (DSS), maximising Matthew's correlation coefficient (MCC) and maximising F-measure (F). Further, we assessed prediction errors for each threshold-selection method in relation to species prevalence.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Results</h3>\\n \\n <p>We found that SDM outputs binarised with TSS resulted in the largest predicted ranges and the smallest future range contractions. For current climate conditions, median range sizes were 1.3, 3.6 and 9.2 times larger when binarised with TSS than with DSS, MCC and F, respectively. Under severe climate change without dispersal, median range size declines were 56%, 61%, 79% and 88% with TSS, DSS, MCC and F, respectively. Binarisation based on TSS tended to result in the highest overprediction rates and lowest underprediction rates, while this was the opposite with F.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Main Conclusions</h3>\\n \\n <p>We recommend choosing the threshold-selection method based on the goals and scope of the study, while carefully considering the trade-offs between overprediction and underprediction. Alternatively, we recommend using multiple threshold-selection methods to quantify the uncertainty in the binarisation.</p>\\n </section>\\n </div>\",\"PeriodicalId\":51018,\"journal\":{\"name\":\"Diversity and Distributions\",\"volume\":\"31 4\",\"pages\":\"\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-04-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/ddi.70019\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Diversity and Distributions\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/ddi.70019\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIODIVERSITY CONSERVATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diversity and Distributions","FirstCategoryId":"93","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/ddi.70019","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIODIVERSITY CONSERVATION","Score":null,"Total":0}
引用次数: 0

摘要

目的 物种分布模型(SDMs)预测的出现概率通常会根据最大化真实技能统计量的阈值进行二值化。最近,真实技能统计被批评为在物种流行率较低时倾向于过高预测。我们的目的是评估三种可供选择的阈值选择方法对欧洲大量物种在气候变化下的预测分布区大小及其变化的影响。 地点:欧洲。 方法 我们以 431 179 块植被地块的物种观测数据为响应变量,以气候、土壤和地形变量为预测变量,为 1677 种维管束植物拟合了 SDM。我们使用 SDMs 量化了温和与严重气候变化情景下每个物种当前和未来的分布范围,每种情景都结合了两种扩散假设(无扩散和无限扩散),并使用了四种阈值选择方法:最大化真实技能统计量(TSS)、最小化灵敏度与特异性之间的差异(DSS)、最大化马修相关系数(MCC)和最大化 F 值(F)。此外,我们还评估了每种阈值选择方法与物种流行率相关的预测误差。 结果 我们发现,使用 TSS 进行二值化的 SDM 输出结果可得出最大的预测范围和最小的未来范围收缩。在目前的气候条件下,使用 TSS 进行二值化的中值范围大小分别是使用 DSS、MCC 和 F 的 1.3、3.6 和 9.2 倍。在无扩散的严重气候变化条件下,使用 TSS、DSS、MCC 和 F 进行二值化处理时,中位分布区大小的下降幅度分别为 56%、61%、79% 和 88%。主要结论 我们建议根据研究目标和范围选择阈值选择方法,同时仔细考虑预测过度和预测不足之间的权衡。或者,我们建议使用多种阈值选择方法来量化二值化的不确定性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Multiple Threshold-Selection Methods Are Needed to Binarise Species Distribution Model Predictions

Multiple Threshold-Selection Methods Are Needed to Binarise Species Distribution Model Predictions

Aim

Probabilities of occurrence predicted by species distribution models (SDMs) are routinely binarised based on a threshold that maximises the true skill statistic. Recently, the true skill statistic is criticised for favouring overprediction when species' prevalence is low. We aim to assess the effect of three alternative threshold-selection methods on predicted range sizes and changes therein under climate change across a large number of species in Europe.

Location

Europe.

Methods

We fitted SDMs for 1677 vascular plant species, using species observations from 431,179 vegetation plots as response variables and climate, soil and topographic variables as predictors. We used the SDMs to quantify current and future range sizes of each species under mild and severe climate change scenarios, each combined with two dispersal assumptions (no and unlimited dispersal) and using four threshold-selection methods: maximising true skill statistic (TSS), minimising the difference between sensitivity and specificity (DSS), maximising Matthew's correlation coefficient (MCC) and maximising F-measure (F). Further, we assessed prediction errors for each threshold-selection method in relation to species prevalence.

Results

We found that SDM outputs binarised with TSS resulted in the largest predicted ranges and the smallest future range contractions. For current climate conditions, median range sizes were 1.3, 3.6 and 9.2 times larger when binarised with TSS than with DSS, MCC and F, respectively. Under severe climate change without dispersal, median range size declines were 56%, 61%, 79% and 88% with TSS, DSS, MCC and F, respectively. Binarisation based on TSS tended to result in the highest overprediction rates and lowest underprediction rates, while this was the opposite with F.

Main Conclusions

We recommend choosing the threshold-selection method based on the goals and scope of the study, while carefully considering the trade-offs between overprediction and underprediction. Alternatively, we recommend using multiple threshold-selection methods to quantify the uncertainty in the binarisation.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Diversity and Distributions
Diversity and Distributions 环境科学-生态学
CiteScore
8.90
自引率
4.30%
发文量
195
审稿时长
8-16 weeks
期刊介绍: Diversity and Distributions is a journal of conservation biogeography. We publish papers that deal with the application of biogeographical principles, theories, and analyses (being those concerned with the distributional dynamics of taxa and assemblages) to problems concerning the conservation of biodiversity. We no longer consider papers the sole aim of which is to describe or analyze patterns of biodiversity or to elucidate processes that generate biodiversity.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信