Marjon Hellegers, Arjen van Hinsberg, Jonathan Lenoir, Jürgen Dengler, Mark A. J. Huijbregts, Aafke M. Schipper
{"title":"需要多种阈值选择方法来二元化物种分布模型预测","authors":"Marjon Hellegers, Arjen van Hinsberg, Jonathan Lenoir, Jürgen Dengler, Mark A. J. Huijbregts, Aafke M. Schipper","doi":"10.1111/ddi.70019","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Aim</h3>\n \n <p>Probabilities of occurrence predicted by species distribution models (SDMs) are routinely binarised based on a threshold that maximises the true skill statistic. Recently, the true skill statistic is criticised for favouring overprediction when species' prevalence is low. We aim to assess the effect of three alternative threshold-selection methods on predicted range sizes and changes therein under climate change across a large number of species in Europe.</p>\n </section>\n \n <section>\n \n <h3> Location</h3>\n \n <p>Europe.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>We fitted SDMs for 1677 vascular plant species, using species observations from 431,179 vegetation plots as response variables and climate, soil and topographic variables as predictors. We used the SDMs to quantify current and future range sizes of each species under mild and severe climate change scenarios, each combined with two dispersal assumptions (no and unlimited dispersal) and using four threshold-selection methods: maximising true skill statistic (TSS), minimising the difference between sensitivity and specificity (DSS), maximising Matthew's correlation coefficient (MCC) and maximising F-measure (F). Further, we assessed prediction errors for each threshold-selection method in relation to species prevalence.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>We found that SDM outputs binarised with TSS resulted in the largest predicted ranges and the smallest future range contractions. For current climate conditions, median range sizes were 1.3, 3.6 and 9.2 times larger when binarised with TSS than with DSS, MCC and F, respectively. Under severe climate change without dispersal, median range size declines were 56%, 61%, 79% and 88% with TSS, DSS, MCC and F, respectively. Binarisation based on TSS tended to result in the highest overprediction rates and lowest underprediction rates, while this was the opposite with F.</p>\n </section>\n \n <section>\n \n <h3> Main Conclusions</h3>\n \n <p>We recommend choosing the threshold-selection method based on the goals and scope of the study, while carefully considering the trade-offs between overprediction and underprediction. Alternatively, we recommend using multiple threshold-selection methods to quantify the uncertainty in the binarisation.</p>\n </section>\n </div>","PeriodicalId":51018,"journal":{"name":"Diversity and Distributions","volume":"31 4","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/ddi.70019","citationCount":"0","resultStr":"{\"title\":\"Multiple Threshold-Selection Methods Are Needed to Binarise Species Distribution Model Predictions\",\"authors\":\"Marjon Hellegers, Arjen van Hinsberg, Jonathan Lenoir, Jürgen Dengler, Mark A. J. Huijbregts, Aafke M. Schipper\",\"doi\":\"10.1111/ddi.70019\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <h3> Aim</h3>\\n \\n <p>Probabilities of occurrence predicted by species distribution models (SDMs) are routinely binarised based on a threshold that maximises the true skill statistic. Recently, the true skill statistic is criticised for favouring overprediction when species' prevalence is low. We aim to assess the effect of three alternative threshold-selection methods on predicted range sizes and changes therein under climate change across a large number of species in Europe.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Location</h3>\\n \\n <p>Europe.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Methods</h3>\\n \\n <p>We fitted SDMs for 1677 vascular plant species, using species observations from 431,179 vegetation plots as response variables and climate, soil and topographic variables as predictors. We used the SDMs to quantify current and future range sizes of each species under mild and severe climate change scenarios, each combined with two dispersal assumptions (no and unlimited dispersal) and using four threshold-selection methods: maximising true skill statistic (TSS), minimising the difference between sensitivity and specificity (DSS), maximising Matthew's correlation coefficient (MCC) and maximising F-measure (F). Further, we assessed prediction errors for each threshold-selection method in relation to species prevalence.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Results</h3>\\n \\n <p>We found that SDM outputs binarised with TSS resulted in the largest predicted ranges and the smallest future range contractions. For current climate conditions, median range sizes were 1.3, 3.6 and 9.2 times larger when binarised with TSS than with DSS, MCC and F, respectively. Under severe climate change without dispersal, median range size declines were 56%, 61%, 79% and 88% with TSS, DSS, MCC and F, respectively. Binarisation based on TSS tended to result in the highest overprediction rates and lowest underprediction rates, while this was the opposite with F.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Main Conclusions</h3>\\n \\n <p>We recommend choosing the threshold-selection method based on the goals and scope of the study, while carefully considering the trade-offs between overprediction and underprediction. Alternatively, we recommend using multiple threshold-selection methods to quantify the uncertainty in the binarisation.</p>\\n </section>\\n </div>\",\"PeriodicalId\":51018,\"journal\":{\"name\":\"Diversity and Distributions\",\"volume\":\"31 4\",\"pages\":\"\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-04-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/ddi.70019\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Diversity and Distributions\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/ddi.70019\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIODIVERSITY CONSERVATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diversity and Distributions","FirstCategoryId":"93","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/ddi.70019","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIODIVERSITY CONSERVATION","Score":null,"Total":0}
Multiple Threshold-Selection Methods Are Needed to Binarise Species Distribution Model Predictions
Aim
Probabilities of occurrence predicted by species distribution models (SDMs) are routinely binarised based on a threshold that maximises the true skill statistic. Recently, the true skill statistic is criticised for favouring overprediction when species' prevalence is low. We aim to assess the effect of three alternative threshold-selection methods on predicted range sizes and changes therein under climate change across a large number of species in Europe.
Location
Europe.
Methods
We fitted SDMs for 1677 vascular plant species, using species observations from 431,179 vegetation plots as response variables and climate, soil and topographic variables as predictors. We used the SDMs to quantify current and future range sizes of each species under mild and severe climate change scenarios, each combined with two dispersal assumptions (no and unlimited dispersal) and using four threshold-selection methods: maximising true skill statistic (TSS), minimising the difference between sensitivity and specificity (DSS), maximising Matthew's correlation coefficient (MCC) and maximising F-measure (F). Further, we assessed prediction errors for each threshold-selection method in relation to species prevalence.
Results
We found that SDM outputs binarised with TSS resulted in the largest predicted ranges and the smallest future range contractions. For current climate conditions, median range sizes were 1.3, 3.6 and 9.2 times larger when binarised with TSS than with DSS, MCC and F, respectively. Under severe climate change without dispersal, median range size declines were 56%, 61%, 79% and 88% with TSS, DSS, MCC and F, respectively. Binarisation based on TSS tended to result in the highest overprediction rates and lowest underprediction rates, while this was the opposite with F.
Main Conclusions
We recommend choosing the threshold-selection method based on the goals and scope of the study, while carefully considering the trade-offs between overprediction and underprediction. Alternatively, we recommend using multiple threshold-selection methods to quantify the uncertainty in the binarisation.
期刊介绍:
Diversity and Distributions is a journal of conservation biogeography. We publish papers that deal with the application of biogeographical principles, theories, and analyses (being those concerned with the distributional dynamics of taxa and assemblages) to problems concerning the conservation of biodiversity. We no longer consider papers the sole aim of which is to describe or analyze patterns of biodiversity or to elucidate processes that generate biodiversity.