{"title":"Applicability Area: A novel utility-based approach for evaluating predictive models, beyond discrimination.","authors":"Star Liu, Shixiong Wei, Harold P Lehmann","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>Translating prediction models into practice and supporting clinicians' decision-making demand demonstration of clinical value. Existing approaches to evaluating machine learning models emphasize discriminatory power, which is only a part of the medical decision problem. We propose the Applicability Area (ApAr), a decision-analytic utility-based approach to evaluating predictive models that communicate the range of prior probability and test cutoffs for which the model has positive utility; larger ApArs suggest a broader potential use of the model. We assess ApAr with simulated datasets and with three published medical datasets. ApAr adds value beyond the typical area under the receiver operating characteristic curve (AUROC) metric analysis. As an example, in the diabetes dataset, the top model by ApAr was ranked as the 23<sup>rd</sup> best model by AUROC. Decision makers looking to adopt and implement models can leverage ApArs to assess if the local range of priors and utilities is within the respective ApArs.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2023 ","pages":"494-503"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10785877/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AMIA ... Annual Symposium proceedings. AMIA Symposium","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Translating prediction models into practice and supporting clinicians' decision-making demand demonstration of clinical value. Existing approaches to evaluating machine learning models emphasize discriminatory power, which is only a part of the medical decision problem. We propose the Applicability Area (ApAr), a decision-analytic utility-based approach to evaluating predictive models that communicate the range of prior probability and test cutoffs for which the model has positive utility; larger ApArs suggest a broader potential use of the model. We assess ApAr with simulated datasets and with three published medical datasets. ApAr adds value beyond the typical area under the receiver operating characteristic curve (AUROC) metric analysis. As an example, in the diabetes dataset, the top model by ApAr was ranked as the 23rd best model by AUROC. Decision makers looking to adopt and implement models can leverage ApArs to assess if the local range of priors and utilities is within the respective ApArs.
将预测模型转化为实践并支持临床医生的决策需要证明其临床价值。评估机器学习模型的现有方法强调判别能力,而这只是医疗决策问题的一部分。我们提出了适用性区域(Applicability Area,ApAr),这是一种基于决策分析效用的预测模型评估方法,它传达了模型具有积极效用的先验概率和测试临界值的范围;ApAr 越大,表明模型的潜在用途越广。我们利用模拟数据集和三个已发表的医学数据集对 ApAr 进行了评估。ApAr 带来的价值超出了典型的接收者工作特征曲线下面积(AUROC)度量分析。例如,在糖尿病数据集中,ApAr 的最佳模型在 AUROC 中排名第 23 位。希望采用和实施模型的决策者可以利用 ApAr 来评估本地先验和效用范围是否在各自的 ApAr 范围内。