%diag_test: a generic SAS macro for evaluating diagnostic accuracy measures for multiple diagnostic tests.

IF 3.3 3区医学 Q2 MEDICAL INFORMATICS

BMC Medical Informatics and Decision Making Pub Date : 2025-01-13 DOI:10.1186/s12911-024-02808-5

Jacques K Muthusi, Peter W Young, Frankline O Mboya, Samuel M Mwalili

{"title":"%diag_test: a generic SAS macro for evaluating diagnostic accuracy measures for multiple diagnostic tests.","authors":"Jacques K Muthusi, Peter W Young, Frankline O Mboya, Samuel M Mwalili","doi":"10.1186/s12911-024-02808-5","DOIUrl":null,"url":null,"abstract":"Background: Measures of diagnostic test accuracy provide evidence of how well a test correctly identifies or rules-out disease. Commonly used diagnostic accuracy measures (DAMs) include sensitivity and specificity, predictive values, likelihood ratios, area under the receiver operator characteristic curve (AUROC), area under precision-recall curves (AUPRC), diagnostic effectiveness (accuracy), disease prevalence, and diagnostic odds ratio (DOR) etc. Most available analysis tools perform accuracy testing for a single diagnostic test using summarized data. We developed a SAS macro for evaluating multiple diagnostic tests using individual-level data that creates a 2 × 2 summary table, AUROC and AUPRC as part of output.Methods: The SAS macro presented here is automated to reduce analysis time and transcription errors. It is simple to use as the user only needs to specify the input dataset, \"standard\" and \"test\" variables and threshold values. It creates a publication-quality output in Microsoft Word and Excel showing more than 15 different accuracy measures together with overlaid AUROC and AUPRC graphics to help the researcher in making decisions to adopt or reject diagnostic tests. Further, it provides for additional variance estimation methods other than the normal distribution approximation.Results: We tested the macro for quality control purposes by reproducing results from published work on evaluation of multiple types of dried blood spots (DBS) as an alternative for human immunodeficiency virus (HIV) viral load (VL) monitoring in resource-limited settings compared to plasma, the gold-standard. Plasma viral load reagents are costly, and blood must be prepared in a reference laboratory setting by a qualified technician. On the other hand, DBS are easy to prepare without these restrictions. This study evaluated the suitability of DBS from venous, microcapillary and direct spotting DBS, hence multiple diagnostic tests which were compared to plasma specimen. We also used the macro to reproduce results of published work on evaluating performance of multiple classification machine learning algorithms for predicting coronary artery disease.Conclusion: The SAS macro presented here is a powerful analytic tool for analyzing data from multiple diagnostic tests. The SAS programmer can modify the source code to include other diagnostic measures and variance estimation methods. By automating analysis, the macro adds value by saving analysis time, reducing transcription errors, and producing publication-quality outputs.","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"21"},"PeriodicalIF":3.3000,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11730795/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-024-02808-5","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Measures of diagnostic test accuracy provide evidence of how well a test correctly identifies or rules-out disease. Commonly used diagnostic accuracy measures (DAMs) include sensitivity and specificity, predictive values, likelihood ratios, area under the receiver operator characteristic curve (AUROC), area under precision-recall curves (AUPRC), diagnostic effectiveness (accuracy), disease prevalence, and diagnostic odds ratio (DOR) etc. Most available analysis tools perform accuracy testing for a single diagnostic test using summarized data. We developed a SAS macro for evaluating multiple diagnostic tests using individual-level data that creates a 2 × 2 summary table, AUROC and AUPRC as part of output.

Methods: The SAS macro presented here is automated to reduce analysis time and transcription errors. It is simple to use as the user only needs to specify the input dataset, "standard" and "test" variables and threshold values. It creates a publication-quality output in Microsoft Word and Excel showing more than 15 different accuracy measures together with overlaid AUROC and AUPRC graphics to help the researcher in making decisions to adopt or reject diagnostic tests. Further, it provides for additional variance estimation methods other than the normal distribution approximation.

Results: We tested the macro for quality control purposes by reproducing results from published work on evaluation of multiple types of dried blood spots (DBS) as an alternative for human immunodeficiency virus (HIV) viral load (VL) monitoring in resource-limited settings compared to plasma, the gold-standard. Plasma viral load reagents are costly, and blood must be prepared in a reference laboratory setting by a qualified technician. On the other hand, DBS are easy to prepare without these restrictions. This study evaluated the suitability of DBS from venous, microcapillary and direct spotting DBS, hence multiple diagnostic tests which were compared to plasma specimen. We also used the macro to reproduce results of published work on evaluating performance of multiple classification machine learning algorithms for predicting coronary artery disease.

Conclusion: The SAS macro presented here is a powerful analytic tool for analyzing data from multiple diagnostic tests. The SAS programmer can modify the source code to include other diagnostic measures and variance estimation methods. By automating analysis, the macro adds value by saving analysis time, reducing transcription errors, and producing publication-quality outputs.

查看原文本刊更多论文

%diag_test：用于评估多个诊断测试的诊断准确性度量的通用SAS宏。

背景：诊断测试准确性的测量提供了测试如何正确识别或排除疾病的证据。常用的诊断准确度指标包括敏感性和特异性、预测值、似然比、受试者操作特征曲线下面积（AUROC）、精确召回曲线下面积（AUPRC）、诊断有效性（准确性）、患病率、诊断优势比（DOR）等。大多数可用的分析工具使用汇总数据对单个诊断测试执行准确性测试。我们开发了一个SAS宏，用于使用个人层面的数据评估多个诊断测试，该数据创建了一个2 × 2汇总表，AUROC和AUPRC作为输出的一部分。方法：本文介绍的SAS宏是自动化的，以减少分析时间和转录错误。它很容易使用，因为用户只需要指定输入数据集，“标准”和“测试”变量和阈值。它在Microsoft Word和Excel中创建出版质量的输出，显示超过15种不同的准确度测量，以及覆盖的AUROC和AUPRC图形，以帮助研究人员做出采用或拒绝诊断测试的决定。此外，它还提供了除正态分布近似之外的其他方差估计方法。结果：我们通过复制已发表的评估多种类型的干血斑（DBS）作为人类免疫缺陷病毒（HIV）病毒载量（VL）监测的替代方法在资源有限的环境下与血浆（金标准）相比的结果，对宏观进行了质量控制测试。血浆病毒载量试剂价格昂贵，血液必须由合格的技术人员在参考实验室环境中制备。另一方面，没有这些限制，星展银行很容易准备。本研究评估了静脉、微血管和直接点状DBS的适用性，从而将多种诊断试验与血浆标本进行了比较。我们还使用宏来重现已发表的关于评估用于预测冠状动脉疾病的多种分类机器学习算法性能的工作结果。结论：本文介绍的SAS宏是一个功能强大的分析工具，用于分析来自多个诊断测试的数据。SAS程序员可以修改源代码，以包含其他诊断度量和方差估计方法。通过自动化分析，宏通过节省分析时间、减少转录错误和产生出版质量的输出来增加价值。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMC Medical Informatics and Decision Making 医学-医学：信息

CiteScore

7.20

自引率

5.70%

发文量

297

审稿时长

1 months

期刊介绍： BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.