Using Aggregated Proficiency Testing Results to Identify Systematic Error.

IF 1.9 Q3 MEDICAL LABORATORY TECHNOLOGY

Journal of Applied Laboratory Medicine Pub Date : 2025-09-15 DOI:10.1093/jalm/jfaf126

Uzay Kırbıyık, J Rex Astles

{"title":"Using Aggregated Proficiency Testing Results to Identify Systematic Error.","authors":"Uzay Kırbıyık, J Rex Astles","doi":"10.1093/jalm/jfaf126","DOIUrl":null,"url":null,"abstract":"Background: Proficiency testing (PT) should identify systematic errors, which are likely to recur. The Clinical Laboratory Improvement Amendments of 1988 (CLIA) acceptance limits (ALs) include 3 standard deviations (3SD) and concentration limits. We investigated the ability of PT to detect systematic error, especially as affected by the different AL types.Methods: We removed any ungradable, duplicate, and irregular scores from CLIA laboratory PT data from 2008 to 2018. We calculated the overall miss rate, unsatisfactory event rate, i.e., score <80 (4 of 5 correct), and event rates for score 100 to 0. We used paired t-tests and the Wilcoxon signed-rank test to compare miss rates and unsatisfactory event rates between short- and long-term PT participants. We used the binomial distribution to estimate the expected event scores under the assumption that all misses were independent (random). We compared observed event scores with their expected values as a ratio.Results: Forty thousand five hundred ninety-six laboratories produced 15 140 128 event scores for 75 analytes. The distribution of event scores was skewed toward multiple event misses (score 0-60) compared to the predicted distribution. Miss rates and unsatisfactory rates were significantly higher for short-term laboratories. Plotting the log ratio of observed vs expected rates for event scores showed that the degree of systematic effect was substantial. The magnitude of the effect was less for 3SD ALs.Conclusions: In an event, PT misses are often dependent. All ALs detected systematic error. Expressing systematic error using PT data could help to identify and remediate analytical issues.","PeriodicalId":46361,"journal":{"name":"Journal of Applied Laboratory Medicine","volume":" ","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Laboratory Medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jalm/jfaf126","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MEDICAL LABORATORY TECHNOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Proficiency testing (PT) should identify systematic errors, which are likely to recur. The Clinical Laboratory Improvement Amendments of 1988 (CLIA) acceptance limits (ALs) include 3 standard deviations (3SD) and concentration limits. We investigated the ability of PT to detect systematic error, especially as affected by the different AL types.

Methods: We removed any ungradable, duplicate, and irregular scores from CLIA laboratory PT data from 2008 to 2018. We calculated the overall miss rate, unsatisfactory event rate, i.e., score <80 (4 of 5 correct), and event rates for score 100 to 0. We used paired t-tests and the Wilcoxon signed-rank test to compare miss rates and unsatisfactory event rates between short- and long-term PT participants. We used the binomial distribution to estimate the expected event scores under the assumption that all misses were independent (random). We compared observed event scores with their expected values as a ratio.

Results: Forty thousand five hundred ninety-six laboratories produced 15 140 128 event scores for 75 analytes. The distribution of event scores was skewed toward multiple event misses (score 0-60) compared to the predicted distribution. Miss rates and unsatisfactory rates were significantly higher for short-term laboratories. Plotting the log ratio of observed vs expected rates for event scores showed that the degree of systematic effect was substantial. The magnitude of the effect was less for 3SD ALs.

Conclusions: In an event, PT misses are often dependent. All ALs detected systematic error. Expressing systematic error using PT data could help to identify and remediate analytical issues.

查看原文本刊更多论文

使用综合能力测试结果识别系统错误。

背景：熟练程度测试（PT）应该识别可能再次发生的系统性错误。1988年临床实验室改进修订（CLIA）可接受限度（ALs）包括3个标准差（3SD）和浓度限值。我们研究了PT检测系统误差的能力，特别是在不同AL类型的影响下。方法：从2008年至2018年的CLIA实验室PT数据中剔除所有不可评分、重复和不规则的分数。我们计算了总体漏检率，不满意事件率，即得分。结果：4万596个实验室对75个分析物产生了15 140 128个事件得分。与预测分布相比，事件得分的分布倾向于多次事件失误（得分0-60）。短期实验室的漏检率和不合格率明显较高。绘制事件得分的观察率与预期率的对数比表明，系统效应的程度是实质性的。3SD ALs的影响程度较小。结论：在某种情况下，PT漏诊通常是依赖的。所有人工智能检测到系统错误。使用PT数据表达系统错误有助于识别和纠正分析问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Applied Laboratory Medicine MEDICAL LABORATORY TECHNOLOGY-

CiteScore

3.70

自引率

5.00%

发文量

137