Ka Keung Chan , Suhhyun Kim , Patrick C. Mathias , Jane Dickerson , Hsuan-Chieh Liao
{"title":"Improving laboratory stewardship through benchmarking: A focus on thyroid function tests ordering","authors":"Ka Keung Chan , Suhhyun Kim , Patrick C. Mathias , Jane Dickerson , Hsuan-Chieh Liao","doi":"10.1016/j.clinbiochem.2025.110952","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><div>Thyroid function tests, particularly thyroid-stimulating hormone (TSH) and free thyroxine (fT4), are among the most frequently ordered laboratory tests. This study evaluates longitudinal trends in thyroid testing utilization, assesses the impact of TSH-first reflex testing, and examines specialty-specific differences in test ordering patterns under the PLUGS (Patient-Centered Laboratory Utilization Guidance Services) benchmarking guidelines.</div></div><div><h3>Methods</h3><div>We analyzed TSH and fT4 test volumes from the year 2009 to 2023. A TSH reflex algorithm, which automatically orders fT4 following an abnormal TSH, was implemented in 2016. The metrics were evaluated using the TSH to fT4 ratio (TSH/fT4) and the percentage of fT4 tests associated with abnormal TSH. Specialty-specific differences were assessed by categorizing providers’ specialty, comparing benchmark patterns and diagnostic codes.</div></div><div><h3>Results</h3><div>Both TSH and fT4 testing volume increased significantly from 2009 to 2023. After the implementation of TSH-first reflex algorithm, TSH/fT4 stabilized (∼3.7) and the percentage of fT4 tests associated with abnormal TSH increased significantly from 26 % to 39 %. Providers from primary care demonstrated stronger adherence to the reflex testing, whereas endocrinology and oncology required more tailored approaches.</div></div><div><h3>Conclusions</h3><div>This first longitudinal analysis of thyroid testing under the PLUGS benchmark guidelines highlights improved efficiency following reflex test implementation, though specialty-specific differences persist. Consensus-based benchmarks and laboratory stewardship initiatives further enhance test utilization efficiency, optimize workflow, and ensure appropriate thyroid function testing across diverse clinical settings.</div></div>","PeriodicalId":10172,"journal":{"name":"Clinical biochemistry","volume":"138 ","pages":"Article 110952"},"PeriodicalIF":2.1000,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical biochemistry","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0009912025000815","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL LABORATORY TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives
Thyroid function tests, particularly thyroid-stimulating hormone (TSH) and free thyroxine (fT4), are among the most frequently ordered laboratory tests. This study evaluates longitudinal trends in thyroid testing utilization, assesses the impact of TSH-first reflex testing, and examines specialty-specific differences in test ordering patterns under the PLUGS (Patient-Centered Laboratory Utilization Guidance Services) benchmarking guidelines.
Methods
We analyzed TSH and fT4 test volumes from the year 2009 to 2023. A TSH reflex algorithm, which automatically orders fT4 following an abnormal TSH, was implemented in 2016. The metrics were evaluated using the TSH to fT4 ratio (TSH/fT4) and the percentage of fT4 tests associated with abnormal TSH. Specialty-specific differences were assessed by categorizing providers’ specialty, comparing benchmark patterns and diagnostic codes.
Results
Both TSH and fT4 testing volume increased significantly from 2009 to 2023. After the implementation of TSH-first reflex algorithm, TSH/fT4 stabilized (∼3.7) and the percentage of fT4 tests associated with abnormal TSH increased significantly from 26 % to 39 %. Providers from primary care demonstrated stronger adherence to the reflex testing, whereas endocrinology and oncology required more tailored approaches.
Conclusions
This first longitudinal analysis of thyroid testing under the PLUGS benchmark guidelines highlights improved efficiency following reflex test implementation, though specialty-specific differences persist. Consensus-based benchmarks and laboratory stewardship initiatives further enhance test utilization efficiency, optimize workflow, and ensure appropriate thyroid function testing across diverse clinical settings.
期刊介绍:
Clinical Biochemistry publishes articles relating to clinical chemistry, molecular biology and genetics, therapeutic drug monitoring and toxicology, laboratory immunology and laboratory medicine in general, with the focus on analytical and clinical investigation of laboratory tests in humans used for diagnosis, prognosis, treatment and therapy, and monitoring of disease.