Assessment of the Diagnostic Performance of a Commercially Available Artificial Intelligence Algorithm for Risk Stratification of Thyroid Nodules on Ultrasound.

IF 5.8 1区 医学 Q1 ENDOCRINOLOGY & METABOLISM
Thyroid Pub Date : 2024-11-01 Epub Date: 2024-10-15 DOI:10.1089/thy.2024.0410
Jeffrey Ashton, Samantha Morrison, Alaattin Erkanli, Benjamin Wildman-Tobriner
{"title":"Assessment of the Diagnostic Performance of a Commercially Available Artificial Intelligence Algorithm for Risk Stratification of Thyroid Nodules on Ultrasound.","authors":"Jeffrey Ashton, Samantha Morrison, Alaattin Erkanli, Benjamin Wildman-Tobriner","doi":"10.1089/thy.2024.0410","DOIUrl":null,"url":null,"abstract":"<p><p><b><i>Background:</i></b> Thyroid nodules are challenging to accurately characterize on ultrasound (US), though the emergence of risk stratification systems and more recently artificial intelligence (AI) algorithms has improved nodule classification. The purpose of this study was to evaluate the performance of a recent Food and Drug Administration (FDA)-cleared AI tool for detection of malignancy in thyroid nodules on US. <b><i>Methods:</i></b> One year of consecutive thyroid US with ≥1 nodule from Duke University Hospital and its affiliate community hospital (649 nodules from 347 patients) were retrospectively evaluated. Included nodules had ground truth diagnoses by surgical pathology, fine needle aspiration (FNA), or three-year follow-up US showing stability. An FDA-cleared AI tool (Koios DS Thyroid) analyzed each nodule to generate (i) American College of Radiology Thyroid Imaging Reporting and Data System (ACR TI-RADS) descriptors, scores, and follow-up recommendations and (ii) an AI-adapter score to further adjust risk assessments and recommendations. Four groups were then compared: (i) Koios with AI-adapter, (ii) Koios without AI-adapter, (iii) clinical radiology report, and (iv) radiology report combined with AI-adapter. Performance of the final recommendations (FNA or no FNA) was determined based on ground truth, and comparison between the four groups was made using sensitivity, specificity, and receiver-operating-curve analysis. <b><i>Results:</i></b> Of 649 nodules, 32 were malignant and 617 were benign. Performance of Koios with AI-adapter enabled was similar to radiologists (area under the curve [AUC] 0.70 for both, [CI 0.60-0.81] and [0.60-0.79], respectively). Koios with AI-adapter had improved specificity compared to radiologists (0.63 [CI: 0.59-0.67] versus 0.43 [CI: 0.38-0.48]) but decreased sensitivity (0.69 [CI: 0.50-0.83) versus 0.81 [CI: 0.61, 0.92]). Highest performance was seen when the radiology interpretation was combined with the AI-adapter (AUC 0.76, [CI: 0.67-0.85]). Combined with the AI-adapter, radiologist specificity improved from 0.43 ([CI: 0.38-0.48]) to 0.53 ([CI: 0.49-0.58]) (McNemar's test <i>p</i> < 0.001), resulting in 17% fewer FNA recommendations, with unchanged sensitivity (0.81, <i>p</i> = 1). <b><i>Conclusion:</i></b> Koios DS demonstrated standalone performance similar to radiologists, though with lower sensitivity and higher specificity. Performance was best when radiologist interpretations were combined with the AI-adapter component, with improved specificity and reduced unnecessary FNA recommendations.</p>","PeriodicalId":23016,"journal":{"name":"Thyroid","volume":" ","pages":"1379-1388"},"PeriodicalIF":5.8000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Thyroid","FirstCategoryId":"88","ListUrlMain":"https://doi.org/10.1089/thy.2024.0410","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/15 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Thyroid nodules are challenging to accurately characterize on ultrasound (US), though the emergence of risk stratification systems and more recently artificial intelligence (AI) algorithms has improved nodule classification. The purpose of this study was to evaluate the performance of a recent Food and Drug Administration (FDA)-cleared AI tool for detection of malignancy in thyroid nodules on US. Methods: One year of consecutive thyroid US with ≥1 nodule from Duke University Hospital and its affiliate community hospital (649 nodules from 347 patients) were retrospectively evaluated. Included nodules had ground truth diagnoses by surgical pathology, fine needle aspiration (FNA), or three-year follow-up US showing stability. An FDA-cleared AI tool (Koios DS Thyroid) analyzed each nodule to generate (i) American College of Radiology Thyroid Imaging Reporting and Data System (ACR TI-RADS) descriptors, scores, and follow-up recommendations and (ii) an AI-adapter score to further adjust risk assessments and recommendations. Four groups were then compared: (i) Koios with AI-adapter, (ii) Koios without AI-adapter, (iii) clinical radiology report, and (iv) radiology report combined with AI-adapter. Performance of the final recommendations (FNA or no FNA) was determined based on ground truth, and comparison between the four groups was made using sensitivity, specificity, and receiver-operating-curve analysis. Results: Of 649 nodules, 32 were malignant and 617 were benign. Performance of Koios with AI-adapter enabled was similar to radiologists (area under the curve [AUC] 0.70 for both, [CI 0.60-0.81] and [0.60-0.79], respectively). Koios with AI-adapter had improved specificity compared to radiologists (0.63 [CI: 0.59-0.67] versus 0.43 [CI: 0.38-0.48]) but decreased sensitivity (0.69 [CI: 0.50-0.83) versus 0.81 [CI: 0.61, 0.92]). Highest performance was seen when the radiology interpretation was combined with the AI-adapter (AUC 0.76, [CI: 0.67-0.85]). Combined with the AI-adapter, radiologist specificity improved from 0.43 ([CI: 0.38-0.48]) to 0.53 ([CI: 0.49-0.58]) (McNemar's test p < 0.001), resulting in 17% fewer FNA recommendations, with unchanged sensitivity (0.81, p = 1). Conclusion: Koios DS demonstrated standalone performance similar to radiologists, though with lower sensitivity and higher specificity. Performance was best when radiologist interpretations were combined with the AI-adapter component, with improved specificity and reduced unnecessary FNA recommendations.

评估市售人工智能算法对超声检查甲状腺结节进行风险分层的诊断性能。
背景:虽然风险分层系统和最近出现的人工智能(AI)算法改善了甲状腺结节的分类,但在超声(US)上对甲状腺结节进行准确定性具有挑战性。本研究的目的是评估最近通过美国食品药品管理局(FDA)认证的人工智能工具在超声检测甲状腺结节恶性程度方面的性能。方法:对杜克大学医院及其附属社区医院连续一年甲状腺 US ≥1 个结节(347 名患者的 649 个结节)进行回顾性评估。所纳入的结节均通过手术病理学、细针穿刺术(FNA)或三年随访甲状腺 US 显示稳定的基本诊断。经 FDA 认证的人工智能工具(Koios DS 甲状腺)对每个结节进行分析,以生成 (i) 美国放射学会甲状腺成像报告和数据系统(ACR TI-RADS)描述符、评分和随访建议,以及 (ii) 人工智能适配器评分,以进一步调整风险评估和建议。然后对四组进行了比较:(i) 带有 AI-adapter 的 Koios,(ii) 不带 AI-adapter 的 Koios,(iii) 临床放射学报告,(iv) 结合 AI-adapter 的放射学报告。根据基本事实确定最终建议(FNA 或无 FNA)的性能,并使用灵敏度、特异性和接收器-操作曲线分析对四组进行比较。结果:在 649 个结节中,32 个为恶性,617 个为良性。启用人工智能适配器的Koios与放射科医生的表现相似(两者的曲线下面积[AUC]分别为0.70,[CI 0.60-0.81]和[0.60-0.79])。与放射科医生相比,使用人工智能适配器的 Koios 的特异性更高(0.63 [CI: 0.59-0.67] 对 0.43 [CI: 0.38-0.48]),但灵敏度却有所下降(0.69 [CI: 0.50-0.83] 对 0.81 [CI: 0.61, 0.92])。当放射学解释与人工智能适配器相结合时,性能最高(AUC 0.76,[CI:0.67-0.85])。结合人工智能适配器,放射科医生的特异性从 0.43([CI:0.38-0.48])提高到 0.53([CI:0.49-0.58])(McNemar's 检验 p < 0.001),从而减少了 17% 的 FNA 建议,灵敏度保持不变(0.81,p = 1)。结论Koios DS 的独立性能与放射科医生相似,但灵敏度较低,特异性较高。当放射医师的解释与人工智能适配器组件相结合时,性能最佳,特异性提高,不必要的 FNA 建议减少。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Thyroid
Thyroid 医学-内分泌学与代谢
CiteScore
12.30
自引率
6.10%
发文量
195
审稿时长
6 months
期刊介绍: This authoritative journal program, including the monthly flagship journal Thyroid, Clinical Thyroidology® (monthly), and VideoEndocrinology™ (quarterly), delivers in-depth coverage on topics from clinical application and primary care, to the latest advances in diagnostic imaging and surgical techniques and technologies, designed to optimize patient care and outcomes. Thyroid is the leading, peer-reviewed resource for original articles, patient-focused reports, and translational research on thyroid cancer and all thyroid related diseases. The Journal delivers the latest findings on topics from primary care to clinical application, and is the exclusive source for the authoritative and updated American Thyroid Association (ATA) Guidelines for Managing Thyroid Disease.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信