Histological proven AI performance in the UKLS CT lung cancer screening study: Potential for workload reduction

IF 7.6 1区 医学 Q1 ONCOLOGY
Harriet L. Lancaster , Beibei Jiang , Michael P.A. Davies , Jan Willem C. Gratama , Mario Silva , Jaeyoun Yi , Marjolein A. Heuvelmans , Geertruida H. de Bock , Anand Devaraj , John K. Field , Matthijs Oudkerk
{"title":"Histological proven AI performance in the UKLS CT lung cancer screening study: Potential for workload reduction","authors":"Harriet L. Lancaster ,&nbsp;Beibei Jiang ,&nbsp;Michael P.A. Davies ,&nbsp;Jan Willem C. Gratama ,&nbsp;Mario Silva ,&nbsp;Jaeyoun Yi ,&nbsp;Marjolein A. Heuvelmans ,&nbsp;Geertruida H. de Bock ,&nbsp;Anand Devaraj ,&nbsp;John K. Field ,&nbsp;Matthijs Oudkerk","doi":"10.1016/j.ejca.2025.115324","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><div>Artificial intelligence (AI) could reduce lung cancer screening computer tomography (CT)-reading workload if used as a first-reader, ruling-out negative CT-scans at baseline. Evidence is lacking to support AI performance when compared to gold-standard lung cancer outcomes. This study validated the performance of a commercially available AI software in the UK lung cancer screening (UKLS) trial dataset, with comparison to human reads and histological lung cancer outcomes, and estimated CT-reading workload reduction.</div></div><div><h3>Methods</h3><div>1252 UKLS-baseline-CT-scans were evaluated independently by AI and human readers. AI performance was evaluated on two-levels. Firstly, AI classification and individual reads were compared to a EU reference standard (based on NELSON2.0-European Position Statement) determined by a European expert panel blinded from individual results. A positive misclassification was defined as a nodule positive read ≥ 100mm<sup>3</sup> and no/&lt;100mm<sup>3</sup> nodules in the expert read; A negative misclassification was defined as a nodule negative read, whereas an indeterminate or positive finding in the expert read. Secondly, AI nodule classification was compared to gold-standard histological lung cancer outcomes. CT-reading workload reduction was calculated from AI negative CT-scans when AI was used as first-reader.</div></div><div><h3>Results</h3><div>Expert panel reference standard reported 815 (65 %) negative and 437 (35 %) indeterminate/positive CT-scans in the dataset of 1252 UKLS-participants. Compared to the reference standard, AI resulted in less misclassification than human reads, NPV 92·0 %(90·2 %-95·3 %). On comparison to gold-standard, AI detected all 31 baseline-round lung cancers, but classified one as negative due to the 100mm<sup>3</sup> threshold, NPV 99·8 %(99·0 %-99·9 %). Estimated maximum CT-reading workload reduction was 79 %.</div></div><div><h3>Conclusion</h3><div>Implementing AI as first-reader to rule-out negative CT-scans, shows considerable potential to reduce CT-reading workload and does not lead to missed lung cancers.</div></div>","PeriodicalId":11980,"journal":{"name":"European Journal of Cancer","volume":"220 ","pages":"Article 115324"},"PeriodicalIF":7.6000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Cancer","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0959804925001054","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose

Artificial intelligence (AI) could reduce lung cancer screening computer tomography (CT)-reading workload if used as a first-reader, ruling-out negative CT-scans at baseline. Evidence is lacking to support AI performance when compared to gold-standard lung cancer outcomes. This study validated the performance of a commercially available AI software in the UK lung cancer screening (UKLS) trial dataset, with comparison to human reads and histological lung cancer outcomes, and estimated CT-reading workload reduction.

Methods

1252 UKLS-baseline-CT-scans were evaluated independently by AI and human readers. AI performance was evaluated on two-levels. Firstly, AI classification and individual reads were compared to a EU reference standard (based on NELSON2.0-European Position Statement) determined by a European expert panel blinded from individual results. A positive misclassification was defined as a nodule positive read ≥ 100mm3 and no/<100mm3 nodules in the expert read; A negative misclassification was defined as a nodule negative read, whereas an indeterminate or positive finding in the expert read. Secondly, AI nodule classification was compared to gold-standard histological lung cancer outcomes. CT-reading workload reduction was calculated from AI negative CT-scans when AI was used as first-reader.

Results

Expert panel reference standard reported 815 (65 %) negative and 437 (35 %) indeterminate/positive CT-scans in the dataset of 1252 UKLS-participants. Compared to the reference standard, AI resulted in less misclassification than human reads, NPV 92·0 %(90·2 %-95·3 %). On comparison to gold-standard, AI detected all 31 baseline-round lung cancers, but classified one as negative due to the 100mm3 threshold, NPV 99·8 %(99·0 %-99·9 %). Estimated maximum CT-reading workload reduction was 79 %.

Conclusion

Implementing AI as first-reader to rule-out negative CT-scans, shows considerable potential to reduce CT-reading workload and does not lead to missed lung cancers.
组织学证实的AI在UKLS CT肺癌筛查研究中的表现:减少工作量的潜力
目的:人工智能(AI)可以减少肺癌筛查计算机断层扫描(CT)阅读工作量,如果用作第一阅读器,排除基线CT扫描阴性的可能性。与金标准肺癌结果相比,缺乏支持人工智能表现的证据。本研究验证了商用人工智能软件在英国肺癌筛查(UKLS)试验数据集中的性能,与人类阅读和组织学肺癌结果进行了比较,并估计了ct阅读工作量的减少。方法1252例ukls基线ct扫描由人工智能和人类读者独立评估。人工智能的表现分为两个等级。首先,将人工智能分类和个人读数与欧盟参考标准(基于nelson2.0 -欧洲立场声明)进行比较,该标准由欧洲专家小组确定,不考虑个人结果。阳性误分定义为结节阳性读数≥ 100mm3,专家读数无/<;100mm3结节;阴性错误分类被定义为结节阴性阅读,而专家阅读的不确定或阳性发现。其次,将AI结节分类与金标准肺癌组织学结果进行比较。当使用AI作为第一阅读者时,通过AI阴性ct扫描计算ct阅读工作量减少。结果专家小组参考标准在1252名ukls参与者的数据集中报告了815例(65 %)阴性ct扫描和437例(35 %)不确定/阳性ct扫描。与参考标准相比,人工智能的误分类率低于人类,NPV为92.0 %(92.0 %- 95.3 %)。与金标准相比,AI检测到所有31例基线轮肺癌,但由于100mm3阈值,NPV为99.8 %(99.0 %- 99.9 %),因此将1例诊断为阴性。估计最大的ct阅读工作量减少为79 %。结论采用人工智能作为第一阅读器来排除阴性ct扫描,显示出相当大的潜力,可以减少ct阅读工作量,并且不会导致漏诊肺癌。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
European Journal of Cancer
European Journal of Cancer 医学-肿瘤学
CiteScore
11.50
自引率
4.80%
发文量
953
审稿时长
23 days
期刊介绍: The European Journal of Cancer (EJC) serves as a comprehensive platform integrating preclinical, digital, translational, and clinical research across the spectrum of cancer. From epidemiology, carcinogenesis, and biology to groundbreaking innovations in cancer treatment and patient care, the journal covers a wide array of topics. We publish original research, reviews, previews, editorial comments, and correspondence, fostering dialogue and advancement in the fight against cancer. Join us in our mission to drive progress and improve outcomes in cancer research and patient care.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信