Deep learning model based on primary tumor to predict lymph node status in clinical stage IA lung adenocarcinoma: a multicenter study

IF 7.6 Q1 ONCOLOGY
Li Zhang , Hailin Li , Shaohong Zhao , Xuemin Tao , Meng Li , Shouxin Yang , Lina Zhou , Mengwen Liu , Xue Zhang , Di Dong , Jie Tian , Ning Wu
{"title":"Deep learning model based on primary tumor to predict lymph node status in clinical stage IA lung adenocarcinoma: a multicenter study","authors":"Li Zhang ,&nbsp;Hailin Li ,&nbsp;Shaohong Zhao ,&nbsp;Xuemin Tao ,&nbsp;Meng Li ,&nbsp;Shouxin Yang ,&nbsp;Lina Zhou ,&nbsp;Mengwen Liu ,&nbsp;Xue Zhang ,&nbsp;Di Dong ,&nbsp;Jie Tian ,&nbsp;Ning Wu","doi":"10.1016/j.jncc.2024.01.005","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><p>To develop a deep learning model to predict lymph node (LN) status in clinical stage IA lung adenocarcinoma patients.</p></div><div><h3>Methods</h3><p>This diagnostic study included 1,009 patients with pathologically confirmed clinical stage T1N0M0 lung adenocarcinoma from two independent datasets (699 from Cancer Hospital of Chinese Academy of Medical Sciences and 310 from PLA General Hospital) between January 2005 and December 2019. The Cancer Hospital dataset was randomly split into a training cohort (559 patients) and a validation cohort (140 patients) to train and tune a deep learning model based on a deep residual network (ResNet). The PLA Hospital dataset was used as a testing cohort to evaluate the generalization ability of the model. Thoracic radiologists manually segmented tumors and interpreted high-resolution computed tomography (HRCT) features for the model. The predictive performance was assessed by area under the curves (AUCs), accuracy, precision, recall, and F1 score. Subgroup analysis was performed to evaluate the potential bias of the study population.</p></div><div><h3>Results</h3><p>A total of 1,009 patients were included in this study; 409 (40.5%) were male and 600 (59.5%) were female. The median age was 57.0 years (inter-quartile range, IQR: 50.0–64.0). The deep learning model achieved AUCs of 0.906 (95% CI: 0.873–0.938) and 0.893 (95% CI: 0.857–0.930) for predicting pN0 disease in the testing cohort and a non-pure ground glass nodule (non-pGGN) testing cohort, respectively. No significant difference was detected between the testing cohort and the non-pGGN testing cohort (<em>P</em> = 0.622). The precisions of this model for predicting pN0 disease were 0.979 (95% CI: 0.963–0.995) and 0.983 (95% CI: 0.967–0.998) in the testing cohort and the non-pGGN testing cohort, respectively. The deep learning model achieved AUCs of 0.848 (95% CI: 0.798–0.898) and 0.831 (95% CI: 0.776–0.887) for predicting pN2 disease in the testing cohort and the non-pGGN testing cohort, respectively. No significant difference was detected between the testing cohort and the non-pGGN testing cohort (<em>P</em> = 0.657). The recalls of this model for predicting pN2 disease were 0.903 (95% CI: 0.870–0.936) and 0.931 (95% CI: 0.901–0.961) in the testing cohort and the non-pGGN testing cohort, respectively.</p></div><div><h3>Conclusions</h3><p>The superior performance of the deep learning model will help to target the extension of lymph node dissection and reduce the ineffective lymph node dissection in early-stage lung adenocarcinoma patients.</p></div>","PeriodicalId":73987,"journal":{"name":"Journal of the National Cancer Center","volume":"4 3","pages":"Pages 233-240"},"PeriodicalIF":7.6000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266700542400005X/pdfft?md5=07e3c41e80da73446c9241c22399a44e&pid=1-s2.0-S266700542400005X-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the National Cancer Center","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S266700542400005X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Objective

To develop a deep learning model to predict lymph node (LN) status in clinical stage IA lung adenocarcinoma patients.

Methods

This diagnostic study included 1,009 patients with pathologically confirmed clinical stage T1N0M0 lung adenocarcinoma from two independent datasets (699 from Cancer Hospital of Chinese Academy of Medical Sciences and 310 from PLA General Hospital) between January 2005 and December 2019. The Cancer Hospital dataset was randomly split into a training cohort (559 patients) and a validation cohort (140 patients) to train and tune a deep learning model based on a deep residual network (ResNet). The PLA Hospital dataset was used as a testing cohort to evaluate the generalization ability of the model. Thoracic radiologists manually segmented tumors and interpreted high-resolution computed tomography (HRCT) features for the model. The predictive performance was assessed by area under the curves (AUCs), accuracy, precision, recall, and F1 score. Subgroup analysis was performed to evaluate the potential bias of the study population.

Results

A total of 1,009 patients were included in this study; 409 (40.5%) were male and 600 (59.5%) were female. The median age was 57.0 years (inter-quartile range, IQR: 50.0–64.0). The deep learning model achieved AUCs of 0.906 (95% CI: 0.873–0.938) and 0.893 (95% CI: 0.857–0.930) for predicting pN0 disease in the testing cohort and a non-pure ground glass nodule (non-pGGN) testing cohort, respectively. No significant difference was detected between the testing cohort and the non-pGGN testing cohort (P = 0.622). The precisions of this model for predicting pN0 disease were 0.979 (95% CI: 0.963–0.995) and 0.983 (95% CI: 0.967–0.998) in the testing cohort and the non-pGGN testing cohort, respectively. The deep learning model achieved AUCs of 0.848 (95% CI: 0.798–0.898) and 0.831 (95% CI: 0.776–0.887) for predicting pN2 disease in the testing cohort and the non-pGGN testing cohort, respectively. No significant difference was detected between the testing cohort and the non-pGGN testing cohort (P = 0.657). The recalls of this model for predicting pN2 disease were 0.903 (95% CI: 0.870–0.936) and 0.931 (95% CI: 0.901–0.961) in the testing cohort and the non-pGGN testing cohort, respectively.

Conclusions

The superior performance of the deep learning model will help to target the extension of lymph node dissection and reduce the ineffective lymph node dissection in early-stage lung adenocarcinoma patients.

基于原发肿瘤的深度学习模型预测临床IA期肺腺癌的淋巴结状态:一项多中心研究
方法这项诊断研究纳入了2005年1月至2019年12月期间两个独立数据集(699例来自中国医学科学院肿瘤医院,310例来自中国人民解放军总医院)中1009例经病理确诊的临床分期为T1N0M0的肺腺癌患者。肿瘤医院数据集被随机分成训练队列(559 名患者)和验证队列(140 名患者),用于训练和调整基于深度残差网络(ResNet)的深度学习模型。解放军医院数据集被用作测试队列,以评估模型的泛化能力。胸部放射科医生为模型手动分割肿瘤并解释高分辨率计算机断层扫描(HRCT)特征。预测性能通过曲线下面积(AUC)、准确度、精确度、召回率和 F1 分数进行评估。本研究共纳入 1,009 名患者,其中男性 409 人(40.5%),女性 600 人(59.5%)。中位年龄为 57.0 岁(四分位数间距,IQR:50.0-64.0)。在测试队列和非纯磨碎玻璃结节(non-GGN)测试队列中,深度学习模型预测 pN0 疾病的 AUC 分别为 0.906(95% CI:0.873-0.938)和 0.893(95% CI:0.857-0.930)。检测队列与非纯磨玻璃结节检测队列之间未发现明显差异(P = 0.622)。该模型预测 pN0 疾病的精确度在测试队列和非纯纯 GGN 测试队列中分别为 0.979(95% CI:0.963-0.995)和 0.983(95% CI:0.967-0.998)。深度学习模型在测试队列和非GGN测试队列中预测pN2疾病的AUC分别为0.848(95% CI:0.798-0.898)和0.831(95% CI:0.776-0.887)。检测队列和非检测队列之间未发现明显差异(P = 0.657)。该模型预测 pN2 疾病的召回率在测试队列和非 GGGN 测试队列中分别为 0.903(95% CI:0.870-0.936)和 0.931(95% CI:0.901-0.961)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
14.20
自引率
0.00%
发文量
0
审稿时长
70 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信