Noninvasive Staging of Lymph Node Status in Breast Cancer Using Machine Learning: External Validation and Further Model Development.

IF 3.3 Q2 ONCOLOGY

JMIR Cancer Pub Date : 2023-11-20 DOI:10.2196/46474

Malin Hjärtström, Looket Dihge, Pär-Ola Bendahl, Ida Skarping, Julia Ellbrant, Mattias Ohlsson, Lisa Rydén

{"title":"Noninvasive Staging of Lymph Node Status in Breast Cancer Using Machine Learning: External Validation and Further Model Development.","authors":"Malin Hjärtström, Looket Dihge, Pär-Ola Bendahl, Ida Skarping, Julia Ellbrant, Mattias Ohlsson, Lisa Rydén","doi":"10.2196/46474","DOIUrl":null,"url":null,"abstract":"Background: Most patients diagnosed with breast cancer present with a node-negative disease. Sentinel lymph node biopsy (SLNB) is routinely used for axillary staging, leaving patients with healthy axillary lymph nodes without therapeutic effects but at risk of morbidities from the intervention. Numerous studies have developed nodal status prediction models for noninvasive axillary staging using postoperative data or imaging features that are not part of the diagnostic workup. Lymphovascular invasion (LVI) is a top-ranked predictor of nodal metastasis; however, its preoperative assessment is challenging.Objective: This paper aimed to externally validate a multilayer perceptron (MLP) model for noninvasive lymph node staging (NILS) in a large population-based cohort (n=18,633) and develop a new MLP in the same cohort. Data were extracted from the Swedish National Quality Register for Breast Cancer (NKBC, 2014-2017), comprising only routinely and preoperatively available documented clinicopathological variables. A secondary aim was to develop and validate an LVI MLP for imputation of missing LVI status to increase the preoperative feasibility of the original NILS model.Methods: Three nonoverlapping cohorts were used for model development and validation. A total of 4 MLPs for nodal status and 1 LVI MLP were developed using 11 to 12 routinely available predictors. Three nodal status models were used to account for the different availabilities of LVI status in the cohorts and external validation in NKBC. The fourth nodal status model was developed for 80% (14,906/18,663) of NKBC cases and validated in the remaining 20% (3727/18,663). Three alternatives for imputation of LVI status were compared. The discriminatory capacity was evaluated using the validation area under the receiver operating characteristics curve (AUC) in 3 of the nodal status models. The clinical feasibility of the models was evaluated using calibration and decision curve analyses.Results: External validation of the original NILS model was performed in NKBC (AUC 0.699, 95% CI 0.690-0.708) with good calibration and the potential of sparing 16% of patients with node-negative disease from SLNB. The LVI model was externally validated (AUC 0.747, 95% CI 0.694-0.799) with good calibration but did not improve the discriminatory performance of the nodal status models. A new nodal status model was developed in NKBC without information on LVI (AUC 0.709, 95% CI: 0.688-0.729), with excellent calibration in the holdout internal validation cohort, resulting in the potential omission of 24% of patients from unnecessary SLNBs.Conclusions: The NILS model was externally validated in NKBC, where the imputation of LVI status did not improve the model's discriminatory performance. A new nodal status model demonstrated the feasibility of using register data comprising only the variables available in the preoperative setting for NILS using machine learning. Future steps include ongoing preoperative validation of the NILS model and extending the model with, for example, mammography images.","PeriodicalId":45538,"journal":{"name":"JMIR Cancer","volume":"9 ","pages":"e46474"},"PeriodicalIF":3.3000,"publicationDate":"2023-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Cancer","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/46474","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Most patients diagnosed with breast cancer present with a node-negative disease. Sentinel lymph node biopsy (SLNB) is routinely used for axillary staging, leaving patients with healthy axillary lymph nodes without therapeutic effects but at risk of morbidities from the intervention. Numerous studies have developed nodal status prediction models for noninvasive axillary staging using postoperative data or imaging features that are not part of the diagnostic workup. Lymphovascular invasion (LVI) is a top-ranked predictor of nodal metastasis; however, its preoperative assessment is challenging.

Objective: This paper aimed to externally validate a multilayer perceptron (MLP) model for noninvasive lymph node staging (NILS) in a large population-based cohort (n=18,633) and develop a new MLP in the same cohort. Data were extracted from the Swedish National Quality Register for Breast Cancer (NKBC, 2014-2017), comprising only routinely and preoperatively available documented clinicopathological variables. A secondary aim was to develop and validate an LVI MLP for imputation of missing LVI status to increase the preoperative feasibility of the original NILS model.

Methods: Three nonoverlapping cohorts were used for model development and validation. A total of 4 MLPs for nodal status and 1 LVI MLP were developed using 11 to 12 routinely available predictors. Three nodal status models were used to account for the different availabilities of LVI status in the cohorts and external validation in NKBC. The fourth nodal status model was developed for 80% (14,906/18,663) of NKBC cases and validated in the remaining 20% (3727/18,663). Three alternatives for imputation of LVI status were compared. The discriminatory capacity was evaluated using the validation area under the receiver operating characteristics curve (AUC) in 3 of the nodal status models. The clinical feasibility of the models was evaluated using calibration and decision curve analyses.

Results: External validation of the original NILS model was performed in NKBC (AUC 0.699, 95% CI 0.690-0.708) with good calibration and the potential of sparing 16% of patients with node-negative disease from SLNB. The LVI model was externally validated (AUC 0.747, 95% CI 0.694-0.799) with good calibration but did not improve the discriminatory performance of the nodal status models. A new nodal status model was developed in NKBC without information on LVI (AUC 0.709, 95% CI: 0.688-0.729), with excellent calibration in the holdout internal validation cohort, resulting in the potential omission of 24% of patients from unnecessary SLNBs.

Conclusions: The NILS model was externally validated in NKBC, where the imputation of LVI status did not improve the model's discriminatory performance. A new nodal status model demonstrated the feasibility of using register data comprising only the variables available in the preoperative setting for NILS using machine learning. Future steps include ongoing preoperative validation of the NILS model and extending the model with, for example, mammography images.

查看原文本刊更多论文

使用机器学习的乳腺癌淋巴结状态的无创分期:外部验证和进一步的模型开发。

背景:大多数诊断为乳腺癌的患者表现为淋巴结阴性疾病。前哨淋巴结活检(SLNB)通常用于腋窝分期，使健康腋窝淋巴结的患者没有治疗效果，但有干预引起发病的风险。许多研究已经开发了无创腋窝分期的淋巴结状态预测模型，这些模型使用的是术后数据或非诊断性检查的影像学特征。淋巴血管侵袭(LVI)是淋巴结转移的首选预测因子;然而，其术前评估是具有挑战性的。目的:本文旨在外部验证基于大型人群队列(n=18,633)的多层感知器(MLP)模型用于无创淋巴结分期(NILS)，并在同一队列中开发新的MLP。数据来自瑞典国家乳腺癌质量登记(NKBC, 2014-2017)，仅包括常规和术前可用的临床病理变量。第二个目标是开发和验证LVI MLP，用于缺失LVI状态的imputation，以增加原始NILS模型的术前可行性。方法:采用三个不重叠的队列进行模型开发和验证。使用11 - 12个常规可用的预测因子，共开发了4个节点状态MLP和1个LVI MLP。使用三种节点状态模型来解释NKBC队列中LVI状态的不同可用性和外部验证。第四种节点状态模型用于80%(14,906/18,663)的NKBC病例，并在其余20%(3727/18,663)中得到验证。比较了三种LVI状态的计算方法。在3个节点状态模型中，使用接受者工作特征曲线(AUC)下的验证面积来评估区分能力。采用校正和决策曲线分析评估模型的临床可行性。结果:原始NILS模型在NKBC中进行了外部验证(AUC 0.699, 95% CI 0.690-0.708)，校准良好，有可能使16%的淋巴结阴性疾病患者免于SLNB。LVI模型经外部验证(AUC为0.747,95% CI为0.694-0.799)，校正效果良好，但并未改善节点状态模型的判别性能。在没有LVI信息的NKBC中建立了一个新的节点状态模型(AUC 0.709, 95% CI: 0.688-0.729)，在holdout内部验证队列中具有出色的校准，导致24%的患者可能从不必要的slnb中遗漏。结论:NILS模型在NKBC中得到了外部验证，LVI状态的输入并没有提高模型的区分性能。一个新的节点状态模型证明了使用寄存器数据的可行性，该数据仅包含使用机器学习的NILS术前设置中可用的变量。未来的步骤包括正在进行的NILS模型的术前验证，并扩展模型，例如乳房x线摄影图像。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊