Explainable machine learning versus known nomogram for predicting non-sentinel lymph node metastases in breast cancer patients: A comparative study

IF 7 2区医学 Q1 BIOLOGY

Computers in biology and medicine Pub Date : 2024-11-16 DOI:10.1016/j.compbiomed.2024.109412

Asieh Sadat Fattahi , Maryam Hoseini , Toktam Dehghani , Raheleh Ghouchan Nezhad Noor Nia , Zeinab Naseri , Amirali Ebrahimzadeh , Ali Mahri , Saeid Eslami

{"title":"Explainable machine learning versus known nomogram for predicting non-sentinel lymph node metastases in breast cancer patients: A comparative study","authors":"Asieh Sadat Fattahi , Maryam Hoseini , Toktam Dehghani , Raheleh Ghouchan Nezhad Noor Nia , Zeinab Naseri , Amirali Ebrahimzadeh , Ali Mahri , Saeid Eslami","doi":"10.1016/j.compbiomed.2024.109412","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><div>Axillary lymph node dissection (ALND) is the standard of care for breast cancer patients with positive sentinel lymph nodes (SLN), which are the first lymph nodes that drain the breast. However, many patients with positive SLNs may not have additional positive nodes, making the prediction of non-sentinel lymph node (NSLN) metastasis challenging. Reliable prognostic tools are essential for accurately assessing NSLN metastasis. The Memorial Sloan Kettering Cancer Center (MSKCC) nomogram has demonstrated effectiveness in this context, but it requires further evaluation within the Iranian breast cancer population. While ALND remains the gold standard, its unnecessary application in patients without evidence of additional positive nodes raises concerns due to potential complications such as lymphedema, nerve injury, and shoulder joint dysfunction. Furthermore, integrating Artificial Intelligence (AI) and Machine Learning (ML) techniques presents an opportunity to enhance the precision of NSLN metastasis predictions.</div></div><div><h3>Method</h3><div>This study conducts an extensive comparative analysis between the MSKCC nomogram and various ML models to predict NSLN metastasis, utilizing a dataset of Iranian breast cancer patients. Employing eXplainable Artificial Intelligence (XAI) methodologies, we analyzed 16 clinical features across a cohort of 183 patients. Our methodology includes rigorous statistical evaluations and the training and validation of ML models to assess the precision and robustness of these models compared to the MSKCC nomogram.</div></div><div><h3>Results</h3><div>Our analysis revealed that the Random Forest (RF) model outperformed the MSKCC nomogram, achieving an accuracy of 72.2 % and an AUC of 0.77, compared to the nomogram's AUC of 0.73. Logistic Regression (LR) also demonstrated competitive performance with an accuracy of 65 % and an AUC of 0.73. The RF model exhibited high sensitivity (75 %) and precision (73 %), effectively identifying critical predictors of NSLN metastasis, including the presence of ductal carcinoma in situ (DCIS) and tumor characteristics such as type and grade. Explainable AI techniques, particularly SHAP values, provided insights into feature importance, enhancing model interpretability.</div></div><div><h3>Conclusion</h3><div>Our study offers a comprehensive comparison between ML models and the MSKCC nomogram for predicting NSLN metastasis among Iranian breast cancer patients. These findings contribute valuable insights to the discourse on personalized treatment approaches, emphasizing the need for tailored prognostic tools across diverse populations. The implications of this research extend to clinical decision-making, potentially improving the accuracy and efficacy of breast cancer management within the Iranian healthcare landscape.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"184 ","pages":"Article 109412"},"PeriodicalIF":7.0000,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482524014975","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction

Axillary lymph node dissection (ALND) is the standard of care for breast cancer patients with positive sentinel lymph nodes (SLN), which are the first lymph nodes that drain the breast. However, many patients with positive SLNs may not have additional positive nodes, making the prediction of non-sentinel lymph node (NSLN) metastasis challenging. Reliable prognostic tools are essential for accurately assessing NSLN metastasis. The Memorial Sloan Kettering Cancer Center (MSKCC) nomogram has demonstrated effectiveness in this context, but it requires further evaluation within the Iranian breast cancer population. While ALND remains the gold standard, its unnecessary application in patients without evidence of additional positive nodes raises concerns due to potential complications such as lymphedema, nerve injury, and shoulder joint dysfunction. Furthermore, integrating Artificial Intelligence (AI) and Machine Learning (ML) techniques presents an opportunity to enhance the precision of NSLN metastasis predictions.

Method

This study conducts an extensive comparative analysis between the MSKCC nomogram and various ML models to predict NSLN metastasis, utilizing a dataset of Iranian breast cancer patients. Employing eXplainable Artificial Intelligence (XAI) methodologies, we analyzed 16 clinical features across a cohort of 183 patients. Our methodology includes rigorous statistical evaluations and the training and validation of ML models to assess the precision and robustness of these models compared to the MSKCC nomogram.

Results

Our analysis revealed that the Random Forest (RF) model outperformed the MSKCC nomogram, achieving an accuracy of 72.2 % and an AUC of 0.77, compared to the nomogram's AUC of 0.73. Logistic Regression (LR) also demonstrated competitive performance with an accuracy of 65 % and an AUC of 0.73. The RF model exhibited high sensitivity (75 %) and precision (73 %), effectively identifying critical predictors of NSLN metastasis, including the presence of ductal carcinoma in situ (DCIS) and tumor characteristics such as type and grade. Explainable AI techniques, particularly SHAP values, provided insights into feature importance, enhancing model interpretability.

Conclusion

Our study offers a comprehensive comparison between ML models and the MSKCC nomogram for predicting NSLN metastasis among Iranian breast cancer patients. These findings contribute valuable insights to the discourse on personalized treatment approaches, emphasizing the need for tailored prognostic tools across diverse populations. The implications of this research extend to clinical decision-making, potentially improving the accuracy and efficacy of breast cancer management within the Iranian healthcare landscape.

查看原文本刊更多论文

预测乳腺癌患者非前哨淋巴结转移的可解释机器学习与已知提名图：比较研究

简介：腋窝淋巴结清扫术（ALND）是前哨淋巴结（SLN）阳性乳腺癌患者的标准治疗方法。然而，许多前哨淋巴结（SLN）阳性患者可能没有其他阳性淋巴结，因此预测非前哨淋巴结（NSLN）转移具有挑战性。可靠的预后工具对于准确评估非前哨淋巴结转移至关重要。纪念斯隆-凯特琳癌症中心（MSKCC）的提名图在这方面已显示出有效性，但还需要在伊朗乳腺癌人群中进行进一步评估。虽然 ALND 仍是黄金标准，但在没有证据表明有其他阳性结节的患者中不必要地应用 ALND 会引发潜在的并发症，如淋巴水肿、神经损伤和肩关节功能障碍，这一点令人担忧。此外，整合人工智能（AI）和机器学习（ML）技术为提高 NSLN 转移预测的精确度提供了机会：本研究利用伊朗乳腺癌患者的数据集，对 MSKCC 提名图和各种 ML 模型进行了广泛的比较分析，以预测 NSLN 转移。我们采用易用人工智能（XAI）方法，分析了 183 例患者的 16 个临床特征。我们的方法包括严格的统计评估以及 ML 模型的训练和验证，以评估这些模型与 MSKCC 提名图相比的精确性和稳健性：我们的分析表明，随机森林（RF）模型优于 MSKCC 直方图，准确率达到 72.2%，AUC 为 0.77，而直方图的 AUC 为 0.73。逻辑回归（LR）的准确率为 65%，AUC 为 0.73，也表现出了很强的竞争力。RF 模型的灵敏度（75%）和精确度（73%）都很高，能有效识别 NSLN 转移的关键预测因素，包括是否存在导管原位癌（DCIS）以及肿瘤类型和分级等肿瘤特征。可解释的人工智能技术，尤其是SHAP值，提供了对特征重要性的见解，增强了模型的可解释性：我们的研究对伊朗乳腺癌患者NSLN转移预测的ML模型和MSKCC提名图进行了全面比较。这些发现为个性化治疗方法的讨论提供了有价值的见解，强调了不同人群对定制预后工具的需求。这项研究的意义延伸到临床决策，有可能提高伊朗医疗保健领域乳腺癌管理的准确性和有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computers in biology and medicine 工程技术-工程：生物医学

CiteScore

11.70

自引率

10.40%

发文量

1086

审稿时长

74 days

期刊介绍： Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.