{"title":"Explainable machine learning versus known nomogram for predicting non-sentinel lymph node metastases in breast cancer patients: A comparative study","authors":"Asieh Sadat Fattahi , Maryam Hoseini , Toktam Dehghani , Raheleh Ghouchan Nezhad Noor Nia , Zeinab Naseri , Amirali Ebrahimzadeh , Ali Mahri , Saeid Eslami","doi":"10.1016/j.compbiomed.2024.109412","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><div>Axillary lymph node dissection (ALND) is the standard of care for breast cancer patients with positive sentinel lymph nodes (SLN), which are the first lymph nodes that drain the breast. However, many patients with positive SLNs may not have additional positive nodes, making the prediction of non-sentinel lymph node (NSLN) metastasis challenging. Reliable prognostic tools are essential for accurately assessing NSLN metastasis. The Memorial Sloan Kettering Cancer Center (MSKCC) nomogram has demonstrated effectiveness in this context, but it requires further evaluation within the Iranian breast cancer population. While ALND remains the gold standard, its unnecessary application in patients without evidence of additional positive nodes raises concerns due to potential complications such as lymphedema, nerve injury, and shoulder joint dysfunction. Furthermore, integrating Artificial Intelligence (AI) and Machine Learning (ML) techniques presents an opportunity to enhance the precision of NSLN metastasis predictions.</div></div><div><h3>Method</h3><div>This study conducts an extensive comparative analysis between the MSKCC nomogram and various ML models to predict NSLN metastasis, utilizing a dataset of Iranian breast cancer patients. Employing eXplainable Artificial Intelligence (XAI) methodologies, we analyzed 16 clinical features across a cohort of 183 patients. Our methodology includes rigorous statistical evaluations and the training and validation of ML models to assess the precision and robustness of these models compared to the MSKCC nomogram.</div></div><div><h3>Results</h3><div>Our analysis revealed that the Random Forest (RF) model outperformed the MSKCC nomogram, achieving an accuracy of 72.2 % and an AUC of 0.77, compared to the nomogram's AUC of 0.73. Logistic Regression (LR) also demonstrated competitive performance with an accuracy of 65 % and an AUC of 0.73. The RF model exhibited high sensitivity (75 %) and precision (73 %), effectively identifying critical predictors of NSLN metastasis, including the presence of ductal carcinoma in situ (DCIS) and tumor characteristics such as type and grade. Explainable AI techniques, particularly SHAP values, provided insights into feature importance, enhancing model interpretability.</div></div><div><h3>Conclusion</h3><div>Our study offers a comprehensive comparison between ML models and the MSKCC nomogram for predicting NSLN metastasis among Iranian breast cancer patients. These findings contribute valuable insights to the discourse on personalized treatment approaches, emphasizing the need for tailored prognostic tools across diverse populations. The implications of this research extend to clinical decision-making, potentially improving the accuracy and efficacy of breast cancer management within the Iranian healthcare landscape.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"184 ","pages":"Article 109412"},"PeriodicalIF":7.0000,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482524014975","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction
Axillary lymph node dissection (ALND) is the standard of care for breast cancer patients with positive sentinel lymph nodes (SLN), which are the first lymph nodes that drain the breast. However, many patients with positive SLNs may not have additional positive nodes, making the prediction of non-sentinel lymph node (NSLN) metastasis challenging. Reliable prognostic tools are essential for accurately assessing NSLN metastasis. The Memorial Sloan Kettering Cancer Center (MSKCC) nomogram has demonstrated effectiveness in this context, but it requires further evaluation within the Iranian breast cancer population. While ALND remains the gold standard, its unnecessary application in patients without evidence of additional positive nodes raises concerns due to potential complications such as lymphedema, nerve injury, and shoulder joint dysfunction. Furthermore, integrating Artificial Intelligence (AI) and Machine Learning (ML) techniques presents an opportunity to enhance the precision of NSLN metastasis predictions.
Method
This study conducts an extensive comparative analysis between the MSKCC nomogram and various ML models to predict NSLN metastasis, utilizing a dataset of Iranian breast cancer patients. Employing eXplainable Artificial Intelligence (XAI) methodologies, we analyzed 16 clinical features across a cohort of 183 patients. Our methodology includes rigorous statistical evaluations and the training and validation of ML models to assess the precision and robustness of these models compared to the MSKCC nomogram.
Results
Our analysis revealed that the Random Forest (RF) model outperformed the MSKCC nomogram, achieving an accuracy of 72.2 % and an AUC of 0.77, compared to the nomogram's AUC of 0.73. Logistic Regression (LR) also demonstrated competitive performance with an accuracy of 65 % and an AUC of 0.73. The RF model exhibited high sensitivity (75 %) and precision (73 %), effectively identifying critical predictors of NSLN metastasis, including the presence of ductal carcinoma in situ (DCIS) and tumor characteristics such as type and grade. Explainable AI techniques, particularly SHAP values, provided insights into feature importance, enhancing model interpretability.
Conclusion
Our study offers a comprehensive comparison between ML models and the MSKCC nomogram for predicting NSLN metastasis among Iranian breast cancer patients. These findings contribute valuable insights to the discourse on personalized treatment approaches, emphasizing the need for tailored prognostic tools across diverse populations. The implications of this research extend to clinical decision-making, potentially improving the accuracy and efficacy of breast cancer management within the Iranian healthcare landscape.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.