Comparative Analysis of Nomogram and Machine Learning Models for Predicting Axillary Lymph Node Metastasis in Early-Stage Breast Cancer: A Study on Clinically and Ultrasound-Negative Axillary Cases Across Two Centers
Meiying Yan , Dilin He , Yu Sun , Long Huang , Linli Cai , Chen Wang , Jincao Yao , Xiangyang Li , Hongping Song , Chen Yang
{"title":"Comparative Analysis of Nomogram and Machine Learning Models for Predicting Axillary Lymph Node Metastasis in Early-Stage Breast Cancer: A Study on Clinically and Ultrasound-Negative Axillary Cases Across Two Centers","authors":"Meiying Yan , Dilin He , Yu Sun , Long Huang , Linli Cai , Chen Wang , Jincao Yao , Xiangyang Li , Hongping Song , Chen Yang","doi":"10.1016/j.ultrasmedbio.2024.11.003","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>Early and accurate prediction of axillary lymph node metastasis (ALNM) is crucial in determining appropriate treatment strategies for patients with early-stage breast cancer. The aim of this study was to evaluate the efficacy of radiomic features extracted from ultrasound (US) images combined with machine learning (ML) methods in predicting ALNM to improve diagnostic accuracy and patient prognosis.</div></div><div><h3>Methods</h3><div>In this retrospective study, data of 282 early-stage breast cancer patients from two centers were analyzed. We considered clinicopathological characteristics, conventional US features, contrast-enhanced ultrasound (CEUS) characteristics, and radiomics features. Radiomics features were extracted from US images, and using least absolute shrinkage and selection operator (LASSO) regression, 12 key features were selected to compute a Radiomics score (Rad-score). A nomogram was developed based on these features, alongside five ML models: Logistic Regression (LR), Naive Bayes (NB), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Extreme Gradient Boosting (XGBoost). Model performance was evaluated using metrics such as the area under the curve (AUC), accuracy (ACC), sensitivity (SEN), specificity (SPE), negative predictive value (NPV), and positive predictive value (PPV).</div></div><div><h3>Results</h3><div>Both the nomogram and ML models, including the Rad-score combined with histologic type, significantly predicted ALNM. Among all models, the XGBoost model showed the best performance with an AUC of 0.810 and an accuracy of 84.1% in the external test set, surpassing the nomogram and other ML models. SHapley Additive exPlanations (SHAP) analysis further provided insights into the influence of individual radiomics features on ALNM prediction.</div></div><div><h3>Conclusions</h3><div>While the nomogram provides a useful traditional statistical approach, integrating radiomics features with ML, particularly the XGBoost model enhanced by SHAP interpretability, offers superior predictive accuracy for ALNM in early-stage breast cancer patients.</div></div>","PeriodicalId":49399,"journal":{"name":"Ultrasound in Medicine and Biology","volume":"51 3","pages":"Pages 463-474"},"PeriodicalIF":2.4000,"publicationDate":"2024-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ultrasound in Medicine and Biology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0301562924004289","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
Early and accurate prediction of axillary lymph node metastasis (ALNM) is crucial in determining appropriate treatment strategies for patients with early-stage breast cancer. The aim of this study was to evaluate the efficacy of radiomic features extracted from ultrasound (US) images combined with machine learning (ML) methods in predicting ALNM to improve diagnostic accuracy and patient prognosis.
Methods
In this retrospective study, data of 282 early-stage breast cancer patients from two centers were analyzed. We considered clinicopathological characteristics, conventional US features, contrast-enhanced ultrasound (CEUS) characteristics, and radiomics features. Radiomics features were extracted from US images, and using least absolute shrinkage and selection operator (LASSO) regression, 12 key features were selected to compute a Radiomics score (Rad-score). A nomogram was developed based on these features, alongside five ML models: Logistic Regression (LR), Naive Bayes (NB), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Extreme Gradient Boosting (XGBoost). Model performance was evaluated using metrics such as the area under the curve (AUC), accuracy (ACC), sensitivity (SEN), specificity (SPE), negative predictive value (NPV), and positive predictive value (PPV).
Results
Both the nomogram and ML models, including the Rad-score combined with histologic type, significantly predicted ALNM. Among all models, the XGBoost model showed the best performance with an AUC of 0.810 and an accuracy of 84.1% in the external test set, surpassing the nomogram and other ML models. SHapley Additive exPlanations (SHAP) analysis further provided insights into the influence of individual radiomics features on ALNM prediction.
Conclusions
While the nomogram provides a useful traditional statistical approach, integrating radiomics features with ML, particularly the XGBoost model enhanced by SHAP interpretability, offers superior predictive accuracy for ALNM in early-stage breast cancer patients.
期刊介绍:
Ultrasound in Medicine and Biology is the official journal of the World Federation for Ultrasound in Medicine and Biology. The journal publishes original contributions that demonstrate a novel application of an existing ultrasound technology in clinical diagnostic, interventional and therapeutic applications, new and improved clinical techniques, the physics, engineering and technology of ultrasound in medicine and biology, and the interactions between ultrasound and biological systems, including bioeffects. Papers that simply utilize standard diagnostic ultrasound as a measuring tool will be considered out of scope. Extended critical reviews of subjects of contemporary interest in the field are also published, in addition to occasional editorial articles, clinical and technical notes, book reviews, letters to the editor and a calendar of forthcoming meetings. It is the aim of the journal fully to meet the information and publication requirements of the clinicians, scientists, engineers and other professionals who constitute the biomedical ultrasonic community.