{"title":"结合病理和临床数据预测乳腺癌腋窝淋巴结转移的机器学习模型的发展:一项双中心研究","authors":"Long Wang, Fanli Qu, Ping Wen, Yu Luo, Huan Zhang, Shanqi Li, Xuedong Yin, Yulan Zhao, Xiaohua Zeng","doi":"10.1002/cnr2.70302","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>Accurately assessing the status of axillary lymph nodes (ALNs) is essential for devising optimal surgical plans and making informed treatment decisions in breast cancer (BC) patients.</p>\n </section>\n \n <section>\n \n <h3> Aims</h3>\n \n <p>This study aims to develop an innovative nomogram based on pathomics to preoperatively predict ALN metastasis (ALNM) in BC.</p>\n </section>\n \n <section>\n \n <h3> Methods and Results</h3>\n \n <p>Our study performed a retrospective analysis on digital hematoxylin and eosin (H&E)-stained images obtained from 407 patients across two institutions who were allocated into a training cohort (TC; <i>n</i> = 203), an internal validation cohort (IVC; <i>n</i> = 136), and an external validation cohort (EVC; <i>n</i> = 68). Initially, the Mann–Whitney <i>U</i>-test and Spearman's rank correlation coefficient were utilized for feature selection, employing the least absolute shrinkage and selection operator (LASSO) regression for further refinement. For the evaluation of the predictive value of ALNM and other clinicopathological factors, we deployed both univariate (ULR) and multivariate (MLR) logistic regression analyses. Among the six machine learning (ML) algorithms, logistic regression, which demonstrated the highest area under the curve (AUC) value, was employed to establish the final nomogram model. The nomogram reliability and stability were assessed by analyzing the AUC of the receiver operating characteristic (ROC) curve, decision curve analysis (DCA), and calibration plots. MLR analysis demonstrated estrogen receptor (ER), human epidermal growth factor receptor 2 (HER2), tumor size, and pathomics features as independent ALNM predictors. The nomogram demonstrated that the AUC in the IVC (0.783) surpassed that of the Path-score model (0.698) (DeLong test, <i>p</i> = 0.008558). Similarly, in the EVC, the nomogram surpassed the clinical model regarding AUC (0.738 vs. 0.574; DeLong test, <i>p</i> = 0.00494). Additionally, DCA analysis indicated a net clinical benefit associated with the nomogram.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>Our study demonstrates the effectiveness of pathomics features in predicting ALNM in BC patients. Furthermore, the pathomics-based nomogram offers a valuable tool for personalized treatment planning in this patient population.</p>\n </section>\n </div>","PeriodicalId":9440,"journal":{"name":"Cancer reports","volume":"8 9","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2025-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cnr2.70302","citationCount":"0","resultStr":"{\"title\":\"Development of a Machine Learning Model Integrating Pathomics and Clinical Data to Predict Axillary Lymph Node Metastasis in Breast Cancer: A Two-Center Study\",\"authors\":\"Long Wang, Fanli Qu, Ping Wen, Yu Luo, Huan Zhang, Shanqi Li, Xuedong Yin, Yulan Zhao, Xiaohua Zeng\",\"doi\":\"10.1002/cnr2.70302\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <h3> Background</h3>\\n \\n <p>Accurately assessing the status of axillary lymph nodes (ALNs) is essential for devising optimal surgical plans and making informed treatment decisions in breast cancer (BC) patients.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Aims</h3>\\n \\n <p>This study aims to develop an innovative nomogram based on pathomics to preoperatively predict ALN metastasis (ALNM) in BC.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Methods and Results</h3>\\n \\n <p>Our study performed a retrospective analysis on digital hematoxylin and eosin (H&E)-stained images obtained from 407 patients across two institutions who were allocated into a training cohort (TC; <i>n</i> = 203), an internal validation cohort (IVC; <i>n</i> = 136), and an external validation cohort (EVC; <i>n</i> = 68). Initially, the Mann–Whitney <i>U</i>-test and Spearman's rank correlation coefficient were utilized for feature selection, employing the least absolute shrinkage and selection operator (LASSO) regression for further refinement. For the evaluation of the predictive value of ALNM and other clinicopathological factors, we deployed both univariate (ULR) and multivariate (MLR) logistic regression analyses. Among the six machine learning (ML) algorithms, logistic regression, which demonstrated the highest area under the curve (AUC) value, was employed to establish the final nomogram model. The nomogram reliability and stability were assessed by analyzing the AUC of the receiver operating characteristic (ROC) curve, decision curve analysis (DCA), and calibration plots. MLR analysis demonstrated estrogen receptor (ER), human epidermal growth factor receptor 2 (HER2), tumor size, and pathomics features as independent ALNM predictors. The nomogram demonstrated that the AUC in the IVC (0.783) surpassed that of the Path-score model (0.698) (DeLong test, <i>p</i> = 0.008558). Similarly, in the EVC, the nomogram surpassed the clinical model regarding AUC (0.738 vs. 0.574; DeLong test, <i>p</i> = 0.00494). Additionally, DCA analysis indicated a net clinical benefit associated with the nomogram.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Conclusion</h3>\\n \\n <p>Our study demonstrates the effectiveness of pathomics features in predicting ALNM in BC patients. Furthermore, the pathomics-based nomogram offers a valuable tool for personalized treatment planning in this patient population.</p>\\n </section>\\n </div>\",\"PeriodicalId\":9440,\"journal\":{\"name\":\"Cancer reports\",\"volume\":\"8 9\",\"pages\":\"\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2025-08-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cnr2.70302\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cancer reports\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cnr2.70302\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer reports","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cnr2.70302","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
背景准确评估腋窝淋巴结(aln)的状态对于制定最佳的手术计划和做出明智的治疗决定对于乳腺癌(BC)患者至关重要。目的本研究旨在建立一种创新的基于病理的nomographic,用于预测BC的ALN转移(ALNM)。方法和结果本研究对来自两个机构的407名患者的数字苏木精和伊红(H&;E)染色图像进行了回顾性分析,这些患者被分配到培训队列(TC, n = 203),内部验证队列(IVC, n = 136)和外部验证队列(EVC, n = 68)。最初,使用Mann-Whitney u检验和Spearman等级相关系数进行特征选择,采用最小绝对收缩和选择算子(LASSO)回归进行进一步细化。为了评估ALNM和其他临床病理因素的预测价值,我们采用了单变量(ULR)和多变量(MLR)逻辑回归分析。在六种机器学习(ML)算法中,采用曲线下面积(AUC)值最高的逻辑回归来建立最终的nomogram模型。通过分析受试者工作特征曲线(ROC)、决策曲线分析(DCA)和标定图的AUC来评估nomogram信度和稳定性。MLR分析显示雌激素受体(ER)、人表皮生长因子受体2 (HER2)、肿瘤大小和病理特征是ALNM的独立预测因子。nomogram显示,IVC模型的AUC(0.783)超过了Path-score模型的AUC (0.698) (DeLong检验,p = 0.008558)。同样,在EVC中,nomogram AUC优于临床模型(0.738 vs. 0.574; DeLong检验,p = 0.00494)。此外,DCA分析表明与nomogram相关的净临床获益。结论我们的研究证实了病理特征在预测BC患者ALNM中的有效性。此外,基于病理的nomographic为这一患者群体的个性化治疗计划提供了一个有价值的工具。
Development of a Machine Learning Model Integrating Pathomics and Clinical Data to Predict Axillary Lymph Node Metastasis in Breast Cancer: A Two-Center Study
Background
Accurately assessing the status of axillary lymph nodes (ALNs) is essential for devising optimal surgical plans and making informed treatment decisions in breast cancer (BC) patients.
Aims
This study aims to develop an innovative nomogram based on pathomics to preoperatively predict ALN metastasis (ALNM) in BC.
Methods and Results
Our study performed a retrospective analysis on digital hematoxylin and eosin (H&E)-stained images obtained from 407 patients across two institutions who were allocated into a training cohort (TC; n = 203), an internal validation cohort (IVC; n = 136), and an external validation cohort (EVC; n = 68). Initially, the Mann–Whitney U-test and Spearman's rank correlation coefficient were utilized for feature selection, employing the least absolute shrinkage and selection operator (LASSO) regression for further refinement. For the evaluation of the predictive value of ALNM and other clinicopathological factors, we deployed both univariate (ULR) and multivariate (MLR) logistic regression analyses. Among the six machine learning (ML) algorithms, logistic regression, which demonstrated the highest area under the curve (AUC) value, was employed to establish the final nomogram model. The nomogram reliability and stability were assessed by analyzing the AUC of the receiver operating characteristic (ROC) curve, decision curve analysis (DCA), and calibration plots. MLR analysis demonstrated estrogen receptor (ER), human epidermal growth factor receptor 2 (HER2), tumor size, and pathomics features as independent ALNM predictors. The nomogram demonstrated that the AUC in the IVC (0.783) surpassed that of the Path-score model (0.698) (DeLong test, p = 0.008558). Similarly, in the EVC, the nomogram surpassed the clinical model regarding AUC (0.738 vs. 0.574; DeLong test, p = 0.00494). Additionally, DCA analysis indicated a net clinical benefit associated with the nomogram.
Conclusion
Our study demonstrates the effectiveness of pathomics features in predicting ALNM in BC patients. Furthermore, the pathomics-based nomogram offers a valuable tool for personalized treatment planning in this patient population.