{"title":"Prediction of lymph node metastasis in cervical cancer patients using AdaBoost machine learning model: analysis of risk factors.","authors":"Nan Li, Erxuan Peng, Fenghua Liu","doi":"10.62347/UMKG8609","DOIUrl":null,"url":null,"abstract":"<p><p>This study focuses on the development and evaluation of machine learning models, particularly the Adaptive Boosting (AdaBoost) algorithm, for predicting lymph node metastasis (LNM) in cervical cancer (CC) patients. The findings show that AdaBoost outperformed traditional statistical methods and other machine learning models, including Random Forest, Support Vector Machine (SVM), and Least Absolute Shrinkage and Selection Operator (LASSO) regression, in predicting LNM. The areas under the curve (AUCs) for the training and validation sets were 0.882 and 0.857, respectively, indicating high prediction efficiency. Multivariate logistic regression identified key independent risk factors for LNM, including FIGO staging, squamous cell carcinoma antigen (SCC-Ag), white blood cell count (WBC), neutrophil count (NEUT), hemoglobin (HGB) level, and prealbumin (PAB) level. These factors are significant in predicting LNM and emphasize their importance in clinical decision-making. AdaBoost's ability to predict LNM preoperatively, without invasive procedures such as lymph node dissection, can reduce treatment risks and improve patient outcomes. While other models, such as XGBoost, showed a marginally higher AUC in training, AdaBoost's performance in validation was comparable (P=0.18). Inflammatory and nutritional markers, such as WBC, NEUT, HGB, and PAB, were significant predictors and provide valuable insights into tumor progression. Despite the study's retrospective nature, the integration of larger, multi-center datasets, and multi-modal imaging could further enhance the model's accuracy and generalizability. This high-performance AdaBoost model offers clinical potential for refining personalized treatment strategies for CC patients.</p>","PeriodicalId":7437,"journal":{"name":"American journal of cancer research","volume":"15 3","pages":"1158-1173"},"PeriodicalIF":3.6000,"publicationDate":"2025-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11982712/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of cancer research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.62347/UMKG8609","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
This study focuses on the development and evaluation of machine learning models, particularly the Adaptive Boosting (AdaBoost) algorithm, for predicting lymph node metastasis (LNM) in cervical cancer (CC) patients. The findings show that AdaBoost outperformed traditional statistical methods and other machine learning models, including Random Forest, Support Vector Machine (SVM), and Least Absolute Shrinkage and Selection Operator (LASSO) regression, in predicting LNM. The areas under the curve (AUCs) for the training and validation sets were 0.882 and 0.857, respectively, indicating high prediction efficiency. Multivariate logistic regression identified key independent risk factors for LNM, including FIGO staging, squamous cell carcinoma antigen (SCC-Ag), white blood cell count (WBC), neutrophil count (NEUT), hemoglobin (HGB) level, and prealbumin (PAB) level. These factors are significant in predicting LNM and emphasize their importance in clinical decision-making. AdaBoost's ability to predict LNM preoperatively, without invasive procedures such as lymph node dissection, can reduce treatment risks and improve patient outcomes. While other models, such as XGBoost, showed a marginally higher AUC in training, AdaBoost's performance in validation was comparable (P=0.18). Inflammatory and nutritional markers, such as WBC, NEUT, HGB, and PAB, were significant predictors and provide valuable insights into tumor progression. Despite the study's retrospective nature, the integration of larger, multi-center datasets, and multi-modal imaging could further enhance the model's accuracy and generalizability. This high-performance AdaBoost model offers clinical potential for refining personalized treatment strategies for CC patients.
期刊介绍:
The American Journal of Cancer Research (AJCR) (ISSN 2156-6976), is an independent open access, online only journal to facilitate rapid dissemination of novel discoveries in basic science and treatment of cancer. It was founded by a group of scientists for cancer research and clinical academic oncologists from around the world, who are devoted to the promotion and advancement of our understanding of the cancer and its treatment. The scope of AJCR is intended to encompass that of multi-disciplinary researchers from any scientific discipline where the primary focus of the research is to increase and integrate knowledge about etiology and molecular mechanisms of carcinogenesis with the ultimate aim of advancing the cure and prevention of this increasingly devastating disease. To achieve these aims AJCR will publish review articles, original articles and new techniques in cancer research and therapy. It will also publish hypothesis, case reports and letter to the editor. Unlike most other open access online journals, AJCR will keep most of the traditional features of paper print that we are all familiar with, such as continuous volume, issue numbers, as well as continuous page numbers to retain our comfortable familiarity towards an academic journal.