{"title":"Development of a prognostic tool using machine learning to identify high-risk mucoepidermoid carcinoma patients across diverse anatomical sites","authors":"Yun Lei, Wan-Shan Li","doi":"10.1016/j.jormas.2025.102490","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>Mucoepidermoid carcinoma (MEC), typically arising in the salivary glands, has been documented in various anatomical locations. This study aimed to develop machine learning models for efficient prognosis prediction in patients with MEC across different anatomical sites and to create a novel tool for identifying high-risk patients in clinical settings.</div></div><div><h3>Methods</h3><div>A retrospective cohort study involving 6280 patients with MEC from diverse anatomical sites was conducted. COX regression analysis identified prognostic variables. Five machine learning models—Support Vector Machine (SVM), Random Forest (RF), Generalized Additive Model (GAM), Extreme Gradient Boosting (XGBoost), and Light Gradient Boosting Machine (LightGBM)—were constructed. Model performance was assessed using accuracy, recall, precision, and the area under the curve (AUC) of receiver operating characteristic (ROC) curves. However, the external validation was not performed. The best-performing model was developed into a prognostic tool.</div></div><div><h3>Results</h3><div>The GAM outperformed the other models, and a web-based tool was developed for identifying patients with varying prognostic risks across MEC cases from different anatomical locations, supporting clinical decision-making, personalizing therapeutic strategies based on clinical characteristics, ensuring tailored treatment schedules and optimizing resource allocation. Extension, age, pathological grade, clinical stage, and N-stage were the most significant variables in the final model.</div></div><div><h3>Conclusion</h3><div>This study established an online tool utilizing GAM to aid in identifying high-risk patients with MEC and enhancing prognostic accuracy in clinical practice. The anatomical site did not significantly influence survival outcomes, offering valuable insights into the shared pathogenesis of MEC.</div></div>","PeriodicalId":55993,"journal":{"name":"Journal of Stomatology Oral and Maxillofacial Surgery","volume":"126 5","pages":"Article 102490"},"PeriodicalIF":2.0000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Stomatology Oral and Maxillofacial Surgery","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468785525002769","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
Mucoepidermoid carcinoma (MEC), typically arising in the salivary glands, has been documented in various anatomical locations. This study aimed to develop machine learning models for efficient prognosis prediction in patients with MEC across different anatomical sites and to create a novel tool for identifying high-risk patients in clinical settings.
Methods
A retrospective cohort study involving 6280 patients with MEC from diverse anatomical sites was conducted. COX regression analysis identified prognostic variables. Five machine learning models—Support Vector Machine (SVM), Random Forest (RF), Generalized Additive Model (GAM), Extreme Gradient Boosting (XGBoost), and Light Gradient Boosting Machine (LightGBM)—were constructed. Model performance was assessed using accuracy, recall, precision, and the area under the curve (AUC) of receiver operating characteristic (ROC) curves. However, the external validation was not performed. The best-performing model was developed into a prognostic tool.
Results
The GAM outperformed the other models, and a web-based tool was developed for identifying patients with varying prognostic risks across MEC cases from different anatomical locations, supporting clinical decision-making, personalizing therapeutic strategies based on clinical characteristics, ensuring tailored treatment schedules and optimizing resource allocation. Extension, age, pathological grade, clinical stage, and N-stage were the most significant variables in the final model.
Conclusion
This study established an online tool utilizing GAM to aid in identifying high-risk patients with MEC and enhancing prognostic accuracy in clinical practice. The anatomical site did not significantly influence survival outcomes, offering valuable insights into the shared pathogenesis of MEC.