Machine Learning for Predicting Malignant Transformation in Actinic Cheilitis: A Prognostic Support System Based on Demographic and Clinical Descriptors.

IF 2.3 3区 医学 Q1 DENTISTRY, ORAL SURGERY & MEDICINE
Journal of Oral Pathology & Medicine Pub Date : 2026-05-01 Epub Date: 2026-01-19 DOI:10.1111/jop.70113
Ivan José Correia-Neto, Alex Franco da Costa, Anna Luíza Damaceno Araújo, Cristina Saldivia-Siracusa, Raísa Sales de Sá, Thiago Martini Pereira, Pablo Agustin Vargas, Alan Roger Santos-Silva, Luiz Paulo Kowalski, Matheus Cardoso Moraes, Marcio Ajudarte Lopes
{"title":"Machine Learning for Predicting Malignant Transformation in Actinic Cheilitis: A Prognostic Support System Based on Demographic and Clinical Descriptors.","authors":"Ivan José Correia-Neto, Alex Franco da Costa, Anna Luíza Damaceno Araújo, Cristina Saldivia-Siracusa, Raísa Sales de Sá, Thiago Martini Pereira, Pablo Agustin Vargas, Alan Roger Santos-Silva, Luiz Paulo Kowalski, Matheus Cardoso Moraes, Marcio Ajudarte Lopes","doi":"10.1111/jop.70113","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>This study aimed to develop and evaluate Machine Learning models to predict the malignant transformation (MT) in patients with actinic cheilitis (AC).</p><p><strong>Methods: </strong>Three hundred forty patients diagnosed with AC (322 in the no MT group, and 18 in the MT group) were carefully documented. The study used the Adaptive Synthetic Sampling to adaptively balance the dataset (322 in the no MT group and 319 in the MT group). Four supervised Machine Learning classifiers (Random Forest, Xtreme Gradient Boosting, Multilayer Perceptron, and Support Vector Machine) were trained and tested using 5-fold cross-validation to correlate inputs (clinical descriptors and demographic data) to outputs (MT). SHAP values were used to identify the most influential predictors of MT.</p><p><strong>Results: </strong>The Xtreme Gradient Boosting model stood out, achieving 96.72% accuracy, 96.87% sensitivity, 96.57% specificity, 96.61% precision, 96.73% of F1-Score, and 0.9498 AUC. Multilayer Perceptron showed the best sensitivity (98.44%), and Random Forest presented comparable results. In contrast, Support Vector Machine underperformed, with higher values of false negatives and false positives. Across models, ulceration, multifocality, and long-standing lesions were the strongest predictors of MT, while small, asymptomatic, or solitary lesions were associated with lower risk.</p><p><strong>Conclusion: </strong>The results revealed promising performance metrics for Xtreme Gradient Boosting and Multilayer Perceptron suggesting their potential value as tools in a support system for monitoring AC. Additionally, synthetic data proved constructive in training, enhancing the models' robustness and predictive capabilities.</p>","PeriodicalId":16588,"journal":{"name":"Journal of Oral Pathology & Medicine","volume":" ","pages":"574-582"},"PeriodicalIF":2.3000,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Oral Pathology & Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/jop.70113","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/1/19 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: This study aimed to develop and evaluate Machine Learning models to predict the malignant transformation (MT) in patients with actinic cheilitis (AC).

Methods: Three hundred forty patients diagnosed with AC (322 in the no MT group, and 18 in the MT group) were carefully documented. The study used the Adaptive Synthetic Sampling to adaptively balance the dataset (322 in the no MT group and 319 in the MT group). Four supervised Machine Learning classifiers (Random Forest, Xtreme Gradient Boosting, Multilayer Perceptron, and Support Vector Machine) were trained and tested using 5-fold cross-validation to correlate inputs (clinical descriptors and demographic data) to outputs (MT). SHAP values were used to identify the most influential predictors of MT.

Results: The Xtreme Gradient Boosting model stood out, achieving 96.72% accuracy, 96.87% sensitivity, 96.57% specificity, 96.61% precision, 96.73% of F1-Score, and 0.9498 AUC. Multilayer Perceptron showed the best sensitivity (98.44%), and Random Forest presented comparable results. In contrast, Support Vector Machine underperformed, with higher values of false negatives and false positives. Across models, ulceration, multifocality, and long-standing lesions were the strongest predictors of MT, while small, asymptomatic, or solitary lesions were associated with lower risk.

Conclusion: The results revealed promising performance metrics for Xtreme Gradient Boosting and Multilayer Perceptron suggesting their potential value as tools in a support system for monitoring AC. Additionally, synthetic data proved constructive in training, enhancing the models' robustness and predictive capabilities.

机器学习预测光化性唇炎的恶性转化:基于人口统计学和临床描述符的预后支持系统。
目的:本研究旨在建立和评估机器学习模型来预测光化性唇腭裂(AC)患者的恶性转化(MT)。方法:对340例确诊为AC的患者(322例为未行MT组,18例为MT组)进行详细记录。该研究使用自适应合成采样来自适应平衡数据集(无MT组322个,MT组319个)。四个监督机器学习分类器(随机森林,Xtreme梯度增强,多层感知器和支持向量机)使用5倍交叉验证进行训练和测试,以将输入(临床描述符和人口统计数据)与输出(MT)相关联。结果:Xtreme Gradient Boosting模型准确率为96.72%,灵敏度为96.87%,特异度为96.57%,精密度为96.61%,F1-Score为96.73%,AUC为0.9498。多层感知器的灵敏度最高(98.44%),与随机森林的结果相当。相比之下,支持向量机表现不佳,假阴性和假阳性的值更高。在所有模型中,溃疡、多灶性和长期病变是MT的最强预测因子,而小的、无症状的或孤立的病变与较低的风险相关。结论:结果揭示了Xtreme梯度增强和多层感知器的良好性能指标,表明它们作为支持系统监测AC的工具的潜在价值。此外,合成数据在训练中被证明具有建设性,增强了模型的鲁棒性和预测能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.90
自引率
6.10%
发文量
121
审稿时长
4-8 weeks
期刊介绍: The aim of the Journal of Oral Pathology & Medicine is to publish manuscripts of high scientific quality representing original clinical, diagnostic or experimental work in oral pathology and oral medicine. Papers advancing the science or practice of these disciplines will be welcomed, especially those which bring new knowledge and observations from the application of techniques within the spheres of light and electron microscopy, tissue and organ culture, immunology, histochemistry and immunocytochemistry, microbiology, genetics and biochemistry.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书