Yunyang Deng, Joakim Dillner, Nicholas Baltzer, Laila Sara Arroyo Mühr, Roxana Merino Martinez, Alexander Ploner, Jiayao Lei, Mark Clements
{"title":"Prediction models for high-grade cervical lesions or worse using machine learning.","authors":"Yunyang Deng, Joakim Dillner, Nicholas Baltzer, Laila Sara Arroyo Mühr, Roxana Merino Martinez, Alexander Ploner, Jiayao Lei, Mark Clements","doi":"10.1016/j.eclinm.2026.103819","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>This study aimed to improve cervical screening efficiency by developing and validating machine-learning models for predicting high-grade cervical lesions or worse (HCL) risk.</p><p><strong>Methods: </strong>From Swedish nationwide registers, we included 474,072 women invited to cervical screening in 2016 (split into 80% training and 20% test sets) and 370,105 women invited in 2017 for validation. All women underwent index cytology and/or human papillomavirus (HPV) testing within the recommended interval after age 29. Predictors included screening results (cytology and/or HPV testing), other HPV-related factors, and demographic factors (including age). Four random forest models were trained via 5-fold cross-validation with different predictors: Model 1 (M1) (all predictors), M2 (cytology, HPV testing, age), M3 (HPV testing, other HPV-related factors, and demographic factors), and M4 (HPV testing and age). We computed area under the curves (AUCs) and created plots to depict positive predictive value (PPV) by the number of women intervened.</p><p><strong>Findings: </strong>In training and test sets, 1-, 3-, and 5-year HCL incidence proportions were 0.25%, 0.68%, and 1.05%, respectively. Cross-validated AUCs were 0.83-0.96 (M1), 0.83-0.96 (M2), 0.91-0.94 (M3), and 0.91-0.93 (M4), depending on the prediction intervals. Similar AUCs were found in the test set. Additionally, the AUCs in the validation set were 0.85-0.95 (M1), 0.85-0.95 (M2), 0.91-0.94 (M3), and 0.92-0.93 (M4). Across all intervals, M1 consistently demonstrated the highest PPV, followed by M2, M3, and M4. For each model, PPVs were lowest for 1-year predictions but comparable at 3 and 5 years.</p><p><strong>Interpretation: </strong>The models demonstrated strong predictive performance. Evaluating PPVs over the number of invited women provides the potential for risk-stratified screening and clinical utility.</p><p><strong>Funding: </strong>Vetenskapsrådet, FORTE, Karolinska Institutet, Horizon 2020, and Cancerfonden.</p>","PeriodicalId":11393,"journal":{"name":"EClinicalMedicine","volume":"93 ","pages":"103819"},"PeriodicalIF":10.0000,"publicationDate":"2026-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12972733/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EClinicalMedicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.eclinm.2026.103819","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/3/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0
Abstract
Background: This study aimed to improve cervical screening efficiency by developing and validating machine-learning models for predicting high-grade cervical lesions or worse (HCL) risk.
Methods: From Swedish nationwide registers, we included 474,072 women invited to cervical screening in 2016 (split into 80% training and 20% test sets) and 370,105 women invited in 2017 for validation. All women underwent index cytology and/or human papillomavirus (HPV) testing within the recommended interval after age 29. Predictors included screening results (cytology and/or HPV testing), other HPV-related factors, and demographic factors (including age). Four random forest models were trained via 5-fold cross-validation with different predictors: Model 1 (M1) (all predictors), M2 (cytology, HPV testing, age), M3 (HPV testing, other HPV-related factors, and demographic factors), and M4 (HPV testing and age). We computed area under the curves (AUCs) and created plots to depict positive predictive value (PPV) by the number of women intervened.
Findings: In training and test sets, 1-, 3-, and 5-year HCL incidence proportions were 0.25%, 0.68%, and 1.05%, respectively. Cross-validated AUCs were 0.83-0.96 (M1), 0.83-0.96 (M2), 0.91-0.94 (M3), and 0.91-0.93 (M4), depending on the prediction intervals. Similar AUCs were found in the test set. Additionally, the AUCs in the validation set were 0.85-0.95 (M1), 0.85-0.95 (M2), 0.91-0.94 (M3), and 0.92-0.93 (M4). Across all intervals, M1 consistently demonstrated the highest PPV, followed by M2, M3, and M4. For each model, PPVs were lowest for 1-year predictions but comparable at 3 and 5 years.
Interpretation: The models demonstrated strong predictive performance. Evaluating PPVs over the number of invited women provides the potential for risk-stratified screening and clinical utility.
Funding: Vetenskapsrådet, FORTE, Karolinska Institutet, Horizon 2020, and Cancerfonden.
期刊介绍:
eClinicalMedicine is a gold open-access clinical journal designed to support frontline health professionals in addressing the complex and rapid health transitions affecting societies globally. The journal aims to assist practitioners in overcoming healthcare challenges across diverse communities, spanning diagnosis, treatment, prevention, and health promotion. Integrating disciplines from various specialties and life stages, it seeks to enhance health systems as fundamental institutions within societies. With a forward-thinking approach, eClinicalMedicine aims to redefine the future of healthcare.