A Random Survival Forest Model for Predicting Residual and Recurrent High-Grade Cervical Intraepithelial Neoplasia in Premenopausal Women.

IF 2.5 4区 医学 Q2 OBSTETRICS & GYNECOLOGY
International Journal of Women's Health Pub Date : 2024-10-30 eCollection Date: 2024-01-01 DOI:10.2147/IJWH.S485515
Furui Zhai, Shanshan Mu, Yinghui Song, Min Zhang, Cui Zhang, Ze Lv
{"title":"A Random Survival Forest Model for Predicting Residual and Recurrent High-Grade Cervical Intraepithelial Neoplasia in Premenopausal Women.","authors":"Furui Zhai, Shanshan Mu, Yinghui Song, Min Zhang, Cui Zhang, Ze Lv","doi":"10.2147/IJWH.S485515","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Loop electrosurgical excision procedure (LEEP) for high-grade cervical intraepithelial neoplasia (CIN) carries significant risks of recurrence and persistence. This study compares the efficacy of a random survival forest (RSF) model with that of a conventional Cox regression model for predicting residual and recurrent high-grade CIN in premenopausal women after LEEP.</p><p><strong>Methods: </strong>Data from 458 premenopausal women treated for CIN2/3 at our hospital between 2016 and 2020 were analyzed. The RSF model incorporated demographic, pathological, and treatment-related variables. Feature selection utilizing LASSO and three other algorithms was performed to enhance the RSF model, which was further compared to a Cox regression model. Model performance was assessed using area under the curve (AUC), out-of-bag (OOB) error rates, and SHAP values to interpret predictor importance.</p><p><strong>Results: </strong>The RSF model showed superior performance compared to the Cox regression model, with AUC values of 0.767-0.901 and peak predictive performance at 36 months post-LEEP. In contrast, the highest AUC achieved by Cox regression was 0.880. The RSF model also exhibited relatively lower OOB error rates, indicating better generalizability. Moreover, SHAP value analysis identified margin status and CIN severity as the most prominent predictors that directly affected risk predictions. Lastly, an online tool providing real-time predictions in clinical settings was successfully implemented using the RSF model.</p><p><strong>Conclusion: </strong>The RSF model outperformed the traditional Cox regression model in predicting residual and recurrent high-grade CIN risks post-LEEP. This model may be a more accurate clinical tool that facilitates improved personalized care and early interventions in gynecological oncology.</p>","PeriodicalId":14356,"journal":{"name":"International Journal of Women's Health","volume":"16 ","pages":"1775-1787"},"PeriodicalIF":2.5000,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11531712/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Women's Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/IJWH.S485515","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: Loop electrosurgical excision procedure (LEEP) for high-grade cervical intraepithelial neoplasia (CIN) carries significant risks of recurrence and persistence. This study compares the efficacy of a random survival forest (RSF) model with that of a conventional Cox regression model for predicting residual and recurrent high-grade CIN in premenopausal women after LEEP.

Methods: Data from 458 premenopausal women treated for CIN2/3 at our hospital between 2016 and 2020 were analyzed. The RSF model incorporated demographic, pathological, and treatment-related variables. Feature selection utilizing LASSO and three other algorithms was performed to enhance the RSF model, which was further compared to a Cox regression model. Model performance was assessed using area under the curve (AUC), out-of-bag (OOB) error rates, and SHAP values to interpret predictor importance.

Results: The RSF model showed superior performance compared to the Cox regression model, with AUC values of 0.767-0.901 and peak predictive performance at 36 months post-LEEP. In contrast, the highest AUC achieved by Cox regression was 0.880. The RSF model also exhibited relatively lower OOB error rates, indicating better generalizability. Moreover, SHAP value analysis identified margin status and CIN severity as the most prominent predictors that directly affected risk predictions. Lastly, an online tool providing real-time predictions in clinical settings was successfully implemented using the RSF model.

Conclusion: The RSF model outperformed the traditional Cox regression model in predicting residual and recurrent high-grade CIN risks post-LEEP. This model may be a more accurate clinical tool that facilitates improved personalized care and early interventions in gynecological oncology.

预测绝经前妇女残留和复发高级别宫颈上皮内瘤变的随机生存森林模型
目的:环状电切术(LEEP)治疗高级别宫颈上皮内瘤变(CIN)有很大的复发和持续存在的风险。本研究比较了随机生存森林(RSF)模型与传统 Cox 回归模型在预测 LEEP 术后绝经前妇女高级别 CIN 的残留和复发方面的效果:分析了2016年至2020年间在我院接受CIN2/3治疗的458名绝经前妇女的数据。RSF模型纳入了人口统计学、病理学和治疗相关变量。利用 LASSO 和其他三种算法进行特征选择,以增强 RSF 模型,并进一步与 Cox 回归模型进行比较。使用曲线下面积(AUC)、袋外错误率(OOB)和 SHAP 值评估模型性能,以解释预测因子的重要性:结果:与 Cox 回归模型相比,RSF 模型显示出更优越的性能,其 AUC 值为 0.767-0.901,在LEEP 后 36 个月达到峰值预测性能。相比之下,Cox 回归模型的最高 AUC 值为 0.880。RSF 模型的 OOB 误差率也相对较低,表明其具有更好的普适性。此外,SHAP 值分析发现,边缘状态和 CIN 严重程度是直接影响风险预测的最主要预测因素。最后,使用 RSF 模型成功实现了在临床环境中提供实时预测的在线工具:RSF模型在预测LEEP术后残留和复发高级别CIN风险方面优于传统的Cox回归模型。该模型可能是一种更准确的临床工具,有助于改善妇科肿瘤的个性化治疗和早期干预。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
International Journal of Women's Health
International Journal of Women's Health OBSTETRICS & GYNECOLOGY-
CiteScore
3.70
自引率
0.00%
发文量
194
审稿时长
16 weeks
期刊介绍: International Journal of Women''s Health is an international, peer-reviewed, open access, online journal. Publishing original research, reports, editorials, reviews and commentaries on all aspects of women''s healthcare including gynecology, obstetrics, and breast cancer. Subject areas include: Chronic conditions including cancers of various organs specific and not specific to women Migraine, headaches, arthritis, osteoporosis Endocrine and autoimmune syndromes - asthma, multiple sclerosis, lupus, diabetes Sexual and reproductive health including fertility patterns and emerging technologies to address infertility Infectious disease with chronic sequelae including HIV/AIDS, HPV, PID, and other STDs Psychological and psychosocial conditions - depression across the life span, substance abuse, domestic violence Health maintenance among aging females - factors affecting the quality of life including physical, social and mental issues Avenues for health promotion and disease prevention across the life span Male vs female incidence comparisons for conditions that affect both genders.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信