Furui Zhai, Shanshan Mu, Yinghui Song, Min Zhang, Cui Zhang, Ze Lv
{"title":"预测绝经前妇女残留和复发高级别宫颈上皮内瘤变的随机生存森林模型","authors":"Furui Zhai, Shanshan Mu, Yinghui Song, Min Zhang, Cui Zhang, Ze Lv","doi":"10.2147/IJWH.S485515","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Loop electrosurgical excision procedure (LEEP) for high-grade cervical intraepithelial neoplasia (CIN) carries significant risks of recurrence and persistence. This study compares the efficacy of a random survival forest (RSF) model with that of a conventional Cox regression model for predicting residual and recurrent high-grade CIN in premenopausal women after LEEP.</p><p><strong>Methods: </strong>Data from 458 premenopausal women treated for CIN2/3 at our hospital between 2016 and 2020 were analyzed. The RSF model incorporated demographic, pathological, and treatment-related variables. Feature selection utilizing LASSO and three other algorithms was performed to enhance the RSF model, which was further compared to a Cox regression model. Model performance was assessed using area under the curve (AUC), out-of-bag (OOB) error rates, and SHAP values to interpret predictor importance.</p><p><strong>Results: </strong>The RSF model showed superior performance compared to the Cox regression model, with AUC values of 0.767-0.901 and peak predictive performance at 36 months post-LEEP. In contrast, the highest AUC achieved by Cox regression was 0.880. The RSF model also exhibited relatively lower OOB error rates, indicating better generalizability. Moreover, SHAP value analysis identified margin status and CIN severity as the most prominent predictors that directly affected risk predictions. Lastly, an online tool providing real-time predictions in clinical settings was successfully implemented using the RSF model.</p><p><strong>Conclusion: </strong>The RSF model outperformed the traditional Cox regression model in predicting residual and recurrent high-grade CIN risks post-LEEP. This model may be a more accurate clinical tool that facilitates improved personalized care and early interventions in gynecological oncology.</p>","PeriodicalId":14356,"journal":{"name":"International Journal of Women's Health","volume":"16 ","pages":"1775-1787"},"PeriodicalIF":2.5000,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11531712/pdf/","citationCount":"0","resultStr":"{\"title\":\"A Random Survival Forest Model for Predicting Residual and Recurrent High-Grade Cervical Intraepithelial Neoplasia in Premenopausal Women.\",\"authors\":\"Furui Zhai, Shanshan Mu, Yinghui Song, Min Zhang, Cui Zhang, Ze Lv\",\"doi\":\"10.2147/IJWH.S485515\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>Loop electrosurgical excision procedure (LEEP) for high-grade cervical intraepithelial neoplasia (CIN) carries significant risks of recurrence and persistence. This study compares the efficacy of a random survival forest (RSF) model with that of a conventional Cox regression model for predicting residual and recurrent high-grade CIN in premenopausal women after LEEP.</p><p><strong>Methods: </strong>Data from 458 premenopausal women treated for CIN2/3 at our hospital between 2016 and 2020 were analyzed. The RSF model incorporated demographic, pathological, and treatment-related variables. Feature selection utilizing LASSO and three other algorithms was performed to enhance the RSF model, which was further compared to a Cox regression model. Model performance was assessed using area under the curve (AUC), out-of-bag (OOB) error rates, and SHAP values to interpret predictor importance.</p><p><strong>Results: </strong>The RSF model showed superior performance compared to the Cox regression model, with AUC values of 0.767-0.901 and peak predictive performance at 36 months post-LEEP. In contrast, the highest AUC achieved by Cox regression was 0.880. The RSF model also exhibited relatively lower OOB error rates, indicating better generalizability. Moreover, SHAP value analysis identified margin status and CIN severity as the most prominent predictors that directly affected risk predictions. Lastly, an online tool providing real-time predictions in clinical settings was successfully implemented using the RSF model.</p><p><strong>Conclusion: </strong>The RSF model outperformed the traditional Cox regression model in predicting residual and recurrent high-grade CIN risks post-LEEP. This model may be a more accurate clinical tool that facilitates improved personalized care and early interventions in gynecological oncology.</p>\",\"PeriodicalId\":14356,\"journal\":{\"name\":\"International Journal of Women's Health\",\"volume\":\"16 \",\"pages\":\"1775-1787\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2024-10-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11531712/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Women's Health\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.2147/IJWH.S485515\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"OBSTETRICS & GYNECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Women's Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/IJWH.S485515","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
A Random Survival Forest Model for Predicting Residual and Recurrent High-Grade Cervical Intraepithelial Neoplasia in Premenopausal Women.
Purpose: Loop electrosurgical excision procedure (LEEP) for high-grade cervical intraepithelial neoplasia (CIN) carries significant risks of recurrence and persistence. This study compares the efficacy of a random survival forest (RSF) model with that of a conventional Cox regression model for predicting residual and recurrent high-grade CIN in premenopausal women after LEEP.
Methods: Data from 458 premenopausal women treated for CIN2/3 at our hospital between 2016 and 2020 were analyzed. The RSF model incorporated demographic, pathological, and treatment-related variables. Feature selection utilizing LASSO and three other algorithms was performed to enhance the RSF model, which was further compared to a Cox regression model. Model performance was assessed using area under the curve (AUC), out-of-bag (OOB) error rates, and SHAP values to interpret predictor importance.
Results: The RSF model showed superior performance compared to the Cox regression model, with AUC values of 0.767-0.901 and peak predictive performance at 36 months post-LEEP. In contrast, the highest AUC achieved by Cox regression was 0.880. The RSF model also exhibited relatively lower OOB error rates, indicating better generalizability. Moreover, SHAP value analysis identified margin status and CIN severity as the most prominent predictors that directly affected risk predictions. Lastly, an online tool providing real-time predictions in clinical settings was successfully implemented using the RSF model.
Conclusion: The RSF model outperformed the traditional Cox regression model in predicting residual and recurrent high-grade CIN risks post-LEEP. This model may be a more accurate clinical tool that facilitates improved personalized care and early interventions in gynecological oncology.
期刊介绍:
International Journal of Women''s Health is an international, peer-reviewed, open access, online journal. Publishing original research, reports, editorials, reviews and commentaries on all aspects of women''s healthcare including gynecology, obstetrics, and breast cancer. Subject areas include: Chronic conditions including cancers of various organs specific and not specific to women Migraine, headaches, arthritis, osteoporosis Endocrine and autoimmune syndromes - asthma, multiple sclerosis, lupus, diabetes Sexual and reproductive health including fertility patterns and emerging technologies to address infertility Infectious disease with chronic sequelae including HIV/AIDS, HPV, PID, and other STDs Psychological and psychosocial conditions - depression across the life span, substance abuse, domestic violence Health maintenance among aging females - factors affecting the quality of life including physical, social and mental issues Avenues for health promotion and disease prevention across the life span Male vs female incidence comparisons for conditions that affect both genders.