Revisiting the Endoscopic Third Ventriculostomy Success Score using machine learning: can we do better?

IF 2.1 3区医学 Q3 CLINICAL NEUROLOGY

Journal of neurosurgery. Pediatrics Pub Date : 2024-12-06 Print Date: 2025-03-01 DOI:10.3171/2024.9.PEDS24146

Syed M Adil, Andreas Seas, Daniel P Sexton, Pranav I Warman, Benjamin D Wissel, Kennedy L Carpenter, Lacey Carter, Brad J Kolls, Anthony T Fuller, Shivanand P Lad, Timothy W Dunn, Herbert Fuchs, Matthew Vestal, Gerald A Grant

{"title":"Revisiting the Endoscopic Third Ventriculostomy Success Score using machine learning: can we do better?","authors":"Syed M Adil, Andreas Seas, Daniel P Sexton, Pranav I Warman, Benjamin D Wissel, Kennedy L Carpenter, Lacey Carter, Brad J Kolls, Anthony T Fuller, Shivanand P Lad, Timothy W Dunn, Herbert Fuchs, Matthew Vestal, Gerald A Grant","doi":"10.3171/2024.9.PEDS24146","DOIUrl":null,"url":null,"abstract":"Objective: The Endoscopic Third Ventriculostomy Success Score (ETVSS) is a useful decision-making heuristic when considering the probability of surgical success, defined traditionally as no repeat cerebrospinal fluid diversion surgery needed within 6 months. Nonetheless, the performance of the logistic regression (LR) model in the original 2009 study was modest, with an area under the receiver operating characteristic curve (AUROC) of 0.68. The authors sought to use a larger dataset to develop more accurate machine learning (ML) models to predict endoscopic third ventriculostomy (ETV) success and also to perform the largest validation of the ETVSS to date.Methods: The authors queried the MarketScan national database for the years 2005-2022 to identify patients < 18 years of age who underwent first-time ETV and subsequently had at least 6 months of continuous enrollment in the database. The authors collected data on predictors matching the original ETVSS: age, etiology of hydrocephalus, and history of any previous shunt placement. Next, they used 6 ML algorithms-LR, support vector classifier, random forest, k-nearest neighbors, Extreme Gradient Boosted Regression (XGBoost), and naive Bayes-to develop predictive models. Finally, the authors used nested cross-validation to assess the models' comparative performances on unseen data.Results: The authors identified 2047 patients who met inclusion criteria, and 1261 (61.6%) underwent successful ETV. The performances of most ML models were similar to that of the original ETVSS, which had an AUROC of 0.693 on the validation set and 0.661 (95% CI 0.600-0.722) on the test set. The authors' new LR model performed comparably with AUROCs of 0.693 on both the validation and test sets, with 95% CI 0.633-0.754 on the test set. Among the more complex ML algorithms, XGBoost performed best, with AUROCs of 0.683 and 0.672 (95% CI 0.609-0.734) on the validation and test sets, respectively.Conclusions: This is the largest external validation of the ETVSS, and it confirms modest performance. More sophisticated ML algorithms do not meaningfully improve predictive performance compared to ETVSS; this underscores the need for higher utility, novelty, and dimensionality of input data rather than changes in modeling strategies.","PeriodicalId":16549,"journal":{"name":"Journal of neurosurgery. Pediatrics","volume":" ","pages":"246-254"},"PeriodicalIF":2.1000,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of neurosurgery. Pediatrics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3171/2024.9.PEDS24146","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/1 0:00:00","PubModel":"Print","JCR":"Q3","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Objective: The Endoscopic Third Ventriculostomy Success Score (ETVSS) is a useful decision-making heuristic when considering the probability of surgical success, defined traditionally as no repeat cerebrospinal fluid diversion surgery needed within 6 months. Nonetheless, the performance of the logistic regression (LR) model in the original 2009 study was modest, with an area under the receiver operating characteristic curve (AUROC) of 0.68. The authors sought to use a larger dataset to develop more accurate machine learning (ML) models to predict endoscopic third ventriculostomy (ETV) success and also to perform the largest validation of the ETVSS to date.

Methods: The authors queried the MarketScan national database for the years 2005-2022 to identify patients < 18 years of age who underwent first-time ETV and subsequently had at least 6 months of continuous enrollment in the database. The authors collected data on predictors matching the original ETVSS: age, etiology of hydrocephalus, and history of any previous shunt placement. Next, they used 6 ML algorithms-LR, support vector classifier, random forest, k-nearest neighbors, Extreme Gradient Boosted Regression (XGBoost), and naive Bayes-to develop predictive models. Finally, the authors used nested cross-validation to assess the models' comparative performances on unseen data.

Results: The authors identified 2047 patients who met inclusion criteria, and 1261 (61.6%) underwent successful ETV. The performances of most ML models were similar to that of the original ETVSS, which had an AUROC of 0.693 on the validation set and 0.661 (95% CI 0.600-0.722) on the test set. The authors' new LR model performed comparably with AUROCs of 0.693 on both the validation and test sets, with 95% CI 0.633-0.754 on the test set. Among the more complex ML algorithms, XGBoost performed best, with AUROCs of 0.683 and 0.672 (95% CI 0.609-0.734) on the validation and test sets, respectively.

Conclusions: This is the largest external validation of the ETVSS, and it confirms modest performance. More sophisticated ML algorithms do not meaningfully improve predictive performance compared to ETVSS; this underscores the need for higher utility, novelty, and dimensionality of input data rather than changes in modeling strategies.

查看原文本刊更多论文

用机器学习重新审视第三脑室内窥镜造瘘成功评分：我们能做得更好吗？

目的：内镜下第三脑室造瘘成功评分（ETVSS）是考虑手术成功概率的有用决策启发式指标，传统定义为6个月内不需要再次进行脑脊液转移手术。尽管如此，在2009年的原始研究中，logistic回归（LR）模型的表现并不理想，受试者工作特征曲线下面积（AUROC）为0.68。作者试图使用更大的数据集来开发更准确的机器学习（ML）模型，以预测内窥镜第三脑室造口术（ETV）的成功，并对ETVSS进行迄今为止最大的验证。方法：作者查询了2005-2022年MarketScan国家数据库，以确定首次接受ETV且随后在数据库中连续登记至少6个月的< 18岁患者。作者收集了与原始ETVSS相匹配的预测因素的数据：年龄、脑积水的病因和以前任何分流器放置的历史。接下来，他们使用6种ML算法——lr、支持向量分类器、随机森林、k近邻、极端梯度增强回归（XGBoost）和朴素贝叶斯——来开发预测模型。最后，作者使用嵌套交叉验证来评估模型在未见数据上的比较性能。结果：作者确定了2047例符合纳入标准的患者，其中1261例（61.6%）成功接受了ETV。大多数ML模型的性能与原始ETVSS相似，验证集的AUROC为0.693，测试集的AUROC为0.661 （95% CI 0.600-0.722）。作者的新LR模型在验证集和测试集上的auroc均为0.693，测试集上的95% CI为0.633-0.754。在较复杂的ML算法中，XGBoost表现最好，在验证集和测试集上的auroc分别为0.683和0.672 （95% CI 0.609-0.734）。结论：这是对ETVSS进行的最大规模的外部验证，它证实了适度的性能。与ETVSS相比，更复杂的ML算法并没有显著提高预测性能；这强调了对更高的实用性、新颖性和输入数据维度的需求，而不是对建模策略的更改。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊