Revisiting the Endoscopic Third Ventriculostomy Success Score using machine learning: can we do better?

IF 2.1 3区 医学 Q3 CLINICAL NEUROLOGY
Syed M Adil, Andreas Seas, Daniel P Sexton, Pranav I Warman, Benjamin D Wissel, Kennedy L Carpenter, Lacey Carter, Brad J Kolls, Anthony T Fuller, Shivanand P Lad, Timothy W Dunn, Herbert Fuchs, Matthew Vestal, Gerald A Grant
{"title":"Revisiting the Endoscopic Third Ventriculostomy Success Score using machine learning: can we do better?","authors":"Syed M Adil, Andreas Seas, Daniel P Sexton, Pranav I Warman, Benjamin D Wissel, Kennedy L Carpenter, Lacey Carter, Brad J Kolls, Anthony T Fuller, Shivanand P Lad, Timothy W Dunn, Herbert Fuchs, Matthew Vestal, Gerald A Grant","doi":"10.3171/2024.9.PEDS24146","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>The Endoscopic Third Ventriculostomy Success Score (ETVSS) is a useful decision-making heuristic when considering the probability of surgical success, defined traditionally as no repeat cerebrospinal fluid diversion surgery needed within 6 months. Nonetheless, the performance of the logistic regression (LR) model in the original 2009 study was modest, with an area under the receiver operating characteristic curve (AUROC) of 0.68. The authors sought to use a larger dataset to develop more accurate machine learning (ML) models to predict endoscopic third ventriculostomy (ETV) success and also to perform the largest validation of the ETVSS to date.</p><p><strong>Methods: </strong>The authors queried the MarketScan national database for the years 2005-2022 to identify patients < 18 years of age who underwent first-time ETV and subsequently had at least 6 months of continuous enrollment in the database. The authors collected data on predictors matching the original ETVSS: age, etiology of hydrocephalus, and history of any previous shunt placement. Next, they used 6 ML algorithms-LR, support vector classifier, random forest, k-nearest neighbors, Extreme Gradient Boosted Regression (XGBoost), and naive Bayes-to develop predictive models. Finally, the authors used nested cross-validation to assess the models' comparative performances on unseen data.</p><p><strong>Results: </strong>The authors identified 2047 patients who met inclusion criteria, and 1261 (61.6%) underwent successful ETV. The performances of most ML models were similar to that of the original ETVSS, which had an AUROC of 0.693 on the validation set and 0.661 (95% CI 0.600-0.722) on the test set. The authors' new LR model performed comparably with AUROCs of 0.693 on both the validation and test sets, with 95% CI 0.633-0.754 on the test set. Among the more complex ML algorithms, XGBoost performed best, with AUROCs of 0.683 and 0.672 (95% CI 0.609-0.734) on the validation and test sets, respectively.</p><p><strong>Conclusions: </strong>This is the largest external validation of the ETVSS, and it confirms modest performance. More sophisticated ML algorithms do not meaningfully improve predictive performance compared to ETVSS; this underscores the need for higher utility, novelty, and dimensionality of input data rather than changes in modeling strategies.</p>","PeriodicalId":16549,"journal":{"name":"Journal of neurosurgery. Pediatrics","volume":" ","pages":"1-9"},"PeriodicalIF":2.1000,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of neurosurgery. Pediatrics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3171/2024.9.PEDS24146","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: The Endoscopic Third Ventriculostomy Success Score (ETVSS) is a useful decision-making heuristic when considering the probability of surgical success, defined traditionally as no repeat cerebrospinal fluid diversion surgery needed within 6 months. Nonetheless, the performance of the logistic regression (LR) model in the original 2009 study was modest, with an area under the receiver operating characteristic curve (AUROC) of 0.68. The authors sought to use a larger dataset to develop more accurate machine learning (ML) models to predict endoscopic third ventriculostomy (ETV) success and also to perform the largest validation of the ETVSS to date.

Methods: The authors queried the MarketScan national database for the years 2005-2022 to identify patients < 18 years of age who underwent first-time ETV and subsequently had at least 6 months of continuous enrollment in the database. The authors collected data on predictors matching the original ETVSS: age, etiology of hydrocephalus, and history of any previous shunt placement. Next, they used 6 ML algorithms-LR, support vector classifier, random forest, k-nearest neighbors, Extreme Gradient Boosted Regression (XGBoost), and naive Bayes-to develop predictive models. Finally, the authors used nested cross-validation to assess the models' comparative performances on unseen data.

Results: The authors identified 2047 patients who met inclusion criteria, and 1261 (61.6%) underwent successful ETV. The performances of most ML models were similar to that of the original ETVSS, which had an AUROC of 0.693 on the validation set and 0.661 (95% CI 0.600-0.722) on the test set. The authors' new LR model performed comparably with AUROCs of 0.693 on both the validation and test sets, with 95% CI 0.633-0.754 on the test set. Among the more complex ML algorithms, XGBoost performed best, with AUROCs of 0.683 and 0.672 (95% CI 0.609-0.734) on the validation and test sets, respectively.

Conclusions: This is the largest external validation of the ETVSS, and it confirms modest performance. More sophisticated ML algorithms do not meaningfully improve predictive performance compared to ETVSS; this underscores the need for higher utility, novelty, and dimensionality of input data rather than changes in modeling strategies.

用机器学习重新审视第三脑室内窥镜造瘘成功评分:我们能做得更好吗?
目的:内镜下第三脑室造瘘成功评分(ETVSS)是考虑手术成功概率的有用决策启发式指标,传统定义为6个月内不需要再次进行脑脊液转移手术。尽管如此,在2009年的原始研究中,logistic回归(LR)模型的表现并不理想,受试者工作特征曲线下面积(AUROC)为0.68。作者试图使用更大的数据集来开发更准确的机器学习(ML)模型,以预测内窥镜第三脑室造口术(ETV)的成功,并对ETVSS进行迄今为止最大的验证。方法:作者查询了2005-2022年MarketScan国家数据库,以确定首次接受ETV且随后在数据库中连续登记至少6个月的< 18岁患者。作者收集了与原始ETVSS相匹配的预测因素的数据:年龄、脑积水的病因和以前任何分流器放置的历史。接下来,他们使用6种ML算法——lr、支持向量分类器、随机森林、k近邻、极端梯度增强回归(XGBoost)和朴素贝叶斯——来开发预测模型。最后,作者使用嵌套交叉验证来评估模型在未见数据上的比较性能。结果:作者确定了2047例符合纳入标准的患者,其中1261例(61.6%)成功接受了ETV。大多数ML模型的性能与原始ETVSS相似,验证集的AUROC为0.693,测试集的AUROC为0.661 (95% CI 0.600-0.722)。作者的新LR模型在验证集和测试集上的auroc均为0.693,测试集上的95% CI为0.633-0.754。在较复杂的ML算法中,XGBoost表现最好,在验证集和测试集上的auroc分别为0.683和0.672 (95% CI 0.609-0.734)。结论:这是对ETVSS进行的最大规模的外部验证,它证实了适度的性能。与ETVSS相比,更复杂的ML算法并没有显著提高预测性能;这强调了对更高的实用性、新颖性和输入数据维度的需求,而不是对建模策略的更改。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of neurosurgery. Pediatrics
Journal of neurosurgery. Pediatrics 医学-临床神经学
CiteScore
3.40
自引率
10.50%
发文量
307
审稿时长
2 months
期刊介绍: Information not localiced
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信