基于Shapley加性解释的监督聚类的脑动脉瘤表型驱动风险分层:一种预测破裂的新方法。

IF 3 2区 医学 Q2 CLINICAL NEUROLOGY
Shrinit Babel, Syed R H Peeran
{"title":"基于Shapley加性解释的监督聚类的脑动脉瘤表型驱动风险分层:一种预测破裂的新方法。","authors":"Shrinit Babel, Syed R H Peeran","doi":"10.3171/2025.4.FOCUS241024","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>The aim of this study was to address the limitations of traditional aneurysm risk scoring systems and computational fluid dynamics (CFD) analyses by applying a supervised clustering framework to identify distinct aneurysm phenotypes and improve rupture risk prediction.</p><p><strong>Methods: </strong>Geometric and morphological data for 103 cerebral aneurysms were obtained from the AneuriskWeb dataset. To segment the cerebral aneurysm data into information-dense clusters that relate to aneurysm rupture risk, the authors trained an Extreme Gradient Boosting model for Shapley Additive Explanations (SHAP)-based feature attribution followed by nonlinear dimensionality reduction. Hierarchical Density-based Spatial Clustering of Applications with Noise (HDBSCAN) was then used on the SHAP-transformed feature space to identify clusters that were, subsequently, interpreted directly using rule-based machine learning and indirectly with phenotype visualization.</p><p><strong>Results: </strong>The initial SHAP analysis identified the parent vessel diameter, neck vessel angle, and the cross-sectional area along the centerline of the sac as the most significant predictors of rupture risk. Clustering revealed three distinct aneurysm phenotypes with a high degree of separation (Silhouette score = 0.915). Cluster α, characterized by parent vessel diameters > 3.08 mm and elongated geometries, was a low-risk phenotype with a 4.16% rupture rate. Cluster β only included ruptured aneurysms, with vessel diameters ≤ 1.65 mm and nonspherical structures. Cluster γ represented a mixed-risk aneurysm phenotype (rupture rate of 45.45%), with intermedial vessel diameters (range 1.65-3.08 mm); acute neck angles (< 90°) increased the rupture rate within this cluster.</p><p><strong>Conclusions: </strong>The supervised clustering identified distinct cerebral aneurysm phenotypes, balancing granularity with interpretability in CFD data analysis. Future studies should build on these phenotype-driven insights with temporal analyses and larger datasets for validation, as well as an end-to-end framework to enhance scalability.</p>","PeriodicalId":19187,"journal":{"name":"Neurosurgical focus","volume":"59 1","pages":"E3"},"PeriodicalIF":3.0000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Phenotype-driven risk stratification of cerebral aneurysms using Shapley Additive Explanations-based supervised clustering: a novel approach to rupture prediction.\",\"authors\":\"Shrinit Babel, Syed R H Peeran\",\"doi\":\"10.3171/2025.4.FOCUS241024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>The aim of this study was to address the limitations of traditional aneurysm risk scoring systems and computational fluid dynamics (CFD) analyses by applying a supervised clustering framework to identify distinct aneurysm phenotypes and improve rupture risk prediction.</p><p><strong>Methods: </strong>Geometric and morphological data for 103 cerebral aneurysms were obtained from the AneuriskWeb dataset. To segment the cerebral aneurysm data into information-dense clusters that relate to aneurysm rupture risk, the authors trained an Extreme Gradient Boosting model for Shapley Additive Explanations (SHAP)-based feature attribution followed by nonlinear dimensionality reduction. Hierarchical Density-based Spatial Clustering of Applications with Noise (HDBSCAN) was then used on the SHAP-transformed feature space to identify clusters that were, subsequently, interpreted directly using rule-based machine learning and indirectly with phenotype visualization.</p><p><strong>Results: </strong>The initial SHAP analysis identified the parent vessel diameter, neck vessel angle, and the cross-sectional area along the centerline of the sac as the most significant predictors of rupture risk. Clustering revealed three distinct aneurysm phenotypes with a high degree of separation (Silhouette score = 0.915). Cluster α, characterized by parent vessel diameters > 3.08 mm and elongated geometries, was a low-risk phenotype with a 4.16% rupture rate. Cluster β only included ruptured aneurysms, with vessel diameters ≤ 1.65 mm and nonspherical structures. Cluster γ represented a mixed-risk aneurysm phenotype (rupture rate of 45.45%), with intermedial vessel diameters (range 1.65-3.08 mm); acute neck angles (< 90°) increased the rupture rate within this cluster.</p><p><strong>Conclusions: </strong>The supervised clustering identified distinct cerebral aneurysm phenotypes, balancing granularity with interpretability in CFD data analysis. Future studies should build on these phenotype-driven insights with temporal analyses and larger datasets for validation, as well as an end-to-end framework to enhance scalability.</p>\",\"PeriodicalId\":19187,\"journal\":{\"name\":\"Neurosurgical focus\",\"volume\":\"59 1\",\"pages\":\"E3\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neurosurgical focus\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.3171/2025.4.FOCUS241024\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CLINICAL NEUROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurosurgical focus","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3171/2025.4.FOCUS241024","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0

摘要

目的:本研究的目的是通过应用监督聚类框架来识别不同的动脉瘤表型并改进破裂风险预测,从而解决传统动脉瘤风险评分系统和计算流体动力学(CFD)分析的局限性。方法:从动脉瘤网络数据库中获取103例脑动脉瘤的几何和形态学数据。为了将脑动脉瘤数据分割成与动脉瘤破裂风险相关的信息密集簇,作者训练了一个基于Shapley加性解释(SHAP)的特征归因的极端梯度增强模型,然后进行了非线性降维。然后在shap转换的特征空间上使用基于层次密度的带噪声应用空间聚类(HDBSCAN)来识别随后直接使用基于规则的机器学习和间接使用表型可视化进行解释的聚类。结果:最初的SHAP分析确定了母血管直径、颈血管角度和沿囊中心线的横截面积是最重要的破裂风险预测因子。聚类显示三种不同的动脉瘤表型,且分离程度高(Silhouette评分= 0.915)。簇α的特征是母血管直径为> 3.08 mm,几何形状拉长,为低风险表型,破裂率为4.16%。簇β仅包括血管直径≤1.65 mm且非球形结构的破裂动脉瘤。簇γ代表混合风险动脉瘤表型(破裂率为45.45%),中间血管直径(范围为1.65-3.08 mm);急性颈角(< 90°)增加了该簇的破裂率。结论:监督聚类识别出不同的脑动脉瘤表型,平衡了CFD数据分析的粒度和可解释性。未来的研究应该建立在这些表型驱动的见解上,通过时间分析和更大的数据集进行验证,以及端到端框架来增强可扩展性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Phenotype-driven risk stratification of cerebral aneurysms using Shapley Additive Explanations-based supervised clustering: a novel approach to rupture prediction.

Objective: The aim of this study was to address the limitations of traditional aneurysm risk scoring systems and computational fluid dynamics (CFD) analyses by applying a supervised clustering framework to identify distinct aneurysm phenotypes and improve rupture risk prediction.

Methods: Geometric and morphological data for 103 cerebral aneurysms were obtained from the AneuriskWeb dataset. To segment the cerebral aneurysm data into information-dense clusters that relate to aneurysm rupture risk, the authors trained an Extreme Gradient Boosting model for Shapley Additive Explanations (SHAP)-based feature attribution followed by nonlinear dimensionality reduction. Hierarchical Density-based Spatial Clustering of Applications with Noise (HDBSCAN) was then used on the SHAP-transformed feature space to identify clusters that were, subsequently, interpreted directly using rule-based machine learning and indirectly with phenotype visualization.

Results: The initial SHAP analysis identified the parent vessel diameter, neck vessel angle, and the cross-sectional area along the centerline of the sac as the most significant predictors of rupture risk. Clustering revealed three distinct aneurysm phenotypes with a high degree of separation (Silhouette score = 0.915). Cluster α, characterized by parent vessel diameters > 3.08 mm and elongated geometries, was a low-risk phenotype with a 4.16% rupture rate. Cluster β only included ruptured aneurysms, with vessel diameters ≤ 1.65 mm and nonspherical structures. Cluster γ represented a mixed-risk aneurysm phenotype (rupture rate of 45.45%), with intermedial vessel diameters (range 1.65-3.08 mm); acute neck angles (< 90°) increased the rupture rate within this cluster.

Conclusions: The supervised clustering identified distinct cerebral aneurysm phenotypes, balancing granularity with interpretability in CFD data analysis. Future studies should build on these phenotype-driven insights with temporal analyses and larger datasets for validation, as well as an end-to-end framework to enhance scalability.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Neurosurgical focus
Neurosurgical focus CLINICAL NEUROLOGY-SURGERY
CiteScore
6.30
自引率
0.00%
发文量
261
审稿时长
3 months
期刊介绍: Information not localized
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信