Technical and clinical validation of a novel deep learning-based white matter hyperintensity segmentation tool

IF 2.8 Q3 CLINICAL NEUROLOGY

Cerebral circulation - cognition and behavior Pub Date : 2025-01-01 DOI:10.1016/j.cccb.2025.100393

Benno Gesierich , Lukas Pirpamer , Dominik S Meier , Michael Amann , Minne N Cerfontaine , Frank-Erik de Leeuw , Pauline Maillard , Sue Moy , Karl G. Helmer , Michael Kühne , Leo H Bonati , Julie W Rutten , Saskia A.J. Lesnik Oberstein , Marco Duering , Alzheimer’s Disease Neuroimaging Initiative

{"title":"Technical and clinical validation of a novel deep learning-based white matter hyperintensity segmentation tool","authors":"Benno Gesierich , Lukas Pirpamer , Dominik S Meier , Michael Amann , Minne N Cerfontaine , Frank-Erik de Leeuw , Pauline Maillard , Sue Moy , Karl G. Helmer , Michael Kühne , Leo H Bonati , Julie W Rutten , Saskia A.J. Lesnik Oberstein , Marco Duering , Alzheimer’s Disease Neuroimaging Initiative","doi":"10.1016/j.cccb.2025.100393","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><div>White matter hyperintensities (WMH) on MRI are a hallmark of cerebral small vessel disease. Although numerous WMH segmentation tools exist, each presents relevant limitations that can impact their usability. This research aimed to develop, validate, and disseminate a novel WMH segmentation algorithm to address these limitations.</div></div><div><h3>Methods</h3><div>Using an intentionally heterogeneous dataset, we trained models based on the MD-GRU and nnU-Net deep learning algorithms. The new models were benchmarked in both technical and clinical validation against current state-of-the-art algorithms, utilizing datasets that were not included in the training data. For technical validation in patients, we assessed bias and precision against reference masks, scan-rescan repeatability and inter-scanner reproducibility in data from the MarkVCID consortium. Segmentation performance on 2D data was evaluated using the SWISS-AF dataset. For clinical validation, we determined percent volume change over a two-year follow-up in the DiViNAS study and calculated statistical power to detect treatment effects.</div></div><div><h3>Results</h3><div>The newly trained algorithms outperformed the benchmarking algorithms, demonstrating better agreement with reference volumes, as well as less bias and higher precision in the repeatability and reproducibility experiments. The nnU-Net algorithm exhibited the highest statistical power for detecting treatment effects, requiring a 41 % smaller sample size than the best-performing benchmarking algorithm.</div></div><div><h3>Conclusion</h3><div>We developed and systematically validated two novel WMH segmentation algorithms, which demonstrated excellent generalization capabilities. The comprehensive, user-friendly processing pipelines are publicly available as prebuilt software containers and can be applied to a wide range of datasets without re-training or modifications.</div></div>","PeriodicalId":72549,"journal":{"name":"Cerebral circulation - cognition and behavior","volume":"9 ","pages":"Article 100393"},"PeriodicalIF":2.8000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cerebral circulation - cognition and behavior","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666245025000170","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction

White matter hyperintensities (WMH) on MRI are a hallmark of cerebral small vessel disease. Although numerous WMH segmentation tools exist, each presents relevant limitations that can impact their usability. This research aimed to develop, validate, and disseminate a novel WMH segmentation algorithm to address these limitations.

Methods

Using an intentionally heterogeneous dataset, we trained models based on the MD-GRU and nnU-Net deep learning algorithms. The new models were benchmarked in both technical and clinical validation against current state-of-the-art algorithms, utilizing datasets that were not included in the training data. For technical validation in patients, we assessed bias and precision against reference masks, scan-rescan repeatability and inter-scanner reproducibility in data from the MarkVCID consortium. Segmentation performance on 2D data was evaluated using the SWISS-AF dataset. For clinical validation, we determined percent volume change over a two-year follow-up in the DiViNAS study and calculated statistical power to detect treatment effects.

Results

The newly trained algorithms outperformed the benchmarking algorithms, demonstrating better agreement with reference volumes, as well as less bias and higher precision in the repeatability and reproducibility experiments. The nnU-Net algorithm exhibited the highest statistical power for detecting treatment effects, requiring a 41 % smaller sample size than the best-performing benchmarking algorithm.

Conclusion

We developed and systematically validated two novel WMH segmentation algorithms, which demonstrated excellent generalization capabilities. The comprehensive, user-friendly processing pipelines are publicly available as prebuilt software containers and can be applied to a wide range of datasets without re-training or modifications.

查看原文本刊更多论文

一种新的基于深度学习的白质高强度分割工具的技术和临床验证

MRI上的白质高信号（WMH）是脑血管疾病的标志。尽管存在许多WMH细分工具，但每个工具都存在影响其可用性的相关限制。本研究旨在开发、验证和传播一种新的WMH分割算法来解决这些限制。方法采用异构数据集，基于MD-GRU和nnU-Net深度学习算法对模型进行训练。新模型利用未包含在训练数据中的数据集，针对当前最先进的算法在技术和临床验证中进行基准测试。为了在患者中进行技术验证，我们评估了参考口罩的偏倚和精度，扫描-扫描可重复性和扫描间可重复性来自MarkVCID联盟数据。使用SWISS-AF数据集评估二维数据的分割性能。为了临床验证，我们在DiViNAS研究中确定了两年随访期间体积变化的百分比，并计算了检测治疗效果的统计能力。结果新训练的算法优于基准算法，在重复性和再现性实验中与参考量的一致性更好，偏差更小，精度更高。nnU-Net算法在检测治疗效果方面表现出最高的统计能力，所需的样本量比性能最好的基准测试算法小41%。结论我们开发并系统验证了两种新的WMH分割算法，它们具有良好的泛化能力。全面的、用户友好的处理管道作为预构建的软件容器公开提供，可以应用于广泛的数据集，而无需重新培训或修改。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊