A comprehensive benchmarking of a U-Net based model for midbrain auto-segmentation on transcranial sonography

IF 4.9 2区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Hong-yu Kang , Wei Zhang , Shuai Li , Xinyi Wang , Yu Sun , Xin Sun , Fang-Xian Li , Chao Hou , Sai-kit Lam , Yong-ping Zheng
{"title":"A comprehensive benchmarking of a U-Net based model for midbrain auto-segmentation on transcranial sonography","authors":"Hong-yu Kang ,&nbsp;Wei Zhang ,&nbsp;Shuai Li ,&nbsp;Xinyi Wang ,&nbsp;Yu Sun ,&nbsp;Xin Sun ,&nbsp;Fang-Xian Li ,&nbsp;Chao Hou ,&nbsp;Sai-kit Lam ,&nbsp;Yong-ping Zheng","doi":"10.1016/j.cmpb.2024.108494","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and objective</h3><div>Transcranial sonography-based grading of Parkinson's Disease has gained increasing attention in recent years, and it is currently used for assistive differential diagnosis in some specialized centers. To this end, accurate midbrain segmentation is considered an important initial step. However, current practice is manual, time-consuming, and bias-prone due to the subjective nature. Relevant studies in the literature are scarce and lacks comprehensive model evaluations from application perspectives. Herein, we aimed to benchmark the best-performing U-Net model for objective, stable and robust midbrain auto-segmentation using transcranial sonography images.</div></div><div><h3>Methods</h3><div>A total of 584 patients who were suspected of Parkinson's Disease were retrospectively enrolled from Beijing Tiantan Hospital. The dataset was divided into training (<em>n</em> = 416), validation (<em>n</em> = 104), and testing (<em>n</em> = 64) sets. Three state-of-the-art deep-learning networks (U-Net, U-Net+++, and nnU-Net) were utilized to develop segmentation models, under 5-fold cross-validation and three randomization seeds for safeguarding model validity and stability. Model evaluation was conducted in testing set in three key aspects: (i) segmentation agreement using DICE coefficients (DICE), Intersection over Union (IoU), and Hausdorff Distance (HD); (ii) model stability using standard deviations of segmentation agreement metrics; (iii) prediction time efficiency, and (iv) model robustness against various degrees of ultrasound imaging noise produced by the salt-and-pepper noise and Gaussian noise.</div></div><div><h3>Results</h3><div>The nnU-Net achieved the best segmentation agreement (averaged DICE: 0.910, IoU: 0.836, HD: 2.793-mm) and time efficiency (1.456-s). Under mild noise corruption, the nnU-Net outperformed others with averaged scores of DICE (0.904), IoU (0.827), HD (2.941 mm) in the salt-and-pepper noise (signal-to-noise ratio, SNR = 0.95), and DICE (0.906), IoU (0.830), HD (2.967 mm) in the Gaussian noise (sigma value, σ = 0.1); by contrast, intriguingly, performance of the U-Net and U-Net+++ models were remarkably degraded. Under increasing levels of simulated noise corruption (SNR decreased from 0.95 to 0.75; σ increased from 0.1 to 0.5), the nnU-Net network exhibited marginal decline in segmentation agreement meanwhile yielding decent performance as if there were absence of noise corruption.</div></div><div><h3>Conclusions</h3><div>The nnU-Net model was the best-performing midbrain segmentation model in terms of segmentation agreement, stability, time efficiency and robustness, providing the community with an objective, effective and automated alternative. Moving forward, a multi-center multi-vendor study is warranted when it comes to clinical implementation.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"258 ","pages":"Article 108494"},"PeriodicalIF":4.9000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169260724004875","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Background and objective

Transcranial sonography-based grading of Parkinson's Disease has gained increasing attention in recent years, and it is currently used for assistive differential diagnosis in some specialized centers. To this end, accurate midbrain segmentation is considered an important initial step. However, current practice is manual, time-consuming, and bias-prone due to the subjective nature. Relevant studies in the literature are scarce and lacks comprehensive model evaluations from application perspectives. Herein, we aimed to benchmark the best-performing U-Net model for objective, stable and robust midbrain auto-segmentation using transcranial sonography images.

Methods

A total of 584 patients who were suspected of Parkinson's Disease were retrospectively enrolled from Beijing Tiantan Hospital. The dataset was divided into training (n = 416), validation (n = 104), and testing (n = 64) sets. Three state-of-the-art deep-learning networks (U-Net, U-Net+++, and nnU-Net) were utilized to develop segmentation models, under 5-fold cross-validation and three randomization seeds for safeguarding model validity and stability. Model evaluation was conducted in testing set in three key aspects: (i) segmentation agreement using DICE coefficients (DICE), Intersection over Union (IoU), and Hausdorff Distance (HD); (ii) model stability using standard deviations of segmentation agreement metrics; (iii) prediction time efficiency, and (iv) model robustness against various degrees of ultrasound imaging noise produced by the salt-and-pepper noise and Gaussian noise.

Results

The nnU-Net achieved the best segmentation agreement (averaged DICE: 0.910, IoU: 0.836, HD: 2.793-mm) and time efficiency (1.456-s). Under mild noise corruption, the nnU-Net outperformed others with averaged scores of DICE (0.904), IoU (0.827), HD (2.941 mm) in the salt-and-pepper noise (signal-to-noise ratio, SNR = 0.95), and DICE (0.906), IoU (0.830), HD (2.967 mm) in the Gaussian noise (sigma value, σ = 0.1); by contrast, intriguingly, performance of the U-Net and U-Net+++ models were remarkably degraded. Under increasing levels of simulated noise corruption (SNR decreased from 0.95 to 0.75; σ increased from 0.1 to 0.5), the nnU-Net network exhibited marginal decline in segmentation agreement meanwhile yielding decent performance as if there were absence of noise corruption.

Conclusions

The nnU-Net model was the best-performing midbrain segmentation model in terms of segmentation agreement, stability, time efficiency and robustness, providing the community with an objective, effective and automated alternative. Moving forward, a multi-center multi-vendor study is warranted when it comes to clinical implementation.
基于 U-Net 的经颅超声中脑自动分割模型的综合基准测试。
背景和目的:近年来,基于经颅超声的帕金森病分级越来越受到关注,目前在一些专业中心用于辅助鉴别诊断。为此,准确的中脑分割被认为是重要的第一步。然而,目前的做法是手工操作,耗时长,而且由于主观性,容易出现偏差。文献中的相关研究很少,而且缺乏从应用角度对模型的全面评估。在此,我们旨在为使用经颅超声图像进行客观、稳定和稳健的中脑自动分割的最佳 U-Net 模型设定基准:方法:我们从北京天坛医院回顾性招募了 584 名疑似帕金森病患者。数据集分为训练集(n = 416)、验证集(n = 104)和测试集(n = 64)。利用三种最先进的深度学习网络(U-Net、U-Net+++ 和 nnU-Net)开发分割模型,并进行 5 倍交叉验证和三种随机化种子,以保障模型的有效性和稳定性。在测试集中从三个关键方面对模型进行了评估:(i) 使用 DICE 系数(DICE)、Intersection over Union(IoU)和 Hausdorff Distance(HD)进行的分割一致性评估;(ii) 使用分割一致性指标的标准偏差进行的模型稳定性评估;(iii) 预测时间效率评估;(iv) 模型对由椒盐噪声和高斯噪声产生的不同程度超声成像噪声的鲁棒性评估:nnU-Net 获得了最佳的分割一致性(平均 DICE:0.910,IoU:0.836,HD:2.793-mm)和时间效率(1.456-s)。在轻度噪声破坏情况下,nnU-Net 的表现优于其他网络,在椒盐噪声(信噪比 SNR = 0.95)中的平均得分分别为 DICE (0.904)、IoU (0.827)、HD (2.941 mm)。95),以及高斯噪声(西格玛值,σ = 0.1)下的 DICE (0.906)、IoU (0.830)、HD (2.967 mm)。在模拟噪声损坏水平不断提高的情况下(信噪比从 0.95 降至 0.75;σ 从 0.1 升至 0.5),nnU-Net 网络的分割一致性略有下降,但其性能却与没有噪声损坏时相当:nnU-Net模型是在分割一致性、稳定性、时间效率和鲁棒性方面表现最好的中脑分割模型,为社会提供了一个客观、有效和自动化的替代方案。展望未来,在临床应用方面,有必要进行多中心多供应商研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computer methods and programs in biomedicine
Computer methods and programs in biomedicine 工程技术-工程:生物医学
CiteScore
12.30
自引率
6.60%
发文量
601
审稿时长
135 days
期刊介绍: To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine. Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信