Enhancing CT image segmentation accuracy through ensemble loss function optimization

IF 3.2 2区医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Medical physics Pub Date : 2025-04-24 DOI:10.1002/mp.17848

Chengyin Li, Rafi Ibn Sultan, Hassan Bagher-Ebadian, Yao Qiang, Kundan Thind, Dongxiao Zhu, Indrin J. Chetty

{"title":"Enhancing CT image segmentation accuracy through ensemble loss function optimization","authors":"Chengyin Li, Rafi Ibn Sultan, Hassan Bagher-Ebadian, Yao Qiang, Kundan Thind, Dongxiao Zhu, Indrin J. Chetty","doi":"10.1002/mp.17848","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n <p>In CT-based medical image segmentation, the choice of loss function profoundly impacts the training efficacy of deep neural networks. Traditional loss functions like cross entropy (CE), Dice, Boundary, and TopK each have unique strengths and limitations, often introducing biases when used individually.</p>\n </section>\n \n <section>\n \n <h3> Purpose</h3>\n <p>This study aims to enhance segmentation accuracy by optimizing ensemble loss functions, thereby addressing the biases and limitations of single loss functions and their linear combinations.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n <p>We implemented a comprehensive evaluation of loss function combinations by integrating CE, Dice, Boundary, and TopK loss functions through both loss-level linear combination and model-level ensemble methods. Our approach utilized two state-of-the-art 3D segmentation architectures, Attention U-Net (AttUNet) and SwinUNETR, to test the impact of these methods. The study was conducted on two large CT dataset cohorts: an institutional dataset containing pelvic organ segmentations, and a public dataset consisting of multiple organ segmentations. All the models were trained from scratch with different loss settings, and performance was evaluated using Dice similarity coefficient (DSC), Hausdorff distance (HD), and average surface distance (ASD). In the ensemble approach, both static averaging and learnable dynamic weighting strategies were employed to combine the outputs of models trained with different loss functions.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n <p>Extensive experiments revealed the following: (1) the linear combination of loss functions achieved results comparable to those of single loss-driven methods; (2) compared to the best non-ensemble methods, ensemble-based approaches resulted in a 2%–7% increase in DSC scores, along with notable reductions in HD (e.g., a 19.1% reduction for rectum segmentation using SwinUNETR) and ASD (e.g., a 49.0% reduction for prostate segmentation using AttUNet); (3) the learnable ensemble approach with optimized weights produced finer details in predicted masks, as confirmed by qualitative analyses; and (4) the learnable ensemble consistently outperforms the static ensemble across most metrics (DSC, HD, ASD) for both AttUNet and SwinUNETR architectures.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n <p>Our findings support the efficacy of using ensemble models with optimized weights to improve segmentation accuracy, highlighting the potential for broader applications in automated medical image analysis.</p>\n </section>\n </div>","PeriodicalId":18384,"journal":{"name":"Medical physics","volume":"52 7","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical physics","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/mp.17848","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Background

In CT-based medical image segmentation, the choice of loss function profoundly impacts the training efficacy of deep neural networks. Traditional loss functions like cross entropy (CE), Dice, Boundary, and TopK each have unique strengths and limitations, often introducing biases when used individually.

Purpose

This study aims to enhance segmentation accuracy by optimizing ensemble loss functions, thereby addressing the biases and limitations of single loss functions and their linear combinations.

Methods

We implemented a comprehensive evaluation of loss function combinations by integrating CE, Dice, Boundary, and TopK loss functions through both loss-level linear combination and model-level ensemble methods. Our approach utilized two state-of-the-art 3D segmentation architectures, Attention U-Net (AttUNet) and SwinUNETR, to test the impact of these methods. The study was conducted on two large CT dataset cohorts: an institutional dataset containing pelvic organ segmentations, and a public dataset consisting of multiple organ segmentations. All the models were trained from scratch with different loss settings, and performance was evaluated using Dice similarity coefficient (DSC), Hausdorff distance (HD), and average surface distance (ASD). In the ensemble approach, both static averaging and learnable dynamic weighting strategies were employed to combine the outputs of models trained with different loss functions.

Results

Extensive experiments revealed the following: (1) the linear combination of loss functions achieved results comparable to those of single loss-driven methods; (2) compared to the best non-ensemble methods, ensemble-based approaches resulted in a 2%–7% increase in DSC scores, along with notable reductions in HD (e.g., a 19.1% reduction for rectum segmentation using SwinUNETR) and ASD (e.g., a 49.0% reduction for prostate segmentation using AttUNet); (3) the learnable ensemble approach with optimized weights produced finer details in predicted masks, as confirmed by qualitative analyses; and (4) the learnable ensemble consistently outperforms the static ensemble across most metrics (DSC, HD, ASD) for both AttUNet and SwinUNETR architectures.

Conclusions

Our findings support the efficacy of using ensemble models with optimized weights to improve segmentation accuracy, highlighting the potential for broader applications in automated medical image analysis.

查看原文本刊更多论文

通过集成损失函数优化提高CT图像分割精度。

背景：在基于ct的医学图像分割中，损失函数的选择深刻影响着深度神经网络的训练效果。传统的损失函数如交叉熵（CE）、Dice、Boundary和TopK都有其独特的优势和局限性，在单独使用时往往会引入偏差。目的：本研究旨在通过优化集合损失函数来提高分割精度，从而解决单个损失函数及其线性组合的偏差和局限性。方法：通过损失级线性组合和模型级集成方法，对CE、Dice、Boundary和TopK损失函数进行积分，对损失函数组合进行综合评估。我们的方法使用了两种最先进的3D分割架构，注意力U-Net （AttUNet）和SwinUNETR，来测试这些方法的影响。该研究在两个大型CT数据集队列上进行：一个包含盆腔器官分割的机构数据集和一个包含多器官分割的公共数据集。所有模型都在不同的损失设置下从头开始训练，并使用Dice相似系数（DSC）、Hausdorff距离（HD）和平均表面距离（ASD）来评估其性能。在集成方法中，采用静态平均和可学习的动态加权策略对不同损失函数训练的模型输出进行组合。结果：大量实验表明：(1)损失函数的线性组合与单一损失驱动方法的结果相当；(2)与最佳的非集成方法相比，基于集成的方法导致DSC评分增加2%-7%，同时HD（例如，使用SwinUNETR进行直肠分割减少19.1%）和ASD（例如，使用AttUNet进行前列腺分割减少49.0%）显著降低；(3)经定性分析证实，权重优化后的可学习集成方法在预测掩模中产生了更精细的细节；(4)对于AttUNet和SwinUNETR架构，可学习集成在大多数指标（DSC， HD， ASD）上始终优于静态集成。结论：我们的研究结果支持使用优化权重的集成模型来提高分割精度的有效性，突出了在自动化医学图像分析中更广泛应用的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Medical physics 医学-核医学

CiteScore

6.80

自引率

15.80%

发文量

660

审稿时长

1.7 months

期刊介绍： Medical Physics publishes original, high impact physics, imaging science, and engineering research that advances patient diagnosis and therapy through contributions in 1) Basic science developments with high potential for clinical translation 2) Clinical applications of cutting edge engineering and physics innovations 3) Broadly applicable and innovative clinical physics developments Medical Physics is a journal of global scope and reach. By publishing in Medical Physics your research will reach an international, multidisciplinary audience including practicing medical physicists as well as physics- and engineering based translational scientists. We work closely with authors of promising articles to improve their quality.