Three-Blind Validation Strategy of Deep Learning Models for Image Segmentation.

IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

Journal of Imaging Pub Date : 2025-05-21 DOI:10.3390/jimaging11050170

Andrés Larroza, Francisco Javier Pérez-Benito, Raquel Tendero, Juan Carlos Perez-Cortes, Marta Román, Rafael Llobet

{"title":"Three-Blind Validation Strategy of Deep Learning Models for Image Segmentation.","authors":"Andrés Larroza, Francisco Javier Pérez-Benito, Raquel Tendero, Juan Carlos Perez-Cortes, Marta Román, Rafael Llobet","doi":"10.3390/jimaging11050170","DOIUrl":null,"url":null,"abstract":"<p><p>Image segmentation plays a central role in computer vision applications such as medical imaging, industrial inspection, and environmental monitoring. However, evaluating segmentation performance can be particularly challenging when ground truth is not clearly defined, as is often the case in tasks involving subjective interpretation. These challenges are amplified by inter- and intra-observer variability, which complicates the use of human annotations as a reliable reference. To address this, we propose a novel validation framework-referred to as the three-blind validation strategy-that enables rigorous assessment of segmentation models in contexts where subjectivity and label variability are significant. The core idea is to have a third independent expert, blind to the labeler identities, assess a shuffled set of segmentations produced by multiple human annotators and/or automated models. This allows for the unbiased evaluation of model performance and helps uncover patterns of disagreement that may indicate systematic issues with either human or machine annotations. The primary objective of this study is to introduce and demonstrate this validation strategy as a generalizable framework for robust model evaluation in subjective segmentation tasks. We illustrate its practical implementation in a mammography use case involving dense tissue segmentation while emphasizing its potential applicability to a broad range of segmentation scenarios.</p>","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"11 5","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2025-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12113085/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Imaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/jimaging11050170","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Image segmentation plays a central role in computer vision applications such as medical imaging, industrial inspection, and environmental monitoring. However, evaluating segmentation performance can be particularly challenging when ground truth is not clearly defined, as is often the case in tasks involving subjective interpretation. These challenges are amplified by inter- and intra-observer variability, which complicates the use of human annotations as a reliable reference. To address this, we propose a novel validation framework-referred to as the three-blind validation strategy-that enables rigorous assessment of segmentation models in contexts where subjectivity and label variability are significant. The core idea is to have a third independent expert, blind to the labeler identities, assess a shuffled set of segmentations produced by multiple human annotators and/or automated models. This allows for the unbiased evaluation of model performance and helps uncover patterns of disagreement that may indicate systematic issues with either human or machine annotations. The primary objective of this study is to introduce and demonstrate this validation strategy as a generalizable framework for robust model evaluation in subjective segmentation tasks. We illustrate its practical implementation in a mammography use case involving dense tissue segmentation while emphasizing its potential applicability to a broad range of segmentation scenarios.

查看原文本刊更多论文

图像分割中深度学习模型的三盲验证策略。

图像分割在医学成像、工业检测和环境监测等计算机视觉应用中起着核心作用。然而，当基础真理没有明确定义时，评估分割性能可能特别具有挑战性，因为在涉及主观解释的任务中经常出现这种情况。这些挑战被观察者之间和内部的可变性放大了，这使得人工注释作为可靠参考的使用变得复杂。为了解决这个问题，我们提出了一种新的验证框架，称为三盲验证策略，可以在主观性和标签可变性显著的环境中对分割模型进行严格评估。核心思想是有第三个独立的专家，对标注者身份视而不见，评估由多个人工注释者和/或自动化模型产生的一组混乱的分割。这允许对模型性能进行无偏见的评估，并有助于发现可能表明人工或机器注释存在系统问题的不一致模式。本研究的主要目的是介绍并证明这种验证策略作为主观分割任务中鲁棒模型评估的可推广框架。我们在一个涉及致密组织分割的乳房x线摄影用例中说明了它的实际实现，同时强调了它对广泛分割场景的潜在适用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊