Benchmarking HEp-2 cell segmentation methods in indirect immunofluorescence images - standard models to deep learning

IF 7 2区医学 Q1 BIOLOGY

Computers in biology and medicine Pub Date : 2025-04-26 DOI:10.1016/j.compbiomed.2025.110150

Balaji Iyer , Smruti Deoghare , Krish Ranjan , Bruce J. Aronow , V.B. Surya Prasath

{"title":"Benchmarking HEp-2 cell segmentation methods in indirect immunofluorescence images - standard models to deep learning","authors":"Balaji Iyer , Smruti Deoghare , Krish Ranjan , Bruce J. Aronow , V.B. Surya Prasath","doi":"10.1016/j.compbiomed.2025.110150","DOIUrl":null,"url":null,"abstract":"<div><div>Indirect Immunofluorescence (IIF) stained Human Epithelial (HEp-2) cells are considered the gold standard for detecting autoimmune diseases. Accurate cell segmentation, though often viewed as an intermediary step to downstream tasks like classification, significantly enhances overall performance when executed with precision. In this study, we conduct a systematic literature review of HEp-2 cell segmentation techniques, identifying 28 key papers utilizing traditional image processing, machine learning classifiers, deep convolutional neural networks (CNNs), and generative adversarial network (GAN) frameworks. Building on these insights, we benchmark 17 CNN models without pretraining and 8 CNN models pretrained on ImageNet using both Frozen Encoder and Tunable Encoder strategies on the I3A dataset. Cross-validation (CV) and Benjamini–Hochberg (BH) significance correction were employed to ensure statistical rigor in model comparisons. Domain-Specific Pretraining (DSPT) experiments demonstrated performance improvements, particularly for underrepresented classes, while Data Augmentation strategies (DA-1 and DA-2) revealed distinct impacts across model categories. GAN-based segmentation experiments using the top-performing CNN architectures as generators within a Pix2Pix framework revealed performance degradation due to data limitations and adversarial training instabilities. Nonetheless, GANs displayed class-specific improvements in visual alignment of segmentation masks. Results were evaluated comprehensively across eight performance metrics, including Dice, IOU, Accuracy, Precision, Sensitivity, Specificity, AU-ROC and AU-PR. This work offers a robust benchmarking of state-of-the-art CNN, GAN, and Transformer-based models for HEp-2 cell segmentation, providing valuable insights for future research directions, including ensemble approaches, dynamic patch sampling, and diffusion models.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"192 ","pages":"Article 110150"},"PeriodicalIF":7.0000,"publicationDate":"2025-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525005013","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Indirect Immunofluorescence (IIF) stained Human Epithelial (HEp-2) cells are considered the gold standard for detecting autoimmune diseases. Accurate cell segmentation, though often viewed as an intermediary step to downstream tasks like classification, significantly enhances overall performance when executed with precision. In this study, we conduct a systematic literature review of HEp-2 cell segmentation techniques, identifying 28 key papers utilizing traditional image processing, machine learning classifiers, deep convolutional neural networks (CNNs), and generative adversarial network (GAN) frameworks. Building on these insights, we benchmark 17 CNN models without pretraining and 8 CNN models pretrained on ImageNet using both Frozen Encoder and Tunable Encoder strategies on the I3A dataset. Cross-validation (CV) and Benjamini–Hochberg (BH) significance correction were employed to ensure statistical rigor in model comparisons. Domain-Specific Pretraining (DSPT) experiments demonstrated performance improvements, particularly for underrepresented classes, while Data Augmentation strategies (DA-1 and DA-2) revealed distinct impacts across model categories. GAN-based segmentation experiments using the top-performing CNN architectures as generators within a Pix2Pix framework revealed performance degradation due to data limitations and adversarial training instabilities. Nonetheless, GANs displayed class-specific improvements in visual alignment of segmentation masks. Results were evaluated comprehensively across eight performance metrics, including Dice, IOU, Accuracy, Precision, Sensitivity, Specificity, AU-ROC and AU-PR. This work offers a robust benchmarking of state-of-the-art CNN, GAN, and Transformer-based models for HEp-2 cell segmentation, providing valuable insights for future research directions, including ensemble approaches, dynamic patch sampling, and diffusion models.

查看原文本刊更多论文

间接免疫荧光图像中HEp-2细胞分割方法的标杆-深度学习标准模型

间接免疫荧光（IIF）染色的人上皮细胞（HEp-2）被认为是检测自身免疫性疾病的金标准。虽然准确的细胞分割通常被视为分类等下游任务的中间步骤，但如果精确执行，则可以显著提高整体性能。在本研究中，我们对HEp-2细胞分割技术进行了系统的文献综述，确定了28篇利用传统图像处理、机器学习分类器、深度卷积神经网络（cnn）和生成对抗网络（GAN）框架的关键论文。基于这些见解，我们对17个没有预训练的CNN模型和8个在ImageNet上使用冻结编码器和可调编码器策略在I3A数据集上预训练的CNN模型进行了基准测试。采用交叉验证（CV）和Benjamini-Hochberg （BH）显著性校正来确保模型比较的统计严谨性。特定领域预训练（DSPT）实验证明了性能的提高，特别是对于代表性不足的类，而数据增强策略（DA-1和DA-2）显示了不同模型类别的不同影响。在Pix2Pix框架中使用性能最好的CNN架构作为生成器的基于gan的分割实验揭示了由于数据限制和对抗性训练不稳定性而导致的性能下降。尽管如此，gan在分割掩码的视觉对齐方面显示出特定类别的改进。结果通过8个性能指标进行综合评估，包括Dice、IOU、Accuracy、Precision、Sensitivity、Specificity、AU-ROC和AU-PR。这项工作为HEp-2细胞分割提供了最先进的CNN， GAN和基于transformer的模型的稳健基准，为未来的研究方向提供了有价值的见解，包括集成方法，动态斑块采样和扩散模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computers in biology and medicine 工程技术-工程：生物医学

CiteScore

11.70

自引率

10.40%

发文量

1086

审稿时长

74 days

期刊介绍： Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.