ConvNext Mixer-Based Encoder Decoder Method for Nuclei Segmentation in Histopathology Images

IF 2.5 4区计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

International Journal of Imaging Systems and Technology Pub Date : 2024-09-21 DOI:10.1002/ima.23181

Hüseyin Firat, Hüseyin Üzen, Davut Hanbay, Abdulkadir Şengür

{"title":"ConvNext Mixer-Based Encoder Decoder Method for Nuclei Segmentation in Histopathology Images","authors":"Hüseyin Firat, Hüseyin Üzen, Davut Hanbay, Abdulkadir Şengür","doi":"10.1002/ima.23181","DOIUrl":null,"url":null,"abstract":"<p>Histopathology, vital in diagnosing medical conditions, especially in cancer research, relies on analyzing histopathology images (HIs). Nuclei segmentation, a key task, involves precisely identifying cell nuclei boundaries. Manual segmentation by pathologists is time-consuming, prompting the need for robust automated methods. Challenges in segmentation arise from HI complexities, necessitating advanced techniques. Recent advancements in deep learning, particularly Convolutional Neural Networks (CNNs), have transformed nuclei segmentation. This study emphasizes feature extraction, introducing the ConvNext Mixer-based Encoder-Decoder (CNM-ED) model. Unlike traditional CNN based models, the proposed CNM-ED model enables the extraction of spatial and long context features to address the inherent complexities of histopathology images. This method leverages a multi-path strategy using a traditional CNN architecture as well as different paths focused on obtaining customized long context features using the ConvNext Mixer block structure that combines ConvMixer and ConvNext blocks. The fusion of these diverse features in the final segmentation output enables improved accuracy and performance, surpassing existing state-of-the-art segmentation models. Moreover, our multi-level feature extraction strategy is more effective than models using self-attention mechanisms such as SwinUnet and TransUnet, which have been frequently used in recent years. Experimental studies were conducted using five different datasets (TNBC, MoNuSeg, CoNSeP, CPM17, and CryoNuSeg) to analyze the performance of the proposed CNM-ED model. Comparisons were made with various CNN based models in the literature using evaluation metrics such as accuracy, AJI, macro F1 score, macro intersection over union, macro precision, and macro recall. It was observed that the proposed CNM-ED model achieved highly successful results across all metrics. Through comparisons with state-art-of models from the literature, the proposed CNM-ED model stands out as a promising advancement in nuclei segmentation, addressing the intricacies of histopathological images. The model demonstrates enhanced diagnostic capabilities and holds the potential for significant progress in medical research.</p>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"34 5","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2024-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ima.23181","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.23181","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

Histopathology, vital in diagnosing medical conditions, especially in cancer research, relies on analyzing histopathology images (HIs). Nuclei segmentation, a key task, involves precisely identifying cell nuclei boundaries. Manual segmentation by pathologists is time-consuming, prompting the need for robust automated methods. Challenges in segmentation arise from HI complexities, necessitating advanced techniques. Recent advancements in deep learning, particularly Convolutional Neural Networks (CNNs), have transformed nuclei segmentation. This study emphasizes feature extraction, introducing the ConvNext Mixer-based Encoder-Decoder (CNM-ED) model. Unlike traditional CNN based models, the proposed CNM-ED model enables the extraction of spatial and long context features to address the inherent complexities of histopathology images. This method leverages a multi-path strategy using a traditional CNN architecture as well as different paths focused on obtaining customized long context features using the ConvNext Mixer block structure that combines ConvMixer and ConvNext blocks. The fusion of these diverse features in the final segmentation output enables improved accuracy and performance, surpassing existing state-of-the-art segmentation models. Moreover, our multi-level feature extraction strategy is more effective than models using self-attention mechanisms such as SwinUnet and TransUnet, which have been frequently used in recent years. Experimental studies were conducted using five different datasets (TNBC, MoNuSeg, CoNSeP, CPM17, and CryoNuSeg) to analyze the performance of the proposed CNM-ED model. Comparisons were made with various CNN based models in the literature using evaluation metrics such as accuracy, AJI, macro F1 score, macro intersection over union, macro precision, and macro recall. It was observed that the proposed CNM-ED model achieved highly successful results across all metrics. Through comparisons with state-art-of models from the literature, the proposed CNM-ED model stands out as a promising advancement in nuclei segmentation, addressing the intricacies of histopathological images. The model demonstrates enhanced diagnostic capabilities and holds the potential for significant progress in medical research.

Abstract Image

查看原文本刊更多论文

基于 ConvNext 混合器的编码器解码器方法用于组织病理学图像中的细胞核分割

组织病理学是诊断疾病，尤其是癌症研究的重要依据，它依赖于对组织病理学图像（HIs）的分析。细胞核分割是一项关键任务，包括精确识别细胞核边界。病理学家手动分割非常耗时，因此需要强大的自动方法。HI 的复杂性给分割带来了挑战，因此需要先进的技术。深度学习，尤其是卷积神经网络（CNN）的最新进展改变了细胞核分割的方式。本研究强调特征提取，引入了基于 ConvNext 混合器的编码器-解码器（CNM-ED）模型。与传统的基于 CNN 的模型不同，所提出的 CNM-ED 模型能够提取空间和长上下文特征，以解决组织病理学图像固有的复杂性问题。该方法利用传统 CNN 架构的多路径策略，以及使用 ConvMixer 和 ConvNext 块相结合的 ConvNext 混合器块结构获取定制长上下文特征的不同路径。在最终的分割输出中融合这些不同的特征，可以提高准确性和性能，超越现有的最先进分割模型。此外，我们的多层次特征提取策略比近年来经常使用的 SwinUnet 和 TransUnet 等使用自我关注机制的模型更加有效。我们使用五个不同的数据集（TNBC、MoNuSeg、CoNSeP、CPM17 和 CryoNuSeg）进行了实验研究，以分析所提出的 CNM-ED 模型的性能。使用准确率、AJI、宏 F1 分数、宏交集大于联合、宏精确度和宏召回率等评价指标与文献中各种基于 CNN 的模型进行了比较。结果表明，所提出的 CNM-ED 模型在所有指标上都取得了非常成功的结果。通过与文献中的早期模型进行比较，所提出的 CNM-ED 模型在细胞核分割方面取得了巨大进步，解决了组织病理学图像的复杂性问题。该模型增强了诊断能力，有望在医学研究领域取得重大进展。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Imaging Systems and Technology 工程技术-成像科学与照相技术

CiteScore

6.90

自引率

6.10%

发文量

138

审稿时长

3 months

期刊介绍： The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals. IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging. The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered. The scope of the journal includes, but is not limited to, the following in the context of biomedical research: Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.; Neuromodulation and brain stimulation techniques such as TMS and tDCS; Software and hardware for imaging, especially related to human and animal health; Image segmentation in normal and clinical populations; Pattern analysis and classification using machine learning techniques; Computational modeling and analysis; Brain connectivity and connectomics; Systems-level characterization of brain function; Neural networks and neurorobotics; Computer vision, based on human/animal physiology; Brain-computer interface (BCI) technology; Big data, databasing and data mining.