A robust image segmentation and synthesis pipeline for histopathology

IF 10.7 1区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Medical image analysis Pub Date : 2024-09-11 DOI:10.1016/j.media.2024.103344

Muhammad Jehanzaib , Yasin Almalioglu , Kutsev Bengisu Ozyoruk , Drew F.K. Williamson , Talha Abdullah , Kayhan Basak , Derya Demir , G. Evren Keles , Kashif Zafar , Mehmet Turan

{"title":"A robust image segmentation and synthesis pipeline for histopathology","authors":"Muhammad Jehanzaib , Yasin Almalioglu , Kutsev Bengisu Ozyoruk , Drew F.K. Williamson , Talha Abdullah , Kayhan Basak , Derya Demir , G. Evren Keles , Kashif Zafar , Mehmet Turan","doi":"10.1016/j.media.2024.103344","DOIUrl":null,"url":null,"abstract":"<div><p>Significant diagnostic variability between and within observers persists in pathology, despite the fact that digital slide images provide the ability to measure and quantify features much more precisely compared to conventional methods. Automated and accurate segmentation of cancerous cell and tissue regions can streamline the diagnostic process, providing insights into the cancer progression, and helping experts decide on the most effective treatment. Here, we evaluate the performance of the proposed PathoSeg model, with an architecture comprising of a modified HRNet encoder and a UNet++ decoder integrated with a CBAM block to utilize attention mechanism for an improved segmentation capability. We demonstrate that PathoSeg outperforms the current state-of-the-art (SOTA) networks in both quantitative and qualitative assessment of instance and semantic segmentation. Notably, we leverage the use of synthetic data generated by PathopixGAN, which effectively addresses the data imbalance problem commonly encountered in histopathology datasets, further improving the performance of PathoSeg. It utilizes spatially adaptive normalization within a generative and discriminative mechanism to synthesize diverse histopathological environments dictated through semantic information passed through pixel-level annotated Ground Truth semantic masks.Besides, we contribute to the research community by providing an in-house dataset that includes semantically segmented masks for breast carcinoma tubules (BCT), micro/macrovesicular steatosis of the liver (MSL), and prostate carcinoma glands (PCG). In the first part of the dataset, we have a total of 14 whole slide images from 13 patients’ liver, with fat cell segmented masks, totaling 951 masks of size 512 × 512 pixels. In the second part, it includes 17 whole slide images from 13 patients with prostate carcinoma gland segmentation masks, amounting to 30,000 masks of size 512 × 512 pixels. In the third part, the dataset contains 51 whole slides from 36 patients, with breast carcinoma tubule masks totaling 30,000 masks of size 512 × 512 pixels. To ensure transparency and encourage further research, we will make this dataset publicly available for non-commercial and academic purposes. To facilitate reproducibility and encourage further research, we will also make our code and pre-trained models publicly available at <span><span>https://github.com/DeepMIALab/PathoSeg</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"99 ","pages":"Article 103344"},"PeriodicalIF":10.7000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S136184152400269X","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Significant diagnostic variability between and within observers persists in pathology, despite the fact that digital slide images provide the ability to measure and quantify features much more precisely compared to conventional methods. Automated and accurate segmentation of cancerous cell and tissue regions can streamline the diagnostic process, providing insights into the cancer progression, and helping experts decide on the most effective treatment. Here, we evaluate the performance of the proposed PathoSeg model, with an architecture comprising of a modified HRNet encoder and a UNet++ decoder integrated with a CBAM block to utilize attention mechanism for an improved segmentation capability. We demonstrate that PathoSeg outperforms the current state-of-the-art (SOTA) networks in both quantitative and qualitative assessment of instance and semantic segmentation. Notably, we leverage the use of synthetic data generated by PathopixGAN, which effectively addresses the data imbalance problem commonly encountered in histopathology datasets, further improving the performance of PathoSeg. It utilizes spatially adaptive normalization within a generative and discriminative mechanism to synthesize diverse histopathological environments dictated through semantic information passed through pixel-level annotated Ground Truth semantic masks.Besides, we contribute to the research community by providing an in-house dataset that includes semantically segmented masks for breast carcinoma tubules (BCT), micro/macrovesicular steatosis of the liver (MSL), and prostate carcinoma glands (PCG). In the first part of the dataset, we have a total of 14 whole slide images from 13 patients’ liver, with fat cell segmented masks, totaling 951 masks of size 512 × 512 pixels. In the second part, it includes 17 whole slide images from 13 patients with prostate carcinoma gland segmentation masks, amounting to 30,000 masks of size 512 × 512 pixels. In the third part, the dataset contains 51 whole slides from 36 patients, with breast carcinoma tubule masks totaling 30,000 masks of size 512 × 512 pixels. To ensure transparency and encourage further research, we will make this dataset publicly available for non-commercial and academic purposes. To facilitate reproducibility and encourage further research, we will also make our code and pre-trained models publicly available at https://github.com/DeepMIALab/PathoSeg.

查看原文本刊更多论文

用于组织病理学的稳健图像分割和合成管道

尽管与传统方法相比，数字幻灯片图像能够更精确地测量和量化特征，但病理诊断中观察者之间和观察者内部仍然存在很大的诊断差异。对癌细胞和组织区域进行自动、准确的分割可以简化诊断过程，深入了解癌症进展情况，帮助专家决定最有效的治疗方法。在这里，我们评估了所提出的 PathoSeg 模型的性能，该模型的架构包括一个改进的 HRNet 编码器和一个 UNet++ 解码器，并集成了一个 CBAM 块，以利用注意力机制提高分割能力。我们证明，PathoSeg 在实例和语义分割的定量和定性评估方面均优于目前最先进的（SOTA）网络。值得一提的是，我们利用 PathopixGAN 生成的合成数据，有效解决了组织病理学数据集中常见的数据不平衡问题，进一步提高了 PathoSeg 的性能。此外，我们还提供了一个内部数据集，其中包括乳腺癌小管（BCT）、肝脏微/大泡性脂肪变性（MSL）和前列腺癌腺体（PCG）的语义分割掩码，为研究界做出了贡献。在数据集的第一部分，我们总共有 14 张来自 13 名患者肝脏的整张切片图像，其中包含脂肪细胞分割掩膜，总计 951 个掩膜，大小为 512 × 512 像素。第二部分包括来自 13 名患者的 17 张整张幻灯片图像，其中有前列腺癌腺体分割掩膜，共计 30,000 个 512 × 512 像素大小的掩膜。第三部分数据集包含来自 36 名患者的 51 张整张切片图像，其中有乳腺癌小管遮罩，共计 30,000 个遮罩，大小为 512 × 512 像素。为确保透明度并鼓励进一步研究，我们将公开该数据集，供非商业和学术用途使用。为了促进可重复性并鼓励进一步研究，我们还将在 https://github.com/DeepMIALab/PathoSeg 上公开我们的代码和预训练模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Medical image analysis 工程技术-工程：生物医学

CiteScore

22.10

自引率

6.40%

发文量

309

审稿时长

6.6 months

期刊介绍： Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.