Transformer based Generative Adversarial Network for Liver Segmentation

Proceedings of the ... International Conference on Image Analysis and Processing. International Conference on Image Analysis and Processing Pub Date : 2022-05-01 DOI:10.48550/arXiv.2205.10663

Ugur Demir, Zheyu Zhang, Bin Wang, M. Antalek, Elif Keles, Debesh Jha, A. Borhani, D. Ladner, Ulas Bagci

{"title":"Transformer based Generative Adversarial Network for Liver Segmentation","authors":"Ugur Demir, Zheyu Zhang, Bin Wang, M. Antalek, Elif Keles, Debesh Jha, A. Borhani, D. Ladner, Ulas Bagci","doi":"10.48550/arXiv.2205.10663","DOIUrl":null,"url":null,"abstract":"Automated liver segmentation from radiology scans (CT, MRI) can improve surgery and therapy planning and follow-up assessment in addition to conventional use for diagnosis and prognosis. Although convolutional neural networks (CNNs) have became the standard image segmentation tasks, more recently this has started to change towards Transformers based architectures because Transformers are taking advantage of capturing long range dependence modeling capability in signals, so called attention mechanism. In this study, we propose a new segmentation approach using a hybrid approach combining the Transformer(s) with the Generative Adversarial Network (GAN) approach. The premise behind this choice is that the self-attention mechanism of the Transformers allows the network to aggregate the high dimensional feature and provide global information modeling. This mechanism provides better segmentation performance compared with traditional methods. Furthermore, we encode this generator into the GAN based architecture so that the discriminator network in the GAN can classify the credibility of the generated segmentation masks compared with the real masks coming from human (expert) annotations. This allows us to extract the high dimensional topology information in the mask for biomedical image segmentation and provide more reliable segmentation results. Our model achieved a high dice coefficient of 0.9433, recall of 0.9515, and precision of 0.9376 and outperformed other Transformer based approaches. The implementation details of the proposed architecture can be found at https://github.com/UgurDemir/tranformer_liver_segmentation.","PeriodicalId":74527,"journal":{"name":"Proceedings of the ... International Conference on Image Analysis and Processing. International Conference on Image Analysis and Processing","volume":"27 1","pages":"340-347"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... International Conference on Image Analysis and Processing. International Conference on Image Analysis and Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2205.10663","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Automated liver segmentation from radiology scans (CT, MRI) can improve surgery and therapy planning and follow-up assessment in addition to conventional use for diagnosis and prognosis. Although convolutional neural networks (CNNs) have became the standard image segmentation tasks, more recently this has started to change towards Transformers based architectures because Transformers are taking advantage of capturing long range dependence modeling capability in signals, so called attention mechanism. In this study, we propose a new segmentation approach using a hybrid approach combining the Transformer(s) with the Generative Adversarial Network (GAN) approach. The premise behind this choice is that the self-attention mechanism of the Transformers allows the network to aggregate the high dimensional feature and provide global information modeling. This mechanism provides better segmentation performance compared with traditional methods. Furthermore, we encode this generator into the GAN based architecture so that the discriminator network in the GAN can classify the credibility of the generated segmentation masks compared with the real masks coming from human (expert) annotations. This allows us to extract the high dimensional topology information in the mask for biomedical image segmentation and provide more reliable segmentation results. Our model achieved a high dice coefficient of 0.9433, recall of 0.9515, and precision of 0.9376 and outperformed other Transformer based approaches. The implementation details of the proposed architecture can be found at https://github.com/UgurDemir/tranformer_liver_segmentation.

查看原文本刊更多论文

基于变压器的生成对抗网络肝脏分割

从放射学扫描(CT, MRI)中自动分割肝脏可以改善手术和治疗计划以及随访评估，除了常规用于诊断和预后。虽然卷积神经网络(cnn)已经成为标准的图像分割任务，但最近它开始向基于变形金刚的架构转变，因为变形金刚利用了捕获信号中的远程依赖建模能力，即所谓的注意力机制。在这项研究中，我们提出了一种新的分割方法，使用混合方法结合变压器(s)和生成对抗网络(GAN)方法。这种选择背后的前提是，变形金刚的自关注机制允许网络聚合高维特征并提供全局信息建模。与传统方法相比，该机制具有更好的分割性能。此外，我们将此生成器编码到基于GAN的架构中，以便GAN中的判别器网络可以将生成的分割掩码与来自人类(专家)注释的真实掩码进行可信度分类。这使得我们可以提取生物医学图像分割中的高维拓扑信息，并提供更可靠的分割结果。我们的模型获得了0.9433的高骰子系数，0.9515的召回率和0.9376的精度，优于其他基于Transformer的方法。建议架构的实现细节可以在https://github.com/UgurDemir/tranformer_liver_segmentation上找到。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the ... International Conference on Image Analysis and Processing. International Conference on Image Analysis and Processing

自引率

0.00%

发文量