Trans2Unet:核语义分割的神经融合

2022 11th International Conference on Control, Automation and Information Sciences (ICCAIS) Pub Date : 2022-11-21 DOI:10.1109/ICCAIS56082.2022.9990159

Dinh-Phu Tran, Quoc-Anh Nguyen, Van-Truong Pham, Thi-Thao Tran

{"title":"Trans2Unet:核语义分割的神经融合","authors":"Dinh-Phu Tran, Quoc-Anh Nguyen, Van-Truong Pham, Thi-Thao Tran","doi":"10.1109/ICCAIS56082.2022.9990159","DOIUrl":null,"url":null,"abstract":"Nuclei segmentation, despite its fundamental role in histopathological image analysis, is still a challenge work. The main challenge of this task is the existence of overlapping areas, which makes separating independent nuclei more complicated. In this paper, we propose a new two-branch architecture by combining the Unet and TransUnet networks for nuclei segmentation task. In the proposed architecture, namely Trans2Unet, the input image is first sent into the Unet branch whose the last convolution layer is removed. This branch makes the network combine features from different spatial regions of the input image and localizes more precisely the regions of interest. The input image is also fed into the second branch. In the second branch, which is called TransUnet branch, the input image will be divided into patches of images. With Vision transformer (ViT) in architecture, TransUnet can serve as a powerful encoder for medical image segmentation tasks and enhance image details by recovering localized spatial information. To boost up Trans2Unet efficiency and performance, we proposed to infuse TransUnet with a computational-efficient variation called “Waterfall” Atrous Spatial Pooling with Skip Connection (WASP-KC) module, which is inspired by the “Waterfall” Atrous Spatial Pooling (WASP) module. Experiment results on the 2018 Data Science Bowl benchmark show the effectiveness and performance of the proposed architecture while compared with previous segmentation models.","PeriodicalId":273404,"journal":{"name":"2022 11th International Conference on Control, Automation and Information Sciences (ICCAIS)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Trans2Unet: Neural fusion for Nuclei Semantic Segmentation\",\"authors\":\"Dinh-Phu Tran, Quoc-Anh Nguyen, Van-Truong Pham, Thi-Thao Tran\",\"doi\":\"10.1109/ICCAIS56082.2022.9990159\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nuclei segmentation, despite its fundamental role in histopathological image analysis, is still a challenge work. The main challenge of this task is the existence of overlapping areas, which makes separating independent nuclei more complicated. In this paper, we propose a new two-branch architecture by combining the Unet and TransUnet networks for nuclei segmentation task. In the proposed architecture, namely Trans2Unet, the input image is first sent into the Unet branch whose the last convolution layer is removed. This branch makes the network combine features from different spatial regions of the input image and localizes more precisely the regions of interest. The input image is also fed into the second branch. In the second branch, which is called TransUnet branch, the input image will be divided into patches of images. With Vision transformer (ViT) in architecture, TransUnet can serve as a powerful encoder for medical image segmentation tasks and enhance image details by recovering localized spatial information. To boost up Trans2Unet efficiency and performance, we proposed to infuse TransUnet with a computational-efficient variation called “Waterfall” Atrous Spatial Pooling with Skip Connection (WASP-KC) module, which is inspired by the “Waterfall” Atrous Spatial Pooling (WASP) module. Experiment results on the 2018 Data Science Bowl benchmark show the effectiveness and performance of the proposed architecture while compared with previous segmentation models.\",\"PeriodicalId\":273404,\"journal\":{\"name\":\"2022 11th International Conference on Control, Automation and Information Sciences (ICCAIS)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 11th International Conference on Control, Automation and Information Sciences (ICCAIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCAIS56082.2022.9990159\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 11th International Conference on Control, Automation and Information Sciences (ICCAIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCAIS56082.2022.9990159","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

尽管细胞核分割在组织病理图像分析中起着重要的作用，但它仍然是一项具有挑战性的工作。这项任务的主要挑战是重叠区域的存在，这使得分离独立核变得更加复杂。本文将Unet和TransUnet网络结合起来，提出了一种新的双分支结构用于核分割任务。在提出的Trans2Unet架构中，输入图像首先被发送到Unet分支，该分支的最后一个卷积层被移除。该分支使得网络能够结合输入图像不同空间区域的特征，更精确地定位感兴趣的区域。输入图像也被送入第二个分支。在第二个分支中，称为TransUnet分支，输入图像将被分割成图像块。TransUnet采用视觉转换器(Vision transformer, ViT)架构，可以作为医学图像分割任务的强大编码器，并通过恢复局部空间信息来增强图像细节。为了提高Trans2Unet的效率和性能，我们建议在TransUnet中注入一个计算效率高的变体，称为“瀑布”空间池与跳跃连接(WASP- kc)模块，该模块的灵感来自“瀑布”空间池(WASP)模块。2018年数据科学碗基准测试的实验结果表明，与以前的分割模型相比，所提出的架构的有效性和性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Trans2Unet: Neural fusion for Nuclei Semantic Segmentation

Nuclei segmentation, despite its fundamental role in histopathological image analysis, is still a challenge work. The main challenge of this task is the existence of overlapping areas, which makes separating independent nuclei more complicated. In this paper, we propose a new two-branch architecture by combining the Unet and TransUnet networks for nuclei segmentation task. In the proposed architecture, namely Trans2Unet, the input image is first sent into the Unet branch whose the last convolution layer is removed. This branch makes the network combine features from different spatial regions of the input image and localizes more precisely the regions of interest. The input image is also fed into the second branch. In the second branch, which is called TransUnet branch, the input image will be divided into patches of images. With Vision transformer (ViT) in architecture, TransUnet can serve as a powerful encoder for medical image segmentation tasks and enhance image details by recovering localized spatial information. To boost up Trans2Unet efficiency and performance, we proposed to infuse TransUnet with a computational-efficient variation called “Waterfall” Atrous Spatial Pooling with Skip Connection (WASP-KC) module, which is inspired by the “Waterfall” Atrous Spatial Pooling (WASP) module. Experiment results on the 2018 Data Science Bowl benchmark show the effectiveness and performance of the proposed architecture while compared with previous segmentation models.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 11th International Conference on Control, Automation and Information Sciences (ICCAIS)

自引率

0.00%

发文量