MT-ONet: Mixed Transformer O-Net for Medical Image Segmentation

Pengfei Zheng
{"title":"MT-ONet: Mixed Transformer O-Net for Medical Image Segmentation","authors":"Pengfei Zheng","doi":"10.1109/ICSMD57530.2022.10058445","DOIUrl":null,"url":null,"abstract":"In the past few years, the deep learning is widely used in the medical industry due to its advantage. Constructed using Convolutional Neural Networks (CNN), the U-Net framework has become the industry standard for solving medical image segmentation tasks. Nonetheless, this framework is incapable of entirely learning all global and remote semantic information. It has been demonstrated that the transformer structure collects more global information than U-Net but less local information than CNN. To improve the performance of segmentation and classification in medical images while maximizing global and local data, we integrate O-Net with Mixed Transformer [1], this fuses the advantages of CNN and Transformer. This enables us to maximize both types of data. We combine CNN, Mixed Transformer, and Local-Global Gaussian-Weighted Self-Attention (LGG-SA) in the encoder component of our proposed O-Net architecture to obtain more global and local background information. The decoder part combines the Mixed Transformer and CNN blocks to obtain the results. The segmentation capability of the proposed network is evaluated by the multi-organ CT dataset containing synaptic information. The results of our trials demonstrate that the proposed MT-ONet can deliver superior segmentation performance relative to cutting-edge methods, resulting in improved classification precision.","PeriodicalId":396735,"journal":{"name":"2022 International Conference on Sensing, Measurement & Data Analytics in the era of Artificial Intelligence (ICSMD)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Sensing, Measurement & Data Analytics in the era of Artificial Intelligence (ICSMD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSMD57530.2022.10058445","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In the past few years, the deep learning is widely used in the medical industry due to its advantage. Constructed using Convolutional Neural Networks (CNN), the U-Net framework has become the industry standard for solving medical image segmentation tasks. Nonetheless, this framework is incapable of entirely learning all global and remote semantic information. It has been demonstrated that the transformer structure collects more global information than U-Net but less local information than CNN. To improve the performance of segmentation and classification in medical images while maximizing global and local data, we integrate O-Net with Mixed Transformer [1], this fuses the advantages of CNN and Transformer. This enables us to maximize both types of data. We combine CNN, Mixed Transformer, and Local-Global Gaussian-Weighted Self-Attention (LGG-SA) in the encoder component of our proposed O-Net architecture to obtain more global and local background information. The decoder part combines the Mixed Transformer and CNN blocks to obtain the results. The segmentation capability of the proposed network is evaluated by the multi-organ CT dataset containing synaptic information. The results of our trials demonstrate that the proposed MT-ONet can deliver superior segmentation performance relative to cutting-edge methods, resulting in improved classification precision.
MT-ONet:用于医学图像分割的混合变压器O-Net
在过去的几年里,深度学习因其优势被广泛应用于医疗行业。使用卷积神经网络(CNN)构建的U-Net框架已成为解决医学图像分割任务的行业标准。然而,该框架不能完全学习所有全局和远程语义信息。结果表明,变压器结构比U-Net能收集到更多的全局信息,但比CNN能收集到更少的局部信息。为了提高医学图像的分割和分类性能,同时最大化全局和局部数据,我们将O-Net与混合变压器[1]相结合,融合了CNN和Transformer的优点。这使我们能够最大化这两种类型的数据。我们在我们提出的O-Net架构的编码器组件中结合了CNN,混合变压器和局部-全局高斯加权自关注(LGG-SA),以获得更多的全局和局部背景信息。解码器部分结合混合变压器和CNN块来获得结果。通过包含突触信息的多器官CT数据集评估该网络的分割能力。我们的实验结果表明,相对于先进的方法,所提出的MT-ONet可以提供更好的分割性能,从而提高分类精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信