MT-ONet:用于医学图像分割的混合变压器O-Net

2022 International Conference on Sensing, Measurement & Data Analytics in the era of Artificial Intelligence (ICSMD) Pub Date : 2022-11-30 DOI:10.1109/ICSMD57530.2022.10058445

Pengfei Zheng

{"title":"MT-ONet:用于医学图像分割的混合变压器O-Net","authors":"Pengfei Zheng","doi":"10.1109/ICSMD57530.2022.10058445","DOIUrl":null,"url":null,"abstract":"In the past few years, the deep learning is widely used in the medical industry due to its advantage. Constructed using Convolutional Neural Networks (CNN), the U-Net framework has become the industry standard for solving medical image segmentation tasks. Nonetheless, this framework is incapable of entirely learning all global and remote semantic information. It has been demonstrated that the transformer structure collects more global information than U-Net but less local information than CNN. To improve the performance of segmentation and classification in medical images while maximizing global and local data, we integrate O-Net with Mixed Transformer [1], this fuses the advantages of CNN and Transformer. This enables us to maximize both types of data. We combine CNN, Mixed Transformer, and Local-Global Gaussian-Weighted Self-Attention (LGG-SA) in the encoder component of our proposed O-Net architecture to obtain more global and local background information. The decoder part combines the Mixed Transformer and CNN blocks to obtain the results. The segmentation capability of the proposed network is evaluated by the multi-organ CT dataset containing synaptic information. The results of our trials demonstrate that the proposed MT-ONet can deliver superior segmentation performance relative to cutting-edge methods, resulting in improved classification precision.","PeriodicalId":396735,"journal":{"name":"2022 International Conference on Sensing, Measurement & Data Analytics in the era of Artificial Intelligence (ICSMD)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MT-ONet: Mixed Transformer O-Net for Medical Image Segmentation\",\"authors\":\"Pengfei Zheng\",\"doi\":\"10.1109/ICSMD57530.2022.10058445\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the past few years, the deep learning is widely used in the medical industry due to its advantage. Constructed using Convolutional Neural Networks (CNN), the U-Net framework has become the industry standard for solving medical image segmentation tasks. Nonetheless, this framework is incapable of entirely learning all global and remote semantic information. It has been demonstrated that the transformer structure collects more global information than U-Net but less local information than CNN. To improve the performance of segmentation and classification in medical images while maximizing global and local data, we integrate O-Net with Mixed Transformer [1], this fuses the advantages of CNN and Transformer. This enables us to maximize both types of data. We combine CNN, Mixed Transformer, and Local-Global Gaussian-Weighted Self-Attention (LGG-SA) in the encoder component of our proposed O-Net architecture to obtain more global and local background information. The decoder part combines the Mixed Transformer and CNN blocks to obtain the results. The segmentation capability of the proposed network is evaluated by the multi-organ CT dataset containing synaptic information. The results of our trials demonstrate that the proposed MT-ONet can deliver superior segmentation performance relative to cutting-edge methods, resulting in improved classification precision.\",\"PeriodicalId\":396735,\"journal\":{\"name\":\"2022 International Conference on Sensing, Measurement & Data Analytics in the era of Artificial Intelligence (ICSMD)\",\"volume\":\"45 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Sensing, Measurement & Data Analytics in the era of Artificial Intelligence (ICSMD)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSMD57530.2022.10058445\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Sensing, Measurement & Data Analytics in the era of Artificial Intelligence (ICSMD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSMD57530.2022.10058445","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在过去的几年里，深度学习因其优势被广泛应用于医疗行业。使用卷积神经网络(CNN)构建的U-Net框架已成为解决医学图像分割任务的行业标准。然而，该框架不能完全学习所有全局和远程语义信息。结果表明，变压器结构比U-Net能收集到更多的全局信息，但比CNN能收集到更少的局部信息。为了提高医学图像的分割和分类性能，同时最大化全局和局部数据，我们将O-Net与混合变压器[1]相结合，融合了CNN和Transformer的优点。这使我们能够最大化这两种类型的数据。我们在我们提出的O-Net架构的编码器组件中结合了CNN，混合变压器和局部-全局高斯加权自关注(LGG-SA)，以获得更多的全局和局部背景信息。解码器部分结合混合变压器和CNN块来获得结果。通过包含突触信息的多器官CT数据集评估该网络的分割能力。我们的实验结果表明，相对于先进的方法，所提出的MT-ONet可以提供更好的分割性能，从而提高分类精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

MT-ONet: Mixed Transformer O-Net for Medical Image Segmentation

In the past few years, the deep learning is widely used in the medical industry due to its advantage. Constructed using Convolutional Neural Networks (CNN), the U-Net framework has become the industry standard for solving medical image segmentation tasks. Nonetheless, this framework is incapable of entirely learning all global and remote semantic information. It has been demonstrated that the transformer structure collects more global information than U-Net but less local information than CNN. To improve the performance of segmentation and classification in medical images while maximizing global and local data, we integrate O-Net with Mixed Transformer [1], this fuses the advantages of CNN and Transformer. This enables us to maximize both types of data. We combine CNN, Mixed Transformer, and Local-Global Gaussian-Weighted Self-Attention (LGG-SA) in the encoder component of our proposed O-Net architecture to obtain more global and local background information. The decoder part combines the Mixed Transformer and CNN blocks to obtain the results. The segmentation capability of the proposed network is evaluated by the multi-organ CT dataset containing synaptic information. The results of our trials demonstrate that the proposed MT-ONet can deliver superior segmentation performance relative to cutting-edge methods, resulting in improved classification precision.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 International Conference on Sensing, Measurement & Data Analytics in the era of Artificial Intelligence (ICSMD)

自引率

0.00%

发文量