{"title":"DCTNet: A Hybrid Model of CNN and Dilated Contextual Transformer for Medical Image Segmentation","authors":"Xiang Pan, Jiapeng Xiong","doi":"10.1109/ITNEC56291.2023.10082385","DOIUrl":null,"url":null,"abstract":"Medical image segmentation is a prerequisite for the development of medical systems, especially for disease diagnosis and treatment planning. Due to the inherent limitations of convolutional operations, convolutional neural networks (CNNs), although they have become the consensus for various medical image segmentation tasks, show limitations in extracting remote image features. transformer shows superior performance in extracting remote image features, but it cannot capture low-level features. Existing studies have shown that combining CNN and Transformer can give better results. In this paper, we propose a Transformer module that can effectively combine low-level and high-level features while maintaining feature consistency, leveraging the rich context between adjacent keys, and combining it with the classical model of convolutional neural networks to form a new approach to medical image segmentation that effectively connects CNN and Transformer. We conducted experiments on the dataset. The experimental results show that our proposed algorithm is the best in all segmentation metrics.","PeriodicalId":218770,"journal":{"name":"2023 IEEE 6th Information Technology,Networking,Electronic and Automation Control Conference (ITNEC)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 6th Information Technology,Networking,Electronic and Automation Control Conference (ITNEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITNEC56291.2023.10082385","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Medical image segmentation is a prerequisite for the development of medical systems, especially for disease diagnosis and treatment planning. Due to the inherent limitations of convolutional operations, convolutional neural networks (CNNs), although they have become the consensus for various medical image segmentation tasks, show limitations in extracting remote image features. transformer shows superior performance in extracting remote image features, but it cannot capture low-level features. Existing studies have shown that combining CNN and Transformer can give better results. In this paper, we propose a Transformer module that can effectively combine low-level and high-level features while maintaining feature consistency, leveraging the rich context between adjacent keys, and combining it with the classical model of convolutional neural networks to form a new approach to medical image segmentation that effectively connects CNN and Transformer. We conducted experiments on the dataset. The experimental results show that our proposed algorithm is the best in all segmentation metrics.