{"title":"PDNet: an advanced architecture for polyp image segmentation","authors":"Hanqing Liu, Zhipeng Zhao, Ruichun Tang, Peishun Liu, Yixin Chen, Jianjun Zhang, Jing Jia","doi":"10.1117/12.2643392","DOIUrl":null,"url":null,"abstract":"In order to improve the segmentation accuracy of polyp image segmentation under colonoscopy, we propose PVT Dual-Upsampling Net (PDNet). PDNet adopts the encoder network based on Transformer as the backbone network for downsampling, and designs a dual upsampling module based on cascaded fusion network and simple connection network to recover the loss of high-level image features caused by the downsampling process, and obtains a high-level semantic feature map with the same resolution as the input image. The multi-feature fusion module is used to aggregate the low-level feature map and high-level semantic feature map. We validate the model on three publicly available datasets, and our experimental evaluations show that the suggested architecture produces good segmentation results on datasets.","PeriodicalId":314555,"journal":{"name":"International Conference on Digital Image Processing","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Digital Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2643392","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In order to improve the segmentation accuracy of polyp image segmentation under colonoscopy, we propose PVT Dual-Upsampling Net (PDNet). PDNet adopts the encoder network based on Transformer as the backbone network for downsampling, and designs a dual upsampling module based on cascaded fusion network and simple connection network to recover the loss of high-level image features caused by the downsampling process, and obtains a high-level semantic feature map with the same resolution as the input image. The multi-feature fusion module is used to aggregate the low-level feature map and high-level semantic feature map. We validate the model on three publicly available datasets, and our experimental evaluations show that the suggested architecture produces good segmentation results on datasets.