{"title":"Algorithm for polyp segmentation with local encoding and decoding fusion and multi-scale attention","authors":"Qi Wu, Changming Zhu","doi":"10.1117/12.2682582","DOIUrl":null,"url":null,"abstract":"In recent years, the application of medical image semantic segmentation tasks in medical diagnosis and treatment planning has received widespread attention from the research community. The High-Resolution Network (HRNet) has good adaptability to high-resolution and high-scale medical images. In this paper, a novel high-resolution serial feature fusion encoding and decoding structure is proposed, and a CBAM attention mechanism is fused to construct a module that can jointly focus on spatial, channel, and multi-scale hierarchical information, which can improve the feature representation ability of the model and effectively reduce parameter complexity. We use the HRNet architecture to construct our model. Experimental results show that our method achieves MIoU coefficient of 98.44% on the Kvasir-SEG dataset, which is 1.43 percentage points higher than the original HR-Net model, validating the effectiveness and reliability of our method.","PeriodicalId":177416,"journal":{"name":"Conference on Electronic Information Engineering and Data Processing","volume":"66 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference on Electronic Information Engineering and Data Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2682582","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, the application of medical image semantic segmentation tasks in medical diagnosis and treatment planning has received widespread attention from the research community. The High-Resolution Network (HRNet) has good adaptability to high-resolution and high-scale medical images. In this paper, a novel high-resolution serial feature fusion encoding and decoding structure is proposed, and a CBAM attention mechanism is fused to construct a module that can jointly focus on spatial, channel, and multi-scale hierarchical information, which can improve the feature representation ability of the model and effectively reduce parameter complexity. We use the HRNet architecture to construct our model. Experimental results show that our method achieves MIoU coefficient of 98.44% on the Kvasir-SEG dataset, which is 1.43 percentage points higher than the original HR-Net model, validating the effectiveness and reliability of our method.