{"title":"基于局部编解码融合和多尺度关注的息肉分割算法","authors":"Qi Wu, Changming Zhu","doi":"10.1117/12.2682582","DOIUrl":null,"url":null,"abstract":"In recent years, the application of medical image semantic segmentation tasks in medical diagnosis and treatment planning has received widespread attention from the research community. The High-Resolution Network (HRNet) has good adaptability to high-resolution and high-scale medical images. In this paper, a novel high-resolution serial feature fusion encoding and decoding structure is proposed, and a CBAM attention mechanism is fused to construct a module that can jointly focus on spatial, channel, and multi-scale hierarchical information, which can improve the feature representation ability of the model and effectively reduce parameter complexity. We use the HRNet architecture to construct our model. Experimental results show that our method achieves MIoU coefficient of 98.44% on the Kvasir-SEG dataset, which is 1.43 percentage points higher than the original HR-Net model, validating the effectiveness and reliability of our method.","PeriodicalId":177416,"journal":{"name":"Conference on Electronic Information Engineering and Data Processing","volume":"66 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Algorithm for polyp segmentation with local encoding and decoding fusion and multi-scale attention\",\"authors\":\"Qi Wu, Changming Zhu\",\"doi\":\"10.1117/12.2682582\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, the application of medical image semantic segmentation tasks in medical diagnosis and treatment planning has received widespread attention from the research community. The High-Resolution Network (HRNet) has good adaptability to high-resolution and high-scale medical images. In this paper, a novel high-resolution serial feature fusion encoding and decoding structure is proposed, and a CBAM attention mechanism is fused to construct a module that can jointly focus on spatial, channel, and multi-scale hierarchical information, which can improve the feature representation ability of the model and effectively reduce parameter complexity. We use the HRNet architecture to construct our model. Experimental results show that our method achieves MIoU coefficient of 98.44% on the Kvasir-SEG dataset, which is 1.43 percentage points higher than the original HR-Net model, validating the effectiveness and reliability of our method.\",\"PeriodicalId\":177416,\"journal\":{\"name\":\"Conference on Electronic Information Engineering and Data Processing\",\"volume\":\"66 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Conference on Electronic Information Engineering and Data Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2682582\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference on Electronic Information Engineering and Data Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2682582","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Algorithm for polyp segmentation with local encoding and decoding fusion and multi-scale attention
In recent years, the application of medical image semantic segmentation tasks in medical diagnosis and treatment planning has received widespread attention from the research community. The High-Resolution Network (HRNet) has good adaptability to high-resolution and high-scale medical images. In this paper, a novel high-resolution serial feature fusion encoding and decoding structure is proposed, and a CBAM attention mechanism is fused to construct a module that can jointly focus on spatial, channel, and multi-scale hierarchical information, which can improve the feature representation ability of the model and effectively reduce parameter complexity. We use the HRNet architecture to construct our model. Experimental results show that our method achieves MIoU coefficient of 98.44% on the Kvasir-SEG dataset, which is 1.43 percentage points higher than the original HR-Net model, validating the effectiveness and reliability of our method.