Xuzhen Huang , Yuliang Ma , Xiajin Mei , Zizhuo Wu , Mingxu Sun , Qingshan She
{"title":"基于边界感知和CNN-transformer融合网络的皮肤病灶分割的病灶边界检测","authors":"Xuzhen Huang , Yuliang Ma , Xiajin Mei , Zizhuo Wu , Mingxu Sun , Qingshan She","doi":"10.1016/j.artmed.2025.103190","DOIUrl":null,"url":null,"abstract":"<div><div>Traditional convolutional neural networks often struggle to capture global information and handle ambiguous boundaries during complex skin lesion segmentation tasks. To tackle this challenge, we proposed MPBA-Net, a hybrid network that integrates multi-pooling fusion and boundary-aware refinement. The network integrated Convolutional Neural Network (CNN) and Transformer to generate rich skin lesion feature maps for comprehensive feature extraction. Specifically, we introduced a boundary-aware attention gate (BAAG) module in the Transformer encoder layer and added a boundary cross attention (BCA) module at the end of the network to capture critical skin lesion boundary features. Additionally, we developed a multi-pooling fusion (MPF) module that extracts global multi-scale features by fusing improved Spatial Pyramid (SP) and Atrous Spatial Pyramid Pooling (ASPP). To optimize training, we designed a Point Loss derived from Binary Cross-Entropy (BCE) and combined it with Dice Loss to form a hybrid loss function. This approach not only enhances classification performance but also provides more precise measurement of the similarity between segmentation results and ground truth annotations. Ablation experiments on the ISIC2018 dataset validated the effectiveness of our fusion strategies and network improvements. Comparative experiments on the ISIC2016, ISIC2017, and ISIC2018 datasets showed that the Dice index of MPBA-Net outperformed other comparative segmentation methods in all three datasets, achieving 91.47 %, 87.04 %, and 88.93 %, respectively. Quantitative and qualitative results demonstrate that our method improves skin lesion segmentation accuracy, aiding dermatologists in clinical diagnosis and treatment. Our code is available at <span><span>https://github.com/FengYuchenGuang/MPBA-Net</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"167 ","pages":"Article 103190"},"PeriodicalIF":6.2000,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Lesion boundary detection for skin lesion segmentation based on boundary sensing and CNN-transformer fusion networks\",\"authors\":\"Xuzhen Huang , Yuliang Ma , Xiajin Mei , Zizhuo Wu , Mingxu Sun , Qingshan She\",\"doi\":\"10.1016/j.artmed.2025.103190\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Traditional convolutional neural networks often struggle to capture global information and handle ambiguous boundaries during complex skin lesion segmentation tasks. To tackle this challenge, we proposed MPBA-Net, a hybrid network that integrates multi-pooling fusion and boundary-aware refinement. The network integrated Convolutional Neural Network (CNN) and Transformer to generate rich skin lesion feature maps for comprehensive feature extraction. Specifically, we introduced a boundary-aware attention gate (BAAG) module in the Transformer encoder layer and added a boundary cross attention (BCA) module at the end of the network to capture critical skin lesion boundary features. Additionally, we developed a multi-pooling fusion (MPF) module that extracts global multi-scale features by fusing improved Spatial Pyramid (SP) and Atrous Spatial Pyramid Pooling (ASPP). To optimize training, we designed a Point Loss derived from Binary Cross-Entropy (BCE) and combined it with Dice Loss to form a hybrid loss function. This approach not only enhances classification performance but also provides more precise measurement of the similarity between segmentation results and ground truth annotations. Ablation experiments on the ISIC2018 dataset validated the effectiveness of our fusion strategies and network improvements. Comparative experiments on the ISIC2016, ISIC2017, and ISIC2018 datasets showed that the Dice index of MPBA-Net outperformed other comparative segmentation methods in all three datasets, achieving 91.47 %, 87.04 %, and 88.93 %, respectively. Quantitative and qualitative results demonstrate that our method improves skin lesion segmentation accuracy, aiding dermatologists in clinical diagnosis and treatment. Our code is available at <span><span>https://github.com/FengYuchenGuang/MPBA-Net</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":55458,\"journal\":{\"name\":\"Artificial Intelligence in Medicine\",\"volume\":\"167 \",\"pages\":\"Article 103190\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2025-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial Intelligence in Medicine\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0933365725001253\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0933365725001253","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Lesion boundary detection for skin lesion segmentation based on boundary sensing and CNN-transformer fusion networks
Traditional convolutional neural networks often struggle to capture global information and handle ambiguous boundaries during complex skin lesion segmentation tasks. To tackle this challenge, we proposed MPBA-Net, a hybrid network that integrates multi-pooling fusion and boundary-aware refinement. The network integrated Convolutional Neural Network (CNN) and Transformer to generate rich skin lesion feature maps for comprehensive feature extraction. Specifically, we introduced a boundary-aware attention gate (BAAG) module in the Transformer encoder layer and added a boundary cross attention (BCA) module at the end of the network to capture critical skin lesion boundary features. Additionally, we developed a multi-pooling fusion (MPF) module that extracts global multi-scale features by fusing improved Spatial Pyramid (SP) and Atrous Spatial Pyramid Pooling (ASPP). To optimize training, we designed a Point Loss derived from Binary Cross-Entropy (BCE) and combined it with Dice Loss to form a hybrid loss function. This approach not only enhances classification performance but also provides more precise measurement of the similarity between segmentation results and ground truth annotations. Ablation experiments on the ISIC2018 dataset validated the effectiveness of our fusion strategies and network improvements. Comparative experiments on the ISIC2016, ISIC2017, and ISIC2018 datasets showed that the Dice index of MPBA-Net outperformed other comparative segmentation methods in all three datasets, achieving 91.47 %, 87.04 %, and 88.93 %, respectively. Quantitative and qualitative results demonstrate that our method improves skin lesion segmentation accuracy, aiding dermatologists in clinical diagnosis and treatment. Our code is available at https://github.com/FengYuchenGuang/MPBA-Net.
期刊介绍:
Artificial Intelligence in Medicine publishes original articles from a wide variety of interdisciplinary perspectives concerning the theory and practice of artificial intelligence (AI) in medicine, medically-oriented human biology, and health care.
Artificial intelligence in medicine may be characterized as the scientific discipline pertaining to research studies, projects, and applications that aim at supporting decision-based medical tasks through knowledge- and/or data-intensive computer-based solutions that ultimately support and improve the performance of a human care provider.