VSGNet：基于视觉显著性的皮肤病变分割网络

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications Pub Date : 2025-06-14 DOI:10.1016/j.eswa.2025.128635

Zhefei Cai , Yingle Fan , Tao Fang , Wei Wu

{"title":"VSGNet：基于视觉显著性的皮肤病变分割网络","authors":"Zhefei Cai , Yingle Fan , Tao Fang , Wei Wu","doi":"10.1016/j.eswa.2025.128635","DOIUrl":null,"url":null,"abstract":"<div><div>The accuracy of skin lesion segmentation is of great significance for the subsequent clinical diagnosis. In order to improve the segmentation accuracy, some pioneering works tried to embed multiple complex modules, or used the huge Transformer framework, but due to the limitation of computing resources, these type of large models were not suitable for the actual clinical environment. To address the coexistence challenges of precision and lightweight, we propose a visual saliency guided network (VSGNet) for skin lesion segmentation, which generates saliency images of skin lesions through the efficient attention mechanism of biological vision, and guides the network to quickly locate the target area, so as to solve the localization difficulties in the skin lesion segmentation tasks. VSGNet includes three parts: Color Constancy module, Saliency Detection module and Ultra Lightweight Multi-level Interconnection Network (ULMI-Net). Specially, ULMI-Net uses a U-shaped structure network as the skeleton, including the Adaptive Split Channel Attention (ASCA) module that simulates the parallel mechanism of biological vision dual pathway, and the Channel-Spatial Parallel Attention (CSPA) module inspired by the multi-level interconnection structure of visual cortices. Through these modules, ULMI-Net can balance the efficient extraction and multi-scale fusion of global and local features, and try to achieve the excellent segmentation results at the lowest cost of parameters and computational complexity. To validate the effectiveness and robustness of the proposed VSGNet on three publicly available skin lesion segmentation datasets (ISIC2017, ISIC2018 and PH2 datasets). The experimental results show that compared to other state-of-the-art methods, VSGNet improves the Dice and mIoU metrics by 1.84 % and 3.34 %, respectively, and with a 196 × and 106 × reduction in the number of parameters and computational complexity. This paper constructs the VSGNet integrating the biological vision mechanism and the artificial intelligence algorithm, providing a new idea for the construction of deep learning models guided by the biological vision, promoting the development of biomimetic computational vision as well as.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"292 ","pages":"Article 128635"},"PeriodicalIF":7.5000,"publicationDate":"2025-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"VSGNet: visual saliency guided network for skin lesion segmentation\",\"authors\":\"Zhefei Cai , Yingle Fan , Tao Fang , Wei Wu\",\"doi\":\"10.1016/j.eswa.2025.128635\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The accuracy of skin lesion segmentation is of great significance for the subsequent clinical diagnosis. In order to improve the segmentation accuracy, some pioneering works tried to embed multiple complex modules, or used the huge Transformer framework, but due to the limitation of computing resources, these type of large models were not suitable for the actual clinical environment. To address the coexistence challenges of precision and lightweight, we propose a visual saliency guided network (VSGNet) for skin lesion segmentation, which generates saliency images of skin lesions through the efficient attention mechanism of biological vision, and guides the network to quickly locate the target area, so as to solve the localization difficulties in the skin lesion segmentation tasks. VSGNet includes three parts: Color Constancy module, Saliency Detection module and Ultra Lightweight Multi-level Interconnection Network (ULMI-Net). Specially, ULMI-Net uses a U-shaped structure network as the skeleton, including the Adaptive Split Channel Attention (ASCA) module that simulates the parallel mechanism of biological vision dual pathway, and the Channel-Spatial Parallel Attention (CSPA) module inspired by the multi-level interconnection structure of visual cortices. Through these modules, ULMI-Net can balance the efficient extraction and multi-scale fusion of global and local features, and try to achieve the excellent segmentation results at the lowest cost of parameters and computational complexity. To validate the effectiveness and robustness of the proposed VSGNet on three publicly available skin lesion segmentation datasets (ISIC2017, ISIC2018 and PH2 datasets). The experimental results show that compared to other state-of-the-art methods, VSGNet improves the Dice and mIoU metrics by 1.84 % and 3.34 %, respectively, and with a 196 × and 106 × reduction in the number of parameters and computational complexity. This paper constructs the VSGNet integrating the biological vision mechanism and the artificial intelligence algorithm, providing a new idea for the construction of deep learning models guided by the biological vision, promoting the development of biomimetic computational vision as well as.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"292 \",\"pages\":\"Article 128635\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-06-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425022547\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425022547","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

皮肤病灶分割的准确性对后续临床诊断具有重要意义。为了提高分割精度，一些开创性的作品尝试嵌入多个复杂模块，或者使用庞大的Transformer框架，但由于计算资源的限制，这些类型的大型模型并不适合实际的临床环境。为了解决精确度和轻量化共存的挑战，我们提出了一种用于皮肤病变分割的视觉显著性引导网络（visual saliency guided network, VSGNet），该网络通过生物视觉的高效注意机制生成皮肤病变的显著性图像，并引导网络快速定位目标区域，从而解决皮肤病变分割任务中的定位难题。VSGNet包括三部分：颜色恒定模块、显著性检测模块和超轻量级多级互连网络（ULMI-Net）。ULMI-Net采用u型结构网络作为骨架，包括模拟生物视觉双通路并行机制的自适应分裂通道注意（ASCA）模块和受视觉皮层多层次互联结构启发的通道-空间平行注意（CSPA）模块。通过这些模块，ULMI-Net可以平衡全局特征和局部特征的高效提取和多尺度融合，尽量在最小的参数代价和计算复杂度下获得优异的分割效果。为了验证所提出的VSGNet在三个公开可用的皮肤病变分割数据集（ISIC2017， ISIC2018和PH2数据集）上的有效性和鲁棒性。实验结果表明，与其他最先进的方法相比，VSGNet将Dice和mIoU指标分别提高了1.84%和3.34%，参数数量和计算复杂度分别减少了196倍和106倍。本文构建了生物视觉机制与人工智能算法相结合的VSGNet，为构建以生物视觉为指导的深度学习模型提供了新的思路，促进了仿生计算视觉的发展。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

VSGNet: visual saliency guided network for skin lesion segmentation

The accuracy of skin lesion segmentation is of great significance for the subsequent clinical diagnosis. In order to improve the segmentation accuracy, some pioneering works tried to embed multiple complex modules, or used the huge Transformer framework, but due to the limitation of computing resources, these type of large models were not suitable for the actual clinical environment. To address the coexistence challenges of precision and lightweight, we propose a visual saliency guided network (VSGNet) for skin lesion segmentation, which generates saliency images of skin lesions through the efficient attention mechanism of biological vision, and guides the network to quickly locate the target area, so as to solve the localization difficulties in the skin lesion segmentation tasks. VSGNet includes three parts: Color Constancy module, Saliency Detection module and Ultra Lightweight Multi-level Interconnection Network (ULMI-Net). Specially, ULMI-Net uses a U-shaped structure network as the skeleton, including the Adaptive Split Channel Attention (ASCA) module that simulates the parallel mechanism of biological vision dual pathway, and the Channel-Spatial Parallel Attention (CSPA) module inspired by the multi-level interconnection structure of visual cortices. Through these modules, ULMI-Net can balance the efficient extraction and multi-scale fusion of global and local features, and try to achieve the excellent segmentation results at the lowest cost of parameters and computational complexity. To validate the effectiveness and robustness of the proposed VSGNet on three publicly available skin lesion segmentation datasets (ISIC2017, ISIC2018 and PH2 datasets). The experimental results show that compared to other state-of-the-art methods, VSGNet improves the Dice and mIoU metrics by 1.84 % and 3.34 %, respectively, and with a 196 × and 106 × reduction in the number of parameters and computational complexity. This paper constructs the VSGNet integrating the biological vision mechanism and the artificial intelligence algorithm, providing a new idea for the construction of deep learning models guided by the biological vision, promoting the development of biomimetic computational vision as well as.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Expert Systems with Applications 工程技术-工程：电子与电气

CiteScore

13.80

自引率

10.60%

发文量

2045

审稿时长

8.7 months

期刊介绍： Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.