综合细节协同与多层次动态交互增强显著目标检测

IF 7.5 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Engineering Applications of Artificial Intelligence Pub Date : 2025-07-04 DOI:10.1016/j.engappai.2025.111579

Bingfeng Li , Boxiang Lv , Qingshan Chen , Xinxin Duan , Xinwei Li

{"title":"综合细节协同与多层次动态交互增强显著目标检测","authors":"Bingfeng Li , Boxiang Lv , Qingshan Chen , Xinxin Duan , Xinwei Li","doi":"10.1016/j.engappai.2025.111579","DOIUrl":null,"url":null,"abstract":"<div><div>Salient Object Detection involves identifying and segmenting the most visually distinctive objects in an image. A key challenge is distinguishing Salient objects from complex backgrounds while preserving global features and minimizing local detail loss. To address this issue, we introduce a Comprehensive-Detail Synergy with Multi-Level Dynamic Interaction for Enhanced Salient Object Detection aimed at enhancing salient object features. Initially, a Multi-Scale Pooling Self-Attention Module is introduced to capture global contextual information of salient objects by combining multi-scale max pooling across spatial dimensions with self-attention. Additionally, to better preserve local details, an Adaptive Channel Enhancement Block is proposed, utilizing an adaptive weighting strategy to prioritize salient channels and enhance the model’s ability to capture intricate local features. Furthermore, to enhance the interaction between features at different levels, a Multi-Level Diffusive Synergy Block is introduced. With the integration of the cross-attention and dynamic diffusion refinement mechanism, it enables deep features to guide shallow features in focusing on salient regions. To alleviate the loss of local details due to excessive deep feature guidance, a Dual-Domain Fusion Attention Module is proposed, which integrates global self-attention with locally enhanced feature extraction units, thereby balancing global context modeling and local detail preservation. The experimental results conducted on six challenging publicly available datasets demonstrate that the proposed method outperforms the state of the art, achieving improvements of 4.9%, 3.3%, 2.4%, 2.3%, 1.2%, and 7.2% in the Weighted Harmonic Mean of Precision and Recall. These results demonstrate that the method improves accuracy and boundary detail.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111579"},"PeriodicalIF":7.5000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comprehensive-Detail Synergy with Multi-Level Dynamic Interaction for Enhanced Salient Object Detection\",\"authors\":\"Bingfeng Li , Boxiang Lv , Qingshan Chen , Xinxin Duan , Xinwei Li\",\"doi\":\"10.1016/j.engappai.2025.111579\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Salient Object Detection involves identifying and segmenting the most visually distinctive objects in an image. A key challenge is distinguishing Salient objects from complex backgrounds while preserving global features and minimizing local detail loss. To address this issue, we introduce a Comprehensive-Detail Synergy with Multi-Level Dynamic Interaction for Enhanced Salient Object Detection aimed at enhancing salient object features. Initially, a Multi-Scale Pooling Self-Attention Module is introduced to capture global contextual information of salient objects by combining multi-scale max pooling across spatial dimensions with self-attention. Additionally, to better preserve local details, an Adaptive Channel Enhancement Block is proposed, utilizing an adaptive weighting strategy to prioritize salient channels and enhance the model’s ability to capture intricate local features. Furthermore, to enhance the interaction between features at different levels, a Multi-Level Diffusive Synergy Block is introduced. With the integration of the cross-attention and dynamic diffusion refinement mechanism, it enables deep features to guide shallow features in focusing on salient regions. To alleviate the loss of local details due to excessive deep feature guidance, a Dual-Domain Fusion Attention Module is proposed, which integrates global self-attention with locally enhanced feature extraction units, thereby balancing global context modeling and local detail preservation. The experimental results conducted on six challenging publicly available datasets demonstrate that the proposed method outperforms the state of the art, achieving improvements of 4.9%, 3.3%, 2.4%, 2.3%, 1.2%, and 7.2% in the Weighted Harmonic Mean of Precision and Recall. These results demonstrate that the method improves accuracy and boundary detail.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"159 \",\"pages\":\"Article 111579\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625015817\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625015817","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

显著目标检测涉及识别和分割图像中视觉上最显著的目标。一个关键的挑战是在保持全局特征和最小化局部细节损失的同时，从复杂的背景中区分出突出的目标。为了解决这个问题，我们引入了一种具有多层次动态交互的全面细节协同，用于增强显著目标检测，旨在增强显著目标特征。首先，引入多尺度池化自注意模块，通过将跨空间维度的多尺度最大池化与自注意相结合，捕获显著目标的全局上下文信息。此外，为了更好地保留局部细节，提出了一种自适应信道增强块，利用自适应加权策略对显著信道进行优先级排序，增强模型捕获复杂局部特征的能力。此外，为了增强不同层次特征之间的相互作用，引入了多级扩散协同块。融合交叉关注和动态扩散细化机制，使深层特征引导浅层特征聚焦突出区域。为了减轻深度特征引导导致的局部细节丢失，提出了一种双域融合关注模块，该模块将全局自关注与局部增强特征提取单元相结合，从而平衡全局上下文建模和局部细节保存。在六个具有挑战性的公开数据集上进行的实验结果表明，所提出的方法优于目前的技术水平，在精度和召回率的加权调和平均值上分别提高了4.9%,3.3%,2.4%,2.3%，1.2%和7.2%。结果表明，该方法提高了图像的精度和边界细节。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Comprehensive-Detail Synergy with Multi-Level Dynamic Interaction for Enhanced Salient Object Detection

Salient Object Detection involves identifying and segmenting the most visually distinctive objects in an image. A key challenge is distinguishing Salient objects from complex backgrounds while preserving global features and minimizing local detail loss. To address this issue, we introduce a Comprehensive-Detail Synergy with Multi-Level Dynamic Interaction for Enhanced Salient Object Detection aimed at enhancing salient object features. Initially, a Multi-Scale Pooling Self-Attention Module is introduced to capture global contextual information of salient objects by combining multi-scale max pooling across spatial dimensions with self-attention. Additionally, to better preserve local details, an Adaptive Channel Enhancement Block is proposed, utilizing an adaptive weighting strategy to prioritize salient channels and enhance the model’s ability to capture intricate local features. Furthermore, to enhance the interaction between features at different levels, a Multi-Level Diffusive Synergy Block is introduced. With the integration of the cross-attention and dynamic diffusion refinement mechanism, it enables deep features to guide shallow features in focusing on salient regions. To alleviate the loss of local details due to excessive deep feature guidance, a Dual-Domain Fusion Attention Module is proposed, which integrates global self-attention with locally enhanced feature extraction units, thereby balancing global context modeling and local detail preservation. The experimental results conducted on six challenging publicly available datasets demonstrate that the proposed method outperforms the state of the art, achieving improvements of 4.9%, 3.3%, 2.4%, 2.3%, 1.2%, and 7.2% in the Weighted Harmonic Mean of Precision and Recall. These results demonstrate that the method improves accuracy and boundary detail.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.