Bingfeng Li , Boxiang Lv , Qingshan Chen , Xinxin Duan , Xinwei Li
{"title":"Comprehensive-Detail Synergy with Multi-Level Dynamic Interaction for Enhanced Salient Object Detection","authors":"Bingfeng Li , Boxiang Lv , Qingshan Chen , Xinxin Duan , Xinwei Li","doi":"10.1016/j.engappai.2025.111579","DOIUrl":null,"url":null,"abstract":"<div><div>Salient Object Detection involves identifying and segmenting the most visually distinctive objects in an image. A key challenge is distinguishing Salient objects from complex backgrounds while preserving global features and minimizing local detail loss. To address this issue, we introduce a Comprehensive-Detail Synergy with Multi-Level Dynamic Interaction for Enhanced Salient Object Detection aimed at enhancing salient object features. Initially, a Multi-Scale Pooling Self-Attention Module is introduced to capture global contextual information of salient objects by combining multi-scale max pooling across spatial dimensions with self-attention. Additionally, to better preserve local details, an Adaptive Channel Enhancement Block is proposed, utilizing an adaptive weighting strategy to prioritize salient channels and enhance the model’s ability to capture intricate local features. Furthermore, to enhance the interaction between features at different levels, a Multi-Level Diffusive Synergy Block is introduced. With the integration of the cross-attention and dynamic diffusion refinement mechanism, it enables deep features to guide shallow features in focusing on salient regions. To alleviate the loss of local details due to excessive deep feature guidance, a Dual-Domain Fusion Attention Module is proposed, which integrates global self-attention with locally enhanced feature extraction units, thereby balancing global context modeling and local detail preservation. The experimental results conducted on six challenging publicly available datasets demonstrate that the proposed method outperforms the state of the art, achieving improvements of 4.9%, 3.3%, 2.4%, 2.3%, 1.2%, and 7.2% in the Weighted Harmonic Mean of Precision and Recall. These results demonstrate that the method improves accuracy and boundary detail.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111579"},"PeriodicalIF":7.5000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625015817","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Salient Object Detection involves identifying and segmenting the most visually distinctive objects in an image. A key challenge is distinguishing Salient objects from complex backgrounds while preserving global features and minimizing local detail loss. To address this issue, we introduce a Comprehensive-Detail Synergy with Multi-Level Dynamic Interaction for Enhanced Salient Object Detection aimed at enhancing salient object features. Initially, a Multi-Scale Pooling Self-Attention Module is introduced to capture global contextual information of salient objects by combining multi-scale max pooling across spatial dimensions with self-attention. Additionally, to better preserve local details, an Adaptive Channel Enhancement Block is proposed, utilizing an adaptive weighting strategy to prioritize salient channels and enhance the model’s ability to capture intricate local features. Furthermore, to enhance the interaction between features at different levels, a Multi-Level Diffusive Synergy Block is introduced. With the integration of the cross-attention and dynamic diffusion refinement mechanism, it enables deep features to guide shallow features in focusing on salient regions. To alleviate the loss of local details due to excessive deep feature guidance, a Dual-Domain Fusion Attention Module is proposed, which integrates global self-attention with locally enhanced feature extraction units, thereby balancing global context modeling and local detail preservation. The experimental results conducted on six challenging publicly available datasets demonstrate that the proposed method outperforms the state of the art, achieving improvements of 4.9%, 3.3%, 2.4%, 2.3%, 1.2%, and 7.2% in the Weighted Harmonic Mean of Precision and Recall. These results demonstrate that the method improves accuracy and boundary detail.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.