Chenxing Xia , Jingjing Wang , Xiuju Gao , Bin Ge , Wenjun Zhao , Kuan-Ching Li , Xianjin Fang , Yan Zhang
{"title":"Dynamic selection fusion network for RGB-D salient object detection","authors":"Chenxing Xia , Jingjing Wang , Xiuju Gao , Bin Ge , Wenjun Zhao , Kuan-Ching Li , Xianjin Fang , Yan Zhang","doi":"10.1016/j.compeleceng.2025.110701","DOIUrl":null,"url":null,"abstract":"<div><div>RGB-D salient object detection (SOD) has garnered increasing attention due to the introduction of depth cues. While RGB images and depth maps have inherent differences, they possess strong semantic consistency in representing objects. However, existing methods often fail to consider the consistent representation of the two modalities when exploring cross-modal interaction. To this end, we propose a novel dynamic selection fusion network (DSFNet) for RGB-D SOD by incorporating cross-modal consistency enhancement and cascade feature refinement. Specifically, in the encoding phase, a cross-modal consistency enhancement (CMCE) module is introduced to dynamically enhance both interaction consistency and semantic consistency representations through attention guidance. In the decoding phase, a cascade feature refinement (CFR) module is designed to progressively refine segmentation results from coarse to fine by incorporating multi-scale pooling and enhancement operations. Experimental evaluation shows that our DSFNet outperforms 14 current state-of-the-art RGB-D SOD methods.</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"128 ","pages":"Article 110701"},"PeriodicalIF":4.9000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790625006445","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
RGB-D salient object detection (SOD) has garnered increasing attention due to the introduction of depth cues. While RGB images and depth maps have inherent differences, they possess strong semantic consistency in representing objects. However, existing methods often fail to consider the consistent representation of the two modalities when exploring cross-modal interaction. To this end, we propose a novel dynamic selection fusion network (DSFNet) for RGB-D SOD by incorporating cross-modal consistency enhancement and cascade feature refinement. Specifically, in the encoding phase, a cross-modal consistency enhancement (CMCE) module is introduced to dynamically enhance both interaction consistency and semantic consistency representations through attention guidance. In the decoding phase, a cascade feature refinement (CFR) module is designed to progressively refine segmentation results from coarse to fine by incorporating multi-scale pooling and enhancement operations. Experimental evaluation shows that our DSFNet outperforms 14 current state-of-the-art RGB-D SOD methods.
期刊介绍:
The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency.
Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.