Youpeng Yi, Jiawei Xu, Xiaoqin Zhang, Seop Hyeong Park
{"title":"ReBiT-Net: Resource-Efficient Bidirectional Transmission Network for RGB-D Salient Object Detection","authors":"Youpeng Yi, Jiawei Xu, Xiaoqin Zhang, Seop Hyeong Park","doi":"10.1007/s42835-024-01971-z","DOIUrl":null,"url":null,"abstract":"<p>Existing artificial neural network-based methodologies for salient object detection in RGB-depth (RGB-D) images typically require significant memory and computation time. In this paper, we propose ReBiT-Net, an novel and resource-efficient network designed to addresses this issue. ReBiT-Net utilizes a mobile network for feature extraction and incorporates depth map quality to regulate the fusion of multi-modal features, resulting in top-to-bottom refinement of salient objects using salient information. Empirical evaluations conducted on five benchmark datasets demonstrate the superior performance of our model in terms of accuracy (achieving 334 frames per second for an input size of 320 <span>\\(\\times\\)</span> 320) and model parameters (merely 5.1 MB). Moreover, we introduce ReBiT-Net*, a simplified variant of ReBiT-Net, which entails reduced model parameters (4.2 MB) and enhanced processing speed (793 frames per second for a 256 <span>\\(\\times\\)</span> 256 input size). These improvements are achieved through reduced memory requirements and computational demands via the adoption of a smaller input image size.</p>","PeriodicalId":15577,"journal":{"name":"Journal of Electrical Engineering & Technology","volume":null,"pages":null},"PeriodicalIF":1.6000,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Electrical Engineering & Technology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s42835-024-01971-z","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Existing artificial neural network-based methodologies for salient object detection in RGB-depth (RGB-D) images typically require significant memory and computation time. In this paper, we propose ReBiT-Net, an novel and resource-efficient network designed to addresses this issue. ReBiT-Net utilizes a mobile network for feature extraction and incorporates depth map quality to regulate the fusion of multi-modal features, resulting in top-to-bottom refinement of salient objects using salient information. Empirical evaluations conducted on five benchmark datasets demonstrate the superior performance of our model in terms of accuracy (achieving 334 frames per second for an input size of 320 \(\times\) 320) and model parameters (merely 5.1 MB). Moreover, we introduce ReBiT-Net*, a simplified variant of ReBiT-Net, which entails reduced model parameters (4.2 MB) and enhanced processing speed (793 frames per second for a 256 \(\times\) 256 input size). These improvements are achieved through reduced memory requirements and computational demands via the adoption of a smaller input image size.
期刊介绍:
ournal of Electrical Engineering and Technology (JEET), which is the official publication of the Korean Institute of Electrical Engineers (KIEE) being published bimonthly, released the first issue in March 2006.The journal is open to submission from scholars and experts in the wide areas of electrical engineering technologies.
The scope of the journal includes all issues in the field of Electrical Engineering and Technology. Included are techniques for electrical power engineering, electrical machinery and energy conversion systems, electrophysics and applications, information and controls.