{"title":"A Three-Branch Cross-Modal Interactive Network for RGB-D Salient Defect Detection","authors":"Lisha Cui;Ming Ma;Chaochao Li;Xiaoheng Jiang;Zhiwen Song;Lijian Fan;Mingliang Xu","doi":"10.1109/TIM.2025.3599282","DOIUrl":null,"url":null,"abstract":"RGB defective images are abundant in color and texture, whereas depth images exhibit prominent defect shapes and boundaries. Given this fact, we propose a three-branch cross-modal interactive network for RGB-D salient defect detection (TCI-Net). Specifically, we first perform a real three-stream encoder–decoder network at both the image level and feature level, with each branch utilizing RGB, RGB-D, and depth images as input to fully extract the underlying complementary information from each modality. In the encoder stage, a differential guidance module (DGM) is proposed to guide the RGB branch in learning the boundary shape features of defects, while a fusion perception module (FPM) is devised to facilitate the depth branch in encoding more textual knowledge. In addition, we propose a cross-modal feature refinement module (CFRM) to bridge the feature gap between modalities and enhance information interaction. Finally, in the decoder stage, we incorporate boundary map supervision and a semantic guidance module (SGM) to enhance the details and contextual semantics of the defects, while gradually reconstructing the spatial scale. Extensive experiments and analyses on the RGB-D defect dataset NEU RSDDS-AUG demonstrate that the proposed TCI-Net significantly improves the segmentation accuracy compared with state-of-the-art algorithms.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-11"},"PeriodicalIF":5.9000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11146450/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
RGB defective images are abundant in color and texture, whereas depth images exhibit prominent defect shapes and boundaries. Given this fact, we propose a three-branch cross-modal interactive network for RGB-D salient defect detection (TCI-Net). Specifically, we first perform a real three-stream encoder–decoder network at both the image level and feature level, with each branch utilizing RGB, RGB-D, and depth images as input to fully extract the underlying complementary information from each modality. In the encoder stage, a differential guidance module (DGM) is proposed to guide the RGB branch in learning the boundary shape features of defects, while a fusion perception module (FPM) is devised to facilitate the depth branch in encoding more textual knowledge. In addition, we propose a cross-modal feature refinement module (CFRM) to bridge the feature gap between modalities and enhance information interaction. Finally, in the decoder stage, we incorporate boundary map supervision and a semantic guidance module (SGM) to enhance the details and contextual semantics of the defects, while gradually reconstructing the spatial scale. Extensive experiments and analyses on the RGB-D defect dataset NEU RSDDS-AUG demonstrate that the proposed TCI-Net significantly improves the segmentation accuracy compared with state-of-the-art algorithms.
期刊介绍:
Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.