{"title":"Multi-Scale Graph Channel Attention Detectors for Sonar Images in Smart Ocean","authors":"Sensen Li;Yu Zhang;Zhengda Ma;Jie Ding;Binbin Zou","doi":"10.1109/TIM.2025.3551424","DOIUrl":null,"url":null,"abstract":"Object detection in sonar images is challenging since the performance based on generic object detectors is generally not well on sonar datasets and the prior for the scale distribution of the objects is usually ignored. To overcome these issues, in this article, multi-scale graph channel attention (MGCA) detectors are proposed, in which multi-scale box copy-paste is designed for data augmentation and graph transformer channel attention (GTCA) based on ConvNeXt is introduced to learn powerful visual representations for sonar images. GTCA contains three basic modules: graph structure transformation module for feature maps, graph convolution module for aggregating and updating information, and feedforward network module for feature transformation. With multi-scale box copy-paste to balance scale distribution and channel attention network in a graph form to recalibrate feature maps, MGCA detectors can effectively detect multi-scale objects in sonar images through transfer learning. The MGCA detector with Cascade R-CNN (CR) detection head achieves detection accuracies of 95.6 mAP and 72.7 mmAP on the sonar common target detection (SCTD) dataset, outperforming the previous best detector with an increase of 2.8 mAP and 11.8 mmAP. Extensive experimental results and ablation studies on two sonar datasets demonstrate the superiority of the proposed method compared with state-of-the-art methods.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-14"},"PeriodicalIF":5.6000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10930717/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Object detection in sonar images is challenging since the performance based on generic object detectors is generally not well on sonar datasets and the prior for the scale distribution of the objects is usually ignored. To overcome these issues, in this article, multi-scale graph channel attention (MGCA) detectors are proposed, in which multi-scale box copy-paste is designed for data augmentation and graph transformer channel attention (GTCA) based on ConvNeXt is introduced to learn powerful visual representations for sonar images. GTCA contains three basic modules: graph structure transformation module for feature maps, graph convolution module for aggregating and updating information, and feedforward network module for feature transformation. With multi-scale box copy-paste to balance scale distribution and channel attention network in a graph form to recalibrate feature maps, MGCA detectors can effectively detect multi-scale objects in sonar images through transfer learning. The MGCA detector with Cascade R-CNN (CR) detection head achieves detection accuracies of 95.6 mAP and 72.7 mmAP on the sonar common target detection (SCTD) dataset, outperforming the previous best detector with an increase of 2.8 mAP and 11.8 mmAP. Extensive experimental results and ablation studies on two sonar datasets demonstrate the superiority of the proposed method compared with state-of-the-art methods.
期刊介绍:
Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.