{"title":"基于残差空间协同注意模块的自监督学习水下目标检测","authors":"Weilin Luo;Chengyu Lin;Huan Zhou","doi":"10.1109/JOE.2025.3556153","DOIUrl":null,"url":null,"abstract":"The performance of underwater target detection techniques is limited by various factors. First, in underwater image data sets, there exists degradation such as color bias, low contrast, and blurring, which affect the accuracy of the detection algorithms. Second, the underwater image data set is difficult to obtain and the cost of making the labeled data sets is high, which also prevents underwater object detection algorithms from fully leveraging their potential. In this article, we propose a self-supervised learning network for underwater target detection. Considering the deficiency that the conventional contrastive learning network pays more attention to the global information and ignores the local information, an auxiliary branch inspired by masked autoencoders is added to the baseline SimSiam network, which collaborates with the main branch to optimize the target network and help the target network learn the local information of the target feature map. A residual spatial cooperative attention module is proposed to be embedded within the proposed self-supervised learning network to obtain remote information through residual structure and construct spatial context features. The method of cooperative attention is used to enhance feature learning ability. Experiments are carried out on a reconstructed underwater target data set. Results show that compared with the baseline network, the method proposed in this article is more suitable for underwater environments and has better mean average precision.","PeriodicalId":13191,"journal":{"name":"IEEE Journal of Oceanic Engineering","volume":"50 3","pages":"1930-1943"},"PeriodicalIF":5.3000,"publicationDate":"2025-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Underwater Target Detection by Residual Spatial Cooperative Attention Module–Based Self-Supervised Learning\",\"authors\":\"Weilin Luo;Chengyu Lin;Huan Zhou\",\"doi\":\"10.1109/JOE.2025.3556153\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The performance of underwater target detection techniques is limited by various factors. First, in underwater image data sets, there exists degradation such as color bias, low contrast, and blurring, which affect the accuracy of the detection algorithms. Second, the underwater image data set is difficult to obtain and the cost of making the labeled data sets is high, which also prevents underwater object detection algorithms from fully leveraging their potential. In this article, we propose a self-supervised learning network for underwater target detection. Considering the deficiency that the conventional contrastive learning network pays more attention to the global information and ignores the local information, an auxiliary branch inspired by masked autoencoders is added to the baseline SimSiam network, which collaborates with the main branch to optimize the target network and help the target network learn the local information of the target feature map. A residual spatial cooperative attention module is proposed to be embedded within the proposed self-supervised learning network to obtain remote information through residual structure and construct spatial context features. The method of cooperative attention is used to enhance feature learning ability. Experiments are carried out on a reconstructed underwater target data set. Results show that compared with the baseline network, the method proposed in this article is more suitable for underwater environments and has better mean average precision.\",\"PeriodicalId\":13191,\"journal\":{\"name\":\"IEEE Journal of Oceanic Engineering\",\"volume\":\"50 3\",\"pages\":\"1930-1943\"},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2025-03-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal of Oceanic Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10994668/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Oceanic Engineering","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10994668/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
The performance of underwater target detection techniques is limited by various factors. First, in underwater image data sets, there exists degradation such as color bias, low contrast, and blurring, which affect the accuracy of the detection algorithms. Second, the underwater image data set is difficult to obtain and the cost of making the labeled data sets is high, which also prevents underwater object detection algorithms from fully leveraging their potential. In this article, we propose a self-supervised learning network for underwater target detection. Considering the deficiency that the conventional contrastive learning network pays more attention to the global information and ignores the local information, an auxiliary branch inspired by masked autoencoders is added to the baseline SimSiam network, which collaborates with the main branch to optimize the target network and help the target network learn the local information of the target feature map. A residual spatial cooperative attention module is proposed to be embedded within the proposed self-supervised learning network to obtain remote information through residual structure and construct spatial context features. The method of cooperative attention is used to enhance feature learning ability. Experiments are carried out on a reconstructed underwater target data set. Results show that compared with the baseline network, the method proposed in this article is more suitable for underwater environments and has better mean average precision.
期刊介绍:
The IEEE Journal of Oceanic Engineering (ISSN 0364-9059) is the online-only quarterly publication of the IEEE Oceanic Engineering Society (IEEE OES). The scope of the Journal is the field of interest of the IEEE OES, which encompasses all aspects of science, engineering, and technology that address research, development, and operations pertaining to all bodies of water. This includes the creation of new capabilities and technologies from concept design through prototypes, testing, and operational systems to sense, explore, understand, develop, use, and responsibly manage natural resources.