基于无监督聚类优化的YOLO水下目标检测高效关注

IF 13.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence Review Pub Date : 2025-04-23 DOI:10.1007/s10462-025-11218-6

Xin Shen, Guoliang Yuan, Huibing Wang, Xianping Fu

{"title":"基于无监督聚类优化的YOLO水下目标检测高效关注","authors":"Xin Shen, Guoliang Yuan, Huibing Wang, Xianping Fu","doi":"10.1007/s10462-025-11218-6","DOIUrl":null,"url":null,"abstract":"<div><p>Underwater object detection is a prerequisite for underwater robots to realize ocean exploration and autonomous grasping. However, underwater detection tasks face some inevitable interference factors, such as poor imaging quality, strong environment randomness, and high organism concealment. These phenomena will lead to strong underwater background interference and weak underwater object perception, which greatly aggravates the difficulty of underwater object detection. In order to deal with the above problems, we propose an unsupervised clustering optimization-based efficient attention (UCOEA). Different from the channel-wise strategy, cross-channel strategy and channel grouping strategy, we design a channel clustering strategy, which achieves autonomous dynamic screening of channel information by using the K-Means algorithm. Same types of channel information with high redundancy are learned uniformly to share the same operation. Different types of channel information with high specificity are learned independently to avoid channel noise information interference. Different from the single spatial strategy and multiple spatial strategy, we design a spatial clustering strategy, which achieves autonomous dynamic stripping of spatial information by using the EM algorithm. This strategy can extract multiple required spatial information at one time from different spatial locations. We further assign learnable weight parameters to distinguish dominant information and auxiliary information, which can alleviate spatial noise information interference. Our strategies can better balance additional cost overhead and information processing quality, which is crucial for the proposed attention to achieve fast and accurate underwater information calibration. In order to achieve high-precision and real-time underwater object detection, we propose a combined system of UCOEA underwater adapter and one-stage YOLO detector, which can efficiently detect small, medium and large targets at the same time. Extensive experiments demonstrate the effectiveness of our work. More importantly, we publish an underwater detection dataset DLMU2024 with low image continuity and high data diversity, which provides reliable support for the rapid development of underwater detection research. Our dataset is available at https://github.com/shenxin-dlmu/DLMU2024.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 7","pages":""},"PeriodicalIF":13.9000,"publicationDate":"2025-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11218-6.pdf","citationCount":"0","resultStr":"{\"title\":\"Unsupervised clustering optimization-based efficient attention in YOLO for underwater object detection\",\"authors\":\"Xin Shen, Guoliang Yuan, Huibing Wang, Xianping Fu\",\"doi\":\"10.1007/s10462-025-11218-6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Underwater object detection is a prerequisite for underwater robots to realize ocean exploration and autonomous grasping. However, underwater detection tasks face some inevitable interference factors, such as poor imaging quality, strong environment randomness, and high organism concealment. These phenomena will lead to strong underwater background interference and weak underwater object perception, which greatly aggravates the difficulty of underwater object detection. In order to deal with the above problems, we propose an unsupervised clustering optimization-based efficient attention (UCOEA). Different from the channel-wise strategy, cross-channel strategy and channel grouping strategy, we design a channel clustering strategy, which achieves autonomous dynamic screening of channel information by using the K-Means algorithm. Same types of channel information with high redundancy are learned uniformly to share the same operation. Different types of channel information with high specificity are learned independently to avoid channel noise information interference. Different from the single spatial strategy and multiple spatial strategy, we design a spatial clustering strategy, which achieves autonomous dynamic stripping of spatial information by using the EM algorithm. This strategy can extract multiple required spatial information at one time from different spatial locations. We further assign learnable weight parameters to distinguish dominant information and auxiliary information, which can alleviate spatial noise information interference. Our strategies can better balance additional cost overhead and information processing quality, which is crucial for the proposed attention to achieve fast and accurate underwater information calibration. In order to achieve high-precision and real-time underwater object detection, we propose a combined system of UCOEA underwater adapter and one-stage YOLO detector, which can efficiently detect small, medium and large targets at the same time. Extensive experiments demonstrate the effectiveness of our work. More importantly, we publish an underwater detection dataset DLMU2024 with low image continuity and high data diversity, which provides reliable support for the rapid development of underwater detection research. Our dataset is available at https://github.com/shenxin-dlmu/DLMU2024.</p></div>\",\"PeriodicalId\":8449,\"journal\":{\"name\":\"Artificial Intelligence Review\",\"volume\":\"58 7\",\"pages\":\"\"},\"PeriodicalIF\":13.9000,\"publicationDate\":\"2025-04-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s10462-025-11218-6.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial Intelligence Review\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10462-025-11218-6\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence Review","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10462-025-11218-6","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

水下物体探测是水下机器人实现海洋探测和自主抓取的前提条件。然而，水下探测任务面临着一些不可避免的干扰因素，如成像质量差、环境随机性强、生物隐蔽性高等。这些现象会导致水下背景干扰强、水下物体感知弱，大大增加了水下物体探测的难度。针对上述问题，我们提出了一种基于无监督聚类优化的高效注意力（UCOEA）。与信道策略、跨信道策略和信道分组策略不同，我们设计了一种信道聚类策略，利用 K-Means 算法实现了信道信息的自主动态筛选。对冗余度高的同类型信道信息进行统一学习，共享同一操作。不同类型的高特异性信道信息独立学习，避免信道噪声信息干扰。与单空间策略和多空间策略不同，我们设计了一种空间聚类策略，利用电磁算法实现空间信息的自主动态剥离。这种策略可以一次性从不同的空间位置提取多种所需的空间信息。我们还进一步分配了可学习的权重参数，以区分主导信息和辅助信息，从而减轻空间噪声信息的干扰。我们的策略能更好地平衡额外成本开销和信息处理质量，这对所提出的注意力实现快速、准确的水下信息校准至关重要。为了实现高精度、实时的水下物体检测，我们提出了一种由 UCOEA 水下适配器和单级 YOLO 检测器组成的组合系统，可以同时高效地检测小型、中型和大型目标。大量实验证明了我们工作的有效性。更重要的是，我们发布了一个图像连续性低、数据多样性高的水下探测数据集 DLMU2024，为水下探测研究的快速发展提供了可靠的支持。我们的数据集可在 https://github.com/shenxin-dlmu/DLMU2024 网站上查阅。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Unsupervised clustering optimization-based efficient attention in YOLO for underwater object detection

Underwater object detection is a prerequisite for underwater robots to realize ocean exploration and autonomous grasping. However, underwater detection tasks face some inevitable interference factors, such as poor imaging quality, strong environment randomness, and high organism concealment. These phenomena will lead to strong underwater background interference and weak underwater object perception, which greatly aggravates the difficulty of underwater object detection. In order to deal with the above problems, we propose an unsupervised clustering optimization-based efficient attention (UCOEA). Different from the channel-wise strategy, cross-channel strategy and channel grouping strategy, we design a channel clustering strategy, which achieves autonomous dynamic screening of channel information by using the K-Means algorithm. Same types of channel information with high redundancy are learned uniformly to share the same operation. Different types of channel information with high specificity are learned independently to avoid channel noise information interference. Different from the single spatial strategy and multiple spatial strategy, we design a spatial clustering strategy, which achieves autonomous dynamic stripping of spatial information by using the EM algorithm. This strategy can extract multiple required spatial information at one time from different spatial locations. We further assign learnable weight parameters to distinguish dominant information and auxiliary information, which can alleviate spatial noise information interference. Our strategies can better balance additional cost overhead and information processing quality, which is crucial for the proposed attention to achieve fast and accurate underwater information calibration. In order to achieve high-precision and real-time underwater object detection, we propose a combined system of UCOEA underwater adapter and one-stage YOLO detector, which can efficiently detect small, medium and large targets at the same time. Extensive experiments demonstrate the effectiveness of our work. More importantly, we publish an underwater detection dataset DLMU2024 with low image continuity and high data diversity, which provides reliable support for the rapid development of underwater detection research. Our dataset is available at https://github.com/shenxin-dlmu/DLMU2024.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Artificial Intelligence Review 工程技术-计算机：人工智能

CiteScore

22.00

自引率

3.30%

发文量

194

审稿时长

5.3 months

期刊介绍： Artificial Intelligence Review, a fully open access journal, publishes cutting-edge research in artificial intelligence and cognitive science. It features critical evaluations of applications, techniques, and algorithms, providing a platform for both researchers and application developers. The journal includes refereed survey and tutorial articles, along with reviews and commentary on significant developments in the field.