Underwater target detection algorithm based on multi-scale feature fusion

IF 4 3区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE
Xiao Chen , Qi Yang , Xiaoqi Ge , Jiayi Chen , Haiyan Wang
{"title":"Underwater target detection algorithm based on multi-scale feature fusion","authors":"Xiao Chen ,&nbsp;Qi Yang ,&nbsp;Xiaoqi Ge ,&nbsp;Jiayi Chen ,&nbsp;Haiyan Wang","doi":"10.1016/j.compeleceng.2025.110327","DOIUrl":null,"url":null,"abstract":"<div><div>Underwater target detection technology is an important way to realize the biological monitoring of the marine ranching. Nevertheless, since light absorption of the water and particles in the water will have a scattering effect on the light, the majority of the captured underwater optical images have low contrast and color bias, which blurs the local details of the measured biological targets. In addition, due to the mutual occlusion of underwater organisms, which will lead to a higher rate of missed targets detection. To address the above problems, we propose a new target detection algorithm for complex underwater environments based on the YOLOv7 (You Only Look Once Version 7). Firstly, in order to enhance the underwater image quality, while improving the color bias and reducing the image noise. By comparing different image enhancement methods, RGHS (Relative Global Histogram Stretching) is used to enhance the quality of dataset. Secondly, in order to realize the network's perception of biological detail features, the MIS (Multi-Gradient Interaction Structure) is proposed. The structural innovation aims to mitigate the challenges posed by target proximity and overlap. Finally, after the backbone feature extraction, we propose the multiscale feature fusion structure. It facilitates the cross-channel communication of semantic information, thereby enhancing the network's capability to express features. The experimental results indicate that our proposed algorithm has achieved a detection accuracy improvement of 1.35 % compared to the original YOLOv7 (You Only Look Once Version 7). Simultaneously, it has reduced instances of missed detections and misjudgments of underwater targets, surpassing common object detection algorithms.</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"124 ","pages":"Article 110327"},"PeriodicalIF":4.0000,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790625002708","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

Abstract

Underwater target detection technology is an important way to realize the biological monitoring of the marine ranching. Nevertheless, since light absorption of the water and particles in the water will have a scattering effect on the light, the majority of the captured underwater optical images have low contrast and color bias, which blurs the local details of the measured biological targets. In addition, due to the mutual occlusion of underwater organisms, which will lead to a higher rate of missed targets detection. To address the above problems, we propose a new target detection algorithm for complex underwater environments based on the YOLOv7 (You Only Look Once Version 7). Firstly, in order to enhance the underwater image quality, while improving the color bias and reducing the image noise. By comparing different image enhancement methods, RGHS (Relative Global Histogram Stretching) is used to enhance the quality of dataset. Secondly, in order to realize the network's perception of biological detail features, the MIS (Multi-Gradient Interaction Structure) is proposed. The structural innovation aims to mitigate the challenges posed by target proximity and overlap. Finally, after the backbone feature extraction, we propose the multiscale feature fusion structure. It facilitates the cross-channel communication of semantic information, thereby enhancing the network's capability to express features. The experimental results indicate that our proposed algorithm has achieved a detection accuracy improvement of 1.35 % compared to the original YOLOv7 (You Only Look Once Version 7). Simultaneously, it has reduced instances of missed detections and misjudgments of underwater targets, surpassing common object detection algorithms.
基于多尺度特征融合的水下目标检测算法
水下目标探测技术是实现海洋牧场生物监测的重要手段。然而,由于水和水中粒子的光吸收会对光线产生散射效应,因此大多数捕获的水下光学图像对比度低,颜色偏置,从而模糊了被测生物目标的局部细节。此外,由于水下生物的相互遮挡,这将导致更高的目标漏检率。针对上述问题,我们提出了一种基于YOLOv7 (You Only Look Once Version 7)的复杂水下环境目标检测新算法。首先,为了提高水下图像质量,同时改善颜色偏差,降低图像噪声;通过比较不同的图像增强方法,采用RGHS (Relative Global Histogram Stretching,相对全局直方图拉伸)增强数据集的质量。其次,为了实现网络对生物细节特征的感知,提出了多梯度交互结构(MIS)。结构创新旨在缓解目标接近和重叠带来的挑战。最后,在提取主干特征后,提出多尺度特征融合结构。它促进了语义信息的跨通道通信,从而增强了网络表达特征的能力。实验结果表明,该算法的检测精度比原来的YOLOv7 (You Only Look Once Version 7)提高了1.35%,同时减少了对水下目标的漏检和误判,超过了常用的目标检测算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computers & Electrical Engineering
Computers & Electrical Engineering 工程技术-工程:电子与电气
CiteScore
9.20
自引率
7.00%
发文量
661
审稿时长
47 days
期刊介绍: The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency. Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信