{"title":"Developing lightweight object detection models for USV with enhanced maritime surface visible imaging","authors":"Longhui Niu, Yunsheng Fan, Ting Liu, Qi Han","doi":"10.1016/j.jvcir.2025.104477","DOIUrl":null,"url":null,"abstract":"<div><div>Maritime surface object detection is a key technology for the autonomous navigation of unmanned surface vehicles (USVs). However, Maritime surface object detectors often face challenges such as large parameter sizes, object size variations, and image degradation caused by complex sea environments, severely affecting the deployment and detection accuracy on USVs. To address these challenges, this paper proposes the LightV7-enhancer object detection framework. This framework is based on the CPA-Enhancer image enhancement module and an improved YOLOv7 detection module for joint optimal learning. First, a new lightweight backbone network, GhostOECANet, was designed based on Ghost modules and improved coordinate attention. Second, by integrating ELAN and Efficient Multi-scale attention, an ELAN-EMA module is constructed to enhance the network’s perception and multi-scale feature extraction capabilities. Additionally, to improve the detection accuracy of small objects, multi-scale object detection layers are added based on the YOLOv5 detection head. The paper also introduces CPA-Enhancer in conjunction with the improved YOLOv7 detection module for joint training to adaptively restore degraded Maritime surface images, thereby improving detection accuracy in complex maritime backgrounds. Finally, the SeaShips dataset and Singapore Maritime Dataset are used to evaluate and compare LightV7-enhancer with other mainstream detectors. The results show that LightV7-enhancer supports object detection in various degraded maritime scenarios, achieving a balance between accuracy and computational complexity compared to other mainstream models. Compared to the baseline YOLOv7, LightV7-enhancer improves mAP by 2.7% and 7.5% on the two datasets, respectively, and has only half the number of parameters of YOLOv7, demonstrating robustness in degraded sea surface scenarios.</div></div>","PeriodicalId":54755,"journal":{"name":"Journal of Visual Communication and Image Representation","volume":"110 ","pages":"Article 104477"},"PeriodicalIF":2.6000,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Visual Communication and Image Representation","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1047320325000914","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Maritime surface object detection is a key technology for the autonomous navigation of unmanned surface vehicles (USVs). However, Maritime surface object detectors often face challenges such as large parameter sizes, object size variations, and image degradation caused by complex sea environments, severely affecting the deployment and detection accuracy on USVs. To address these challenges, this paper proposes the LightV7-enhancer object detection framework. This framework is based on the CPA-Enhancer image enhancement module and an improved YOLOv7 detection module for joint optimal learning. First, a new lightweight backbone network, GhostOECANet, was designed based on Ghost modules and improved coordinate attention. Second, by integrating ELAN and Efficient Multi-scale attention, an ELAN-EMA module is constructed to enhance the network’s perception and multi-scale feature extraction capabilities. Additionally, to improve the detection accuracy of small objects, multi-scale object detection layers are added based on the YOLOv5 detection head. The paper also introduces CPA-Enhancer in conjunction with the improved YOLOv7 detection module for joint training to adaptively restore degraded Maritime surface images, thereby improving detection accuracy in complex maritime backgrounds. Finally, the SeaShips dataset and Singapore Maritime Dataset are used to evaluate and compare LightV7-enhancer with other mainstream detectors. The results show that LightV7-enhancer supports object detection in various degraded maritime scenarios, achieving a balance between accuracy and computational complexity compared to other mainstream models. Compared to the baseline YOLOv7, LightV7-enhancer improves mAP by 2.7% and 7.5% on the two datasets, respectively, and has only half the number of parameters of YOLOv7, demonstrating robustness in degraded sea surface scenarios.
期刊介绍:
The Journal of Visual Communication and Image Representation publishes papers on state-of-the-art visual communication and image representation, with emphasis on novel technologies and theoretical work in this multidisciplinary area of pure and applied research. The field of visual communication and image representation is considered in its broadest sense and covers both digital and analog aspects as well as processing and communication in biological visual systems.