A lightweight YOLO network using temporal features for high-resolution sonar segmentation

IF 2.8 2区 生物学 Q1 MARINE & FRESHWATER BIOLOGY
Sen Gao, Wei Guo, Gaofei Xu, Ben Liu, Yu Sun, Bo Yuan
{"title":"A lightweight YOLO network using temporal features for high-resolution sonar segmentation","authors":"Sen Gao, Wei Guo, Gaofei Xu, Ben Liu, Yu Sun, Bo Yuan","doi":"10.3389/fmars.2025.1581794","DOIUrl":null,"url":null,"abstract":"IntroductionHigh-resolution sonar systems are critical for underwater robots to obtain precise environmental perception. However, the computational demands of processing sonar imagery in real-time pose significant challenges for autonomous underwater vehicles (AUVs) operating in dynamic environments. Current segmentation methods often struggle to balance processing speed with accuracy.MethodsWe propose a novel YOLO-based segmentation framework featuring: (1) A lightweight backbone(ghostnet) network optimized for sonar imagery processing (2) A bypass BiLSTM network for temporal feature learning across consecutive frames. The system processes non-keyframes by predicting semantic vectors through the trained BiLSTM model, selectively skipping computational layers to enhance efficiency. The model was trained and evaluated on a high-resolution sonar dataset collected using an AUV-mounted Oculus MD750d multibeam forward-looking sonar in two distinct underwater environments.ResultsImplementation on Nvidia Jetson TX2 demonstrated significant performance improvements. (1) Processing latency reduced to 87.4 ms (keyframes) and 35.3 ms (non-keyframes)(2)Maintained competitive segmentation accuracy compared to conventional methods and achieved low latency.DiscussionThe proposed architecture successfully addresses the speed-accuracy trade-off in sonar image segmentation through its innovative temporal feature utilization and computational skipping mechanism. The significant latency reduction enables more responsive AUV navigation without compromising perception quality. The newly introduced dataset fills an important gap in high-resolution sonar benchmarking. Future work will focus on optimizing the keyframe selection algorithm and expanding the dataset to include more complex underwater scenarios.","PeriodicalId":12479,"journal":{"name":"Frontiers in Marine Science","volume":"58 1","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Marine Science","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3389/fmars.2025.1581794","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MARINE & FRESHWATER BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

IntroductionHigh-resolution sonar systems are critical for underwater robots to obtain precise environmental perception. However, the computational demands of processing sonar imagery in real-time pose significant challenges for autonomous underwater vehicles (AUVs) operating in dynamic environments. Current segmentation methods often struggle to balance processing speed with accuracy.MethodsWe propose a novel YOLO-based segmentation framework featuring: (1) A lightweight backbone(ghostnet) network optimized for sonar imagery processing (2) A bypass BiLSTM network for temporal feature learning across consecutive frames. The system processes non-keyframes by predicting semantic vectors through the trained BiLSTM model, selectively skipping computational layers to enhance efficiency. The model was trained and evaluated on a high-resolution sonar dataset collected using an AUV-mounted Oculus MD750d multibeam forward-looking sonar in two distinct underwater environments.ResultsImplementation on Nvidia Jetson TX2 demonstrated significant performance improvements. (1) Processing latency reduced to 87.4 ms (keyframes) and 35.3 ms (non-keyframes)(2)Maintained competitive segmentation accuracy compared to conventional methods and achieved low latency.DiscussionThe proposed architecture successfully addresses the speed-accuracy trade-off in sonar image segmentation through its innovative temporal feature utilization and computational skipping mechanism. The significant latency reduction enables more responsive AUV navigation without compromising perception quality. The newly introduced dataset fills an important gap in high-resolution sonar benchmarking. Future work will focus on optimizing the keyframe selection algorithm and expanding the dataset to include more complex underwater scenarios.
一个轻量级的YOLO网络,利用时间特征进行高分辨率声纳分割
高分辨率声纳系统是水下机器人获得精确环境感知的关键。然而,实时处理声呐图像的计算需求对动态环境下的自主水下航行器(auv)提出了重大挑战。当前的分割方法往往难以平衡处理速度和准确性。方法提出了一种新的基于yolo的分割框架,该框架具有:(1)针对声纳图像处理优化的轻量级骨干(鬼网)网络(ghostnet);(2)跨连续帧进行时间特征学习的旁路BiLSTM网络。该系统通过训练后的BiLSTM模型预测语义向量来处理非关键帧,有选择地跳过计算层以提高效率。该模型在高分辨率声纳数据集上进行了训练和评估,该数据集使用安装在auv上的Oculus MD750d多波束前视声纳在两种不同的水下环境中收集。结果在Nvidia Jetson TX2上的实现显示了显著的性能改进。(1)处理延迟降低到87.4 ms(关键帧)和35.3 ms(非关键帧)(2)与传统方法相比,保持了具有竞争力的分割精度,实现了低延迟。所提出的架构通过其创新的时间特征利用和计算跳过机制成功地解决了声纳图像分割中速度和精度的权衡。显著的延迟减少使AUV导航响应更灵敏,而不影响感知质量。新引入的数据集填补了高分辨率声纳基准测试的重要空白。未来的工作将集中在优化关键帧选择算法和扩展数据集以包括更复杂的水下场景。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Frontiers in Marine Science
Frontiers in Marine Science Agricultural and Biological Sciences-Aquatic Science
CiteScore
5.10
自引率
16.20%
发文量
2443
审稿时长
14 weeks
期刊介绍: Frontiers in Marine Science publishes rigorously peer-reviewed research that advances our understanding of all aspects of the environment, biology, ecosystem functioning and human interactions with the oceans. Field Chief Editor Carlos M. Duarte at King Abdullah University of Science and Technology Thuwal is supported by an outstanding Editorial Board of international researchers. This multidisciplinary open-access journal is at the forefront of disseminating and communicating scientific knowledge and impactful discoveries to researchers, academics, policy makers and the public worldwide. With the human population predicted to reach 9 billion people by 2050, it is clear that traditional land resources will not suffice to meet the demand for food or energy, required to support high-quality livelihoods. As a result, the oceans are emerging as a source of untapped assets, with new innovative industries, such as aquaculture, marine biotechnology, marine energy and deep-sea mining growing rapidly under a new era characterized by rapid growth of a blue, ocean-based economy. The sustainability of the blue economy is closely dependent on our knowledge about how to mitigate the impacts of the multiple pressures on the ocean ecosystem associated with the increased scale and diversification of industry operations in the ocean and global human pressures on the environment. Therefore, Frontiers in Marine Science particularly welcomes the communication of research outcomes addressing ocean-based solutions for the emerging challenges, including improved forecasting and observational capacities, understanding biodiversity and ecosystem problems, locally and globally, effective management strategies to maintain ocean health, and an improved capacity to sustainably derive resources from the oceans.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信