Three-Stage Distortion-Driven Enhancement Network for Forward-Looking Sonar Image Segmentation

IF 4.3 2区 综合性期刊 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
Chengjun Han;Yunlei Shen;Zhi Liu
{"title":"Three-Stage Distortion-Driven Enhancement Network for Forward-Looking Sonar Image Segmentation","authors":"Chengjun Han;Yunlei Shen;Zhi Liu","doi":"10.1109/JSEN.2024.3506831","DOIUrl":null,"url":null,"abstract":"Forward-looking sonar (FLS) image segmentation aims to accurately locate underwater objects, providing essential support for marine engineering. Unlike natural optical images, FLS images have weak semantic content and complex background, posing segmentation challenging for existing models. To address this issue, we propose a novel FLS image segmentation model, the three-stage distortion-driven enhancement network (TDENet), equipped with an extended IoU loss. TDENet employs a three-stage distortion-driven feature processing strategy. Specifically, we propose the three-stage distortion-driven module (TDM), which consists of three stages (i.e., feature distortion, enhancement, and fusion). First, feature distortion introduces dynamic inputs, compelling the model to learn patterns in a distorted feature space, thereby enhancing robustness. In the enhancement stage, global and local interactions between distorted and undistorted features improve the model’s semantic comprehension and detail retention. Finally, feature fusion ensures a comprehensive representation. To refine segmentation maps, we propose the centroid deviation intersection over union loss (CD-IoU loss), incorporating a CD term to the vanilla IoU loss. This term measures the distance and directional discrepancies between the centroids of predicted and ground truth object regions, quantifying their detail differences. Equipped with the CD-IoU loss, our TDENet can better capture details. Extensive experiments on the public Marine Debris Dataset demonstrate that our TDENet outperforms 16 state-of-the-art models.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 2","pages":"3867-3878"},"PeriodicalIF":4.3000,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10778210/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Forward-looking sonar (FLS) image segmentation aims to accurately locate underwater objects, providing essential support for marine engineering. Unlike natural optical images, FLS images have weak semantic content and complex background, posing segmentation challenging for existing models. To address this issue, we propose a novel FLS image segmentation model, the three-stage distortion-driven enhancement network (TDENet), equipped with an extended IoU loss. TDENet employs a three-stage distortion-driven feature processing strategy. Specifically, we propose the three-stage distortion-driven module (TDM), which consists of three stages (i.e., feature distortion, enhancement, and fusion). First, feature distortion introduces dynamic inputs, compelling the model to learn patterns in a distorted feature space, thereby enhancing robustness. In the enhancement stage, global and local interactions between distorted and undistorted features improve the model’s semantic comprehension and detail retention. Finally, feature fusion ensures a comprehensive representation. To refine segmentation maps, we propose the centroid deviation intersection over union loss (CD-IoU loss), incorporating a CD term to the vanilla IoU loss. This term measures the distance and directional discrepancies between the centroids of predicted and ground truth object regions, quantifying their detail differences. Equipped with the CD-IoU loss, our TDENet can better capture details. Extensive experiments on the public Marine Debris Dataset demonstrate that our TDENet outperforms 16 state-of-the-art models.
面向前视声纳图像分割的三级失真驱动增强网络
前视声呐(FLS)图像分割的目的是准确定位水下目标,为海洋工程提供必要的支持。与自然光学图像不同,FLS图像语义内容薄弱,背景复杂,对现有模型的分割提出了挑战。为了解决这个问题,我们提出了一种新的FLS图像分割模型,即具有扩展IoU损失的三级失真驱动增强网络(TDENet)。TDENet采用三阶段畸变驱动特征处理策略。具体来说,我们提出了三级失真驱动模块(TDM),它包括三个阶段(即特征失真、增强和融合)。首先,特征失真引入动态输入,迫使模型在扭曲的特征空间中学习模式,从而增强鲁棒性。在增强阶段,扭曲和未扭曲特征之间的全局和局部相互作用提高了模型的语义理解和细节保留能力。最后,进行特征融合,保证了表征的全面性。为了改进分割图,我们提出了联合损失(CD-IoU损失)的质心偏差相交,将CD项合并到普通IoU损失中。该术语测量预测和地面真实目标区域的质心之间的距离和方向差异,量化它们的细节差异。配备CD-IoU损失,我们的TDENet可以更好地捕捉细节。在公共海洋垃圾数据集上进行的大量实验表明,我们的TDENet优于16个最先进的模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Sensors Journal
IEEE Sensors Journal 工程技术-工程:电子与电气
CiteScore
7.70
自引率
14.00%
发文量
2058
审稿时长
5.2 months
期刊介绍: The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following: -Sensor Phenomenology, Modelling, and Evaluation -Sensor Materials, Processing, and Fabrication -Chemical and Gas Sensors -Microfluidics and Biosensors -Optical Sensors -Physical Sensors: Temperature, Mechanical, Magnetic, and others -Acoustic and Ultrasonic Sensors -Sensor Packaging -Sensor Networks -Sensor Applications -Sensor Systems: Signals, Processing, and Interfaces -Actuators and Sensor Power Systems -Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting -Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data) -Sensors in Industrial Practice
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信