{"title":"Three-Stage Distortion-Driven Enhancement Network for Forward-Looking Sonar Image Segmentation","authors":"Chengjun Han;Yunlei Shen;Zhi Liu","doi":"10.1109/JSEN.2024.3506831","DOIUrl":null,"url":null,"abstract":"Forward-looking sonar (FLS) image segmentation aims to accurately locate underwater objects, providing essential support for marine engineering. Unlike natural optical images, FLS images have weak semantic content and complex background, posing segmentation challenging for existing models. To address this issue, we propose a novel FLS image segmentation model, the three-stage distortion-driven enhancement network (TDENet), equipped with an extended IoU loss. TDENet employs a three-stage distortion-driven feature processing strategy. Specifically, we propose the three-stage distortion-driven module (TDM), which consists of three stages (i.e., feature distortion, enhancement, and fusion). First, feature distortion introduces dynamic inputs, compelling the model to learn patterns in a distorted feature space, thereby enhancing robustness. In the enhancement stage, global and local interactions between distorted and undistorted features improve the model’s semantic comprehension and detail retention. Finally, feature fusion ensures a comprehensive representation. To refine segmentation maps, we propose the centroid deviation intersection over union loss (CD-IoU loss), incorporating a CD term to the vanilla IoU loss. This term measures the distance and directional discrepancies between the centroids of predicted and ground truth object regions, quantifying their detail differences. Equipped with the CD-IoU loss, our TDENet can better capture details. Extensive experiments on the public Marine Debris Dataset demonstrate that our TDENet outperforms 16 state-of-the-art models.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 2","pages":"3867-3878"},"PeriodicalIF":4.3000,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10778210/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Forward-looking sonar (FLS) image segmentation aims to accurately locate underwater objects, providing essential support for marine engineering. Unlike natural optical images, FLS images have weak semantic content and complex background, posing segmentation challenging for existing models. To address this issue, we propose a novel FLS image segmentation model, the three-stage distortion-driven enhancement network (TDENet), equipped with an extended IoU loss. TDENet employs a three-stage distortion-driven feature processing strategy. Specifically, we propose the three-stage distortion-driven module (TDM), which consists of three stages (i.e., feature distortion, enhancement, and fusion). First, feature distortion introduces dynamic inputs, compelling the model to learn patterns in a distorted feature space, thereby enhancing robustness. In the enhancement stage, global and local interactions between distorted and undistorted features improve the model’s semantic comprehension and detail retention. Finally, feature fusion ensures a comprehensive representation. To refine segmentation maps, we propose the centroid deviation intersection over union loss (CD-IoU loss), incorporating a CD term to the vanilla IoU loss. This term measures the distance and directional discrepancies between the centroids of predicted and ground truth object regions, quantifying their detail differences. Equipped with the CD-IoU loss, our TDENet can better capture details. Extensive experiments on the public Marine Debris Dataset demonstrate that our TDENet outperforms 16 state-of-the-art models.
期刊介绍:
The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following:
-Sensor Phenomenology, Modelling, and Evaluation
-Sensor Materials, Processing, and Fabrication
-Chemical and Gas Sensors
-Microfluidics and Biosensors
-Optical Sensors
-Physical Sensors: Temperature, Mechanical, Magnetic, and others
-Acoustic and Ultrasonic Sensors
-Sensor Packaging
-Sensor Networks
-Sensor Applications
-Sensor Systems: Signals, Processing, and Interfaces
-Actuators and Sensor Power Systems
-Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting
-Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data)
-Sensors in Industrial Practice