{"title":"Syn2Real Domain Generalization for Underwater Mine-Like Object Detection Using Side-Scan Sonar","authors":"Aayush Agrawal;Aniruddh Sikdar;Rajini Makam;Suresh Sundaram;Suresh Kumar Besai;Mahesh Gopi","doi":"10.1109/LGRS.2025.3550037","DOIUrl":null,"url":null,"abstract":"Underwater mine-like object (MLO) detection with deep learning suffers from limitations due to the scarcity of real-world side-scan sonar (SSS) data. This scarcity leads to overfitting, where models perform well on training data but poorly on unseen data. In this letter, we propose a synthetic to real (Syn2Real) domain generalization approach using diffusion models to address this challenge. Synthetic data generated by DDPM and DDIM models effectively enhances the training dataset. The residual noise in the final sampled images improves the model’s ability to generalize to real-world data with inherent noise and high variation. The baseline mask-region-based convolutional neural network (RCNN) model when trained on a combination of synthetic and original SSS training datasets, exhibited approximately a 35% increase in average precision (AP) compared to being trained solely on the original training data. This significant improvement highlights the potential of Syn2Real domain generalization for underwater mine detection.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10919029/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Underwater mine-like object (MLO) detection with deep learning suffers from limitations due to the scarcity of real-world side-scan sonar (SSS) data. This scarcity leads to overfitting, where models perform well on training data but poorly on unseen data. In this letter, we propose a synthetic to real (Syn2Real) domain generalization approach using diffusion models to address this challenge. Synthetic data generated by DDPM and DDIM models effectively enhances the training dataset. The residual noise in the final sampled images improves the model’s ability to generalize to real-world data with inherent noise and high variation. The baseline mask-region-based convolutional neural network (RCNN) model when trained on a combination of synthetic and original SSS training datasets, exhibited approximately a 35% increase in average precision (AP) compared to being trained solely on the original training data. This significant improvement highlights the potential of Syn2Real domain generalization for underwater mine detection.