A momentum-based adversarial training approach for generalization in underwater acoustic target recognition: An individual-vessel perspective.

IF 2.1 2区物理与天体物理 Q2 ACOUSTICS

Journal of the Acoustical Society of America Pub Date : 2025-05-01 DOI:10.1121/10.0036456

Haotian Fang, Yuan Xie, Jiawei Ren, Wenchao Wang, Ji Xu

{"title":"A momentum-based adversarial training approach for generalization in underwater acoustic target recognition: An individual-vessel perspective.","authors":"Haotian Fang, Yuan Xie, Jiawei Ren, Wenchao Wang, Ji Xu","doi":"10.1121/10.0036456","DOIUrl":null,"url":null,"abstract":"<p><p>Underwater passive acoustic recognition, which focuses on classifying targets based on ship-radiated noise, is a key challenge in underwater acoustics. Deep learning-based methods have gained popularity in recent years because of their strong performance. However, these methods often fail to generalize well in real-world scenarios. This work reveals one underlying challenge: the characteristics of ship-radiated noise are influenced by factors such as vessel structures and propulsion systems. Although vessels of the same type may exhibit different patterns in these aspects, vessels of different categories share similarities. As a result, data-driven models often tend to overemphasize individual-specific features, leading to \"overfitting\" and poor generalization. The momentum-based adversarial training (MBAT) framework is proposed to mitigate this challenge. MBAT leverages a momentum adversarial strategy to use category information and individual vessel relationships, helping extract class-discriminative features. A homoscedastic uncertainty algorithm is employed to balance the optimization objectives of category-related and vessel-specific features. These strategies allow the model to capture category-discriminative patterns more effectively and generalize to unseen targets. Experiments on DeepShip and ShipsEar demonstrate that MBAT significantly improves generalization capability on unseen individual vessels, outperforming existing state-of-the-art methods. Visualizations further confirm the efficacy and necessity of the proposed approach.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 5","pages":"3508-3523"},"PeriodicalIF":2.1000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Acoustical Society of America","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1121/10.0036456","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ACOUSTICS","Score":null,"Total":0}

引用次数: 0

Abstract

Underwater passive acoustic recognition, which focuses on classifying targets based on ship-radiated noise, is a key challenge in underwater acoustics. Deep learning-based methods have gained popularity in recent years because of their strong performance. However, these methods often fail to generalize well in real-world scenarios. This work reveals one underlying challenge: the characteristics of ship-radiated noise are influenced by factors such as vessel structures and propulsion systems. Although vessels of the same type may exhibit different patterns in these aspects, vessels of different categories share similarities. As a result, data-driven models often tend to overemphasize individual-specific features, leading to "overfitting" and poor generalization. The momentum-based adversarial training (MBAT) framework is proposed to mitigate this challenge. MBAT leverages a momentum adversarial strategy to use category information and individual vessel relationships, helping extract class-discriminative features. A homoscedastic uncertainty algorithm is employed to balance the optimization objectives of category-related and vessel-specific features. These strategies allow the model to capture category-discriminative patterns more effectively and generalize to unseen targets. Experiments on DeepShip and ShipsEar demonstrate that MBAT significantly improves generalization capability on unseen individual vessels, outperforming existing state-of-the-art methods. Visualizations further confirm the efficacy and necessity of the proposed approach.

查看原文本刊更多论文

基于动量的水声目标识别泛化对抗训练方法：个体船舶视角。

水下被动声识别是水声学研究的一个关键问题，主要是基于舰船辐射噪声对目标进行分类。近年来，基于深度学习的方法因其强大的性能而受到欢迎。然而，这些方法往往不能很好地推广到实际场景中。这项工作揭示了一个潜在的挑战：船舶辐射噪声的特征受到船舶结构和推进系统等因素的影响。虽然同一类型的血管在这些方面可能表现出不同的模式，但不同类别的血管有相似之处。因此，数据驱动的模型往往倾向于过度强调个人特定的特征，导致“过度拟合”和较差的泛化。为了缓解这一挑战，提出了基于动量的对抗训练（MBAT）框架。MBAT利用动量对抗策略来使用类别信息和个体血管关系，帮助提取类别区分特征。采用同方差不确定性算法来平衡类别相关和船舶特定特征的优化目标。这些策略使模型能够更有效地捕获类别判别模式，并将其推广到看不见的目标。在DeepShip和ShipsEar上的实验表明，MBAT显著提高了对不可见的单个船舶的泛化能力，优于现有的最先进的方法。可视化进一步证实了所提出方法的有效性和必要性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of the Acoustical Society of America 物理-声学

CiteScore

4.60

自引率

16.70%

发文量

1433

审稿时长

4.7 months

期刊介绍： Since 1929 The Journal of the Acoustical Society of America has been the leading source of theoretical and experimental research results in the broad interdisciplinary study of sound. Subject coverage includes: linear and nonlinear acoustics; aeroacoustics, underwater sound and acoustical oceanography; ultrasonics and quantum acoustics; architectural and structural acoustics and vibration; speech, music and noise; psychology and physiology of hearing; engineering acoustics, transduction; bioacoustics, animal bioacoustics.