Haotian Fang, Yuan Xie, Jiawei Ren, Wenchao Wang, Ji Xu
{"title":"A momentum-based adversarial training approach for generalization in underwater acoustic target recognition: An individual-vessel perspective.","authors":"Haotian Fang, Yuan Xie, Jiawei Ren, Wenchao Wang, Ji Xu","doi":"10.1121/10.0036456","DOIUrl":null,"url":null,"abstract":"<p><p>Underwater passive acoustic recognition, which focuses on classifying targets based on ship-radiated noise, is a key challenge in underwater acoustics. Deep learning-based methods have gained popularity in recent years because of their strong performance. However, these methods often fail to generalize well in real-world scenarios. This work reveals one underlying challenge: the characteristics of ship-radiated noise are influenced by factors such as vessel structures and propulsion systems. Although vessels of the same type may exhibit different patterns in these aspects, vessels of different categories share similarities. As a result, data-driven models often tend to overemphasize individual-specific features, leading to \"overfitting\" and poor generalization. The momentum-based adversarial training (MBAT) framework is proposed to mitigate this challenge. MBAT leverages a momentum adversarial strategy to use category information and individual vessel relationships, helping extract class-discriminative features. A homoscedastic uncertainty algorithm is employed to balance the optimization objectives of category-related and vessel-specific features. These strategies allow the model to capture category-discriminative patterns more effectively and generalize to unseen targets. Experiments on DeepShip and ShipsEar demonstrate that MBAT significantly improves generalization capability on unseen individual vessels, outperforming existing state-of-the-art methods. Visualizations further confirm the efficacy and necessity of the proposed approach.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 5","pages":"3508-3523"},"PeriodicalIF":2.1000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Acoustical Society of America","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1121/10.0036456","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Underwater passive acoustic recognition, which focuses on classifying targets based on ship-radiated noise, is a key challenge in underwater acoustics. Deep learning-based methods have gained popularity in recent years because of their strong performance. However, these methods often fail to generalize well in real-world scenarios. This work reveals one underlying challenge: the characteristics of ship-radiated noise are influenced by factors such as vessel structures and propulsion systems. Although vessels of the same type may exhibit different patterns in these aspects, vessels of different categories share similarities. As a result, data-driven models often tend to overemphasize individual-specific features, leading to "overfitting" and poor generalization. The momentum-based adversarial training (MBAT) framework is proposed to mitigate this challenge. MBAT leverages a momentum adversarial strategy to use category information and individual vessel relationships, helping extract class-discriminative features. A homoscedastic uncertainty algorithm is employed to balance the optimization objectives of category-related and vessel-specific features. These strategies allow the model to capture category-discriminative patterns more effectively and generalize to unseen targets. Experiments on DeepShip and ShipsEar demonstrate that MBAT significantly improves generalization capability on unseen individual vessels, outperforming existing state-of-the-art methods. Visualizations further confirm the efficacy and necessity of the proposed approach.
期刊介绍:
Since 1929 The Journal of the Acoustical Society of America has been the leading source of theoretical and experimental research results in the broad interdisciplinary study of sound. Subject coverage includes: linear and nonlinear acoustics; aeroacoustics, underwater sound and acoustical oceanography; ultrasonics and quantum acoustics; architectural and structural acoustics and vibration; speech, music and noise; psychology and physiology of hearing; engineering acoustics, transduction; bioacoustics, animal bioacoustics.