{"title":"End-to-end underwater acoustic target classification with frequency separation strategy","authors":"Ning Tang , Hao Zhang , Fei Zhou , Wei Huang , Pengfei Wu","doi":"10.1016/j.apor.2025.104554","DOIUrl":null,"url":null,"abstract":"<div><div>Most underwater target classification methods require data preprocessing to convert waveforms into spectrograms. Recent advancements have led to the emergence of waveform-based approaches that directly extract discriminative features from raw acoustic signals, eliminating the need for preprocessing. However, waveform-based architectures often exhibit inferior classification performance compared to spectrogram-based methods. This study identifies two key limitations of existing waveform-based approaches: (1) the lack of explicit frequency feature extraction mechanisms, and (2) information degradation due to downsampling. To address these challenges, we propose an end-to-end frequency separation network (FSNet), which includes two innovative components: a frequency separation (FS) module that explicitly captures discriminative frequency characteristics, and a scale-adaptive max pooling (SAMP) layer that preserves critical information during dimensionality reduction. Comprehensive evaluations on ShipsEar and DeepShip datasets demonstrate that our framework achieves competitive accuracy performance (82.91% on ShipsEar and 78.39% on DeepShip) while maintaining exceptional computational efficiency, requiring only 0.49M parameters—over three times fewer than the second-smallest model.</div></div>","PeriodicalId":8261,"journal":{"name":"Applied Ocean Research","volume":"158 ","pages":"Article 104554"},"PeriodicalIF":4.3000,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Ocean Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141118725001427","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, OCEAN","Score":null,"Total":0}
引用次数: 0
Abstract
Most underwater target classification methods require data preprocessing to convert waveforms into spectrograms. Recent advancements have led to the emergence of waveform-based approaches that directly extract discriminative features from raw acoustic signals, eliminating the need for preprocessing. However, waveform-based architectures often exhibit inferior classification performance compared to spectrogram-based methods. This study identifies two key limitations of existing waveform-based approaches: (1) the lack of explicit frequency feature extraction mechanisms, and (2) information degradation due to downsampling. To address these challenges, we propose an end-to-end frequency separation network (FSNet), which includes two innovative components: a frequency separation (FS) module that explicitly captures discriminative frequency characteristics, and a scale-adaptive max pooling (SAMP) layer that preserves critical information during dimensionality reduction. Comprehensive evaluations on ShipsEar and DeepShip datasets demonstrate that our framework achieves competitive accuracy performance (82.91% on ShipsEar and 78.39% on DeepShip) while maintaining exceptional computational efficiency, requiring only 0.49M parameters—over three times fewer than the second-smallest model.
期刊介绍:
The aim of Applied Ocean Research is to encourage the submission of papers that advance the state of knowledge in a range of topics relevant to ocean engineering.