Journal of the Acoustical Society of America最新文献

筛选
英文 中文
A few-shot learning method for underwater acoustic target recognition based on generative data augmentationa). 基于生成数据增强的水声目标识别少镜头学习方法[j]。
IF 2.3 2区 物理与天体物理
Journal of the Acoustical Society of America Pub Date : 2026-05-01 DOI: 10.1121/10.0043727
Wei Huang, Qirui Zhang, Bowen Zhao, Tianhe Xu, Hao Zhang
{"title":"A few-shot learning method for underwater acoustic target recognition based on generative data augmentationa).","authors":"Wei Huang, Qirui Zhang, Bowen Zhao, Tianhe Xu, Hao Zhang","doi":"10.1121/10.0043727","DOIUrl":"https://doi.org/10.1121/10.0043727","url":null,"abstract":"<p><p>The recognition of underwater acoustic targets (UATR) is of great significance for the protection of marine diversity and national defense security. The development of deep learning provides new opportunities for UATR, but faces challenges brought by the scarcity of reference samples and complex environmental interference. To tackle this problem, we propose a generative discriminative collaborative framework, a variational auto-encoder boosted learning framework based on latent space completion. Rooted in the core contradiction arising from the incompleteness of intra-class manifolds and the instability of discriminative boundaries, this framework incorporates the premise of latent space continuity. Leveraging a structure-preserving generative reconstruction mechanism, it implicitly supplements the original dataset, which in turn enables the reconstruction of intra-class distributions that are more continuous, integral, and discriminative at the feature level. In this paper, we construct a three-stage pipeline system consisting of auto-clean cut unified preprocessing, latent reconstruction variational auto-encoder multi-scale latent space reconstruction, and an acoustic identification model. Furthermore, by establishing a staged modeling workflow, data purification, latent space completion, and discriminative optimization converge on their individual objectives independently while maintaining overall synergy, thus forging a robust recognition paradigm tailored to few-shot learning scenarios.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 5","pages":"3773-3791"},"PeriodicalIF":2.3,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147816705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discrimination of two-highlight versus single-highlight phantom echoes by bottlenose dolphins (Tursiops truncatus). 宽吻海豚对双高光和单高光回波的识别。
IF 2.3 2区 物理与天体物理
Journal of the Acoustical Society of America Pub Date : 2026-05-01 DOI: 10.1121/10.0043763
Jason Mulsow, Madilyn R Pardini, Sean J Avery, Katie A Christman, Dorian S Houser, James J Finneran
{"title":"Discrimination of two-highlight versus single-highlight phantom echoes by bottlenose dolphins (Tursiops truncatus).","authors":"Jason Mulsow, Madilyn R Pardini, Sean J Avery, Katie A Christman, Dorian S Houser, James J Finneran","doi":"10.1121/10.0043763","DOIUrl":"https://doi.org/10.1121/10.0043763","url":null,"abstract":"<p><p>Range resolution defines the ability to discriminate multiple closely spaced surfaces from a single surface and is a fundamental determinant of dolphin biosonar performance. In this study, three bottlenose dolphins were trained to discriminate \"phantom\" echoes with a single highlight from those with two highlights. The inter-highlight interval of the two-highlight echo was varied to determine the discrimination threshold. Threshold measurements were conducted at echo delays corresponding to simulated target ranges of 2.5-80 m. Discrimination thresholds were 2-3 μs (∼2 mm) for two dolphins with a full-bandwidth of hearing. A third dolphin with high-frequency hearing loss had higher thresholds, between 3 and 5 μs (∼2-4 mm). Thresholds were independent of simulated range, which contrasts with previous reports that the dolphin's accuracy of range determination-as opposed to range resolution-is reduced beyond 10-20 m. Dolphin discriminations in the current study appeared to be based on a spectral cue that consisted of a low-pass filter effect at the highest audible frequencies in the two-highlight echoes. It is unknown if dolphins perceive highlight-dependent spectral interference patterns as a spatial separation of target features along a range axis.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 5","pages":"3847-3857"},"PeriodicalIF":2.3,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147839468","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Orthogonal subspace approach for underwater acoustic wave-vector direction estimation based on acousto-optic sensing. 基于声光传感的水声波矢量方向估计正交子空间方法。
IF 2.3 2区 物理与天体物理
Journal of the Acoustical Society of America Pub Date : 2026-05-01 DOI: 10.1121/10.0043813
Xianyang Li, Boyuan Wang, Ruitao Zhang, Yinghe Ma, XiaoXia Yang, Bin Xue
{"title":"Orthogonal subspace approach for underwater acoustic wave-vector direction estimation based on acousto-optic sensing.","authors":"Xianyang Li, Boyuan Wang, Ruitao Zhang, Yinghe Ma, XiaoXia Yang, Bin Xue","doi":"10.1121/10.0043813","DOIUrl":"https://doi.org/10.1121/10.0043813","url":null,"abstract":"<p><p>High-precision wave-vector direction estimation is critical for underwater acoustic positioning, target detection, and tracking. Traditional array-based methods typically require large apertures, whereas a single acoustic vector sensor depends on inter-channel phase consistency and remains underexplored at mid-to-high frequencies. To overcome these limitations, we replace piezoelectric or electromagnetic principles with the acousto-optic effect for acoustic vector sensing, which provides multidimensional, high-order, non-contact sensing and is well suited to wave-vector sensing in the mid- to high- frequency range. Building on the classical MUSIC method and the acousto-optic sensing principle, we develop an orthogonal-subspace wave-vector direction estimation algorithm (named MUSIC-L) tailored to acousto-optic wave-vector sensing and validate it through simulations and experiments. Simulation results show that the proposed method is robust and that the theoretical error is almost independent of angle; at a signal-to-noise ratio of 10 dB with 80 snapshots (2 MHz sampling rate, 75 kHz source), the root mean square error is 1.4°. Finally, we design and fabricate an acousto-optic vector hydrophone prototype (0.5 m × 0.5 m × 0.185 m) and measure the wave-vector direction in an anechoic tank. The results show that, with 80 snapshots, the estimation error remains below 1°, with a standard deviation of approximately 0.23°.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 5","pages":"3870-3883"},"PeriodicalIF":2.3,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147839461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing the benefits of using plane wave compounding when estimating backscatter coefficients with an in situ bead as a calibration target. 评估平面波复合在以原位磁珠为标定目标估计后向散射系数时的效益。
IF 2.3 2区 物理与天体物理
Journal of the Acoustical Society of America Pub Date : 2026-05-01 DOI: 10.1121/10.0043333
Yuning Zhao, Zhengchang Kou, Mingrui Liu, Rita J Miller, Gregory J Czarnota, Michael L Oelze
{"title":"Assessing the benefits of using plane wave compounding when estimating backscatter coefficients with an in situ bead as a calibration target.","authors":"Yuning Zhao, Zhengchang Kou, Mingrui Liu, Rita J Miller, Gregory J Czarnota, Michael L Oelze","doi":"10.1121/10.0043333","DOIUrl":"https://doi.org/10.1121/10.0043333","url":null,"abstract":"<p><p>Quantitative ultrasound (QUS) techniques provide tissue characterization and have applications in precision diagnostics and therapy monitoring. A common QUS approach estimates backscatter coefficients (BSCs) using the external reference phantom method, which does not adequately address attenuation and transmission losses. To address this issue, we previously utilized a 2 mm titanium bead as an in situ calibration reference with conventional focused wave (FW) imaging. Plane wave compounding (PWC), which transmits multiple angled plane waves, enables rapid image acquisition while maintaining quality, making it widely useful in Doppler and elastographic imaging. However, its role in BSC estimation with an in situ bead had not been investigated. We hypothesized that PWC would outperform FW, as diffraction effects vary more slowly away from the bead depth when using PWC. We tested this hypothesis by comparing BSC estimation in phantoms and in vivo rabbit mammary tumors. Results indicated that PWC with in situ calibration better compensated for attenuation loss and reduced variability in scatterer property estimates compared to FW. Specifically, the scatterer diameter variance was 15.2 μm2 when using PWC versus 45.4 μm2 when using FW where the focus was offset from the bead depth. These findings demonstrate the superiority of using PWC with an in situ calibration target for accurate, efficient BSC estimation.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 5","pages":"3896-3909"},"PeriodicalIF":2.3,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147839538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Wake-induced variations in noise levels and amplitude modulation for two interacting wind turbines. 两个相互作用的风力涡轮机的尾流诱导的噪声水平和振幅调制变化。
IF 2.3 2区 物理与天体物理
Journal of the Acoustical Society of America Pub Date : 2026-04-01 DOI: 10.1121/10.0043240
Jules Colas, Ariane Emmanuelli, Didier Dragna, Richard J A M Stevens
{"title":"Wake-induced variations in noise levels and amplitude modulation for two interacting wind turbines.","authors":"Jules Colas, Ariane Emmanuelli, Didier Dragna, Richard J A M Stevens","doi":"10.1121/10.0043240","DOIUrl":"https://doi.org/10.1121/10.0043240","url":null,"abstract":"<p><p>The influence of turbine-turbine interactions on sound propagation is investigated using numerical simulations. Three configurations are examined: turbines aligned downstream of each other, placed side by side, and arranged in a staggered pattern. The simulation framework combines large-eddy simulations for aerodynamic interactions, an aeroacoustic source model to simulate turbine sound emission, and parabolic equation methods for sound propagation. When a second turbine is positioned directly downstream, wake-induced flow focusing enhances sound pressure levels (SPLs) and amplitude modulation (AM) by several decibels downwind. In side-by-side and staggered configurations, SPL increases are limited (<2 dBA), and AM is generally reduced because of spatial averaging. Distinct AM patterns emerge in regions where acoustic contributions from both turbines are comparable. For identical rotor speeds, AM is strongly affected by the angular offset between rotors. When rotor speeds differ slightly, beating effects occur, resulting in intermittent AM. These findings highlight the sensitivity of AM to rotor dynamics, a key factor influencing sound perception, with implications for environmental impact and turbine siting.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 4","pages":"3048-3061"},"PeriodicalIF":2.3,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147628019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian machine learning framework for time-domain prediction of multirotor vehicle noisea). 多旋翼飞行器噪声时域预测的贝叶斯机器学习框架[j]。
IF 2.3 2区 物理与天体物理
Journal of the Acoustical Society of America Pub Date : 2026-04-01 DOI: 10.1121/10.0043469
Howon Lee, Jeongwoo Ko, Pranay Seshadri, Juergen Rauleder
{"title":"Bayesian machine learning framework for time-domain prediction of multirotor vehicle noisea).","authors":"Howon Lee, Jeongwoo Ko, Pranay Seshadri, Juergen Rauleder","doi":"10.1121/10.0043469","DOIUrl":"https://doi.org/10.1121/10.0043469","url":null,"abstract":"<p><p>This work presents a Bayesian machine learning framework developed to predict aeroacoustic time-series signals generated by a quadrotor vehicle in forward flight at varying velocities. In this effort, a Gaussian process (GP) regression model is trained using a database of simulated signals produced by the Comprehensive Multi-rotor Noise Assessment framework. Unlike traditional frequency-domain models, the GP model directly predicts the time-domain signal, inherently capturing both amplitude and phase information of relevant frequency components. This capability is achieved by partitioning the tonal and broadband components during pre-processing, and capturing each component via a blade passage frequency-informed Fourier kernel and a Gaussian likelihood model, respectively. The resulting model is probabilistic in nature, inherently capturing the associated prediction uncertainty. Quantitative evaluations demonstrate strong agreement with ground truth signals in both time and frequency domains, with mean loudness errors of 1.11% in decibels and 5.55% in sones. The mean psychoacoustic annoyance error is found to be approximately 10%. The model is also computationally efficient compared to traditional physics-based solvers, requiring 0.1803 s to generate a time-series signal sampled at 44 100 Hz on a single NVIDIA A100 GPU (NVIDIA, Santa Clara, CA).</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 4","pages":"3418-3435"},"PeriodicalIF":2.3,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147717169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Frequency-domain multiphysics modeling of photoacoustic responses in strongly scattering multilayered human skin. 强散射多层人体皮肤光声响应的频域多物理场建模。
IF 2.3 2区 物理与天体物理
Journal of the Acoustical Society of America Pub Date : 2026-04-01 DOI: 10.1121/10.0043328
Sangmo Kang
{"title":"Frequency-domain multiphysics modeling of photoacoustic responses in strongly scattering multilayered human skin.","authors":"Sangmo Kang","doi":"10.1121/10.0043328","DOIUrl":"https://doi.org/10.1121/10.0043328","url":null,"abstract":"<p><p>This work presents a frequency-domain semi-analytical framework for modeling photoacoustic wave generation and propagation in strongly scattering multilayered human skin. The approach extends previous semi-analytical methods for non-scattering multilayered media by incorporating photon diffusion and its coupling with heat conduction and thermoelastic wave generation within a unified Fourier-domain formulation, enabling accurate characterization of frequency-dependent acoustic responses. A key contribution is the physically consistent treatment of photon transport at refractive-index discontinuities, resolving inconsistencies that can arise when Robin-type conditions are applied at internal interfaces in some diffusion-based layered models. This ensures photon-flux continuity and optical energy conservation across anatomical interfaces, which is critical for accurate predictions in scattering-dominant media. The formulation is validated against benchmark reference results, demonstrating excellent agreement in pressure amplitude and phase. Application to a realistic nine-layer anatomical skin model enables depth-resolved analysis of photon fluence, thermal behavior, and acoustic pressure under physiologically relevant conditions. The proposed framework provides an energy-conserving and generalizable foundation for quantitative photoacoustic analysis in layered biological media, offering improved physical fidelity for biomedical ultrasound and photoacoustic imaging applications.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 4","pages":"3181-3200"},"PeriodicalIF":2.3,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147645759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discrimination and suppression of reverberation by leveraging the property disparity between target echoes and rough seabed reverberation. 利用目标回波与粗糙海底混响的特性差异进行混响识别与抑制。
IF 2.3 2区 物理与天体物理
Journal of the Acoustical Society of America Pub Date : 2026-04-01 DOI: 10.1121/10.0043478
Yifei Zou, Xiukun Li, Ge Yu
{"title":"Discrimination and suppression of reverberation by leveraging the property disparity between target echoes and rough seabed reverberation.","authors":"Yifei Zou, Xiukun Li, Ge Yu","doi":"10.1121/10.0043478","DOIUrl":"https://doi.org/10.1121/10.0043478","url":null,"abstract":"<p><p>In shallow water environments, reverberation, which is generated from the rough seabed, remains one of the major sources of background interference in active sonar systems. This paper proposes a reverberation suppression method that leverages multidimensional property differences, designed for a rough seabed. The approach establishes a link between variations in seabed reflection and scattering coefficients and the spectral property of reverberation signals, providing a physical basis for effective suppression. Using the Wigner-Ville distribution, reverberation suppression is formulated as the identification and utilization of time-frequency feature disparities between reverberation and target echoes. The method consists of two stages. First, seed region growing is enhanced using a multi-feature framework and Rényi entropy to suppress reverberation in the time-frequency plane. The multi-feature framework evaluates the stability and similarity of the instantaneous frequency sequence to screen the initial target echo position, whereas the Rényi entropy of the time-frequency domain adaptively adjusts the growing thresholding. In the second stage, the extracted time-frequency results are combined with a de-chirp procedure to reconstruct the time-domain echoes. Simulation and experimental results under various shallow water conditions demonstrate that the proposed method effectively suppresses reverberation while accurately preserving the target echo.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 4","pages":"3499-3514"},"PeriodicalIF":2.3,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147774905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sensitivity to interaural phase as a function of frequency: Age effects measured with behavioral and electrophysiological tasks. 作为频率函数的耳间相位敏感性:用行为和电生理任务测量的年龄效应。
IF 2.3 2区 物理与天体物理
Journal of the Acoustical Society of America Pub Date : 2026-04-01 DOI: 10.1121/10.0043472
John H Grose, Emily Buss
{"title":"Sensitivity to interaural phase as a function of frequency: Age effects measured with behavioral and electrophysiological tasks.","authors":"John H Grose, Emily Buss","doi":"10.1121/10.0043472","DOIUrl":"https://doi.org/10.1121/10.0043472","url":null,"abstract":"<p><p>Temporal fine-structure processing, as measured with binaural tasks, declines with increasing age in adults. The purpose of this study was to determine whether an electrophysiological test of binaural temporal processing could be used as a proxy measure for behavioral performance to reliably capture this age dependence. The behavioral measure was the upper frequency limit for differentiating in-phase from out-of-phase tones. This was assessed in both quiet and in background noise. The electrophysiological measure was the acoustic change complex elicited by epochs of interaurally out-of-phase frequency modulation carried by tones of different frequencies. Adults with normal/near-normal hearing were tested on both measures. There were 20 participants in each of the three age groups categorized as young, middle-aged, and older. The upper frequency limit measured behaviorally declined with age. It also declined in the presence of background noise, but the noise effect was equivalent across age groups. The robustness of the acoustic change complex also declined with age and as a function of the carrier frequency. However, correlations between the behavioral and electrophysiological measures were modest at best, suggesting that the electrophysiological test as implemented in this study did not provide a robust proxy for behavioral performance.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 4","pages":"3489-3498"},"PeriodicalIF":2.3,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147775002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spectrogram-derived graphs and inductive learning for multi-label avian vocalization detection in field recordings. 野外录音中多标签鸟类发声检测的谱图衍生图和归纳学习。
IF 2.3 2区 物理与天体物理
Journal of the Acoustical Society of America Pub Date : 2026-04-01 DOI: 10.1121/10.0043592
Noumida A, Rajeev Rajan
{"title":"Spectrogram-derived graphs and inductive learning for multi-label avian vocalization detection in field recordings.","authors":"Noumida A, Rajeev Rajan","doi":"10.1121/10.0043592","DOIUrl":"https://doi.org/10.1121/10.0043592","url":null,"abstract":"<p><p>This paper presents a methodology that employs inductive spatial geometric deep learning networks to detect multiple avian vocalizations from field recordings. Initially, a graph is constructed from the Mel-spectrogram of each audio file using a trained deep convolutional neural network (Deep CNN). The extracted features are used to build a node-feature graph, which is then processed by two spatial inductive graph-based models: graph sample and aggregation (GraphSAGE) and the graph attention network (GAT), for multi-label classification. To enhance the robustness and generalization of the Deep CNN, SpecAugment is applied to generate additional Mel-spectrograms via data augmentation. The proposed framework is evaluated on the Xeno-canto bird sound database and compared against state-of-the-art methods. The results demonstrate that the proposed inductive spatial graph-based approach outperforms existing techniques, achieving macro F1-scores of 0.90 with GraphSAGE and 0.92 with GAT. We further replaced Deep CNN with AudioProtoPNet-20 and evaluated GAT on the Xeno-canto dataset, obtaining a macro F1-score of 0.93.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 4","pages":"3598-3609"},"PeriodicalIF":2.3,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147775038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书