{"title":"Estimating band importance for environmental sound recognition using deep learninga).","authors":"Eric M Johnson, Eric W Healy","doi":"10.1121/10.0043736","DOIUrl":"10.1121/10.0043736","url":null,"abstract":"<p><p>Environmental sound recognition (ESR) enables listeners to interpret complex acoustic environments, yet the frequency regions that support recognition are poorly understood. This study used deep learning to model ESR in competing speech and estimate frequency band-importance functions (BIFs) underlying recognition performance. Trial-level responses were collected from 46 listeners who identified 25 everyday sounds mixed with speech across a wide range of target-to-masker ratios. Two model variants were evaluated: one trained to mimic human performance, which was trained on soft labels derived from listener responses, and one trained for maximum accuracy, which was trained on ground-truth correct sound labels, enabling a direct comparison between perceptually driven and task-optimal band-importance patterns. The human-trained model closely reproduced key features of human performance, whereas the ground-truth-trained model exceeded human accuracy and showed highly reliable performance across cross-validation folds. BIFs were estimated by bandstop filtering the target signal and quantifying the resulting drop in recognition accuracy. Both model variants yielded reproducible BIFs with five prominent peaks (∼0.43, 0.77, 1.46, 2.6, and 9.7 kHz), largely driven by subsets of sounds having sharply tuned spectral dependence. This convergence across training objectives suggests that human performance closely reflects the task-optimal frequencies for segregating environmental sounds from speech maskers.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 5","pages":"3804-3818"},"PeriodicalIF":2.3,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147816708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Andrea Beni, Andrea P C Bresciani, Julien Christophe, Christophe Schram
{"title":"Performance and noise emission analysis of coaxial co-rotating propellersa).","authors":"Andrea Beni, Andrea P C Bresciani, Julien Christophe, Christophe Schram","doi":"10.1121/10.0043731","DOIUrl":"https://doi.org/10.1121/10.0043731","url":null,"abstract":"<p><p>The trade-off between performance and noise emissions has progressively established itself as a key factor in the emergence of Urban Air Mobility. This work addresses the relationship between the propulsive efficiency and noise emissions of drone-sized coaxial, co-rotating propellers operating in hover, for varying phase offset ϕ and axial separation Δz between the rotors. Experimentally, an antagonistic behavior emerged between power loading and broadband noise, while a modulation of the tonal noise with varying phase offset was observed. The physical mechanism modulating tonal noise is explained using an analytical approach accounting for the acoustic interferences that occur for unevenly spaced rotors, based on Goldstein's formulation for the tonal noise generated by rotating dipole sources. Furthermore, a combination of a commercial panel method and a rotating dipole source model is proposed as a fast-turnaround approach to predict aerodynamic loads and tonal noise. The acoustic predictions obtained for the second shaft harmonic are consistent with the experimental trends, given the inherent limitations of the solver employed, and assess the potential of the proposed method for a multidisciplinary early-stage design phase.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 5","pages":"3910-3924"},"PeriodicalIF":2.3,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147839464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zengxian Guo, Jia Li, Yingxian Guo, Songsong Zhang, Teng Ma, Guojun Wang, Qiang Wang, Xiaoji Liu, Zhijia Zhang, Hao Wei, Lin Wang
{"title":"Fourier modal method with enhanced transmittance matrix for diffraction analysis of multilayer two-dimensional acoustic metamaterial gratings.","authors":"Zengxian Guo, Jia Li, Yingxian Guo, Songsong Zhang, Teng Ma, Guojun Wang, Qiang Wang, Xiaoji Liu, Zhijia Zhang, Hao Wei, Lin Wang","doi":"10.1121/10.0043728","DOIUrl":"https://doi.org/10.1121/10.0043728","url":null,"abstract":"<p><p>This paper adapts a well-established optical Fourier modal method to acoustics and proposes a Fourier modal method with an enhanced transmittance matrix for investigating the diffraction characteristics of multilayer two-dimensional acoustic metamaterial gratings. This method analytically solves the acoustics equation in the Fourier domain and matches the analytical solutions at boundaries to obtain the overall diffraction information. Numerical results demonstrate that this method yields results consistent with those of the finite-element method for both oblique and normal incidence, while offering significant computational advantages compared with full-wave spatial discretization for relatively simple periodic unit-cell geometries. The primary practical value of the framework lies in the rapid prediction of the overall reflection and transmission of multilayer periodic acoustic gratings, which is particularly attractive for parameter sweeps and design optimization. When higher-order propagating channels are open, this method also provides direct access to the contributions of individual diffraction orders. The proposed framework therefore provides a fast and accurate tool for the analysis and design of multilayer acoustic-metamaterial gratings.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 5","pages":"3835-3844"},"PeriodicalIF":2.3,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147839510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Heart heard in motion.","authors":"Damien Garcia","doi":"10.1121/10.0043734","DOIUrl":"https://doi.org/10.1121/10.0043734","url":null,"abstract":"","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 5","pages":"R9-R10"},"PeriodicalIF":2.3,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147816711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Morphology modulation and its forefront implication of a three-dimensional particulate-fluid system by weaving acoustic field.","authors":"Zhenghui Qiao","doi":"10.1121/10.0043733","DOIUrl":"https://doi.org/10.1121/10.0043733","url":null,"abstract":"<p><p>Relied on three-dimensional metamaterial-based particulate-fluid system, an acoustic weave platform increases the sound pressure amplitude by frequency bandgap peak gain (Q-factor 4.5) in acoustically regular air media system with ways that a conventional method cannot. The interesting morphology alternations of the aggregation, fluidization, and trapping for numerous expanded-polystyrene particles (1-7.5 mm) were experimentally observed by engineering acoustic field in the low-frequency range of <0.8 kHz, improving the weak phenomenon in the absence of acoustic-metamaterial design. With vertical square-waveguide arrayed uniformly 12 of Helmholtz sound sources, the platform modulates the acoustic wave-packet movement and amplifies resonantly the time-spatial Y-shape-bifurcated-aggregation 54.7°-long-short-range-attraction wave phenomenon of complex macro soft-matter particles. Through experiment coinciding with simulation and theory, the main behaviors' phenomena were accurately explained by acoustic radiation force and secondary radiation force joint with the modulated three-dimensional acoustic field. The particle fluidization and trapping occur on contrary acoustic gradient fields at 220 and 250 Hz, respectively. There exist several vertically parallel \"chiral\" layer thin-film-aggregation stripes of millimeter-scale particles also obviously appearing at 220 Hz, more intuitively displaying the quasi-waves' constructive and destructive interferences of mm-scale particles themselves for wave-particle duality theory. The macro wavelike character helps to conveniently modulate collectively the environmental behaviors of fly-ash.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 5","pages":"3792-3803"},"PeriodicalIF":2.3,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147816746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
João Fatela, Anne Heimes, Michael Vorländer, Massimiliano Masullo, Luigi Maffei
{"title":"Acoustic radiosity simulation with custom bidirectional reflectance distribution implementation in three dimensionsa).","authors":"João Fatela, Anne Heimes, Michael Vorländer, Massimiliano Masullo, Luigi Maffei","doi":"10.1121/10.0043765","DOIUrl":"https://doi.org/10.1121/10.0043765","url":null,"abstract":"<p><p>Geometrical acoustics methods provide efficient techniques to estimate sound propagation properties in virtual spaces. Advances based on acoustic radiance transfer principles facilitate the simulation of environments with complex scattering distributions via the implementation of bidirectional reflectance distribution functions (BRDF). Though significant work has been developed in the optimization of such methods, the effects of complex BRDF implementation have seldom been studied. In this article, two scattering distribution modeling approaches are compared, based on random incidence and bidirectional scattering coefficients, respectively. Their implementation in an acoustic radiosity (AR) simulation tool is discussed. The simulation of a single façade shows that random-incidence modeling in the radiosity tool aligns with state-of-the-art ray-tracing (RT) simulation. Bidirectional scattering approach results of retro-reflective surface simulations are closest to a reference wave-based simulation output, meaning they capture more of the spatial details of the scattering distribution. The simulation of a street canyon scene further highlights how the choice of scattering modeling leads to significant differences in the energy time curve. The results evidence minor artifacts due to the AR and RT implementations. Major computational performance metrics are also assessed.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 5","pages":"3925-3940"},"PeriodicalIF":2.3,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147839455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Observing long-term adaptation and generalization to an untrained set of head-related transfer functions.","authors":"Julie Meyer, Lorenzo Picinali","doi":"10.1121/10.0043815","DOIUrl":"https://doi.org/10.1121/10.0043815","url":null,"abstract":"<p><p>Adaptation to altered or non-individual head-related transfer functions (HRTFs) has been widely studied in individuals with normal hearing. Studies have shown that after an adaptation period, individuals can significantly improve their sound localization performance with altered cues. Recently, it has been shown that after training with one set of HRTFs, improvements can also be observed with another set, a phenomenon referred to as generalization. This study provides significantly more evidence of this phenomenon by including a larger number of participant groups and a longer training period compared to prior work. The experiment involved four groups of participants: two control groups (one with individual HRTFs, one with non-individual HRTFs) only performing localization tests and two training groups (one with individual HRTFs, one with non-individual HRTFs) completing both localization tests and training sessions. In all cases, the stimulus consisted of a male speech sentence reproduced binaurally. Both training groups showed significant and comparable improvements in localization performance. Moreover, generalization effects were observed and found to be similar between the two training groups.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 5","pages":"3884-3895"},"PeriodicalIF":2.3,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147839527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Sound-field-projection synthesis using latent diffusion model for acousto-optic reconstruction.","authors":"Risako Tanigawa, Kenji Ishikawa, Noboru Harada, Yasuhiro Oikawa","doi":"10.1121/10.0043764","DOIUrl":"https://doi.org/10.1121/10.0043764","url":null,"abstract":"<p><p>Acousto-optic sensing (AOS) is a non-contact method for measuring sound using light, suitable for environments where microphones are impractical, such as confined spaces or within airflow. Despite its effectiveness, AOS captures line-integrated sound pressure along the optical path, resulting in signals that cannot be interpreted as those from point-wise microphones. To obtain the sound pressure distribution in three-dimensional (3D) space, volumetric sound-field reconstruction is required. Existing methods require multi-directional line-integrated projection data, typically obtained either by deploying multiple devices or by repeatedly exciting the sound source. The former requires deploying multiple high-cost devices, which is often impractical, whereas the latter is not applicable to sound sources that cannot be reproduced consistently. To overcome these limitations, we propose a task called sound projection synthesis, which synthesizes data in specified directions based on observation data. We achieve this by using a latent diffusion model conditioned on observed projections and view angles to generate new sound-field projections. A model pretrained on natural images was fine-tuned using sound-field data and optimized with a pixel-wise loss. Experiments demonstrate that the model can generate realistic projection data. Combining nine observed views with nine generated views improved 3D reconstruction accuracy compared with using only nine observed views.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 5","pages":"3858-3869"},"PeriodicalIF":2.3,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147839451","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yongshuai Gao, Yan Wang, Yu Hao, Yuang Li, Zhiyao Du
{"title":"Underwater source range and depth estimation using a vertical dual-hydrophone configuration in a deep sea bottom bounce areaa).","authors":"Yongshuai Gao, Yan Wang, Yu Hao, Yuang Li, Zhiyao Du","doi":"10.1121/10.0043782","DOIUrl":"https://doi.org/10.1121/10.0043782","url":null,"abstract":"<p><p>This paper addresses the challenge of pulse source localization in a deep sea bottom bounce area using multipath time differences via a vertical dual-hydrophone configuration. By analyzing the variation trends of multipath time-delay differences with respect to source range and depth in deep sea bottom bounce area, a set of analytical expressions relating multipath time-delay differences to source range and depth is derived based on geometric relationships. Furthermore, by integrating ray tracing technology with simulated annealing and Bayesian inversion methods, source localization within the range-depth two-dimensional plane is achieved. Simulation results indicate that the source positions calculated using the geometric analytical expressions can provide starting values for the other two methods. The Bayesian inversion approach demonstrates higher accuracy in estimating source range compared to simulated annealing. However, simulated annealing yields better depth estimation results with higher concentration. Sea trial results confirm the effectiveness of both methods in localizing cooperative sources under high signal-to-noise ratio conditions.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 5","pages":"3941-3956"},"PeriodicalIF":2.3,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147839546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Won Gu Jung, Do Yong Kim, Jung Wook Lee, Yeon June Kang
{"title":"Deep learning-based approach for linking microstructural and macroscopic acoustic properties of sound-absorbing polyurethane foam.","authors":"Won Gu Jung, Do Yong Kim, Jung Wook Lee, Yeon June Kang","doi":"10.1121/10.0043476","DOIUrl":"https://doi.org/10.1121/10.0043476","url":null,"abstract":"<p><p>Sound absorption in porous materials is fundamentally governed by their microstructural morphology yet establishing a quantitative and design-oriented relationship between microstructure and acoustic behavior remains challenging. To address this challenge, a deep learning-based acoustic modeling framework is proposed for analyzing the microstructure of flexible polyurethane (PU) foam and predicting its acoustic performance. A microscopic analysis model is developed to semantically segment SEM images using a U-Net model and quantitatively extract the distribution parameters of microstructural properties, including cell size, pore size, pore shape factor, and strut thickness. An artificial neural network model is developed to model the relationship between these microstructural parameters and acoustic performance measured using an impedance tube, based on 210 flexible PU foam samples including thermally aged and non-aged materials from multiple manufacturers. Finally, the proposed approach is validated through comprehensive performance evaluation, comparison with experiments and alternative methods, and analysis of microstructural parameter contributions. The validated framework establishes a quantitative link between microscale morphology and acoustic performance, providing data-driven acoustic insights and practical guidance for acoustic material design, including feature selection, optimization of acoustic performance, and fabrication of sound-absorbing materials for applications such as automotive and construction.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 5","pages":"3819-3834"},"PeriodicalIF":2.3,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147816703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}