Brian Roberts, Stephen D Holmes, Christopher J Darwin
{"title":"Perception of concurrent sentences: Within and across-formant grouping in harmonic and frequency-shifted speech.","authors":"Brian Roberts, Stephen D Holmes, Christopher J Darwin","doi":"10.1121/10.0043024","DOIUrl":"https://doi.org/10.1121/10.0043024","url":null,"abstract":"<p><p>Keyword identification in one of two simultaneous sentences is substantially improved when they differ in fundamental frequency (F0); this effect is greatest for almost continuously voiced speech. ΔF0 may act by improving voice identification [better first-formant (F1) definition or better across-formant grouping] or voice tracking. Sentences were monotonized and resynthesized to give a range of ΔF0s (0-10 semitones; F0s = 90-160 Hz). Sentences were additionally resynthesized after applying a frequency shift of 25% of F0 to the monotonized excitation source-making it inharmonic but with regularly spaced components-while preserving the original formant frequencies. Sentence pairs were created by embedding shorter targets within longer interferers. The large improvement with increasing ΔF0 found for harmonic sentences was reduced but still substantial for frequency-shifted sentences. In both cases, swapping target and interferer F0s across spectral regions (below vs above 800 Hz, F1 vs higher formants) caused substantial intelligibility loss only for large ΔF0s, indicating an important effect of changes in across-formant grouping. Listeners made few target-tracking errors, but these errors were more frequent for smaller ΔF0s and for stimuli with more ambiguous pitches. The results extend the range of perceptual phenomena usually attributed to harmonic processing to grouping by spectral regularity.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 3","pages":"2487-2497"},"PeriodicalIF":2.3,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147468383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Shane V Lympany, Michael M James, Matthew F Calton, Alexandria R Salton, Katrina Pedersen, Mark K Transtrum, Kent L Gee
{"title":"A geospatial model of global ambient sound levelsa).","authors":"Shane V Lympany, Michael M James, Matthew F Calton, Alexandria R Salton, Katrina Pedersen, Mark K Transtrum, Kent L Gee","doi":"10.1121/10.0042979","DOIUrl":"https://doi.org/10.1121/10.0042979","url":null,"abstract":"<p><p>This paper presents ambient | global, an ambient soundscape model developed to predict global ambient sound levels from all anthropogenic, biological, and geophysical sources. The soundscape model adopts a geospatial approach by modeling the ambient sound level as a function of geospatial features at a location. The soundscape model consists of an ensemble of four machine learning regression models fitted at acoustic measurement sites where both the geospatial features and ambient sound levels are known. The fitted model is then applied to predict ambient sound levels at any location where the geospatial features are known. The results quantify the spatial, temporal, and spectral patterns of ambient sound levels across the world under various scenarios. This paper presents maps of the existing ambient sound levels across the world in terms of the daytime overall A-weighted L50, or median sound level, and partitions the existing sound levels into their natural and anthropogenic constituents. Ultimately, the soundscape model will enable research into the impacts of humans and nature on the ambient soundscape and the impacts of ambient sound levels on humans and nature across the world.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 3","pages":"2444-2459"},"PeriodicalIF":2.3,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147468412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Nicole Miller-Viacava, Regis Ferriere, Nicholas R Friedman, Timothy C Mullet, Jérôme Sueur, Jacob Willie, Christian Lorenzi
{"title":"Spectro-temporal modulation templates for human perception of animal vocalisations in natural soundscapesa).","authors":"Nicole Miller-Viacava, Regis Ferriere, Nicholas R Friedman, Timothy C Mullet, Jérôme Sueur, Jacob Willie, Christian Lorenzi","doi":"10.1121/10.0042827","DOIUrl":"https://doi.org/10.1121/10.0042827","url":null,"abstract":"<p><p>The human ability to hear animal vocalisations in natural settings was studied by asking naive participants to detect vocalisations in 11 520 samples from a large database of soundscapes varying in terms of habitat (latitude), moment of the day, and precipitation period. Results show that agreement was moderate/good across participants, and the number of vocalisation detections varied systematically as a function of the three ecological factors: Vocalisations were more often detected in the most equatorial habitats, at sunrise and sunset, and for the highest precipitation period. These results are consistent with the latitudinal gradient of biodiversity and diel and seasonal cycles of animal vocal activity. Analysis of modulation power spectra for sounds categorised as containing vocalisations indicates that listeners may base their decisions upon a variety of spectro-temporal modulation cues that are mostly influenced by the openness of the habitat (savannah and desert versus forests). This finding may reflect either adaptation of animal vocal production to the specific sound-propagation characteristics of their native environment, and/or the filtering and distortion effects caused by sound-propagation within natural habitats. This study reveals important ecological constraints on sensory mechanisms involved in detection of vocalisations and provides modulation templates involved in this process for further research.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 3","pages":"2425-2443"},"PeriodicalIF":2.3,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147468432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Statistical wave field theory: Anisotropic wave fields under Neumann's boundary condition.","authors":"Roland Badeau","doi":"10.1121/10.0042450","DOIUrl":"https://doi.org/10.1121/10.0042450","url":null,"abstract":"<p><p>The statistical wave field theory mathematically establishes the statistical laws of the solutions to the wave equation in a bounded domain. It provides the closed-form expressions of the power distribution and the correlations of the wave field jointly over time, frequency, and space, which hold at high frequency and after many reflections, in terms of the geometry and the specific admittance of the boundary surface. This theory was originally developed in the particular case of mixing rooms, which are characterized by a diffuse wave field, based on the theory of dynamical billiards and on Weyl-like asymptotic laws. Then it was extended to the finite family of special polyhedra, where the wave field is anisotropic, based on a simpler geometric approach related to mathematical crystallography. In this paper, we introduce a unified version of the theory dedicated to a class of semi-mixing billiards. In the case of Neumann's boundary condition, we show that the wave field is stationary, but it is generally anisotropic. In particular, the correlation between two spatial positions at a given frequency is different from the well-known cardinal sine formula that characterizes diffuse acoustic fields.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 3","pages":"2265-2280"},"PeriodicalIF":2.3,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147433619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Evi Ofekeze, Jeffrey B Johnson, Hans-Peter Marshall
{"title":"A deep learning approach for infrasound signal detection in snow-covered terrain.","authors":"Evi Ofekeze, Jeffrey B Johnson, Hans-Peter Marshall","doi":"10.1121/10.0043176","DOIUrl":"10.1121/10.0043176","url":null,"abstract":"<p><p>Infrasound signal detection using arrays is an established method for detecting signals from a variety of cultural and geophysical sources. Array processing requires analysis from two or more sensors at an infrasound station. By employing array techniques, signals are isolated from various sources of noise by computing coherence between recorded signals across spatially distributed sensors at a station. Array processing requires multiple sensors and intra-array comparisons that potentially increases time required for event detection. This study presents a signal detection method utilizing deep learning to identify high-quality signals in infrasound waveforms with data from a single infrasound sensor, which might permit event discrimination without using arrays. The approach involves using audio signal processing techniques, applied to infrasound frequencies, to extract a feature space, which is then used to train a deep learning model. A 25-day continuous dataset featuring diverse infrasonic events that include potential avalanches, vehicle traffic, and explosions, collected from Little Cottonwood Canyon, Utah, USA was analyzed for this study. The data provide a convenient testbed for high-quality signal identification using single channel infrasound. The sensor network was motivated by the goal of identifying snow avalanches. Performance was assessed for an optimized model over a range of test data.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 3","pages":"2795-2809"},"PeriodicalIF":2.3,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147503951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Arthur Varon, Julien Bonnel, Jerome Mars, Yongsung Park, Remi Emmetiere
{"title":"A deep learning approach to broadband modal propagation in various shallow water waveguides.","authors":"Arthur Varon, Julien Bonnel, Jerome Mars, Yongsung Park, Remi Emmetiere","doi":"10.1121/10.0043177","DOIUrl":"https://doi.org/10.1121/10.0043177","url":null,"abstract":"<p><p>Normal mode simulations of underwater acoustic propagation can be computationally intensive, particularly for broadband signals or iterative applications like inversion. An approach using neural network (NN) is introduced to approximate and accelerate these simulations. The NN predicts modal parameters, such as the horizontal wavenumbers and modal depth functions. Modal parameters are predicted and can subsequently be used to compute propagation for arbitrary source-receiver configurations. To address the challenge of dynamic ocean environments or unknown seabeds, the model is trained across different range-independent environments. Training data were generated using the Kraken normal mode code for shallow oceanic waveguides with variable environmental parameters and frequencies within 50-500 Hz. The proposed NN is conditioned on mode and frequency, enabling efficient broadband predictions. Evaluated on environments unseen during training, the NN accurately approximates modal parameters. Once trained, the proposed approach can reduce computation time for modal parameters by an order of magnitude compared to conventional codes, such as Kraken. This efficiency could support demanding applications, like geoacoustic inversion or simulations on computationally constrained platforms.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 3","pages":"2889-2901"},"PeriodicalIF":2.3,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147513061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Acoustic interference pattern restoration in shallow water with internal solitary waves based on time-invariant characteristics.","authors":"Ziyu Zhang, Chao Sun, Lei Xie, Qixuan Zhu","doi":"10.1121/10.0042992","DOIUrl":"https://doi.org/10.1121/10.0042992","url":null,"abstract":"<p><p>Acoustic interference patterns (AIPs) typically exhibit a regular striation structure in shallow water. However, internal solitary waves (ISWs) commonly found in the ocean can cause severe distortion in AIPs, thereby presenting significant challenges to acoustic signal processing methods reliant on AIPs. Grounded in coupled mode theory, this paper elucidates the mechanism of AIP distortion induced by ISWs. Specifically, ISWs introduce several time-varying \"coupled\" components while maintaining the time-invariant \"adiabatic\" components. Leveraging the distinct temporal characteristics of these two classes of components, this paper proposes a time-averaging processing (TAP) method. This method averages the AIPs from multiple snapshots to preserve the time-invariant \"adiabatic\" components while suppressing the time-varying \"coupled\" components. Consequently, the TAP method restores the distorted AIP to its striation structure. The efficacy of the proposed method is substantiated through validation using simulated data. Moreover, the TAP method provides a foundation for enhancing the performance of underwater acoustic signal processing methods reliant on AIPs.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 3","pages":"2211-2226"},"PeriodicalIF":2.3,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147433906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hossein Zafari, Tohid Mardaneh, Ali Tarkashvand, Zahra Daneshjoo
{"title":"Stiffener-induced stiffness/mass suppression of the exterior sound field in sandwich structures with open-cell foam cores: Periodically arranged stiffeners.","authors":"Hossein Zafari, Tohid Mardaneh, Ali Tarkashvand, Zahra Daneshjoo","doi":"10.1121/10.0042816","DOIUrl":"https://doi.org/10.1121/10.0042816","url":null,"abstract":"<p><p>Sandwich cylindrical shells with porous polymeric foam cores and periodically distributed stiffeners are attractive candidates for lightweight control of surrounding acoustic pressure, yet their acoustic scattering in realistic multilayered configurations remains poorly characterized. This study develops an analytical framework for predicting the total scattering cross section of a sandwich cylindrical shell under oblique plane wave incidence for a specific configuration combining a functionally graded (FG) outer facesheet, an FG open-cell porous polymeric foam core, an isotropic inner layer separated by air gaps, orthogrid (ring-string) stiffeners, and coupled interior and exterior acoustic fluids. Structural dynamics are modeled using first-order shear deformation theory and Hamilton's principle, while the foam core is treated as an FG viscoelastic medium with a frequency-dependent complex modulus. Continuity of acoustic pressure and normal velocity at the fluid-structure interfaces couples the interior and exterior acoustic fields and yields the scattered pressure and total scattering cross section. Numerical results show that periodic ring-string stiffening strongly attenuates low-frequency resonances and shifts the first major scattering peak to higher frequencies, thereby reducing low-frequency scattering levels. The stiffened configuration also reduces the peak magnitude of the surrounding acoustic pressure (from approximately ±1.5 Pa to ±1.0 Pa at 10 kHz), indicating a weaker scattered field.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 3","pages":"2009-2024"},"PeriodicalIF":2.3,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147377966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Christina J Naify, Bogdan-Ioan Popa, Serife Tol, Michael R Haberman
{"title":"Introduction to the special issue on active and tunable acoustic metamaterialsa).","authors":"Christina J Naify, Bogdan-Ioan Popa, Serife Tol, Michael R Haberman","doi":"10.1121/10.0043097","DOIUrl":"https://doi.org/10.1121/10.0043097","url":null,"abstract":"<p><p>Acoustic metamaterials are a class of architected materials with dynamic properties that are designed at the sub-wavelength scale to achieve exotic or unique macroscopic response. Although early concepts of acoustic metamaterials relied on static configurations, recent research has further expanded the limits of acoustic customization by incorporating active or tunable responses. This article provides an introduction to the special issues of The Journal of the Acoustical Society of America and JASA Express Letters on active and tunable acoustic metamaterials and begins with a brief description of the general categories of active control and tunable response included in the contributions to the special issue and, then, provides a brief description of the articles in this special issue, grouped by general category, and how the research presented in these works contribute to the advancement of acoustic metamaterial research.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 3","pages":"2591-2595"},"PeriodicalIF":2.3,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147486421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Adaptive noise reduction and speech intelligibility using shape memory metamaterial-based hearing protection earmuffs.","authors":"Musa'ab Ejaz, Nabihah Sallih, Venkata Somi Reddy Janga, Yasir Mujahid, Masturah Mesri, Mohamed Latheef, Leong Yin Liong, Chrisminder Dain, Tong Boon Tang, Normani Zakaria, Umair Khan","doi":"10.1121/10.0043163","DOIUrl":"https://doi.org/10.1121/10.0043163","url":null,"abstract":"<p><p>Conventional passive earmuffs, which have a single number rating (SNR) of 28-32 dB, can effectively protect users' hearing but block useful lower-intensity sound waves, such as speech, causing communication difficulties. This leads employees to remove their earmuffs to comprehend important information or to avoid miscommunication while wearing earmuffs, which might lead to accidents. This problem can be solved using pressure level-dependent passive earmuffs with selective frequency and intensity filtration to allow lower intensity waves (such as speech) to pass through while blocking high-intensity waves (hazardous noise). This study evaluates speech intelligibility and hearing protection of an in-house developed pressure level-dependent passive earmuff prototype made of shape memory metamaterial inserts. Hearing protection and speech intelligibility were evaluated in a controlled environment. Numerical simulations of earcups and inserts were performed to optimize their design and obtain the required noise reduction at various intensities. The prototype can ensure an in-ear sound pressure level of less than 82 dB (across frequencies) up to 110 dB pink noise. The prototype exhibits fair speech intelligibility in both quiet and noisy environments. User-experience feedback indicated that 85% of the participants preferred the shape memory metamaterial earmuffs over commercially available earmuffs, for use in diverse working environments.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 3","pages":"2647-2669"},"PeriodicalIF":2.3,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147491171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}