{"title":"End-to-end underwater acoustic transmission loss prediction with adaptive multi-scale dilated network.","authors":"Zhao Sun, Yongxian Wang, Wei Liu","doi":"10.1121/10.0034857","DOIUrl":"https://doi.org/10.1121/10.0034857","url":null,"abstract":"<p><p>Underwater acoustic propagation is a complex phenomenon in the ocean environment. Traditional methods for calculating acoustic propagation loss rely on solving complex partial differential equations. Deep learning methods, leveraging their robust nonlinear approximation capabilities, can model various physical phenomena effectively, significantly reducing computation time and cost. Despite considerable advancements in the study of various inverse underwater acoustic problems, research focused on forward physical modeling is still nascent. This study proposes an end-to-end architecture for predicting underwater acoustic transmission loss (TL). This architecture employs a data-driven approach capable of swiftly and accurately predicting the complete acoustic field. It employs a U-Net model integrated with an adaptive multi-scale dilated module, named MultiScale-DUNet, which effectively predicts by assimilating multi-scale acoustic field information. It is demonstrated that the MultiScale-DUNet is capable of predicting acoustic TL in complex two-dimensional ocean environments within the end-to-end framework. The results indicate that the MultiScale-DUNet can rapidly predict the acoustic TL while maintaining high accuracy under computationally inexpensive conditions. This end-to-end technology for predicting underwater acoustic TL holds broad application prospects in fields such as underwater exploration and real-time underwater monitoring.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"382-395"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143007440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Xiangyu Meng, Zheng Yang, Zhijuan Zhu, Lingyun Ye, Xinglin Sun
{"title":"Underwater low-power electromagnetic transducers with Central permanent magnet integration.","authors":"Xiangyu Meng, Zheng Yang, Zhijuan Zhu, Lingyun Ye, Xinglin Sun","doi":"10.1121/10.0034844","DOIUrl":"https://doi.org/10.1121/10.0034844","url":null,"abstract":"<p><p>High-efficiency electromagnetic transducers are crucial for enabling the self-sustained operation of underwater electromagnetic sound sources under power-constrained conditions as noted by Hao, Xie, and Ma [Proceedings of the 2019 Western China Acoustics Academic Conference, Guangzhou, China (November 5-9, 2019)]. This paper proposes a permanent magnet drive technology to enhance the electromechanical conversion efficiency of can-type electromagnetic transducers under low-power driving conditions. The can-type transducers consist of coils, an armature, and a cylindrical magnetic core with a central pillar, similar to the pot core proposed by Cui, Xu, Xu, and Shui [Electr. Eng. 101, 911-919 (2019)]. The proposed driving technology involves adding a permanent magnet to the central pillar to introduce a direct current excitation bias, thereby adjusting the static operating point and the electromechanical conversion efficiency of the transducer. In this paper, a theoretical model of the electromagnetic driving force is established, and finite element simulations show that adding a permanent magnet increases the electromechanical efficiency of can-type transducers from less than 1% to 84.43% under low-power conditions. Furthermore, prototypes of underwater can-type electromagnetic transducers with and without a permanent magnet at the core center were fabricated in this study. The experimental results confirmed that the permanent magnet drive technology significantly improves electromechanical conversion efficiency by adding a permanent magnet to the core center.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"302-313"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143007460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Response to \"Comment on 'similar susceptibility to temporary hearing threshold shifts despite different audiograms in harbor porpoises and harbor seals' \" [J. Acoust. Soc. Am. 157, 538-541 (2025)].","authors":"Robin Gransier, Ronald A Kastelein","doi":"10.1121/10.0035453","DOIUrl":"https://doi.org/10.1121/10.0035453","url":null,"abstract":"<p><p>In their Comment, Tougaard et al. [(2025). J. Acoust. Soc. Am. 157, 538-541] question our conclusion that despite their different audiograms, harbor seals (Phoca vitulina) and harbor porpoises (Phocoena phocoena) have similar susceptibility to temporary hearing threshold shift caused by loud sounds, and claim that our selection of data for analysis was biased. In this Response, we clarify our methods and uphold our original conclusions.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"618-621"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143059446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Biological, linguistic, and individual factors govern voice qualitya).","authors":"Jody Kreiman, Yoonjeong Lee","doi":"10.1121/10.0034848","DOIUrl":"https://doi.org/10.1121/10.0034848","url":null,"abstract":"<p><p>Voice quality serves as a rich source of information about speakers, providing listeners with impressions of identity, emotional state, age, sex, reproductive fitness, and other biologically and socially salient characteristics. Understanding how this information is transmitted, accessed, and exploited requires knowledge of the psychoacoustic dimensions along which voices vary, an area that remains largely unexplored. Recent studies of English speakers have shown that two factors related to speaker size and arousal consistently emerge as the most important determinants of quality, regardless of who is speaking. The present findings extend this picture by demonstrating that in four languages that vary fundamental frequency (fo) and/or phonation type contrastively (Korean, Thai, Gujarati, and White Hmong), additional acoustic variability is systematically related to the phonology of the language spoken, and the amount of variability along each dimension is consistent across speaker groups. This study concludes that acoustic voice spaces are structured in a remarkably consistent way: first by biologically driven, evolutionarily grounded factors, second by learned linguistic factors, and finally by variations within a talker over utterances, possibly due to personal style, emotional state, social setting, or other dynamic factors. Implications for models of speaker recognition are also discussed.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"482-492"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143023642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ivo Dobrev, Jongwoo Lim, Namkeun Kim, Johannes Niermann, Christof Röösli, Flurin Pfiffner
{"title":"Exploration of the dynamics of otic capsule and intracochlear pressure: Numerical insights with experimental validation.","authors":"Ivo Dobrev, Jongwoo Lim, Namkeun Kim, Johannes Niermann, Christof Röösli, Flurin Pfiffner","doi":"10.1121/10.0034859","DOIUrl":"https://doi.org/10.1121/10.0034859","url":null,"abstract":"<p><p>The otic capsule and surrounding temporal bone exhibit complex 3D motion influenced by frequency and location of the bone conduction stimulus. The resultant correlation with the intracochlear pressure is not sufficiently understood, thus is the focus of this study, both experimentally and numerically. Experiments were conducted on six temporal bones from three cadaver heads, with BC hearing aid stimulation applied at the mastoid and classical BAHA locations across 0.1-20 kHz. Three-dimensional motions were measured on various skull regions, including the promontory and stapes. Intracochlear pressure was measured using a custom acoustic receiver. The experiment was digitally recreated by a custom finite element model (FEM), based on the LiUHead, with the addition of an auditory periphery. The Young's modulus of the cortical bone domain within the FEM was varied between 4, 8, and 20 GPa. The predicted differential intracochlear pressures aligned with experimental data for most frequencies, and showed that skull deformation, particularly in the otic capsule, depends on skull material properties. Both experimental and FEM results indicated that the otic capsule behaves as a rigid accelerometer, imposing inertial loads on cochlear fluids, even above 7 kHz. Future work should explore the solid-fluid interactions between the otic capsule and cochlear contents.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"554-568"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143052755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Sparse loudspeaker array design for wideband frequency-invariant beamforming with multiple targets.","authors":"Keyu Pan, Simiao Chen, Yiming Han, Xuelei Feng, Yong Shen","doi":"10.1121/10.0034626","DOIUrl":"https://doi.org/10.1121/10.0034626","url":null,"abstract":"<p><p>Beamforming technology using loudspeaker arrays is widely used in sound applications, but current sparse array design methods focus on optimizing a single beam for a single target direction, limiting their applicability to multi-channel sound systems. This paper presents a design method for sparse loudspeaker line arrays to generate wideband frequency-invariant beams in multiple target directions. A model based on tapped delay lines is developed and a two-stage design approach is proposed. In the first stage, a compressive sensing framework combined with an extended fast iterative shrinkage-thresholding algorithm is employed to determine the optimal locations of active drivers. In the second stage, the excitations of the active drivers are calculated using the least squares method. Numerical simulations show a 32.3% reduction in the number of drivers compared to that of traditional uniform linear loudspeaker arrays while maintaining frequency-invariant properties. The proposed algorithm exhibits considerable computational efficiency, rendering it highly suitable for addressing large-scale optimization problems and substantially reducing the design time. This design method is applicable to sound projection scenarios and can be extended to other multi-channel beamforming systems to achieve reliable beam performance and reduce system costs.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"369-381"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143007450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Does sound symbolism need sound?: The role of articulatory movement in detecting iconicity between sound and meaninga).","authors":"Mutsumi Imai, Sotaro Kita, Kimi Akita, Noburo Saji, Masato Ohba, Miki Namatame","doi":"10.1121/10.0034832","DOIUrl":"https://doi.org/10.1121/10.0034832","url":null,"abstract":"<p><p>Ever since de Saussure [Course in General Lingustics (Columbia University Press, 1916)], theorists of language have assumed that the relation between form and meaning of words is arbitrary. However, recently, a body of empirical research has established that language is embodied and contains iconicity. Sound symbolism, an intrinsic link language users perceive between word sound and properties of referents, is a representative example of iconicity in language and has offered profound insights into theories of language pertaining to language processing, language acquisition, and evolution. However, on what basis people detect iconicity between sound and meaning has not yet been made clear. One way to address this question is to ask whether one needs to be able to hear sound to detect sound symbolism. Here, it is shown that (1) deaf-and-Hard-of-Hearing (DHH) participants, even those with profound hearing loss, could judge the sound symbolic match between shapes and words at the same level of accuracy as hearing participants do; and (2) restriction of articulatory movements negatively affects DHH individuals' judgments. The results provided support for the articulatory theory of sound symbolism and lead to a possibility that linguistic symbols may have emerged through iconic mappings across different sensory modality-in particular, oral gesture and sensory experience of the world in the case of speech.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"137-148"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142950684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Junpeng Lai, Zihan Liu, Morteza Karimi, Mahdi Farahikia, Weili Cui, Johar Pourghader, Sara Aghazadeh, Changhong Ke, Ronald Miles
{"title":"A small cavity for detecting sound-induced flow.","authors":"Junpeng Lai, Zihan Liu, Morteza Karimi, Mahdi Farahikia, Weili Cui, Johar Pourghader, Sara Aghazadeh, Changhong Ke, Ronald Miles","doi":"10.1121/10.0034788","DOIUrl":"https://doi.org/10.1121/10.0034788","url":null,"abstract":"<p><p>A study is presented of a method for creating an acoustic flow sensor that is generally compatible with current silicon microfabrication processes. An aim of this effort is to obtain a design consisting of a minimal departure from the existing designs employed in mass-produced silicon microphones. Because the primary component in all of these microphones is the cavity behind the pressure-sensing diaphragm, we begin with a study of the acoustic particle velocity within a cavity in a planar surface. The sound within the cavity is caused by the external plane sound wave traveling parallel to the cavity's open surface. It is shown that with suitable dimensions of the cavity, the acoustic particle velocity simultaneously flows inward at one end and outward at the other end of the single open cavity surface. A simple analytical model is presented to estimate the required length and depth of the cavity such that the acoustic particle velocity into and out of the opening is a reasonable approximation to that of a plane traveling sound wave in the free field. Measurements of the acoustic particle velocity into and out of the cavity are in close agreement with both the simple model and a more detailed finite element model. Agreement between two dissimilar modeling approaches and experiments suggests that the dominant features of the system have been accounted for. By redirecting the acoustic particle velocity into and out of the cavity opening rather than the flow being parallel to the plane surface, this configuration greatly facilitates the design and fabrication of structures intended to sense the acoustic flow.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"29-42"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11710855/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142950634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Target an arbitrary probability of response using weighted staircase procedures.","authors":"Eric C Hoover","doi":"10.1121/10.0034861","DOIUrl":"10.1121/10.0034861","url":null,"abstract":"<p><p>Threshold estimation procedures are widely used to measure the stimulus level corresponding to a specified probability of response. The weighted up-and-down procedure, familiar to many due to its use in standard pure-tone audiometry, allows the experimenter to target any probability of response by using different ascending and descending step sizes. Unfortunately, thresholds have a signed mean error that made using weighted staircases inadvisable. The current study evaluated a correction to eliminate the error. Monte Carlo simulations of weighted staircases were used to test the effectiveness of the proposed correction for yes-no and forced-choice tasks with Gaussian and log-Weibull psychometric functions. Results showed that the proposed correction was effective over a wide range of step size magnitudes and ratios with a symmetric psychometric function and less effective when there was asymmetry due to the shape of the function or a high guess or lapse rate. The proposed correction facilitates the use of weighted staircases to target an arbitrary probability of response.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"191-202"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11735050/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142978931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Theoretical modeling and modal analysis of multi-element coupled transducers.","authors":"Hao Sun, Yu Lan, Wei Lu, Tianfang Zhou","doi":"10.1121/10.0034862","DOIUrl":"https://doi.org/10.1121/10.0034862","url":null,"abstract":"<p><p>Low-frequency transducers are considerably smaller than the wavelength. When multiple low-frequency transducers are closely packed, they couple with the surrounding water and form a transducer-water-transducer coupling structure called multi-element coupled transducers (MCT). This study presents a theoretical model of the MCT based on radiation and mutual radiation theory and analyzes it under multiple resonance frequencies and vibration modes. The MCT comprising N single-degree-of-freedom elements possesses N resonance frequencies and corresponding vibration modes, and the resonance frequency varies with the spacing between elements. Additionally, the elements have different vibration amplitudes, with the phases of vibration among the elements manifesting as different combinations of in-phase and anti-phase in different vibration modes. Validation is performed through finite element analysis (FEA) and measurements of binary and ternary coupled transducer prototypes. The prototypes comprise bender transducers closely packed in a vertical configuration. The theoretical predictions agree with the FEA and measurement results. Thus, the theoretical model contributes to enhancing the understanding of acoustic transducer behavior in coupled configurations and optimizing the design of low-frequency acoustic arrays.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"433-444"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143007456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}