JASA express letters最新文献

筛选
英文 中文
The stability of articulatory and acoustic oscillatory signals derived from speecha). 源自语音的发音和声学振荡信号的稳定性。
IF 1.2
JASA express letters Pub Date : 2025-04-01 DOI: 10.1121/10.0036389
Jessica Campbell, Dani Byrd, Louis Goldstein
{"title":"The stability of articulatory and acoustic oscillatory signals derived from speecha).","authors":"Jessica Campbell, Dani Byrd, Louis Goldstein","doi":"10.1121/10.0036389","DOIUrl":"https://doi.org/10.1121/10.0036389","url":null,"abstract":"<p><p>Articulatory underpinnings of periodicities in the speech signal are unclear beyond a general alternation of vocal tract opening and closing. This study evaluates a modulatory articulatory signal that captures instantaneous change in vocal tract posture and its relation with two acoustic oscillatory signals, comparing stabilities to the progression of vowel and stressed vowel onsets. Modulatory signals can be calculated more efficiently than labeling linguistic events. These signals were more stable in periodicity than acoustic vowel onsets and not different from stressed vowel onsets, suggesting that an articulatory modulation function can provide a useful method for indexing foundational periodicities in speech without tedious annotation.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 4","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12010241/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144022521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating synthesized speech intelligibility in noise. 评价噪声环境下的合成语音清晰度。
IF 1.2
JASA express letters Pub Date : 2025-04-01 DOI: 10.1121/10.0036397
Ye Yang, Dathan Nguyen, Katherine Chen, Fan-Gang Zeng
{"title":"Evaluating synthesized speech intelligibility in noise.","authors":"Ye Yang, Dathan Nguyen, Katherine Chen, Fan-Gang Zeng","doi":"10.1121/10.0036397","DOIUrl":"https://doi.org/10.1121/10.0036397","url":null,"abstract":"<p><p>Humans can modify their speech to improve intelligibility in noisy environments. With the advancement of speech synthesis technology, machines may also synthesize voices that remain highly intelligible in noise condition. This study evaluates both the subjective and objective intelligibility of synthesized speech in speech-shaped noise from three major speech synthesis platforms. It was found that synthesized voices have a similar intelligibility range to human voices, and some synthesized voices were more intelligible than human voices. It was also found that two modern automatic speech recognition systems recognized 10% more words than human listeners.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 4","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144058097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing speech intelligibility in optical microphone systems through physics-informed data augmentation. 通过物理数据增强增强光学麦克风系统的语音清晰度。
IF 1.2
JASA express letters Pub Date : 2025-04-01 DOI: 10.1121/10.0036356
Jia-Wei Chen, Jia-Hui Li, Yi-Hao Jiang, Yi-Chang Wu, Ying-Hui Lai
{"title":"Enhancing speech intelligibility in optical microphone systems through physics-informed data augmentation.","authors":"Jia-Wei Chen, Jia-Hui Li, Yi-Hao Jiang, Yi-Chang Wu, Ying-Hui Lai","doi":"10.1121/10.0036356","DOIUrl":"10.1121/10.0036356","url":null,"abstract":"<p><p>Laser doppler vibrometers (LDVs) facilitate noncontact speech acquisition; however, they are prone to material-dependent spectral distortions and speckle noise, which degrade intelligibility in noisy environments. This study proposes a data augmentation method that incorporates material-specific and impulse noises to simulate LDV-induced distortions. The proposed approach utilizes a gated convolutional neural network with HiFi-GAN to enhance speech intelligibility across various material and low signal-to-noise ratio (SNR) conditions, achieving a short-time objective intelligibility score of 0.76 at 0 dB SNR. These findings provide valuable insights into optimized augmentation and deep-learning techniques for enhancing LDV-based speech recordings in practical applications.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 4","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143766092","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Perception-production link mediated by position in the imitation of Korean nasal stops. 韩语鼻塞模仿中位置介导的感知-产生联系。
IF 1.2
JASA express letters Pub Date : 2025-03-01 DOI: 10.1121/10.0036057
Jiwon Hwang, Yu-An Lu
{"title":"Perception-production link mediated by position in the imitation of Korean nasal stops.","authors":"Jiwon Hwang, Yu-An Lu","doi":"10.1121/10.0036057","DOIUrl":"10.1121/10.0036057","url":null,"abstract":"<p><p>This study explores how perceptual cues in two positions influence imitation of Korean nasal stops. As a result of initial denasalization, nasality cues are secondary in the initial position but primary in the medial position. Categorization and imitation tasks using CV (consonant-vowel) and VCV (vowel-consonant-vowel) items on a continuum from voiced oral to nasal stops were completed by 32 Korean speakers. Results revealed categorical imitation of nasality medially, whereas imitation was gradient or minimal initially. Furthermore, individuals requiring stronger nasality cues to categorize a nasal sound produced greater nasality in imitation. These findings highlight a perception-production link mediated by positional cue reliance.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 3","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143544861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatial grouping as a method to improve personalized head-related transfer function prediction. 空间分组作为提高个性化头部相关传递函数预测的方法。
IF 1.2
JASA express letters Pub Date : 2025-03-01 DOI: 10.1121/10.0036032
Keng-Wei Chang, Yih-Liang Shen, Tai-Shih Chi
{"title":"Spatial grouping as a method to improve personalized head-related transfer function prediction.","authors":"Keng-Wei Chang, Yih-Liang Shen, Tai-Shih Chi","doi":"10.1121/10.0036032","DOIUrl":"10.1121/10.0036032","url":null,"abstract":"<p><p>The head-related transfer function (HRTF) characterizes the frequency response of the sound traveling path between a specific location and the ear. When it comes to estimating HRTFs by neural network models, angle-specific models greatly outperform global models but demand high computational resources. To balance the computational resource and performance, we propose a method by grouping HRTF data spatially to reduce variance within each subspace. HRTF predicting neural network is then trained for each subspace. Results show the proposed method performs better than global models and angle-specific models by using different grouping strategies at the ipsilateral and contralateral sides.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 3","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143544862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Variation in the production of nasal coarticulation by speaker age and speech style. 说话者年龄和说话风格对鼻音协同发音产生的影响。
IF 1.2
JASA express letters Pub Date : 2025-03-01 DOI: 10.1121/10.0036227
Georgia Zellou, Michelle Cohn
{"title":"Variation in the production of nasal coarticulation by speaker age and speech style.","authors":"Georgia Zellou, Michelle Cohn","doi":"10.1121/10.0036227","DOIUrl":"10.1121/10.0036227","url":null,"abstract":"<p><p>This study investigates apparent-time variation in the production of anticipatory nasal coarticulation in California English. Productions of consonant-vowel-nasal words in clear vs casual speech by 58 speakers aged 18-58 (grouped into three generations) were analyzed for degree of coarticulatory vowel nasality. Results reveal an interaction between age and style: the two younger speaker groups produce greater coarticulation (measured as A1-P0) in clear speech, whereas older speakers produce less variable coarticulation across styles. Yet, duration lengthening in clear speech is stable across ages. Thus, age- and style-conditioned changes in produced coarticulation interact as part of change in coarticulation grammars over time.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 3","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143674970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Voice assistant technology continues to underperform on children's speech. 语音助手技术在儿童语言方面仍然表现不佳。
IF 1.2
JASA express letters Pub Date : 2025-03-01 DOI: 10.1121/10.0036052
Holly Bradley, Madeleine E Yu, Elizabeth K Johnson
{"title":"Voice assistant technology continues to underperform on children's speech.","authors":"Holly Bradley, Madeleine E Yu, Elizabeth K Johnson","doi":"10.1121/10.0036052","DOIUrl":"10.1121/10.0036052","url":null,"abstract":"<p><p>Voice assistant (VA) technology is increasingly part of children's everyday lives. But how well do these systems understand children? No study has asked this with children under 5 years old. Here, two versions of Siri, and one of Alexa, were tested on their ability to transcribe utterances produced by 2-, 3-, and 5-year-olds. Human listeners (mothers and undergraduates) were also tested. Results showed that while Siri's performance on children's speech has improved in recent years, even the newest Siri and Alexa models struggle with children's speech. Human listeners far outperformed VA systems with all ages, especially with the youngest children's speech.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 3","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143544863","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bone mineral density and hydroxyapatite alignment in leg cortical bone influence on ultrasound velocity. 骨矿物质密度和羟基磷灰石排列对腿皮质骨超声速度的影响。
IF 1.2
JASA express letters Pub Date : 2025-03-01 DOI: 10.1121/10.0036082
Shuta Kodama, Hiroshi Mita, Norihisa Tamura, Daisuke Koyama, Mami Matsukawa
{"title":"Bone mineral density and hydroxyapatite alignment in leg cortical bone influence on ultrasound velocity.","authors":"Shuta Kodama, Hiroshi Mita, Norihisa Tamura, Daisuke Koyama, Mami Matsukawa","doi":"10.1121/10.0036082","DOIUrl":"10.1121/10.0036082","url":null,"abstract":"<p><p>Bone diagnosis using x-ray techniques, such as computed tomography and dual-energy x-ray absorptiometry, can evaluate bone mineral density (BMD) and microstructure but does not provide elastic properties. This study investigated the ultrasonic properties of racehorse leg cortical bone, focusing on the relationship between wave velocity, BMD, and hydroxyapatite (HAp) crystallite alignment. The results showed a strong correlation between wave velocity and BMD, suggesting that quantitative ultrasound-obtained wave velocity is primarily influenced by BMD, followed by the HAp alignment direction.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 3","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143588552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A method of reference phase velocity selecting for bearing estimation with a horizontal line array in shallow water. 浅水水平线阵列方位估计参考相速度选择方法。
IF 1.2
JASA express letters Pub Date : 2025-03-01 DOI: 10.1121/10.0035934
Dai Liu, Feilong Zhu, Yanjun Zhang, Zhaohui Peng
{"title":"A method of reference phase velocity selecting for bearing estimation with a horizontal line array in shallow water.","authors":"Dai Liu, Feilong Zhu, Yanjun Zhang, Zhaohui Peng","doi":"10.1121/10.0035934","DOIUrl":"https://doi.org/10.1121/10.0035934","url":null,"abstract":"<p><p>In shallow water environments, choosing an appropriate reference phase velocity for direction-of-arrival estimation with a beamformed underwater horizontal line array is very important. The direction of the maximum beamformer output power will deviate from the true source bearing when a mismatched reference phase velocity was used. This Letter analyzed the intrinsic relationship between the reference phase velocity and normal mode amplitude distribution, source bearing, array aperture, and then proposed a multi-parameter weighted reference phase velocity selection method, which has improved the accuracy of source bearing estimation. Numerical simulation and experimental results validated the effectiveness of this method.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 3","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143607363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effects of spatial asymmetry and voice-gender differences between talkers on spatial release from masking in normal-hearing listeners. 说话者空间不对称和语音性别差异对正常听力听者空间掩蔽释放的影响。
IF 1.2
JASA express letters Pub Date : 2025-03-01 DOI: 10.1121/10.0036249
Yonghee Oh, Josephine Kinder, Phillip Friggle, Caroline Cuthbertson
{"title":"Effects of spatial asymmetry and voice-gender differences between talkers on spatial release from masking in normal-hearing listeners.","authors":"Yonghee Oh, Josephine Kinder, Phillip Friggle, Caroline Cuthbertson","doi":"10.1121/10.0036249","DOIUrl":"10.1121/10.0036249","url":null,"abstract":"<p><p>This study investigated how a listener's spatial release from masking (SRM) performance is affected by spatial asymmetry and voice-gender differences between talkers in multi-talker listening situations. The amounts of SRM were measured with symmetric and asymmetric (toward the right or left) masker configurations in same-gender and different-gender target-masker conditions. The results showed that the SRM was co-varied by talkers' voice-gender differences and spatial asymmetry cues: maximized in the same-gender and asymmetrical target-maskers condition and minimized in the different-gender and symmetrical target-maskers condition. Those findings suggest that the talkers' asymmetry and voice-gender differences could contribute to the variation in SRM independently.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 3","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143660003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信