{"title":"听觉外围系统的频谱表征能否解释在噪声掩盖下元音产生的变化?","authors":"Yasufumi Uezu, Masato Akagi, Masashi Unoki","doi":"10.1121/10.0039109","DOIUrl":null,"url":null,"abstract":"<p><p>This study examines how auditory spectral representations in the peripheral auditory system explain changes in vowel production under noisy conditions, especially when lower formants (F1 and F2) are masked. Ten adult male Japanese speakers produced sustained vowels /a/ and /i/ under quiet and noisy conditions involving three noise types (broadband, low-pass, and high-pass) at 75 and 85 dB. We analyzed vocal intensity and the amplitudes and frequencies of the F1 and F2. Auditory spectral representations, simulated using a loudness model, were used to estimate excitation patterns in the auditory periphery. Most noise conditions significantly increased vocal intensity and the amplitude of both formants. F1 frequency consistently shifted upward under high-intensity broadband noise, while F2 shifts depended on vowel and noise type, shifting upward for /a/ and downward for /i/. Some patterns could not be explained by power spectra alone. Instead, they were better accounted for by frequency-specific masking effects, reflected in overlapping excitation patterns in the auditory spectral representation. These overlaps indicated reduced self-audibility in specific frequency bands, triggering compensatory adjustments. The findings highlight how auditory masking influences speech production, supporting a perceptually grounded model of auditory-motor control in noisy environments.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"158 3","pages":"1737-1750"},"PeriodicalIF":2.3000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Can the spectral representation of the auditory peripheral system explain vowel production changes under noises masking lower formants?\",\"authors\":\"Yasufumi Uezu, Masato Akagi, Masashi Unoki\",\"doi\":\"10.1121/10.0039109\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>This study examines how auditory spectral representations in the peripheral auditory system explain changes in vowel production under noisy conditions, especially when lower formants (F1 and F2) are masked. Ten adult male Japanese speakers produced sustained vowels /a/ and /i/ under quiet and noisy conditions involving three noise types (broadband, low-pass, and high-pass) at 75 and 85 dB. We analyzed vocal intensity and the amplitudes and frequencies of the F1 and F2. Auditory spectral representations, simulated using a loudness model, were used to estimate excitation patterns in the auditory periphery. Most noise conditions significantly increased vocal intensity and the amplitude of both formants. F1 frequency consistently shifted upward under high-intensity broadband noise, while F2 shifts depended on vowel and noise type, shifting upward for /a/ and downward for /i/. Some patterns could not be explained by power spectra alone. Instead, they were better accounted for by frequency-specific masking effects, reflected in overlapping excitation patterns in the auditory spectral representation. These overlaps indicated reduced self-audibility in specific frequency bands, triggering compensatory adjustments. The findings highlight how auditory masking influences speech production, supporting a perceptually grounded model of auditory-motor control in noisy environments.</p>\",\"PeriodicalId\":17168,\"journal\":{\"name\":\"Journal of the Acoustical Society of America\",\"volume\":\"158 3\",\"pages\":\"1737-1750\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the Acoustical Society of America\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://doi.org/10.1121/10.0039109\",\"RegionNum\":2,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ACOUSTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Acoustical Society of America","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1121/10.0039109","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ACOUSTICS","Score":null,"Total":0}
Can the spectral representation of the auditory peripheral system explain vowel production changes under noises masking lower formants?
This study examines how auditory spectral representations in the peripheral auditory system explain changes in vowel production under noisy conditions, especially when lower formants (F1 and F2) are masked. Ten adult male Japanese speakers produced sustained vowels /a/ and /i/ under quiet and noisy conditions involving three noise types (broadband, low-pass, and high-pass) at 75 and 85 dB. We analyzed vocal intensity and the amplitudes and frequencies of the F1 and F2. Auditory spectral representations, simulated using a loudness model, were used to estimate excitation patterns in the auditory periphery. Most noise conditions significantly increased vocal intensity and the amplitude of both formants. F1 frequency consistently shifted upward under high-intensity broadband noise, while F2 shifts depended on vowel and noise type, shifting upward for /a/ and downward for /i/. Some patterns could not be explained by power spectra alone. Instead, they were better accounted for by frequency-specific masking effects, reflected in overlapping excitation patterns in the auditory spectral representation. These overlaps indicated reduced self-audibility in specific frequency bands, triggering compensatory adjustments. The findings highlight how auditory masking influences speech production, supporting a perceptually grounded model of auditory-motor control in noisy environments.
期刊介绍:
Since 1929 The Journal of the Acoustical Society of America has been the leading source of theoretical and experimental research results in the broad interdisciplinary study of sound. Subject coverage includes: linear and nonlinear acoustics; aeroacoustics, underwater sound and acoustical oceanography; ultrasonics and quantum acoustics; architectural and structural acoustics and vibration; speech, music and noise; psychology and physiology of hearing; engineering acoustics, transduction; bioacoustics, animal bioacoustics.