{"title":"Effect of speaker variability on tone perception for L2 learners: Perspectives of machine learning.","authors":"Hui Zhang, Zizhu Wang, Weitong Liu","doi":"10.1121/10.0037073","DOIUrl":null,"url":null,"abstract":"<p><p>Mandarin tones 2 and 3 share similar acoustic characteristics, posing a challenge for L2 learners to distinguish. This difficulty is further compounded by both inter- and intra-speaker variability. This study employed machine learning methods to examine the effectiveness of eight acoustic cues for tone 2 and tone 3 classification in speech that were produced by 20 speakers under normal and loud speaking modes (experiment 1). Additionally, we compared the perception between native listeners and medium-to-advanced Thai L2 learners of Mandarin, including perceptual accuracy, cue-weighting strategies, and perceptual space (experiment 2). Results of experiment 1 show that temporal cues are more effective than height-related cues for tone 2 and tone 3 classifications in speech with inter- and intra- speaker variability. In experiment 2, medium-to-advanced Thai L2 learners produced more confusions than native listeners. The ranking of perceptual acoustic cues is generally similar to native listeners, suggesting that these L2 learners have generally mastered the correct cue-weighting strategy for distinguishing between tones 2 and 3. Additional analyses showed that the confusion stems from learners' perceptual biases. Specifically, learners allocate narrower space for tone 3 category than native listeners, with atypical tone 3 exemplars being misidentified as tone 2.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"158 1","pages":"391-406"},"PeriodicalIF":2.1000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Acoustical Society of America","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1121/10.0037073","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Mandarin tones 2 and 3 share similar acoustic characteristics, posing a challenge for L2 learners to distinguish. This difficulty is further compounded by both inter- and intra-speaker variability. This study employed machine learning methods to examine the effectiveness of eight acoustic cues for tone 2 and tone 3 classification in speech that were produced by 20 speakers under normal and loud speaking modes (experiment 1). Additionally, we compared the perception between native listeners and medium-to-advanced Thai L2 learners of Mandarin, including perceptual accuracy, cue-weighting strategies, and perceptual space (experiment 2). Results of experiment 1 show that temporal cues are more effective than height-related cues for tone 2 and tone 3 classifications in speech with inter- and intra- speaker variability. In experiment 2, medium-to-advanced Thai L2 learners produced more confusions than native listeners. The ranking of perceptual acoustic cues is generally similar to native listeners, suggesting that these L2 learners have generally mastered the correct cue-weighting strategy for distinguishing between tones 2 and 3. Additional analyses showed that the confusion stems from learners' perceptual biases. Specifically, learners allocate narrower space for tone 3 category than native listeners, with atypical tone 3 exemplars being misidentified as tone 2.
期刊介绍:
Since 1929 The Journal of the Acoustical Society of America has been the leading source of theoretical and experimental research results in the broad interdisciplinary study of sound. Subject coverage includes: linear and nonlinear acoustics; aeroacoustics, underwater sound and acoustical oceanography; ultrasonics and quantum acoustics; architectural and structural acoustics and vibration; speech, music and noise; psychology and physiology of hearing; engineering acoustics, transduction; bioacoustics, animal bioacoustics.