Comparisons of Mandarin on-focus expansion and post-focus compression between native speakers and L2 learners: Production and machine learning classification
Jing Wu , Jun Liu , Ting Wang , Sunghye Cho , Yong-cheol Lee
{"title":"Comparisons of Mandarin on-focus expansion and post-focus compression between native speakers and L2 learners: Production and machine learning classification","authors":"Jing Wu , Jun Liu , Ting Wang , Sunghye Cho , Yong-cheol Lee","doi":"10.1016/j.specom.2025.103280","DOIUrl":null,"url":null,"abstract":"<div><div>Korean and Mandarin are reported to have on-focus expansion and post-focus compression in marking prosodic focus. It is not clear whether Korean L2 learners of Mandarin benefit from this prosodic similarity in the production of focused tones or encounter difficulty due to the interaction between tone and intonation in a tonal language. This study examined the prosodic focus of Korean L2 learners of Mandarin through a production experiment, followed by the development of a machine learning classification to automatically detect learners’ production of focused elements. Learners were divided into two groups according to proficiency level (advanced and intermediate) and were directly compared with Mandarin native speakers. Production results showed that intermediate-level speakers did not show any systemic modulations for focus marking. Although the advanced-level speakers performed better than the intermediate group, their prosodic effects of focus were significantly different from those of native speakers in both focus and post-focus positions. The machine learning classification of focused elements reflected clear focus-cueing differences among the three groups. The accuracy rate was about 86 % for the native speakers, 49 % for the advanced learners, and about 34 % for the intermediate learners. The results of this study suggest that on-focus expansion and post-focus compression are not automatically transferred across languages, even when those languages share similar acoustic correlates of prosodic focus. This study also underscores that the difficulty in acquiring the prosodic structure of a tone language lies mainly in mastering tone acquisition, which impacts non-tonal language learners, leading to ineffective performance of on-focus expansion and post-focus compression.</div></div>","PeriodicalId":49485,"journal":{"name":"Speech Communication","volume":"173 ","pages":"Article 103280"},"PeriodicalIF":3.0000,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Speech Communication","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167639325000950","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Korean and Mandarin are reported to have on-focus expansion and post-focus compression in marking prosodic focus. It is not clear whether Korean L2 learners of Mandarin benefit from this prosodic similarity in the production of focused tones or encounter difficulty due to the interaction between tone and intonation in a tonal language. This study examined the prosodic focus of Korean L2 learners of Mandarin through a production experiment, followed by the development of a machine learning classification to automatically detect learners’ production of focused elements. Learners were divided into two groups according to proficiency level (advanced and intermediate) and were directly compared with Mandarin native speakers. Production results showed that intermediate-level speakers did not show any systemic modulations for focus marking. Although the advanced-level speakers performed better than the intermediate group, their prosodic effects of focus were significantly different from those of native speakers in both focus and post-focus positions. The machine learning classification of focused elements reflected clear focus-cueing differences among the three groups. The accuracy rate was about 86 % for the native speakers, 49 % for the advanced learners, and about 34 % for the intermediate learners. The results of this study suggest that on-focus expansion and post-focus compression are not automatically transferred across languages, even when those languages share similar acoustic correlates of prosodic focus. This study also underscores that the difficulty in acquiring the prosodic structure of a tone language lies mainly in mastering tone acquisition, which impacts non-tonal language learners, leading to ineffective performance of on-focus expansion and post-focus compression.
期刊介绍:
Speech Communication is an interdisciplinary journal whose primary objective is to fulfil the need for the rapid dissemination and thorough discussion of basic and applied research results.
The journal''s primary objectives are:
• to present a forum for the advancement of human and human-machine speech communication science;
• to stimulate cross-fertilization between different fields of this domain;
• to contribute towards the rapid and wide diffusion of scientifically sound contributions in this domain.