{"title":"The effect of rhythm on inter-gestural coupling of onset and vowel gestures and predictive timing in stuttering","authors":"Mona Franke , Simone Falk , Nicole Benker , Phil Hoole","doi":"10.1016/j.wocn.2025.101432","DOIUrl":"10.1016/j.wocn.2025.101432","url":null,"abstract":"<div><div>In this study we investigate articulatory timing in fluent speech production in persons who stutter (PWS) and persons who do not stutter (PWNS) by focusing on consonant–vowel (CV)-timing, which refers to the coupling of onset consonant and vowel gestures, as well as on predictive timing, which describes the synchronization of the speech onset to a rhythmic event. These two timing mechanisms are particularly interesting to investigate in relation to stuttering, given that CV-timing is especially challenging for PWS and that they exhibit differences in predictive timing related to speech-motor and manual-motor tasks, suggesting that disturbances in inter-gestural coordination and auditory-motor integration may contribute to stuttering. To shed further light on this, we examine CV-timing and predictive timing under different rhythmic conditions.</div><div>Twenty German-speaking adults (10 PWS and 10 PWNS) were recorded using electromagnetic articulography (EMA). Participants produced target words that started with a bilabial onset, followed by a vowel (/a/, /o/, or /u/) and were embedded in a carrier phrase in four different conditions: Unpaced (speaking), Tapping (speaking while concurrently tapping), Metronome (synchronizing speech to a metronome), and Metronome+Tapping (speaking to a metronome while concurrently tapping).</div><div>We found evidence for both CV-timing and predictive timing differences between PWS and PWNS. Our results suggest that in general, PWS time CV gestures closer together. However, CV-timing differences were linked to condition in an unexpected way. As to predictive timing, PWS initiated their speech later to a metronome beat than PWNS but they did not differ when timing speech to their own finger tapping, indicating that motor-pacing may stabilize the speech motor system of PWS. In the Metronome+Tapping condition, the groups appeared to rely on different rhythmic cues. While PWNS timed their speech more towards the metronome beat, PWS synchronized their speech onset closer to the finger tap. We discuss that this difference could result from differences in CV-timing. Furthermore, the potential for future research on the interplay of non-verbal and verbal motor systems and the possible benefit for the stuttering population is discussed.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"112 ","pages":"Article 101432"},"PeriodicalIF":1.9,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144679165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Journal of PhoneticsPub Date : 2025-09-01Epub Date: 2025-07-11DOI: 10.1016/j.wocn.2025.101429
Anna Balas , Krzysztof Hwaszcz , Kamil Kaźmierski , Magdalena Wrembel
{"title":"Perceived cross-linguistic similarity of retroflexes in trilingual, bilingual and native listener groups","authors":"Anna Balas , Krzysztof Hwaszcz , Kamil Kaźmierski , Magdalena Wrembel","doi":"10.1016/j.wocn.2025.101429","DOIUrl":"10.1016/j.wocn.2025.101429","url":null,"abstract":"<div><div>This paper investigates the perceived cross-linguistic similarity of retroflexes from a broad multilingual perspective by employing trilingual and bilingual learners and native users as three distinct listener groups. Previous research has demonstrated that L2 learners rely on their L1 in non-native speech perception. However, no study has examined how L3 learners perceive differences between retroflex sounds in their L1, L2, and L3. In a series of three parallel studies, we examined cross-linguistic similarity of Norwegian retroflexes and similar retroflex and non-retroflex sounds by trilingual (L1 Polish, L2 English and L3 Norwegian), bilingual (L1 Polish, L2 English) and Norwegian control (L1 Norwegian, L2 English) listeners. The listeners assessed similarity between the Norwegian and Polish or English sounds primarily based on the place and manner of articulation rather than retroflexion. The results, where condition specifies the presence or absence of agreement in terms of retroflexion and place/manner of articulation, demonstrated that all the two-way interactions: condition:language, condition:group, language:group and the three-way interaction were significant. The study revealed that experience with a given language did not influence similarity ratings in a wholesale manner but rather in a precise manner related to the presence or absence of retroflexion. The results also showed that the perceived cross-linguistic similarity by multilinguals was gradient in nature. The study provides new insights into research on the perception of retroflexes and multilingual perception by participants differing in the amount of experience with the languages of the stimuli: from L1 controls through L2 and L3 learners to naïve listeners.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"112 ","pages":"Article 101429"},"PeriodicalIF":1.9,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144605211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Journal of PhoneticsPub Date : 2025-09-01Epub Date: 2025-07-10DOI: 10.1016/j.wocn.2025.101431
Jeanne Brown , Morgan Sonderegger
{"title":"A sociophonetic study of creaky voice across language, gender and age in Canadian English-French bilinguals","authors":"Jeanne Brown , Morgan Sonderegger","doi":"10.1016/j.wocn.2025.101431","DOIUrl":"10.1016/j.wocn.2025.101431","url":null,"abstract":"<div><div>This paper examines the acoustic correlates of creaky voice across language, gender and year of birth to investigate 1) the reliability of cross-linguistic differences in voice quality, 2) the direction and extent of gender differences with respect to creaky voice, and 3) the existence of an ongoing sound change targeting voice quality. Spontaneous speech from 49 Canadian English-French bilingual speakers was collected from publicly available online data sources. This corpus was processed and a range of acoustic measures of voice quality extracted using an automated pipeline with manual checks. Results do not show strong nor consistent evidence for cross-linguistic differences in creak. Regarding gender, men’s voices are unequivocally creakier, indicated by more unreliable f0 tracks, lower H1*–H2*, lower CPP and lower HNR < 500 Hz. As for age, results generally show more creak for older speakers, CPP and HNR < 500 Hz values increasing with YOB while other acoustic measures show no significant differences, suggesting that these effects are more likely due to vocal aging than sound change in progress. Contrary to popular perception and recent work claiming that young women are leaders in creaky voice use, this study finds that acoustic correlates of creak show the exact opposite: men’s voices are creakier and if anything, younger speakers are less creaky. Possible reasons for this discrepancy, reviewing recent perceptual work on creaky voice, are discussed.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"112 ","pages":"Article 101431"},"PeriodicalIF":1.9,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144596614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Journal of PhoneticsPub Date : 2025-09-01Epub Date: 2025-07-12DOI: 10.1016/j.wocn.2025.101428
Yuyu Zeng , Chang Wang , Jie Zhang
{"title":"Cascading activation in spoken word production drives incomplete neutralization: An internet-based study of Mandarin 3rd tone sandhi","authors":"Yuyu Zeng , Chang Wang , Jie Zhang","doi":"10.1016/j.wocn.2025.101428","DOIUrl":"10.1016/j.wocn.2025.101428","url":null,"abstract":"<div><div>Incomplete neutralization occurs when two underlying contrastive sounds are phonologically neutralized but remain phonetically distinct (e.g., “latter” and “ladder” become homophonous when the intervocalic stops are flapped in American English). Its proper understanding is foundational to phonology and speech production. Using the incomplete neutralization of the Mandarin 3rd tone sandhi as a test case (T3 + T3 → T2 + T3), we confirmed the presence of this incomplete neutralization with generalized additive modeling (GAMM) and growth curve analysis (GCA). Crucially, we found that the two tones (T2 and T3) became more neutralized when speakers were additionally required to perform a concurrent verbal working memory task while speaking; similar patterns were found when pseudowords were tested, although the overall effects were weaker. Since the concurrent verbal working memory task is expected to add processing load and decrease cascading activation in the spoken word production process, our results suggest that cascading activation, which permits upstream distinctions to surface in downstream acoustics, drives incomplete neutralization. Our study shows how embracing cascading activation can inform the long-standing debate between discrete vs. exemplar representations/operations surrounding incomplete neutralization. How cascading activation is compatible with the core assumptions of generative phonology is also discussed.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"112 ","pages":"Article 101428"},"PeriodicalIF":1.9,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144605212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Journal of PhoneticsPub Date : 2025-09-01Epub Date: 2025-07-15DOI: 10.1016/j.wocn.2025.101433
Bihua Chen , Isabelle Darcy
{"title":"Effects of sentential context on nonnative recognition of reduced speech: Does meaning explain it all?","authors":"Bihua Chen , Isabelle Darcy","doi":"10.1016/j.wocn.2025.101433","DOIUrl":"10.1016/j.wocn.2025.101433","url":null,"abstract":"<div><div>In casual speech, reduction of segments or even syllables is common. Native (L1) listeners recover these reduced forms by recruiting not only semantic and syntactic but also fine-grained acoustic cues in the surrounding utterance. Whether second-language (L2) listeners exploit the same constellation of cues is still poorly understood. We therefore compared 21 L1 English listeners and 21 Mandarin learners of English as they identified reduced targets (e.g., /tuɪnə/ ‘too into’) presented in one of three contexts: Isolation, Textual (orthographic context only), and Auditory (orthography plus the original carrier sentence). Accuracy patterns revealed a graded facilitation hierarchy. For L2 listeners, semantic-syntactic information alone (Textual) boosted recognition relative to Isolation, and adding acoustic context produced a further significant gain. Nevertheless, both effects were smaller for L2 than for L1 listeners, indicating less effective contextual integration in the L2 processing mechanism. The findings refine accounts of reduced-speech perception by showing that L2 listeners can harness acoustic context, but less efficiently than L1 listeners.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"112 ","pages":"Article 101433"},"PeriodicalIF":1.9,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144632575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Journal of PhoneticsPub Date : 2025-07-01Epub Date: 2025-05-09DOI: 10.1016/j.wocn.2025.101415
Morgan Sonderegger , Márton Sóskuthy
{"title":"Advancements of phonetics in the 21st century: Quantitative data analysis","authors":"Morgan Sonderegger , Márton Sóskuthy","doi":"10.1016/j.wocn.2025.101415","DOIUrl":"10.1016/j.wocn.2025.101415","url":null,"abstract":"<div><div>Phonetic research in the 21st century has relied heavily on quantitative analysis. This article reviews the evolution of common practices and the emergence of newer techniques. Using a detailed literature survey, we show that most work follows a mainstream, which has shifted from ANOVAs to mixed-effects regression models over time. Alongside this mainstream, we highlight the increasing use of a diverse methodological toolbox, especially Bayesian methods and dynamic methods, for which we provide comprehensive reviews. Bayesian methods, as well as frequentist methods beyond linear and logistic regression, offer flexibility in model specification, interpretation, and incorporation of prior knowledge. Dynamic methods, such as GAMs and functional data analysis, capture non-linear patterns in acoustic and articulatory data. Machine learning techniques, such as random forests, expand the questions and types of data phoneticians can analyze. We also discuss the growing importance of open science practices promoting replicability and transparency. We argue that the future lies in a diverse methodological toolbox, with techniques chosen based on research questions and data structure.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"111 ","pages":"Article 101415"},"PeriodicalIF":1.9,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143923730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Journal of PhoneticsPub Date : 2025-07-01Epub Date: 2025-06-18DOI: 10.1016/j.wocn.2025.101427
Michael C. Stern, Jason A. Shaw
{"title":"Nonlinear second-order dynamics describe labial constriction trajectories across languages and contexts","authors":"Michael C. Stern, Jason A. Shaw","doi":"10.1016/j.wocn.2025.101427","DOIUrl":"10.1016/j.wocn.2025.101427","url":null,"abstract":"<div><div>We investigate the dynamics of labial constriction trajectories during the production of /b/ and /m/ in English and Mandarin in two prosodic contexts. We find that, across languages and contexts, the ratio of instantaneous displacement to instantaneous velocity generally follows an exponential decay curve from movement onset to movement offset. We formalize this empirical discovery in a differential equation and, in combination with an assumption of point attractor dynamics, derive a nonlinear second-order dynamical system describing labial constriction trajectories. The equation has only two parameters, <span><math><mrow><mi>T</mi></mrow></math></span> and <span><math><mrow><mi>r</mi></mrow></math></span>. <span><math><mrow><mi>T</mi></mrow></math></span> corresponds to the target state and <span><math><mrow><mi>r</mi></mrow></math></span> corresponds to movement rapidity. Thus, each of the parameters corresponds to a phonetically relevant dimension of control. Nonlinear regression demonstrates that the model provides excellent fits to individual movement trajectories. Moreover, trajectories simulated from the model qualitatively match empirical trajectories, and capture key kinematic variables like duration and peak velocity. The model constitutes a proposal for the dynamics of individual articulatory movements, and thus offers a novel foundation from which to understand additional influences on articulatory kinematics like prosody, inter-movement coordination, and stochastic noise.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"111 ","pages":"Article 101427"},"PeriodicalIF":1.9,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144307382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Journal of PhoneticsPub Date : 2025-07-01Epub Date: 2025-05-22DOI: 10.1016/j.wocn.2025.101418
Ji Young Kim
{"title":"Relative importance of stress correlates in native listeners’ identification of Spanish lexical stress produced by monolingual and bilingual speakers","authors":"Ji Young Kim","doi":"10.1016/j.wocn.2025.101418","DOIUrl":"10.1016/j.wocn.2025.101418","url":null,"abstract":"<div><div>Spanish has many minimal stress pairs, and lexical stress in Spanish is marked primarily via suprasegmental cues. Thus, sensitivity to suprasegmental information is crucial for spoken-word identification in Spanish. Using stimuli produced by speakers of Mexican Spanish with varying language learning experience (i.e., monolingual speakers, heritage speakers, L2 learners), this study examines native listeners’ identification of Spanish lexical stress under enhanced variability in phonetic cues. Our data demonstrate that listeners exploit various stress correlates in the speech signal and assign different weights to them, which is context-specific; when there is a pitch accent, native listeners mainly attend to f0-related cues, whereas in the absence of a pitch accent, intensity plays a stronger role. Our data also show that clustering based on stress correlates is not consistent with the predetermined monolingual-heritage-L2 group division, which indicates that language learning experience alone is not sufficient to explain how Spanish speakers mark stress. This study highlights the importance of incorporating variable speech data in speech perception research and adopting a data-driven, individual-centered approach to speaker grouping in cross-sectional studies.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"111 ","pages":"Article 101418"},"PeriodicalIF":1.9,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144115751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Journal of PhoneticsPub Date : 2025-07-01Epub Date: 2025-05-10DOI: 10.1016/j.wocn.2025.101417
Quentin Zhen Qin , Rui Jin , Ruofan Wu
{"title":"The role of prior knowledge in second-language learners’ overnight consolidation of Cantonese tones","authors":"Quentin Zhen Qin , Rui Jin , Ruofan Wu","doi":"10.1016/j.wocn.2025.101417","DOIUrl":"10.1016/j.wocn.2025.101417","url":null,"abstract":"<div><div>This study examines the role of prior (tonal) knowledge in memory consolidation of non-native tones after an overnight sleep. While memory consolidation is beneficial in learning new sounds in a second language, only new linguistic information consistent with the existing knowledge is often prioritized for consolidation. What remains unclear from the research is whether prior tonal knowledge from a native language (i.e., pitch contour signaling the Mandarin contour-tone system) influences an overnight consolidation of tone learning. The study adopts an overnight design, using Cantonese contour and level tones contrasting in pitch contour and height, for two perceptual learning experiments conducted separately on Mandarin and English-speaking novice learners of Cantonese. The first experiment found that Mandarin-speaking participants showed a stronger effect of consolidation in novel words contrasting in contour tones than in level tones, thanks to their prior knowledge of contour tones. The consolidation effect was predicted by rough estimates of deep-sleep length. Without prior knowledge of tones, English-speaking L2 learners in the second experiment showed an (unexpected) offline improvement for both contour and level tones. Overall, the findings suggest a preferential effect on overnight consolidation of contour tones when the cues contrasting L2-Cantonese tones are consistent with L1-Mandarin prior knowledge.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"111 ","pages":"Article 101417"},"PeriodicalIF":1.9,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143931342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Journal of PhoneticsPub Date : 2025-07-01Epub Date: 2025-06-14DOI: 10.1016/j.wocn.2025.101425
Elizabeth K. Johnson , Katherine S. White
{"title":"Advancements in phonetics in the 21st century: Infant speech development","authors":"Elizabeth K. Johnson , Katherine S. White","doi":"10.1016/j.wocn.2025.101425","DOIUrl":"10.1016/j.wocn.2025.101425","url":null,"abstract":"<div><div>Infant speech perception emerged as a field late in the 20th century. Early work focused on defining the initial state, and documenting the timecourse of changes in speech perception over the first year of life. At the turn of the century, attention shifted from studying <em>when</em> children became attuned to their native language, to asking <em>how</em> children achieved this transformation. Statistical learning became the dominant mechanism to explain language development. But, as researchers pushed the bounds of statistical learning, different questions took center stage: given the complexity of spoken language, how do infants determine which regularities to track? And are the patterns infants track influenced by their unique language learning environment? Inspired by these questions, researchers have shifted to studying acquisition across more diverse contexts, and to using dense corpora and big data approaches to examine how individual differences in children’s input relate to speech perception in the lab. In this paper, we first review this progression, summarizing how the field has arrived at the current state of the art. We then argue that the time is ripe for the development of new theoretical approaches, and sketch out the loose contours of SLED, a new 21st-century proposal that emphasizes the role of sociophonetic variation and the richness of the speech signal in early development. With advanced tools in hand and data from a wide variety of learning contexts increasingly available, we are excited to see how the field will evolve over the next 25 years.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"111 ","pages":"Article 101425"},"PeriodicalIF":1.9,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144289030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}