{"title":"Advancements of phonetics in the 21st century: Theoretical and empirical issues of spoken word recognition in phonetic research","authors":"Natasha Warner","doi":"10.1016/j.wocn.2023.101275","DOIUrl":"https://doi.org/10.1016/j.wocn.2023.101275","url":null,"abstract":"<div><p>How do listeners understand what they are hearing? Humans hearing speech perform spoken word recognition, recognizing what words they are hearing in a speech stream in order to understand the meaning. Phonetics refers to the properties of the speech at a detailed level, particularly below the level of segmental phonemic distinctions. In order to recognize spoken words, listeners have to extract information from the detailed acoustic signal in some way, but theories differ about whether listeners extract phonemes, whole words, or other units, by what mechanism, and they differ on what kinds of information are stored in the lexicon. The process of spoken word recognition can be affected by any number of situations such as the speaker or listener being a non-native of the language or dialect, being a child, having a speech/hearing disability, hearing speech in noise, the speech itself containing variability, or many other situations. Any of these situations can shed light on theoretical questions by giving a fuller picture of how listeners recognize words. This chapter examines what we have learned in these first ∼21 years of the 21st century about how phonetics interacts with spoken word recognition.</p></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"101 ","pages":"Article 101275"},"PeriodicalIF":1.9,"publicationDate":"2023-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49754243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Stop voicing perception in the societal and heritage language of Spanish-English bilingual preschoolers: The role of age, input quantity and input diversity","authors":"Simona Montanari , Jeremy Steffman , Robert Mayr","doi":"10.1016/j.wocn.2023.101276","DOIUrl":"https://doi.org/10.1016/j.wocn.2023.101276","url":null,"abstract":"<div><p>This is the first study to examine stop voicing perception in the societal (English) and heritage language (Spanish) of bilingual preschoolers. The study a) compares bilinguals’ English perception patterns to those of monolinguals; b) it examines how child-internal (age) and external variables (input quantity and input diversity) predict English and Spanish perceptual performance; and c) it compares bilinguals’ perception patterns across languages. Perception was assessed through a forced-choice minimal-pair identification task in which children heard synthesized audio stimuli that varied systematically along a /p-b/ and /t-d/ Voice Onset Time (VOT) continuum and were asked to match them with one of two pictures for each contrast. The results of Bayesian mixed-effects logistic regression analyses indicate that the bilinguals’ category boundary for English stops was impacted by their experience with Spanish, with more short-lag VOT tokens being perceived as voiceless consistent with Spanish VOT. Age solely predicted English perceptual skills, whereas input quantity was the only moderator of Spanish perceptual performance. Finally, the bilingual children showed separate stop voicing contrasts in each language, although perceptual performance was already more mature in English by preschool age. Implications for theories of bilingual speech learning and the role of sociolinguistic variables are discussed.</p></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"101 ","pages":"Article 101276"},"PeriodicalIF":1.9,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49727390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Compensatory effects of foot structure in segmental durations of Soikkola Ingrian disyllables and trisyllables","authors":"N. Kuznetsova, Irina Brodskaya, E. Markus","doi":"10.1016/j.wocn.2023.101246","DOIUrl":"https://doi.org/10.1016/j.wocn.2023.101246","url":null,"abstract":"","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"100 1","pages":"101246"},"PeriodicalIF":1.9,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55304165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Natalia Kuznetsova , Irina Brodskaya , Elena Markus
{"title":"Compensatory effects of foot structure in segmental durations of Soikkola Ingrian disyllables and trisyllables","authors":"Natalia Kuznetsova , Irina Brodskaya , Elena Markus","doi":"10.1016/j.wocn.2023.101246","DOIUrl":"https://doi.org/10.1016/j.wocn.2023.101246","url":null,"abstract":"<div><p>This acoustic study explores compensatory influences of foot structure on segmental duration and quantity in the foot nuclei of 22 trisyllabic and four disyllabic structures in vanishing Soikkola Ingrian (Finnic). A robust ternary quantity contrast of consonants is confirmed for both disyllables and trisyllables. While in the shortest disyllables the contrast is “pure” (i.e., not significantly reinforced by the durations of other segments), in all trisyllables it is enhanced through the durationally inverse (compensatory) effects in other segments. In this, the situation in trisyllables is closer to that attested in other languages with ternary consonantal quantity than the situation in disyllables. The phonological quantity contrast has been lost from the second syllable vowel of trisyllables, and its duration is now inversely related to the first syllable complexity. In the segments preceding this vowel, all compensatory effects are purely phonetic. Shorter segmental durations and stronger compensatory effects in trisyllables than in disyllables indicate tendencies for both polysegmental and polysyllabic shortening. We discuss a potential relation of observed compensatory effects of shortening and lengthening (a “half-long” vowel) to foot isochrony and metrical stress.</p></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"100 ","pages":"Article 101246"},"PeriodicalIF":1.9,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49756255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Looking within events: Examining internal temporal structure with local relative rate","authors":"Sam Tilsen , Mark Tiede","doi":"10.1016/j.wocn.2023.101264","DOIUrl":"https://doi.org/10.1016/j.wocn.2023.101264","url":null,"abstract":"<div><p>This paper describes a method for quantifying temporally local variation in the relative rates of speech signals, based on warping curves obtained from dynamic time warping. Although the use of dynamic time warping for signal alignment is well established in speech science, its use to estimate local rate variation is quite rare. Here we introduce an extension of the local relative rate method that supports the quantification of variability in local relative rate, both within and across a set of events. We show how measures of temporal variation derived from this analysis method can be used to characterize the internal temporal structure of events. In order to achieve this, we first provide an overview of the standard dynamic time warping algorithm. We then introduce the local relative rate measure and describe our extensions, applying them to an articulatory and acoustic dataset of consonant-vowel-consonant syllable productions.</p></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"100 ","pages":"Article 101264"},"PeriodicalIF":1.9,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49756644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"L1 vowel perceptual boundary shift as a result of L2 vowel learning","authors":"Chikako Takahashi","doi":"10.1016/j.wocn.2023.101265","DOIUrl":"https://doi.org/10.1016/j.wocn.2023.101265","url":null,"abstract":"<div><p>The current study investigated second language (L2) vowel learning influence on first language (L1) vowel perception. We examined how late L2-English learners’ perception of L1-Japanese vowels is influenced by learning to perceive a new L2-English vowel. The study compared L1/L2 perception task results from 60 late L1-Japanese learners of L2-English with those of monolingual Japanese (N = 21) and English speakers (N = 16). To further test hypotheses put forward in the revised Speech Learning Model (SLM-r: Flege & Bohn, 2021), that L2 input distribution is associated with L1/L2 phonetic learning, information on L2-learner participants’ L2 dominance was gathered. The results showed clear L1 perceptual drift in a subgroup of L2-learner participants who were NOT nativelike in L2 English /i-ɪ/ categorization but were L2 dominant. The results support the claim that L2 input plays an important role in reorganizing the L1 phonetic system. However, they also highlight the importance of separating L2 dominance related factors (e.g., L2 input/use) and L2 perceptual ability in investigating L1-L2 phonetic interaction.</p></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"100 ","pages":"Article 101265"},"PeriodicalIF":1.9,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49766703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Do children better understand adults or themselves? An acoustic and perceptual study of the complex sibilant system of Polish","authors":"Marzena Żygis, D. Pape, M. Jaskula, L. Koenig","doi":"10.1016/j.wocn.2023.101227","DOIUrl":"https://doi.org/10.1016/j.wocn.2023.101227","url":null,"abstract":"","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"100 1","pages":"101227"},"PeriodicalIF":1.9,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55304154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Xiaojuan Zhang, Bing Cheng, Yu Zou, Xujia Li, Yang Zhang
{"title":"Cognitive factors in nonnative phonetic learning: Impacts of inhibitory control and working memory on the benefits and costs of talker variability","authors":"Xiaojuan Zhang, Bing Cheng, Yu Zou, Xujia Li, Yang Zhang","doi":"10.1016/j.wocn.2023.101266","DOIUrl":"https://doi.org/10.1016/j.wocn.2023.101266","url":null,"abstract":"","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"100 1","pages":"101266"},"PeriodicalIF":1.9,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55304199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"L1 vowel perceptual boundary shift as a result of L2 vowel learning","authors":"C. Takahashi","doi":"10.1016/j.wocn.2023.101265","DOIUrl":"https://doi.org/10.1016/j.wocn.2023.101265","url":null,"abstract":"","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"100 1","pages":"101265"},"PeriodicalIF":1.9,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55304187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An acoustic study of rhythmic synchronization with natural English speech","authors":"Tamara Rathcke , Chia-Yuan Lin","doi":"10.1016/j.wocn.2023.101263","DOIUrl":"https://doi.org/10.1016/j.wocn.2023.101263","url":null,"abstract":"<div><p>Sensorimotor synchronization as a means of studying rhythmic perception-action coupling has been extensively researched across a large number of temporally regular structures including music while little is known about synchronization with speech. The present study fills this gap by applying a sensorimotor synchronization paradigm to natural speech and studying acoustic landmarks that may serve as perceptual anchors of rhythmic movement in spoken sentences. Five rhythmically relevant types of acoustic landmarks were identified in twenty sentences of English containing syllables with vocalic and non-vocalic nuclei. The landmarks were either manually defined or algorithm-generated and included nucleus onsets, peaks and onsets of inter-syllabic and inter-stress timescales, moments of the fastest energy change (approximating the P-center location), and timepoints of combined pitch and periodic power. Sensorimotor synchronization data from 32 native English participants were examined with regards to the location of an increased synchronization activity in the proximity of the predefined landmarks. The results demonstrated that participants synchronized with syllable-size units regardless of the type of syllable nucleus (vowel or consonant) and that their taps were consistently timed close to nucleus onsets. Hereby, the manually defined nucleus onsets predicted synchronization peaks as well as the algorithm-generated moments of the fastest energy change around nucleus onsets (i.e., a model of the P-center location) did. In contrast, other landmarks did not constitute a stable acoustic anchor of sensorimotor synchronization with English speech. The synchronization performance was not influenced by either acoustic F0-information or by phonological tune specifications. These findings provide new evidence for the proposals that rhythmic attention in natural speech may be locked on to fast spectral changes within a syllable as the smallest structuring unit of prosodic hierarchy.</p></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"100 ","pages":"Article 101263"},"PeriodicalIF":1.9,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49756482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}