Language and SpeechPub Date : 2024-09-01Epub Date: 2023-09-15DOI: 10.1177/00238309231195263
Jiwon Hwang, Yu-An Lu
{"title":"The Effect of Distributional Restrictions in Speech Perception: A Case Study From Korean and Taiwanese Southern Min.","authors":"Jiwon Hwang, Yu-An Lu","doi":"10.1177/00238309231195263","DOIUrl":"10.1177/00238309231195263","url":null,"abstract":"<p><p>In Korean, voiced oral stops can occur intervocalically as allophones of their voiceless lenis counterparts; they can also occur initially as variants of nasal stops as a result of initial denasalization (e.g., /motu/→[<b>b</b>o<b>d</b>u] \"all\"). However, neither [ŋ] nor [ɡ] (the denasalized variant of the velar nasal) is allowed in the initial position due to the phonotactic restriction against initial [ŋ] in Korean. Given the distribution of nasal and voiced stops in Korean, this study draws on the idea of cue informativeness, exploring (a) whether Korean listeners' attention to nasality and voicing cues is based on the distributional characteristics of nasal and voiced stops, and (b) whether their attention can be generalized across different places of articulation without such linguistic experience. In a forced-choice identification experiment, Korean listeners were more likely than Taiwanese listeners to perceive items on the voiced oral-to-nasal stop continua as nasal when they occurred in the initial position than in the intervocalic position, with the exception of velar stops. The results demonstrate that the Korean listeners attended to the nasality cue more reliably in the medial position than in the initial position, since the nasality cue in this position is less informative due to initial denasalization. Two additional forced-choice identification experiments suggested that upon hearing initial velar nasal [ŋ], Korean listeners variably employed different perceptual strategies (i.e., vowel insertion and place change) to repair the phonotactic illegality. These findings provide support for exemplar models of speech perception in which cue attention is specific to the position of a word, and to segments rather than to features.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"744-771"},"PeriodicalIF":1.1,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10591760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Albandary Aldossari, Ryan Andrew Stevenson, Yasaman Rafat
{"title":"An Investigation of Language-Specific and Orthographic Effects in L2 Arabic geminate production by Advanced Japanese- and English-speaking learners.","authors":"Albandary Aldossari, Ryan Andrew Stevenson, Yasaman Rafat","doi":"10.1177/00238309241267876","DOIUrl":"https://doi.org/10.1177/00238309241267876","url":null,"abstract":"<p><p>Research has indicated that second-language learners have difficulty producing geminates accurately. Previous studies have also shown an effect of orthography on second-language speech production. We tested whether the existence of a contrast in the first language phonology for length aids the second-language production of the same contrast. Furthermore, we examined the effect of exposure to orthographic input on geminate consonant production in a cross-script context. We tested the production of Arabic geminate-singleton stop consonants [/bː/-/b/, /tː/-/t/, /dː/-/d/, and /kː/-/k/], a nasal stop consonant /mː/-/m/, and an emphatic stop consonant /tˤː/-/tˤ/, as well as the effect of the diacritic used in Arabic to mark gemination in a delayed imitation task and two reading tasks (ortho-with diacritics and ortho-without diacritics). A comparison of the productions of advanced Japanese-speaking learners, English-speaking learners, and an Arabic control group showed that both learner groups were able to produce Arabic geminate stops; however, the Japanese-speaking learners exhibited an advantage over the English-speaking learners in the auditory-only task and in the presence of diacritics, highlighting the fact that orthographic effects may occur in some cross-script contexts.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309241267876"},"PeriodicalIF":1.1,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142114557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Violeta Gómez-Vicente, Gema Esquiva, Carmen Lancho, Kawthar Benzerdjeb, Antonia Angulo Jerez, Eva Ausó
{"title":"Importance of Visual Support Through Lipreading in the Identification of Words in Spanish Language.","authors":"Violeta Gómez-Vicente, Gema Esquiva, Carmen Lancho, Kawthar Benzerdjeb, Antonia Angulo Jerez, Eva Ausó","doi":"10.1177/00238309241270741","DOIUrl":"https://doi.org/10.1177/00238309241270741","url":null,"abstract":"<p><p>We sought to examine the contribution of visual cues, such as lipreading, in the identification of familiar (words) and unfamiliar (phonemes) words in terms of percent accuracy. For that purpose, in this retrospective study, we presented lists of words and phonemes (adult female healthy voice) in auditory (A) and audiovisual (AV) modalities to 65 Spanish normal-hearing male and female listeners classified in four age groups. Our results showed a remarkable benefit of AV information in word and phoneme recognition. Regarding gender, women exhibited better performance than men in both A and AV modalities, although we only found significant differences for words but not for phonemes. Concerning age, significant differences were detected in word recognition in the A modality between the youngest (18-29 years old) and oldest (⩾50 years old) groups only. We conclude visual information enhances word and phoneme recognition and women are more influenced by visual signals than men in AV speech perception. On the contrary, it seems that, overall, age is not a limiting factor for word recognition, with no significant differences observed in the AV modality.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309241270741"},"PeriodicalIF":1.1,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142074522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jonathan Barnes, Alejna Brugos, Nanette Veilleux, Stefanie Shattuck-Hufnagel
{"title":"Segmental Influences on the Perception of High Pitch Accent Scaling in American English.","authors":"Jonathan Barnes, Alejna Brugos, Nanette Veilleux, Stefanie Shattuck-Hufnagel","doi":"10.1177/00238309241255319","DOIUrl":"https://doi.org/10.1177/00238309241255319","url":null,"abstract":"<p><p>Researchers investigating a broad array of questions in spoken language prosody routinely base their arguments on measurements taken from the F0 contours of representative speech samples. These analyses, however, frequently involve abstracting F0 contours away from the segmental strings that bear them, potentially overlooking in the process the role played by segmental qualities such as sonority or periodicity in the realization of F0 patterns by speakers and their interpretation by listeners. This paper reports the results of two experiments investigating how perception of F0 contours is affected by the segmental string over which those contours are realized. The first focuses on gaps in F0 contours created by voiceless obstruents such as stops and fricatives, while the second investigates F0 intervals spanning lower-sonority voiced segments, such as nasals and voiced fricatives. While these two scenarios might at first seem unrelated, we argue that listeners treat both with a single mechanism in perception, namely, by reducing (potentially to zero) the amount of weight accorded to those portions of the contour for determination of the speaker's intended F0 scaling level. We present an account of both effects within a unified model of F0 scaling perception called TCoG-F, with discussion of its implications for phonetic and phonological intonation research going forward.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309241255319"},"PeriodicalIF":1.1,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141903521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mónica Anna Wagner, Mirjam Broersma, James M McQueen, Roeland van Hout, Kristin Lemhöfer
{"title":"The Case for a Quantitative Approach to the Study of Nonnative Accent Features.","authors":"Mónica Anna Wagner, Mirjam Broersma, James M McQueen, Roeland van Hout, Kristin Lemhöfer","doi":"10.1177/00238309241256653","DOIUrl":"https://doi.org/10.1177/00238309241256653","url":null,"abstract":"<p><p>Research with nonnative speech spans many different linguistic branches and topics. Most studies include one or a few well-known features of a particular accent. However, due to a lack of empirical studies, little is known about how common these features are among nonnative speakers or how uncommon they are among native speakers. Moreover, it remains to be seen whether findings from such studies generalize to lesser-known features. Here, we demonstrate a quantitative approach to study nonnative accent features using Dutch-accented English as an example. By analyzing the phonetic distances between transcriptions of speech samples, this approach can identify the features that best distinguish nonnative from native speech. In addition, we describe a method to test hypotheses about accent features by checking whether the prevalence of the features overall varies between native and nonnative speakers. Furthermore, we include English speakers from the United States and United Kingdom and native Dutch speakers from Belgium and The Netherlands to address the issue of regional accent variability in both the native and target language. We discuss the results concerning three observed features. Overall, the results provide empirical support for some well-known features of Dutch-accented English, but suggest that others may be infrequent among nonnatives or in fact frequent among natives. In addition, the findings reveal potentially new accent features, and factors that may modulate the expression of known features. Our study demonstrates a fruitful approach to study nonnative accent features that has the potential to expand our understanding of the phenomenon of accent.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309241256653"},"PeriodicalIF":1.1,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141898936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Aspiring to Aspirate: L2 Acquisition of English Word-Initial /p/ Over 10 Years.","authors":"Murray J Munro, Tracey M Derwing, Kazuya Saito","doi":"10.1177/00238309241264296","DOIUrl":"https://doi.org/10.1177/00238309241264296","url":null,"abstract":"<p><p>This short report describes a longitudinal examination of the acquisition of English-aspirated stops by an initial cohort of 24 adult Slavic-language (Russian, Ukrainian, and Croatian) speakers. All had arrived in Canada with low oral English proficiency, and all were enrolled in the same language instruction program at the outset. Initial bilabial stops in CVCs were recorded at eight testing times: six during the first year of the study, again at year 7, and finally at year 10. Intelligibility was evaluated through a blind listening assessment of the stop productions from the first seven testing times. Voice onset times (VOT) were measured for /p/ from all eight times. Mean /p/ intelligibility improved-mainly during a proposed Window of Maximal Opportunity for L2 speech acquisition-but remained below 100%, even after 7 years. For some speakers, early /p/ productions were minimally aspirated, with VOT increasing over time but remaining intermediate between L1 English and L1 Slavic-Language values at 10 years. However, inter-speaker variability was dramatic, with some speakers showing full intelligibility throughout the study and others showing many unintelligible productions at all times. Individual learning trajectories tended to be non-linear and often non-cumulative. Overall, these findings point to a developmental process that varies considerably from one learner to another. It also demonstrates the serious drawbacks of relying on group means to characterize the process of L2 segmental learning.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309241264296"},"PeriodicalIF":1.1,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141857093","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The Articulatory and Acoustic Representation of Second-Language French Vowels.","authors":"Madeleine Oakley","doi":"10.1177/00238309241259748","DOIUrl":"https://doi.org/10.1177/00238309241259748","url":null,"abstract":"<p><p>This study examines how L1 English-L2 French learners use L1 articulatory and acoustic categories to produce L2 vowels that are both similar to and different from their L1 vowels. Previous studies examining the relationship between L1 and L2 sound inventories have found that learners reuse L1 phone categories to produce L2 phones that are perceived as similar, but importantly, there is a lack of articulatory data included in these types of studies, which has reinforced the assumption that vowel categories can be solely represented by their acoustic properties. The present study uses ultrasound tongue imaging data and videos of lip rounding in addition to acoustic data to examine how L1 English-L2 French learners produce the French vowels /i y u e ø o/ compared with their English vowels /i u e o/. The results focus on individual paths to category formation to show how learners articulate L2 vowels, and reveal that they tend to reuse L1 tongue body gestures to produce the French vowels /i u e o/, and lip rounding gestures to produce the round vowels /y u o/. This study demonstrates that transfer of articulatory gestures depends on vowel quality and emphasizes the importance of using articulatory data to inform theories of L2 category formation.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309241259748"},"PeriodicalIF":1.1,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141861624","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Incidental Learning of Collocations Under Different Input Modes and the Mediating Role of Perceptual Learning Style.","authors":"Xin Yuan, Jing Tang","doi":"10.1177/00238309241266864","DOIUrl":"https://doi.org/10.1177/00238309241266864","url":null,"abstract":"<p><p>This study investigated how input modes (reading vs. listening) and learners' perceptual learning style (visual vs. auditory) affected the incidental learning of collocations. A total of 182 college students were first assigned to either a visual or auditory group based on their performance on a perceptual learning style questionnaire. Each style group was subsequently subdivided into three groups who were exposed to a series of texts containing unfamiliar collocation items under one of the input conditions: written input, aural input, or no input. Results of the study indicated that both written and aural input led to gains in collocational knowledge, and aural input was more effective than written input. Furthermore, the study provided empirical evidence that there was a moderating role of perceptual learning style on incidental collocation learning. The auditory learners under aural input showed the highest rate of collocation learning among all treatment subgroups.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309241266864"},"PeriodicalIF":1.1,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141794058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Modeling Lexical Tones for Speaker Discrimination.","authors":"Ricky K W Chan, Bruce Xiao Wang","doi":"10.1177/00238309241261702","DOIUrl":"https://doi.org/10.1177/00238309241261702","url":null,"abstract":"<p><p>Fundamental frequency (F0) has been widely studied and used in the context of speaker discrimination and forensic voice comparison casework, but most previous studies focused on long-term F0 statistics. Lexical tone, the linguistically structured and dynamic aspects of F0, has received much less research attention. A main methodological issue lies on how tonal F0 should be parameterized for the best speaker discrimination performance. This paper compares the speaker discriminatory performance of three approaches with lexical tone modeling: discrete cosine transform (DCT), polynomial curve fitting, and quantitative target approximation (qTA). Results show that using parameters based on DCT and polynomials led to similarly promising performance, whereas those based on qTA generally yielded relatively poor performance. Implications modeling surface tonal F0 and the underlying articulatory processes for speaker discrimination are discussed.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309241261702"},"PeriodicalIF":1.1,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141768005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Incorporating Frequency Effects in the Lexical Access of Mandarin Tone 3 Sandhi.","authors":"Feier Gao, Chien-Jer Charles Lin","doi":"10.1177/00238309241260062","DOIUrl":"https://doi.org/10.1177/00238309241260062","url":null,"abstract":"<p><p>Mandarin tone 3 sandhi refers to the phenomenon whereby a tone 3 syllable changes to a tone 2 when followed by another tone 3. This phonological process creates a deviation between the tonal forms realized at morphemic (/tone3-tone3/) and word ([tone2-tone3]) levels, posing questions in terms of how disyllabic tone 3 sandhi words are represented and accessed. The current study conducted three cross-modal lexical decision priming experiments to investigate this issue. Experiment 1 manipulated the frequencies of the initial morpheme and whole word, showing that the higher initial-character frequency against the whole word gives stronger activation to the underlying representation and the lower frequency of the initial character leads to stronger activation of the surface tone. Experiments 2 and 3 operationalized the relative frequency of the initial tone 3 morpheme's realization as a sandhi tone, finding that the competition between the two tonal realizations also influences how T3 sandhi words are accessed. Specifically, the more frequently the T3 morpheme surfaces as a T2 allomorph, the less activated the underlying representation becomes in the mental lexicon. Our results indicate a complex interplay between morpheme, word, and the associated tonal representations in the mental lexicon and that these factors co-determine the lexical access of tone 3 sandhi.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309241260062"},"PeriodicalIF":1.1,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141762454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}