Speech Prosody 2022Pub Date : 2022-05-23DOI: 10.21437/speechprosody.2022-146
L. V. Maastricht, M. Hoetjes, Lisette van der Heijden
{"title":"Learning L2 Prosody using Gestures: The Role of Individual Differences related to Musicality","authors":"L. V. Maastricht, M. Hoetjes, Lisette van der Heijden","doi":"10.21437/speechprosody.2022-146","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-146","url":null,"abstract":"The present study aimed to disentangle the influence of gesture type, physical involvement level, and individual differences in learner characteristics, i.e., working memory (WM) capacity and musicality, in determining the effectiveness of L2 lexical stress training. To this end, 60 native speakers of Dutch read aloud Spanish phrases containing cognates, which were counterbalanced for lexical stress position compared to their Dutch counterpart (e.g., ‘piRÁmides’ in Spanish, ‘piraMIdes’ in Dutch). They did so as a pre-test before receiving lexical stress training (T1) and as a post-test both directly after training (T2), and approximately one hour later (T3). Subjects received lexical stress training in one of five conditions varying in gesture type and physical involvement level: audio-visual (AV), AV-beat-perception, AV-beat-production, AV-metaphoric-perception, AV-metaphoric-production. Between T2 and T3, subjects performed a WM capacity and musical aptitude task. The results show that irrespective of training condition subjects significantly improved their L2 lexical stress production from T1 to T2 and T3. Although differences between training conditions were non-significant, there were several significant three-way interactions between WM capacity or musical aptitude and testing time and training condition. This underlines the importance of considering task and learner characteristics in determining the gestural benefit in learning L2 prosody.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131502553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Speech Prosody 2022Pub Date : 2022-05-23DOI: 10.21437/speechprosody.2022-129
N. Holliday
{"title":"Kamala Harris, Maya Rudolph and the Prosody of Parody","authors":"N. Holliday","doi":"10.21437/speechprosody.2022-129","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-129","url":null,"abstract":"Despite advances in the studies of both ethnolinguistic and prosodic variation, linguists still know relatively little about how individual speakers may use prosody to construct and perform aspects of their identity in dynamic ways. One novel way to study how individuals employ both personal and ethnolinguistic variation is to examine salient linguistic features that occur both in a natural context and in parodies of that same context. The current study analyzes speech from U.S. Vice President Kamala Harris and actor Maya Rudolph, who frequently parodies Harris on the American television program Saturday Night Live. Using a comparative analysis of data coded in the Autosegmental Metrical Phonology framework using MAE-ToBI conventions, I show that Rudolph closely mirrors several of the unique prosodic patterns employed by Harris, but that Rudolph does not simply mimic her; rather she employs exaggerated versions of Harris’ patterns, especially higher F0 peaks and more phrase-initial falsetto phonation. The results of this study expand our knowledge about how specific idiolectal variants connected to social and ethnic styles are enregistered as part of the public discourse. Additionally, it demonstrates the value of examining parodic performance to better understand and contextualize the speech of public figures.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131291581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Anticipatory marking of (non-corrective) contrastive focus by the Initial Rise in French","authors":"Axel Barrault, J. German, Pauline Welby","doi":"10.21437/speechprosody.2022-73","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-73","url":null,"abstract":"This study addresses tonal marking of non-corrective contrastive focus in French. Speakers read aloud sentences composed of two parallel clauses, where the structure of the post-verbal constituent under focus was varied by the presence or absence in the second clause of the final adjective appearing in the first clause (signaling Noun-focus or NP-focus, respectively). This way, we were able to test whether the Initial Rise (i.e., LHi) is a marker of the span of an upcoming non-corrective contrast in the second clause. We posited that French speakers mark contrast tonally in the first and/or second clause. Corroborating previous findings, we found that a faster speech rate is associated with fewer Initial Rises. More importantly, an Initial Rise occurs on the direct object of the first clause more often in narrow focus. Additionally, the height of the Initial Rise peak does not depend on focus structure. These results suggest that anticipatory use of the Initial Rise may signal an upcoming contrast and reveals additional complexity in the tonal encoding of the left edge of a contrastively focused constituent.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128128920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Speech Prosody 2022Pub Date : 2022-05-23DOI: 10.21437/speechprosody.2022-116
Chunyu Ge, Wenwei Xu, Wentao Gu, P. Mok
{"title":"An electroglottographic study of phonation types in tones of Suzhou Wu Chinese","authors":"Chunyu Ge, Wenwei Xu, Wentao Gu, P. Mok","doi":"10.21437/speechprosody.2022-116","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-116","url":null,"abstract":"Tonal systems often involve cues other than F0, such as phonation types. This paper investigated the phonation distinction of tones in Suzhou Wu Chinese. Simultaneous electroglottographic (EGG) and audio data of isolated syllables collected from eight speakers aged above 65 were analyzed. Closed quotient (CQ) and peak increase contact (PIC) were measured on the EGG signals. Generalized Additive Mixed Models (GAMM) were conducted to analyze the time course of CQ and PIC, with tone and speaker’s gender as smoothing terms. CQ and PIC were lower for low register than high register tones, with smaller differences in female than male speakers. The time courses of CQ and PIC were also varied with genders. The correlations between EGG and acoustic measurements were also calculated. H1*-H2* and H1*-A1* were more strongly correlated with CQ, whereas the correlations between them and PIC were weak. This paper showed that low register tones in Suzhou Wu were pronounced with breathy voice, which was more prominent at the onset of vowel, while the degree of breathiness and its time course dif-fered between females and males. The EGG measurements and their correlations with acoustic measurements provide evidence to explicate the evolution of tones in Suzhou Wu.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116863555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
J. Arciuli, Kate Philips, B. Bailey, A. Forndran, Adam T. Vogel, K. Ballard
{"title":"Production of Lexical Stress Matures Late in Typically Developing Children","authors":"J. Arciuli, Kate Philips, B. Bailey, A. Forndran, Adam T. Vogel, K. Ballard","doi":"10.21437/speechprosody.2022-80","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-80","url":null,"abstract":"","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114071963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Speech Prosody 2022Pub Date : 2022-05-23DOI: 10.21437/speechprosody.2022-108
Nadja Schauffler, Fabian Schubö, T. Bernhart, Gunilla Eschenbach, Julia Koch, Sandra Richter, Gabriel Viehhauser, Thang Vu, Lorenz Wesemann, Jonas Kuhn
{"title":"Prosodic realisation of enjambment in recitations of German poetry","authors":"Nadja Schauffler, Fabian Schubö, T. Bernhart, Gunilla Eschenbach, Julia Koch, Sandra Richter, Gabriel Viehhauser, Thang Vu, Lorenz Wesemann, Jonas Kuhn","doi":"10.21437/speechprosody.2022-108","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-108","url":null,"abstract":"A salient feature of poetry is the organisation in lines and stanzas. Versification thus serves as a structural (and aesthetic) layer that either conforms to syntactic units or breaks them up. If line breaks disrupt syntactic units, we speak of enjambment - the line suggests a pause while the syntactic information continues. Since prosodic boundaries typically reflect syntactic boundaries, the question is which information is marked - the line or the syntactic unit. In a preliminary study with one professional speaker of German, we investigate how line breaks are prosodically realised in recitations of twenty poems by Friedrich Hölderlin. We compare cases of enjambment to line breaks without enjambment and look at lengthening of the line-final segment, frequency and duration of pauses and F 0 reset across the line break, which are typical cues used for the prosodic marking of phrase boundaries in speech. We found that while pauses and F 0 reset are tuned down in cases of enjambment, the line break is still prosodically marked by means of final lengthening. This preliminary result supports the idea that a speaker can convey both the syntactic continuity as well as the versification of the poem by strengthening different cues.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114888859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Speech Prosody 2022Pub Date : 2022-05-23DOI: 10.21437/speechprosody.2022-157
Adam A. Bramlett, Seth Wiener
{"title":"jTRACE modeling of L2 Mandarin learners’ spoken word recognition at two time points in learning","authors":"Adam A. Bramlett, Seth Wiener","doi":"10.21437/speechprosody.2022-157","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-157","url":null,"abstract":"This study used the TRACE model of spoken word recognition to simulate adult second language (L2) learners’ spoken word recognition at two time points in learning. The pre-existing architecture of jTRACE with the TRACE-T phonology was used to simulate spoken Mandarin word recognition by adult L2 learners at week 1 and week 15 of structured elementary classroom learning. A modified lexicon with reduced tonal information was used to capture recognition during week 1 of learning. Partially restored tonal information was used to capture the change observed at week 15. jTRACE simulations were validated by comparing the results to eye fixation data taken at week 1 and week 15. The eye-tracking task consisted of viewing four Mandarin words written in pinyin while one of the words was presented auditorily. Roughly half of the trials contained words that were segmentally and tonally contrastive (e.g., gān, chá, pǐ, xiàn ). The remaining trials contained a target and competitor that were segmentally identical but tonally contrastive (e.g., mā , má, pěn, gòng ). Proportion of looks to the target were calculated and compared to the jTRACE simulations using multiple linear regression. The results showed evidence of activation and recognition, thereby corroborating our modeling approach.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121723130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Synchronous speech and semantic incongruity: what do outliers tell us about it?","authors":"Veronica P. Siqueira, B. Medeiros","doi":"10.21437/speechprosody.2022-42","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-42","url":null,"abstract":"Synchronous speech, considered to be an easily performed task, is investigated in two experimental conditions designated as original (OC) and altered (AC), with focus on the outliers’ behavior. The hypothesis raised is that AC, which offers semantic incongruities, would lead individuals to a poorer synchronization performance, i.e, producing greater lag duration. Divided equally in two groups (A and B), 24 dyads were recorded reading two fables in Brazilian Portuguese, in both original and altered conditions. Asynchrony duration was obtained by extracting the lag between vowel onsets, after aligning speakers’ waveforms in each dyad. Considering results related to the entire dataset, speakers are able to synchronize in both conditions (OC and AC). However, a great number of outliers was observed throughout the dataset. Its distribution in AC is significantly different from the distribution in OC, the former showing greater values for both variance and mean. In this exploratory study, one promising explanation for these results will be discussed taking into account aspects such as the outliers’ location throughout the text. These initial results prompt further investigation, in order to verify a more accurate relation between the outliers’ duration and the semantic incongruities' place of occurrence.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121797160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Speech Prosody 2022Pub Date : 2022-05-23DOI: 10.21437/speechprosody.2022-103
J. Volín, Radek Skarnitzl
{"title":"The Impact of Prosodic Position on Post-Stress Rise in Three Genres of Czech","authors":"J. Volín, Radek Skarnitzl","doi":"10.21437/speechprosody.2022-103","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-103","url":null,"abstract":"In general phonetics, stressed syllables are described as more prominent due to greater duration, higher intensity, less steep spectral slope and/or higher fundamental frequency. However, there are languages in which lexical stress commonly manifests with a post-stress rise (L*+H). Previous studies dedicated to post-stress rises in Czech were limited in material and/or methodology. Our current study extends the material to sizeable samples of three genres of speech: professional story-telling, poetry reciting and news reading. Over 30,000 syllables were manually labelled in terms of their accent-group status. The main focus of the study was the step between the stressed and the following syllable, but apart from the frequency of occurrence and size of the step, we also examined the influence of the position within a prosodic phrase. The results suggest that the post-stress rise should be considered a typical pitch accent in Czech, but that it does not occur uniformly across the examined genres and prosodic positions.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115051498","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"How prosody affects ASR performance in conversational Austrian German","authors":"Saskia Wepner, Barbara Schuppler, G. Kubin","doi":"10.21437/speechprosody.2022-40","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-40","url":null,"abstract":"Currently available Automatic Speech Recognition (ASR) systems achieve good word error rates (WER) for read speech ( 2 − 10% ), but not for conversational speech ( 20 − 40% ), a speaking style especially relevant for dialogue systems, as they become more conversational and interactional. Here, we anal-yse how prosody affects WER in a Kaldi-based speech recognition system for a corpus of conversational Austrian German. This analysis is a step towards improving ASR systems and increasing our knowledge about which aspects are relevant to consider for ASR of conversational speech. For this purpose, we compare a typical language model (LM) with an oracle LM trained on the utterances from the whole corpus, thus knowing each possible N -gram in advance. We find that short, deaccented words have the lowest recognition accuracy, which also cannot be compensated for by the oracle LM. Despite our over-all high WERs, the highly prominent words were recognised significantly better. Our findings suggest that reporting global WERs for an ASR system of conversational speech does not predict its usefulness in dialogue systems. Given the role of prominent words in carrying meaning and function in conver-sation, our analysis is relevant for researchers developing automatic speech understanding systems.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122542908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}