{"title":"Variability in hesitations in Punjabi semi-spontaneous narrative speech: An automatic clustering based analysis","authors":"Farhat Jabeen, P. Wagner","doi":"10.21437/diss.2023-15","DOIUrl":null,"url":null,"abstract":"This research offers a first analysis of hesitations in Punjabi, an under-researched language, in conjunction with a cross-linguistic comparison. We show speaker related variation in the frequency of hesitations in Punjabi. Variability was also observed in the form of filled pauses which comprised vowels or vowel-consonant sequences with nasals or obstruents. The vowels in filled pauses differed based on their segmental context and individual speakers. Automatic clustering showed that (lexical-ized) filled pauses were grouped by F0 register, instead of F0 contour. These results (1) have cross-linguistic significance and (2) provide insights for modeling hesitations in speech technological systems","PeriodicalId":224600,"journal":{"name":"Disfluency in Spontaneous Speech (DiSS) Workshop 2023","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Disfluency in Spontaneous Speech (DiSS) Workshop 2023","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21437/diss.2023-15","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This research offers a first analysis of hesitations in Punjabi, an under-researched language, in conjunction with a cross-linguistic comparison. We show speaker related variation in the frequency of hesitations in Punjabi. Variability was also observed in the form of filled pauses which comprised vowels or vowel-consonant sequences with nasals or obstruents. The vowels in filled pauses differed based on their segmental context and individual speakers. Automatic clustering showed that (lexical-ized) filled pauses were grouped by F0 register, instead of F0 contour. These results (1) have cross-linguistic significance and (2) provide insights for modeling hesitations in speech technological systems