{"title":"Measuring speaking rate: how do objective measurements correlate with audio-perceptual ratings?","authors":"Jenny Iwarsson, Josefine Naes, Rikke Hollen","doi":"10.1080/14015439.2021.1988702","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Although speaking rate is central for many speech disorders, no consensus exists regarding the measurement of this feature. The purpose of this study was to examine the correlations between perceptual listener evaluations and various measures of speaking rate. Furthermore, the study investigated the relationship between speaking rate and articulation rate and how pauses in speech affect the perceived tempo.</p><p><strong>Method: </strong>Nine healthy females were selected to produce stimuli representing three habitual speech tempi during semi-spontaneous speech: slow (<i>n</i> = 3), neutral (<i>n</i> = 3) and fast (<i>n</i> = 3). Speaking rate was analyzed both by manual calculation and through automatic detection by a script to the computer-based program Praat. Thirty untrained male and female listeners evaluated the recordings with regard to speech tempo on visual analogue scales from <i>very slow</i> to <i>very fast</i>.</p><p><strong>Results: </strong>Large, significant correlations (Pearson's <i>r</i>) were found between all objective measures of speaking rate and perceptual listener evaluations. Words/minute showed the largest correlation (.91), followed by syllables/second (.89), while articulation rate (pauses excluded) as automatically measured by the script, showed the smallest correlation (.69). Possible explanations for the findings are discussed.</p><p><strong>Conclusion: </strong>Untrained listeners' evaluation of speech tempo in normal subjects correlated strongly with objective measurements. The results both support the use of auditive-perceptual evaluation of tempo and the use of automatic script analysis for clinical use. Speaking rate (pauses included) showed better consistency with perceptual listener evaluations than articulation rate.</p>","PeriodicalId":49903,"journal":{"name":"Logopedics Phoniatrics Vocology","volume":null,"pages":null},"PeriodicalIF":0.7000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Logopedics Phoniatrics Vocology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/14015439.2021.1988702","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: Although speaking rate is central for many speech disorders, no consensus exists regarding the measurement of this feature. The purpose of this study was to examine the correlations between perceptual listener evaluations and various measures of speaking rate. Furthermore, the study investigated the relationship between speaking rate and articulation rate and how pauses in speech affect the perceived tempo.
Method: Nine healthy females were selected to produce stimuli representing three habitual speech tempi during semi-spontaneous speech: slow (n = 3), neutral (n = 3) and fast (n = 3). Speaking rate was analyzed both by manual calculation and through automatic detection by a script to the computer-based program Praat. Thirty untrained male and female listeners evaluated the recordings with regard to speech tempo on visual analogue scales from very slow to very fast.
Results: Large, significant correlations (Pearson's r) were found between all objective measures of speaking rate and perceptual listener evaluations. Words/minute showed the largest correlation (.91), followed by syllables/second (.89), while articulation rate (pauses excluded) as automatically measured by the script, showed the smallest correlation (.69). Possible explanations for the findings are discussed.
Conclusion: Untrained listeners' evaluation of speech tempo in normal subjects correlated strongly with objective measurements. The results both support the use of auditive-perceptual evaluation of tempo and the use of automatic script analysis for clinical use. Speaking rate (pauses included) showed better consistency with perceptual listener evaluations than articulation rate.
期刊介绍:
Logopedics Phoniatrics Vocology is an amalgamation of the former journals Scandinavian Journal of Logopedics & Phoniatrics and VOICE.
The intention is to cover topics related to speech, language and voice pathology as well as normal voice function in its different aspects. The Journal covers a wide range of topics, including:
Phonation and laryngeal physiology
Speech and language development
Voice disorders
Clinical measurements of speech, language and voice
Professional voice including singing
Bilingualism
Cleft lip and palate
Dyslexia
Fluency disorders
Neurolinguistics and psycholinguistics
Aphasia
Motor speech disorders
Voice rehabilitation of laryngectomees
Augmentative and alternative communication
Acoustics
Dysphagia
Publications may have the form of original articles, i.e. theoretical or methodological studies or empirical reports, of reviews of books and dissertations, as well as of short reports, of minor or ongoing studies or short notes, commenting on earlier published material. Submitted papers will be evaluated by referees with relevant expertise.